Machine Learning Roadmap and Notes — 2026 Update

A carefully curated collection of machine learning notes, resources, projects, and datasets designed to guide you through the ML landscape effectively.

Star History

Learning Journey Overview
Level 1: Testing the waters
Level 2: Gaining Conceptual depth
Level 3: Learn Practical Concepts
Level 4: Diving into different domains
Level 5: Pushing it with Projects
Level 6: Senior / Expert-Level ML Engineering (🆕 April 2026)
Level 7: Emerging & Frontier Topics (🆕 April 2026)

Learning Journey Overview

This roadmap is organized into seven progressive levels:

Level	Focus	Description
1️	Testing the waters	Familiarize yourself with the ML universe
2️	Gaining Conceptual depth	Learn core ML concepts and algorithms
3️	Learning Practical Concepts	Apply ML in real-world scenarios
4️	Diving into different domains	Explore specialized ML fields
5️	Pushing it with Projects	Build comprehensive ML projects
6️	Senior / Expert-Level ML Engineering 🆕	Production systems, MLOps, cloud, system design
7️	Emerging & Frontier Topics 🆕	LLMs, Agents, LLMOps, Responsible AI

1. Testing the waters

This level aims to familiarize you with the ML universe. You will learn a bit about everything.

Learn Python

Click to expand Python resources

Basics of Python - View Notes
OOP in Python - View Notes
Advanced Topics - View Notes
Practice Problems - View Notes

Learn Numpy

Click to expand NumPy resources

Numpy - View Notes
Numpy Practice Problems - View Exercises

Learn Pandas

Click to expand Pandas resources

Pandas - View Notes
Pandas Problems - View Exercises

Learn Data Visualization

Click to expand Data Visualization resources

Matplotlib - View Notes
Seaborn - View Notes

Fundamentals of Statistics

Click to expand Statistics resources

Statistics - View Notes

Learn Data Analysis Process

Click to expand Data Analysis Process resources

Learn Data Analysis Process - View Notes

Learn Exploratory Data Analysis (EDA)

Click to expand EDA resources

Learn Exploratory Data Analysis (EDA) Notes - View Notes

Learn Machine Learning Basics

Click to expand ML Basics resources

Learn Machine Learning Basics Notes - View Notes

2. Gaining Conceptual depth

The goal of this level is to learn the core machine learning concepts and algorithms

Mathematics for Machine Learning

Click to expand Mathematics resources

Roadmap: Mathematics for Machine Learning
Book: Essential Math for Data Science
Book: Mathematics for Machine Learning

Learn about tensors

Click to expand Tensor resources

What are Tensors? - View Notes

Advanced Statistics

Click to expand Advanced Statistics resources

Advanced Statistics Notes - View Notes

Fundamentals of Probability

Click to expand Probability resources

Probability Basics Notes - View Notes

Fundamentals of Linear Algebra

Click to expand Linear Algebra resources

Linear Algebra Basics Notes - View Notes

Fundamentals of Calculus

Click to expand Calculus resources

Basics of Calculus Notes - View Notes

Machine Learning Algorithms

Click to expand ML Algorithms resources

Machine Learning — All Models Link

Algorithm	Notes Link
Linear Regression	View Notes
Gradient Descent	View Notes
Logistic Regression	View Notes
Support Vector Machines	View Notes
Naive Bayes	View Notes
K Nearest Neighbors	View Notes
Decision Trees	View Notes
Random Forest	View Notes
Bagging	View Notes
AdaBoost	View Notes
Gradient Boosting	View Notes
XGBoost	View Notes
PCA	View Notes
K-Means Clustering	View Notes
Hierarchical Clustering	View Notes
DBSCAN	View Notes
T-sne	Coming Soon

Machine Learning Metrics

Click to expand ML Metrics resources

View Notes

Regularization

Click to expand Regularization resources

View Notes

3. Learn Practical Concepts

This level aims to introduce you to the practical side of machine learning. What you learn at this level will help you out there in the wild.

Data Acquisition

Click to expand Data Acquisition resources

Data Acquisition - View Notes

Working with missing values

Click to expand Missing Values resources

Technique	Notes Link
Complete Case Analysis	View Notes
Handling missing numerical data	View Notes
Handling missing categorical data	View Notes
Missing indicator	View Notes
KNN Imputer	View Notes
MICE	View Notes

Practice Resources: Kaggle Notebooks and Practice Datasets

Feature Scaling/Normalization

Click to expand Feature Scaling resources

Standardization / Normalization - View Notes

Feature Encoding Techniques

Click to expand Feature Encoding resources

Feature Encoding Techniques - View Notes

Feature Transformation

Click to expand Feature Transformation resources

Function Transformer - View Notes
Power Transformations - View Notes
Binning and Binarization - View Notes

Working with Pipelines

Click to expand Pipelines resources

Column Transformer - View Notes
Sklearn Pipelines - View Notes

Handing Time and Date

Click to expand Time and Date resources

Working with time and date data - View Notes

Working with Outliers

Click to expand Outliers resources

Working with Outliers - View Notes

Feature Construction

Click to expand Feature Construction resources

Feature Construction - View Notes

Feature Selection

Click to expand Feature Selection resources

Feature selection - View Notes

Cross Validation

Click to expand Cross Validation resources

Cross-validation - View Notes

Modelling - Stacking and Blending

Click to expand Modelling resources

Stacking - View Notes
Blending - View Notes
LightGBM - View Notes
CatBoost - View Notes

Model Tuning

Click to expand Model Tuning resources

GridSearchCV - View Notes
RandomSearchCV - View Notes
Hyperparameter Tuning - View Notes

Working with imbalanced data

Click to expand Imbalanced Data resources

How to handle imbalanced data - View Notes

Handling Multicollinearity

Click to expand Multicollinearity resources

Handling Multicollinearity - View Notes

Data Leakage

Click to expand Data Leakage resources

Data Leakage - View Notes

Serving your model

Click to expand Model Serving resources

Coming Soon:

Pickling your model

Flask

Streamlit

Deploy model on Heroku

Deploy model on AWS

Deploy model to GCP

Deploy model to Azure

ML model to Android App

Working with Large Datasets

Click to expand Large Datasets resources

Working with Large Datasets - View Notes

4. Diving into different domains

This is the level where you would dive into different domains of Machine Learning. Mastering these will make you a true Data Scientist.

SQL

Click to expand SQL resources

SQL learning resources - View Resources

Recommendation Systems

Click to expand Recommendation Systems resources

Movie Recommendation System - View Project
Book Recommender System - View Project

Association Rule Learning

Click to expand Association Rule Learning resources

Coming Soon:

Association Rule Mining(Apriori Algorithm)

Eclat Algorithm

Market Basket Analysis

Anomaly Detection

Click to expand Anomaly Detection resources

Coming Soon:

Anomaly Detection Lecture from Microsoft Research

Novelty Detection Lecture

NLP

Click to expand NLP resources

NLP-Introduction - View Notebook
NLP NOTES - (Coming Soon)
Email Spam Classifier Project - View Project

Time Series

Coming Soon

Computer Vision

Click to expand Computer Vision resources

Introduction to Computer Vision - View Notebook
Cat vs Dog Classification Project - View Project

Fundamentals of Neural Network

Coming Soon

5. Pushing it with Projects

The objective of this level is to sharpen the knowledge that you have accumulated in the previous 4 levels

Project Collections

Click to expand Project resources

6. Senior / Expert-Level ML Engineering 🆕 (April 2026)

The high-value MLE in 2026 is an MLOps expert who can deploy and monitor models at scale, not just an algorithm tuner. This level covers the production engineering, infrastructure, and system-design skills that separate senior ML engineers from juniors.

6.1 Production-Grade Python & Software Engineering

Click to expand

In 2026, basic scripting isn't enough — you need to write clean, modular, production-grade code.

Topic	What to Learn	Recommended Resources
Clean Code & Design Patterns	SOLID principles, factory patterns, dependency injection for ML	Clean Code by Robert C. Martin; Fluent Python by Luciano Ramalho
Type Hints & Static Analysis	`mypy`, `pydantic` for data validation, runtime checks	Pydantic Docs
Testing ML Code	`pytest`, property-based testing with `hypothesis`, data-validation with `great_expectations`	Great Expectations
Async & Concurrency	`asyncio`, `multiprocessing`, `ray` for distributed workloads	Ray Docs
Packaging & Reproducibility	`pyproject.toml`, `uv`/`poetry`, virtual environments, reproducible builds	Python Packaging Guide

6.2 MLOps — Machine Learning Operations

Click to expand

MLOps skills bridge the gap between experimental machine learning and production systems, enabling the scalable, reliable deployment of models in enterprise environments. MLOps represents a discipline blending machine learning with software engineering practices to ensure the scalable, reliable, and automated deployment of ML models.

6.2.1 Core MLOps Concepts

In 2026, MLOps has evolved beyond basic CI/CD for models. It now covers model lifecycle management, data versioning and governance, continuous training and retraining, infrastructure automation, monitoring, observability, and compliance, plus integration with enterprise systems.

Topic	Tools & Frameworks	Description
Experiment Tracking	MLflow, Weights & Biases, Neptune	Key tools include MLflow for lifecycle management, Git/GitHub for version control, and orchestration frameworks that automate complex workflows. These platforms enable teams to track experiments, reproduce results, and manage model versions systematically.
Model Registry & Versioning	MLflow Model Registry, DVC, LakeFS	Track model artifacts, datasets, and lineage
CI/CD for ML	GitHub Actions, Jenkins, GitLab CI	Applying CI/CD principles to machine learning ensures seamless updates and reliable deployments. In 2026, automated pipelines are critical for retraining models, testing changes, and deploying updates with minimal downtime.
Pipeline Orchestration	Apache Airflow, Kubeflow Pipelines, Prefect, Dagster	Build automated, reproducible training & serving pipelines
Model Monitoring & Drift Detection	Evidently AI, WhyLabs, Prometheus, Datadog	Critical practices encompass pipeline versioning, drift detection, model monitoring, and reproducible workflows. These capabilities ensure models maintain performance over time and alert teams when retraining becomes necessary.
Feature Stores	Feast, Tecton, Hopsworks	Feature mismatch between training and production is a common failure point. In 2026, feature stores are essential for enterprise ML scalability.

6.2.2 MLOps 2026 Key Trends

With 2026 here, the field is moving toward hyper-automation, with workflows that can retrain and redeploy models autonomously, learning and adapting without human intervention. Edge computing will take center stage.

Hyper-Automation: Tools now enable automated retraining triggered by data changes or model drift, ensuring models stay accurate and relevant in dynamic environments. Automated deployment pipelines further enhance efficiency.
Policy-as-Code Governance: Embedding executable governance rules into MLOps pipelines is a trend on the rise. Organizations are pursuing systems that automatically integrate fairness, data lineage, versioning, and compliance with regulations.
AgentOps: AgentOps has emerged as the new "evolution" of MLOps practices, defined as the discipline to manage, deploy, and monitor AI systems based on autonomous agents. This novel trend defines its own set of operational practices, tooling, and pipelines that accommodate stateful, multi-step AI agent lifecycles.
Edge MLOps: As edge devices become more powerful, deploying ML models directly on these devices is gaining traction. Edge MLOps enables real-time decision-making in environments with limited connectivity, such as autonomous vehicles or IoT sensors.

6.2.3 MLOps Platforms to Know in 2026

Platform	Best For
AWS SageMaker	End-to-end ML on AWS (training, serving, monitoring)
Google Vertex AI	Vertex AI provides powerful integration with Google Cloud and is consistently listed alongside SageMaker as one of the top enterprise MLOps solutions for 2026.
Azure Machine Learning	Enterprise compliance & Microsoft ecosystem
Databricks (MLflow)	Lakehouse architecture, unified analytics + ML
MLflow (Open Source)	MLflow remains one of the most widely used open-source platforms for tracking and managing ML experiments in 2026, offering flexibility as its primary advantage.
Kubeflow	Kubernetes-native ML pipelines
TrueFoundry	TrueFoundry is a modern MLOps and LLMOps platform built for teams that want to deploy, scale, and monitor machine learning and generative AI models in production. It abstracts away infrastructure complexity while offering complete control.

6.3 Containerization & Cloud Infrastructure

Click to expand

Learn Docker and work through a basic tutorial to containerize a simple web application. This is your gateway to MLOps.

Topic	Tools	What to Learn
Containerization	Docker, Docker Compose	Build reproducible ML environments; multi-stage builds for lean images
Container Orchestration	Kubernetes, Helm	Senior ML engineer job postings in 2026 list skills including Airflow, BigQuery, Docker, Kafka, Kubernetes, Terraform, and major ML frameworks.
Infrastructure as Code	Terraform, Pulumi, CloudFormation	Reproducible infrastructure for training clusters and serving endpoints
Serverless ML Serving	AWS Lambda, Google Cloud Functions, BentoML	Cost-effective inference for low-traffic or bursty workloads
GPU Management	NVIDIA CUDA, RAPIDS, cloud GPU instances	Efficient use of GPU resources for training and inference

Cloud Platforms — Pick One to Master

Fluency in one major cloud provider (AWS, Azure, or GCP) is now expected for most machine learning engineer roles.

Provider	Key ML Services
AWS	SageMaker, Bedrock, S3, EC2, Lambda, Step Functions
GCP	AWS is the market leader, but GCP often has the most intuitive and modern MLOps tools (like Vertex AI). Vertex AI, BigQuery ML, Cloud Run
Azure	Azure ML, Azure OpenAI Service, Cognitive Services

6.4 Model Serving & Deployment

Click to expand

This fills in the "Coming Soon" serving section from Level 3, now with the 2026 production stack.

Topic	Tools	Description
Model Serialization	Pickle, Joblib, ONNX, TorchScript	Export models for production with optimized formats
REST API Serving	FastAPI, Flask, BentoML	Build production APIs around your models
Real-Time Serving	TensorFlow Serving, Triton Inference Server, TorchServe	Low-latency, high-throughput model serving
Batch Inference	Apache Spark MLlib, Dask, SageMaker Batch Transform	Process large volumes offline
Web Apps & Demos	Streamlit, Gradio	Rapid prototyping and internal tools
Model Optimization	Quantization (INT8/FP16), Pruning, Distillation, ONNX Runtime	Reduce latency and cost in production
Deployment Targets	Heroku, AWS (ECS/Lambda), GCP (Cloud Run), Azure (App Service), Hugging Face Spaces	Full spectrum from MVP to enterprise

6.5 ML System Design

Click to expand

Modern ML engineers use pretrained models instead of training from scratch — a key productivity shift in 2026.

Senior ML engineers are expected to design end-to-end ML systems. Key areas:

Design Component	What to Master
Problem Framing	Technical excellence alone no longer suffices. Engineers must understand business contexts, translate technical capabilities into business value, and align ML initiatives with organizational objectives.
Data System Design	Data lakes, data warehouses, streaming pipelines (Kafka, Kinesis), ETL
Training System Design	Distributed training, hyperparameter optimization at scale, compute budgeting
Serving System Design	Online vs. batch, caching, A/B testing, shadow deployment, canary releases
Feedback Loops	The loop of Raw Data → Data Preprocessing → Feature Store → Model Training → Model Registry → Deployment → Monitoring & Feedback is the heart of MLOps — continuously improving models based on real-world feedback.
Cost Optimization	Spot instances, auto-scaling, model distillation, right-sizing infrastructure

Recommended Resources:

Designing Machine Learning Systems by Chip Huyen
ML System Design Interview resources
Explore real-world ML system design case studies — over 300 are publicly shared across 80+ companies.

6.6 Data Engineering Fundamentals for ML Engineers

Click to expand

SQL & Data Engineering: Data is the fuel. If you can't fetch, clean, and pipe your own fuel, your engine (model) won't run.

Topic	Tools	Description
Advanced SQL	PostgreSQL, BigQuery, Snowflake, Redshift	Window functions, CTEs, query optimization
Data Pipelines	Apache Airflow, Dagster, Prefect	Orchestrate ETL/ELT workflows
Streaming	Apache Kafka, Apache Flink, Kinesis	Real-time data ingestion for online ML
Data Quality	Great Expectations, dbt tests, Soda	Validate data before it enters your pipeline
Data Versioning	DVC, LakeFS, Delta Lake	Version datasets alongside code

6.7 Business Acumen & Soft Skills for Senior Engineers

Click to expand

The ability to frame business problems as machine-learning challenges is a critical skill. Engineers must identify situations where ML provides appropriate solutions versus contexts where simpler approaches suffice. This judgment prevents wasted effort on ML implementations that don't deliver commensurate value.

Skill	Why It Matters
ROI & Cost-Benefit Analysis	ROI calculation and project scoping abilities enable engineers to set realistic expectations and demonstrate value delivery.
Technical Communication	Present ML results to non-technical stakeholders; write design docs
Mentorship & Leadership	At the senior stage, you may lead teams or design ML platforms.
Cross-Functional Collaboration	Work with product, design, data engineering, and DevOps teams
Continuous Learning	The tools will change next year, but the ability to learn and adapt is the only skill that never depreciates.

7. Emerging & Frontier Topics 🆕 (April 2026)

The future of ML goes hand in hand with GenAI, LLMs, and AutoML enabling more autonomous model creation and intelligent systems. This level covers the most in-demand emerging skills for ML engineers in 2026.

7.1 Deep Learning Foundations (Prerequisite for LLMs)

Click to expand

Topic	What to Learn
Neural Network Architectures	Feedforward, CNN, RNN, LSTM, GRU
Training Deep Networks	Backpropagation, optimizers (Adam, AdamW), learning rate schedulers, gradient clipping
Frameworks	PyTorch (dominant in 2026), TensorFlow/Keras, JAX
Transfer Learning	Pretrained models, fine-tuning, feature extraction
Transformer Architecture	Transformers are the backbone of modern NLP models. They allow for parallel processing of data, enhancing model efficiency and scalability. Self-attention, multi-head attention, positional encoding

Resources:

Deep Learning by Goodfellow, Bengio, Courville (free online)
Stanford CS231n (CNNs), CS224n (NLP with Deep Learning)
PyTorch Official Tutorials

7.2 Large Language Models (LLMs)

Click to expand

The new reality with LLMs is fundamentally different. In 2026, production AI systems are not single models but complex orchestrations of multiple components: foundation models, fine-tuned adapters, retrieval systems, guardrails, routing logic, and feedback mechanisms.

Topic	What to Learn
LLM Fundamentals	Tokenization, attention mechanisms, scaling laws, pretraining vs. fine-tuning
Working with LLM APIs	OpenAI, Anthropic (Claude), Google (Gemini), open-source (Llama, Mistral, DeepSeek)
Prompt Engineering	Zero-shot, few-shot, and chain-of-thought prompting techniques to achieve consistent and controlled outputs.
Fine-Tuning	With smaller, more efficient models becoming more common, fine-tuning is quickly turning into a core skill for AI engineers. LoRA, QLoRA, PEFT, using `trl` and Hugging Face
Model Selection	Take a base model (Llama 3.3 8B is the 2026 choice), curate instruction-response pairs, run LoRA fine-tuning. Evaluate open-source vs. frontier models for your use case
Quantization & Optimization	GPTQ, AWQ, GGUF, INT8/FP16, vLLM for high-throughput serving

7.3 Retrieval-Augmented Generation (RAG)

Click to expand

Chunking strategy (fixed-size vs. semantic vs. hierarchical), overlap settings, embedding model choice, retrieval (dense, sparse, hybrid), and reranking with a cross-encoder — the gap between a 10-minute tutorial RAG and a production RAG that users trust is enormous.

Component	What to Learn
Vector Databases	Pinecone, Weaviate, Qdrant, ChromaDB, pgvector
Embedding Models	OpenAI Embeddings, Sentence-Transformers, Cohere Embed, fine-tuning embeddings
Chunking Strategies	Fixed-size, semantic, hierarchical, overlap tuning
Retrieval	Dense retrieval, sparse (BM25), hybrid search, re-ranking
Advanced RAG	Query augmentation, multi-hop retrieval, metadata filtering, parent-child chunks
Evaluation	RAGAS metrics, faithfulness, answer relevancy, context precision

Resources:

7.4 AI Agents & Agentic Systems

Click to expand

AI agents powered by large language models and other agentic architectures have recently gained a significant presence in production environments. As a result, organizations need dedicated operational frameworks.

Topic	What to Learn
Agent Fundamentals	Take your RAG pipeline and give the model tools: web search, document retrieval, a calculator, a code executor. Implement the ReAct loop manually.
Agent Frameworks	Teams starting new agent projects in 2026 are choosing LangGraph, CrewAI, or PydanticAI. Also: OpenAI Agents SDK, n8n
Model Context Protocol (MCP)	MCP has become the de facto standard for connecting AI agents to external systems.
Multi-Agent Systems	Orchestration of multiple specialized agents, handoffs, shared state
Tool Use & Function Calling	Define tools, parse LLM function calls, handle errors gracefully
Guardrails & Safety	Prevent prompt injection, harmful outputs, infinite loops
Observability	Use Grafana, Pydantic Logfire and OpenTelemetry for observability and safety.

7.5 LLM Orchestration Frameworks

Click to expand

Many production stacks in 2026 use LlamaIndex as the knowledge/retrieval layer and LangChain as the orchestration layer — the two are no longer direct competitors.

Framework	Purpose	Link
LangChain	Orchestration, chains, LCEL composability	langchain.com
LlamaIndex	Data ingestion, indexing, retrieval	llamaindex.ai
LangGraph	Stateful, graph-based agent workflows	LangGraph Docs
CrewAI	Multi-agent orchestration	crewai.io
PydanticAI	Type-safe agent building	pydantic.dev/pydantic-ai
OpenAI Agents SDK	Official OpenAI agent tooling	OpenAI Docs
Hugging Face Transformers	Model hub, fine-tuning, inference	huggingface.co

7.6 LLMOps — Operating LLM Systems in Production

Click to expand

The year 2026 marks a pivotal moment where traditional MLOps — focused primarily on model training pipelines, experiment tracking, and batch inference — is evolving into something far more sophisticated and demanding.

New skills specific to LLM systems include prompt engineering and optimization, RAG system design, LLM evaluation strategies, guardrail implementation, and cost optimization techniques.

Topic	Tools	Description
Prompt Management	LangSmith, Langfuse, PromptLayer	Version, test, and A/B test prompts
LLM Evaluation	Improve through testing and offline evaluation. Use LLMs as judges to compare approaches. Learn tools like Evidently and LangWatch.
Cost Monitoring	Set up Langfuse or LangSmith tracing, write a golden test set, implement RAGAS metrics, and build a cost-per-request dashboard.
Guardrails	Guardrails AI, NeMo Guardrails, custom validators	Content safety, factuality checks, PII filtering
Caching	Semantic caching (GPTCache), Redis	Reduce API costs and latency
Gateway / Routing	LiteLLM, Portkey	Route between models, implement fallbacks

7.7 Explainable AI (XAI) & Responsible ML

Click to expand

As AI systems become more prevalent in critical decision-making processes, the ability to understand and explain their behavior becomes essential. Explainable AI (XAI) encompasses techniques and tools that make AI model behaviors and decisions understandable to humans.

Industry and regulatory trends indicate that explainable and ethical AI capabilities will become essential job skills by 2026, as organizations face increasing scrutiny over AI decision-making processes.

Topic	What to Learn
Model Interpretability	SHAP, LIME, Attention Visualization, Feature Importance
Bias Detection & Mitigation	Building fair, transparent ML systems requires understanding data privacy regulations, implementing bias mitigation strategies, and ensuring responsible AI content generation. Fairlearn, AIF360
AI Governance & Compliance	Compliance frameworks like Europe's AI Act forced organizations to rethink how they managed governance and accountability in their AI systems. EU AI Act, NIST AI RMF
Model Cards & Documentation	Standardized model documentation for transparency
Red Teaming for LLMs	Adversarial testing of LLM systems for safety

7.8 Generative AI & Multimodal Systems

Click to expand

In 2026, MLOps is no longer just about CI/CD for models — it now encompasses classical ML models, LLMs, retrieval-augmented generation pipelines, vector search, and increasingly agent-based applications.

Topic	What to Learn
Text Generation	GPT-4, Claude, Gemini, Llama, Mistral — API usage and self-hosting
Image Generation	Stable Diffusion, DALL-E, Midjourney APIs
Multimodal Models	GPT-4V, Gemini multimodal, LLaVA — combining text, image, audio
Audio/Speech	Whisper (speech-to-text), TTS models, audio embeddings
Code Generation	GitHub Copilot integration, code LLMs, AI-assisted development

7.9 Certifications & Career Growth (2026)

Click to expand

The average salary for a Senior Machine Learning Engineer is $212,689 per year in United States.

Recommended Certifications

Certification	Provider	Notes
AWS Machine Learning — Specialty	AWS	AWS's ML Specialty exam is being retired in March 2026, so plan accordingly. Look for the new AWS AI/ML certifications replacing it.
AWS Certified AI Practitioner	AWS	New 2026 entry-level AI certification
Google Cloud Professional ML Engineer	GCP	Strong focus on Vertex AI
Azure AI Engineer Associate	Microsoft	Azure ML + Azure OpenAI
TensorFlow Developer Certificate	Google	Proves hands-on TF proficiency

Interview Preparation for Senior Roles

Engineering teams at FAANG and AI-native companies are testing: System design for LLM systems — "Design a document Q&A system for 10 million documents." Interviewers want to hear you think about chunking strategy, embedding model tradeoffs, vector index design, query routing, latency budgets, and cost. They're checking whether you understand RAG as a system, not just a library call.

Key interview focus areas in 2026:

ML System Design (end-to-end)
LLM System Design (RAG, agents, serving)
Debugging prompts: You'll get a bad prompt and a set of failure cases and be asked to fix it. This tests systematic thinking: can you identify whether the failure is a prompt issue, a context issue, a model capability issue, or a retrieval issue?
Coding (LeetCode medium-level DSA)
ML fundamentals & statistics

Summary: What Changed in the April 2026 Update

Section	What Was Added	Why
Level 6	MLOps, Containerization, Cloud, Model Serving, System Design, Data Engineering, Business Skills	In 2026, MLOps is no longer optional. It is the foundation that enables scalable, secure, compliant, and business-ready AI systems.
Level 7	Deep Learning, LLMs, RAG, AI Agents, LLMOps, XAI, Generative AI, Certifications	In 2026, companies expect engineers to build production-ready AI systems that combine large language models with RAG pipelines, vector databases, agent frameworks, fine-tuning, and scalable backends.

As AI becomes more capable, human skills become your competitive advantage. The code you write is important, but the problem you solve is vital. In 2026, the best ML Engineers will actually be Product Engineers who use AI.

⭐ If you find this repository helpful, please consider giving it a star! ⭐

Name		Name	Last commit message	Last commit date
Latest commit History 122 Commits
1. Testing the waters		1. Testing the waters
2. Gaining Conceptual depth		2. Gaining Conceptual depth
3. Learn Practical Concepts		3. Learn Practical Concepts
Top Resources		Top Resources
README.md		README.md

Folders and files

Latest commit

History

Repository files navigation

Machine Learning Roadmap and Notes — 2026 Update

Star History

Table of Contents

Learning Journey Overview

1. Testing the waters

Learn Python

Learn Numpy

Learn Pandas

Learn Data Visualization

Fundamentals of Statistics

Learn Data Analysis Process

Learn Exploratory Data Analysis (EDA)

Learn Machine Learning Basics

2. Gaining Conceptual depth

Mathematics for Machine Learning

Learn about tensors

Advanced Statistics

Fundamentals of Probability

Fundamentals of Linear Algebra

Fundamentals of Calculus

Machine Learning Algorithms

Machine Learning Metrics

Regularization

3. Learn Practical Concepts

Data Acquisition

Working with missing values

Feature Scaling/Normalization

Feature Encoding Techniques

Feature Transformation

Working with Pipelines

Handing Time and Date

Working with Outliers

Feature Construction

Feature Selection

Cross Validation

Modelling - Stacking and Blending

Model Tuning

Working with imbalanced data

Handling Multicollinearity

Data Leakage

Serving your model

Working with Large Datasets

4. Diving into different domains

SQL

Recommendation Systems

Association Rule Learning

Anomaly Detection

NLP

Time Series

Computer Vision

Fundamentals of Neural Network

5. Pushing it with Projects

Project Collections

6. Senior / Expert-Level ML Engineering 🆕 (April 2026)

6.1 Production-Grade Python & Software Engineering

6.2 MLOps — Machine Learning Operations

6.2.1 Core MLOps Concepts

6.2.2 MLOps 2026 Key Trends

6.2.3 MLOps Platforms to Know in 2026

6.3 Containerization & Cloud Infrastructure

Cloud Platforms — Pick One to Master

6.4 Model Serving & Deployment

6.5 ML System Design

6.6 Data Engineering Fundamentals for ML Engineers

6.7 Business Acumen & Soft Skills for Senior Engineers

7. Emerging & Frontier Topics 🆕 (April 2026)

7.1 Deep Learning Foundations (Prerequisite for LLMs)

7.2 Large Language Models (LLMs)

7.3 Retrieval-Augmented Generation (RAG)

7.4 AI Agents & Agentic Systems

7.5 LLM Orchestration Frameworks

7.6 LLMOps — Operating LLM Systems in Production

7.7 Explainable AI (XAI) & Responsible ML

7.8 Generative AI & Multimodal Systems

7.9 Certifications & Career Growth (2026)

Recommended Certifications

Packages