ai-infra-curriculum/ai-infra-engineer-learning
A production-grade learning path for becoming an AI Infrastructure Engineer, covering MLOps, LLM deployment, and distributed ML systems at scale.

This repository provides a complete curriculum for AI infrastructure engineering spanning 10 modules across 500+ hours of hands-on learning. It covers building ML infrastructure from scratch, deploying production ML systems with auto-scaling and monitoring, implementing end-to-end MLOps pipelines, deploying LLM infrastructure using vLLM and RAG, and scaling distributed training with GPU clusters. The curriculum includes production-grade code stubs with TODO comments and real-world projects aligned with industry standards from FAANG and top tech companies.