HuaizhengZhang/AI-Infra-from-Zero-to-Hero
A curated collection of machine learning systems research papers, tutorials, and resources covering LLMs, generative AI, model serving, and AI infrastructure from top systems conferences.

This repository aggregates research and resources in AI systems engineering, covering topics from foundational ML systems to large language model infrastructure. It compiles papers from major systems conferences (OSDI, NSDI, MLSys, SIGCOMM), includes video tutorials, and documents industry practices for building and serving AI models including Llama3 and Mistral. The project serves as both a learning roadmap and reference library for AI system design, training optimization, and inference deployment.