AmberLJC/LLMSys-PaperList
A curated list of academic papers, tutorials, and technical reports covering LLM training systems, serving infrastructure, and agent frameworks.

This repository aggregates academic papers and resources on Large Language Model systems, spanning topics like distributed training (Megatron-LM, FlashAttention), inference serving (vLLM, llama.cpp), agent systems, multi-modal training, and ML frameworks. It organizes references by category including pre-training, post-training, RLHF, benchmarking, and industrial technical reports. The list serves as a reference bibliography for researchers and engineers tracking developments in LLM system infrastructure.