mryab/efficient-dl-systems
A university course on efficient deep learning systems covering GPU optimization, distributed training, model serving, and inference acceleration.

This repository contains lecture slides, seminars, and assignments for a graduate-level course on efficient deep learning systems taught at HSE University and Yandex School of Data Analysis. The syllabus covers CUDA programming, mixed-precision training, data-parallel and tensor-parallel training, FSDP, gradient checkpointing, model serving, and inference optimization using PyTorch. Materials include benchmarking exercises, profiling labs, and distributed training practicals.