codefuse-ai/Awesome-Code-LLM
An academic survey and curated collection of language modeling research focused on code and software engineering.

This repository hosts a TMLR-published survey on Code LLMs, maintaining a comprehensive curated list of related research papers, datasets, benchmarks, and model releases. It aggregates work from venues like EMNLP, ICML, and various AI organizations, organizing resources across code generation, code search, code understanding, and multilingual code embeddings. The list is actively maintained with new papers added regularly from major ML/NLP conferences.