albertan017/LLM4Decompile
LLM4Decompile is a research project that applies large language models to reverse engineer and decompile binary executables back into readable source code.

The project trains specialized LLMs on binary-source code pairs to perform decompilation, enabling recovery of human-readable code from compiled executables. It provides trained models (e.g., SK-Decompile for two-phase binary decompilation), evaluation benchmarks (decompile-bench with 70K function pairs), and datasets for training. The work focuses on improving decompilation accuracy through model fine-tuning and structured approaches combining structure recovery with identifier naming.