shibing624/MedicalGPT
MedicalGPT trains medical domain large language models using ChatGPT-style training pipelines including RLHF, DPO, and on-policy distillation.

Velocity · 7d
+5.0
★ / day
Trend
→steady
star history
MedicalGPT provides a complete training pipeline for building medical GPT models. It implements incremental pretraining, supervised fine-tuning, RLHF with reward modeling and reinforcement learning, and direct preference optimization methods including DPO, ORPO, and GRPO. The project supports training on medical domain data and is designed for researchers and developers building healthcare AI applications.