← all repositories

shibing624/MedicalGPT

MedicalGPT trains medical domain large language models using ChatGPT-style training pipelines including RLHF, DPO, and on-policy distillation.

MedicalGPT
Velocity · 7d
+5.0
★ / day
Trend
steady
star history

MedicalGPT provides a complete training pipeline for building medical GPT models. It implements incremental pretraining, supervised fine-tuning, RLHF with reward modeling and reinforcement learning, and direct preference optimization methods including DPO, ORPO, and GRPO. The project supports training on medical domain data and is designed for researchers and developers building healthcare AI applications.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.