← all repositories

dmis-lab/biobert

A BERT-based language model pre-trained on biomedical corpora (PubMed and PMC) for biomedical text mining tasks.

2.2k stars Python Language ModelsDomain Apps
biobert
Velocity · 7d
+0.8
★ / day
Trend
steady
star history

BioBERT provides pre-trained language representation models specialized for the biomedical domain, based on Google’s original BERT architecture. It offers multiple model sizes (base and large) with checkpoints trained on large biomedical corpora including PubMed abstracts and PMC full-text articles. The models can be fine-tuned for downstream biomedical NLP tasks including named entity recognition, relation extraction, and question answering.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.