dmis-lab/biobert
A BERT-based language model pre-trained on biomedical corpora (PubMed and PMC) for biomedical text mining tasks.

Velocity · 7d
+0.8
★ / day
Trend
→steady
star history
BioBERT provides pre-trained language representation models specialized for the biomedical domain, based on Google’s original BERT architecture. It offers multiple model sizes (base and large) with checkpoints trained on large biomedical corpora including PubMed abstracts and PMC full-text articles. The models can be fine-tuned for downstream biomedical NLP tasks including named entity recognition, relation extraction, and question answering.