← all repositories

zzy99/epidemic-sentence-pair

A competition-winning solution that uses NLP models to classify whether COVID-19 patient question pairs are semantically similar.

435 stars Python Domain AppsLanguage Models
epidemic-sentence-pair
Velocity · 7d
+0.2
★ / day
Trend
steady
star history

This repository contains the first-place solution for the Tianchi COVID-19 similar sentence pair classification competition. The project applies natural language processing techniques to identify semantically similar patient questions in a medical Q&A context. It employs k-fold cross-validation, binary classification with cross-entropy loss, and model ensembling using pre-trained language models to achieve high accuracy on sentence pair similarity detection.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.