zzy99/epidemic-sentence-pair
A competition-winning solution that uses NLP models to classify whether COVID-19 patient question pairs are semantically similar.

This repository contains the first-place solution for the Tianchi COVID-19 similar sentence pair classification competition. The project applies natural language processing techniques to identify semantically similar patient questions in a medical Q&A context. It employs k-fold cross-validation, binary classification with cross-entropy loss, and model ensembling using pre-trained language models to achieve high accuracy on sentence pair similarity detection.