lucasjinreal/weibo_terminater
A Weibo web scraper that collects posts, comments, and followers to build NLP training corpora.

This repository provides a Python-based scraper for Weibo (Chinese social media platform) that collects posts, comments, user followers, and other content. It is described as an NLP corpus preparation tool, intended to gather text data for natural language processing research. The tool uses Selenium with Firefox geckodriver to navigate Weibo pages and extract content. While the author has moved to autonomous driving work, they maintain the project for corpus collection purposes.