← all repositories

raznem/parsera

Python library that scrapes websites by using LLMs to extract structured data from web pages.

1.3k stars Python Data Tooling
parsera
Velocity · 7d
+1.9
★ / day
Trend
steady
star history

Parsera is a lightweight web scraping library that leverages LLMs to extract structured data from websites. It uses an LLM-powered approach instead of traditional HTML parsing, allowing users to define elements they want extracted and receive structured JSON output. The library integrates with Playwright for page rendering and supports custom LLM models.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.