← all repositories

alex000kim/nsfw_data_scraper

Shell scripts that scrape and organize images for training an NSFW content classifier using deep learning.

12.6k stars Shell Data Tooling
nsfw_data_scraper
Velocity · 7d
+4.6
★ / day
Trend
steady
star history

This repository provides a set of shell scripts for automatically collecting tens of thousands of images across five categories (porn, hentai, sexy, neutral, and drawings) from sources like Reddit and public datasets. The collected images are organized into train and test directories suitable for training a deep learning image classifier. It uses the Ripme tool to scrape albums and integrates with datasets like Danbooru2018 and Caltech256.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.