Is raccoon_dataset open source?

Yes — datitran/raccoon_dataset is open source, released under the MIT license.

What language is raccoon_dataset written in?

datitran/raccoon_dataset is primarily written in Jupyter Notebook.

How popular is raccoon_dataset?

datitran/raccoon_dataset has 1.3k stars on GitHub.

Where can I find raccoon_dataset?

datitran/raccoon_dataset is on GitHub at https://github.com/datitran/raccoon_dataset.

← all repositories

datitran/raccoon_dataset

200 raccoons walk into a TensorFlow pipeline

A small, scrappy image dataset for anyone who wants to train an object detector without wrangling data formats first.

★1.3k stars Jupyter Notebook Data Tooling Computer Vision

View on GitHub ↗ Homepage ↗

Not currently ranked — collecting fresh signals.

star history

What it does

This repo packages 200 raccoon images scraped from Google and Pixabay into a ready-to-ingest bundle for TensorFlow’s Object Detection API. The author used it to train a raccoon detector and wrote about the process on Medium. You get PASCAL VOC annotations, TFRecord generators, CSV converters, and a pre-baked train/validation split (160/40).

The interesting bit

The value isn’t the raccoons—it’s the plumbing. Most object-detection tutorials drown you in format-shuffling; this repo hands you working scripts (generate_tfrecord.py, xml_to_csv.py) and a folder layout that the TF API expects out of the box. It’s a minimal viable dataset, not a benchmark.

Key highlights

200 annotated images with bounding boxes in PASCAL VOC XML format
Ready-made 80/20 train-validation split
Includes Jupyter notebooks for visualizing boxes and re-splitting labels
Scripts to convert XML → CSV → TFRecord without leaving the repo
Frozen model and pipeline config from the author’s training run included

Caveats

Image provenance is “Google and Pixabay”—licensing and quality are uneven and not individually documented
200 images is tiny by modern standards; expect overfitting without heavy augmentation or pre-trained weights

Verdict

Grab this if you’re learning the TF Object Detection API and want a working end-to-end example that isn’t COCO. Skip it if you need a production-ready detector or care about clean licensing provenance.

Frequently asked

What is datitran/raccoon_dataset?: A small, scrappy image dataset for anyone who wants to train an object detector without wrangling data formats first.
Is raccoon_dataset open source?: Yes — datitran/raccoon_dataset is open source, released under the MIT license.
What language is raccoon_dataset written in?: datitran/raccoon_dataset is primarily written in Jupyter Notebook.
How popular is raccoon_dataset?: datitran/raccoon_dataset has 1.3k stars on GitHub.
Where can I find raccoon_dataset?: datitran/raccoon_dataset is on GitHub at https://github.com/datitran/raccoon_dataset.