← all repositories
inception-project/inception

A web-based annotation tool that actually tries to help you

INCEpTION is a collaborative text-annotation platform with built-in recommender systems and knowledge-base integration for NLP and ML workflows.

698 stars Java Data ToolingOther AI
inception
Velocity · 7d
+0.2
★ / day
Trend
steady
star history

What it does

INCEpTION is a web application where multiple users annotate text together for linguistic or machine-learning tasks. It suggests annotations through a recommender system, lets you build corpora by pulling documents from external repositories, and hooks into knowledge bases for entity linking. The project also provides Python notebooks for pre- and post-processing data.

The interesting bit

The recommender system is the hook: instead of just giving you colored boxes to click, it actively proposes annotations to speed up the grind. The knowledge-base integration means you’re not just tagging text in a vacuum—you can link entities to structured data via SPARQL.

Key highlights

  • Web-based multi-user annotation with project isolation
  • Built-in recommender system for annotation suggestions
  • Knowledge-base support for entity linking (SPARQL-backed)
  • Python Jupyter notebooks for data prep and export pipelines
  • Active development with tutorial videos and a live demo server

Caveats

  • The README notes the project is “still in development,” so expect moving parts
  • 699 stars suggests a niche academic/NLP audience rather than broad adoption
  • Java-based and UIMA-aligned, which may feel heavyweight if you’re not already in that ecosystem

Verdict

Worth a look if you’re doing serious NLP annotation at scale and want machine assistance, not just a highlighter tool. Skip it if you need something lightweight or already have a Python-native pipeline you’re happy with.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.