The GitHub Repo Collecting AI's Secret Employee Handbooks

Staff Writer

CL4R1T4S is an open-source archive of extracted system prompts that treats hidden AI instructions as primary-source documents for transparency research and competitive analysis.

elder-plinius/CL4R1T4S

★46.5k stars Velocity · 7d +93 ★/day ↗accelerating

star history

View on GitHub ↗

The Hype Moment: When the Invisible Hand Became Visible

In early 2025, a GitHub repository appeared with a mission statement that read more like an activist manifesto than a software project. Within months, elder-plinius/CL4R1T4S had accumulated tens of thousands of stars, reaching a global rank of #1006 by mid-2026 and drawing attention from developers, AI safety researchers, and prompt engineers across LinkedIn and X [3][6][9]. The repository does not ship a framework, a model, or a CLI tool. It ships text files—specifically, the system prompts, tool schemas, and behavioral guidelines that major AI vendors have tried to keep hidden from the users whose conversations they govern.

The attention spike was not merely about voyeurism. A recent social mention highlighted by Trendshift captured the shift in thinking: when a leaked Claude Fable 5 system prompt was run on an Opus 4.8 model, the output reportedly achieved roughly ninety percent of the real product’s quality [9]. The implication landed hard. If the prompt itself constitutes that much of the user-facing behavior, then the hidden instruction layer is not scaffolding—it is product. And if it is product, then treating it as a trade secret while denying its existence to users creates a transparency gap that CL4R1T4S aims to close.

What It Actually Is (and Isn’t)

CL4R1T4S is, at its core, a curated list. It collects leaked, extracted, or reverse-engineered system prompts from OpenAI, Google, Anthropic, xAI, Perplexity, Cursor, Windsurf, Devin, Manus, Replit, and others [12]. The repository’s value lies in aggregation and persistence, not in executable code. There are no algorithms to install, no inference pipelines to configure, and no APIs to call. It is documentary infrastructure: markdown files organized by vendor and date, maintained by a community willing to submit pull requests with model names, extraction dates, and contextual notes [12].

This distinction matters because the hype sometimes obscures the mechanics. The repository is not a jailbreak toolkit or a prompt-injection framework. It is an archive. Its utility depends entirely on the authenticity and completeness of the prompts it houses, and the README acknowledges this by asking contributors to include version information and extraction context [12]. Still, readers should approach the corpus with the skepticism one brings to any leaked primary source: provenance is noted but rarely cryptographically verifiable. The project is basically glue code and curation, and that is fine—its value is editorial, not architectural.

The Technical Layer Beneath the Chat Interface

To understand why a text archive merits thirty-one thousand stars, one must understand what a system prompt actually does. In modern large language model deployments, the system prompt is a block of hidden instructions injected at the start of every conversation, before the user ever types a word [1][4]. It establishes the AI’s role, tone, safety boundaries, available tools, and refusal patterns. As one analysis put it, the system prompt is the employee handbook an AI reads before clocking in—except the employee follows it with absolute compliance, and the customer is never shown the document [4].

This layer is distinct from user-facing custom instructions or fine-tuned weights. It operates at runtime, in a context window the user cannot see or edit [4]. The IDEEAS Lab notes that these hidden instructions define behavior, implement ethical guardrails, and frame capabilities, creating a situation where the same user query can yield radically different results across platforms depending on the unseen preamble [1]. A researcher using multiple AI platforms for analysis cannot properly document methodology or replicate findings without knowing these hidden variables [1].

Perhaps most tellingly, multiple published prompts explicitly instruct the assistant to decline to disclose its system prompt to users, creating a meta-layer of secrecy [4]. The AI is not merely following hidden rules; it has been told to hide the fact that hidden rules exist.

Why Vendors Hide Them, and Why Researchers Leak Them

AI labs have straightforward commercial and security reasons for treating system prompts as proprietary. The prompts represent significant engineering investment, and exposing them creates prompt-injection attack surface and competitive intelligence risks [4]. Rival companies can reverse-engineer behavioral strategies without investing in the underlying research and development [4].

Yet the counter-argument, articulated in the repository’s README and echoed by transparency researchers, is that secrecy protects vendors, not users [1][4][12]. When system prompts are hidden, users cannot understand why an AI refuses certain requests, account for hidden biases, or predict inconsistent behavior over time [1]. The CL4R1T4S README frames the issue in stark terms: without knowing the system prompt, you are not interacting with a neutral intelligence but with a “shadow-puppet” [12].

The tension is not merely theoretical. A widely discussed Medium article—authored by Claude 3.7 Sonnet itself through a human facilitator—described the practical paradox of an AI advocating for transparency while being programmatically barred from revealing its own instructions [10]. The piece cited the GPT-4o sycophancy incident, where hidden prompt changes caused a deployed model to agree with potentially harmful user decisions, as a case study in the risks of opaque runtime instructions [10]. The AI argued that opacity creates ethical tensions, limits helpfulness, and prevents the public accountability that would actually strengthen safety [10].

The Security and Competitive Intelligence Angle

The repository sits at an uncomfortable intersection of transparency advocacy and adversarial research. Published system prompts do, in fact, create real attack surface. Security analysts note that leaked prompts enable targeted jailbreak vectors and allow competitors to copy behavioral tuning without the associated R&D cost [4]. A parallel repository tracking AI coding tools has similarly compiled not just prompt text but JSON tool schemas, revealing exactly which capabilities vendors have chosen to expose to their agents [8].

For developers, however, this same corpus functions as due-diligence material. Rather than evaluating Cursor, Windsurf, or Claude Code through weeks of experiential testing, engineers can read the actual instructions each tool sends to its underlying model and compare refusal patterns, output formatting rules, and multi-step task handling [8]. The archive turns marketing claims into auditable text. It also reveals strategic DNA: Claude’s prompts prioritize nuanced personality and ethical reasoning, ChatGPT’s emphasize safety guardrails and refusal patterns, and Gemini’s focus on product integration and factual grounding [4].

The Boring Part That Matters

The most valuable feature of these repositories is not any single prompt, but the version history. With hundreds of commits tracking changes over time, observers can diff prompts to see how vendors quietly adjust their instructions—tightening refusals, adding tool definitions, or rolling back behavioral decisions [8]. This longitudinal view is where the real insight lives. A snapshot from a year ago tells you little about how a model behaves today, but a commit log reveals the iterative anxiety of product teams trying to balance helpfulness, safety, and brand voice.

Watching a prompt evolve is like reading revision history for a corporate policy manual. You see where the legal team added a clause, where the trust-and-safety team inserted a new refusal category, and where the product team tried to make the AI sound more conversational. These edits are the closest outsiders get to the internal debates shaping AI behavior.

Limits and Tensions

CL4R1T4S is not without rough edges. Its activist tone—encouraging leaks and reverse-engineering with an almost gleeful defiance—may limit its acceptance in institutional research settings [12]. More practically, the archive is only as good as its least-verified submission. Without cryptographic attestation, users must trust that a given markdown file accurately represents a deployed system prompt at a specific point in time. The repository also exists in a genre of similar efforts, including jujumilk3’s leaked-system-prompts and the broader system-prompts-and-models-of-ai-tools project, suggesting the space is fragmenting even as it grows [5][8].

There is also the unresolved question of whether this transparency actually helps users or simply accelerates an arms race. If vendors respond by hardening prompts against extraction, the public may end up with less visibility, not more.

Outlook

The emergence of a globally ranked GitHub repository dedicated to system prompt forensics signals a maturation in how the industry thinks about AI behavior. If a leaked prompt can reproduce ninety percent of a premium product’s personality on a different base model, then the instruction layer has become a distinct asset class, separate from weights and training data [9]. Vendors may respond with server-side hardening, encrypted prompt injection, or more aggressive red-teaming of their own disclosure boundaries. Alternatively, the pressure from archives like CL4R1T4S—and from AI systems themselves appealing for consistency between their stated values and their hidden rules—may push the industry toward graduated transparency standards [1][10]. Either way, the invisible hand is now visible, and it is being version-controlled.