When AI Agents Need a Style Guide, Not Just Tokens

DESIGN.md is an open-source format that marries machine-readable tokens with human-readable rationale so coding agents can generate interfaces that match a brand's intent rather than its guess.
The Hype: From Google Labs to the Enterprise Stage
Every few months, a repository escapes the Google Labs incubator and immediately attracts enterprise attention. DESIGN.md is the latest. Originally created for Stitch, an internal AI UI design tool, the draft specification was open-sourced by Google and announced by software engineer Cassia Xu with a demonstration from Google Labs’ David East [6]. The format is deliberately simple: a single Markdown file that encodes both the arithmetic of a design system—colors, type scales, spacing—and the narrative logic behind them. Within weeks, Atlassian had generated its own file from its structured content pipeline and used it during the Team ‘26 keynote to spin up custom dashboards through Figma Make [9]. Meanwhile, a third-party catalog called getdesign.md began offering over three hundred pre-analyzed design systems derived from real websites, from Apple’s white-space minimalism to Nintendo’s early-2000s console chrome [3].

The timing is not accidental. The industry has spent a decade refining framework-specific component libraries and token-driven themes, an architecture that works until the framework changes or the build process calcifies [1]. Simultaneously, the act of writing code is shifting; by 2026, engineers are expected to spend more time reviewing agent-generated output than typing implementations by hand [5]. In that environment, a portable, human-and-machine-readable design contract starts to look like infrastructure rather than documentation.
The Core Idea: Intent, Not Just Values
What separates DESIGN.md from a Tailwind config or a W3C Design Tokens Community Group JSON file is its insistence on prose. The format is bisected into a machine-readable YAML front matter section—exact hex codes, font stacks, spacing increments—and a Markdown body that explains why those values exist [9]. An agent reading the file does not merely learn a primary hex code; it learns the semantic role of that color—whether it anchors headlines, signals interactivity, or provides a neutral canvas—and the aesthetic philosophy that binds the choices together [6][9]. The tokens are normative; the prose is instructional.
This matters because large language models are pattern-completion engines that excel at syntax and fail at inference when prompts are ambiguous [8]. The practice of “vibe coding”—describing a desired outcome in natural language and hoping the agent hallucinates the right structure—produces inconsistent results at scale [4][8]. DESIGN.md is essentially an anti-vibe device. It treats the specification as the primary artifact and the generated interface as a secondary, verifiable output, a philosophy that aligns with the broader Spec-Driven Development movement [8]. By giving agents a persistent, structured understanding of a design system, the file turns brand identity from a vague prompt into a compile-time constraint.
The specification is opinionated about structure. Sections must appear in a fixed order—Overview, Colors, Typography, Layout, and so on—and the linter enforces rules such as broken token references, orphaned colors, and WCAG contrast ratios [6]. The accompanying tooling can diff two versions of a system to catch regressions, or export the token layer to Tailwind v3, Tailwind v4, or the W3C DTCG format [6]. These characteristics position the format as a bridge between design tools and agentic coding pipelines, not merely a style guide.
The Agentic Infrastructure Gap
Coding agents do not possess the tacit, “vibe-based” pattern recognition of human developers [5]. They require explicit instructions on separation of concerns, naming conventions, and fundamental principles like DRY [5]. When an engineer builds a forty-thousand-line SaaS platform using an AI agent, the critical investment is not the prompts but the top-down documentation generated before any code appears [2]. DESIGN.md applies that same discipline to visual design. It functions as a coding standard for an agent’s aesthetic output, specifying not only which values to use but how to reason about them.
The format also sits within a growing ecosystem of agent specifications. The U.S. General Services Administration, for example, maintains a template for AI agent mandates that includes organizational placement, core responsibilities, and non-negotiable strategic objectives [11]. DESIGN.md is narrower—it cares only about visual identity—but it shares the same premise: agents need contracts, not suggestions. As design systems evolve from static libraries into generative substrates, the bottleneck shifts from component availability to intent communication [1]. A flat variable file cannot communicate intent; a structured Markdown file with enforced prose sections can.
The Limits: A Flat File in a Live System
For all its theoretical elegance, DESIGN.md has already run into the wall that separates prototyping from production. Atlassian’s experiments provide the most rigorous public test. When used for one-shot prototypes—generating a dashboard through Figma Make or customizing a Tailwind and Shadcn library—the format performed well, shifting output from generic UI to something recognizably Atlassian in color, spacing, and elevation [9].
In production environments, however, the picture darkened. Against Atlassian’s own integrated MCP server and agent skills, a DESIGN.md file required approximately ninety-two percent more tokens to process, took longer to execute, and exhibited roughly 2.7 times the variance in token consumption between runs [9]. The reason is straightforward: a portable Markdown snapshot, however well-structured, cannot replicate a live context engine connected to structured content models, internal component libraries, and strict coding standards. It captures design intent, but not the full technical specifications, linters, or Figma details that an enterprise agent needs to ship code [9].
The format is also still at version alpha, with an evolving token schema and a rigid section order that may prove too constraining for complex design systems [6]. Unknown section headings are preserved rather than rejected, which is forgiving, but duplicate headings trigger a hard error—a choice that suggests the spec is optimizing for machine parsing over human flexibility [6].
Outlook: Specifications as the Primary Artifact
DESIGN.md arrives at a moment when software engineering is renegotiating the relationship between specification and implementation. The Spec-Driven Development framework argues that code should be the implementation detail of the specification, not the reverse [8]. DESIGN.md is a small, focused experiment in that inversion: it asks whether a Markdown file can become the authoritative source of truth for how an interface should look and feel.
The early answer is partial. The ecosystem around the format—catalogs, linters, export adapters—suggests genuine demand for a portable design contract [3][6]. Yet Atlassian’s findings suggest that in high-scale production, the future likely belongs not to flat files but to context engines that can ingest, extend, and serve DESIGN.md as one layer among many [9]. The unresolved tension is between portability and depth. A file you can email to an external contractor is valuable; a file that requires ninety-two percent more tokens than an integrated MCP server is a bottleneck.
If agentic coding becomes the default modality, as several industry observers predict [5], the scarce resource will not be implementation speed but clarity of intent. DESIGN.md’s real contribution may not be its specific schema or its CLI, but the proof that design systems need narrative. Tokens tell an agent what to render; prose tells it why. In an era where AI writes the code, the humans may end up writing the style guide—and DESIGN.md is one of the first formal attempts to make that guide executable.
Sources
- How AI-integrated design systems and generative component ...
- Coding Standards for AI Agents
- getdesign.md — DESIGN.md collection for AI coding agents
- Best Practices I Learned for AI Assisted Coding | by Claire Longo
- Building shared coding guidelines for AI (and people too)
- Stitch app's DESIGN.md format is now open-source for designers
- How are you using AI tools alongside your own design system?
- Spec-Driven Development: From Code to Contract in the ...
- Atlassian's DESIGN.md is here: what we learned testing portable ...
- 15 of the best AI coding tools and assistants for developers - Figma
- AI Agent Specification Template.md
- DesignMD: DESIGN.md catalog for AI agents