Drop one folder into your project. Claude Code now must plan what every AI feature looks like before writing code — what shows while it's loading, what shows when something goes wrong, how someone using a screen reader experiences it, and how the design matches your other components.
Your design system handles fonts, spacing, and colours. It has no rules for AI. What shows while the AI is thinking? What if someone using a screen reader can't hear the response arrive? What does the input box actually need? None of that is covered.
Streaming looks broken without a cursor. Error is a blank screen. Empty is a void with a text box. Six of the seven states were never designed — they just happened.
A textarea and a div are not AI-native components. Streaming output needs role="log", a blinking cursor, auto-scroll that respects user position, and live markdown rendering.
The AI response arrives word by word. Without the right setup, a screen reader hears nothing. The blinking cursor announces itself on every word unless you explicitly hide it from assistive technology.
A prompt input is not a textarea. It needs auto-resize, Cmd+Enter semantics, a suggestion system, a feedback loop (idle → submitting → streaming → complete), and a token counter.
Rendered markdown without typography is a wall of text. Responses with hedging language look identical to confident responses. There is no progressive disclosure for 2000-word outputs.
#3b82f6 hardcoded in six componentsAI components built under deadline hardcode colour, spacing, and type. Each new component copies the pattern. A rebrand becomes a multi-file migration instead of three token values.
The difference between streaming output that works for everyone and one that doesn't is three settings and a cursor. This pack makes sure they're always there.
Two rules require design work before any code is written. Three rules verify everything is correct before the feature ships.
Each rule requires a specific output with real values. Claude cannot say 'all states are designed' without showing a file where all 7 states are actually written out.
Claude cannot write design/ai-states/document-assistant.md ✓
with real copy for all 7 states without actually doing the state design. That is the enforcement.
Each specialist has a narrow focus. Reviewers give PASS / CONDITIONAL / BLOCK verdicts and write reports to real files — not chat messages.
Plans and builds every AI component from written spec to working code. Writes the spec and state maps before any code. Never hardcodes a colour, size, or spacing value.
Reviews every AI feature against all 5 rules. Writes the report to a real file. Verdict: pass, fix these things, or blocked — never 'looks good.'
Writes exactly what the user sees in all 7 visual states — the words, the transitions, and 3 edge cases — before any component is built.
Designs the input area: how it grows, what suggestions appear, how the visual state changes from idle to sending to receiving to done. Never just a fixed-height text box.
Reviews screen reader support, keyboard navigation, colour contrast, and readability. Writes the report to file. Will not approve streaming output without proper screen reader announcements.
The difference is not catching issues in review. It is the work existing before implementation starts.
#3b82f6 hardcoded in 6 components. Rebrand = migration sprint.role="log" aria-live="polite", streaming cursor aria-hidden, scroll follows user position.Sparse-clone installs only skills/ and .claude/agents/. Non-destructive — won't overwrite existing files.
ai-states-required before writing any component code, then design-before-code to write the spec. Run design-token-audit and accessible-ai-output before any AI component PR merges.
All packs work standalone. All share the same enforcement model. Use one, some, or all.
25 pre-configured engineering specialists, 18 workflow skills, a lead orchestrator, and a Pipeline Auditor. The base layer.
AI UI design enforcement. States, streaming UI, prompt UX, accessibility, and design tokens.
Eval, prompt versioning, fallback design, RAG pipelines, safety review. Three hard gates that block ship.
Product quality gates. PRD, feature scoping, metric definition, research synthesis, A/B test design, AI feature validation.
Growth quality gates. Positioning, copy, funnel analysis, experiments, retention design, AI messaging review.