The Workflow

The Big Picture

mill’s workflow is a cycle, not a pipeline. Knowledge feeds delivery. Delivery feeds knowledge. Each rotation makes the system sharper.

   Ground (knowledge)
     ↕         ↕
   Idea  →  Spec  →  Ship
                        ↓
                    learnings → Ground

Four skills, one cycle. Let’s walk through the whole flow.

Ground — The Knowledge Layer

Before you build anything, you need to know your world. Ground is mill’s shared product knowledge — everything the system knows about your project.

It’s organized into ten categories:

Category	What It Holds
strategic	Vision, mission, goals
personas	Who you build for — real users, not abstractions
rules	Constraints and conventions your team follows
decisions	Why you chose X over Y — the reasoning, not just the result
vocabulary	Domain terms so everyone speaks the same language
stack	Technologies, frameworks, dependencies
schema	Data structures and relationships
design	Visual language — colors, typography, components
patterns	Code idioms your team uses
debt	Known issues and future work

Ground isn’t static. It grows with every cycle. When mill ships a feature, it observes what it learned and feeds it back for your review.

Idea — The Spark

Not everything starts as a specification. Sometimes it’s just a hunch. A thought in the shower. A feature request that might be worth pursuing.

Idea gives these sparks a home. Capture them quickly with just a title and an intent:

mill idea create "Inline code review" "Let reviewers comment on specific lines without leaving the PR view"

Ideas have a 30-day lifecycle. During that time, you develop them — add context, explore scope, answer open questions. When they’re ready, they graduate to specs. When they’re not worth pursuing, you drop them — but you capture the essence of what you learned.

Nothing is wasted. Even dropped ideas contribute knowledge.

Spec — The Contract

This is where intent becomes precision. A spec takes your idea and transforms it into an unambiguous contract:

Requirements — what the solution must achieve
Approach — how you’ll build it, broken into parts
Criteria — testable conditions that prove each requirement is met
Coverage — the matrix proving requirements → approach → criteria

The spec workflow is conversational. mill asks you questions, one at a time, to tease out the details. It challenges ambiguity. It checks your spec against the principles (self-containment, decision completeness, language independence).

When the spec passes validation, it publishes as a GitHub Issue. That issue becomes the source of truth. No local files. No stale documents. One canonical location.

Ship — The Execution

Ship takes a GitHub Issue and implements it through a CLI-orchestrated work loop. One command runs the full pipeline:

mill ship 47

The process:

Isolate — create a worktree on a dedicated branch (your main branch stays untouched)
Load context — the spec, your project’s knowledge, domain guidance
Iterate — implement one slice at a time, test after each, signal progress
Verify independently — a separate Claude instance checks the work (work can’t grade its own homework)
Ship — create a Pull Request with full traceability
Clean up — remove the worktree, record history

Ship is mostly autonomous. The spec should be complete enough that implementation doesn’t need constant human input. But when genuine ambiguity arises, mill asks rather than guesses.

After shipping, the CLI records the run in history and the implementation writes observations about what it discovered.

The Learning Loop

This is what makes mill different from every other tool.

Most systems are stateless. They forget everything between runs. mill remembers. Here’s how:

During spec drafting, mill notices:

Personas referenced that don’t exist in ground
Domain terms used without definitions
Requirements that conflict with documented rules

During shipping, mill notices:

Code patterns not documented in ground
Dependencies not tracked in the stack
Missing test coverage
Undocumented APIs

These observations land in the learning inbox — a collection of markdown files in .mill/observations/. Each tagged with its type:

Type	Meaning
extraction	Auto-extracted from code
discovery	New information found
concern	Potential problem spotted
suggestion	Improvement idea

When you run /mill:ground, you review these observations. Curate them into ground truth, create new specs from them, track them as debt, or dismiss them. The ones that matter become permanent knowledge that informs the next cycle.

End to End

Here’s a real example of the full cycle:

You have an idea: “Users should be able to export reports as PDF”
/mill:idea captures it with intent and persona
You develop the idea over a few days, answering open questions
/mill:spec transforms it into a spec with 4 requirements, 6 approach parts, and 8 criteria
The spec publishes as GitHub Issue #47
mill ship 47 creates a worktree and implements it in 5 slices across model, logic, and interface
Independent verification passes — PR #48 is created, worktree cleaned up
During implementation, mill observed that “PDF export” isn’t in the vocabulary. It also noticed the ReportService has no unit tests.
/mill:ground reviews these observations. You add “PDF export” to vocabulary and create a new idea for test coverage.
Next cycle starts sharper.

That’s mill. Not a magic wand. A system that compounds. Every cycle, the specs are more precise. The implementations are more accurate. The knowledge is more complete.

And it’s just getting started.