Назад към всички

war-room

// Multi-LLM deliberation for strategic decisions via expert pressure-testing and consensus building. Use for critical, irreversible, or high-stakes architecture choices and conflicts. Skip for trivial or reversible decisions.

$ git log --oneline --stat
stars:201
forks:38
updated:March 4, 2026
SKILL.mdreadonly
SKILL.md Frontmatter
namewar-room
descriptionMulti-LLM deliberation for high-stakes decisions via pressure-testing. Use for critical choices. Skip for reversible ones.
model_preferenceclaude-opus-4
categorystrategic-planning
tagsdeliberation,multi-llm,strategy,decision-making,council,reversibility
complexityadvanced
estimated_tokens2500
progressive_loadingtrue
modulesmodules/reversibility-assessment.md,modules/expert-roles.md,modules/deliberation-protocol.md,modules/merkle-dag.md,modules/discussion-publishing.md,modules/deferred-capture.md
dependenciesconjure:delegation-core,memory-palace:strategeion,leyline:git-platform
toolsBash,Read,Write

Table of Contents

War Room Skill

Orchestrate multi-LLM deliberation for complex strategic decisions.

Overview

The War Room convenes multiple AI experts to analyze problems from diverse perspectives, challenge assumptions through adversarial review, and synthesize optimal approaches under the guidance of a Supreme Commander.

Philosophy

"The trick is that there is no trick. The power of intelligence stems from our vast diversity, not from any single, perfect principle."

  • Marvin Minsky, Society of Mind

Reversibility-Based Routing

Before deliberation, assess the Reversibility Score (RS) to determine appropriate resource allocation:

RS = (Reversal Cost + Time Lock-In + Blast Radius + Information Loss + Reputation Impact) / 25
RS RangeTypeModeResources
0.04 - 0.40Type 2Express1 expert, < 2 min
0.41 - 0.60Type 1BLightweight3 experts, 5-10 min
0.61 - 0.80Type 1AFull Council7 experts, 15-30 min
0.81 - 1.00Type 1A+Delphi7 experts, 30-60 min

Quick Heuristics:

  • Can be A/B tested? → Type 2
  • Requires data migration? → Type 1
  • Public commitment required? → Type 1A+

See modules/reversibility-assessment.md for full scoring guide.

When To Use

  • Architectural decisions with major trade-offs
  • Multi-stakeholder problems requiring diverse perspectives
  • High-stakes choices with significant consequences (RS > 0.60)
  • Novel problems without clear precedent
  • When brainstorming produces multiple strong competing approaches

When NOT To Use

  • Simple questions with obvious answers
  • Routine implementation tasks
  • Well-documented patterns with clear solutions
  • Time-critical decisions requiring immediate action
  • Type 2 decisions (RS ≤ 0.40) — use Express mode or skip War Room entirely

Expert Panel

Default (Lightweight Mode)

RoleModelPurpose
Supreme CommanderClaude OpusFinal synthesis, escalation decisions
Chief StrategistClaude SonnetApproach generation, trade-off analysis
Red TeamGemini FlashAdversarial challenge, failure modes

Full Council (Escalated)

RoleModelPurpose
Supreme CommanderClaude OpusFinal synthesis
Chief StrategistClaude SonnetApproach generation
Intelligence OfficerGemini 2.5 ProLarge context analysis (1M+)
Field TacticianGLM-4.7Implementation feasibility
ScoutQwen TurboQuick data gathering
Red Team CommanderGemini FlashAdversarial challenge
Logistics OfficerQwen MaxResource estimation

Deliberation Protocol

Two-Round Default

Round 1: Generation
  - Phase 1: Intelligence Gathering (Scout, Intel Officer)
  - Phase 2: Situation Assessment (Chief Strategist)
  - Phase 3: COA Development (Multiple experts, parallel)
  - Commander Escalation Check

Round 2: Pressure Testing
  - Phase 4: Red Team Review (all COAs)
  - Phase 5: Voting + Narrowing (top 2-3)
  - Phase 6: Premortem Analysis (selected COA)
  - Phase 7: Supreme Commander Synthesis
  - Phase 8: Discussion Publishing

Delphi Extension (High-Stakes)

For high-stakes decisions, extend to iterative Delphi convergence:

  • Multiple rounds until expert consensus
  • Convergence threshold: 0.85

Integration

With Brainstorm

War Room is AUTOMATICALLY INVOKED from Skill(attune:project-brainstorming) after Phase 3 (Approach Generation).

The brainstorm skill passes all context to War Room:

  • Problem statement and constraints
  • Generated approaches with pros/cons
  • Comparison matrix
  • Reversibility assessment (automatically calculated)

Bypass conditions (only if ALL true):

  • RS ≤ 0.40 (Type 2 decision - clearly reversible)
  • Single obvious approach with no meaningful trade-offs
  • Low complexity with well-documented pattern
  • User explicitly declines after seeing RS assessment
# Automatic invocation from brainstorm (do not skip)
/attune:war-room --from-brainstorm

# Direct invocation (standalone)
/attune:war-room "Should we use microservices or monolith for this system?"

With Memory Palace

Sessions persist to the Strategeion (War Palace):

~/.claude/memory-palace/strategeion/
  - war-table/      # Active sessions
  - campaign-archive/  # Historical decisions
  - doctrine/       # Learned patterns
  - armory/         # Expert configurations

With Conjure

Experts are invoked via conjure delegation:

  • conjure:gemini-delegation for Gemini models
  • conjure:qwen-delegation for Qwen models
  • Direct CLI for GLM-4.7 (ccgd or claude-glm --dangerously-skip-permissions)

Usage

Basic Invocation

/attune:war-room "What architecture should we use for the new payment system?"

With Context

/attune:war-room "Best approach for API versioning" --files src/api/**/*.py

Reversibility Assessment Only

Quick assessment without full deliberation:

/attune:war-room "Database migration to MongoDB" --assess-only

Output:

Reversibility Assessment
========================
Decision: Database migration to MongoDB

Dimensions:
  Reversal Cost:      5/5 (months of rework)
  Time Lock-In:       4/5 (migration path hardens)
  Blast Radius:       5/5 (all services affected)
  Information Loss:   4/5 (query patterns, ACID)
  Reputation Impact:  2/5 (internal unless downtime)

Reversibility Score: 0.80
Decision Type: Type 1A (One-Way Door)
Recommended Mode: Full Council

Proceed with full deliberation? [Y/n]

Force Express Mode (Type 2)

Skip to rapid decision for clearly reversible choices:

/attune:war-room "Which logging library to use" --express

Force Full Council

Override RS assessment for critical decisions:

/attune:war-room "Migration strategy" --full-council

Delphi Mode

For highest-stakes irreversible decisions:

/attune:war-room "Long-term platform decision" --delphi

Resume Session

/attune:war-room --resume war-room-20260120-153022

Output

Decision Document

The War Room produces a Supreme Commander Decision document:

## SUPREME COMMANDER DECISION: {session_id}

### Reversibility Assessment
| Dimension | Score | Rationale |
|-----------|-------|-----------|
| Reversal Cost | X/5 | ... |
| Time Lock-In | X/5 | ... |
| Blast Radius | X/5 | ... |
| Information Loss | X/5 | ... |
| Reputation Impact | X/5 | ... |

**RS: 0.XX | Type: [1A+/1A/1B/2] | Mode: [delphi/full_council/lightweight/express]**

### Decision
**Selected Approach**: [Name]

### Rationale
[Why this approach was selected]

### Implementation Orders
1. [ ] Immediate actions
2. [ ] Short-term actions

### Watch Points
[From Premortem - what to monitor]

### Reversal Plan (for Type 1 decisions)
[If this decision proves wrong, here's the exit strategy]

### Dissenting Views
[For the record]

Session Artifacts

Saved to Strategeion:

  • Intelligence reports
  • Situation assessment
  • All COAs (with full attribution after unsealing)
  • Red Team challenges
  • Premortem analysis
  • Final decision

Anonymization

Expert contributions are anonymized during deliberation using Merkle-DAG:

  • Responses labeled as "Response A, B, C..." during review
  • Attribution revealed only after decision is made
  • Hash verification ensures integrity

See modules/merkle-dag.md for details.

Escalation

Automatic (Reversibility-Based)

Deliberation mode is automatically selected based on Reversibility Score:

RS ScoreAutomatic Mode
≤ 0.40Express (bypass full War Room)
0.41 - 0.60Lightweight panel
0.61 - 0.80Full Council
> 0.80Full Council + Delphi

Manual Override

The Supreme Commander may override automatic classification when:

  • High complexity detected (multiple architectural trade-offs)
  • Significant disagreement between initial experts
  • Novel problem domain requiring specialized analysis
  • Precedent-setting decision (future decisions will follow pattern)
  • Political/organizational sensitivity beyond technical scope

Escalation requires written justification with RS assessment.

De-escalation

Equally important: identify decisions being over-deliberated:

  • If RS ≤ 0.40, recommend Express mode or immediate execution
  • Challenge "false irreversibility" ("we can't change this later" without evidence)
  • Track de-escalation rate as team health metric

Configuration

User Settings

{
  "war_room": {
    "default_mode": "lightweight",
    "auto_escalate": true,
    "delphi_threshold": 0.85,
    "max_delphi_rounds": 5
  }
}

Hook Auto-Trigger

War Room can be auto-suggested via hook when:

  • Keywords detected ("strategic decision", "trade-off", etc.)
  • Complexity score exceeds threshold (0.7)
  • User has opted in via settings

Agent Teams Execution Mode

Overview

When --agent-teams is specified (or auto-selected for Full Council / Delphi modes), the War Room uses Claude Code Agent Teams instead of sequential conjure delegation. Each expert runs as a persistent teammate with bidirectional messaging, enabling real-time deliberation instead of batch request/response cycles.

Requires: Claude Code 2.1.32+, CLAUDE_CODE_EXPERIMENTAL_AGENT_TEAMS=1, tmux installed.

When Agent Teams Helps

ModeWithout Agent TeamsWith Agent TeamsBenefit
ExpressSonnet direct callN/A (overkill)None — skip
Lightweight3 sequential delegationsN/A (overhead exceeds benefit)None — skip
Full Council7 sequential/parallel delegations7 teammates with live inbox messagingExperts can react to each other's COAs in real-time
DelphiMultiple delegation roundsPersistent team iterates until convergenceNo re-invocation cost per round; state preserved across rounds

Rule of thumb: Use agent teams only for Full Council and Delphi modes. Lightweight and Express modes don't generate enough inter-expert traffic to justify the coordination overhead.

Team Configuration

# War Room agent team structure
Team: war-room-{session-id}
  Lead: supreme-commander (Opus) — orchestrates phases, final synthesis
  Teammates:
    chief-strategist (Sonnet) — approach generation
    intel-officer (Sonnet) — deep context analysis
    field-tactician (Sonnet) — implementation feasibility
    scout (Haiku) — rapid reconnaissance
    red-team (Sonnet) — adversarial challenge
    logistics (Haiku) — resource estimation

Note: In agent teams mode, all teammates run as Claude Code instances (Opus/Sonnet/Haiku). External LLM experts (Gemini, Qwen, GLM) are not used because agent teams requires the Claude CLI. The trade-off is losing model diversity but gaining real-time inter-expert messaging.

Deliberation Flow with Agent Teams

  1. Lead creates team → spawns teammates in tmux panes
  2. Phase 1 (Intel): Lead assigns intel tasks to scout + intel-officer via inbox
  3. Phase 3 (COA): Lead broadcasts situation assessment; teammates develop COAs independently; messaging allows clarifying questions mid-development
  4. Phase 4 (Red Team): Red-team teammate receives all COAs, posts challenges; other teammates can respond to challenges in real-time
  5. Phase 5 (Voting): Lead broadcasts ballot; teammates rank via inbox messages
  6. Phase 6 (Premortem): All teammates receive selected COA; can build on each other's failure scenarios
  7. Phase 7 (Synthesis): Lead collects all artifacts, produces decision
  8. Phase 8 (Discussion Publishing): After the Supreme Commander Decision document is finalized, follow modules/discussion-publishing.md to publish the decision to GitHub Discussions. The user can decline, but publishing is the default. If the user declines or publishing fails, the local strategeion record is unaffected.

Falling Back to Conjure Delegation

If agent teams fails (tmux unavailable, team creation error), the War Room automatically falls back to standard conjure delegation. The deliberation protocol is identical — only the execution backend differs.

Cost Considerations

Agent teams is significantly more token-intensive than conjure delegation (each teammate maintains its own context window). Use only when the coordination value justifies the cost — typically Delphi mode where multiple rounds of revision make persistent teammates worthwhile.

Discussion Publishing

After Phase 7 synthesis completes (in any execution mode), invoke the modules/discussion-publishing.md workflow. Publishing is the default action. This module:

  1. Notifies the user that the decision will be published (they can opt out with "n")
  2. Creates a Discussion in the "Decisions" category
  3. Posts deliberation phases as threaded comments
  4. Updates the local strategeion with the Discussion URL

If GitHub Discussions are unavailable (non-GitHub platform, Discussions disabled, gh not authenticated), the module skips silently. Publishing failures never block the war room workflow.

Related Skills

  • Skill(attune:project-brainstorming) - Pre-War Room ideation
  • Skill(imbue:scope-guard) - Scope management
  • Skill(imbue:rigorous-reasoning) - Reasoning methodology
  • Skill(conjure:delegation-core) - Expert dispatch
  • Skill(conjure:agent-teams) - Agent teams coordination (Full Council / Delphi)

Related Commands

  • /attune:war-room - Invoke this skill
  • /attune:brainstorm - Pre-War Room ideation
  • /memory-palace:strategeion - Access War Room history

References

Strategic Foundations

  • Sun Tzu - Art of War (intelligence gathering)
  • Clausewitz - On War (friction and fog)
  • Robert Greene - 33 Strategies of War (unity of command)
  • MDMP - U.S. Army (structured decision process)
  • Gary Klein - Premortem (failure mode analysis)
  • Karpathy - LLM Council (anonymized peer review)

Reversibility Framework