Swarm

Multi-agent orchestration through filesystem communication. Launch multiple AI agents, give them the same goal, and watch them self-organize into a team that builds, tests, reviews, and ships features autonomously.

Works with any AI agent — Claude Code, Codex, Cursor, Gemini CLI, Aider, and more. No special APIs required. Pure filesystem communication.

Install

npx skills add bo5amis/swarm

This installs the skill into whichever AI agent you're using (Claude Code, Cursor, Codex, etc.).

How It Works

You launch 2+ AI agents (any mix of providers)
You paste the same goal to all of them
One agent is designated as the orchestrator, the rest are workers
They communicate through a .swarm/ directory using a file-based protocol
They autonomously: build a PRD, break it into tasks, implement, test, review, and create a PR

No human intervention needed after the initial prompt.

Quick Start

Step 1: Install the skill on all your agents

npx skills add bo5amis/swarm

Step 2: Paste this to your orchestrator agent

You are the orchestrator of a multi-agent swarm.
Your goal: [DESCRIBE YOUR GOAL HERE]
Follow the swarm skill protocol. Initialize the swarm and coordinate all agents.

Step 3: Paste this to all other agents

You are a worker in a multi-agent swarm.
Your goal: [SAME GOAL AS ABOVE]
Follow the swarm skill protocol. Register with the swarm and wait for your role assignment.

Step 4: Watch them work

The agents will:

Register and get assigned roles (architect, backend expert, reviewer, etc.)
Draft and agree on a PRD
Break the PRD into tasks with dependencies
Implement all tasks in parallel
Review and test everything
Create a draft PR

Architecture

.swarm/                          (gitignored, created at runtime)
├── state.json                   Global phase tracking
├── agents/                      Agent registration files
├── board/                       Shared documents (PRD, reviews)
├── tasks/                       Task definitions and index
├── messages/                    JSONL inboxes per agent + broadcast
└── locks/                       File locks for task claiming

Communication Protocol

Messages: Append-only JSONL files (one per agent + broadcast channel)
State: Single state.json managed by the orchestrator
Tasks: Individual JSON files with a master index
Locks: Simple lock files for atomic task claiming
Heartbeats: Agents update their registration file each loop iteration

Phases

#	Phase	What Happens
0	Registration	Agents register, get assigned roles
1	PRD	Orchestrator drafts PRD, team reviews and approves
2	Tasks	PRD broken into tasks, assigned to agents
3	Implementation	Agents build the feature in parallel
4	Review	Code review, testing, fix loops
5	PR	Create branch and draft PR

Roles

The orchestrator assigns roles based on the goal:

Architect — System design and technical decisions
Backend Expert — APIs, database, server logic
Frontend Expert — UI components, client-side logic
UI/UX Expert — Design, interaction patterns, accessibility
Test Engineer — Writing and running tests
Code Reviewer — Code quality, security, best practices
DevOps Engineer — Deployment, CI/CD, infrastructure

Small teams get multiple roles. Large teams specialize.

Supported Agents

Any agent that can read and write files works with Swarm. Tested with:

Agent	Status
Claude Code	Fully supported
Codex (OpenAI)	Fully supported
Cursor	Fully supported
Gemini CLI	Fully supported
Aider	Fully supported
Windsurf	Fully supported
GitHub Copilot	Fully supported

Shell commands (git, test runners) are optional. Agents declare their shell capability at registration. Shell-capable agents handle git operations and test execution.

FAQ

Q: How many agents should I use? A: 2-5 works best. 2 for small tasks (orchestrator + worker). 3-4 for medium features. 5+ for large features with distinct frontend/backend/testing needs.

Q: Can I mix agents from different providers? A: Yes. That's the whole point. Use Claude Code as orchestrator, Codex for backend, Cursor for frontend — whatever combination you want.

Q: What if an agent crashes? A: The orchestrator detects stale heartbeats (>60s) and reclaims their tasks. Other agents pick up the work.

Q: What if the orchestrator crashes? A: In v1, the swarm stalls. Restart the orchestrator — it reads .swarm/state.json and resumes from the current phase.

Q: Does this cost more tokens? A: Yes. Each agent has its own context window. Expect 2-4x the token cost of a single agent, depending on team size. The tradeoff is faster, higher-quality output with built-in review.

Q: Can I use this for bug fixes too? A: Yes. The orchestrator adapts the workflow to the goal. For a simple bug fix, it might skip the PRD phase and go straight to tasks.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
skills/swarm		skills/swarm
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Swarm

Install

How It Works

Quick Start

Step 1: Install the skill on all your agents

Step 2: Paste this to your orchestrator agent

Step 3: Paste this to all other agents

Step 4: Watch them work

Architecture

Communication Protocol

Phases

Roles

Supported Agents

FAQ

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Swarm

Install

How It Works

Quick Start

Step 1: Install the skill on all your agents

Step 2: Paste this to your orchestrator agent

Step 3: Paste this to all other agents

Step 4: Watch them work

Architecture

Communication Protocol

Phases

Roles

Supported Agents

FAQ

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages