DevClaw — Testing Guide

DevClaw uses Node.js built-in test runner (node:test) with node:assert/strict for all tests.

Quick Start

# Run all tests
npx tsx --test lib/**/*.test.ts

# Run a specific test file
npx tsx --test lib/roles/registry.test.ts

# Run E2E tests only
npx tsx --test lib/services/*.e2e.test.ts

# Build (also type-checks all test files)
npm run build

Test Files

Unit Tests

File	What it tests
lib/roles/registry.test.ts	Role registry: role lookup, level resolution, model defaults
lib/projects/projects.test.ts	Project state: read/write, worker state, atomic file operations
lib/dispatch/bootstrap-hook.test.ts	Bootstrap hook: role instruction loading, source tracking, overloads
lib/dispatch/attachments.test.ts	File attachment handling
lib/tools/tasks/task-set-level.test.ts	Task set level tool: level hints, label detection
lib/tools/tasks/task-edit-body.test.ts	Task edit body tool: title/body editing, state guards
lib/tools/tasks/research-task.test.ts	Research task tool: architect dispatch
lib/tools/admin/queue-status.test.ts	Queue status formatting
lib/tools/worker/group-isolation.test.ts	Worker group isolation
lib/services/heartbeat/health.test.ts	Heartbeat health pass: zombie detection, stale workers
lib/providers/provider-pr-status.test.ts	PR status polling and detection
lib/setup/version.test.ts	Version tracking and comparison
lib/setup/workspace.test.ts	Workspace initialization and write-once defaults
lib/setup/migrate-layout.test.ts	Workspace layout migration: `projects/` → `devclaw/`
lib/names.test.ts	Deterministic name generation
lib/instance.test.ts	Instance name management

E2E Tests

File	What it tests
lib/services/pipeline.e2e.test.ts	Full pipeline: completion rules, label transitions, actions
lib/services/bootstrap.e2e.test.ts	Bootstrap hook chain: session key → parse → load instructions → inject

Test Infrastructure

Test Harness (`lib/testing/`)

The lib/testing/ module provides E2E test infrastructure:

import { createTestHarness } from "../testing/index.js";

const h = await createTestHarness({
  projectName: "my-project",
  groupId: "-1234567890",
  workflow: DEFAULT_WORKFLOW,
  workers: {
    developer: { active: true, issueId: "42", level: "medior" },
  },
});
try {
  // ... run tests against h.provider, h.commands, etc.
} finally {
  await h.cleanup();
}

createTestHarness() scaffolds:

Temporary workspace directory with devclaw/ data dir and log/ subdirectory
projects.json with test project and configurable worker state
Mock runCommand via CommandInterceptor (captures all CLI calls)
TestProvider — in-memory IssueProvider with call tracking

TestProvider

In-memory implementation of IssueProvider for testing. Tracks all provider method calls and maintains in-memory issue state:

const h = await createTestHarness();
h.provider.seedIssue(42, {
  title: "Fix the bug",
  labels: ["Doing"],
  state: "open",
});

// After running pipeline code:
const calls = h.provider.calls;  // All method invocations

CommandInterceptor

Captures all runCommand calls during tests. Provides filtering and extraction helpers:

// All captured commands
h.commands.commands;

// Filter by command name
h.commands.commandsFor("openclaw");

// Extract task messages dispatched to workers
h.commands.taskMessages();

// Extract session creation patches
h.commands.sessionPatches();

// Reset between test cases
h.commands.reset();

simulateBootstrap

Tests the full bootstrap hook chain without a live OpenClaw gateway:

// Write a project-specific prompt
await h.writePrompt("developer", "Custom dev instructions", "my-project");

// Simulate bootstrap for a developer session
const files = await h.simulateBootstrap(
  "agent:orchestrator:subagent:my-project-developer-medior"
);

// Verify injected bootstrap files
assert.strictEqual(files.length, 1);
assert.strictEqual(files[0].content, "Custom dev instructions");

Writing Tests

Pattern: Unit Test

import { describe, it } from "node:test";
import assert from "node:assert/strict";

describe("my feature", () => {
  it("should do something", () => {
    const result = myFunction("input");
    assert.strictEqual(result, "expected");
  });
});

Pattern: E2E Pipeline Test

import { describe, it, afterEach } from "node:test";
import assert from "node:assert/strict";
import { createTestHarness, type TestHarness } from "../testing/index.js";
import { executeCompletion } from "./pipeline.js";

describe("pipeline completion", () => {
  let h: TestHarness;

  afterEach(async () => {
    if (h) await h.cleanup();
  });

  it("developer:done transitions Doing → To Review", async () => {
    h = await createTestHarness({
      workers: {
        developer: { active: true, issueId: "42", level: "medior" },
      },
    });
    h.provider.seedIssue(42, { labels: ["Doing"], state: "open" });

    const result = await executeCompletion({
      workspaceDir: h.workspaceDir,
      groupId: h.groupId,
      project: h.project,
      workflow: h.workflow,
      provider: h.provider,
      role: "developer",
      result: "done",
    });

    assert.strictEqual(result.rule.to, "To Review");
  });
});

Pattern: Bootstrap Hook Test

import { describe, it, afterEach } from "node:test";
import assert from "node:assert/strict";
import { createTestHarness, type TestHarness } from "../testing/index.js";

describe("bootstrap instructions", () => {
  let h: TestHarness;

  afterEach(async () => {
    if (h) await h.cleanup();
  });

  it("injects project-specific prompt for developer", async () => {
    h = await createTestHarness({ projectName: "webapp" });
    await h.writePrompt("developer", "Build with React", "webapp");

    const files = await h.simulateBootstrap(
      "agent:orchestrator:subagent:webapp-developer-medior"
    );

    assert.strictEqual(files.length, 1);
    assert.ok(files[0].content?.includes("React"));
  });
});

CI/CD Integration

GitHub Actions

name: Test
on: [push, pull_request]
jobs:
  test:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4
      - uses: actions/setup-node@v4
        with:
          node-version: 20
      - run: npm ci
      - run: npm run build
      - run: npx tsx --test lib/**/*.test.ts

GitLab CI

test:
  image: node:20
  script:
    - npm ci
    - npm run build
    - npx tsx --test lib/**/*.test.ts

Debugging Tests

Run specific test

# Run by file
npx tsx --test lib/roles/registry.test.ts

# Run by name pattern
npx tsx --test --test-name-pattern "should have all expected roles" lib/**/*.test.ts

Debug with Node inspector

node --inspect-brk node_modules/.bin/tsx --test lib/roles/registry.test.ts

Then open Chrome DevTools at chrome://inspect.

Best Practices

Use node:test + node:assert/strict — no test framework dependencies
Use createTestHarness() for any test that needs workspace state, providers, or command interception
Always call h.cleanup() in afterEach to remove temp directories
Seed provider state with h.provider.seedIssue() before testing pipeline operations
Use h.commands to verify what CLI commands were dispatched without actually running them
One assertion focus per test — test one behavior, not the whole pipeline
Test error cases — invalid roles, missing projects, bad state transitions

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DevClaw — Testing Guide

Quick Start

Test Files

Unit Tests

E2E Tests

Test Infrastructure

Test Harness (`lib/testing/`)

TestProvider

CommandInterceptor

simulateBootstrap

Writing Tests

Pattern: Unit Test

Pattern: E2E Pipeline Test

Pattern: Bootstrap Hook Test

CI/CD Integration

GitHub Actions

GitLab CI

Debugging Tests

Run specific test

Debug with Node inspector

Best Practices

FilesExpand file tree

TESTING.md

Latest commit

History

TESTING.md

File metadata and controls

DevClaw — Testing Guide

Quick Start

Test Files

Unit Tests

E2E Tests

Test Infrastructure

Test Harness (lib/testing/)

TestProvider

CommandInterceptor

simulateBootstrap

Writing Tests

Pattern: Unit Test

Pattern: E2E Pipeline Test

Pattern: Bootstrap Hook Test

CI/CD Integration

GitHub Actions

GitLab CI

Debugging Tests

Run specific test

Debug with Node inspector

Best Practices

Test Harness (`lib/testing/`)