T-021: Add multi-execution support to exec command #125

ali90h · 2025-09-13T15:12:16Z

🚀 T-021: Multi-execution (exec --all/--indexes/--until-success) + JSONL Summary

This PR implements comprehensive multi-execution support for the exec command, enabling users to execute multiple candidate commands from a plan in a single invocation with structured logging and flexible execution control.

✨ New Features

Multi-execution Flags

--all: Execute all candidate commands in their original order
--indexes "N,M-P": Execute specific commands by indices/ranges (comma-separated)
--until-success: Stop after the first command that exits with code 0
--summary FILE.json: Write final execution summary to JSON file

JSONL Streaming Output

Run records: One JSONL record per executed command with timing, exit codes, and metadata
Summary record: Final JSONL record with execution statistics
Structured logging: Complete audit trail of all executions

Command Selection Logic

Precedence handling: --indexes > --all > single --index
Range parsing: Support for ranges like "0,2-4,6" with validation
Error handling: Clear error messages for invalid configurations

🔧 Implementation Details

Enhanced CLI Interface

Extended ExecConfig with new multi-execution fields
Updated argument parsing with comprehensive validation
Maintained backward compatibility with existing single command execution

JSONL Output Schema

{"type": "run", "index": 0, "cmd": "pytest", "start_ts": "2025-09-13T12:00:00Z", "end_ts": "2025-09-13T12:00:05Z", "exit_code": 1, "duration_ms": 5000}
{"type": "summary", "schema_version": 1, "tool": "autorepro", "runs": 2, "successes": 1, "first_success_index": 1}

Execution Pipeline

Single execution: Maintains existing behavior for backward compatibility
Multi-execution: New pipeline with JSONL streaming and summary generation
Environment handling: Proper setup and cleanup for each command execution

🧪 Testing & Quality

Comprehensive Test Suite

19 unit tests covering all new functionality
Config validation tests: Index parsing, mutual exclusivity, error handling
Command selection tests: Single/all/indexes selection, precedence rules
JSONL record tests: Run records, summary records, field validation
Integration tests: Multi-execution scenarios with proper mocking

Manual Testing

All acceptance criteria from issue T-021: Multi-execution (exec --all/--indexes/--until-success) + JSONL Summary #111 validated
Demo plugin with deterministic fail/success/fail pattern
JSONL output format verification
Exit code behavior confirmation

📚 Documentation

Updated README.md

New dedicated "Exec Command (Multi-execution Support)" section
Comprehensive usage examples for all new flags
JSONL output format specification
Multi-execution features explanation
Complete option reference and exit behavior documentation

CLI Help Text

Clear descriptions for all new flags
Usage examples and parameter explanations
Comprehensive option reference

🔄 Backward Compatibility

Single command execution: Unchanged behavior and interface
Existing flags: All existing functionality preserved
Exit codes: Consistent with existing behavior
Configuration: No breaking changes to existing config options

🎯 Acceptance Criteria Met

✅ --all executes all candidate commands in original order
✅ --until-success stops immediately after first successful command
✅ --indexes accepts single indices and ranges with proper validation
✅ --summary writes JSON file matching final JSONL summary line
✅ JSONL output streams per-run and summary records in order
✅ Backward compatibility maintained (no JSONL emitted without --jsonl)
✅ Invalid indices result in clear error messages and no execution
✅ Command precedence properly implemented and tested

🔗 Related Issues

fix #111

📊 Files Changed

autorepro/cli.py: Core multi-execution implementation
tests/test_exec_multi.py: Comprehensive test suite (new)
README.md: Updated documentation
Cleanup: Removed temporary test files

🚀 Ready for Review

This implementation is production-ready with comprehensive testing, documentation, and maintains full backward compatibility. All linting checks pass and the code follows project standards.

- Add --all, --indexes, --until-success, --summary flags to exec command - Implement JSONL streaming output with run records and summary - Add comprehensive command selection logic with precedence handling - Support index ranges (e.g., '0,2-3') and validation - Maintain backward compatibility with single command execution - Add 19 comprehensive unit tests covering all new functionality - Update README.md with detailed multi-execution documentation - Implement proper error handling and validation for all new features Resolves #111

- Fix mypy command in CI workflow to specify autorepro tests paths - Fix JSON output parsing in exec CLI tests for multi-execution format - Fix exit code mismatch: return 2 for out-of-range index errors - Update unit tests to expect correct exit codes - Prevent duplicate JSONL records in multi-execution mode - Add noqa comment for PLR0912 complexity warning Resolves CI test failures for T-021

ali90h added 2 commits September 13, 2025 18:11

ali90h merged commit fd4408c into main Sep 13, 2025
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

T-021: Add multi-execution support to exec command #125

T-021: Add multi-execution support to exec command #125

Uh oh!

ali90h commented Sep 13, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

T-021: Add multi-execution support to exec command #125

T-021: Add multi-execution support to exec command #125

Uh oh!

Conversation

ali90h commented Sep 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🚀 T-021: Multi-execution (exec --all/--indexes/--until-success) + JSONL Summary

✨ New Features

Multi-execution Flags

JSONL Streaming Output

Command Selection Logic

🔧 Implementation Details

Enhanced CLI Interface

JSONL Output Schema

Execution Pipeline

🧪 Testing & Quality

Comprehensive Test Suite

Manual Testing

📚 Documentation

Updated README.md

CLI Help Text

🔄 Backward Compatibility

🎯 Acceptance Criteria Met

🔗 Related Issues

📊 Files Changed

🚀 Ready for Review

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ali90h commented Sep 13, 2025 •

edited

Loading