Skip to content

fix: remove stale health-gate args and add done-gate passthrough in core4_eval.py#111

Merged
abrichr merged 1 commit intomainfrom
fix/core4-eval-args
Mar 6, 2026
Merged

fix: remove stale health-gate args and add done-gate passthrough in core4_eval.py#111
abrichr merged 1 commit intomainfrom
fix/core4-eval-args

Conversation

@abrichr
Copy link
Member

@abrichr abrichr commented Mar 6, 2026

Summary

  • Remove --transport-error-threshold, --health-samples, --health-min-success, --health-sample-delay from core4_eval.py — these args don't exist in run_dc_eval.py and cause unrecognized arguments errors
  • Add --done-gate, --done-gate-max-overrides, --done-gate-threshold passthrough to match PR feat: add done-gate to prevent premature task completion #110

Test plan

  • uv run python scripts/core4_eval.py run --trials 1 --controller --done-gate --max-steps 30 --vm-ip 172.173.66.131 --dry-run succeeds

🤖 Generated with Claude Code

…ore4_eval.py

The core4_eval.py was passing --transport-error-threshold, --health-samples,
--health-min-success, and --health-sample-delay to run_dc_eval.py, but those
args don't exist in run_dc_eval.py (they were from uncommitted Codex changes).
Also adds --done-gate passthrough to match PR #110.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
@abrichr abrichr merged commit 38f8e33 into main Mar 6, 2026
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant