feat: Add hosted evals options. #303

d42me · 2026-01-13T23:09:37Z

Note

Introduces a platform-hosted evaluation flow alongside local runs, with CLI ergonomics and async polling/log streaming.

New --hosted mode in prime eval run and deprecated prime env eval; requires owner/name slug and resolves environment ID from hub
Adds hosted run options: --poll-interval, --no-stream-logs, --timeout-minutes, --allow-sandbox-access, --allow-instances-access, --custom-secrets, --eval-name
Implements utils/hosted_eval.py providing HostedEvalConfig, HostedEvalResult, run_hosted_evaluation (create via /hosted-evaluations, poll /evaluations/{id}, stream logs, fetch final stats) and print_hosted_result
Updates env.run_eval to parse JSON args (--env-args, --custom-secrets), create hosted config, run via asyncio, print results, and treat non-COMPLETED as failure; retains existing local eval path and installation behavior

^{Written by Cursor Bugbot for commit 1d4747c. This will update automatically on new commits. Configure here.}

cursor

Cursor Bugbot has reviewed your changes and found 1 potential issue.

^{Bugbot Autofix is OFF. To automatically fix reported issues with Cloud Agents, enable Autofix in the Cursor dashboard.}

cursor · 2026-01-16T17:38:01Z

packages/prime/src/prime_cli/utils/hosted_eval.py

+            console=console,
+        ) as live:
+            while True:
+                await asyncio.sleep(poll_interval)


Missing validation for negative poll interval causes crash

Low Severity

The poll_interval parameter is passed directly to asyncio.sleep() without validation. If a user provides a negative value via --poll-interval, Python raises ValueError: sleep length must be non-negative, resulting in a traceback rather than a graceful CLI error. The parameter is exposed as a CLI option in evals.py without any bounds checking.

Additional Locations (1)

packages/prime/src/prime_cli/commands/evals.py#L648-L652

Add hosted evals options.

9ce2026

d42me marked this pull request as ready for review January 15, 2026 15:24

Update endpoints:

1d4747c

cursor bot reviewed Jan 16, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Add hosted evals options. #303

feat: Add hosted evals options. #303

Uh oh!

d42me commented Jan 13, 2026 •

edited by cursor bot

Loading

Uh oh!

cursor bot left a comment

Uh oh!

cursor bot Jan 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

feat: Add hosted evals options. #303

Are you sure you want to change the base?

feat: Add hosted evals options. #303

Uh oh!

Conversation

d42me commented Jan 13, 2026 • edited by cursor bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cursor bot left a comment

Choose a reason for hiding this comment

Uh oh!

cursor bot Jan 16, 2026

Choose a reason for hiding this comment

Missing validation for negative poll interval causes crash

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

d42me commented Jan 13, 2026 •

edited by cursor bot

Loading