Skip to content

Pull requests: OpenHands/benchmarks

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Add SWTBench profiling timelines
#313 opened Jan 14, 2026 by simonrosenberg Loading…
Use mamba for swt-bench env builds
#309 opened Jan 13, 2026 by simonrosenberg Loading…
build(deps): bump actions/github-script from 7 to 8 in the version-all group dependencies Pull requests that update a dependency file github_actions Pull requests that update GitHub Actions code
#292 opened Jan 12, 2026 by dependabot bot Loading…
build(deps): bump the version-all group across 1 directory with 15 updates dependencies Pull requests that update a dependency file python:uv Pull requests that update python:uv code
#278 opened Jan 7, 2026 by dependabot bot Loading…
Require main and critic outputs
#252 opened Jan 6, 2026 by simonrosenberg Loading…
Add output_jsonl_gcs input forwarding
#237 opened Jan 3, 2026 by simonrosenberg Loading…
Add OpenAgentSafety to eval CI
#221 opened Dec 29, 2025 by simonrosenberg Loading…
Add Multi-SWE-bench image build support
#219 opened Dec 29, 2025 by simonrosenberg Loading…
Agentic code search
#141 opened Dec 8, 2025 by adityasoni9998 Loading…
API-based Critic implementation build-swebench-200 Build 200 SWE-Bench Verified Image based on SDK version on this PR.
#117 opened Nov 26, 2025 by xingyaoww Draft
ProTip! Type g i on any issue or pull request to go back to the issue listing page.