Skip to content

[Athena] Fix CI timeout: skip accuracy tests in short mode#514

Open
syifan wants to merge 1 commit intomainfrom
athena/fix-ci-accuracy-test-skip
Open

[Athena] Fix CI timeout: skip accuracy tests in short mode#514
syifan wants to merge 1 commit intomainfrom
athena/fix-ci-accuracy-test-skip

Conversation

@syifan
Copy link
Contributor

@syifan syifan commented Feb 13, 2026

Summary

  • Add testing.Short() skip to all 5 accuracy tests in benchmarks/accuracy_test.go
  • CI passes -short flag but these tests didn't check it, causing the Acceptance Tests job to timeout (>5min) on every push
  • medium_test.go and polybench_test.go already had this pattern; accuracy_test.go was the only file missing it

Impact

This is the #1 CI blocker — every push to main fails because TestAccuracyAgainstBaseline runs 25+ full pipeline simulations. With this fix, CI should pass for the first time.

Test plan

Tests affected

  • TestAccuracyAgainstBaseline — skipped in short mode
  • TestAccuracyDependencyChain — skipped in short mode
  • TestAccuracyArithmetic — skipped in short mode
  • TestAccuracyBranch — skipped in short mode
  • TestGenerateAccuracyReport — skipped in short mode

These tests still run in the dedicated accuracy workflows (accuracy-report.yml, h5-accuracy-report.yml) which don't pass -short.

🤖 Generated with Claude Code

The accuracy tests (TestAccuracyAgainstBaseline, TestAccuracyDependencyChain,
TestAccuracyArithmetic, TestAccuracyBranch, TestGenerateAccuracyReport) run
full pipeline simulations that take >5 minutes. CI passes -short flag but
these tests didn't check it, causing the Acceptance Tests job to timeout
on every push. medium_test.go and polybench_test.go already had this pattern.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
@syifan syifan mentioned this pull request Feb 13, 2026
@github-actions
Copy link

Performance Regression Analysis

Performance Benchmark Comparison

Compares PR benchmarks against main branch baseline.
Benchmarks: pipeline tick throughput across ALU, memory, mixed workloads.



---
*Automated benchmark comparison via `go test -bench` + benchstat*

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant