-
Notifications
You must be signed in to change notification settings - Fork 199
[AURON #1658] Add tpch suite #1662
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: master
Are you sure you want to change the base?
Conversation
c7853b1 to
a336113
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Pull request overview
This PR adds a comprehensive TPC-H benchmark test suite to the Auron Spark extension project. The implementation includes a test framework for running TPC-H queries (q1-q22), verifying results against expected outputs, and validating query execution plans for stability testing.
Key Changes:
- Introduces
AuronTPCHSuiteabstract test class with result and plan verification capabilities - Adds
AuronTPCHV1Suitevariant for testing with V1 Parquet data source - Includes 22 TPC-H SQL queries, expected results, and plan stability files for Spark 3.5
Reviewed changes
Copilot reviewed 68 out of 76 changed files in this pull request and generated no comments.
Show a summary per file
| File | Description |
|---|---|
| AuronTPCHSuite.scala | Test framework implementing TPC-H query execution and verification logic |
| tpch-queries/*.sql | 22 TPC-H benchmark SQL queries (q1-q22) |
| tpch-query-results/*.out | Expected query results for verification |
| tpch-plan-stability/*.txt | Expected query execution plans for Spark 3.5 |
| tpch-data-parquet/* | Sample TPC-H data in Parquet format |
| .rat-excludes | Updated to exclude TPC-H resource files from license checks |
💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.
| @@ -0,0 +1,151 @@ | |||
| == Physical Plan == | |||
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Is there any way to update query plan and query result in batches?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I've added a switch: set REGEN_TPCH_GOLDEN_FILES to 1 to regenerate query plans and results.
733ddd9 to
981e991
Compare
75c2bc3 to
3eba587
Compare
3eba587 to
7f74e23
Compare
|
@cxzl25 PTAL |
Which issue does this PR close?
Closes #1658 .
Rationale for this change
What changes are included in this PR?
Are there any user-facing changes?
How was this patch tested?