Skip to content

Conversation

@R-Palazzo
Copy link
Contributor

@R-Palazzo R-Palazzo commented Dec 22, 2025

Resolve #516
CU-86b7w17ze

@amontanez24 From what I've experienced, we can currently run 4-gpu machines at the same location:

RuntimeError: GCP instance creation failed: Quota 'NVIDIA_T4_GPUS' exceeded.  Limit: 4.0 in region us-central1.

The initial plan is to have 6 instances running on January 1st (4 for single-table, 2 for multi-table), so I wanted to discuss the options with you:

  • Run only 4 machines (start with single-table on January 1st and multi-table on January 5th, so machines are available again)
  • Run 6 machines in a different location, if that's allowed.
  • Increase the number of machines, if possible.

Maybe there is a better solution that I'm happy to discuss.

@R-Palazzo R-Palazzo self-assigned this Dec 22, 2025
@R-Palazzo R-Palazzo requested a review from a team as a code owner December 22, 2025 17:48
@sdv-team
Copy link
Contributor

@R-Palazzo R-Palazzo removed the request for review from a team December 22, 2025 17:50
workflow_dispatch:
schedule:
- cron: '0 5 1 * *'
push:
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is for testing the workflows, I will remove it before merging

OUTPUT_DESTINATION_AWS = 's3://sdgym-benchmark/Benchmarks/'
UPLOAD_DESTINATION_AWS = 's3://sdgym-benchmark/Benchmarks/'
OUTPUT_DESTINATION_AWS = (
's3://sdgym-benchmark/Debug/GCP_Github/' # 's3://sdgym-benchmark/Benchmarks/'
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

For testing purposes, TODO: Update it before merging

def post_benchmark_launch_message(date_str, compute_service='AWS', modality='single_table'):
"""Post a message to the SDV Alerts Slack channel when the benchmark is launched."""
channel = SLACK_CHANNEL
channel = DEBUG_SLACK_CHANNEL
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

For testing purposes, TODO: Update it before merging

def post_benchmark_uploaded_message(folder_name, commit_url=None, modality='single_table'):
"""Post benchmark uploaded message to sdv-alerts slack channel."""
channel = SLACK_CHANNEL
channel = DEBUG_SLACK_CHANNEL
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

For testing purposes, TODO: Update it before merging

@codecov
Copy link

codecov bot commented Dec 22, 2025

Codecov Report

❌ Patch coverage is 92.06349% with 5 lines in your changes missing coverage. Please review.
✅ Project coverage is 78.93%. Comparing base (279a867) to head (7f02321).

Files with missing lines Patch % Lines
sdgym/run_benchmark/utils.py 89.65% 3 Missing ⚠️
sdgym/benchmark.py 0.00% 2 Missing ⚠️
Additional details and impacted files
@@                               Coverage Diff                                @@
##           issue-515-_benchmark_multi_table_compute_gcp     #518      +/-   ##
================================================================================
+ Coverage                                         78.76%   78.93%   +0.16%     
================================================================================
  Files                                                33       33              
  Lines                                              2793     2825      +32     
================================================================================
+ Hits                                               2200     2230      +30     
- Misses                                              593      595       +2     
Flag Coverage Δ
integration 54.69% <0.00%> (-0.56%) ⬇️
unit 73.45% <92.06%> (+0.16%) ⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

@R-Palazzo R-Palazzo force-pushed the issue-516-add-workflows branch from 223d2b2 to 9f10efc Compare December 23, 2025 11:48
compute_quality_score=True,
compute_diagnostic_score=True,
compute_privacy_score=True,
compute_privacy_score=False,
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is this for testing purposes only or it will change to this now ?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

For the benchmark we don't compute the privacy_score for now:

compute_privacy_score=False,

Comment on lines +52 to +55
MODALITY_TO_GDRIVE_LINK = {
'single_table': 'https://docs.google.com/spreadsheets/d/1W3tsGOOtbtTw3g0EVE0irLgY_TN_cy2W4ONiZQ57OPo/edit?usp=sharing',
'multi_table': 'https://docs.google.com/spreadsheets/d/1R13RktVvKnxRecYIge07OBpbX1vbEkE2D1_2idNAKSY/edit?usp=sharing',
}
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I know this won't change probably, but could we use the id from sdgym/run_benchmark/upload_benchmark_results.py if it doesn't cause some circular dependencies ? Or maybe define them in a constants.py file. That way we should just change the id in one file and not have to worry about it.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

if we do this, let's move to constants file

Copy link
Contributor

@amontanez24 amontanez24 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants