Bump gha resources to avoid over-committing splice cluster by julientinguely-da · Pull Request #4170 · hyperledger-labs/splice

julientinguely-da · 2026-02-27T09:43:56Z

fixes https://github.com/DACH-NY/cn-test-failures/issues/7390

see https://grafana.splice.network.canton.global/d/ae9lqwimiigw0d/resource-utilization?orgId=1&from=now-12h&to=now&timezone=UTC&var-test_suite=$__all

Pull Request Checklist

Cluster Testing

If a cluster test is required, comment /cluster_test on this PR to request it, and ping someone with access to the DA-internal system to approve it.
If a hard-migration test is required (from the latest release), comment /hdm_test on this PR to request it, and ping someone with access to the DA-internal system to approve it.

PR Guidelines

Include any change that might be observable by our partners or affect their deployment in the release notes.
Specify fixed issues with Fixes #n, and mention issues worked on using #n
Include a screenshot for frontend-related PRs - see README or use your favorite screenshot tool

Merge Guidelines

Make the git commit message look sensible when squash-merging on GitHub (most likely: just copy your PR description).

[static] Signed-off-by: Julien Tinguely <julien.tinguely@digitalasset.com>

moritzkiefer-da · 2026-02-27T11:09:50Z

cluster/pulumi/gha/src/runners.ts

      requests: {
        cpu: '4',
-        memory: '10Gi',
+        memory: '8Gi',


can we make this stuff configurable through config instead of hardcoding it? That way changing it becomes much easier than having to jump betwene splice and internal

Now we have to be careful and first define the runnerSpecs in internal before to merge this PR. right?

https://github.com/DACH-NY/canton-network-internal/pull/3879

[static] Signed-off-by: Julien Tinguely <julien.tinguely@digitalasset.com>

moritzkiefer-da

The configurability makes sense, for actually changing resources I suggest reading the discussion in #4047 and checking with Itai and Nicu what makes sense. I haven't followed this closely enough and some of what you seem to do here goes against what Nicu for example suggested there which was to reduce some of the requests while you are increasing them I believe. Maybe for now just merge only the configurability but keep the values as before.

moritzkiefer-da · 2026-02-27T13:40:03Z

cluster/deployment/mock/config.yaml

      nodeType: c4-standard-16
      minNodes: 0
      maxNodes: 1
+gha:


why not put this in the default config.yaml? Then we can overwrite in internal but it still works if we don't overwrite

because it seems kind of weird to have half of the config in internal/main and the other half in the default. I usually just paste it in mock to have a reference, but here it doesn't update the mock expected

Oh thanks for the refs

Maybe for now just merge only the configurability but keep the values as before.

I'll just wait Monday to see what @nicu says on that. To me the grafana shows that the resources are tight

ok yes, we could indeed lower the requests a bit more while keeping the limits 👍

because it seems kind of weird to have half of the config in internal/main and the other half in the default. I usually just paste it in mock to have a reference, but here it doesn't update the mock expected

fair

@julientinguely-da also check the discussion in #4047 around changing any resource requests. I think it makes sense but maybe check with @isegall-da as well

I'm a bit worried that requests != limits means we are not dedicating resources and things might start failing in non-deterministic and hard to investigate ways

I've made the changes to make them equal and bump their value: https://github.com/DACH-NY/canton-network-internal/pull/3879

[static] Signed-off-by: Julien Tinguely <julien.tinguely@digitalasset.com>

bumping resources to avoid overcommitting the cluster

e64adfc

[static] Signed-off-by: Julien Tinguely <julien.tinguely@digitalasset.com>

julientinguely-da requested review from OriolMunoz-da and moritzkiefer-da February 27, 2026 09:43

moritzkiefer-da reviewed Feb 27, 2026

View reviewed changes

make runner resources configurable

5cd1744

[static] Signed-off-by: Julien Tinguely <julien.tinguely@digitalasset.com>

julientinguely-da requested a review from moritzkiefer-da February 27, 2026 13:16

moritzkiefer-da reviewed Feb 27, 2026

View reviewed changes

julientinguely-da requested review from nicu-da and removed request for OriolMunoz-da February 27, 2026 13:49

make requests = limits

e2c29f0

[static] Signed-off-by: Julien Tinguely <julien.tinguely@digitalasset.com>

julientinguely-da requested review from isegall-da and moritzkiefer-da March 2, 2026 16:22

Conversation

julientinguely-da commented Feb 27, 2026

Pull Request Checklist

Cluster Testing

PR Guidelines

Merge Guidelines

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

moritzkiefer-da left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

julientinguely-da Feb 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

julientinguely-da Feb 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

julientinguely-da Mar 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

julientinguely-da Feb 27, 2026 •

edited

Loading

julientinguely-da Feb 27, 2026 •

edited

Loading

julientinguely-da Mar 2, 2026 •

edited

Loading