Skip to content

Conversation

@siegfriedweber
Copy link
Member

Description

Use unique names for cluster-wide Prometheus resources

Prometheus was installed with the Helm paramter --take-ownership which should have prevented the following error, but it did not:

Error: INSTALLATION FAILED: 3 errors occurred:
    * clusterroles.rbac.authorization.k8s.io "prometheus-stack-kube-prom-prometheus" already exists
    * clusterrolebindings.rbac.authorization.k8s.io "prometheus-stack-kube-prom-prometheus" already exists
    * mutatingwebhookconfigurations.admissionregistration.k8s.io "prometheus-stack-kube-prom-admission" already exists

Therefore, the namespace is now added to the Helm release name and in turn to the cluster-wide resources.

Definition of Done Checklist

  • Not all of these items are applicable to all PRs, the author should update this template to only leave the boxes in that are relevant
  • Please make sure all these things are done and tick the boxes

Author

  • Changes are OpenShift compatible
  • Integration tests passed (for non trivial changes)

Reviewer

  • Code contains useful comments
  • Code contains useful logging statements
  • (Integration-)Test cases added
  • Documentation added or updated. Follows the style guide.
  • Changelog updated
  • Cargo.toml only contains references to git tags (not specific commits or branches)

Acceptance

  • Feature Tracker has been updated
  • Proper release label has been added
  • Links to generated (nightly) docs added
  • Release note snippet added
  • Add type/deprecation label & add to the deprecation schedule
  • Add type/experimental label & add to the experimental features tracker

@siegfriedweber siegfriedweber requested a review from a team January 14, 2026 09:51
@siegfriedweber siegfriedweber self-assigned this Jan 14, 2026
@siegfriedweber siegfriedweber moved this to Development: Waiting for Review in Stackable Engineering Jan 14, 2026
Copy link
Member

@xeniape xeniape left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

  • Is the assert of Prometheus starting up no longer needed?
  • The test fails locally for me, a ServiceMonitor is created a step earlier than Prometheus being installed (which has the ServiceMonitor CRD)

@xeniape xeniape moved this from Development: Waiting for Review to Development: In Review in Stackable Engineering Jan 14, 2026
@siegfriedweber
Copy link
Member Author

The test fails locally for me, a ServiceMonitor is created a step earlier than Prometheus being installed (which has the ServiceMonitor CRD)

Argh. I should have tested the change in a fresh Kubernetes cluster.

Is the assert of Prometheus starting up no longer needed?

helm install ... --wait waits until all resources are in a ready state. Therefore, the assertion was never required. I removed it because the StatefulSet is now also suffixed with the namespace and this cannot be expressed in the assertion.

Copy link
Member

@xeniape xeniape left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Passes now 👍 thanks :)

@siegfriedweber siegfriedweber added this pull request to the merge queue Jan 14, 2026
Merged via the queue into main with commit c0275bb Jan 14, 2026
10 checks passed
@siegfriedweber siegfriedweber deleted the test/fix-metrics-test branch January 14, 2026 13:53
@siegfriedweber siegfriedweber moved this from Development: In Review to Development: Done in Stackable Engineering Jan 14, 2026
@lfrancke lfrancke moved this from Development: Done to Done in Stackable Engineering Jan 19, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

Development

Successfully merging this pull request may close these issues.

3 participants