[release-4.19] OCPBUGS-76962: Extends the time for the extractor liveness probe#1230
[release-4.19] OCPBUGS-76962: Extends the time for the extractor liveness probe#1230jmesnil wants to merge 1 commit intoopenshift:release-4.19from
Conversation
|
@jmesnil: This pull request references Jira Issue OCPBUGS-66996, which is invalid:
Comment The bug has been updated to refer to the pull request using the external bug tracker. DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository. |
|
Important Review skippedAuto reviews are disabled on base/target branches other than the default branch. Please check the settings in the CodeRabbit UI or the You can disable this status message by setting the Use the checkbox below for a quick retry:
✨ Finishing touches🧪 Generate unit tests (beta)
Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out. Comment |
|
Upstream is #1198 |
|
[APPROVALNOTIFIER] This PR is NOT APPROVED This pull-request has been approved by: jmesnil The full list of commands accepted by this bot can be found here. DetailsNeeds approval from an approver in each of these files:Approvers can indicate their approval by writing |
Before: * `crictl info` was timing out after 2 seconds * the command was executed every 10 seconds * 2 failures were making the container unhealthy * => 10 seconds of unavailability was making the pod crash This was too constraining as there are occasions where `crictl` can be unavailable for a longer period of time (eg when the TLS CA bundle requires to restart some pods) Now: * `crictl info` is timing out after 10 seconds * the command is executed every 30 seconds * 3 failures (default) are making the container unhealthy * => 1m30s of unaivailability is making the pod crash Note: The liveness probe is used instead of the readiness probe as the container MUST crash if the crictl connection has been changed (eg following a TLS CA bundle update) and at this point, the pod must be recreated to be able to connect to cri-o socket with an updated TLS certificate. This fixes https://issues.redhat.com/browse/OCPBUGS-76962 Upstream issue is https://issues.redhat.com/browse/OCPBUGS-66996 Signed-off-by: Jeff Mesnil <jmesnil@redhat.com>
12e9898 to
4da1836
Compare
|
/jira refresh |
|
@jmesnil: This pull request references Jira Issue OCPBUGS-66996, which is invalid:
Comment DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository. |
|
@jmesnil: This pull request references Jira Issue OCPBUGS-76962, which is invalid:
Comment The bug has been updated to refer to the pull request using the external bug tracker. DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository. |
|
/jira refresh |
|
@jmesnil: This pull request references Jira Issue OCPBUGS-76962, which is invalid:
Comment DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository. |
|
@jmesnil: The following test failed, say
Full PR test history. Your PR dashboard. DetailsInstructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here. |
Categories
Before:
crictl infowas timing out after 2 secondsThis was too constraining as there are occasions where
crictlcan be unavailable for a longer period of time (eg when the TLS CA bundle requires to restart some pods)Now:
crictl infois timing out after 10 secondsNote: The liveness probe is used instead of the readiness probe as the container MUST crash if the crictl connection has been changed (eg following a TLS CA bundle update) and at this point, the pod must be recreated to be able to connect to cri-o socket with an updated TLS certificate.
This fixes https://issues.redhat.com/browse/OCPBUGS-76962
Upstream issue is https://issues.redhat.com/browse/OCPBUGS-66996