Skip to content

0.5.0 load test anomaly #63

@erkolson

Description

@erkolson

Something similar to the "stuck connections" issue we see in production occurred during the 0.5.0 load test. Though, due to the different connection handling in bb8, it was not readily apparent which pod was "stuck"

Connection pools looked like this:
Screen Shot 2020-07-20 at 10 43 10 AM

It appears that one pod (...-nr48p) was unable to use all of the idle connections, was very slow to handle requests, and was returning 503s to clients.

Request handling durations:
Screen Shot 2020-07-20 at 10 45 57 AM

5xx rate:
Screen Shot 2020-07-20 at 10 46 09 AM

After deleting that one pod, performance returned to normal.

Metadata

Metadata

Assignees

Labels

8Estimate - xl - Moderately complex, medium effort, some uncertainty.bugSomething isn't workingp1

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions