Tortoise scales unreasonably when application misbehaves #405

randytqwjp · 2024-06-05T05:34:24Z

https://mercari.slack.com/archives/C6HC4JBKM/p1717478614923889

When pods are stuck in crashbackloopoff, tortoise recommendation algorithm does not consider this and causes recommendations to scale HPA max replicas to a unreasonable amount

sanposhiho · 2024-06-11T07:20:09Z

causes recommendations to scale HPA max replicas to a unreasonable amount

Does it mean the tortoise lowered the target utilization of HPA too much and consequently HPA increased the replica number?

randytqwjp · 2024-08-23T05:48:16Z

there was a bug in application logic and caused some pods to be stuck in crashbackloop while remaining pods utilization went up. Tortoise then increased maxreplica for this service but since new pods also get stuck in crashbackloop, tortoise kept increasing maxreplica

sanposhiho · 2024-08-23T09:39:00Z

My suggestion is that we can improve Tortoise to check all Pods' status, and then if the ratio of such crashed Pods is higher than the criterion (50% etc), stop changing the max replica (or maybe stop changing any parameters until the situation is stable).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tortoise scales unreasonably when application misbehaves #405

Tortoise scales unreasonably when application misbehaves #405

randytqwjp commented Jun 5, 2024

sanposhiho commented Jun 11, 2024

randytqwjp commented Aug 23, 2024

sanposhiho commented Aug 23, 2024

Tortoise scales unreasonably when application misbehaves #405

Tortoise scales unreasonably when application misbehaves #405

Comments

randytqwjp commented Jun 5, 2024

sanposhiho commented Jun 11, 2024

randytqwjp commented Aug 23, 2024

sanposhiho commented Aug 23, 2024