I am encountering frequent, random job cancellations in a Kubernetes environment. The job status changes to Canceled with the error message:
Canceled: Job canceled: http connection broke by
this happen usually occurs in less than 1 second of execution time.
Environment Context
-
Deployment: Kubernetes (Helm/Manifest)
-
Worker Type: TypeScript (Deno)
-
Resource Status: * No OOMKilled events in kubectl describe pod.
-
High CPU/Memory limits provided (resource starvation is not the cause).
-
No Pod restarts or liveness probe failures observed during the error.
-
Frequency: Intermittent, but frequent on short-lived flows.
Has anybody faced this issue before?