MachineLearningKubernetesNonCriticalPodCrashLooping

Reference information for MachineLearningKubernetesNonCriticalPodCrashLooping used as part of Pulse in CDP Private Cloud Data Services.

Summary
Machine Learning Pod is crash looping
PromQL Expression
increase(kube_pod_container_status_restarts_total{pod!~"{{ .Values.alerts.mlCriticalPodsRegexp }}", appType="ml"} [15m]) > 2
Time Period
0s
Severity
warning
Source
environments/ml