stacks-instances/otc/observability.buildth.ing/stacks
Daniel Sy 0316eefa43
fix(observability): 🐛 disable false-positive control-plane alerts and fix empty cluster_environment label
Hub defaultRules groups kubernetesSystemControllerManager, kubeScheduler, and
kubernetesSystemScheduler used wrong key 'enabled: false' — chart expects 'create: false'.
This caused KubeControllerManagerDown/KubeSchedulerDown to fire as false positives
because OTC CCE managed k8s does not expose control plane for scraping.

Dev local vmagent had empty externalLabels, so backup-alert rules evaluated by local
vmalert had no cluster_environment label on kube_job_status_failed metrics. Added
cluster_environment=dev to match what the vm-client-stack vmagent adds for hub shipping.
2026-06-19 12:42:21 +02:00
..
coder Automated upload for observability.buildth.ing 2026-03-04 09:55:46 +00:00
core fix: add ServerSideApply for argocd CRDs, remove deprecated vector playground field 2026-06-02 09:57:05 +01:00
forgejo fix(forgejo): ⏱️ increase s3-backup activeDeadlineSeconds 1350→7200 2026-06-19 12:35:41 +02:00
garm Automated upload for observability.buildth.ing 2026-03-04 09:55:46 +00:00
observability fix(observability): 🐛 disable false-positive control-plane alerts and fix empty cluster_environment label 2026-06-19 12:42:21 +02:00
observability-client revert(kepler): remove Kepler, incompatible with OTC CCE proc mount restrictions 2026-06-02 16:12:06 +01:00
otc upgrade chart versions: argocd, dex, cloudnative-pg, cert-manager, ingress-nginx, vector, metrics-server 2026-06-02 09:50:04 +01:00
terralist Automated upload for observability.buildth.ing 2026-03-04 09:55:46 +00:00