stacks-instances/otc/observability.buildth.ing/stacks
Daniel Sy 7a6f96a8b4
feat(observability): add cluster heartbeat dead-man switch alerts
ClusterMetricsSilent: fires if no kubelet metrics for >10m (catches vmagent outages).
ClusterAPIServerDown: fires if apiserver scrape fails for >5m.
Replaces silenced KubeControllerManagerDown/KubeSchedulerDown which never fire on managed K8s.
2026-06-22 11:05:48 +02:00
..
coder Automated upload for observability.buildth.ing 2026-03-04 09:55:46 +00:00
core fix: add ServerSideApply for argocd CRDs, remove deprecated vector playground field 2026-06-02 09:57:05 +01:00
forgejo fix(observability): 🔇 silence managed-K8s false alerts + bump backup deadline to 4h 2026-06-22 10:46:01 +02:00
garm Automated upload for observability.buildth.ing 2026-03-04 09:55:46 +00:00
observability feat(observability): add cluster heartbeat dead-man switch alerts 2026-06-22 11:05:48 +02:00
observability-client revert(kepler): remove Kepler, incompatible with OTC CCE proc mount restrictions 2026-06-02 16:12:06 +01:00
otc upgrade chart versions: argocd, dex, cloudnative-pg, cert-manager, ingress-nginx, vector, metrics-server 2026-06-02 09:50:04 +01:00
terralist Automated upload for observability.buildth.ing 2026-03-04 09:55:46 +00:00