Commit graph

50 commits

Author SHA1 Message Date
zowoq
92c55595d0 format tree 2024-07-24 10:27:26 +00:00
zowoq
5ce9567be2 darwin03: remove 2024-07-16 09:07:47 +00:00
zowoq
29ccc15750 serve logs on nixpkgs-update-logs.nix-community.org 2024-05-14 09:32:14 +00:00
zowoq
cdd225cc94 modules/nixos/monitoring/alert-rules: increase comin commit alert to 90m 2024-04-21 02:45:17 +00:00
zowoq
7172164bbe modules/nixos/monitoring/alert-rules: check comin deployments are on the same commit 2024-04-20 01:24:04 +00:00
zowoq
536c5e38be modules/nixos/monitoring/alert-rules: update comin 2024-04-09 00:12:52 +00:00
zowoq
e82b00eee9 Revert "modules/nixos/monitoring: ofborg: telegraf -> prometheus"
This reverts commit 2d3f246125.
2024-04-04 04:51:34 +00:00
zowoq
d100c8aac5 modules/nixos/monitoring/telegraf: reenable nixpkgs-update http check
also use `~supervisor` directory
2024-04-02 21:33:29 +00:00
zowoq
b7d0c7a4c5 modules/nixos/monitoring: remove grafana 2024-03-22 23:35:57 +00:00
zowoq
6bdb32d87d modules/nixos/monitoring/alert-rules: add comin 2024-03-22 06:10:00 +00:00
zowoq
2d3f246125 modules/nixos/monitoring: ofborg: telegraf -> prometheus
scraping this target with telegraf isn't working since 1.30.0
2024-03-14 23:52:55 +00:00
zowoq
1ff767bded darwin01: init 2024-03-08 07:38:14 +00:00
zowoq
9e026e0366 modules/nixos/monitoring: add ofborg prometheus and eval queue alert 2024-02-04 10:51:26 +00:00
zowoq
50fa6f0686 modules/nixos/monitoring/prometheus: set retention time to 30 days
default is 15 days
2024-01-23 22:56:05 +00:00
zowoq
c03246f531 add wants to services using network-online.target
c2853e2588
2024-01-22 03:39:59 +00:00
zowoq
4143922c6b build02: switch to new hardware 2023-12-13 05:53:33 +00:00
Jörg Thalheim
b01aa3a7e2 monitoring: build03 -> build01 for smart errors 2023-12-04 08:20:28 +00:00
zowoq
e55dafbe9d modules/nixos/monitoring/grafana: ensurePermissions -> ensureDBOwnership 2023-11-20 00:24:48 +00:00
zowoq
5f03801844 remove web01 and lemmy 2023-11-19 22:44:53 +00:00
zowoq
5c7bab039b modules/nixos/monitoring/alert-rules: alert at 90% disk usage 2023-11-14 23:20:22 +00:00
zowoq
a668626fcf Revert "nur-update: build03 -> web01"
This reverts commit 0fe327bce4.
2023-11-12 00:09:23 +00:00
zowoq
e24a3ac8cb modules/nixos/monitoring: switch to srvos alert rules 2023-11-11 07:48:49 +00:00
zowoq
8e1423592b modules/nixos/monitoring/alert-rules: drop localhost_reboot 2023-11-06 22:32:03 +00:00
zowoq
d90801d01f add buildbot 2023-11-04 08:05:37 +00:00
zowoq
ffe723781b modules/nixos/monitoring/telegraf: disable nixpkgs-update http check 2023-10-17 05:28:29 +00:00
zowoq
912a7b27c1 modules/nixos/monitoring: put alertmanager behind basic_auth 2023-09-26 21:38:24 +00:00
zowoq
4293c51090 modules/nixos/monitoring: add grafana 2023-09-26 21:31:58 +00:00
zowoq
f97526c4ee modules/nixos/monitoring/alert-rules: update filesystem_full rules 2023-09-26 21:19:34 +00:00
zowoq
0fe327bce4 nur-update: build03 -> web01 2023-09-15 06:44:04 +00:00
zowoq
6cbfb10f9a modules/nixos/monitoring/matrix-hook: use package from nixpkgs 2023-09-14 03:17:35 +00:00
zowoq
d4343f7ebe move alertmanager, prometheus under monitoring.nix-community.org 2023-09-06 05:26:57 +00:00
zowoq
ac4a067c2b Revert "move alertmanager, prometheus under monitoring.nix-community.org"
This reverts commit 2e480a6b62.
2023-09-05 00:12:43 +00:00
zowoq
2e480a6b62 move alertmanager, prometheus under monitoring.nix-community.org 2023-09-04 22:29:03 +00:00
zowoq
8da79f024c modules/nixos/monitoring: disable alertmanager clustering 2023-08-31 08:21:22 +00:00
zowoq
8b8a4ca8b3 modules/nixos/monitoring/matrix-hook: after network-online 2023-08-28 13:36:23 +00:00
zowoq
af1e5359a7 modules/nixos/monitoring/prometheus: set after for services 2023-08-28 13:36:23 +00:00
zowoq
f5744e67e9 Revert "modules/nixos/monitoring: import matrix-hook template"
This reverts commit d9bb98aa79.
2023-08-26 01:16:50 +00:00
zowoq
0aabe7310f Revert "modules/nixos/monitoring/message.html.tmpl: various"
This reverts commit 287ae73ad2.
2023-08-26 01:16:50 +00:00
zowoq
287ae73ad2 modules/nixos/monitoring/message.html.tmpl: various 2023-08-26 00:29:25 +00:00
zowoq
d9bb98aa79 modules/nixos/monitoring: import matrix-hook template 2023-08-26 00:29:25 +00:00
zowoq
d883b923d5 modules/nixos/monitoring/alert-rules: various
- instance -> host

- simplify telegraf_down

- also exclude darwin from load15
2023-08-21 23:58:51 +00:00
zowoq
e9a020cfd5 modules/nixos/monitoring: matrix-alertmanager-receiver -> matrix-hook 2023-08-18 00:54:11 +00:00
zowoq
b1483a1ead modules/nixos/monitoring/alert-rules: revert reboot to 300, add localhost_reboot 2023-08-16 22:36:57 +00:00
zowoq
7062fc07e4 modules/nixos/monitoring: scrape prometheus and alertmanager 2023-08-16 22:36:57 +00:00
zowoq
e3fb48904c modules/nixos/monitoring: refactor hosts 2023-08-16 22:36:57 +00:00
zowoq
bdcb90a8df modules/nixos/monitoring/prometheus: drop match_re 2023-08-14 11:37:24 +00:00
zowoq
f68b4d7a02 modules/nixos/monitoring/alert-rules: add matrix-alertmanager-receiver 2023-08-14 11:26:44 +00:00
zowoq
0db08b6881 modules/nixos/monitoring: add alertmanager, matrix-alertmanager 2023-08-14 05:16:29 +00:00
zowoq
a5e91e20b4 modules/nixos/monitoring/telegraf: add hydra and nixpkgs-update logs 2023-08-11 05:48:27 +00:00
zowoq
9d8bffd0fd modules/nixos: monitoring 2023-08-08 06:16:48 +00:00