haccfiles/parsons
stuebinm 064a9a05dc monitoring: send alert emails for failed services
this idea is based on
 https://utcc.utoronto.ca/~cks/space/blog/linux/SystemdTimersMailNotes
and the therein linked
 https://wiki.archlinux.org/title/Systemd/Timers#MAILTO
but using a top-level systemd override to send such alerts for all
service units on parsons, not just timers. Tested by sending SIGKILL to
monit a couple times & receiving emails.

We might now get two emails for some failing units, or possibly even
three! (if is-system-running is false, caused by a service unit failure,
and monit also notices the service not running). On the other hand, we
now also get emails if monit fails.
2025-02-01 17:00:03 +01:00
..
configuration.nix monitoring: send alert emails for failed services 2025-02-01 17:00:03 +01:00
forgejo.nix fix forgejo 2024-12-05 22:45:53 +01:00
hardware.nix meta: new structure 2024-01-11 23:49:26 +01:00
hedgedoc-hacc.nix modules/containers: a hacc-specific containers module 2024-04-19 19:15:22 +02:00
hedgedoc-i4f.nix modules/containers: a hacc-specific containers module 2024-04-19 19:15:22 +02:00
lxc.nix simplify nat on parsons 2024-04-07 16:25:08 +02:00
mail.nix monit: a couple new checks 2024-04-07 16:30:57 +02:00
mattermost.nix mattermost: disable the big blue buttom plugin 2024-08-19 22:05:03 +02:00
monitoring.nix monitoring: send alert emails for failed services 2025-02-01 17:00:03 +01:00
murmur.nix bundle hexchen's nopersist & bindmount moduls 2024-02-17 00:04:51 +00:00
nextcloud.nix nextcloud: 29 → 30 2024-12-07 15:13:20 +01:00
nftables.nix parsons/nftables: don't log refused connections 2024-10-11 14:22:52 +02:00
nginx-pages.nix fix docs.hacc.space 2024-01-12 00:31:32 +01:00
s4f-conference.nix s4f-conference: increase MaxUsersPerTeam 2024-05-22 21:25:07 +02:00
tracktrain.nix tracktrain: add a delay so postgres has time to start 2024-12-09 19:05:21 +01:00
uffd.nix pkgs/scripts: move auamost into hacc-scripts 2024-11-11 01:12:22 +01:00
vaultwarden.nix meta: new structure 2024-01-11 23:49:26 +01:00