As for reporting I do 3 things
- nagios - off course - with a pone app. When things are gone, I want to know that now
- cacti - I want graphs of a normal situation. That makes it easier to find a time frame from when it started to go not so normal anymore, but not alarming enough to spark off Nagios.
- email cron-jobs to a filtered account: first job every morning reading those logs: are the certificates updated that need updating, (certbot log), did all the updates install successfully, did backups run without an error message, aren't there a ridiculous number of mails in the mailq of the mailserver, ... all that stuff.
When ik goes wrong, I want to be the first to know and preferably a few hours before it will go wrong
- nagios - off course - with a pone app. When things are gone, I want to know that now
- cacti - I want graphs of a normal situation. That makes it easier to find a time frame from when it started to go not so normal anymore, but not alarming enough to spark off Nagios.
- email cron-jobs to a filtered account: first job every morning reading those logs: are the certificates updated that need updating, (certbot log), did all the updates install successfully, did backups run without an error message, aren't there a ridiculous number of mails in the mailq of the mailserver, ... all that stuff.
When ik goes wrong, I want to be the first to know and preferably a few hours before it will go wrong