Alerting on Cron Jobs That Finish Too Fast
As engineers, we often set up cron jobs and other scheduled tasks to automate crucial parts of our infrastructure. We meticulously configure them, test them, and then, typically, we monitor them. Most of the time, our monitoring focuses on the obvious failures: jobs that never start, jobs that hang indefinitely, or jobs that crash with an error. We want to know if a job isn't running or isn't finishing.
But what about the silent killer? The job that does run, does finish