Alerting on Cron Jobs Too Fast vs. Heartfly
As engineers, we're well-versed in the common pitfalls of cron jobs and scheduled tasks: they fail to run, they run too long, or they encounter an error and exit with a non-zero status code. We've built robust monitoring around these scenarios, often relying on simple checks: "Did the job start?" "Did it finish within X minutes?" "What was its exit code?"
But what about the more insidious problem: when a cron job runs too fast? It completes seemingly