How We Improved Our Monitoring Stack With Only a Few Small Changes
The article discusses the process of improving the monitoring system at Riskified. The team identified pain points and goals, including bottleneck on changes in the monorepo, crashing Prometheus, inability to silence alerts easily, and removing hardcoded secrets in Alertmanager config. They consid..