At 3:17 AM on a Tuesday in Q3 2024, our production Kotlin 2.0 microservice fleet hit a 92% memory utilization threshold across 140 nodes, traced to a silent coroutine leak in Ktor 2.2’s request pipeline that had been bleeding 12MB of heap per second for 72 hours. We lost $14k in SLO credits before we found the root cause. A Couple Million Lines of Haskell: Production Engineering at Mercury (78 p
Postmortem: How a LangGraph 0.1 Multi-Agent Bug Broke Our 2026 Customer Support Bot Executive Summary On October 12, 2026, our production customer support bot experienced a 4-hour partial outage caused by an unpatched edge case in LangGraph 0.1’s multi-agent orchestration layer. The bug triggered infinite agent handoff loops for 18% of inbound customer queries, leading to SLA breaches