Energy utility, DACH
Migrating a monolithic ERP landscape to the cloud
Zero-downtime migration of 14 legacy services to Kubernetes — with a GitOps pipeline and 70% lower infrastructure costs.
Mid-cap fintech
Built an SLO framework, observability stack and chaos-engineering practice — mean time to recovery down from 47 to 6 minutes.
A mid-sized fintech runs a real-time trading system where every minute of downtime costs money directly. Incidents were handled reactively and ad hoc — without clear metrics and without dependable alerting.
We established an SLO framework that makes availability measurable and built an observability stack that shows causes instead of symptoms. Runbooks, on-call processes and regular chaos-engineering exercises keep the team ready to act — before the emergency happens.
Mean time to recovery reduced from 47 to 6 minutes
99.98% availability in the first year of operations
Alert volume cut by 80%