99.99%Uptime.Always.

We monitor every layer of our infrastructure in real time — from individual pods to API endpoints — so transit agencies never have to worry.

0.00%

Uptime SLA

0+ Weeks

Continuous pod uptime

Endpoint success rate

Unplanned restarts

Monitoring Stack

Five tools. Every layer covered.

Real screenshots from our live monitoring dashboards — this is what our infrastructure looks like right now.

GrafanaPod Health

Tracks individual pod uptime, health status, and restart count across all production services.

17.60 weeksContinuous pod uptime

GrafanaCluster Resources

Monitors cluster-wide CPU and memory usage to detect resource pressure before it affects service.

0.43% CPUAverage cluster load

Kibana / Elastic APMAPI Performance

Measures API response latency and throughput for every endpoint, with full request tracing.

0% error rateAcross all endpoints

FirebaseEndpoint Success

Tracks success rate for all Firebase endpoints serving the Chartr mobile and rider-facing apps.

100%Endpoint success rate

AWS CloudWatch + SlackInfrastructure Alerts

AWS CloudWatch watches Kubernetes in production and fires instant Slack alerts for any anomaly.

Real-timeAlert delivery to Slack

Philosophy

How it works

Multi-layer observability

Every layer is watched — from individual Kubernetes pods to database connections, API response times, and mobile app endpoints. Nothing runs dark.

Real-time alerting to Slack

The moment anything crosses a threshold, the engineering team is notified via Slack. No waiting for a dashboard — alerts come to where the team already works.

No single point of failure

Multiple independent monitoring systems (Grafana, Kibana, Firebase, AWS CloudWatch) ensure that monitoring itself cannot go blind. If one tool misses something, another catches it.

Want to know more about our infrastructure?

We are happy to walk you through our reliability practices and SLAs.

Get in touch