All Systems Operational · Last updated:
All Systems Operational

MetricsHub Platform Status

Real-time health and uptime history for the MetricsHub API Gateway.

Last checked:
Core Services
Ingest API
POST /v2/ingest · POST /v2/metrics · POST /v2/logs
99.98% uptime (90d)
● Operational
Stream Processor
Enrichment, routing rules, transformation pipeline
99.99% uptime (90d)
● Operational
Query API
GET /v2/query · aggregation, time-range queries
99.95% uptime (90d)
● Operational
Time-Series Storage
Durable event store, write-ahead log, hot/warm/cold tiers
100.00% uptime (90d)
● Operational
Platform Services
Authentication & API Keys
Token validation, rate limit enforcement
100.00% uptime (90d)
● Operational
Developer Portal
Dashboard, API key management, usage analytics
99.93% uptime (90d)
● Operational
Webhook Delivery
Outbound event webhooks, retry queue
99.97% uptime (90d)
● Operational
Regional Ingest Nodes
🇩🇪 EU West — Frankfurt (fra1)
Primary region · fra1-a, fra1-b, fra1-c
99.97% uptime (90d)
● Operational
🇺🇸 US East — Ashburn (iad1)
iad1-a, iad1-b
99.99% uptime (90d)
● Operational
🇸🇬 APAC — Singapore (sin1)
sin1-a, sin1-b
100.00% uptime (90d)
● Operational

Subscribe to status updates

Get notified by email when incidents are created, updated, or resolved.

Incident History
✓ No incidents in the past 30 days All systems have been operating normally. See below for historical incidents.
Elevated error rates on EU West Ingest API
Incident #INC-20250122  ·  Started: 22 Jan 2025, 03:14 UTC  ·  Duration: 11 min
Resolved
22 Jan 03:25 UTC
Resolved
The issue has been fully resolved. All error rates have returned to normal levels. A post-mortem will be published within 48 hours. Impact was limited to EU West (fra1-b node) — fra1-a and fra1-c were unaffected. No data loss occurred.
22 Jan 03:19 UTC
Monitoring
A fix has been deployed to the affected node. Error rates are dropping. We are monitoring to confirm full recovery.
22 Jan 03:14 UTC
Identified
The root cause has been identified: a connection pool exhaustion on fra1-b caused by a misconfigured upstream timeout following the v2.4.0 deployment. A rollback is being applied to the affected node.
22 Jan 03:11 UTC
Investigating
We are investigating elevated HTTP 500 error rates on the EU West Ingest API endpoint. Other regions (US East, APAC) are operating normally. Impact: approximately 3% of ingest requests to fra1 returning 500 errors.
Scheduled maintenance — Storage tier compaction
Maintenance #MNT-20241215  ·  15 Dec 2024, 02:00 – 03:30 UTC (90 min)
Completed
15 Dec 03:28 UTC
Completed
Scheduled maintenance completed 2 minutes ahead of schedule. Storage compaction was successful. Query performance improvements of 8–12% are expected for time-range queries over 30-day windows. All services are fully operational.
15 Dec 02:00 UTC
In progress
Scheduled maintenance window has begun. The Query API may experience degraded performance (increased latency) for the duration. Ingest API is unaffected. This maintenance was announced 72 hours in advance.
Developer Portal — intermittent login failures
Incident #INC-20241108  ·  8 Nov 2024, 16:42 UTC  ·  Duration: 23 min
Resolved
8 Nov 17:05 UTC
Resolved
The session token signing issue has been resolved by restarting the affected auth service replica. All logins are completing normally. API functionality was not impacted — only the developer portal web interface was affected.
8 Nov 16:52 UTC
Identified
Root cause identified: a session token signing key rotation triggered by our 90-day auto-rotation policy was not propagated correctly to all auth service replicas. Fix in progress.
8 Nov 16:42 UTC
Investigating
We are investigating reports of intermittent login failures on the developer portal. Users may see "Invalid session" errors when signing in. Affected: approximately 15% of portal login attempts. API access via API keys is unaffected.