Status & incident communication

Where to watch operational state and how we communicate during incidents.

Status page

Live at status.xcity.one. Components:

  • Gateway (tokenhub) — inference API availability + p95 latency
  • Identity (auth.xcity.one) — login, session, registration
  • Billing — Stripe webhook ingestion and plan-state sync
  • Consolexcity.one/dashboard
  • Docsxcity.one/docs

Each component publishes a 90-day uptime history. JSON feed at /status.json; RSS at /status.rss.

Incident severity

LevelDefinitionPublic update cadence
InvestigatingAnomaly detected, scope unconfirmedwithin 15 min of detection
IdentifiedRoot cause known, fix in progressevery 30 min
MonitoringFix deployed, watchingevery hour
ResolvedService restored, postmortem pendingpostmortem within 5 business days

Enterprise customers get the same updates pushed to their dedicated Slack Connect channel within 60 seconds of the public post.

Postmortems

Within 5 business days of any S1 incident. We publish a redacted version on the status page and share the full version (including timelines, alerts, and corrective actions) with affected Enterprise customers.

Our template includes: detection → diagnosis → mitigation → root cause → corrective action → action items with owners. We follow blameless review practice — naming systems and processes, not individuals.

Scheduled maintenance

Announced ≥48h in advance via:

  • Status page (banner)
  • Email to billing-admin contacts
  • Slack Connect for Enterprise

Emergency security patches with <15 min duration may go out without advance notice. We document them retroactively on the status page.

Last updated: