Multi-agent DAG orchestration, purpose-built for enterprise engineering teams Learn more →

Multi-cloud · failover

Your app shouldn't die because AWS us-east-1 is having a bad day.

Cendriix orchestrates across AWS, GCP, Azure, and OCI. Pin to one cloud, span two, or let Cendriix decide based on live cost and uptime data. We handle DNS, secrets, and data residency for you. Your app stays up when a region doesn't.

Four failover modes.

Every mode is configured via one policy file. No bespoke runbooks. No on-call engineering in the failover path, Cendriix executes it.

PinRecommended early-stage
Single cloud · single region
One cloud. One region. Simplest topology. Recommended for early-stage products that want maximum development velocity and minimum operational surface area.
Active-passive
Two clouds · ~90s RTO
Two clouds. Cendriix continuously replicates state to the passive side. If the active region goes down, Cendriix promotes passive and re-routes DNS in under 90 seconds, no human needed.
Active-active
Two clouds · sub-5s RTO
Both clouds serve live traffic simultaneously. Cendriix splits traffic via weighted DNS, reconciles writes using a CRDT-backed merge layer, and keeps latency to the nearest region.
Sovereign
Customer-region pinning
EU users hit eu-west-1; US users hit us-east-1. Data residency enforced at the DNS layer. GDPR compliance maintained without bespoke engineering.

What Cendriix does automatically when a cloud goes down.

Seven steps. Zero pager-duty for your engineers at 3 AM. Every action is logged to the immutable audit trail.

1
Detect
Three independent health sources cross-reference: provider status page, Cendriix synthetic probes, your app-level metrics.
agent.sre.healthcheck.multi_source(region="us-east-1")
2
Drain
In-flight requests are allowed to complete. New connections stop routing to the failing region. Connection draining window: 30s.
agent.sre.traffic.drain(region="us-east-1", grace_s=30)
3
Promote passive
The standby environment is promoted to active. Read replicas become writers. Sequence counters are reconciled.
agent.sre.failover.promote(target="eu-west-1", mode="active-passive")
4
Re-route DNS
Route 53 / Cloud DNS / Traffic Manager health checks are updated. DNS TTL flushes in < 30 seconds. Users see no interruption.
agent.sre.dns.failover(ttl=30, provider="route53")
5
Notify
Slack, PagerDuty, and email alerts fire simultaneously. Severity, affected region, estimated RTO, and a link to the Cendriix ops console.
agent.notify.incident(severity="p1", channels=["slack","pagerduty","email"])
6
Draft postmortem
Cendriix writes a structured postmortem draft: timeline, root-cause hypothesis, action items. Placed in your incident tracker for human review.
agent.docs.postmortem.draft(incident_id="inc-2026-0519-01")
7
Rebalance
When the failed cloud recovers, Cendriix re-balances traffic on your configured schedule, not automatically, to avoid flapping.
agent.sre.traffic.rebalance(schedule="business_hours", confirm=true)

See this in action: open the sandboxes console

Data residency.

Choose a region at workspace creation. Orchestrator state and audit events stay in that region. Your source code, secrets, and production data never leave your cloud.

RegionData centerGDPRCCPAHIPAAⁱAvailable clouds
us-east-1N. VirginiaYesYes
AWSCendriix
us-west-2OregonYesYes
AWSGCPCendriix
eu-west-1DublinYes
AWSCendriix
eu-central-1FrankfurtYes
AWSAzureGCP
ap-southeast-1Singapore
AWSAzure
ap-northeast-1Tokyo
AWSGCP
gov-cloudUS GovCloud WestYesYes
AWS GovCloud

ⁱ HIPAA BAA is on the roadmap, not yet available. No HIPAA-covered data should be processed until a BAA is executed.

Ready to stop worrying about us-east-1?

Cendriix handles the failover. You ship the product.