AI Reliability Leaderboard
Compare 90-day reliability signals across 800+ AI services: confirmed incidents, community reports, monitoring confidence, and verified availability. Not a model-speed benchmark.
When most AI services have no confirmed hard outage, a single “most reliable” ranking would be misleading. DownForAI shows the underlying reliability signals instead.
Last updated: Sun, 05 Jul 2026 15:47:50 GMT
Key signals (90 days)
Most reported AI services (90d)
Raw community reports over the last 90 days. Not normalized by user base — popular services may receive more reports.
| Service | Category | Reports 90d | Community-detected outages | Confirmed incidents | Availability | Confidence |
|---|---|---|---|---|---|---|
| Civitai | Image | 93 | 3 | 0 | 100.00% | Status page |
| Chub AI | Roleplay AI | 75 | 3 | 0 | 100.00% | Basic probe |
| Ollama | LLM | 62 | — | 0 | 100.00% | Basic probe |
| OpenAI | LLM | 39 | 2 | 1 | 100.00% | Official API |
| SpicyChat AI | Roleplay AI | 39 | — | 0 | 100.00% | Basic probe |
| Google Gemini | LLM | 36 | — | 4 | 100.00% | Basic probe |
| HiWaifu | Roleplay AI | 32 | — | 0 | 100.00% | Basic probe |
| Chai AI | Roleplay AI | 29 | — | 0 | 100.00% | Basic probe |
| Kiro | Dev Tools | 20 | — | 0 | 100.00% | Basic probe |
| Devin (Cognition) | Dev Tools | 19 | — | 0 | 100.00% | Basic probe |
| NVIDIA NIM | Infrastructure | 19 | — | 0 | 100.00% | Status page |
| GitHub Copilot | Dev Tools | 19 | — | 0 | 100.00% | Official API |
| Talkie AI | Roleplay AI | 14 | — | 0 | 100.00% | Basic probe |
| Voicemod | Audio | 14 | — | 0 | 100.00% | Basic probe |
| Candy AI | Roleplay AI | 13 | — | 0 | 99.31% | Basic probe |
| Microsoft Copilot | LLM | 13 | — | 0 | 100.00% | Basic probe |
| Anthropic | LLM | 13 | — | 0 | 100.00% | Official API |
| LMArena | LLM | 9 | — | 0 | 100.00% | Basic probe |
| Nomi.ai | Roleplay AI | 9 | — | 0 | 100.00% | Basic probe |
| DeepSeek | LLM | 8 | — | 0 | 100.00% | Basic probe |
Confirmed incidents (90d)
Services with confirmed outage or degradation incidents observed by DownForAI. Incident minutes sum the duration of confirmed incidents.
| Service | Category | Incidents | Incident minutes | Degraded checks | Availability | Source |
|---|---|---|---|---|---|---|
| Elastic AI Search | Search | 6 | 21,829 | 0 | 0.00% | Official API |
| OpenAI Sora | Video | 2 | 7,653 | 88 | 100.00% | Official API |
| ChatGPT | LLM | 1 | 3,570 | 108 | 100.00% | Official API |
| OpenAI Operator | Agents | 1 | 3,570 | 108 | 100.00% | Official API |
| OpenAI | LLM | 1 | 3,570 | 433 | 100.00% | Official API |
| GPT Image (OpenAI) | Image | 1 | 3,525 | 109 | 100.00% | Official API |
| OpenAI API | Dev Tools | 1 | 3,525 | 109 | 100.00% | Official API |
| Google Gemini | LLM | 4 | 696 | 0 | 100.00% | Basic probe |
| Writecream | Marketing AI | 1 | 300 | 0 | 96.09% | Basic probe |
| Intercom Fin | Support AI | 1 | 180 | 0 | 100.00% | Official API |
| Snowflake Cortex | Vector DB | 2 | 120 | 0 | 99.44% | Official API |
| Linear | Productivity | 1 | 120 | 0 | 99.45% | Official API |
| MarsCode | Dev Tools | 1 | 60 | 0 | 98.01% | Basic probe |
| HeyGen | Video | 1 | 60 | 2 | 98.90% | Official API |
| Canva AI | Image | 1 | 60 | 0 | 98.87% | Official API |
| ElevenLabs | Audio | 1 | 60 | 0 | 100.00% | Official API |
| Applitools | Dev Tools | 1 | 60 | 0 | 98.25% | Basic probe |
| Stable Video Diffusion | Video | 1 | 30 | 0 | 100.00% | Official API |
| Stability AI | Image | 1 | 24 | 0 | 100.00% | Official API |
Officially monitored reliability leaders
Restricted to the 132 services with an official status API or status page, where availability is measured comparably. Grouped by outcome rather than ranked — most have no confirmed hard outage.
No confirmed incidents or degradations (90d) — 95
Confirmed degradations / incidents, no hard outage — 29
| Service | Source | Availability | Incidents | Degraded checks | Reports |
|---|---|---|---|---|---|
| OpenAI | Official API | 100.00% | 1 | 433 | 39 |
| Supabase | Official API | 100.00% | 0 | 210 | 2 |
| Cloudflare Workers AI | Official API | 100.00% | 0 | 180 | 0 |
| Couchbase Capella | Official API | 100.00% | 0 | 177 | 0 |
| Supabase Vector | Official API | 100.00% | 0 | 114 | 0 |
| GPT Image (OpenAI) | Official API | 100.00% | 1 | 109 | 0 |
| OpenAI API | Official API | 100.00% | 1 | 109 | 0 |
| ChatGPT | Official API | 100.00% | 1 | 108 | 2 |
| OpenAI Operator | Official API | 100.00% | 1 | 108 | 0 |
| Retool AI | Official API | 100.00% | 0 | 94 | 0 |
| OpenAI Sora | Official API | 100.00% | 2 | 88 | 0 |
| OpenAI Whisper | Official API | 100.00% | 0 | 88 | 0 |
| Zapier AI | Official API | 100.00% | 0 | 60 | 0 |
| Zoom AI Companion | Official API | 100.00% | 0 | 15 | 0 |
| GitHub Copilot | Official API | 100.00% | 0 | 7 | 19 |
| Anthropic | Official API | 100.00% | 0 | 6 | 13 |
| AI21 Labs | Official API | 100.00% | 0 | 4 | 0 |
| GitHub Models | Official API | 100.00% | 0 | 3 | 0 |
| v0 by Vercel | Official API | 100.00% | 0 | 2 | 0 |
| Vercel | Official API | 100.00% | 0 | 2 | 0 |
| Claude Chat | Official API | 100.00% | 0 | 2 | 3 |
| MongoDB Atlas Vector | Official API | 100.00% | 0 | 2 | 0 |
| Airtable AI | Official API | 100.00% | 0 | 1 | 0 |
| Render AI | Official API | 100.00% | 0 | 1 | 0 |
| Make (ex-Integromat) | Official API | 100.00% | 0 | 1 | 0 |
| Stable Video Diffusion | Official API | 100.00% | 1 | 0 | 0 |
| Stability AI | Official API | 100.00% | 1 | 0 | 0 |
| ElevenLabs | Official API | 100.00% | 1 | 0 | 0 |
| Intercom Fin | Official API | 100.00% | 1 | 0 | 0 |
Confirmed outages (90d) — 8
| Service | Source | Availability | Incidents | Degraded checks | Reports |
|---|---|---|---|---|---|
| Elastic AI Search | Official API | 0.00% | 6 | 0 | 0 |
| Canva AI | Official API | 98.87% | 1 | 0 | 0 |
| n8n | Status page | 98.90% | 0 | 0 | 0 |
| HeyGen | Official API | 98.90% | 1 | 2 | 0 |
| Writesonic | Status page | 99.44% | 0 | 0 | 0 |
| Snowflake Cortex | Official API | 99.44% | 2 | 0 | 0 |
| Datadog LLM | Official API | 99.44% | 0 | 4 | 0 |
| Linear | Official API | 99.45% | 1 | 0 | 0 |
Reliability leaders by category
Browse like-for-like reliability rankings within each category.
LLM · 60 tracked
View full LLM reliability rankings →Image · 55 tracked
View full Image reliability rankings →Video · 53 tracked
View full Video reliability rankings →Audio · 52 tracked
View full Audio reliability rankings →Dev Tools · 75 tracked
View full Dev Tools reliability rankings →Infrastructure · 50 tracked
View full Infrastructure reliability rankings →Search · 23 tracked
View full Search reliability rankings →Productivity · 55 tracked
View full Productivity reliability rankings →How reliable is our signal?
Monitoring confidence reflects how a service is observed — the quality of our measurement, not the service's reliability.
How this leaderboard works
- Availability = non-outage rate over a rolling 90-day window. “Down” means a hard OUTAGE only. A degraded period is not downtime — it is counted as a separate signal. A blocked or rate-limited probe is never counted as an outage.
- Confirmed incidents exclude false positives. Community reports are user-submitted, counted as raw totals not normalized by user base, and shown as their own independent signal — a popular service naturally receives more.
- Monitoring confidence (official API, status page, basic probe, limited, unverifiable) describes the quality of our measurement, not the service's reliability — we never rank by it.
- Each surface is re-checked roughly every 75 minutes. Any response-time figures shown elsewhere reflect the monitored surface (often a homepage or status page) — not model inference speed or tokens-per-second.