Overview
Incident name: Ineffective planning of isolated db queries led to degraded app experience
Date and time: 2025-10-02 13:47–14:01 ET
Affected areas: Air’s web app
Status: Resolved
Customer impact
- Some users experienced slow loading or were unable to load the Air app for roughly 15 minutes. The issue was fully resolved the same day.
- Time window: Approximately 1:47–2:01 PM ET on October 2, 2025
- Data and security: No data loss or security exposure occurred
What happened
Multiple long-running queries caused contention on the primary database, leading to app unavailability and elevated error rates until mitigation reduced load and queries resumed normal performance.
Root cause
- Primary cause: On the writer, the query planner failed to use an index on the clip table for a frequently executed query, resulting in full table scans and spills to disk under load.
Timeline (high level)
- 13:47: Degradation reported; app fails to load.
- 13:48: Error rate confirmed elevated; incident channel started.
- 14:00: Error rates decrease and app loads.
- 14:01: Huddle begins; root cause investigation continues.
- 2025-10-03 04:56: Database parameter change applied to reduce spill risk for heavy queries.
Preventative actions
Frequently asked questions
Need help?
If you notice anything unexpected, please reach out to your Air contact or reply to your most recent support thread and we’ll follow up immediately.