Discovered: Apr 14, 2025 19:45 UTC
Resolved: Apr 15, 2025 04:05 UTC
A configuration change related to Meraki Devices.
About 55% of tenants in US4 became inaccessible due to increased traffic and system load.
Action taken
All times are in UTC
04/14/2025
19:45 - Auvik receives internal alerts for abnormal CPU usage on its backend systems for the US4 cluster.
19:50 - Engineering begins an investigation into the issue, actively taking measures to stabilize the system.
20:42 - A large number of sites become inaccessible, and Auvik implements its incident response.
20:42-21:45 - Engineering continues to investigate.
21:45 - A possible root cause of the issue is identified, and Engineering begins recovering sites.
04/14/25-04/15/25
21:45 - 00:10 - Engineering continues to bring most of the affected sites back online.
04/15/25
00:10 - All sites, except one client, are back up and accessible.
00:10-01:00 - Auvik continues to work on bringing the last client tenants online and getting them up and running.
01:00 - A root cause is determined for the cause of the incident. Engineering creates mitigation steps.
01:00-03:05 - Mitigation steps are implemented, and the remaining sites of the last client are brought online and accessible.