Discovered: Oct 26, 2023, 05:30 - UTC
Resolved: Oct 27, 2023, 18:52 - UTC
Disk space ran out on the processing disks for Syslog on the US2 cluster.
Syslog message delivery was stopped to clients on the US2 cluster.
All times in UTC
10/26/2023
05:30 - An internal alert was created that Syslog messaging was not working on the US2 cluster.
07:15 - Auvik Engineering begins its investigation.
08:30 - Engineering begins action to increase disk space to be able to process Syslog messages.
09:20 - Engineering alters data retention policy to ensure no data is lost due to the delay.
10:02 - Engineering triggers the new policy to test rollout.
11:10 - Engineering validates new settings and proceeds to see data lag continue to shrink and customer information now flows appropriately.
11:15 - The initial incident is marked as closed.
10/27/2023
09:10 - Data was checked for the cluster as part of standard operating procedure. Data restored by the policy implementation was no longer there.
09:20 - The Auvik Engineering team proceeds to launch an investigation.
09:45 - Engineering confirms that Syslog data from the last 20 days was absent for US2 cluster clients.
10:10 - 10:35 - The log entry for why the Syslog data was deleted was located. The location of the backup of the data was also obtained.
10:43 - Engineering begins to restore the absent Syslog data to the US2 cluster.
10:43 - 18:50 - The data for the Syslog messages is restored to the US2 cluster for clients.
18:52 - The restoration is finished. The incident is closed.