As there have been no further cases of the incident in the past week, we are now closing this incident.
Posted about 1 month ago. Jan 14, 2019 - 13:39 EST
We've completed what we hope is the final deployment for lowering overall database load. We will continue to monitor for the next few hours before closing this incident.
Posted about 1 month ago. Jan 11, 2019 - 15:36 EST
Our monitors indicate that older manifests are working as expected now
Posted about 1 month ago. Jan 08, 2019 - 15:25 EST
We've deployed additional changes to further reduce database load. Unfortunately, we've also encountered a bug in validation of some older manifests and are currently working to deploy a quick fix for that. During this time, pulls of these older manifests will block.
Posted about 1 month ago. Jan 08, 2019 - 14:29 EST
We are currently in the process of making changes to our database and expect to be back in service shortly
Posted about 1 month ago. Jan 02, 2019 - 22:43 EST
We've been monitoring the occasional database outages over the last week. At this point, we're planning on making further changes to better support load in hopes that the lockups will stop.
Posted about 2 months ago. Jan 02, 2019 - 15:09 EST
We have seen no additional recurrences of the issue in the past 10 hours or so, but we will continue to monitor as we await a response from our database provider.
Posted about 2 months ago. Dec 21, 2018 - 16:52 EST
We've had another instance of the database losing its ability to process queries. We have deployed further changes to reduce database load and will continue to monitor.
Posted about 2 months ago. Dec 21, 2018 - 00:33 EST
We've deployed a change to reduce load on the database with the hopes of prevent the massive lockups that we've been seeing. We're continue to monitor and report,.
At this time we'd like to once again apologize to our customers for these problems. We know you trust us to manage and deploy your software, and we recognize how extremely frustrating is must be to be encountering these issues.
Posted about 2 months ago. Dec 20, 2018 - 19:51 EST
We're continuing to see spikes of timeouts, locks and overall erratic behavior on the database server. We're currently investigating workarounds to reduce load on the system while simultaneously looking for issues on the database server itself.
Posted about 2 months ago. Dec 20, 2018 - 19:34 EST
We're continuing to monitor and search after all locks resolved.
Posted about 2 months ago. Dec 20, 2018 - 18:23 EST
Unfortunately, the database restart did not solve the problem. We're continuing to investigate why the database locks up for random intervals. We'll continue updating as we find more information.
Posted about 2 months ago. Dec 20, 2018 - 17:24 EST
We've failed our primary database over to its secondary and are currently rescaling the fleet. We will continue to monitor for any unusual spikes or problems.
Posted about 2 months ago. Dec 20, 2018 - 16:43 EST
The database problems we thought we mitigated earlier have returned. We're currently investigating.
Posted about 2 months ago. Dec 20, 2018 - 16:35 EST
This incident affected: Registry, API, Build System, Frontend, and Security Scanning.