Partial Outage for a Subset of Classic LMS Customer
»
View Event Details
| Created Wed, 09 May 2018 00:00:00 +0000
Post-Mortem
Summary of impact: Between 11:05 and 11:55 BST on 09-05-2018, a subset of customers may have experienced difficulties connecting to Kallidus Suite applications.
Root cause and mitigation: Engineers determined that [our in-memory data structure store, used primary for caching encountered an error, impacting [several sites in various ways depending on how and why they were accessing the cache at the time of the error. Engineers received numerous notifications from monitoring, and immediately remediated the issue by clearing the data structure, and restarting website processes.
Next steps: We sincerely apologize for the impact to affected customers. We are continuously taking steps to improve the Kallidus Suite Platform and our processes to help ensure such incidents do not occur in the future. In this case, this includes (but is not limited to): Ongoing work to improve integration with our Redis solution, as well as creating more data points to monitor to expand our early warning system.
Posted: Wed, 09 May 2018 11:25:00 +0000