Issue preventing access to application

Incident Report for Digital Pigeon

Postmortem

An incident at AWS resulted in the majority of the application servers to be taken out of action. While the remaining application servers took up the load and were able to response to most requests, with elevated response times, a configuration issue meant it took an 45 minutes before the replacement application servers were available.

We will be deploying infrastructure fixes this weekend which will avoid this scenario from repeating again in the future.

Posted Nov 02, 2021 - 16:31 MDT

Resolved

Incident is resolved, we will provide a triage report soon.

Posted Nov 01, 2021 - 02:48 MDT

Monitoring

We have identified and rectified a server issue that caused slow response times and in some cases loss of access to the application. We are currently closely monitoring the servers but access is restored.

Posted Nov 01, 2021 - 01:44 MDT

Investigating

We are currently investigating this issue.

Posted Oct 31, 2021 - 23:54 MDT

This incident affected: Application and API Servers.