Such and such service is probably having X, that means Y.
No users will be able to do anything.
It runs command XYZ
Unicorns jumping in to the cloud
- Ack the incident in pager duty if you haven't already
- Ack in the #ops channel that you are working on the incident
- If we are completely down, engage your secondary immediately
- Log in to X
- Run command Y
- Check X
- If X > B, then do C
Wait 15 minutes after everything looks green to call the event resolved.
Make a personal note to yourself on the incident and anything you learned.