To reduce the risks associated with hindsight bias, develop post-problem questionnaires in advance for each system, or class of system. When incidents happen and it is time to interview and observe the team, use the questionnaires as guides to templates.
Here are some questions that should give you a few starting points:
It is always beneficial to learn from mistakes and outages. Problem review boards analyzing an incident after the fact need to beware of allowing their knowledge of outcomes to bias their examination of the steps that led up to the event. They must pay appropriate attention to the processes and human factors that could create fertile environments for failure, not just the technical elements.
In this age of ever increasing complexity, there will always be incidents and underlying problems that must be addressed with proper organizational learning and corrective actions to keep the problem from popping up again.
One of the ways around the issues of security and control that make some businesses wary of cloud computing is to build a private cloud -- one that remains within the corporate firewall and is wholly controlled internally. Private clouds also increase the agility of IT an organization's IT infrastructure and make it easier to roll out new technology projects. Download this eBook to get the facts behind the private cloud and learn how your organization can get started.