Incidents are encoded messages your system is sending you
about how it really works.

We help you decode them.


We work alongside the Consortium for Resilient Internet-Facing Business IT (“SNAFUCatchers“) at Ohio State University – you might know our work from the Stella Report – our keynote video on it is here.

John Allspaw

John Allspaw has worked in software systems engineering and operations for over twenty years in many different environments: biotech, government, online media, social networking, and e-commerce. John’s publications include the books The Art of Capacity Planning (2009) and Web Operations (2010) as well as the forward to “The DevOps Handbook”.  His 2009 Velocity talk with Paul Hammond, 10+ Deploys Per Day: Dev and Ops Cooperation helped start the DevOps movement.

John served as SVP of Infrastructure and Operations and then CTO at Etsy, and holds an MSc in Human Factors and Systems Safety from Lund University.

Richard Cook

Dr. Richard Cook is a research scientist, physician, and pioneer in Resilience Engineering for safety in complex risk-critical worlds, and author of the seminal paper “How Complex Systems Fail” (link) (video) as well as Behind Human Error (2010).

Richard is presently a research scientist in the Department of Integrated Systems Engineering at the Ohio State University and emeritus professor of healthcare systems safety at Sweden’s KTH. (publications)

David Woods

Dr. David Woods founded Resilience Engineering as an approach to safety in complex systems in 2000-2003 as part of the response to several NASA accidents.

David is currently professor at the Ohio State University in Dept. of Integrated Systems Engineering with pioneering research on the interaction between humans and technology in risk critical activities for almost 40 years. His books on safety and resilience engineering include Resilience Engineering: Concepts and Precepts (2006), Behind Human Error (2010), and Resilience Engineering in Practice (2011). (publications)