Sticking Together while Staying Apart Resilience in the time of global pandemic Aaron Aldrich @CrayZeigh

Million-to-one chances… crop up nine times out of ten. —Sir Terry Pratchet GNU @CrayZeigh

Everything’s a little bit broken all of the time… but it keeps working anyway @CrayZeigh

Resilience @CrayZeigh

Resilience Rebound Graceful Extensibility Robustness Sustained Adaptability @CrayZeigh

Rebound Return to “normal” after a surprise or traumatic incident. Work done ahead of time @CrayZeigh

Robustness The ability to withstand and absorb well-modeled disturbances. Known-knowns @CrayZeigh

Graceful Extensibility The ability to stretch with challenges to operational boundaries. Opposed to brittleness. @CrayZeigh

Sustained Adaptability Recognizing and managing adaptive capabilities over long timescales. Requires people @CrayZeigh

@CrayZeigh

Bone • Continuously created and destroyed • Reconstruction directed by mechanical strain • Process directed by signals through layered networks at cell-level @CrayZeigh

https://youtu.be/8LbePBiOvZ4 @CrayZeigh

Rebound Graceful Extensibility Robustness @CrayZeigh

Socio-Technical Systems @CrayZeigh

Conway’s Law Designed systems represent an organization’s communication structure @CrayZeigh

@CrayZeigh

@CrayZeigh

@CrayZeigh

Blunt End Removed from experience, upstream decision makers Sharp End Closest to the work, practitioners @CrayZeigh

• • • • Constantly building and destroying systems Strong signaling Improve systems based on strain Will do so naturally given ownership Sharp End Closest to the work, practitioners @CrayZeigh

Teams that do well dealing with impact [surprises/incidents] are those that have a strong common ground —J. Paul Reed (@jpaulreed), Sr Applied Resilience Engineer, Net ix Failover Conf

fl @CrayZeigh

If we want to improve a team’s resilience, we must build a strong common ground —Me, Just now. @CrayZeigh

Common Ground • • • • Basic Compact Goal Alignment/ Commitment Inter-predictability Sustain & Repair @CrayZeigh

Building Common Ground • • • • Blameless Postmortems Chaos Engineering Game Days Modeling Vulnerability @CrayZeigh

@CrayZeigh

Our analysis found that this culture of psychological safety is predictive of software delivery performance, organizational performance and productivity. — Accelerate State of DevOps 2019 @CrayZeigh

https://youtu.be/SgCGD7rutSw @CrayZeigh

Resilience is about creating the conditions that maximize everyone’s potential —Rein Henrichs, >Code Podcast, 174: Resilience @CrayZeigh

@CrayZeigh

@CrayZeigh

What happens when governments fail? @CrayZeigh

It’s left to us @CrayZeigh

Community Building is Resilience Engineering —Me again, just now again. @CrayZeigh

Strong Communities • • • • • Diverse High Trust & Safety Sustain & Repair Inter-predictability Loosely Coupled, layered networks @CrayZeigh

@CrayZeigh

@CrayZeigh

@CrayZeigh

@CrayZeigh

@CrayZeigh

@CrayZeigh

@CrayZeigh

https://bit.ly/2Ym7Tp9 @CrayZeigh

@CrayZeigh

@CrayZeigh

@CrayZeigh

https://desertedislanddevops.com @CrayZeigh

https://youtu.be/L9A6ZauhOhg @CrayZeigh

Enable potential and get out of the way @CrayZeigh

Slides & Resources speaking.crayzeigh.com OSMIhelp.org Aaron Aldrich Managed OpenShift Black Belt EmotionalAPI.com @CrayZeigh devopsdays.org

twitch.tv/desertedislandtv discord.gg/CPM5Jcg

I love you Do Good out there We’re all in this together @CrayZeigh

Watching/Listening Four concepts for resilience and the implications for the future of resilience engineering - David Wood https://bit.ly/3bITTdc The Marvelous Resilience of Bone - Dr. Richard Cook, REdeploy 201 https://www.youtube.com/watch?v=8LbePBiOvZ4 Greater Than Code, 174: Resilienc https://www.greaterthancode.com/resilience The Worst Year Ever, How to Save your Community When The Government Fail https://ihr.fm/3eVNFbI s 9 e s @CrayZeigh

Watching/Listening Behind Human Error(2nd Edition) - Woods, Dekker, Cook, Johannessen, Carter The Woolworths Experimen https://safetydifferently.com/the-woolworths-experiment/ The Field Guide to Understanding Human Error - Sydney Dekker Literally every video from REdeploy https://www.youtube.com/channel/UCHbJcI6Kfyx Rqdv26b3Qw On Borrowing From Yourself - Aaron Aldric https://dev.to/crayzeigh/a-re ection-on-borrowing-from-yourself-3jhf fl h : t fl @CrayZeigh

Watching/Listening Kick ‘Em or Keep ‘Em - Collaborating on our own Deserted Islands - Matt Stratto https://youtu.be/SgCGD7rutSw ACCELERATE State of DevOps 2019 https://services.google.com/fh/ les/misc/state-of-devops-2019.pdf n fi @CrayZeigh