Sticking Together while Staying Apart: Resilience in the time of global pandemic

A presentation at Austin DevOps - September Virtual Meetup in September 2020 in Austin, TX, USA by Aaron Aldrich

Slide 1

Slide 1

Sticking Together while Staying Apart Resilience in the time of global pandemic Aaron Aldrich @CrayZeigh

Slide 2

Slide 2

Million-to-one chances… crop up nine times out of ten. ̶Sir Terry Pratchet GNU

Slide 3

Slide 3

Everythingʼs a little bit broken all of the time… but it keeps working anyway

Slide 4

Slide 4

Resilience @CrayZeigh

Slide 5

Slide 5

Resilience @CrayZeigh

Slide 6

Slide 6

Resilience Rebound @CrayZeigh

Slide 7

Slide 7

Resilience Rebound Robustness @CrayZeigh

Slide 8

Slide 8

Resilience Graceful Extensibility Rebound Robustness @CrayZeigh

Slide 9

Slide 9

Resilience Graceful Extensibility Rebound Robustness Sustained Adaptability @CrayZeigh

Slide 10

Slide 10

Rebound Return to “normal” after a surprise or traumatic incident. Work done ahead of time. @CrayZeigh

Slide 11

Slide 11

Robustness The ability to withstand and absorb well-modeled disturbances. Knownknowns @CrayZeigh

Slide 12

Slide 12

Graceful Extensibility The ability to stretch with challenges to operational boundaries. Opposed to brittleness. @CrayZeigh

Slide 13

Slide 13

Sustained Adaptability Recognizing and managing adaptive capabilities over long timescales @CrayZeigh

Slide 14

Slide 14

@CrayZeigh

Slide 15

Slide 15

Bone @CrayZeigh

Slide 16

Slide 16

Bone • Continuously created and destroyed @CrayZeigh

Slide 17

Slide 17

Bone • Continuously created and destroyed • Reconstruction directed by mechanical strain @CrayZeigh

Slide 18

Slide 18

Bone • Continuously created and destroyed • Reconstruction directed by mechanical strain • Process directed by signals through layered networks at cell-level @CrayZeigh

Slide 19

Slide 19

@CrayZeigh

Slide 20

Slide 20

@CrayZeigh

Slide 21

Slide 21

Rebound Graceful Extensibility Robustness @CrayZeigh

Slide 22

Slide 22

Socio-Technical Systems @CrayZeigh

Slide 23

Slide 23

Conway’s Law Designed systems represent an organizationʼs communication structure @CrayZeigh

Slide 24

Slide 24

@CrayZeigh

Slide 25

Slide 25

Blunt end Sharp end @CrayZeigh

Slide 26

Slide 26

Blunt end Sharp end Closest to the work, practitioners @CrayZeigh

Slide 27

Slide 27

Blunt end Removed from experience, upstream decision makers Sharp end Closest to the work, practitioners @CrayZeigh

Slide 28

Slide 28

Sharp end Closest to the work, practitioners @CrayZeigh

Slide 29

Slide 29

• Constantly building and destroying systems Sharp end Closest to the work, practitioners @CrayZeigh

Slide 30

Slide 30

• Constantly building and destroying systems • Strong signaling Sharp end Closest to the work, practitioners @CrayZeigh

Slide 31

Slide 31

• Constantly building and destroying systems • Strong signaling • Improve systems based on strain Sharp end Closest to the work, practitioners @CrayZeigh

Slide 32

Slide 32

• Constantly building and destroying systems • Strong signaling • Improve systems based on strain Sharp end • Will do so naturally given ownership Closest to the work, practitioners @CrayZeigh

Slide 33

Slide 33

Teams that do well dealing with impact [surprises/incidents] are those that have a strong common ground ̶J. Paul Reed (@jpaulreed), Failover Conf

Slide 34

Slide 34

If we want to improve a teamʼs resilience, we must build a strong common ground ̶Me, Just now.

Slide 35

Slide 35

Common Ground @CrayZeigh

Slide 36

Slide 36

Common Ground • Basic Compact @CrayZeigh

Slide 37

Slide 37

Common Ground • • Basic Compact Goal Alignment/ Commitment @CrayZeigh

Slide 38

Slide 38

Common Ground • • • Basic Compact Goal Alignment/ Commitment Inter-predictability @CrayZeigh

Slide 39

Slide 39

Common Ground • • • • Basic Compact Goal Alignment/ Commitment Inter-predictability Sustain & Repair @CrayZeigh

Slide 40

Slide 40

Building Common Ground @CrayZeigh

Slide 41

Slide 41

Building Common Ground • Blameless Postmortems @CrayZeigh

Slide 42

Slide 42

Building Common Ground • • Blameless Postmortems Chaos Engineering @CrayZeigh

Slide 43

Slide 43

Building Common Ground • • • Blameless Postmortems Chaos Engineering Game Days @CrayZeigh

Slide 44

Slide 44

Building Common Ground • • • • Blameless Postmortems Chaos Engineering Game Days Modeling Vulnerability @CrayZeigh

Slide 45

Slide 45

@CrayZeigh

Slide 46

Slide 46

@CrayZeigh

Slide 47

Slide 47

https://youtu.be/SgCGD7rutSw @CrayZeigh

Slide 48

Slide 48

Resilience is about creating the conditions that maximize everyoneʼs potential ̶Rein Hendrichs, >Code Podcast, 174: Resilience

Slide 49

Slide 49

@CrayZeigh

Slide 50

Slide 50

@CrayZeigh

Slide 51

Slide 51

What happens when governments fail? @CrayZeigh

Slide 52

Slide 52

It’s left to us @CrayZeigh

Slide 53

Slide 53

Community Building is Resilience Engineering ̶Me again, just now again.

Slide 54

Slide 54

Strong Communities @CrayZeigh

Slide 55

Slide 55

Strong Communities • Diverse @CrayZeigh

Slide 56

Slide 56

Strong Communities • • Diverse High Trust & Safety @CrayZeigh

Slide 57

Slide 57

Strong Communities • • • Diverse High Trust & Safety Sustain & Repair @CrayZeigh

Slide 58

Slide 58

Strong Communities • • • • Diverse High Trust & Safety Sustain & Repair Inter-predictability @CrayZeigh

Slide 59

Slide 59

Strong Communities • • • • • Diverse High Trust & Safety Sustain & Repair Inter-predictability Loosely Coupled, layered networks @CrayZeigh

Slide 60

Slide 60

@CrayZeigh

Slide 61

Slide 61

@CrayZeigh

Slide 62

Slide 62

@CrayZeigh

Slide 63

Slide 63

@CrayZeigh

Slide 64

Slide 64

@CrayZeigh

Slide 65

Slide 65

@CrayZeigh

Slide 66

Slide 66

@CrayZeigh

Slide 67

Slide 67

@CrayZeigh

Slide 68

Slide 68

@CrayZeigh

Slide 69

Slide 69

@CrayZeigh

Slide 70

Slide 70

@CrayZeigh

Slide 71

Slide 71

@CrayZeigh

Slide 72

Slide 72

@CrayZeigh

Slide 73

Slide 73

Protest Resilience @CrayZeigh

Slide 74

Slide 74

@CrayZeigh

Slide 75

Slide 75

@CrayZeigh

Slide 76

Slide 76

@CrayZeigh

Slide 77

Slide 77

@CrayZeigh

Slide 78

Slide 78

@CrayZeigh

Slide 79

Slide 79

@CrayZeigh

Slide 80

Slide 80

Community Resilience @CrayZeigh

Slide 81

Slide 81

@CrayZeigh

Slide 82

Slide 82

@CrayZeigh

Slide 83

Slide 83

@CrayZeigh

Slide 84

Slide 84

@CrayZeigh

Slide 85

Slide 85

@CrayZeigh

Slide 86

Slide 86

There were drink coolers sorted and labeled: Water, Sparkling water, Juice, Gatorade. A generator-powered phone-charging station enabled people to stay longer while staying in communication with the outside world. A free library—with no late fees!—was established early on and stocked with the words of Black revolutionaries and poets. By July 1, the occupation was offering free COVID-19 testing, too. -(referencing NYC CHAZ) https://crimethinc.com/2020/07/02/the-cop-free-zone-reflectionsfrom-experiments-in-autonomy-around-the-us

Slide 87

Slide 87

@CrayZeigh

Slide 88

Slide 88

@CrayZeigh

Slide 89

Slide 89

@CrayZeigh

Slide 90

Slide 90

@CrayZeigh

Slide 91

Slide 91

@CrayZeigh

Slide 92

Slide 92

@CrayZeigh

Slide 93

Slide 93

Slide 94

Slide 94

Slide 95

Slide 95

Slide 96

Slide 96

Slide 97

Slide 97

https://bit.ly/2Ym7Tp9 @CrayZeigh

Slide 98

Slide 98

@CrayZeigh

Slide 99

Slide 99

@CrayZeigh

Slide 100

Slide 100

@CrayZeigh

Slide 101

Slide 101

@CrayZeigh

Slide 102

Slide 102

https://youtu.be/L9A6ZauhOhg @CrayZeigh

Slide 103

Slide 103

Enable potential and get out of the way @CrayZeigh

Slide 104

Slide 104

Slides & Resources speaking.crayzeigh.com OSMIhelp.org Aaron Aldrich @CrayZeigh EmotionalAPI.co m devopsdays.org

Slide 105

Slide 105

I love you Do good out there Weʼre all in this together @CrayZeigh

Slide 106

Slide 106

Further Reading/Watching/Listening Four concepts for resilience and the implications for the future of resilience engineering - David Woods https://bit.ly/3bITTdc The Marvelous Resilience of Bone - Dr. Richard Cook, REdeploy 2019 https://www.youtube.com/watch?v=8LbePBiOvZ4 Greater Than Code, 174: Resilience https://www.greaterthancode.com/resilience The Worst Year Ever, How to Save your Community When The Government Fails https://ihr.fm/3eVNFbI @CrayZeigh

Slide 107

Slide 107

Further Reading/Watching/Listening Behind Human Error(2nd Edition) - Woods, Dekker, Cook, Johannessen, Carter The Woolworths Experiment https://safetydifferently.com/the-woolworths-experiment/ The Field Guide to Understanding Human Error - Sydney Dekker Literally every video from REdeploy: https://www.youtube.com/channel/UCHbJcI6KfyxflRqdv26b3Qw On Borrowing From Yourself - Aaron Aldrich https://dev.to/crayzeigh/a-reflection-on-borrowing-from-yourself-3jhf @CrayZeigh

Slide 108

Slide 108

Further Reading/Watching/Listening @CrayZeigh

Slide 109

Slide 109

Further Reading/Watching/Listening support your local journalists @CrayZeigh

Slide 110

Slide 110

Further Reading/Watching/Listening support your local journalists support local political candidates @CrayZeigh

Slide 111

Slide 111

Further Reading/Watching/Listening support your local journalists support local political candidates vote @CrayZeigh