Sticking Together while Staying Apart: Resilience in the time of global pandemic

A presentation at Austin DevOps - September Virtual Meetup in September 2020 in Austin, TX, USA by fen aldrich

Slide 1

Slide 1

Sticking Together while Staying Apart Resilience in the time of global pandemic Aaron Aldrich @CrayZeigh

Slide 2

Slide 2

Million-to-one chances… crop up nine times out of ten. ̶Sir Terry Pratchet GNU

Slide 3

Slide 3

Everythingʼs a little bit broken all of the time… but it keeps working anyway

Slide 4

Slide 4

Resilience @CrayZeigh

Slide 5

Slide 5

Resilience @CrayZeigh

Slide 6

Slide 6

Resilience Rebound @CrayZeigh

Slide 7

Slide 7

Resilience Rebound Robustness @CrayZeigh

Slide 8

Slide 8

Resilience Graceful Extensibility Rebound Robustness @CrayZeigh

Slide 9

Slide 9

Resilience Graceful Extensibility Rebound Robustness Sustained Adaptability @CrayZeigh

Slide 10

Slide 10

Rebound Return to “normal” after a surprise or traumatic incident. Work done ahead of time. @CrayZeigh

Slide 11

Slide 11

Robustness The ability to withstand and absorb well-modeled disturbances. Knownknowns @CrayZeigh

Slide 12

Slide 12

Graceful Extensibility The ability to stretch with challenges to operational boundaries. Opposed to brittleness. @CrayZeigh

Slide 13

Slide 13

Sustained Adaptability Recognizing and managing adaptive capabilities over long timescales @CrayZeigh

Slide 14

Slide 14


Slide 15

Slide 15

Bone @CrayZeigh

Slide 16

Slide 16

Bone • Continuously created and destroyed @CrayZeigh

Slide 17

Slide 17

Bone • Continuously created and destroyed • Reconstruction directed by mechanical strain @CrayZeigh

Slide 18

Slide 18

Bone • Continuously created and destroyed • Reconstruction directed by mechanical strain • Process directed by signals through layered networks at cell-level @CrayZeigh

Slide 19

Slide 19


Slide 20

Slide 20


Slide 21

Slide 21

Rebound Graceful Extensibility Robustness @CrayZeigh

Slide 22

Slide 22

Socio-Technical Systems @CrayZeigh

Slide 23

Slide 23

Conway’s Law Designed systems represent an organizationʼs communication structure @CrayZeigh

Slide 24

Slide 24


Slide 25

Slide 25

Blunt end Sharp end @CrayZeigh

Slide 26

Slide 26

Blunt end Sharp end Closest to the work, practitioners @CrayZeigh

Slide 27

Slide 27

Blunt end Removed from experience, upstream decision makers Sharp end Closest to the work, practitioners @CrayZeigh

Slide 28

Slide 28

Sharp end Closest to the work, practitioners @CrayZeigh

Slide 29

Slide 29

• Constantly building and destroying systems Sharp end Closest to the work, practitioners @CrayZeigh

Slide 30

Slide 30

• Constantly building and destroying systems • Strong signaling Sharp end Closest to the work, practitioners @CrayZeigh

Slide 31

Slide 31

• Constantly building and destroying systems • Strong signaling • Improve systems based on strain Sharp end Closest to the work, practitioners @CrayZeigh

Slide 32

Slide 32

• Constantly building and destroying systems • Strong signaling • Improve systems based on strain Sharp end • Will do so naturally given ownership Closest to the work, practitioners @CrayZeigh

Slide 33

Slide 33

Teams that do well dealing with impact [surprises/incidents] are those that have a strong common ground ̶J. Paul Reed (@jpaulreed), Failover Conf

Slide 34

Slide 34

If we want to improve a teamʼs resilience, we must build a strong common ground ̶Me, Just now.

Slide 35

Slide 35

Common Ground @CrayZeigh

Slide 36

Slide 36

Common Ground • Basic Compact @CrayZeigh

Slide 37

Slide 37

Common Ground • • Basic Compact Goal Alignment/ Commitment @CrayZeigh

Slide 38

Slide 38

Common Ground • • • Basic Compact Goal Alignment/ Commitment Inter-predictability @CrayZeigh

Slide 39

Slide 39

Common Ground • • • • Basic Compact Goal Alignment/ Commitment Inter-predictability Sustain & Repair @CrayZeigh

Slide 40

Slide 40

Building Common Ground @CrayZeigh

Slide 41

Slide 41

Building Common Ground • Blameless Postmortems @CrayZeigh

Slide 42

Slide 42

Building Common Ground • • Blameless Postmortems Chaos Engineering @CrayZeigh

Slide 43

Slide 43

Building Common Ground • • • Blameless Postmortems Chaos Engineering Game Days @CrayZeigh

Slide 44

Slide 44

Building Common Ground • • • • Blameless Postmortems Chaos Engineering Game Days Modeling Vulnerability @CrayZeigh

Slide 45

Slide 45


Slide 46

Slide 46


Slide 47

Slide 47 @CrayZeigh

Slide 48

Slide 48

Resilience is about creating the conditions that maximize everyoneʼs potential ̶Rein Hendrichs, >Code Podcast, 174: Resilience

Slide 49

Slide 49


Slide 50

Slide 50


Slide 51

Slide 51

What happens when governments fail? @CrayZeigh

Slide 52

Slide 52

It’s left to us @CrayZeigh

Slide 53

Slide 53

Community Building is Resilience Engineering ̶Me again, just now again.

Slide 54

Slide 54

Strong Communities @CrayZeigh

Slide 55

Slide 55

Strong Communities • Diverse @CrayZeigh

Slide 56

Slide 56

Strong Communities • • Diverse High Trust & Safety @CrayZeigh

Slide 57

Slide 57

Strong Communities • • • Diverse High Trust & Safety Sustain & Repair @CrayZeigh

Slide 58

Slide 58

Strong Communities • • • • Diverse High Trust & Safety Sustain & Repair Inter-predictability @CrayZeigh

Slide 59

Slide 59

Strong Communities • • • • • Diverse High Trust & Safety Sustain & Repair Inter-predictability Loosely Coupled, layered networks @CrayZeigh

Slide 60

Slide 60


Slide 61

Slide 61


Slide 62

Slide 62


Slide 63

Slide 63


Slide 64

Slide 64


Slide 65

Slide 65


Slide 66

Slide 66


Slide 67

Slide 67


Slide 68

Slide 68


Slide 69

Slide 69


Slide 70

Slide 70


Slide 71

Slide 71


Slide 72

Slide 72


Slide 73

Slide 73

Protest Resilience @CrayZeigh

Slide 74

Slide 74


Slide 75

Slide 75


Slide 76

Slide 76


Slide 77

Slide 77


Slide 78

Slide 78


Slide 79

Slide 79


Slide 80

Slide 80

Community Resilience @CrayZeigh

Slide 81

Slide 81


Slide 82

Slide 82


Slide 83

Slide 83


Slide 84

Slide 84


Slide 85

Slide 85


Slide 86

Slide 86

There were drink coolers sorted and labeled: Water, Sparkling water, Juice, Gatorade. A generator-powered phone-charging station enabled people to stay longer while staying in communication with the outside world. A free library—with no late fees!—was established early on and stocked with the words of Black revolutionaries and poets. By July 1, the occupation was offering free COVID-19 testing, too. -(referencing NYC CHAZ)

Slide 87

Slide 87


Slide 88

Slide 88


Slide 89

Slide 89


Slide 90

Slide 90


Slide 91

Slide 91


Slide 92

Slide 92


Slide 93

Slide 93

Slide 94

Slide 94

Slide 95

Slide 95

Slide 96

Slide 96

Slide 97

Slide 97 @CrayZeigh

Slide 98

Slide 98


Slide 99

Slide 99


Slide 100

Slide 100


Slide 101

Slide 101


Slide 102

Slide 102 @CrayZeigh

Slide 103

Slide 103

Enable potential and get out of the way @CrayZeigh

Slide 104

Slide 104

Slides & Resources Aaron Aldrich @CrayZeigh m

Slide 105

Slide 105

I love you Do good out there Weʼre all in this together @CrayZeigh

Slide 106

Slide 106

Further Reading/Watching/Listening Four concepts for resilience and the implications for the future of resilience engineering - David Woods The Marvelous Resilience of Bone - Dr. Richard Cook, REdeploy 2019 Greater Than Code, 174: Resilience The Worst Year Ever, How to Save your Community When The Government Fails @CrayZeigh

Slide 107

Slide 107

Further Reading/Watching/Listening Behind Human Error(2nd Edition) - Woods, Dekker, Cook, Johannessen, Carter The Woolworths Experiment The Field Guide to Understanding Human Error - Sydney Dekker Literally every video from REdeploy: On Borrowing From Yourself - Aaron Aldrich @CrayZeigh

Slide 108

Slide 108

Further Reading/Watching/Listening @CrayZeigh

Slide 109

Slide 109

Further Reading/Watching/Listening support your local journalists @CrayZeigh

Slide 110

Slide 110

Further Reading/Watching/Listening support your local journalists support local political candidates @CrayZeigh

Slide 111

Slide 111

Further Reading/Watching/Listening support your local journalists support local political candidates vote @CrayZeigh