Turning our most significant production outage into a driver for positive lasting change. A DevOps story.
This talk isn’t paywalled. You can support the speaker and the organisers by clicking the button above. Info…
About the talk
Early one Monday morning we had a production deployment that sparked a sequence of events that took our biggest customer offline. Everything went wrong! We were not able to rollback, or failover, or identify the root-cause, and somehow an engineer was able to log into production and make the situation worse. In the end, it took us two weeks to restore service.
As a newcomer to this high-growth organisation, I saw this as a massive opportunity to help the organisation take a quantum leap forward in how it operates and supports its services in production.
This talk outlines the transformative journey we took as we implemented Continuous Deployments and Canaries on our monolith, Blue/Green on our database, Service Level Objectives, and how we improved our DevOps practices backed up by a Site Reliability Engineering team.
Martin is a software engineering leader with ten-years experience in leading, building and transforming teams to deliver well-crafted software effectively. During his recent leadership roles at Serko (Head of Engineering), MYOB (Dev Manager) and nReality (Engineering Coach) he worked with local, remote, and globally distributed teams of 10-130 people building SaaS and mobile products. In this time, he spent most of his energy directed to building highly collaborative engaged teams while provided direction on how to improve the practices, processes, architecture and production operations.During his twenty-year career, he developed a strong technical background having played architect, principal engineer, and engineering coach roles, on systems ranging from mobile and data analytics to high-volume, mission-critical systems in the technology, government and financial services.Martin has spoken on agile, leadership, programming and design at events including LASTConf, Agile USA, Agile New Zealand, Agile Australia, Agile Africa, Scrum Gathering South Africa, Microsoft TechEd Africa and many more.
Title: Turning our most significant production outage into a driver for positive lasting change. A DevOps story.
Date: Wed, 10 June 2020
Duration: 60 minutes
Melbourne, Sydney. 11:30am AEST
Auckland. 1:30pm NZT
Find your local time
Keywords: DevOps, transformation, OKR, SLO, Continuous Deployment
About LAST Anywhere Talks
LAST stands for Lean, Agile and Systems Thinking. It was born out of meetup groups and then a conference, started in Melbourne AU in 2012. Since then, it grew to also be held in Sydney, Brisbane, Canberra and Adelaide. 1st Conference (Organisational Agility) and Spark the Change Melbourne (Meaning and Purpose at work and in society) are related conferences.
LAST Anywhere talks and workshops draw subject matter from the full range of topics covered by LAST, 1st and Spark the Change. Delivered online, they aim to provide non-formal learning and valuable interactions, and extend on the value provided by physical events.
All you need is a good internet connection, a webcam, headphones, and a microphone. Also, you will need the focus to be present and make the most of the session.