For thirteen years Stephen worked as a performance engineering specialist, with a particular interest in data analysis and visualization. Stephen has been blogging, speaking, and sharing content on performance engineering for many years, including his podcast Performance Time.
Recently Stephen took up a new role as a Site Reliability Engineer, which has been a challenging journey full of new concepts and skills.
Benchmarking Reliability – When SLO’s Aren’t Delivering Outcomes
Say you’ve been asked to go work with a team in order to introduce SRE concepts and practices. The first question you probably want to answer is: What is the current reliability of the system or services that the team owns? By answering that question you will identify opportunities for improvement and get a clear sense of the current state.