An insider’s view into the past, present, and future of observability & CloudOps.

monitorjain
12 min readAug 29, 2021
Join me on this literary journey of Observability and CloudOps

In the last four year or so, there’s been surge of SREs (site reliability engineers) on job listings. If you are new to the site reliability game, or remotely associated with products that make SREs more productive, read this article for a fascinating literary tour of the last decade of monitoring & observability world.

This article is more critical than a report or dump of all the highlights that we have experienced in the modern Production Engineering and IT Operations space. It aims to serve as pertinent viewpoints into the right way to execute site reliability, customer experience strategy, and even business continuity and resilience programs.

1.0 The Past: The era when we “monitored”

In the past, the focus was on Systems Infrastructure and Networks.

Back then we had monoliths, service-oriented architecture, fewer answers, and almost zero microservices – containers & server-less workloads. If a system went down, it was assumed that every functionality went down due to tight coupling between transactions, application logic, and the machine where the apps were hosted. There was no microservices approach undertaken whereby…

--

--

monitorjain

Value Engineering | SRE, Cloud, and Dev advocate | Tech enthusiast | Kaizen practitioner | Presales coach | Dad