Metablog

SRE vs. Platform Engineering – Differences Explained

An overview of the similarities and differences between Site Reliability Engineering and Platform Engineering, including from a career perspective. | Continue reading

@rootly.com | 1 year ago

The Pros and Cons of Embedded SREs

A comparison of the two main SRE team models: Embedded SREs vs. standalone SRE teams. | Continue reading

@rootly.com | 1 year ago

Tips If You’re the First SRE Hire by Instacart's First SRE

Best practices for “SRE pioneers” – meaning engineers who are the very first SREs hired at an organization. | Continue reading

@rootly.com | 1 year ago

Building an incident management company by ex-Instacart

Our co-founder JJ reflects on building the fastest-growing incident management platform and the surprising learnings. | Continue reading

@rootly.com | 1 year ago

Practical Guide to SRE: Incident Severity Levels

Incident severity levels are a measurement of the impact an incident has on the business. Classifying the severity of an issue is critical to decide how quickly and efficiently problems get resolved. | Continue reading

@rootly.com | 2 years ago

What SREs Can Learn from Capt. Sully: When to Follow Playbooks

Does it always make sense to stick to your playbooks? There’s no clear answer, but it’s still something you should think about. | Continue reading

@rootly.com | 2 years ago

SLA vs. SLO vs. SLI: Understanding the Similarities and Differences

An explanation of the meaning of SLA, SLO and SLI, and how SREs should use each concept to manage reliability. | Continue reading

@rootly.com | 2 years ago

What Managed Kubernetes Service Is Best for SREs?

A comparison of EKS, AKS, GKE, Rancher and OpenShift from an SRE’s perspective. | Continue reading

@rootly.com | 2 years ago

Page 1