Root Cause Analysis in Three Clicks

Today, we’re excited to announce updates that enable developers to identify the root cause of virtually any regression in three simple clicks. This article will take you through the steps on how to identify what went wrong and why, in under 5 minutes. | Continue reading | 13 days ago

Migrating to Microservices: Worst Practices

The reality is that most migrations bog down quickly. This worst practices guide will tell you how you too can end up with a distributed monolith at the end of a multi-year long slog. | Continue reading | 2 months ago

Observability: A Complete Overview for 2020

A practical guide to observability for developers. You’ll learn, you’ll laugh (we hope), you’ll cry (because you learned so much). | Continue reading | 3 months ago

Microservices Architecture: When and How to Move to Microservices

Continue reading | 4 months ago

What Your Observability Software Should Deliver

Continue reading | 4 months ago

What You Could Be Doing Instead of Smashing the Grep Command for Three Hours

Continue reading | 5 months ago

Getting Real with Command Line Arguments and Go-Flags

Continue reading | 5 months ago

Beyond OnClick: Handling Events in React

Continue reading | 5 months ago

We Built This: LightStep’s Slack Integration

In this post I want to walk you through a recent project I worked on at LightStep: building a Slack integration to make it easier for developers to share insights about their distributed systems. | Continue reading | 6 months ago

Making Your System Observable from the Outside In

An Incremental Approach to Observability | Continue reading | 7 months ago

Have a Platform Team

Platform teams require dedicated investment and commitment. The investment is transformative not just to software development but also to the entire business. | Continue reading | 8 months ago

Understanding Tracer Performance with LightStep Benchmarks

Telemetry data should be collected without causing unacceptable levels of overhead for your service, and we’re excited to push the state-of-the-art forward for the observability community as a major contributor to OpenTelemetry. | Continue reading | 8 months ago

How to Build a Platform Team

Platform teams accelerate the speed of a business by making it easier for developers to release more features with less risk. Observability expert James Burns walks you through how to build and grow a platform team. | Continue reading | 8 months ago

Should You Hire a Freelance Developer?

Your developer talent is directly related to the quality of your application and getting functionality out the door. A common path to quickly on-boarding new dev talent is to consider freelance developers. | Continue reading | 8 months ago

A Brief History of “The Span”: Hard to Love, Hard to Kill

To the best of my knowledge, I’m the one to blame for the term “span,” at least as it relates to distributed tracing. There’s the old joke: “There are only two hard things in computer science: naming, cache invalidation, and off-by-one errors.” | Continue reading | 10 months ago

We Write Code at LightStep: Lab Notebooks

Many engineering teams create some form of a Design Document in the early stages of a new project. At LightStep we do things a bit differently. Rather than Design Documents, we’ve created “Lab Notebooks.” | Continue reading | 10 months ago

Kubernetes Logs and Services: Beware These Pitfalls

It’s the third time in a month where your customers are seeing two minutes of error pages — and then it’s just fixed. That’s plenty long enough for them to complain on Twitter, and besides that, it’s just embarrassing ... | Continue reading | 10 months ago

Why Working on Monoliths Is Bad for Your Career

I’m not going to tell you there aren’t successful businesses that are built on monoliths. I will tell you that being on a monolith has or is going to limit their success. The rate of change in a monolith is so much slower ... | Continue reading | 11 months ago

Migrating to Microservices? Here’s How to Have Reliable APIs from Day One

Migrating from a monolith to microservices can be daunting. Do you move a model into a CRUD service and call it using HTTP? Not if you want reliable APIs on day one. | Continue reading | 1 year ago

How to Get Started with Chaos: A Step-by-Step Guide to Gamedays

What’s the best way to run a chaos gameday? How does it help your team? Why should we intentionally create failure? | Continue reading | 1 year ago

Myth of the Server's Terrible, Horrible, No Good, Very Bad Day

Our preferred strategy: use observability to demystify behaviors such as latency and gain a deeper understanding of your system. Then move faster. | Continue reading | 1 year ago

Three Pillars with Zero Answers – Towards a New Scorecard for Observability

These three so-called “Pillars” each have a fatal flaw – we need to understand them and ultimately rethink the way we assess observability as a practice. | Continue reading | 1 year ago