Plumbing at Scale

This article details our journey building and deploying an event sourcing platform in Go, building a stream processing framework over it, and then scaling it (reliably and efficiently) to service over 300 billion events a week. | Continue reading


@engineering.grab.com | 6 years ago

About Being a Principal Engineer at Grab

Curious about what a Principal Engineer role at Grab entails? Our Principal Engineers' responsibilities range from solving complex problems, taking care of the system-level architecture, collaborating with cross-functional teams, providing mentorship, and more. | Continue reading


@engineering.grab.com | 6 years ago

Catwalk: Serving Machine Learning Models at Scale

This blog post explains why and how we came up with a machine learning model serving platform to accelerate the use of machine learning in Grab. | Continue reading


@engineering.grab.com | 6 years ago

We Built a Logging Stack at Grab

Some string here that must be in quotes if you want it to span across multiple lines. | Continue reading


@engineering.grab.com | 6 years ago

Preventing Pipeline Calls from Crashing Redis Clusters

This blog post describes Grab’s post-mortem findings for the outage caused by the Redis Cluster failure. | Continue reading


@engineering.grab.com | 7 years ago

A Lean and Scalable Data Pipeline to Capture Large Scale Events

This blog post focuses on the lessons we learned while building our batch data pipeline. | Continue reading


@engineering.grab.com | 7 years ago

Querying Big Data in Real-Time with Presto and Grab's TalariaDB

In this article, we focus on TalariaDB, a distributed, highly available, and low latency time-series database that stores real-time data. For example, logs, metrics, and click streams generated by mobile apps and backend services that use Grab's Experimentation Platform SDK. It | Continue reading


@engineering.grab.com | 7 years ago

Designing Resilient Systems: Circuit Breakers or Retries? (Part 2)

Grab designs fault-tolerant systems that can withstand failures allowing us to continuously provide our customers with the many services they expect from us. | Continue reading


@engineering.grab.com | 7 years ago

Designing Resilient Systems: Circuit Breakers or Retries?

Grab designs fault-tolerant systems that can withstand failures allowing us to continuously provide our customers with the many services they expect from us. | Continue reading


@engineering.grab.com | 7 years ago

Mockers – overcoming testing challenges at Grab

Sustaining quality in fast paced development is a challenge. At Grab, we use Mockers - a tool to expand the scope of local box testing. It helps us overcome testing challenges in a microservice architecture. | Continue reading


@engineering.grab.com | 7 years ago

How we designed the Quotas microservice to prevent resource abuse

Reliable, scalable, and high performing solutions for common system level issues are essential for microservice success, and there is a Grab-wide initiative to provide those common solutions. As an important component of the initiative, we wrote a microservice called Quotas, a hi … | Continue reading


@engineering.grab.com | 7 years ago

Introducing Grab-Kit: Distributed Service Design at Grab

As we evolved from a single monolithic application to a microservices-based architecture, we were faced with a new challenge. How do we support exponential growth while maintaining consistency, coordination, and quality? | Continue reading


@engineering.grab.com | 7 years ago