A Sun Zhu strategy to take on AWS for the hybrid cloud

To win the hybrid cloud you have to win at least four of the five components - public, private, private Kubernetes and edge. This is how MinIO gets it done. | Continue reading


@blog.min.io | 3 years ago

Why Using Cassandra as a metadata database for an object store is a bad choice

Cassandra does many things quite well but has certain limitations that preclude its use for a primary storage object store. Some vendors use it anyway. | Continue reading


@blog.min.io | 3 years ago

Why small objects are such a big deal

Small objects are becoming far more prevalent in the modern data stack. Most implementations don't scale to the challenge or can't perform in the face of it. | Continue reading


@blog.min.io | 3 years ago

The long term costs of object storage in the public cloud

The promise/allure of the public cloud is based on the concept that it is elastic. One can, with little effort, scale up workloads and, if desired, scale down those same workloads. We have written on this subject before - from the perspective of what workloads to consider as you | Continue reading


@blog.min.io | 3 years ago

The Attributes and Types of Edge Storage

Edge computing is a complex topic but this post details the two main approaches: edge storage and edge cache and the storage implications for both. | Continue reading


@blog.min.io | 4 years ago

Active-Active Bucket Replication Across Data Centers

MinIO introduces active-active bucket replication - ensuring that geographically distinct datacenter can by continuously synced. | Continue reading


@blog.min.io | 4 years ago

Combining MinIO and VMware Tanzu for Kubernetes Native Storage

MinIO expands its partnership with VMware by integrating its object storage suite with Cloud Foundation through the new vSAN Data Persistence platform. | Continue reading


@blog.min.io | 4 years ago

Using Apache Arrow to Enhance the Performance of MinIO Data Lakes

With the introduction of Apache Arrow, language-independent columnar memory format for flat and hierarchical data, organized for efficient analytic operations, MinIO data lakes can be much more powerful. This article explains how to make use of Apache Arrow by using ArrowRDD. | Continue reading


@blog.min.io | 4 years ago

Software defined storage vs. hardware defined storage

While everyone claims to be a software company today - few can pass the smell test. See a comparison of SDS vs. Appliances to find what is right for you. | Continue reading


@blog.min.io | 4 years ago

When companies should go to the public cloud and when they should come back

The public cloud is here to stay - but more as a place to learn, to move quickly and to experiment. The private cloud is where enterprises will scale. | Continue reading


@blog.min.io | 4 years ago

Object Locking, Retention and Legal Holds in MinIO

Object Locking, Versioning, Legal Holds and Modes are the foundational elements of data immutability. Enterprises can use these features to protect their data with MinIO. | Continue reading


@blog.min.io | 4 years ago

MinIO and Apache Nifi

Continue reading


@blog.min.io | 4 years ago

Impact of Intel vs. ARM CPU Performance for Object Storage

This post looks at the relative strengths of Intel and ARM as it relates to object storage. Specific attention is paid to single core vs. multi-core. | Continue reading


@blog.min.io | 4 years ago

Using Apache Nifi for Event Notifications on MinIO

This post covers how to configure event notifications on MinIO using Apache Nifi - resulting in fast, scalable and effective data flow pipelines. | Continue reading


@blog.min.io | 4 years ago

TensorFlow and MinIO: an example of at scale machine intelligence

High performance object storage is the natural partner for machine learning. In this post we pair Google's Tensorflow with MinIO in a hyperscale example. | Continue reading


@blog.min.io | 4 years ago

Using AVX512 optimizations in Golang to accelerate MD5 hashing

Given the footprint of MD5 hashing, any improvements are worthwhile. This post looks as SIMD enhancements (AVX2 + AVX512) to improve performance by 800%. | Continue reading


@blog.min.io | 4 years ago

Using Veeam's V10 Object-Ready Capabilities to Backup to MinIO

Veeam's V10 release adds object storage support in a big way. Find out how these two software stacks play together to deliver performance oriented backup and restore. | Continue reading


@blog.min.io | 4 years ago

Sidekick – An Open Source, High Performance, Ultralight Load Balancer from MinIO

Lightweight, high-performance load balancers for modern workloads don't really exist - so we built one. Introducing Sidekick from MinIO | Continue reading


@blog.min.io | 4 years ago

Kubernetes, Object Storage and Why Appliance Vendors Are in Trouble

Two tectonic forces are reshaping the tech industry: high performance object storage + Kubernetes. They spell doom for SAN/NAS + appliance vendors. | Continue reading


@blog.min.io | 4 years ago

Simdjson-Go: Parsing Gigabyes of JSON per Second in Go

simdjson uses a novel, two stage approach with which it is possible to achieve a parsing performance of gigabytes of JSON per second on a single core. | Continue reading


@blog.min.io | 4 years ago

The stateful/stateless debate when it comes to storage and apps on Kubernetes

Kubernetes is here to stay. Understanding the optimal storage architecture requires a modern approach that is disaggregated, RESTful and scalable. | Continue reading


@blog.min.io | 4 years ago

Developer Friendly Kubernetes Object Storage via MinIO's Operato

The Minio Operator lets developers who don’t know how to manage storage to leverage Kubernetes’ automation tools and declarative syntax. | Continue reading


@blog.min.io | 4 years ago

Show HN: Using mc to migrate data to/from Amazon Snowball

AWS Snowball is a nice product with real value. AWS's CLI, however, is quite limited. MinIO enhanced its mc tool to make Snowball look like any S3 compatible server. | Continue reading


@blog.min.io | 4 years ago

Show HN: Using mc to migrate data to/from Amazon Snowball

Continue reading


@blog.min.io | 4 years ago

Designing Appropriate Failure Domains and Why We Relaxed Our 32 Node Limitation

MinIO just relaxed its 32 node restriction but this post covers why keeping your failure domain small is still a big idea. | Continue reading


@blog.min.io | 4 years ago

Scaling MinIO: Benchmarking Performance from Terabytes to Petabytes

MinIO's performance characteristics are well known, but what does scaling that performance look like across HDD and NVMe? This post details what to expect when creating large systems. | Continue reading


@blog.min.io | 4 years ago

The new drivers of modern object storage

This post originally appeared on The New Stack[https://thenewstack.io/the-new-metrics-of-object-storage/].As a general rule, when people think about object storage they think about onething — the price per TB/GB. Though a legitimate cost metric, it has the effectof making object … | Continue reading


@blog.min.io | 4 years ago

High Performance Object Storage: 1.4 Tb/S Throughput on NVMe

The fact that MinIO is fast is not a secret. We routinely publish our benchmarksand have put out comparision work against HDFS [/hdfsbenchmark/] and AWS (Spark[/benchmarking-apache-spark-vs-aws-s3/] + Presto[/running-presto-on-minio-benchmarking-vs-aws-s3/]) in addition to our HD … | Continue reading


@blog.min.io | 4 years ago

Frictionless Encryption and the Implications of Overhead

If you handle data of any kind, you have to worry about data security. This istrue for multinational banks; it’s also true for small and medium-sizebusinesses. Encryption should never be your only line of defense againstmalicious actors, but it is one of the most basic security b … | Continue reading


@blog.min.io | 5 years ago

HDFS vs. MinIO on 1T MapReduce Benchmark

Few would argue with the statement that Hadoop HDFS is in decline. In fact, theHDFS part of the Hadoop ecosystem is in more than just decline - it is infreefall. At the time of its inception, it had a meaningful role to play as ahigh-throughput, fault-tolerant distributed file sy … | Continue reading


@blog.min.io | 5 years ago

What “cloud native” means when talking storage

The term ‘cloud native’ is widely used in technical circles but doesn’t have aparticularly clear definition. The confusion lies in the fact that being ‘cloudnative’ has little to do with the environment your application is deployedto—the term is equally applicable to on-premise o … | Continue reading


@blog.min.io | 5 years ago

Object Storage's Role in VMware's Kubernetes Centric World

Perhaps the biggest takeaway from VMworld was the “all in” bet that VMware ismaking on Kubernetes. VMware wasn’t shy about its ambition, with CEO PatGelsinger stating that “We will be the leading enabler of Kubernetes.”This reflects the reality of the market. Kubernetes has becom … | Continue reading


@blog.min.io | 5 years ago

Hadoop HDFS's Logical Successor

The demise of Hadoop is probably overblown[https://www.datanami.com/2019/08/12/re-imagining-big-data-in-a-post-hadoop-world/]. It will not suddenly disappear from the enterprise landscape - there aresimply too many clients, too much sunk investment for it to vanish into thenight. … | Continue reading


@blog.min.io | 5 years ago

Show HN: Breaking HDFS Speed Barrier – A First for Object Storage

Few would argue with the statement that Hadoop HDFS is in decline. In fact, theHDFS part of the Hadoop ecosystem is in more than just decline - it is infreefall. At the time of its inception, it had a meaningful role to play as ahigh-throughput, fault-tolerant distributed file sy … | Continue reading


@blog.min.io | 5 years ago

What cloud pricing tells us about the future of storage

The power of scale[https://hbr.org/search?search_type=&term=scale&sort=popularity_score] iswell-documented in the world of business. Cloud providers - Amazon in particular - have amassed extraordinary scale in avery short period of time. The cloud providers are now using this sca … | Continue reading


@blog.min.io | 5 years ago

Benchmarking the New School: MinIO and AWS Running Spark on TPC-H

Apache Spark is a framework for distributed computing. It provides one of thebest mechanisms for distributing data across multiple machines in a cluster andperforming computations on it. Spark achieves this by constructing data structures called RDDs (ResilientDistributed Dataset … | Continue reading


@blog.min.io | 5 years ago

Comparing AWS and MinIO with Presto on the TPC-H Benchmark

The growth of Presto in the enterprise is a function of its speed, SQLcompatibility, extensibility and enterprise feature set. While initiallydesigned to speed up Hadoop, the success of the project has led to much broaderadoption - on S3, Cassandra, MySQL, and more. Presto allows … | Continue reading


@blog.min.io | 5 years ago

Reimagining the B2B Software Customer Experience: From Slack to Subnet

We approach things differently here at MinIO.When we started in 2014, we questioned everything about the object storagemarket as we built our product - thinking more like a data company than astorage company. We did it from scratch, using engineering first principles, anextraordi … | Continue reading


@blog.min.io | 5 years ago

Why true open source software is inherently ready for enterprise environments

Software isn't usually described as bombproof. Particularly the type of software that is responsible for large analytic jobs ormachine learning workloads. The words “finicky”, “complex” or in the case ofgood marketing “professional grade” (meaning you need years of study andmulti … | Continue reading


@blog.min.io | 5 years ago

Running peta-scale Spark jobs on object storage using S3 Select

When one looks at the amazing roster of talks for the Spark + AI Summit, what you don’t see is a lot of discussion on how to leverage object storage. On some level you would expect to —  | Continue reading


@blog.min.io | 5 years ago

Stream Processing with Apache Flink and MinIO

How to use Apache Flink to build a private cloud data pipeline for a variety of use cases. | Continue reading


@blog.min.io | 5 years ago