To win the hybrid cloud you have to win at least four of the five components - public, private, private Kubernetes and edge. This is how MinIO gets it done. | Continue reading
Cassandra does many things quite well but has certain limitations that preclude its use for a primary storage object store. Some vendors use it anyway. | Continue reading
Small objects are becoming far more prevalent in the modern data stack. Most implementations don't scale to the challenge or can't perform in the face of it. | Continue reading
The promise/allure of the public cloud is based on the concept that it is elastic. One can, with little effort, scale up workloads and, if desired, scale down those same workloads. We have written on this subject before - from the perspective of what workloads to consider as you | Continue reading
Edge computing is a complex topic but this post details the two main approaches: edge storage and edge cache and the storage implications for both. | Continue reading
MinIO introduces active-active bucket replication - ensuring that geographically distinct datacenter can by continuously synced. | Continue reading
MinIO expands its partnership with VMware by integrating its object storage suite with Cloud Foundation through the new vSAN Data Persistence platform. | Continue reading
With the introduction of Apache Arrow, language-independent columnar memory format for flat and hierarchical data, organized for efficient analytic operations, MinIO data lakes can be much more powerful. This article explains how to make use of Apache Arrow by using ArrowRDD. | Continue reading
While everyone claims to be a software company today - few can pass the smell test. See a comparison of SDS vs. Appliances to find what is right for you. | Continue reading
The public cloud is here to stay - but more as a place to learn, to move quickly and to experiment. The private cloud is where enterprises will scale. | Continue reading
Object Locking, Versioning, Legal Holds and Modes are the foundational elements of data immutability. Enterprises can use these features to protect their data with MinIO. | Continue reading
This post looks at the relative strengths of Intel and ARM as it relates to object storage. Specific attention is paid to single core vs. multi-core. | Continue reading
This post covers how to configure event notifications on MinIO using Apache Nifi - resulting in fast, scalable and effective data flow pipelines. | Continue reading
High performance object storage is the natural partner for machine learning. In this post we pair Google's Tensorflow with MinIO in a hyperscale example. | Continue reading
Given the footprint of MD5 hashing, any improvements are worthwhile. This post looks as SIMD enhancements (AVX2 + AVX512) to improve performance by 800%. | Continue reading
Veeam's V10 release adds object storage support in a big way. Find out how these two software stacks play together to deliver performance oriented backup and restore. | Continue reading
Lightweight, high-performance load balancers for modern workloads don't really exist - so we built one. Introducing Sidekick from MinIO | Continue reading
Two tectonic forces are reshaping the tech industry: high performance object storage + Kubernetes. They spell doom for SAN/NAS + appliance vendors. | Continue reading
simdjson uses a novel, two stage approach with which it is possible to achieve a parsing performance of gigabytes of JSON per second on a single core. | Continue reading
Kubernetes is here to stay. Understanding the optimal storage architecture requires a modern approach that is disaggregated, RESTful and scalable. | Continue reading
The Minio Operator lets developers who don’t know how to manage storage to leverage Kubernetes’ automation tools and declarative syntax. | Continue reading
AWS Snowball is a nice product with real value. AWS's CLI, however, is quite limited. MinIO enhanced its mc tool to make Snowball look like any S3 compatible server. | Continue reading
MinIO just relaxed its 32 node restriction but this post covers why keeping your failure domain small is still a big idea. | Continue reading
MinIO's performance characteristics are well known, but what does scaling that performance look like across HDD and NVMe? This post details what to expect when creating large systems. | Continue reading
This post originally appeared on The New Stack[https://thenewstack.io/the-new-metrics-of-object-storage/].As a general rule, when people think about object storage they think about onething — the price per TB/GB. Though a legitimate cost metric, it has the effectof making object … | Continue reading
The fact that MinIO is fast is not a secret. We routinely publish our benchmarksand have put out comparision work against HDFS [/hdfsbenchmark/] and AWS (Spark[/benchmarking-apache-spark-vs-aws-s3/] + Presto[/running-presto-on-minio-benchmarking-vs-aws-s3/]) in addition to our HD … | Continue reading
If you handle data of any kind, you have to worry about data security. This istrue for multinational banks; it’s also true for small and medium-sizebusinesses. Encryption should never be your only line of defense againstmalicious actors, but it is one of the most basic security b … | Continue reading
Few would argue with the statement that Hadoop HDFS is in decline. In fact, theHDFS part of the Hadoop ecosystem is in more than just decline - it is infreefall. At the time of its inception, it had a meaningful role to play as ahigh-throughput, fault-tolerant distributed file sy … | Continue reading
The term ‘cloud native’ is widely used in technical circles but doesn’t have aparticularly clear definition. The confusion lies in the fact that being ‘cloudnative’ has little to do with the environment your application is deployedto—the term is equally applicable to on-premise o … | Continue reading
Perhaps the biggest takeaway from VMworld was the “all in” bet that VMware ismaking on Kubernetes. VMware wasn’t shy about its ambition, with CEO PatGelsinger stating that “We will be the leading enabler of Kubernetes.”This reflects the reality of the market. Kubernetes has becom … | Continue reading
The demise of Hadoop is probably overblown[https://www.datanami.com/2019/08/12/re-imagining-big-data-in-a-post-hadoop-world/]. It will not suddenly disappear from the enterprise landscape - there aresimply too many clients, too much sunk investment for it to vanish into thenight. … | Continue reading
Few would argue with the statement that Hadoop HDFS is in decline. In fact, theHDFS part of the Hadoop ecosystem is in more than just decline - it is infreefall. At the time of its inception, it had a meaningful role to play as ahigh-throughput, fault-tolerant distributed file sy … | Continue reading
The power of scale[https://hbr.org/search?search_type=&term=scale&sort=popularity_score] iswell-documented in the world of business. Cloud providers - Amazon in particular - have amassed extraordinary scale in avery short period of time. The cloud providers are now using this sca … | Continue reading
Apache Spark is a framework for distributed computing. It provides one of thebest mechanisms for distributing data across multiple machines in a cluster andperforming computations on it. Spark achieves this by constructing data structures called RDDs (ResilientDistributed Dataset … | Continue reading
The growth of Presto in the enterprise is a function of its speed, SQLcompatibility, extensibility and enterprise feature set. While initiallydesigned to speed up Hadoop, the success of the project has led to much broaderadoption - on S3, Cassandra, MySQL, and more. Presto allows … | Continue reading
We approach things differently here at MinIO.When we started in 2014, we questioned everything about the object storagemarket as we built our product - thinking more like a data company than astorage company. We did it from scratch, using engineering first principles, anextraordi … | Continue reading
Software isn't usually described as bombproof. Particularly the type of software that is responsible for large analytic jobs ormachine learning workloads. The words “finicky”, “complex” or in the case ofgood marketing “professional grade” (meaning you need years of study andmulti … | Continue reading
When one looks at the amazing roster of talks for the Spark + AI Summit, what you don’t see is a lot of discussion on how to leverage object storage. On some level you would expect to — | Continue reading
How to use Apache Flink to build a private cloud data pipeline for a variety of use cases. | Continue reading