Scalable metadata: the new breed of file systems (em)powering big data companies

HopsFS is an open-source scaleout metadata file system, but its primary use case is not Exabyte storage, rather customizable consistent metadata. | Continue reading


@logicalclocks.com | 2 years ago

Scaleout Metadata: hardest problem in Data. How the Big 3 and a Small 1 solve it

HopsFS is an open-source scaleout metadata file system, but its primary use case is not Exabyte storage, rather customizable consistent metadata. | Continue reading


@logicalclocks.com | 2 years ago

Beyond Brainless AI with a Feature Store

Evolve your models from stateless AI to Total Recall AI with the help of a Feature Store | Continue reading


@logicalclocks.com | 2 years ago

Threading paradox: latency goes down as throughput goes up in RonDB

RonDB enables users to have full control over the assignment of threads to CPUs, how the CPU locking is to be performed and how the thread should be scheduled. | Continue reading


@logicalclocks.com | 3 years ago

You can do better than Redis as a data layer for your models

RonDB shows higher availability and the ability to handle larger data sets in comparison with Redis, paving the way to be the fastest key-value store available. | Continue reading


@logicalclocks.com | 3 years ago

AI/ML needs a key-value store, and Redis is not up to it

RonDB shows higher availability and the ability to handle larger data sets in comparison with Redis, paving the way to be the fastest key-value store available. | Continue reading


@logicalclocks.com | 3 years ago

RonDB: The fastest key-value store, made for AI/ML- now in the cloud

RonDB is a managed key-value store with SQL capabilities. It provides the best low-latency, high throughput, and high availability database available today. | Continue reading


@logicalclocks.com | 3 years ago

Elasticsearch is dead, long live Open Distro for Elasticsearch

Hopsworks now supports dynamic role-based access control to indexes in elasticsearch with no performance penalty by building on Open Distro for Elasticsearch. | Continue reading


@logicalclocks.com | 3 years ago

HopsFS: 100x Times Faster Than AWS S3

HopsFS-S3: cloud-native distributed hierarchical file system that has the same cost as S3, but has 100X the performance of S3 for file move/rename operations. | Continue reading


@logicalclocks.com | 3 years ago

Feature Store for MLOps? Feature Reuse Means Join

Use JOINs for feature reuse to save on infrastructure and the number of feature pipelines needed to maintain models in production. | Continue reading


@logicalclocks.com | 3 years ago

ML Engineer Guide: Feature Store vs. Data Warehouse

A data warehouse is an input to the Feature Store. A data warehouse is a single columnar database, while a feature store is implemented as two databases. | Continue reading


@logicalclocks.com | 3 years ago

One function is all you need for ML Experiments

Hopsworks supports machine learning experiments to track and distribute ML for free and with a built-in TensorBoard. | Continue reading


@logicalclocks.com | 3 years ago

Dynamic RBAC with zero performance overhead in Hopsworks

Integrate with third-party security standards and take advantage from our project-based multi-tenancy model to host data in one single shared cluster. | Continue reading


@logicalclocks.com | 3 years ago

How to build your own Feature store for ML

Given the increasing interest in feature stores, we share our own experience of building one to help others who are considering following us down the same path. | Continue reading


@logicalclocks.com | 3 years ago

A Feature Store for Databricks

This article introduces the Hopsworks Feature Store for Databricks, and how it can accelerate and govern your model development and operations on Databricks. | Continue reading


@logicalclocks.com | 4 years ago

Decompose the ML Pipeline Monolith with a Feature Store

End-to-End ML pipeline with a Feature Store based on MLOps principles | Continue reading


@logicalclocks.com | 4 years ago

GANs revolutionize Anti-Money Laundering – the revolution will not be supervised

Deep learning: State-of-the-art technique for identifying transactions in AML. Less false positives and higher accuracy than traditional rule-based approaches. | Continue reading


@logicalclocks.com | 4 years ago

Guide to File Formats for Machine Learning: Columnar, Training, Inferencing

This is a guide to file formats for machine learning in Python. The Feature Store can training/test data in a file format of choice on a file system of choice. | Continue reading


@logicalclocks.com | 4 years ago

Goodbye Ray, Hello Asynchronous Search for PySpark

Hopsworks supports easy hyperparameter optimization (both synchronous and asynchronous search), distributed training using PySpark, TensorFlow and GPUs. | Continue reading


@logicalclocks.com | 4 years ago

What Is It with European Data Companies?

Continue reading


@logicalclocks.com | 5 years ago

Feature Store: The Missing Data Layer in Machine Learning Pipelines?

Continue reading


@logicalclocks.com | 5 years ago

When Deep Learning with GPUs, Use a Cluster Manager

Continue reading


@logicalclocks.com | 5 years ago

Goodbye Horovod, Hello TensorFlow CollectiveAllReduce

Continue reading


@logicalclocks.com | 5 years ago

When a Resource Scheduler Is Not Enough for GPUs

Continue reading


@logicalclocks.com | 5 years ago

Why You Need a Distributed Filesystem for Deep Learning

Continue reading


@logicalclocks.com | 5 years ago

Integrating NVMe Disks in HopsFS (HDFS)

Continue reading


@logicalclocks.com | 5 years ago