Webinar Lottie

lakeFS Acquires DVC, Uniting Data Version Control Pioneers to Accelerate AI-Ready Data

webcros

Learn from AI, ML & data leaders

March 31, 2026  |  Live

The lakeFS Blog

AWS Endpoint URLs

Messing with AWS Endpoint URLs

It makes perfect sense that if you type aws s3 ls s3://my-bucket to list the contents of an S3 bucket, you would expect to connect

Hudi vs Iceberg vs Delta Lake: Data Lake Table Formats Compared

Introduction to Data Lakehouse When building a data lake, there is perhaps no more consequential decision than the format data will be stored in. The

lakefs-logo

Why I’m Joining lakeFS

Thoughts on a personal journey into the world of developer advocacy at an open-source data project. In March of 2021, I chose to leave the data

3 Data Lake Anti-Patterns to Avoid

Rid yourself of these troubling habits and start the journey towards data lake mastery! Introduction Data lakes offer tantalizing performance upside, which is a major reason

Data Lakes Features

What is a Data Lake? Data Lake vs Data Warehouse

What is a Data Lake? A data lake is a system of technologies that allow for the querying of data in file or blob objects. 

Amazon EMR with lakeFS

Power Amazon EMR Applications with Git-like Operations Using lakeFS

This article will provide a detailed explanation of how to use lakeFS with Amazon EMR. Today, it’s common to manage a data lake using cloud

Write-Audit-Publish for Data using lakeFS Hooks

lakeFS Hooks: Implementing Write-Audit-Publish for Data Using Pre-Merge Hooks

Write-Audit-Publish (continuous integration/continuous deployment of data) is the process of exposing data to consumers only after ensuring it adheres to best practices such as format,

PebbleDB SSTable

Concrete Graveler: Committing Data to Pebble SSTables

Introduction In our recent version of lakeFS, we switched to base metadata storage on immutable files stored on S3 and other common object stores.

Tiers in the Cloud

Tiers in the Cloud: How lakeFS caches immutable data on local-disk

Introduction We recently released the first version of lakeFS supported by Pebble’s sstable library – RocksDB. The release introduced a new data model which is

Airflow and lakeFS

Building Reproducible Data Pipelines with Airflow and lakeFS

Update (May 26th, 2021): We officially released the lakeFS Airflow provider. Read all about it in the latest blog post. In this post, we’ll see

Working with Embed in Go

Working with Embed in Go 1.16 Version

The new Golang v1.16 embed directive helps us keep a single binary and bundle out static content. This post will cover how to work with

We use cookies to improve your experience and understand how our site is used.

Learn more in our Privacy Policy