Transform your object storage into a Git-like repository
lakeFS enables you to manage your data lake the way you manage your code. Run parallel pipelines for experimentation and CI/CD for your data.
Features

Exabytes scale version control

Git-like operations: branch,
commit, merge, revert

Zero copy branching for
frictionless experiments

Full reproducibility of
data and code

Pre-commit/merge hooks for
data CI/CD

Instantly revert changes to data
Features

Petabytes scale version control

Git-like operations: branch,
commit, merge, revert

Zero copy branching for
frictionless experiments

Full reproducibility of
data and code

Pre-commit/merge hooks for
data CI/CD

Instantly revert changes to data
Works seamlessly with all modern data frameworks















Deploy in the cloud or on-prem





Works seamlessly with all modern data frameworks















Deploy in the Cloud or On-Prem





And any S3 Compatible Storage
Add Your Heading Text Here
The latest from our blog
Messing with AWS Endpoint URLs
It makes perfect sense that if you type aws s3 ls s3://my-bucket to list the contents of an S3 bucket, you would...
Hudi, Iceberg and Delta Lake: Data Lake Table Formats Compared
Introduction When building a data lake, there is perhaps no more consequential decision than the format data will be stored in. The...
Why I’m Joining lakeFS
Thoughts on a personal journey into the world of developer advocacy at an open-source data project. In March of 2021, I chose to...