What if you could manage your data lake just like you manage code? With rollback, versioning, and branching capabilities on top of your existing data lake?

lakeFS is an open-source project that provides a Git-like version control interface for data lakes, with seamless integration to most data tools and frameworks. lakeFS enables you to easily implement parallel pipelines for experimentation, reproducibility, and CI/CD for data.

And now you can start playing around with lakeFS in a fully functional lakeFS environment, with your own data and all the tools that you are already using. Get your isolated environment, integrate it with the tools you use, and see how it works in an environment similar to your own.

Please note: playground environments will be deleted after one week.

See these features in action:

Full reproducibility of data and code
Git-like operations: branch, commit, merge and revert
Instant reversion of changes to data
Petabytes scale version control
Zero-copy branching for frictionless experimentation
Seamless integration with most data tools and frameworks (Spark, Hive, AWS Athena, Presto)
Support for AWS S3, Azure Blob Storage, and Google Cloud Storage (GCS)

Start playing here (no email or any additional information required)

One you have experienced the value in lakeFS, go ahead and install lakeFS open source, or sign up for lakeFS Cloud Beta.
And don’t forget to share your feedback with us via our community on Slack or GitHub – this is the best way to let us know if any feature you need is missing!

The lakeFS playground is now live and everybody can play!

What if you could manage your data lake just like you manage code? With rollback, versioning, and branching capabilities on top of your existing data lake?

See these features in action:

Read Related Articles.

AI Ready Data Management: Process, Best Practices & Challenges

Iceberg REST Catalog Alternatives: Top Options & How to Choose The Best One For Your Team

Iceberg Time Travel: Snapshots, Rollbacks & Data Version Control

The lakeFS playground is now live and everybody can play!

What if you could manage your data lake just like you manage code? With rollback, versioning, and branching capabilities on top of your existing data lake?

See these features in action:

Read Related Articles.

AI Ready Data Management: Process, Best Practices & Challenges

Iceberg REST Catalog Alternatives: Top Options & How to Choose The Best One For Your Team

Iceberg Time Travel: Snapshots, Rollbacks & Data Version Control

Related articles

Introducing the AI-Ready Data Summit

What is Metadata Tracking? Types, Tools & Best Practices

How CytoReason Streamlined Nextflow with lakeFS for Smarter Data Pipelines

Pick up the Slack with lakeFS