Ready to dive into the lake?

lakeFS is currently only
available on desktop.

For an optimal experience, provide your email below and one of our lifeguards will send you a link to start swimming in the lake!

lakeFS Community

Media Mentions

Lakefs brings Git-like version control to virtual dataset copies

Jamil Ahmad

Israeli startup Treeverse is developing dataset copy management and version control for data pipeline builders with its open-source Lakefs product. Analytics and AI/ML data supply pipelines depend upon consistent, repeatable and reliable delivery of clean data sets extracted from source data lakes. Such pipelines are equivalent to software programs and they take effort and time …

Lakefs brings Git-like version control to virtual dataset copies Read More »

Data Version Control: The Enabler Of Data Engineering Best Practices

Jamil Ahmad

Data is the backbone of every business organization today, and its importance will only grow in 2023. There have been a lot of discussions lately about adopting version control practices for data. Many engineers believe that data version control is the obvious next step that would transform data pipelines from something that organizations maintain to …

Data Version Control: The Enabler Of Data Engineering Best Practices Read More »

Top Data Version Control Tools for Machine Learning Research in 2022

Jamil Ahmad

All systems used for production must be versioned. A single location where users can access the most recent data. An audit trail must be created for any resource that is often modified, especially when numerous users are making changes at once. To ensure everyone on the team is on the same page, the version control …

Top Data Version Control Tools for Machine Learning Research in 2022 Read More »

LakeFS Open Source – interview with Einat Orr, Co-founder and CEO

Jamil Ahmad

As the value and volume of data continues to surge across industries, the need to improve the management and reliability of open-source data analytics solutions is critical. Developed at the request of its open-source users, lakeFS Cloud relieves users from assigning internal resources to maintain the infrastructure for their extensive technology stacks. Treeverse CEO and …

LakeFS Open Source – interview with Einat Orr, Co-founder and CEO Read More »

LakeFS brings branching to data lakes

Jamil Ahmad

On June 27, the company announced general availability of their service, LakeFS Cloud. Teams will be able to use it to follow the evolution of various versions of their data just as they do with different versions of their code.

Unlock The Full Business Value Of Data With A Better Engineering Process

Jamil Ahmad

The volume of data that organizations handle is growing faster than their engineering capabilities. On top of being resource-consuming and expensive, hiring more engineers doesn’t always solve the problem. In fact, it might make things worse — the more people in a team, the greater the risk of misunderstandings. Optimizing existing processes is a proven …

Unlock The Full Business Value Of Data With A Better Engineering Process Read More »

Git for Data – lakeFS

  • Get Started
    Get Started
  • The annual State of Data Engineering Report is now available. Find out what’s new in 2023 -

    +