Webinar Lottie

lakeFS Acquires DVC, Uniting Data Version Control Pioneers to Accelerate AI-Ready Data

webcros

Learn from AI, ML & data leaders

March 31, 2026  |  Live

The lakeFS Blog

AI data storage

What is AI Data Storage? Benefits, Challenges & Best Practices

Many companies are modernizing their data storage infrastructure to capitalize on the opportunities of machine learning (ML) and advanced analytics. However, teams face several unique

LLM Develoment

lakeFS for LLM Development

Training a Large Language Model (LLM) such as ChatGPT or DeepSeek is a complicated, data-intensive process. As a young discipline, best practices and tool chains

AI Agents in Business and Automation

AI Agents in Business and Automation

This article discusses AI Agents in business and automation, focusing on building an AI Agent using lakeFS, LangChain, OpenAI, and FAISS (Facebook AI Similarity Search)

metadata management tools

Metadata Management Tools: Types, Features & Benefits

Managing complex and massive data sets is tricky but metadata management tools can help teams keep their data in shape. Metadata management has become critical

Local data preprocessing with lakeFS Mount

Preprocessing Data Locally with Zero Copy Using lakeFS

One of the capabilities of lakeFS is that you can use it to create isolated environments for experimentation or development.  Let’s say we want to

Coming full circle: Barak returns to lakeFS

The Road Forward Is the Road Back: My Return to Treeverse

When I first joined lakeFS by Treeverse in 2020, we were just four engineers building an open-source solution for data versioning. It was exhilarating—being part

AI infrastructure in regulated sectors

How lakeFS Solves AI Infrastructure Challenges in Regulated Sectors

As companies race to adopt AI technology, many firms in highly-regulated fields such as Healthcare, Financial Services, and Defence are at risk of being left

What is metadata

What is Metadata? Examples, Benefits & Best Practices

What is the key element that guarantees all data published on portals is discoverable, comprehensible, reusable, and interoperable for people and technology like AI? You

ML reproducibility pillars

The Holy Trinity of ML Reproducibility

Reproducibility is a fundamental challenge in building reliable machine learning (ML) models and AI applications.  It’s not just about debugging a model when it fails

Avoid data breaches using RBAC

How to Avoid Data Breaches by using RBAC 

Introduction Role-Based Access Control (RBAC) is an effective way to minimize the risk of data breaches by ensuring users only have access to the data

We use cookies to improve your experience and understand how our site is used.

Learn more in our Privacy Policy