Webinar Lottie

lakeFS Acquires DVC, Uniting Data Version Control Pioneers to Accelerate AI-Ready Data

webcros

Learn from AI, ML & data leaders from Dell, Lockheed Martin, Red Hat & more

Machine Learning

Data Engineering Machine Learning Thought Leadership

The State of Data and AI Engineering 2025

Einat Orr, PhD

Since 2021, we’ve published the annual State of Data Engineering Report, which includes a summary of all key categories that directly impact data engineering infrastructure. In 2025, we see five primary trends that influence the categories that will be covered in this report. Trend #1: MLOps space is slowly diminishing The MLOps space is slowly […]

Data Engineering Machine Learning

What Is an AI Factory and How Does It Work?

Tal Sofer

During the 2025 Nvidia GTC conference, one of the keywords that drew a lot of attention was “AI factory.” An AI factory is Nvidia’s idea for producing large-scale AI systems. This concept aligns AI development with the industrial process, in which raw data is received, improved through computation, and converted into valuable products via data-driven

Best Practices Machine Learning Product Tutorials

A Single Pane of Glass to Your Data: Multiple Storage Backends Support in lakeFS

Tal Sofer

Today’s organizations don’t just use a single data storage solution – they operate across on-prem servers, multiple cloud providers, and hybrid environments. This distributed approach has become necessary, but it comes with significant costs: teams struggle with siloed tools, duplicated processes, and an endless cycle of environment management that diverts focus from delivering actual value. 

Best Practices Data Engineering Machine Learning

6 Types of Metadata: Examples, Tools & Frameworks

Idan Novogroder

With the volumes of generated data increasing, metadata has become an essential component in organizing and comprehending massive datasets. Metadata plays a key role in any modern data strategy, especially among organizations that treat data as one of their most precious assets. This article dives into all the different metadata types, tools, and frameworks to

Machine Learning Thought Leadership

Distributed Data Management is Broken – Here’s Why You Should Care

Tal Sofer

In today’s data-driven world, businesses don’t just rely on data – they are built on it. But as data infrastructure sprawls across on-prem systems, multiple cloud providers, and third-party platforms, a new challenge is taking center stage: distributed data management. It’s a silent bottleneck with loud consequences. Challenges in Distributed Data Management  Managing data across

Best Practices Machine Learning

What is AI Data Storage? Benefits, Challenges & Best Practices

Tal Sofer

Many companies are modernizing their data storage infrastructure to capitalize on the opportunities of machine learning (ML) and advanced analytics. However, teams face several unique data management challenges such as the increasing time required for AI training and inference workloads, as well as the cost and scarcity and resources, particularly GPUs. Storage is a key

Machine Learning

lakeFS for LLM Development

Nadav Steindler

Training a Large Language Model (LLM) such as ChatGPT or DeepSeek is a complicated, data-intensive process. As a young discipline, best practices and tool chains are still emerging and being formulated for LLM training. lakeFS has a lot to contribute to this field, as a highly scalable data version control system. In this article, we

Best Practices Machine Learning

AI Agents in Business and Automation

Amit Kesarwani

This article discusses AI Agents in business and automation, focusing on building an AI Agent using lakeFS, LangChain, OpenAI, and FAISS (Facebook AI Similarity Search) to answer questions based on documents. It explains what AI Agents and LangChain are, and how lakeFS is used for data version control. The article also provides an example of

Best Practices Machine Learning

Metadata Management Tools: Types, Features & Benefits

Tal Sofer

Managing complex and massive data sets is tricky but metadata management tools can help teams keep their data in shape. Metadata management has become critical in data strategies created by organizations that treat data as an important asset. In this article, we dive into metadata management and give you an overview of tools teams use

We use cookies to improve your experience and understand how our site is used.

Learn more in our Privacy Policy