Webinar Lottie

lakeFS Acquires DVC, Uniting Data Version Control Pioneers to Accelerate AI-Ready Data

webcros

Learn from AI, ML & data leaders

March 31, 2026  |  Live

The lakeFS Blog

Components of Machine Learning

Machine Learning Components: Elements & Classifications

Today, machines are able to emulate human intelligence through the use of artificial intelligence technology. Approaches such as machine learning, deep learning, natural language processing,

DataOps tools

Why Is DataOps So Hard and What Tools Make It Easier?

TL;DR: DataOps complexity arises from unclear R&Rs, a lack of standardization in interfaces, distributed technology complexities, and difficulties in implementing engineering best practices. The solution

How to toggle OpenAI model determinism

How to Toggle OpenAI Model Determinism

TL;DR In the previous blog, Introducing the LangChain lakeFS Loader, and sample notebook, we explained and demonstrated integration of lakeFS with LangChain and LLM models

Anatomy of a lakeFS Repository: Git for data

Anatomy of a lakeFS Repository: Practical Example of Git for Data

Git for data may sound odd at first. But using the Git logic and mechanisms for data lakes makes a lot of sense. After all,

Data preprocessing in machine learning

Data Preprocessing in Machine Learning: Steps & Best Practices

Data is a valuable asset to any company today. But can you really use this massive amount of data in its raw form to train

lakeFS + Unity Catalog integration tutorial

lakeFS + Unity Catalog Integration: Step-by-Step Tutorial

Efficient data management is a critical component of any modern organization.  As data volumes grow and data sources become more diverse, the need for robust

lakeFS Samples Getting Started

lakeFS Samples: The Quickest Way to Get Started

lakeFS is a powerful solution for data version control that enables data practitioners to manage data as code using Git-like operations and achieve reproducible, high-quality

Transactional mirroring (cross-region mirroring)

Introducing lakeFS Transactional Mirroring (Cross-Region Mirroring)

What is mirroring We are pleased to announce a preview of a long-awaited lakeFS feature: transactional mirroring across regions. Mirroring builds on top of S3

Machine Learning Architecture Diagram

Machine Learning Architecture Diagram: Key Elements

Machine learning solutions come in handy for addressing various problems and achieving a wide range of goals. However, if we look at ML applications from

Databricks Workflows and dbt Cloud Jobs

How to Leverage Databricks Workflows to Implement dbt Cloud Jobs

In a previous article, we explored how dbt and Databricks work together. We will now review a specific feature of dbt Cloud – dbt Cloud

lakeFS: where is my data???

lakeFS: Where’s my data?

If you’ve come across our content, you may have noticed blogs diving into the technical details of lakeFS, and this is one of them. These

We use cookies to improve your experience and understand how our site is used.

Learn more in our Privacy Policy