Headless agents are coming for your data. Be ready with lakeFS

Thought Leadership

Data Engineering Machine Learning Thought Leadership

The State of Data and AI Engineering 2025

Einat Orr, PhD

Since 2021, we’ve published the annual State of Data Engineering Report, which includes a summary of all key categories that directly impact data engineering infrastructure. In 2025, we see five primary trends that influence the categories that will be covered in this report. Trend #1: MLOps space is slowly diminishing The MLOps space is slowly

Machine Learning Thought Leadership

Distributed Data Management is Broken – Here’s Why You Should Care

Tal Sofer

In today’s data-driven world, businesses don’t just rely on data – they are built on it. But as data infrastructure sprawls across on-prem systems, multiple cloud providers, and third-party platforms, a new challenge is taking center stage: distributed data management. It’s a silent bottleneck with loud consequences. Challenges in Distributed Data Management  Managing data across

Thought Leadership

The Road Forward Is the Road Back: My Return to Treeverse

Barak Amar

When I first joined lakeFS by Treeverse in 2020, we were just four engineers building an open-source solution for data versioning. It was exhilarating—being part of something from the ground up, shaping the product, and seeing it grow. But after four years, something changed. The excitement faded, and I felt like I was running in

Best Practices Product Thought Leadership

Dataset Versioning in the Age of Open Table Formats

Tal Sofer

Originally presented at Big Data LDN 2024. More than two decades ago, data warehouses outgrew the capacity of single machines, and scaling them started to become costly or inefficient. This prompted the tech industry to rethink the architecture and start to use distributed systems. If we wanted to store more data, we just bought more

Data Engineering Machine Learning Thought Leadership

The State of Data Engineering 2024

Einat Orr, PhD

Since 2021 we’ve been releasing the annual State of Data Engineering Report, a compilation of all the relevant categories that have a direct impact on data engineering infrastructure. In 2024, we see 3 primary trends that influence the categories which will be covered in this report. Trend #1: GenAI influence on software infrastructure As predicted

Data Engineering Machine Learning Thought Leadership

Why Is DataOps So Hard and What Tools Make It Easier?

Einat Orr, PhD

TL;DR: DataOps complexity arises from unclear R&Rs, a lack of standardization in interfaces, distributed technology complexities, and difficulties in implementing engineering best practices. The solution is to define clear responsibilities, address missing requirements, and manage data pipelines efficiently using emerging solutions that enhance the manageability and resilience of DataOps. What makes DataOps so hard is,

Data Engineering Thought Leadership

Data Engineering in 2024: Predictions

Oz Katz

This article was originally published on Datanami and is republished here with permission. As we officially kick off 2024, I realized I have a few thoughts on the direction of the data landscape that might be of interest to others.  This is a recap of my “predictions.”  I will admit that it’s a mix of what I

Product Thought Leadership

Major Milestone: lakeFS 1.0 Is Now Generally Available

Oz Katz

October 24, 2024 — TL;DR lakeFS 1.0 is generally available. You can upgrade to this version via the Assets here. Just over three years ago we publicly released the initial lakeFS. Our mission when we first launched lakeFS was to provide data practitioners with a scalable data version control system that would bring order to

We use cookies to improve your experience and understand how our site is used.

Learn more in our Privacy Policy