Webinar Lottie

lakeFS Acquires DVC, Uniting Data Version Control Pioneers to Accelerate AI-Ready Data

webcros

Thought Leadership

Thought Leadership

What a Year With lakeFS Taught Me

Michal Wosk

A year ago, in November 2021, I took a bold decision to leave my role at Microsoft and join team lakeFS as VP Marketing. Upon joining I wrote an article on why I decided to join. But just before publishing the piece, I had a change of heart; it didn’t feel right to me. It […]

Data Engineering Thought Leadership

4 Ways to Reduce Cloud Data Storage Costs

Oz Katz

In the past year, words like recession, business slowdown and monetary cuttings are being heard more and more often. Not just in the economic press and in the media, these discussions are very much heard also in almost all companies – within boardrooms, in management meetings and when engaging with potential investors and customers. As

Data Engineering Thought Leadership

Data Mesh: What is it and What Does it Mean for Data Engineers?

The lakeFS Team

Organizations have practically always needed data analytics, and they jumped on the analytics bandwagon as soon as the first computers appeared on the scene. In the 80s, businesses built data warehouses using relational databases as their decision-support systems (DSS). However, as companies generated more diverse data at high velocity, relational databases showed their limitations.  This

Data Engineering Thought Leadership

Data versioning as your ‘Get out of jail’ card – DVC vs. Git-LFS vs. dolt vs. lakeFS

Einat Orr, PhD

Data Versioning at Scale: Solutions Overview Back when I was a 23-year-old student, I worked at an Israeli networking company as a BI analyst in the Operations department. My job revolved around modeling the company’s inventory which was quite costly and needed optimization.  At some point, I attended a meeting of the company’s management. When

Data Engineering Thought Leadership

The State of Data Engineering 2022

Einat Orr, PhD

A year has passed since we shared the State of Data Engineering 2021. And since we released that article last May, not much has changed in the data landscape. In fact, we had discussions internally about whether we should even do an update for 2022. We kid. It was another year worthy of its own

Thought Leadership

lakeFS in Search of a Role Model

Einat Orr, PhD

Who needs a role model? When we first launched lakeFS in August of 2020, we asked ourselves a simple question: What does success look like? And how will we know we’re doing the right things to get there?   Of course with thousands of installations, a thriving user community, active developer contributions, and exponential growth on

Thought Leadership

lakeFS’ First Birthday – The Story

Oz Katz, Einat Orr, PhD

The idea About two years ago, we left SimilarWeb, where we had the privilege of managing some of the most interesting and complex data architecture projects in the world. The architecture was centered around a Data Lake – 7 Petabytes of data on Amazon S3. On top, hundreds of hourly, daily and monthly jobs running,

Data Engineering Thought Leadership

The State of Data Engineering in 2021

Einat Orr, PhD

Let’s start with the obvious: the lakeFS project doesn’t exist in isolation. It belongs to a larger ecosystem of data engineering tools and technologies adjacent and complementary to the problems we are solving. What better way to visualize our place in this ecosystem, I thought, than by creating a cross-sectional LUMAscape to depict it. What’s

Data Engineering Thought Leadership

Why Data Versioning as an Infrastructure Matters

Einat Orr, PhD

The demand for infrastructure that contributes to the collection, storage, and analysis of data is growing with the increasing amounts of data managed by organizations. Every organization that manages data pipelines to extract insights from data encounters the need for reproducibility, safe experimentation, and means to ensure data quality. The path to answering these needs

lakeFS