Ready to dive into the lake?

lakeFS is currently only
available on desktop.

For an optimal experience, provide your email below and one of our lifeguards will send you a link to start swimming in the lake!

lakeFS Community

lakeFS Blog

Thought Leadership

MLOps Is Overfitting: Here’s Why

Einat Orr, PhD.

VC surveys show that the MLOps category has significantly expanded in the past few years, with hundreds of companies defining themselves as part of this dynamically evolving niche.  MLOps systems provide the infrastructure allowing ML practitioners to manage the lifecycle of their work from development to production in a robust and reproducible manner. An MLOps …

MLOps Is Overfitting: Here’s Why Read More »

Data Engineering

Managing Structured and Unstructured Data – a Guide for an Effective Synergy

Michal Wosk

No time for the full article now? Read the abbreviated version here Many organizations and companies are rapidly moving from managing only structured data sets to managing both  structured and unstructured data. This is due to the growth in the number of sources and data types, which are rooted in the new variety of use …

Managing Structured and Unstructured Data – a Guide for an Effective Synergy Read More »

Case Studies

How Windward Leverages lakeFS for Resilient Data Ingestion

Lior Resisi

Implementing CI/CD-inspired workflows built atop lakeFS operations prevents inconsistent data and brings increased reliability to our analytics platform. This article was originally published in Lior’s Medium Blog. At Windward, our Maritime Artificial Intelligence Analytics (MAIA™️) platform delivers predictive intelligence on global maritime conditions to hundreds of businesses. Customers across different industries — like oil & …

How Windward Leverages lakeFS for Resilient Data Ingestion Read More »

Case Studies

Improving Our Research Velocity With lakeFS

Ryan Green

This post was originally published in the Enigma blog. In every software engineering problem I’ve worked on, I’ve noticed a recurring tension between two highly desirable properties: flexibility and robustness. But in each situation, this tension manifests itself in different ways. At Enigma, our goal is to build a complete set of authoritative profiles of …

Improving Our Research Velocity With lakeFS Read More »

Announcements

lakeFS Product Offerings Overview: Open Source vs. Enterprise vs. Cloud

The lakeFS team

What is lakeFS? lakeFS is a platform that helps data engineers build scalable and resilient data lakes running on object storage. It provides version control, branching, and merging capabilities for data at petabyte scale, on or off premises. lakeFS enables teams to collaborate and manage data effectively by applying engineering best practices to data management. …

lakeFS Product Offerings Overview: Open Source vs. Enterprise vs. Cloud Read More »

Tutorials

Authorization (RBAC) in lakeFS: Step-by-Step Configuration Tutorial

Amit Kesarwani

Introduction Last month, the lakeFS team decided to move from the decoupled security authentication and access control features to enable you to plug your own authentication and security mechanism. Consequently, the team decided to change the architecture to a pluggable one which enables you to choose your preference without being dependent on the lakeFS solution. …

Authorization (RBAC) in lakeFS: Step-by-Step Configuration Tutorial Read More »

Tutorials

The Airflow and lakeFS Integration: Step-by-Step Configuration Tutorial

Amit Kesarwani

Introduction lakeFS makes creating isolated environments for data ingestion instantaneous so you can run data ingestion jobs without impacting your production data and merge ingested data atomically to your production data instantaneously. This frees you from spending time on environment maintenance and makes it possible to create as many environments as needed. If ingested data …

The Airflow and lakeFS Integration: Step-by-Step Configuration Tutorial Read More »

Best Practices

How To Maintain Data Quality In Your Data Lake

The lakeFS team

Enterprises use more and more data as the foundation for their decisions and operations. The sheer number of digital goods that collect, analyze, and use data to feed decision-making algorithms in order to improve future services is also rapidly increasing. Because of this, data quality has become the most important asset for businesses in almost …

How To Maintain Data Quality In Your Data Lake Read More »

Git for Data – lakeFS

  • Get Started
    Get Started
  • The annual State of Data Engineering Report is now available. Find out what’s new in 2023 -

    +