Ready to dive into the lake?

lakeFS is currently only
available on desktop.

For an optimal experience, provide your email below and one of our lifeguards will send you a link to start swimming in the lake!

lakeFS Community

Amit Kesarwani

Tutorials

Authorization (RBAC) in lakeFS: Step-by-Step Configuration Tutorial

Amit Kesarwani
March 14, 2023

Introduction Last month, the lakeFS team decided to move from the decoupled security authentication and access control features to enable you to plug your own authentication and security mechanism. Consequently, the team decided to change the architecture to a pluggable one which enables you to choose your preference without being dependent on the lakeFS solution. …

Authorization (RBAC) in lakeFS: Step-by-Step Configuration Tutorial Read More »

Tutorials

The Airflow and lakeFS Integration: Step-by-Step Configuration Tutorial

Amit Kesarwani
March 14, 2023

Introduction lakeFS makes creating isolated environments for data ingestion instantaneous so you can run data ingestion jobs without impacting your production data and merge ingested data atomically to your production data instantaneously. This frees you from spending time on environment maintenance and makes it possible to create as many environments as needed. If ingested data …

The Airflow and lakeFS Integration: Step-by-Step Configuration Tutorial Read More »

Data Engineering Use Cases

How to Develop Spark ETL Pipelines in Isolation

Amit Kesarwani, Vino SD, Iddo Avneri
November 7, 2022

You’re bound to ask yourself this question at some point: Do I need to test the Spark ETLs I’m developing? The answer is yes; you certainly should – and not just with unit testing but also integration, performance, load, and regression testing. Naturally, the scale and complexity  of your data set matters a lot, so …

How to Develop Spark ETL Pipelines in Isolation Read More »

Git for Data – lakeFS

  • Get Started
    Get Started