Ready to dive into the lake?

lakeFS is currently only
available on desktop.

For an optimal experience, provide your email below and one of our lifeguards will send you a link to start swimming in the lake!

lakeFS Community

lakeFS Blog

Project

The lakeFS Katacoda Sandbox Environment – Interactive Data Versioning Learning

Guy Hardonag

If you’re interested in playing around and exploring lakeFS, you can now easily get started using the Katacoda demo which provides a personalized sandboxed environment – all from your browser, without installing anything.  lakeFS is an open source platform that delivers resilience and manageability to object-storage based data lakes. With lakeFS you can build repeatable, …

The lakeFS Katacoda Sandbox Environment – Interactive Data Versioning Learning Read More »

Data Engineering Project

How to Manage Your Data the Way You Manage Your Code

Einat Orr, PhD.

50 years ago it was very hard to collaborate over code. When developing large scale software projects it was difficult to manage changes to source code over time, as revision control tools were only starting to enter mainstream computing. The adoption of version control tools, first centralized and then distributed, changed all that, and now …

How to Manage Your Data the Way You Manage Your Code Read More »

Go Project

Improving Postgres Performance Tenfold Using Go Concurrency

Tzahi Yaacobovicz

In this article I will show how Go concurrency enabled us to cut through a daunting DB performance barrier. This blog post continues our journey to big data performance. The first post on this issue discussed in-process caching in Go.  The Pain lakeFS is a versioned directory over objects stores like AWS S3 and GCS …

Improving Postgres Performance Tenfold Using Go Concurrency Read More »

Go Project

In-process Caching In Go: Scaling lakeFS to 100k Requests/Second

Barak Amar

This is a first in a series of posts describing our journey of scaling lakeFS. In this post we describe how adding an in-process cache to our Go server speed up our authorization flow. Background lakeFS is an open-source layer that delivers resilience and manageability to object-storage based data lakes. With lakeFS you can build …

In-process Caching In Go: Scaling lakeFS to 100k Requests/Second Read More »

Data Engineering

How to Pick the Right Postgres for your Application

Ariel Shaqed (Scolnicov)

Lots of applications require a Postgres database. Before you can install them, you will need a Postgres database. How do you pick the right Postgres for your application? There are a bewildering variety of possible ways to acquire a database running on a Postgres instance, but the biggest choice is “build or buy”: whether to …

How to Pick the Right Postgres for your Application Read More »

Data Engineering Project

The Quick Guide for Running Presto Locally on S3

Guy Hardonag

This post aims to cover our experience running Presto in a local environment with the ability to query Amazon S3 and other S3 Compatible Systems. We will: Describe the components needed and how to configure them. Provide a dockerized environment you could run. Show an example of running the provided environment and querying a publicly …

The Quick Guide for Running Presto Locally on S3 Read More »

Git for Data – lakeFS

  • Get Started
    Get Started
  • The annual State of Data Engineering Report is now available. Find out what’s new in 2023 -

    +