The lakeFS Blog
Filter by
Manage your data lifecycle at scale with lakeFS Enterprise
It makes perfect sense that if you type aws s3 ls s3://my-bucket to list the contents of an S3 bucket, you would expect to connect
- Paul Singman
Introduction to Data Lakehouse When building a data lake, there is perhaps no more consequential decision than the format data will be stored in. The
- Oz Katz
Thoughts on a personal journey into the world of developer advocacy at an open-source data project. In March of 2021, I chose to leave the data
- Paul Singman
Rid yourself of these troubling habits and start the journey towards data lake mastery! Introduction Data lakes offer tantalizing performance upside, which is a major reason
- Paul Singman
What is a Data Lake? A data lake is a system of technologies that allow for the querying of data in file or blob objects.
- Paul Singman
This article will provide a detailed explanation of how to use lakeFS with Amazon EMR. Today, it’s common to manage a data lake using cloud
- Itai Admi
Write-Audit-Publish (continuous integration/continuous deployment of data) is the process of exposing data to consumers only after ensuring it adheres to best practices such as format,
- Oz Katz
Introduction In our recent version of lakeFS, we switched to base metadata storage on immutable files stored on S3 and other common object stores.
- Ariel Shaqed (Scolnicov)
Introduction We recently released the first version of lakeFS supported by Pebble’s sstable library – RocksDB. The release introduced a new data model which is
- Itai Admi
Update (May 26th, 2021): We officially released the lakeFS Airflow provider. Read all about it in the latest blog post. In this post, we’ll see
- Guy Hardonag
The new Golang v1.16 embed directive helps us keep a single binary and bundle out static content. This post will cover how to work with
- Barak Amar