The lakeFS Blog
Filter by
Manage your data lifecycle at scale with lakeFS Enterprise
Data is the foundation of every organization, so ensuring it’s reliable and consistent is essential in driving informed decisions and ensuring a sustainable growth. This
- Idan Novogroder
Originally presented at Big Data LDN 2024. More than two decades ago, data warehouses outgrew the capacity of single machines, and scaling them started to
- Tal Sofer
In modern software development, Pull Requests (PRs) are a fundamental tool for collaborating on code. They allow teams to review, discuss, and merge changes in
- Oz Katz, Itai Gilo
Data versioning is a central aspect of modern data management, especially in the context of GenAI and machine learning. Teams need a solution to version
- Amit Kesarwani
A team looking to build an application that uses a large language model (LLM) like OpenAI’s GPT-4 or Meta’s LLama 2 will inevitably run into
- Idan Novogroder
This article focuses on how to work with Nessie Catalog. Please note that since its first publication, fundamental support for Iceberg REST Catalog has been
- Tal Sofer
Promoting ETL code for production is a straightforward process. We have our code – usually stored in Git – and want to build and test
- Amit Kesarwani
Once you start using lakeFS, the files on your object store will form a new representation. The names and paths of the files on the
- Iddo Avneri
What is a mount? A filesystem mount is the ability to present a local device or a remote location as a local directory. It is
- Amit Kesarwani
Retrieval Augmented Generation (RAG) is on its way to becoming the dominant framework for implementing enterprise applications based on Large Language Models (LLMs). However, implementing
- Idan Novogroder
Apache Iceberg is the most popular open table format. It originated at Netflix due to the need to provide a table representation for data saved
- Tal Sofer