Messing with AWS Endpoint URLs
- Paul Singman
- April 20, 2021
It makes perfect sense that if you type aws s3 ls s3://my-bucket to list the contents of an S3 bucket, you would expect to
Hudi, Iceberg and Delta Lake: Data Lake Table Formats Compared
- Oz Katz
- April 12, 2021
Introduction When building a data lake, there is perhaps no more consequential decision than the format data will be stored in. The outcome will
Why I’m Joining lakeFS
- Paul Singman
- April 6, 2021
Thoughts on a personal journey into the world of developer advocacy at an open-source data project. In March of 2021, I chose to leave the
3 Data Lake Anti-Patterns to Avoid
- Paul Singman
- March 30, 2021
Rid yourself of these troubling habits and start the journey towards data lake mastery! Introduction Data lakes offer tantalizing performance upside, which is a major
Data Lakes: The Definitive Guide
- Paul Singman
- March 22, 2021
What is a Data Lake? A data lake is a system of technologies that allow for the querying of data in file or blob
Power Amazon EMR Applications with Git-like Operations Using lakeFS
- Itai Admi
- March 9, 2021
This article will provide a detailed explanation on how to use lakeFS with Amazon EMR. Today it’s common to manage a data lake using
Data Engineering
Messing with AWS Endpoint URLs
Paul Singman April 20, 2021
It makes perfect sense that if you type aws s3 ls s3://my-bucket to list the contents of an S3 bucket, you would...
Data Engineering
Hudi, Iceberg and Delta Lake: Data Lake Table Formats Compared
Oz Katz April 12, 2021
Introduction When building a data lake, there is perhaps no more consequential decision than the format data will be stored in. The...
People
Why I’m Joining lakeFS
Paul Singman April 6, 2021
Thoughts on a personal journey into the world of developer advocacy at an open-source data project. In March of 2021, I chose to...