Webinar Lottie

lakeFS Acquires DVC, Uniting Data Version Control Pioneers to Accelerate AI-Ready Data

webcros

Learn from AI, ML & data leaders from Dell, Lockheed Martin, Red Hat & more

Best Practices

Best Practices Machine Learning

Iceberg Branching Best Practices for Reliable Data Operations

Itai Gilo

Modern data operations call for more than just lightning-fast queries and scalable storage. Safety, reproducibility, and control are all key parts of the equation.  As Iceberg becomes the foundation for analytical and AI-driven applications, how you handle changes to your tables will determine whether your data platform is resilient or unstable. Iceberg branching is a

Best Practices Data Engineering Machine Learning

lakeFS Top 10 Defining Product Milestones in 2025

Oz Katz

2025 was a defining year for lakeFS. Across open source and Enterprise editions, we shipped major capabilities that expanded lakeFS from a powerful data versioning layer into a control plane for AI-Ready Data – spanning structured and unstructured data, multiple public and private clouds, and a growing ecosystem of analytics and ML engines. Here’s our

Best Practices Product

How CytoReason Streamlined Nextflow with lakeFS for Smarter Data Pipelines

Ron Poches

TL;DR CytoReason is a technology company transforming biopharma’s decision-making—from trial and error to data-driven—through its AI platform of computational disease models. Leveraging an extensive database of public and proprietary data, the company maps human diseases tissue by tissue and cell by cell. Researchers at leading pharma companies, including Pfizer and Sanofi, rely on CytoReason’s technology

Best Practices Data Engineering Machine Learning

Building a Data Center of Excellence for Modern Data Teams

Einat Orr, PhD

Sooner or later, every data team will reach a point where things stop working – whether it’s due to team growth, changing business requirements, or advancing pipeline complexity. When facing these issues, leaders start considering a different approach that perfectly balances centralized and decentralized organizational models. A Data Center of Excellence (DCoE) is a centralized

Best Practices Machine Learning

Iceberg Tables Management: Processes, Challenges & Best Practices

Itai Gilo

We all love data lakes. They’re just perfect for storing massive volumes of structured, semi-structured, and unstructured data in native file formats. And they let us explore, refine, and analyze petabytes of data constantly pouring in from various sources. But there’s a caveat. The individual files in a data lake lack the necessary information for

We use cookies to improve your experience and understand how our site is used.

Learn more in our Privacy Policy