The lakeFS Blog

Filter by

Manage your data lifecycle at scale with lakeFS Enterprise

From Glue-on-Pizza to Provenance: A Practical Guide to Reproducible AI

The now-infamous “pizza with glue” AI result is a symptom of something deeper than one bizarre edge case. When AI systems fail, the root cause

Oz Katz
June 29, 2026

Why AI Sovereignty Is Becoming a Strategic Imperative

AI raises a question most organizations haven’t answered yet: who really controls the foundation? In a recent presentation at the AI-Ready Data Summit, Matthew Miller,

Iddo Avneri
June 22, 2026

Unity Catalog and the Quiet Return of Vendor Lock-In

Databricks built its reputation on openness. Spark. Delta Lake. MLflow. A company that rose by betting on open ecosystems over proprietary silos. Which is why

Oz Katz
June 18, 2026

Data Agents: How to Build Reliable Enterprise AI Workflows on Trusted Data

Data agents are fast becoming the operating layer of enterprise AI – automating analysis, managing workflows, obtaining context, and acting across production systems. Headless agents

Tal Sofer
June 17, 2026

Driving End-User Adoption of AI-Ready Data Infrastructure

First presented at the AI-Ready Data Summit, this talk tackled the part of AI-ready data that tooling alone can’t solve: getting busy people to actually

Joe Pringle
June 15, 2026

Data Lake Mount for Efficient Data Sharing & Versioned Lake Management

Mounting object storage as a filesystem is the fastest way to get a notebook or Spark job reading S3, Azure Data Lake Storage, or GCS

Oz Katz
June 11, 2026

Agentic AI Will Make or Break on the Data Layer. Meet lakeFS for Agentic AI

For the past few years, the hard work in AI has gone into models. Organizations spent that time learning, experimenting, and building the best models

Gottfried Sehringer
June 10, 2026

GxP-Aligned by Design: How lakeFS Brings Compliance Discipline to AI-Ready Data in Life Sciences

AI is moving fast in life sciences. GxP is not. The teams that close that gap first get treatments to market faster. Pharma, biotech, and

Vince Antinozzi
June 7, 2026

Multimodal Data Integration: Architecture, Challenges & Best Practices

As AI systems scale, data bottlenecks for AI projects quickly become one of the key barriers to model development and deployment. Slow pipelines, inconsistent datasets,

Idan Novogroder
May 28, 2026

AI-Ready Data Explained: The Pillars, Challenges, and Process

AI-ready data is often misunderstood, dismissed as just another layer of hype on top of familiar practices like data quality. But that assumption misses something

Einat Orr, PhD
May 26, 2026

Lessons Learned Building an AI Factory from Lockheed Martin

Most organizations today are experimenting with AI, but few have built the systems needed to make AI repeatable, scalable, and genuinely useful in production. That’s

Gottfried Sehringer
May 13, 2026

[hubspot type=form portal=8040338 id=9f5646ec-3e20-4568-9d6e-b82fca022065]

The lakeFS Blog

Manage your data lifecycle at scale with lakeFS Enterprise

Pick up the Slack with lakeFS