Headless agents are coming for your data. Be ready with lakeFS

Thought Leadership

Best Practices Thought Leadership

AI-Ready Data Explained: The Pillars, Challenges, and Process

Einat Orr, PhD

AI-ready data is often misunderstood, dismissed as just another layer of hype on top of familiar practices like data quality. But that assumption misses something important. There is a real shift happening in how data needs to be prepared, structured, and managed to support modern AI systems. AI-ready data is critical to the success of […]

Best Practices Thought Leadership

Lessons Learned Building an AI Factory from Lockheed Martin

Gottfried Sehringer

Most organizations today are experimenting with AI, but few have built the systems needed to make AI repeatable, scalable, and genuinely useful in production.  That’s where Lockheed Martin stands apart. In a recent presentation at the AI-Ready Data Summit, Thomas Vander Wal shared how Lockheed Martin built what they call an AI Factory. Not a

Product Thought Leadership

Introducing the AI-Ready Data Summit

Gottfried Sehringer

Free Virtual Event for Enterprise AI Leaders Building AI that works in production is hard. For most enterprise teams, the biggest obstacle isn’t the model, it’s the data behind it. Study after study shows that organizations abandon AI projects due to poor data quality and inadequate data infrastructure. The gap between AI ambition and AI

Best Practices Product Thought Leadership

Git-Style Workflows for Multimodal AI Data Using Dremio and lakeFS

Alex Merced, Tal Sofer

This post recaps a comprehensive tutorial published by Alex Merced from Dremio and Tal Sofer from lakeFS, highlighting how version control transforms multimodal data management for AI teams. The Challenge: Keeping Diverse Data Types in Sync and Queriable Modern AI pipelines consume more than just structured data. Training sets include images, model artifacts, logs, and

Product Thought Leadership

A Celebration of Shared Vision: lakeFS 🫶 DVC

Einat Orr, PhD

From Inspiration to Action When we were still dreaming up lakeFS, one of the projects that inspired us was DVC (Data Version Control). It was one of those moments when you realize – “Ah, others see it too.” We weren’t alone in believing that data should be managed like code. DVC was built by data

Product Thought Leadership

lakeFS Named a Representative Vendor in the 2025 Gartner® Market Guide for DataOps Tools

Gottfried Sehringer

We’re excited to share that lakeFS has been named a Representative Vendor in the 2025 Gartner® Market Guide for DataOps Tools. We believe this recognition reflects what we’re seeing across the industry: the urgent need for data infrastructures that can provide AI-ready data efficiently, repeatably, and safely as organizations build production AI systems. DataOps Market

Best Practices Machine Learning Thought Leadership

OpenAI’s Open Source Revolution: Why Enterprise AI Infrastructure Matters More Than Ever

Gottfried Sehringer

Yesterday, OpenAI launched gpt-oss-120b and gpt-oss-20b, marking the company’s first open-weight models since GPT-2 in 2019. This strategic shift represents far more than a product release—it signals a fundamental transformation in how large organizations, particularly in regulated industries, approach AI infrastructure and data management. OpenAI’s Strategic Return to Open Source The gpt-oss models—gpt-oss-120b and gpt-oss-20b—are

Best Practices Product Thought Leadership

The Evolving Equation: When Do You Move From Open Source to Enterprise with Data Version Control

Tal Sofer

Open source software has fundamentally reshaped technology—delivering unmatched flexibility, low friction, and rapid innovation. For some teams, it’s a philosophical commitment. For others, it’s the fastest path to building. lakeFS supports both models. For most data teams, the journey starts with open source and evolves over time. lakeFS open source offers a robust foundation for

We use cookies to improve your experience and understand how our site is used.

Learn more in our Privacy Policy