Webinar Lottie

lakeFS Acquires DVC, Uniting Data Version Control Pioneers to Accelerate AI-Ready Data

webcros

Learn from AI, ML & data leaders from Dell, Lockheed Martin, Red Hat & more

The lakeFS Blog

ai data infrastructure

AI Data Infrastructure: Components, Challenges & Best Practices

A solid AI data infrastructure is a key enabler for teams looking to efficiently deploy ML applications. It delivers the fundamental features required to enable

data registry

What is a Data Registry? Benefits, Use Cases & Best Practices

One of the major pain points in ML is the lack of transparency, consistency, and control over data assets. Without a centralized system, teams often

real world AI data version control

Bound by Physics: Why Data Version Control is Critical for Real-World AI

TL;DR Software-only systems can be rerun from the source, but physics-bound workflows face a tougher challenge. Once a moment is gone, it’s gone. Sensor drift,

integrating labeling tools

Versioning Data Labels: Integrating Labeling Tools with lakeFS

In this post, we explore how lakeFS can integrate with popular data labeling solutions, the differences between labeling tools’ built-in dataset management and lakeFS data

unified data management

Unified Data Management: Types, Challenges & Best Practices

Historically, companies have developed their IT systems on an ad hoc basis, installing various software and taking on data management approaches as their needs changed.

metadata filtering

What is Metadata Filtering? Benefits, Best Practices & Tools

Vector databases are a critical enabler for expanding the use of LLMs. They power applications such as Retrieval Augmented Generation (RAG), pattern matching, anomaly detection,

How lakeFS ensures data compliance

How lakeFS Helps Ensure Data Compliance

Data compliance is all about adhering to laws, regulations, standards, and internal policies regarding data use. Organizations must comply with regulations like the General Data

Tutorial: lakeFS Iceberg REST Catalog

Versioned Data with Apache Iceberg Using lakeFS Iceberg REST Catalog

lakeFS Enterprise offers a fully standards-compliant implementation of the Apache Iceberg REST Catalog, enabling Git-style version control for structured data at scale. This integration allows

data compliance

What is Data Compliance? Tools, Benefits & Key Metrics

Organizations deal with ever-increasing volumes of data. More data translates into more risk, as hackers have a larger target area. This is where data compliance

OpenAI OSS revolution ai data infrastructure

OpenAI’s Open Source Revolution: Why Enterprise AI Infrastructure Matters More Than Ever

Yesterday, OpenAI launched gpt-oss-120b and gpt-oss-20b, marking the company’s first open-weight models since GPT-2 in 2019. This strategic shift represents far more than a product

lakeFS Iceberg Catalog

How We Built Our lakeFS Iceberg Catalog

A behind-the-scenes look at the design decisions, architecture, and lessons learned while bringing the Apache Iceberg REST Catalog to lakeFS. When we first announced our

[hubspot type=form portal=8040338 id=9f5646ec-3e20-4568-9d6e-b82fca022065]

We use cookies to improve your experience and understand how our site is used.

Learn more in our Privacy Policy