Webinar Lottie

lakeFS Acquires DVC, Uniting Data Version Control Pioneers to Accelerate AI-Ready Data

webcros

Learn from AI, ML & data leaders

March 31, 2026  |  Live

Best Practices

Best Practices Machine Learning

AI Agents in Business and Automation

Amit Kesarwani

This article discusses AI Agents in business and automation, focusing on building an AI Agent using lakeFS, LangChain, OpenAI, and FAISS (Facebook AI Similarity Search) to answer questions based on documents. It explains what AI Agents and LangChain are, and how lakeFS is used for data version control. The article also provides an example of

Best Practices Machine Learning

Metadata Management Tools: Types, Features & Benefits

Tal Sofer

Managing complex and massive data sets is tricky but metadata management tools can help teams keep their data in shape. Metadata management has become critical in data strategies created by organizations that treat data as an important asset. In this article, we dive into metadata management and give you an overview of tools teams use

Best Practices Machine Learning

What is Metadata? Examples, Benefits & Best Practices

Tal Sofer

What is the key element that guarantees all data published on portals is discoverable, comprehensible, reusable, and interoperable for people and technology like AI? You guessed right; it’s metadata. Metadata also plays a key role in data governance and management. According to Gartner,  organizations that fail to adopt a metadata-driven strategy for IT modernization might

Best Practices Machine Learning Product

The Holy Trinity of ML Reproducibility

Oz Katz

Reproducibility is a fundamental challenge in building reliable machine learning (ML) models and AI applications.  It’s not just about debugging a model when it fails in production; it’s also about ensuring that experiments are consistent, avoiding unintended variance, and making incremental progress with confidence.  Without reproducibility, ML teams risk wasting time on unreliable results and

Best Practices Product Tutorials

How to Avoid Data Breaches by using RBAC 

Amit Kesarwani

Introduction Role-Based Access Control (RBAC) is an effective way to minimize the risk of data breaches by ensuring users only have access to the data and systems necessary for their job roles. Here’s how you can use RBAC to avoid data breaches: 1. Principle of Least Privilege (PoLP) 2. Define Clear Roles and Responsibilities 3.

Best Practices Machine Learning

What is GPU Utilization? Benefits & Best Practices

Tal Sofer

GPUs are blazingly fast, but many teams struggle to keep them running at peak performance. A recent poll on AI infrastructure shows that maximizing GPU use is a top priority, and data from Weights & Biases reveals that roughly a third of GPUs are at less than 15% usage, which is low. The good news

Best Practices Product

Easier GDPR With lakeFS

Iddo Avneri

The General Data Protection Regulation (GDPR) imposes strict requirements on how organizations collect, store, and manage personal data. Businesses must ensure data security, auditability, and access control while minimizing unnecessary data duplication. However, traditional data management practices often make compliance challenging—especially when handling large-scale datasets used in AI and analytics. lakeFS, an enterprise-grade data versioning

Best Practices

Top 12 Data Science Tools to Consider in 2026

Idan Novogroder

The growing volume and complexity of organizational data and its critical role in decision-making inspire organizations to invest in people, processes, and technology to unlock value from data assets.  Data science teams can choose from diverse tools and platforms to build their portfolios. Here’s a list of the 12 most widespread data science tools data

Best Practices Data Engineering Machine Learning

Top Data Lineage Tools for 2025 and Their Benefits

Iddo Avneri

Data lineage tools make it easier for teams to track the transfer of data across several systems, databases, and applications. Ultimately, this translates into better capabilities around understanding and handling data.  But how do you choose the best data lineage solution for your organization? This article dives into the most widespread data lineage tools to

We use cookies to improve your experience and understand how our site is used.

Learn more in our Privacy Policy