Use Cases

CI CD For Data

Jamil Ahmad
January 17, 2023

Dev Test environment Rollback on Bad data Continuous validation of data quality Continuous validation of data quality through automated quality checks Automate data quality checks within the data pipelines through hooks, so that bad data does not reach production. Read the documentation > The Main Ingredients Fully automated Best practice Production data is protected How …

CI CD For Data Read More »

Reproducibility

Jamil Ahmad
January 17, 2023

Dev Test environment Rollback on Bad data Continuous validation of data quality Atomic rollback on the entire data to recover from bad data issues Rapidly recover from data quality issues in production, with atomic rollback on bad data, not just on a single table – but on the entire data lake Read the documentation > …

Reproducibility Read More »

Etl Testing

Jamil Ahmad
January 17, 2023

Dev Test environment Rollback on Bad data Continuous validation of data quality ETL Testing using an isolated DevTest Environment with Zero-Copy Build and test ETLs freely, on top of production data, with copying anything or compromising on sample data. Read the documentation > The Main Ingredients Zero copy Test ETLs onentire data Createdinstantly How it …

Etl Testing Read More »

Git for Data – lakeFS

  • Get Started
    Get Started
  • LIVE: Develop Spark pipelines against production data on February 15 -

    Register Now
    +