ADP documentation

Date: Mar 18, 2026 Version: 3.0.0

adp is a suite of tools to ingest, deliver and maintain data using OSS pyspark.

The main components of adp include:

  • Core Module: Reusable components for building data products. These components can are mainly used in the other three modules but can also be directly used by data engineers and data scientists.

  • Ingestion Module: Tools for ingesting data from various sources into a Azure Data Lake Storage (ADLS). Configurable using YAML files.

  • Delivery Module: For transforming, reading, and writing data in a consistent manner.

  • Maintenance Module: Utilities for optimizing, cleaning, and managing data lifecycle.

Release Notes

Technical Documentation

adp.core

adp.ingest

adp.delivery

adp.maintain