ADP documentation
Date: Mar 18, 2026 Version: 3.0.0
adp is a suite of tools to ingest, deliver and maintain data using OSS pyspark.
The main components of adp include:
Core Module: Reusable components for building data products. These components can are mainly used in the other three modules but can also be directly used by data engineers and data scientists.
Ingestion Module: Tools for ingesting data from various sources into a Azure Data Lake Storage (ADLS). Configurable using YAML files.
Delivery Module: For transforming, reading, and writing data in a consistent manner.
Maintenance Module: Utilities for optimizing, cleaning, and managing data lifecycle.
Release Notes
Functional Documentation
Technical Documentation