Welcome to Spooq’s documentation!¶
Spooq is your PySpark based helper library for ETL data ingestion pipeline in Data Lakes.
Extractors, Transformers, and Loaders are independent components which can be plugged-in into a pipeline instance or used separately.
Table of Content¶
- Installation / Deployment
- Spooq Base
- Setup for Development, Testing, Documenting
- Architecture Overview