Extractors
Extractors are used to fetch, extract and convert a source data set into a PySpark DataFrame. Exemplary extraction sources are JSON Files on file systems like HDFS, DBFS or EXT4 and relational database systems via JDBC.
- class Extractor[source]
Base Class of Extractor Classes.
- logger
Shared, class level logger for all instances.
- Type
Create your own Extractor
Please see the Create your own Extractor for further details.