Extractors¶
Extractors are used to fetch, extract and convert a source data set into a PySpark DataFrame. Exemplary extraction sources are JSON Files on file systems like HDFS, DBFS or EXT4 and relational database systems via JDBC.
Class Diagram of Extractor Subpackage¶
Create your own Extractor¶
Please see the Create your own Extractor for further details.