Transformers
Transformers take a DataFrame as an input, transform it accordingly
and return a DataFrame.
Each Transformer class has to have a transform method which takes no arguments
and returns a DataFrame.
Possible transformation methods can be Selecting the most up-to-date record by id, Exploding an array, Filter (on an exploded array), Apply basic threshold cleansing or Map the incoming DataFrame to at provided structure.
- Annotator (Load and Update Column Comments)
- Exploder
- Sieve (Filter)
- Mapper
- Class
- Custom Transformations
- spooq.transformer.mapper_transformations.as_is
- spooq.transformer.mapper_transformations.to_num
- spooq.transformer.mapper_transformations.to_bool
- spooq.transformer.mapper_transformations.to_timestamp
- spooq.transformer.mapper_transformations.str_to_array
- spooq.transformer.mapper_transformations.map_values
- spooq.transformer.mapper_transformations.meters_to_cm
- spooq.transformer.mapper_transformations.has_value
- spooq.transformer.mapper_transformations.apply
- spooq.transformer.mapper_transformations.to_json_string
- spooq.transformer.mapper_transformations.to_str
- spooq.transformer.mapper_transformations.to_int
- spooq.transformer.mapper_transformations.to_long
- spooq.transformer.mapper_transformations.to_float
- spooq.transformer.mapper_transformations.to_double
- Threshold-based Cleaner
- Enumeration-based Cleaner
- Null Cleaner
- Newest by Group (Most current record per ID)
Create your own Transformer
Please see the Create your own Transformer for further details.