dft

dft#

Source code: sensai/data_transformation/dft.py

class DataFrameTransformer[source]#

Bases: ABC, ToStringMixin

Base class for data frame transformers, i.e. objects which can transform one data frame into another (possibly applying the transformation to the original data frame - in-place transformation). A data frame transformer may require being fitted using training data.

get_name() → str[source]#

Returns:: the name of this dft transformer, which may be a default name if the name has not been set.

set_name(name: str)[source]#

with_name(name: str)[source]#

apply(df: pandas.DataFrame) → pandas.DataFrame[source]#

info()[source]#

fit(df: pandas.DataFrame)[source]#

is_fitted()[source]#

fit_apply(df: pandas.DataFrame) → pandas.DataFrame[source]#

to_feature_generator(categorical_feature_names: Optional[Union[Sequence[str], str]] = None, normalisation_rules: Sequence[Rule] = (), normalisation_rule_template: Optional[RuleTemplate] = None, add_categorical_default_rules=True)[source]#

chain(*others: DataFrameTransformer) → DataFrameTransformerChain[source]#

get_column_change_tracker() → DataFrameColumnChangeTracker[source]#

class DFTFromFeatureGenerator(fgen: FeatureGenerator, append: bool = False, copy: bool = True)[source]#

Bases: DataFrameTransformer

Transforms a feature generator into a data frame transformer, which either returns the features data frame or the original data frame extended with the features data frame

Parameters:

fgen – the feature generator from which to generate
append – whether to append the columns generated by the feature generator to the input data frame; if False, the transformed data frame will consist only of the generated features data frame
copy – whether, for the case where append=True, the returned data frame shall copy the data of the input data frame (rather than reuse the data)

is_fitted()[source]#

class InvertibleDataFrameTransformer[source]#

Bases: DataFrameTransformer, ABC

abstract apply_inverse(df: pandas.DataFrame) → pandas.DataFrame[source]#

get_inverse() → InverseDataFrameTransformer[source]#

Returns:: a transformer whose (forward) transformation is the inverse transformation of this DFT

class RuleBasedDataFrameTransformer[source]#

Bases: DataFrameTransformer, ABC

Base class for transformers whose logic is entirely based on rules and does not need to be fitted to data

fit(df: pandas.DataFrame)[source]#

is_fitted()[source]#

class InverseDataFrameTransformer(invertible_dft: InvertibleDataFrameTransformer)[source]#: Bases: RuleBasedDataFrameTransformer

class DataFrameTransformerChain(*data_frame_transformers: Union[DataFrameTransformer, List[DataFrameTransformer]])[source]#

Bases: DataFrameTransformer

Supports the application of a chain of data frame transformers. During fit and apply each transformer in the chain receives the transformed output of its predecessor.

is_fitted()[source]#

get_names() → List[str][source]#

Returns:: the list of names of all contained feature generators

info()[source]#

find_first_transformer_by_type(cls) → Optional[DataFrameTransformer][source]#

append(t: DataFrameTransformer)[source]#

class DFTRenameColumns(columns_map: Dict[str, str])[source]#