Flexible feature engineering using a network flow approach

dc.contributor.advisorBlignaut, Renette
dc.contributor.authorBodenstein, Gerhardus
dc.date.accessioned2025-05-29T13:53:11Z
dc.date.available2025-05-29T13:53:11Z
dc.date.issued2024
dc.description.abstractFeature engineering, a critical part of the data preparation and exploration phase in predictive modelling, involves transforming predictor variables to enhance interpretability and better understand their relationship with the response variable. In some cases, it also offers automatic handling of outliers and missing values. Many machine learning and data mining techniques perform better with discretised continuous variables or clustered levels of categorical variables, making feature engineering essential for improving the accuracy and robustness of predictive models. Furthermore, the feature engineering process often needs to incorporate business, operational, or best-practice constraints applicable to the final transformed predictor variables or newly created features. This thesis addresses two significant challenges in feature engineering. The first is the supervised discretisation of continuous predictors, which involves partitioning a predictor's domain into disjoint intervals while preserving a specified trend in the relationship with the response variable and adhering to side constraints.
dc.identifier.urihttps://hdl.handle.net/10566/20466
dc.language.isoen
dc.publisherUniversty of the Western Cape
dc.subjectSupervised discretisation
dc.subjectContinuous predictors
dc.subjectCategorical predictors
dc.subjectScorecards
dc.subjectPredictive modelling
dc.titleFlexible feature engineering using a network flow approach
dc.typeThesis

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
bodenstein_nsc_phd_2024.pdf
Size:
18.46 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: