This workflow compares the performances of three different setups for a classification model that is used to detect fraud in credit card data.
Scenario 1: A classification model is trained on imbalanced data
Scenario 2: A classification model is trained on resampled, balanced data.
Scenario 3: A classification model is trained on resampled, balanced data, and the predicted class probabilities are adjusted according to the class distribution in the original data
Performance is evaluated in terms of cost reduction compared to not using any model.
Workflow
03_Adjusting_Class_Probabilities_after_Resampling
External resources
Used extensions & nodes
Created with KNIME Analytics Platform version 5.1.0
Legal
By using or downloading the workflow, you agree to our terms and conditions.