Function to set parameters before passing into the validation step eg - do data balancing or dropping based on the labels
Parameters set in examining data
Maximum size of dataset want to train on.
Fraction of data to reserve for test Default is 0.1
Seed for data splitting
Function to use to create the training set and test set.
Function to use to prepare the dataset for modeling within the validation step eg - do data balancing or dropping based on the labels
Training set test set
Add a splitter parameter to name the label column