Function to use to prepare the dataset for modeling eg - do data balancing or dropping based on the labels
Training set test set
Fraction of data to reserve for test Default is 0.1
Seed for data splitting
Function to use to create the training set and test set.