com.salesforce.op.stages.impl.tuning
Maximum size of dataset want to train on.
Maximum size of dataset want to train on. Value should be > 0. Default is 1000000.
Function to set the down sampling fraction and parameters before passing into the validation step
Function to set the down sampling fraction and parameters before passing into the validation step
Parameters set in examining data
Fraction of data to reserve for test Default is 0.1
Fraction of data to reserve for test Default is 0.1
Seed for data splitting
Seed for data splitting
Function to use to create the training set and test set.
Function to use to create the training set and test set.
(dataTrain, dataTest)
Rebalance the training data within the validation step
Rebalance the training data within the validation step
to prepare for model training. first column must be the label as a double
balanced training set and a test set
Add a splitter parameter to name the label column
Add a splitter parameter to name the label column
Instance that will split the data into training and holdout for regressions