Default model types and model parameters for problem type
Default model types and model parameters for problem type
Default model types and model parameters for problem type
Creates a new Regression Model Selector with a Cross Validation
Create the model selector for specified interface
Create the model selector for specified interface
training split of cross validator
data prep class
evaluation to do on data
list of models to use
sequence of models and parameters to explore
default model types and model parameters for problem type
model selector with these settings
Creates a new Regression Model Selector with a Cross Validation
Creates a new Regression Model Selector with a Cross Validation
instance that will split the data into training set and test set
number of folds for cross validation (>= 2)
metric name in evaluation: RMSE, R2 etc
List of evaluators applied on training + holdout data for evaluation. Default is the standard OpRegressionEvaluator.
random seed
level of parallelism used to schedule a number of models to be trained/evaluated so that the jobs can be run concurrently
list of model types to run grid search on must from supported types in RegressionModelsToTry (OpLinearRegression, OpDecisionTreeRegressor, OpRandomForestRegressor, OpGBTRegressor, OpGeneralizedLinearRegression)
pass in an explicit list pairs of estimators and the accompanying hyperparameters to for model selection Seq[(EstimatorType, Array[ParamMap])] where Estimator type must be an Estimator that takes in a label (RealNN) and features (OPVector) and returns a prediction (Prediction)
maximum allowable time to wait for a model to finish running (default is 1 day)
Regression Model Selector with a Cross Validation
Creates a new Regression Model Selector with a Train Validation Split
Creates a new Regression Model Selector with a Train Validation Split
instance that will split the data into training set and test set
ratio between training set and validation set (>= 0 && <= 1)
metric name in evaluation: RMSE, R2 etc
List of evaluators applied on training + holdout data for evaluation. Default is the standard OpRegressionEvaluator.
random seed
level of parallelism used to schedule a number of models to be trained/evaluated so that the jobs can be run concurrently
list of model types to run grid search on must from supported types in RegressionModelsToTry (OpLinearRegression, OpDecisionTreeRegressor, OpRandomForestRegressor, OpGBTRegressor, OpGeneralizedLinearRegression)
pass in an explicit list pairs of estimators and the accompanying hyperparameters to for model selection Seq[(EstimatorType, Array[ParamMap])] where Estimator type must be an Estimator that takes in a label (RealNN) and features (OPVector) and returns a prediction (Prediction)
maximum allowable time to wait for a model to finish running (default is 1 day)
Regression Model Selector with a Train Validation Split
A factory for Regression Model Selector