Creates instance that will split data into training and test set filtering out any labels that don't meet the minimum fraction cutoff or fall in the top N labels specified
Creates instance that will split data into training and test set filtering out any labels that don't meet the minimum fraction cutoff or fall in the top N labels specified
set for the random split
fraction of the data used for test
maximum number of label categories to include
minimum fraction of total labels that a category must have to be included
data splitter