Minimum records needed for label to include it
Set a threshold for the minimum number of records a label must have in your dataset to be included in model training.
Before you begin
Verify that your dataset includes label values suitable for classification.
Role required: ml_admin or admin
About this task
This advanced option helps to eliminate labels that occur infrequently. Setting this minimum threshold can reduce overfitting and unreliable predictions.
To configure, you provide a number for a minimum count of records. Any class value with fewer records than the minimum is dropped from the training set. Class values that are dropped before training can't appear in predictions.
To add this option using a script:
config.setMinLabelCount(50);In JSON format:
{
"min_label_count": 50
}
Procedure
Result
The setting appears as a row on the Advanced Solution Setting tab on your solution's form.
What to do next
Retrain your solution.