name of the feature
map key associated with distribution (when the feature is a map)
total count of feature seen
number of empties seen in feature
binned counts of feature values (hashed for strings, evenly spaced bins for numerics)
either min and max number of tokens for text data, or splits used for bins for numeric data
total count of feature seen
total count of feature seen
binned counts of feature values (hashed for strings, evenly spaced bins for numerics)
binned counts of feature values (hashed for strings, evenly spaced bins for numerics)
Get feature key associated to this distribution
Get fill rate of feature
Get fill rate of feature
fraction of data that is non empty
Jensen-Shannon divergence from this distribution to the other distribution fed in
Jensen-Shannon divergence from this distribution to the other distribution fed in
other feature distribution
the KL divergence
map key associated with distribution (when the feature is a map)
map key associated with distribution (when the feature is a map)
name of the feature
name of the feature
number of empties seen in feature
number of empties seen in feature
Combine feature distributions
Combine feature distributions
other feature distribution (from the same feature)
summed distribution information
Absolute difference in empty rates
Absolute difference in empty rates
feature distribution to compare to
absolute difference of rates
Ratio of fill rates between the two distributions symetric with larger value on the top
Ratio of fill rates between the two distributions symetric with larger value on the top
feature distribution to compare to
ratio of fill rates
either min and max number of tokens for text data, or splits used for bins for numeric data
either min and max number of tokens for text data, or splits used for bins for numeric data
feature distribution type: training or scoring
feature distribution type: training or scoring
Class containing summary information for a feature
name of the feature
map key associated with distribution (when the feature is a map)
total count of feature seen
number of empties seen in feature
binned counts of feature values (hashed for strings, evenly spaced bins for numerics)
either min and max number of tokens for text data, or splits used for bins for numeric data