Class/Object

com.salesforce.op.filters

FeatureDistribution

Related Docs: object FeatureDistribution | package filters

Permalink

case class FeatureDistribution(name: String, key: Option[String], count: Long, nulls: Long, distribution: Array[Double], summaryInfo: Array[Double], moments: Option[Moments] = None, cardEstimate: Option[TextStats] = None, type: FeatureDistributionType = FeatureDistributionType.Training) extends FeatureDistributionLike with Product with Serializable

Class containing summary information for a feature

name

name of the feature

key

map key associated with distribution (when the feature is a map)

count

total count of feature seen

nulls

number of empties seen in feature

distribution

binned counts of feature values (hashed for strings, evenly spaced bins for numerics)

summaryInfo

either min and max number of tokens for text data, or splits used for bins for numeric data

Linear Supertypes
Serializable, Serializable, Product, Equals, FeatureDistributionLike, AnyRef, Any
Ordering
  1. Alphabetic
  2. By Inheritance
Inherited
  1. FeatureDistribution
  2. Serializable
  3. Serializable
  4. Product
  5. Equals
  6. FeatureDistributionLike
  7. AnyRef
  8. Any
  1. Hide All
  2. Show All
Visibility
  1. Public
  2. All

Instance Constructors

  1. new FeatureDistribution(name: String, key: Option[String], count: Long, nulls: Long, distribution: Array[Double], summaryInfo: Array[Double], moments: Option[Moments] = None, cardEstimate: Option[TextStats] = None, type: FeatureDistributionType = FeatureDistributionType.Training)

    Permalink

    name

    name of the feature

    key

    map key associated with distribution (when the feature is a map)

    count

    total count of feature seen

    nulls

    number of empties seen in feature

    distribution

    binned counts of feature values (hashed for strings, evenly spaced bins for numerics)

    summaryInfo

    either min and max number of tokens for text data, or splits used for bins for numeric data

Value Members

  1. final def !=(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  2. final def ##(): Int

    Permalink
    Definition Classes
    AnyRef → Any
  3. final def ==(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  4. final def asInstanceOf[T0]: T0

    Permalink
    Definition Classes
    Any
  5. val cardEstimate: Option[TextStats]

    Permalink
  6. def clone(): AnyRef

    Permalink
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  7. val count: Long

    Permalink

    total count of feature seen

    total count of feature seen

    Definition Classes
    FeatureDistributionFeatureDistributionLike
  8. val distribution: Array[Double]

    Permalink

    binned counts of feature values (hashed for strings, evenly spaced bins for numerics)

    binned counts of feature values (hashed for strings, evenly spaced bins for numerics)

    Definition Classes
    FeatureDistributionFeatureDistributionLike
  9. final def eq(arg0: AnyRef): Boolean

    Permalink
    Definition Classes
    AnyRef
  10. def equals(that: Any): Boolean

    Permalink
    Definition Classes
    FeatureDistribution → Equals → AnyRef → Any
  11. def featureKey: FeatureKey

    Permalink

    Get feature key associated to this distribution

  12. def fillRate(): Double

    Permalink

    Get fill rate of feature

    Get fill rate of feature

    returns

    fraction of data that is non empty

  13. def finalize(): Unit

    Permalink
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] )
  14. final def getClass(): Class[_]

    Permalink
    Definition Classes
    AnyRef → Any
  15. def hashCode(): Int

    Permalink
    Definition Classes
    FeatureDistribution → AnyRef → Any
  16. final def isInstanceOf[T0]: Boolean

    Permalink
    Definition Classes
    Any
  17. def jsDivergence(fd: FeatureDistribution): Double

    Permalink

    Jensen-Shannon divergence from this distribution to the other distribution fed in

    Jensen-Shannon divergence from this distribution to the other distribution fed in

    fd

    other feature distribution

    returns

    the KL divergence

  18. val key: Option[String]

    Permalink

    map key associated with distribution (when the feature is a map)

    map key associated with distribution (when the feature is a map)

    Definition Classes
    FeatureDistributionFeatureDistributionLike
  19. val moments: Option[Moments]

    Permalink
  20. val name: String

    Permalink

    name of the feature

    name of the feature

    Definition Classes
    FeatureDistributionFeatureDistributionLike
  21. final def ne(arg0: AnyRef): Boolean

    Permalink
    Definition Classes
    AnyRef
  22. final def notify(): Unit

    Permalink
    Definition Classes
    AnyRef
  23. final def notifyAll(): Unit

    Permalink
    Definition Classes
    AnyRef
  24. val nulls: Long

    Permalink

    number of empties seen in feature

    number of empties seen in feature

    Definition Classes
    FeatureDistributionFeatureDistributionLike
  25. def reduce(fd: FeatureDistribution): FeatureDistribution

    Permalink

    Combine feature distributions

    Combine feature distributions

    fd

    other feature distribution (from the same feature)

    returns

    summed distribution information

  26. def relativeFillRate(fd: FeatureDistribution): Double

    Permalink

    Absolute difference in empty rates

    Absolute difference in empty rates

    fd

    feature distribution to compare to

    returns

    absolute difference of rates

  27. def relativeFillRatio(fd: FeatureDistribution): Double

    Permalink

    Ratio of fill rates between the two distributions symetric with larger value on the top

    Ratio of fill rates between the two distributions symetric with larger value on the top

    fd

    feature distribution to compare to

    returns

    ratio of fill rates

  28. val summaryInfo: Array[Double]

    Permalink

    either min and max number of tokens for text data, or splits used for bins for numeric data

    either min and max number of tokens for text data, or splits used for bins for numeric data

    Definition Classes
    FeatureDistributionFeatureDistributionLike
  29. final def synchronized[T0](arg0: ⇒ T0): T0

    Permalink
    Definition Classes
    AnyRef
  30. def toString(): String

    Permalink
    Definition Classes
    FeatureDistribution → AnyRef → Any
  31. val type: FeatureDistributionType

    Permalink

    feature distribution type: training or scoring

    feature distribution type: training or scoring

    Definition Classes
    FeatureDistributionFeatureDistributionLike
  32. final def wait(): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  33. final def wait(arg0: Long, arg1: Int): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  34. final def wait(arg0: Long): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )

Inherited from Serializable

Inherited from Serializable

Inherited from Product

Inherited from Equals

Inherited from FeatureDistributionLike

Inherited from AnyRef

Inherited from Any

Ungrouped