Package

com.salesforce.op.utils

text

Permalink

package text

Visibility
  1. Public
  2. All

Type Members

  1. sealed abstract class Language extends EnumEntry with Serializable

    Permalink

    Language enum

  2. trait LanguageDetector extends Serializable

    Permalink

    Language detector trait

  3. class LuceneTextAnalyzer extends TextAnalyzer

    Permalink

    Text analyzer implementation using a Lucene analyzer

  4. trait NameEntityTagger[Result <: TaggerResult] extends Serializable

    Permalink

    Interface for Name Entity Recognition tagger

    Interface for Name Entity Recognition tagger

    Result

    result of the NameEntityTagger.tag function call

  5. sealed trait NameEntityType extends EnumEntry with Serializable

    Permalink

    Name Entity Recognition entity type

  6. class OpenNLPAnalyzer extends TextAnalyzer

    Permalink

    OpenNLP text analyzer to apply when applying Open NLP TokenNameFinderModel

  7. class OpenNLPNameEntityTagger extends NameEntityTagger[OpenNLPTagResult]

    Permalink

    OpenNLP implementation of NameEntityTagger

  8. class OpenNLPSentenceSplitter extends SentenceSplitter

    Permalink

    Implementation of SentenceSplitter using Open NLP sentence splitter

  9. case class OpenNLPTagResult(tokenTags: Map[String, Set[NameEntityType]]) extends TaggerResult with Product with Serializable

    Permalink

    OpenNLP implementation of TaggerResult

    OpenNLP implementation of TaggerResult

    tokenTags

    token tags map, where keys are token and values are entities matching each token

  10. class OptimaizeLanguageDetector extends LanguageDetector

    Permalink

    Language detector implementation using Optimaize/language-detector library https://github.com/optimaize/language-detector

  11. trait SentenceSplitter extends Serializable

    Permalink

    Interface for Sentence Splitter that allows detecting and splitting text into separate sentences.

  12. trait TaggerResult extends Serializable

    Permalink

    Result of NameEntityTagger.tag function call

  13. trait TextAnalyzer extends Serializable

    Permalink

    Text analyzer trait

Value Members

  1. object Language extends Enum[Language] with Serializable

    Permalink

    Language enum with entryName is ISO 639-1 or 639-3 language code, eg "fr" or "gsw"

  2. object LuceneTextAnalyzer extends Serializable

    Permalink

    Creates a Lucene Analyzer for a specific language or falls back to StandardAnalyzer

  3. object NameEntityType extends Enum[NameEntityType] with Serializable

    Permalink

    Name Entity Recognition entity type

  4. object OpenNLPModels

    Permalink

    A factory to get/create OpenNLP models

  5. object TextUtils extends Product with Serializable

    Permalink

Ungrouped