Package

com.salesforce.op.utils

text

Permalink

package text

Visibility
  1. Public
  2. All

Type Members

  1. sealed abstract class Language extends EnumEntry with Serializable

    Permalink

    Language enum

  2. trait LanguageDetector extends Serializable

    Permalink

    Language detector trait

  3. class LuceneHtmlStripTextAnalyzer extends LuceneTextAnalyzer

    Permalink

    Text analyzer implementation using a Lucene analyzer with HTML stripping applied

  4. class LuceneRegexTextAnalyzer extends LuceneTextAnalyzer

    Permalink

    Text analyzer implementation using a Lucene analyzer with Pattern Tokenizer matching

  5. class LuceneTextAnalyzer extends TextAnalyzer

    Permalink

    Text analyzer implementation using a Lucene analyzer

  6. trait NameEntityTagger[Result <: TaggerResult] extends Serializable

    Permalink

    Interface for Name Entity Recognition tagger

    Interface for Name Entity Recognition tagger

    Result

    result of the NameEntityTagger.tag function call

  7. sealed trait NameEntityType extends EnumEntry with Serializable

    Permalink

    Name Entity Recognition entity type

  8. class OpenNLPAnalyzer extends TextAnalyzer

    Permalink

    OpenNLP text analyzer to apply when applying Open NLP TokenNameFinderModel

  9. class OpenNLPNameEntityTagger extends NameEntityTagger[OpenNLPTagResult]

    Permalink

    OpenNLP implementation of NameEntityTagger

  10. class OpenNLPSentenceSplitter extends SentenceSplitter

    Permalink

    Implementation of SentenceSplitter using Open NLP sentence splitter

  11. case class OpenNLPTagResult(tokenTags: Map[String, Set[NameEntityType]]) extends TaggerResult with Product with Serializable

    Permalink

    OpenNLP implementation of TaggerResult

    OpenNLP implementation of TaggerResult

    tokenTags

    token tags map, where keys are token and values are entities matching each token

  12. class OptimaizeLanguageDetector extends LanguageDetector

    Permalink

    Language detector implementation using Optimaize/language-detector library https://github.com/optimaize/language-detector

  13. trait SentenceSplitter extends Serializable

    Permalink

    Interface for Sentence Splitter that allows detecting and splitting text into separate sentences.

  14. trait TaggerResult extends Serializable

    Permalink

    Result of NameEntityTagger.tag function call

  15. trait TextAnalyzer extends Serializable

    Permalink

    Text analyzer trait

Value Members

  1. object Language extends Enum[Language] with Serializable

    Permalink

    Language enum with entryName is ISO 639-1 or 639-3 language code, eg "fr" or "gsw"

  2. object LuceneTextAnalyzer extends Serializable

    Permalink

    Creates a Lucene Analyzer for a specific language or falls back to StandardAnalyzer

  3. object NameEntityType extends Enum[NameEntityType] with Serializable

    Permalink

    Name Entity Recognition entity type

  4. object OpenNLPModels

    Permalink

    A factory to get/create OpenNLP models

  5. object TextUtils extends Product with Serializable

    Permalink

Ungrouped