Language enum
Language detector trait
Text analyzer implementation using a Lucene analyzer with HTML stripping applied
Text analyzer implementation using a Lucene analyzer with Pattern Tokenizer matching
Text analyzer implementation using a Lucene analyzer
Interface for Name Entity Recognition tagger
Name Entity Recognition entity type
OpenNLP text analyzer to apply when applying Open NLP TokenNameFinderModel
OpenNLP implementation of NameEntityTagger
Implementation of SentenceSplitter using Open NLP sentence splitter
OpenNLP implementation of TaggerResult
OpenNLP implementation of TaggerResult
token tags map, where keys are token and values are entities matching each token
Language detector implementation using Optimaize/language-detector library https://github.com/optimaize/language-detector
Interface for Sentence Splitter that allows detecting and splitting text into separate sentences.
Result of NameEntityTagger.tag function call
Text analyzer trait
Language enum with entryName is ISO 639-1 or 639-3 language code, eg "fr" or "gsw"
Creates a Lucene Analyzer for a specific language or falls back to StandardAnalyzer
Name Entity Recognition entity type
A factory to get/create OpenNLP models
Interface for Name Entity Recognition tagger
result of the NameEntityTagger.tag function call