Analyze a text and produce tokens
Analyze a text and produce tokens
input text
assumed text language
sequence of tokens
Lucene analyzer factory to use (defaults to LuceneTextAnalyzer)
Lucene analyzer factory to use (defaults to LuceneTextAnalyzer)
desired language
language specific language analyzer
selects the matching group as the token (default: -1, which is equivalent to "split".
is the regular expression
Text analyzer implementation using a Lucene analyzer with Pattern Tokenizer matching