Matches standard date formats into a provided format
Class to find standarized lemmas from words.
Annotator that cleans out tokens.
Matches regular expressions and maps them to specified values optionally provided Rules are provided from external source file
Hard stemming of words for cut-of into standard word references
Extracts entities out of provided phrases
Tokenizes raw text into word pieces, tokens.