Package

com.johnsnowlabs.nlp

annotators

Permalink

package annotators

Visibility
  1. Public
  2. All

Type Members

  1. class Chunk2Token extends AnnotatorModel[Chunk2Token]

    Permalink

    A feature transformer that converts the input array of strings (annotatorType CHUNK) into an array of chunk-based tokens (annotatorType TOKEN).

    A feature transformer that converts the input array of strings (annotatorType CHUNK) into an array of chunk-based tokens (annotatorType TOKEN).

    When the input is empty, an empty array is returned.

    This Annotator is specially convenient when using NGramGenerator annotations as inputs to WordEmbeddingsModels

  2. class DrugNormalizer extends AnnotatorModel[DrugNormalizer]

    Permalink

    Annotator which normalizes raw text from clinical documents, e.g.

    Annotator which normalizes raw text from clinical documents, e.g. scraped web pages or xml documents, from document type columns into Sentence. Removes all dirty characters from text following one or more input regex patterns. Can apply non wanted character removal which a specific policy. Can apply lower case normalization.

    See DocumentNormalizer test class for examples examples of usage.

Value Members

  1. object DrugNormalizer extends DefaultParamsReadable[DrugNormalizer] with Serializable

    Permalink
  2. package assertion

    Permalink
  3. package chunker

    Permalink
  4. package classification

    Permalink
  5. package context

    Permalink
  6. package datasets

    Permalink
  7. package deid

    Permalink
  8. package disambiguation

    Permalink
  9. package generic_classifier

    Permalink
  10. package merge

    Permalink
  11. package ner

    Permalink
  12. package re

    Permalink
  13. package resolution

    Permalink
  14. package text2sql

    Permalink

Ungrouped