sparknlp.training

Contains helper classes to make training with specific datasets easier.

To load a specific dataset, the class has to be instantiated, then the data can be loaded with readDataset.

Classes

CoNLL

Instantiates the class to read a CoNLL dataset.

CoNLLU

Instantiates the class to read a CoNLL-U dataset.

POS

Helper class for creating DataFrames for training a part-of-speech tagger.

PubTator

The PubTator format includes medical papers’ titles, abstracts, and tagged chunks.