RoBERTa Large CoNLL-03 NER Pipeline

Description

This pretrained pipeline is built on the top of roberta_large_token_classifier_conll03 model.

How to use

pipeline = PretrainedPipeline("roberta_large_token_classifier_conll03_pipeline", lang = "en")

pipeline.annotate("My name is John and I work at John Snow Labs.")

val pipeline = new PretrainedPipeline("roberta_large_token_classifier_conll03_pipeline", lang = "en")

pipeline.annotate("My name is John and I work at John Snow Labs."))

Results

+--------------+---------+
|chunk         |ner_label|
+--------------+---------+
|John          |PERSON   |
|John Snow Labs|ORG      |
+--------------+---------+

Model Information

Model Name:	roberta_large_token_classifier_conll03_pipeline
Type:	pipeline
Compatibility:	Spark NLP 3.4.1+
License:	Open Source
Edition:	Official
Language:	en
Size:	1.3 GB

Included Models

DocumentAssembler
SentenceDetectorDLModel
TokenizerModel
RoBertaForTokenClassification
NerConverter
Finisher

PREVIOUSRoBERTa Base Ontonotes NER Pipeline

NEXTArabic Part of Speech Tagger (from CAMeL-Lab)