RoBERTa Large CoNLL-03 NER Pipeline

Description

This pretrained pipeline is built on the top of roberta_large_token_classifier_conll03 model.

Download

How to use

pipeline = PretrainedPipeline("roberta_large_token_classifier_conll03_pipeline", lang = "en")

pipeline.annotate("My name is John and I work at John Snow Labs.")
val pipeline = new PretrainedPipeline("roberta_large_token_classifier_conll03_pipeline", lang = "en")

pipeline.annotate("My name is John and I work at John Snow Labs."))

Results

+--------------+---------+
|chunk         |ner_label|
+--------------+---------+
|John          |PERSON   |
|John Snow Labs|ORG      |
+--------------+---------+

Model Information

Model Name: roberta_large_token_classifier_conll03_pipeline
Type: pipeline
Compatibility: Spark NLP 3.4.1+
License: Open Source
Edition: Official
Language: en
Size: 1.3 GB

Included Models

  • DocumentAssembler
  • SentenceDetectorDLModel
  • TokenizerModel
  • RoBertaForTokenClassification
  • NerConverter
  • Finisher