Pipeline to Detect PHI for Deidentification (Generic - Augmented)

Description

This pretrained pipeline is built on the top of ner_deid_generic_augmented model.

Live Demo Open in Colab Copy S3 URI

How to use

from sparknlp.pretrained import PretrainedPipeline

pipeline = PretrainedPipeline("ner_deid_generic_augmented_pipeline", "en", "clinical/models")

pipeline.annotate("A. Record date : 2093-01-13, David Hale, M.D., Name : Hendrickson, Ora MR. # 7194334 Date : 01/13/93 PCP : Oliveira, 25 -year-old, Record date : 1-11-2000. Cocke County Baptist Hospital. 0295 Keats Street. Phone +1 (302) 786-5227.")
val pipeline = new PretrainedPipeline("ner_deid_generic_augmented_pipeline", "en", "clinical/models")

pipeline.annotate("A. Record date : 2093-01-13, David Hale, M.D., Name : Hendrickson, Ora MR. # 7194334 Date : 01/13/93 PCP : Oliveira, 25 -year-old, Record date : 1-11-2000. Cocke County Baptist Hospital. 0295 Keats Street. Phone +1 (302) 786-5227.")
import nlu
nlu.load("en.med_ner.deid_generic_augmented.pipeline").predict("""A. Record date : 2093-01-13, David Hale, M.D., Name : Hendrickson, Ora MR. # 7194334 Date : 01/13/93 PCP : Oliveira, 25 -year-old, Record date : 1-11-2000. Cocke County Baptist Hospital. 0295 Keats Street. Phone +1 (302) 786-5227.""")

Results

+-------------------------------------------------+---------+
|chunk                                            |ner_label|
+-------------------------------------------------+---------+
|2093-01-13                                       |DATE     |
|David Hale                                       |NAME     |
|Hendrickson                                      |NAME     |
|Ora MR.                                          |LOCATION |
|7194334                                          |ID       |
|01/13/93                                         |DATE     |
|Oliveira                                         |NAME     |
|25                                               |AGE      |
|1-11-2000                                        |DATE     |
|Cocke County Baptist Hospital. 0295 Keats Street.|LOCATION |
|(302) 786-5227                                   |CONTACT  |
+-------------------------------------------------+---------+

Model Information

Model Name: ner_deid_generic_augmented_pipeline
Type: pipeline
Compatibility: Healthcare NLP 3.4.1+
License: Licensed
Edition: Official
Language: en
Size: 1.7 GB

Included Models

  • DocumentAssembler
  • SentenceDetectorDLModel
  • TokenizerModel
  • WordEmbeddingsModel
  • MedicalNerModel
  • NerConverter