Pipeline to Extract Cancer Therapies and Posology Information

Description

This pretrained pipeline is built on the top of ner_oncology_unspecific_posology_healthcare model.

Predicted Entities

Cancer_Therapy, Posology_Information

Copy S3 URI

How to use

from sparknlp.pretrained import PretrainedPipeline

pipeline = PretrainedPipeline("ner_oncology_unspecific_posology_healthcare_pipeline", "en", "clinical/models")

text = "
he patient underwent a regimen consisting of adriamycin (60 mg/m2) and cyclophosphamide (600 mg/m2) over six courses. She is currently receiving his second cycle of chemotherapy and is in good overall condition.
"

result = pipeline.fullAnnotate(text)
import com.johnsnowlabs.nlp.pretrained.PretrainedPipeline

val pipeline = new PretrainedPipeline("ner_oncology_unspecific_posology_healthcare_pipeline", "en", "clinical/models")

val text = "
he patient underwent a regimen consisting of adriamycin (60 mg/m2) and cyclophosphamide (600 mg/m2) over six courses. She is currently receiving his second cycle of chemotherapy and is in good overall condition.
"

val result = pipeline.fullAnnotate(text)

Results

|    | chunks           |   begin |   end | entities             |   confidence |
|---:|:-----------------|--------:|------:|:---------------------|-------------:|
|  0 | adriamycin       |      46 |    55 | Cancer_Therapy       |     0.9999   |
|  1 | 60 mg/m2         |      58 |    65 | Posology_Information |     0.807    |
|  2 | cyclophosphamide |      72 |    87 | Cancer_Therapy       |     0.9998   |
|  3 | 600 mg/m2        |      90 |    98 | Posology_Information |     0.9566   |
|  4 | over six courses |     101 |   116 | Posology_Information |     0.689833 |
|  5 | second cycle     |     150 |   161 | Posology_Information |     0.9906   |
|  6 | chemotherapy     |     166 |   177 | Cancer_Therapy       |     0.9997   |

Model Information

Model Name: ner_oncology_unspecific_posology_healthcare_pipeline
Type: pipeline
Compatibility: Healthcare NLP 4.3.0+
License: Licensed
Edition: Official
Language: en
Size: 533.1 MB

Included Models

  • DocumentAssembler
  • SentenceDetectorDLModel
  • TokenizerModel
  • WordEmbeddingsModel
  • MedicalNerModel
  • NerConverterInternalModel