Vaccinations and Infectious Diseases

Description

This pipeline is designed to extract vaccine and related disease/symptom entities from clinical/medical texts.

Predicted entities are:

Bacterial_Vax, Viral_Vax, Cancer_Vax, Bac_Vir_Comb, Other_Vax, Vax_Dose, Infectious_Disease, Other_Disease_Disorder, Sign_Symptom, Toxoid, Adaptive_Immunity, Inactivated, Date, Age

Copy S3 URI

How to use


from sparknlp.pretrained import PretrainedPipeline

ner_pipeline = PretrainedPipeline("vaccine_names", "en", "clinical/models")

result = ner_pipeline.annotate("""
On May 14, 2023, a 57-year-old female presented with fever, joint pain, and fatigue three days after receiving her third dose of the Shingrix vaccine. 
Her history includes rheumatoid arthritis, managed with immunosuppressants, and prior breast cancer in remission. 
She previously received Gardasil 9, an HPV vaccine, and the hepatitis B recombinant vaccine series in 2020. 
Notably, she developed mild aches following her annual influenza shot, which is an inactivated vaccine. 
The patient reported receiving the DTaP vaccine (a toxoid vaccine) as a child. She also had tuberculosis as a teenager and had COVID-19 twice during the pandemic. 
In 2022, she was enrolled in a clinical trial for Stimuvax, a cancer vaccine targeting MUC1-expressing tumors. 
The team is assessing whether the patient's symptoms are due to a flare in her autoimmune disease or a delayed viral vaccine reaction. 
""")


import com.johnsnowlabs.nlp.pretrained.PretrainedPipeline

val ner_pipeline = PretrainedPipeline("vaccine_names", "en", "clinical/models")

val result = ner_pipeline.annotate("""
On May 14, 2023, a 57-year-old female presented with fever, joint pain, and fatigue three days after receiving her third dose of the Shingrix vaccine. 
Her history includes rheumatoid arthritis, managed with immunosuppressants, and prior breast cancer in remission. 
She previously received Gardasil 9, an HPV vaccine, and the hepatitis B recombinant vaccine series in 2020. 
Notably, she developed mild aches following her annual influenza shot, which is an inactivated vaccine. 
The patient reported receiving the DTaP vaccine (a toxoid vaccine) as a child. She also had tuberculosis as a teenager and had COVID-19 twice during the pandemic. 
In 2022, she was enrolled in a clinical trial for Stimuvax, a cancer vaccine targeting MUC1-expressing tumors. 
The team is assessing whether the patient's symptoms are due to a flare in her autoimmune disease or a delayed viral vaccine reaction. 
""")

Results

|    | chunks               |   begin |   end | entities               |
|---:|:---------------------|--------:|------:|:-----------------------|
|  0 | May 14, 2023         |       4 |    15 | Date                   |
|  1 | 57-year-old          |      20 |    30 | Age                    |
|  2 | fever                |      54 |    58 | Sign_Symptom           |
|  3 | joint pain           |      61 |    70 | Sign_Symptom           |
|  4 | fatigue              |      77 |    83 | Sign_Symptom           |
|  5 | third dose           |     116 |   125 | Vax_Dose               |
|  6 | Shingrix             |     134 |   141 | Viral_Vax              |
|  7 | vaccine              |     143 |   149 | Adaptive_Immunity      |
|  8 | rheumatoid arthritis |     174 |   193 | Other_Disease_Disorder |
|  9 | breast cancer        |     239 |   251 | Other_Disease_Disorder |
| 10 | Gardasil 9           |     292 |   301 | Viral_Vax              |
| 11 | HPV                  |     307 |   309 | Infectious_Disease     |
| 12 | vaccine              |     311 |   317 | Adaptive_Immunity      |
| 13 | hepatitis B          |     328 |   338 | Infectious_Disease     |
| 14 | recombinant          |     340 |   350 | Other_Vax              |
| 15 | vaccine              |     352 |   358 | Adaptive_Immunity      |
| 16 | 2020                 |     370 |   373 | Date                   |
| 17 | aches                |     405 |   409 | Sign_Symptom           |
| 18 | influenza            |     432 |   440 | Infectious_Disease     |
| 19 | inactivated          |     460 |   470 | Inactivated            |
| 20 | vaccine              |     472 |   478 | Adaptive_Immunity      |
| 21 | DTaP                 |     517 |   520 | Bacterial_Vax          |
| 22 | vaccine              |     522 |   528 | Adaptive_Immunity      |
| 23 | toxoid               |     533 |   538 | Toxoid                 |
| 24 | vaccine              |     540 |   546 | Adaptive_Immunity      |
| 25 | child                |     554 |   558 | Age                    |
| 26 | tuberculosis         |     574 |   585 | Infectious_Disease     |
| 27 | COVID-19             |     609 |   616 | Infectious_Disease     |
| 28 | 2022                 |     649 |   652 | Date                   |
| 29 | Stimuvax             |     696 |   703 | Cancer_Vax             |
| 30 | cancer               |     708 |   713 | Other_Disease_Disorder |
| 31 | vaccine              |     715 |   721 | Adaptive_Immunity      |
| 32 | tumors               |     749 |   754 | Other_Disease_Disorder |
| 33 | vaccine              |     875 |   881 | Adaptive_Immunity      |

Model Information

Model Name: vaccine_names
Type: pipeline
Compatibility: Healthcare NLP 6.0.3+
License: Licensed
Edition: Official
Language: en
Size: 1.7 GB

Benchmarking


                  label  precision    recall  f1-score   support

      Adaptive_Immunity       0.96      0.98      0.97       659
                    Age       0.87      0.91      0.89       379
           Bac_Vir_Comb       0.94      0.97      0.96        66
          Bacterial_Vax       0.94      0.73      0.82       129
             Cancer_Vax       0.92      0.96      0.94       112
                   Date       0.93      0.92      0.92        99
            Inactivated       0.93      0.78      0.85        32
     Infectious_Disease       0.96      0.98      0.97       772
 Other_Disease_Disorder       0.87      0.90      0.89       441
              Other_Vax       0.77      1.00      0.87        17
           Sign_Symptom       0.87      0.86      0.87       197
                 Toxoid       0.94      1.00      0.97        15
               Vax_Dose       0.78      0.95      0.86       169
              Viral_Vax       0.94      0.94      0.94       186
            

              micro_avg       0.92      0.93      0.92      3273
              macro avg       0.90      0.92      0.91      3273
           weighted avg       0.92      0.93      0.92      3273

Included Models

  • DocumentAssembler
  • SentenceDetectorDLModel
  • TokenizerModel
  • WordEmbeddingsModel
  • TextMatcherInternalModel
  • MedicalNerModel
  • NerConverterInternalModel
  • ChunkMergeModel