Sentence Entity Resolver for Clinical Abbreviations and Acronyms (sbiobert_base_cased_mli embeddings)


This model maps clinical abbreviations and acronyms to their meanings using sbiobert_base_cased_mli Sentence Bert Embeddings. This model is an improved version of the base model, and includes more variational data.

Predicted Entities

Abbreviation Meanings

How to use

document_assembler = DocumentAssembler()\

tokenizer = Tokenizer()\

word_embeddings = WordEmbeddingsModel.pretrained("embeddings_clinical", "en", "clinical/models")\
    .setInputCols(["document", "token"])\

clinical_ner = MedicalNerModel.pretrained("ner_abbreviation_clinical", "en", "clinical/models") \
    .setInputCols(["document", "token", "word_embeddings"]) \

ner_converter = NerConverterInternal() \
    .setInputCols(["document", "token", "ner"]) \

sentence_chunk_embeddings = BertSentenceChunkEmbeddings.pretrained("sbiobert_base_cased_mli", "en", "clinical/models")\
    .setInputCols(["document", "ner_chunk"])\

abbr_resolver = SentenceEntityResolverModel.pretrained("sbiobertresolve_clinical_abbreviation_acronym", "en", "clinical/models") \
    .setInputCols(["sentence_embeddings"]) \

resolver_pipeline = Pipeline(
stages = [

model =[['']]).toDF("text"))

sample_text = "Gravid with estimated fetal weight of 6-6/12 pounds. LOWER EXTREMITIES: No edema. LABORATORY DATA: Laboratory tests include a CBC which is normal. Blood Type: AB positive. Rubella: Immune. VDRL: Nonreactive. Hepatitis C surface antigen: Negative. HIV: Negative. One-Hour Glucose: 117. Group B strep has not been done as yet."

abbr_result = model.transform(spark.createDataFrame([[sample_text]]).toDF('text'))
val document_assembler = new DocumentAssembler()

val tokenizer = new Tokenizer()

val word_embeddings = WordEmbeddingsModel.pretrained("embeddings_clinical", "en", "clinical/models")
    .setInputCols(Array("document", "token"))

val clinical_ner = MedicalNerModel.pretrained("ner_abbreviation_clinical", "en", "clinical/models") 
    .setInputCols(Array("document", "token", "word_embeddings")) 

val ner_converter = new NerConverterInternal() 
    .setInputCols(Array("document", "token", "ner")) 

val sentence_chunk_embeddings = BertSentenceChunkEmbeddings.pretrained("sbiobert_base_cased_mli", "en", "clinical/models")
    .setInputCols(Array("document", "ner_chunk"))

val abbr_resolver = SentenceEntityResolverModel.pretrained("sbiobertresolve_clinical_abbreviation_acronym", "en", "clinical/models") 

val resolver_pipeline = new Pipeline().setStages(document_assembler, tokenizer, word_embeddings, clinical_ner, ner_converter, sentence_chunk_embeddings, abbr_resolver)

val sample_text = Seq("""Gravid with estimated fetal weight of 6-6/12 pounds. LOWER EXTREMITIES: No edema. LABORATORY DATA: Laboratory tests include a CBC which is normal. Blood Type: AB positive. Rubella: Immune. VDRL: Nonreactive. Hepatitis C surface antigen: Negative. HIV: Negative. One-Hour Glucose: 117. Group B strep has not been done as yet.""").toDS().toDF("text")

val abbr_result =
import nlu
nlu.load("en.resolve.clinical_abbreviation_acronym").predict("""Gravid with estimated fetal weight of 6-6/12 pounds. LOWER EXTREMITIES: No edema. LABORATORY DATA: Laboratory tests include a CBC which is normal. Blood Type: AB positive. Rubella: Immune. VDRL: Nonreactive. Hepatitis C surface antigen: Negative. HIV: Negative. One-Hour Glucose: 117. Group B strep has not been done as yet.""")


|    | chunk   | abbr_meaning                         | all_k_results                                                                                                                                                                                                |
|  0 | CBC     | Complete Blood Count                 | Complete Blood Count:::Complete blood count:::blood group in ABO system:::(complement) component 4:::abortion:::carbohydrate antigen:::clear to auscultation:::carcinoembryonic antigen:::cervical (level) 4 |
|  1 | AB      | blood group in ABO system            | blood group in ABO system:::abortion                                                                                                                                                                         |
|  2 | VDRL    | Venereal disease research laboratory | Venereal disease research laboratory:::venous blood gas:::leukocyte esterase:::vertical banded gastroplasty                                                                                                  |
|  3 | HIV     | human immunodeficiency virus         | human immunodeficiency virus:::blood group in ABO system:::abortion:::fluorescent in situ hybridization                                                                                                      |

Model Information

Model Name: sbiobertresolve_clinical_abbreviation_acronym
Compatibility: Healthcare NLP 3.3.4+
License: Licensed
Edition: Official
Input Labels: [sentence_embeddings]
Output Labels: [output]
Language: en
Size: 112.3 MB
Case sensitive: true


Trained on in-house curated dataset.