sparknlp_jsl#
Subpackages#
sparknlp_jsl.annotatorsparknlp_jsl.annotator.assertionsparknlp_jsl.annotator.chunkersparknlp_jsl.annotator.classificationsparknlp_jsl.annotator.contextsparknlp_jsl.annotator.deidsparknlp_jsl.annotator.disambiguationsparknlp_jsl.annotator.embeddingssparknlp_jsl.annotator.ersparknlp_jsl.annotator.fhirsparknlp_jsl.annotator.generic_classifiersparknlp_jsl.annotator.matchersparknlp_jsl.annotator.medical_llmsparknlp_jsl.annotator.mergesparknlp_jsl.annotator.nersparknlp_jsl.annotator.normalizersparknlp_jsl.annotator.paramssparknlp_jsl.annotator.parsersparknlp_jsl.annotator.qasparknlp_jsl.annotator.ragsparknlp_jsl.annotator.resparknlp_jsl.annotator.regexsparknlp_jsl.annotator.resolutionsparknlp_jsl.annotator.seq2seqsparknlp_jsl.annotator.splittersparknlp_jsl.annotator.windowedsparknlp_jsl.annotator.annotation_convertersparknlp_jsl.annotator.annotation_mergersparknlp_jsl.annotator.chunk2_tokensparknlp_jsl.annotator.doc2_chunk_internalsparknlp_jsl.annotator.document_filterer_by_classifiersparknlp_jsl.annotator.document_filterer_by_nersparknlp_jsl.annotator.feature_assemblersparknlp_jsl.annotator.filtering_paramssparknlp_jsl.annotator.flattenersparknlp_jsl.annotator.handle_exception_paramssparknlp_jsl.annotator.metadata_annotation_convertersparknlp_jsl.annotator.multi_chunk2_docsparknlp_jsl.annotator.resolution2_chunksparknlp_jsl.annotator.routersparknlp_jsl.annotator.source_tracking_metadata_paramssparknlp_jsl.annotator.tf_graph_buildersparknlp_jsl.annotator.white_black_list_params
sparknlp_jsl.commonsparknlp_jsl.financesparknlp_jsl.legalsparknlp_jsl.llmsparknlp_jsl.transpilersparknlp_jsl.utilssparknlp_jsl.utils.alab_utilssparknlp_jsl.utils.conll_parsesparknlp_jsl.utils.database_deidentificationsparknlp_jsl.utils.date_shift_fillersparknlp_jsl.utils.deidentification_utilssparknlp_jsl.utils.exception_handling_utilssparknlp_jsl.utils.importssparknlp_jsl.utils.java_helpersparknlp_jsl.utils.licensed_annotator_typesparknlp_jsl.utils.llm_utilssparknlp_jsl.utils.log_parsesparknlp_jsl.utils.ner_utilssparknlp_jsl.utils.ocr_nlp_processorsparknlp_jsl.utils.ocr_utilssparknlp_jsl.utils.risk_adjustment_utilssparknlp_jsl.utils.run_transpiled_codesparknlp_jsl.utils.training_log_parser_utilssparknlp_jsl.utils.visualner_annotations_parser
Submodules#
sparknlp_jsl.alabsparknlp_jsl.basesparknlp_jsl.compatibilitysparknlp_jsl.custom_transformersparknlp_jsl.deidentification_modulesparknlp_jsl.evalsparknlp_jsl.functionssparknlp_jsl.internalsparknlp_jsl.modelTracersparknlp_jsl.pipeline_output_parsersparknlp_jsl.pipeline_tracersparknlp_jsl.pretrainedsparknlp_jsl.structured_deidentificationsparknlp_jsl.text_to_documents_columnssparknlp_jsl.trainingsparknlp_jsl.training_log_parsersparknlp_jsl.updateModelssparknlp_jsl.util
Package Contents#
Functions#
|
Gets John Snow Labs credentials |
|
Gets the library settings |
Gets the public version of Spark NLP |
|
|
Starts a SparkSession with default parameters for Spark NLP Licensed |
|
Gets the version of Spark NLP |
Attributes#
- get_credentials(spark)#
Gets John Snow Labs credentials
- Parameters:
spark (SparkSession) – SparkSession
- Returns:
(secretKey, keyId, token)
- Return type:
tuple
- library_settings(spark)#
Gets the library settings
- Parameters:
spark (SparkSession) – SparkSession
- Returns:
Library settings
- Return type:
str
- load_license_validator()#
- pub_version()#
Gets the public version of Spark NLP
- Returns:
Public version of Spark NLP
- Return type:
str
- start(secret: str = None, gpu: bool = False, apple_silicon: bool = False, aarch64=False, public: str = '', params: dict = None, fhir_deid: bool = False)#
Starts a SparkSession with default parameters for Spark NLP Licensed
The default parameters would result in the equivalent of:
SparkSession.builder \ .appName("Spark NLP Licensed") \ .master("local[*]") \ .config("spark.driver.memory", "{{available memory}}") \ .config("spark.serializer", "org.apache.spark.serializer.KryoSerializer") \ .config("spark.kryoserializer.buffer.max", "2000M") \ .config("spark.driver.maxResultSize", "0") \ .config("spark.extraListeners", "com.johnsnowlabs.license.LicenseLifeCycleManager") \ .config("spark.jars", "https://pypi.johnsnowlabs.com/|secret|/spark-nlp-jsl-|release|.jar") \ .config("spark.jars.packages", "com.johnsnowlabs.nlp:spark-nlp_2.12:|release|") \ .getOrCreate()
- Parameters:
secret (str, optional) – Secret key. If None, will try to read from environment variable ‘SECRET’
gpu (bool) – Whether to use GPU or not
apple_silicon (bool) – Whether to use M1 or not
aarch64 (bool) – Whether to use aarch64 or not
public (str) – Spark NLP version
params (dict) – SparkSession params
fhir_deid (bool) – Whether to use FHIR De-identification or not
- Notes :
spark.driver.memory is set to the available memory.
- Returns:
SparkSession with Spark NLP Licensed
- Return type:
SparkSession
- version()#
Gets the version of Spark NLP
- Returns:
Version of Spark NLP
- Return type:
str
- annotators#
- size_regex#
- transformer_seq_classification#
- version_regex#