RelationExtractionModel

Companion object RelationExtractionModel

class RelationExtractionModel extends GenericClassifierModel with ParamsAndFeaturesWritable with HandleExceptionParams with HasSafeAnnotate[GenericClassifierModel]

Extracts and classifies instances of relations between named entities. For this, relation pairs need to be defined with setRelationPairs, to specify between which entities the extraction should be done.

For pretrained models please see the Models Hub for available models.

Example

Relation Extraction between body parts

Define pipeline stages to extract entities

val documenter = new DocumentAssembler()
  .setInputCol("text")
  .setOutputCol("document")

val sentencer = new SentenceDetector()
  .setInputCols("document")
  .setOutputCol("sentences")

val tokenizer = new Tokenizer()
  .setInputCols("sentences")
  .setOutputCol("tokens")

val words_embedder = WordEmbeddingsModel.pretrained("embeddings_clinical", "en", "clinical/models")
  .setInputCols("sentences", "tokens")
  .setOutputCol("embeddings")

val pos_tagger = PerceptronModel.pretrained("pos_clinical", "en", "clinical/models")
  .setInputCols("sentences", "tokens")
  .setOutputCol("pos_tags")

val dependency_parser = DependencyParserModel.pretrained("dependency_conllu", "en")
  .setInputCols("sentences", "pos_tags", "tokens")
  .setOutputCol("dependencies")

val clinical_ner_tagger = MedicalNerModel.pretrained("jsl_ner_wip_greedy_clinical","en","clinical/models")
  .setInputCols("sentences", "tokens", "embeddings")
  .setOutputCol("ner_tags")

val ner_chunker = new NerConverter()
  .setInputCols("sentences", "tokens", "ner_tags")
  .setOutputCol("ner_chunks")

Define the relations that are to be extracted

val relationPairs = Array("direction-external_body_part_or_region",
                      "external_body_part_or_region-direction",
                      "direction-internal_organ_or_component",
                      "internal_organ_or_component-direction")

val re_model = RelationExtractionModel.pretrained("re_bodypart_directions", "en", "clinical/models")
  .setInputCols("embeddings", "pos_tags", "ner_chunks", "dependencies")
  .setOutputCol("relations")
  .setRelationPairs(relationPairs)
  .setMaxSyntacticDistance(4)
  .setPredictionThreshold(0.9f)

val pipeline = new Pipeline().setStages(Array(
  documenter,
  sentencer,
  tokenizer,
  words_embedder,
  pos_tagger,
  clinical_ner_tagger,
  ner_chunker,
  dependency_parser,
  re_model
))

val data = Seq("MRI demonstrated infarction in the upper brain stem , left cerebellum and  right basil ganglia").toDF("text")
val result = pipeline.fit(data).transform(data)

Show results

result.selectExpr("explode(relations) as relations")
 .select(
   "relations.metadata.chunk1",
   "relations.metadata.entity1",
   "relations.metadata.chunk2",
   "relations.metadata.entity2",
   "relations.result"
 )
 .where("result != 0")
 .show(truncate=false)
+------+---------+-------------+---------------------------+------+
|chunk1|entity1  |chunk2       |entity2                    |result|
+------+---------+-------------+---------------------------+------+
|upper |Direction|brain stem   |Internal_organ_or_component|1     |
|left  |Direction|cerebellum   |Internal_organ_or_component|1     |
|right |Direction|basil ganglia|Internal_organ_or_component|1     |
+------+---------+-------------+---------------------------+------+

See also: RelationExtractionApproach to train your own model.
RelationExtractionDLModel for BERT based extraction

Linear Supertypes

GenericClassifierModel, CheckLicense, HasSafeAnnotate[GenericClassifierModel], HandleExceptionParams, HasSimpleAnnotate[GenericClassifierModel], WriteTensorflowModel, HasStorageRef, GenericClassifierParams, AnnotatorModel[GenericClassifierModel], CanBeLazy, RawAnnotator[GenericClassifierModel], HasOutputAnnotationCol, HasInputAnnotationCols, HasOutputAnnotatorType, ParamsAndFeaturesWritable, HasFeatures, DefaultParamsWritable, MLWritable, Model[GenericClassifierModel], Transformer, PipelineStage, Logging, Params, Serializable, Serializable, Identifiable, AnyRef, Any

Known Subclasses

GenericREModel, PosologyREModel

Ordering

Grouped
Alphabetic
By Inheritance

Inherited

RelationExtractionModel
GenericClassifierModel
CheckLicense
HasSafeAnnotate
HandleExceptionParams
HasSimpleAnnotate
WriteTensorflowModel
HasStorageRef
GenericClassifierParams
AnnotatorModel
CanBeLazy
RawAnnotator
HasOutputAnnotationCol
HasInputAnnotationCols
HasOutputAnnotatorType
ParamsAndFeaturesWritable
HasFeatures
DefaultParamsWritable
MLWritable
Model
Transformer
PipelineStage
Logging
Params
Serializable
Serializable
Identifiable
AnyRef
Any

Hide All
Show All

Visibility

Public
All

Instance Constructors

new RelationExtractionModel()
new RelationExtractionModel(uid: String)
uid
a unique identifier for the instantiated AnnotatorModel

Type Members

type AnnotationContent = Seq[Row]

Attributes
protected
Definition Classes
AnnotatorModel
type AnnotatorType = String

Definition Classes
HasOutputAnnotatorType

Value Members

final def !=(arg0: Any): Boolean

Definition Classes
AnyRef → Any
final def ##(): Int

Definition Classes
AnyRef → Any
final def $[T](param: Param[T]): T

Attributes
protected
Definition Classes
Params
def $$[T](feature: StructFeature[T]): T

Attributes
protected
Definition Classes
HasFeatures
def $$[K, V](feature: MapFeature[K, V]): Map[K, V]

Attributes
protected
Definition Classes
HasFeatures
def $$[T](feature: SetFeature[T]): Set[T]

Attributes
protected
Definition Classes
HasFeatures
def $$[T](feature: ArrayFeature[T]): Array[T]

Attributes
protected
Definition Classes
HasFeatures
final def ==(arg0: Any): Boolean

Definition Classes
AnyRef → Any
def _transform(dataset: Dataset[_], recursivePipeline: Option[PipelineModel]): DataFrame

Attributes
protected
Definition Classes
AnnotatorModel
def afterAnnotate(dataset: DataFrame): DataFrame

Attributes
protected
Definition Classes
AnnotatorModel
def annotate(annotations: Seq[Annotation]): Seq[Annotation]
takes a document and annotations and produces new annotations of this annotator's annotation type
takes a document and annotations and produces new annotations of this annotator's annotation type
annotations
Annotations that correspond to inputAnnotationCols generated by previous annotators if any
returns
any number of annotations processed for every input annotation. Not necessary one to one relationship

Definition Classes
RelationExtractionModel → GenericClassifierModel → HasSimpleAnnotate
final def asInstanceOf[T0]: T0

Definition Classes
Any
val batchSize: IntParam
Batch size
Batch size

Definition Classes
GenericClassifierParams
def beforeAnnotate(dataset: Dataset[_]): Dataset[_]

Attributes
protected
Definition Classes
GenericClassifierModel → AnnotatorModel
def categorizeRel(relation: RelationInstance): (Long, Float, Array[Float])

Attributes
protected
final def checkSchema(schema: StructType, inputAnnotatorType: String): Boolean

Attributes
protected
Definition Classes
HasInputAnnotationCols
def checkValidEnvironment(spark: Option[SparkSession], scopes: Seq[String]): Unit

Definition Classes
CheckLicense
def checkValidScope(scope: String): Unit

Definition Classes
CheckLicense
def checkValidScopeAndEnvironment(scope: String, spark: Option[SparkSession], checkLp: Boolean): Unit

Definition Classes
CheckLicense
def checkValidScopesAndEnvironment(scopes: Seq[String], spark: Option[SparkSession], checkLp: Boolean): Unit

Definition Classes
CheckLicense
val classes: StringArrayParam

Definition Classes
GenericClassifierModel
final def clear(param: Param[_]): RelationExtractionModel.this.type

Definition Classes
Params
def clone(): AnyRef

Attributes
protected[lang]
Definition Classes
AnyRef
Annotations
@throws( ... ) @native()
def copy(extra: ParamMap): GenericClassifierModel

Definition Classes
RawAnnotator → Model → Transformer → PipelineStage → Params
def copyValues[T <: Params](to: T, extra: ParamMap): T

Attributes
protected
Definition Classes
Params
def createDatabaseConnection(database: Name): RocksDBConnection

Definition Classes
HasStorageRef
var customLabels: MapFeature[String, String]
Custom relation labels
val datasetInfo: Param[String]
Descriptive information about the dataset being used.
Descriptive information about the dataset being used.

Definition Classes
GenericClassifierParams
final def defaultCopy[T <: Params](extra: ParamMap): T

Attributes
protected
Definition Classes
Params
def dfAnnotate: UserDefinedFunction

Definition Classes
HasSimpleAnnotate
val directionSensitive: BooleanParam
If it is true, only relations in the form of "ENTITY1-ENTITY2" will be considered, If it is false, both "ENTITY1-ENTITY2" and "ENTITY2-ENTITY1" relations will be considered,
val doExceptionHandling: BooleanParam
If true, exceptions are handled.
If true, exceptions are handled. If exception causing data is passed to the model, a error annotation is emitted which has the exception message. Processing continues with the next one. This comes with a performance penalty.

Definition Classes
HandleExceptionParams
val dropout: FloatParam
Dropout coefficient
Dropout coefficient

Definition Classes
GenericClassifierParams
val epochsN: IntParam
Maximum number of epochs to train
Maximum number of epochs to train

Definition Classes
GenericClassifierParams
final def eq(arg0: AnyRef): Boolean

Definition Classes
AnyRef
def equals(arg0: Any): Boolean

Definition Classes
AnyRef → Any
def explainParam(param: Param[_]): String

Definition Classes
Params
def explainParams(): String

Definition Classes
Params
def extraValidate(structType: StructType): Boolean

Attributes
protected
Definition Classes
RawAnnotator
def extraValidateMsg: String

Attributes
protected
Definition Classes
RawAnnotator
final def extractParamMap(): ParamMap

Definition Classes
Params
final def extractParamMap(extra: ParamMap): ParamMap

Definition Classes
Params
val featureScaling: Param[String]
Feature scaling method.
Feature scaling method. Possible values are 'zscore', 'minmax' or empty (no scaling)

Definition Classes
GenericClassifierParams
val features: ArrayBuffer[Feature[_, _, _]]

Definition Classes
HasFeatures
val filterByTokenDistance: IntParam
filtering criterion based on number of token between entities.
filtering criterion based on number of token between entities. Model only finds relations that have fewer than the specified number of tokens between them.
def finalize(): Unit

Attributes
protected[lang]
Definition Classes
AnyRef
Annotations
@throws( classOf[java.lang.Throwable] )
val fixImbalance: BooleanParam
Fix the imbalance in the training set by replicating examples of under represented categories
Fix the imbalance in the training set by replicating examples of under represented categories

Definition Classes
GenericClassifierParams
def get[T](feature: StructFeature[T]): Option[T]

Attributes
protected
Definition Classes
HasFeatures
def get[K, V](feature: MapFeature[K, V]): Option[Map[K, V]]

Attributes
protected
Definition Classes
HasFeatures
def get[T](feature: SetFeature[T]): Option[Set[T]]

Attributes
protected
Definition Classes
HasFeatures
def get[T](feature: ArrayFeature[T]): Option[Array[T]]

Attributes
protected
Definition Classes
HasFeatures
final def get[T](param: Param[T]): Option[T]

Definition Classes
Params
def getBatchSize: Int
Batch size
Batch size

Definition Classes
GenericClassifierParams
def getCategories(): Array[String]

Definition Classes
GenericClassifierModel
def getCategoryName(id: Int): String

Definition Classes
GenericClassifierModel
final def getClass(): Class[_]

Definition Classes
AnyRef → Any
Annotations
@native()
def getClasses: Array[String]
Proxy to getCategories
def getCustomLabel(label: String): String
def getCustomLabels: Map[String, String]
Custom relation labels
def getDatasetInfo: String
get descriptive information about the dataset being used
get descriptive information about the dataset being used

Definition Classes
GenericClassifierParams
final def getDefault[T](param: Param[T]): Option[T]

Definition Classes
Params
def getDirectionSensitive: Boolean
Gets the directionSensitive
def getDropout: Float
Dropout coefficient
Dropout coefficient

Definition Classes
GenericClassifierParams
def getEncoder: GenericClassifierDataEncoder

Definition Classes
GenericClassifierModel
def getFeatureScaling: String
Get feature scaling method
Get feature scaling method

Definition Classes
GenericClassifierParams
def getFixImbalance: Boolean
Fix imbalance in training set
Fix imbalance in training set

Definition Classes
GenericClassifierParams
def getInputCols: Array[String]

Definition Classes
HasInputAnnotationCols
def getLabelColumn: String
Column with label per each document
Column with label per each document

Definition Classes
GenericClassifierParams
def getLazyAnnotator: Boolean

Definition Classes
CanBeLazy
def getLearningRate: Float
Learning Rate
Learning Rate

Definition Classes
GenericClassifierParams
def getMaxEpochs: Int
Maximum number of epochs to train
Maximum number of epochs to train

Definition Classes
GenericClassifierParams
def getMaxSyntacticDistance: Int
Maximal syntactic distance, as threshold (Default: 0)
def getModelFile: String
Model file name
Model file name

Definition Classes
GenericClassifierParams
def getMultiClass: Boolean
Gets the model multi class prediction mode
Gets the model multi class prediction mode

Definition Classes
GenericClassifierParams
final def getOrDefault[T](param: Param[T]): T

Definition Classes
Params
final def getOutputCol: String

Definition Classes
HasOutputAnnotationCol
def getOutputLogsPath: String
Get output logs path
Get output logs path

Definition Classes
GenericClassifierParams
def getParam(paramName: String): Param[Any]

Definition Classes
Params
def getPredictionThreshold: Float
Minimal activation of the target unit to encode a new relation instance (Default: 0.5f)
def getRelationPairs: Array[String]
List of dash-separated pairs of named entities ("ENTITY1-ENTITY2", e.g.
List of dash-separated pairs of named entities ("ENTITY1-ENTITY2", e.g. "Biomarker-RelativeDay"), which will be processed
def getRelationPairsCaseSensitive: Boolean
Gets the case sensitivity of relation pairs
def getRelationTypePerPair: Map[String, Array[String]]
Get the lists of entity pairs allowed for a given relation
def getRelationTypePerPairStr: String
Get a string representation of the lists of entity pairs allowed for a given relation
def getScopeWindow: (Int, Int)
Get scope window
def getStorageRef: String

Definition Classes
HasStorageRef
def getValidationSplit: Float
Choose the proportion of training dataset to be validated against the model on each Epoch.
Choose the proportion of training dataset to be validated against the model on each Epoch. The value should be between 0.0 and 1.0 and by default it is 0.0 and off.

Definition Classes
GenericClassifierParams
final def hasDefault[T](param: Param[T]): Boolean

Definition Classes
Params
def hasParam(paramName: String): Boolean

Definition Classes
Params
def hasParent: Boolean

Definition Classes
Model
def hashCode(): Int

Definition Classes
AnyRef → Any
Annotations
@native()
val inExceptionMode: Boolean

Attributes
protected
Definition Classes
HasSafeAnnotate
def initializeLogIfNecessary(isInterpreter: Boolean, silent: Boolean): Boolean

Attributes
protected
Definition Classes
Logging
def initializeLogIfNecessary(isInterpreter: Boolean): Unit

Attributes
protected
Definition Classes
Logging
val inputAnnotatorTypes: Array[AnnotatorType]
Input annotator types : WORD_EMBEDDINGS, POS, CHUNK, DEPENDENCY
Input annotator types : WORD_EMBEDDINGS, POS, CHUNK, DEPENDENCY

Definition Classes
RelationExtractionModel → GenericClassifierModel → HasInputAnnotationCols
final val inputCols: StringArrayParam

Attributes
protected
Definition Classes
HasInputAnnotationCols
final def isDefined(param: Param[_]): Boolean

Definition Classes
Params
final def isInstanceOf[T0]: Boolean

Definition Classes
Any
final def isSet(param: Param[_]): Boolean

Definition Classes
Params
def isTraceEnabled(): Boolean

Attributes
protected
Definition Classes
Logging
val labelColumn: Param[String]
Column with label per each document
Column with label per each document

Definition Classes
GenericClassifierParams
val lazyAnnotator: BooleanParam

Definition Classes
CanBeLazy
val learningRate: FloatParam
Learning Rate
Learning Rate

Definition Classes
GenericClassifierParams
def loadModel(sparkSession: SparkSession, tfModel: TensorflowWrapper, categories: Array[String], encoder: GenericClassifierDataEncoder, nerTags: Array[String]): RelationExtractionModel.this.type
def log: Logger

Attributes
protected
Definition Classes
Logging
def logDebug(msg: ⇒ String, throwable: Throwable): Unit

Attributes
protected
Definition Classes
Logging
def logDebug(msg: ⇒ String): Unit

Attributes
protected
Definition Classes
Logging
def logError(msg: ⇒ String, throwable: Throwable): Unit

Attributes
protected
Definition Classes
Logging
def logError(msg: ⇒ String): Unit

Attributes
protected
Definition Classes
Logging
def logInfo(msg: ⇒ String, throwable: Throwable): Unit

Attributes
protected
Definition Classes
Logging
def logInfo(msg: ⇒ String): Unit

Attributes
protected
Definition Classes
Logging
def logName: String

Attributes
protected
Definition Classes
Logging
def logTrace(msg: ⇒ String, throwable: Throwable): Unit

Attributes
protected
Definition Classes
Logging
def logTrace(msg: ⇒ String): Unit

Attributes
protected
Definition Classes
Logging
def logWarning(msg: ⇒ String, throwable: Throwable): Unit

Attributes
protected
Definition Classes
Logging
def logWarning(msg: ⇒ String): Unit

Attributes
protected
Definition Classes
Logging
var maxSyntacticDistance: IntParam
Maximal syntactic distance, as threshold (Default: 0)
def model: TensorflowGenericClassifier

Definition Classes
GenericClassifierModel
val modelFile: Param[String]
Location of file of the model used for classification
Location of file of the model used for classification

Definition Classes
GenericClassifierParams
def msgHelper(schema: StructType): String

Attributes
protected
Definition Classes
HasInputAnnotationCols
val multiClass: BooleanParam
If multiClass is set, the model will return all the labels with corresponding scores.
If multiClass is set, the model will return all the labels with corresponding scores. By default, multiClass is false.

Definition Classes
GenericClassifierParams
final def ne(arg0: AnyRef): Boolean

Definition Classes
AnyRef
final def notify(): Unit

Definition Classes
AnyRef
Annotations
@native()
final def notifyAll(): Unit

Definition Classes
AnyRef
Annotations
@native()
def onWrite(path: String, spark: SparkSession): Unit

Definition Classes
GenericClassifierModel → ParamsAndFeaturesWritable
val optionalInputAnnotatorTypes: Array[String]

Definition Classes
HasInputAnnotationCols
val outputAnnotatorType: String
Output annotator types : CATEGORY
Output annotator types : CATEGORY

Definition Classes
RelationExtractionModel → GenericClassifierModel → HasOutputAnnotatorType
final val outputCol: Param[String]

Attributes
protected
Definition Classes
HasOutputAnnotationCol
val outputLogsPath: Param[String]
Folder path to save training logs.
Folder path to save training logs. If no path is specified, the logs won't be stored in disk. The path can be a local file path, a distributed file path (HDFS, DBFS), or a cloud storage (S3).

Definition Classes
GenericClassifierParams
lazy val params: Array[Param[_]]

Definition Classes
Params
var parent: Estimator[GenericClassifierModel]

Definition Classes
Model
var predictionThreshold: FloatParam
Minimal activation of the target unit to encode a new relation instance (Default: 0.5f)
var relationPairs: Param[String]
List of dash-separated pairs of named entities ("ENTITY1-ENTITY2", e.g.
List of dash-separated pairs of named entities ("ENTITY1-ENTITY2", e.g. "Biomarker-RelativeDay"), which will be processed
var relationPairsCaseSensitive: BooleanParam
Determines whether relation pairs are case sensitive
def safeAnnotate(annotations: Seq[Annotation]): Seq[Annotation]
A protected method designed to safely annotate a sequence of Annotation objects by handling exceptions.
A protected method designed to safely annotate a sequence of Annotation objects by handling exceptions.
annotations
A sequence of Annotation.
returns
A sequence of Annotation objects after processing, potentially containing error annotations.

Attributes
protected
Definition Classes
HasSafeAnnotate
def save(path: String): Unit

Definition Classes
MLWritable
Annotations
@Since( "1.6.0" ) @throws( ... )
def scaleFeatures(features: Array[Array[Float]]): Array[Array[Float]]

Attributes
protected
Definition Classes
GenericClassifierModel
val scopeWindow: IntArrayParam
The scope window of the feature generation for relations
def set[T](feature: StructFeature[T], value: T): RelationExtractionModel.this.type

Attributes
protected
Definition Classes
HasFeatures
def set[K, V](feature: MapFeature[K, V], value: Map[K, V]): RelationExtractionModel.this.type

Attributes
protected
Definition Classes
HasFeatures
def set[T](feature: SetFeature[T], value: Set[T]): RelationExtractionModel.this.type

Attributes
protected
Definition Classes
HasFeatures
def set[T](feature: ArrayFeature[T], value: Array[T]): RelationExtractionModel.this.type

Attributes
protected
Definition Classes
HasFeatures
final def set(paramPair: ParamPair[_]): RelationExtractionModel.this.type

Attributes
protected
Definition Classes
Params
final def set(param: String, value: Any): RelationExtractionModel.this.type

Attributes
protected
Definition Classes
Params
final def set[T](param: Param[T], value: T): RelationExtractionModel.this.type

Definition Classes
Params
def setBatchSize(batch: Int): RelationExtractionModel.this.type
Batch size
Batch size

Definition Classes
GenericClassifierParams
def setCategoryNames(categoryNames: Array[String]): RelationExtractionModel.this.type

Definition Classes
GenericClassifierModel
def setCustomLabels(labels: HashMap[String, String]): RelationExtractionModel.this.type
def setCustomLabels(labels: Map[String, String]): RelationExtractionModel.this.type
Set custom relation labels
def setDatasetInfo(value: String): RelationExtractionModel.this.type
set descriptive information about the dataset being used
set descriptive information about the dataset being used

Definition Classes
GenericClassifierParams
def setDefault[T](feature: StructFeature[T], value: () ⇒ T): RelationExtractionModel.this.type

Attributes
protected
Definition Classes
HasFeatures
def setDefault[K, V](feature: MapFeature[K, V], value: () ⇒ Map[K, V]): RelationExtractionModel.this.type

Attributes
protected
Definition Classes
HasFeatures
def setDefault[T](feature: SetFeature[T], value: () ⇒ Set[T]): RelationExtractionModel.this.type

Attributes
protected
Definition Classes
HasFeatures
def setDefault[T](feature: ArrayFeature[T], value: () ⇒ Array[T]): RelationExtractionModel.this.type

Attributes
protected
Definition Classes
HasFeatures
final def setDefault(paramPairs: ParamPair[_]*): RelationExtractionModel.this.type

Attributes
protected
Definition Classes
Params
final def setDefault[T](param: Param[T], value: T): RelationExtractionModel.this.type

Attributes
protected[org.apache.spark.ml]
Definition Classes
Params
def setDirectionSensitive(value: Boolean): RelationExtractionModel.this.type
If it is true, only relations in the form of "ENTITY1-ENTITY2" will be considered, If it is false, both "ENTITY1-ENTITY2" and "ENTITY2-ENTITY1" relations will be considered,
def setDoExceptionHandling(value: Boolean): RelationExtractionModel.this.type
If true, exceptions are handled.
If true, exceptions are handled. If exception causing data is passed to the model, a error annotation is emitted which has the exception message. Processing continues with the next one. This comes with a performance penalty.

Definition Classes
HandleExceptionParams
def setDropout(dropout: Float): RelationExtractionModel.this.type
Dropout coefficient
Dropout coefficient

Definition Classes
GenericClassifierParams
def setEncoder(encoder: GenericClassifierDataEncoder): RelationExtractionModel.this.type

Definition Classes
GenericClassifierModel
def setEpochsNumber(epochs: Int): RelationExtractionModel.this.type
Maximum number of epochs to train
Maximum number of epochs to train

Definition Classes
GenericClassifierParams
def setFeatureScaling(featureScaling: String): RelationExtractionModel.this.type
Set the feature scaling method.
Set the feature scaling method. Possible values are 'zscore', 'minmax' or empty (no scaling)

Definition Classes
GenericClassifierParams
def setFilterByTokenDistance(value: Int): RelationExtractionModel.this.type
filtering criterion based on number of token between entities.
filtering criterion based on number of token between entities. Model only finds relations that have fewer than the specified number of tokens between them
def setFixImbalance(fix: Boolean): RelationExtractionModel.this.type
Fix imbalance of training set
Fix imbalance of training set

Definition Classes
GenericClassifierParams
final def setInputCols(value: String*): RelationExtractionModel.this.type

Definition Classes
HasInputAnnotationCols
def setInputCols(value: Array[String]): RelationExtractionModel.this.type

Definition Classes
HasInputAnnotationCols
def setLabelColumn(column: String): RelationExtractionModel.this.type
Column with label per each document
Column with label per each document

Definition Classes
GenericClassifierParams
def setLazyAnnotator(value: Boolean): RelationExtractionModel.this.type

Definition Classes
CanBeLazy
def setMaxSyntacticDistance(maxSyntacticDistance: Int): RelationExtractionModel.this.type
Maximal syntactic distance, as threshold (Default: 0)
def setModelFile(modelFile: String): RelationExtractionModel.this.type
Set the model file name
Set the model file name

Definition Classes
GenericClassifierParams
def setMultiClass(value: Boolean): RelationExtractionModel.this.type
Sets the model in multi class prediction mode
Sets the model in multi class prediction mode

Definition Classes
GenericClassifierParams
final def setOutputCol(value: String): RelationExtractionModel.this.type

Definition Classes
HasOutputAnnotationCol
def setOutputLogsPath(outputLogsPath: String): RelationExtractionModel.this.type
Set the output log path
Set the output log path

Definition Classes
GenericClassifierParams
def setParent(parent: Estimator[GenericClassifierModel]): GenericClassifierModel

Definition Classes
Model
def setPredictionThreshold(predictionThreshold: Float): RelationExtractionModel.this.type
Minimal activation of the target unit to encode a new relation instance (Default: 0.5f)
def setRelationPairs(relationPairs: Array[String]): RelationExtractionModel.this.type
List of dash-separated pairs of named entities ("ENTITY1-ENTITY2", e.g.
List of dash-separated pairs of named entities ("ENTITY1-ENTITY2", e.g. "Biomarker-RelativeDay"), which will be processed
def setRelationPairsCaseSensitive(value: Boolean): RelationExtractionModel.this.type
Sets the case sensitivity of relation pairs
def setRelationTypePerPair(categories: HashMap[String, List[String]]): RelationExtractionModel.this.type
Set the lists of entity pairs allowed for a given relation
def setRelationTypePerPair(categories: Map[String, Array[String]]): RelationExtractionModel.this.type
Set the lists of entity pairs allowed for a given relation
def setScopeWindow(window: (Int, Int)): RelationExtractionModel.this.type
Sets the scope of the window of the feature generation for relations
def setStorageRef(value: String): RelationExtractionModel.this.type

Definition Classes
HasStorageRef
def setTensorflowModel(spark: SparkSession, tf: TensorflowWrapper): RelationExtractionModel.this.type

Definition Classes
GenericClassifierModel
def setValidationSplit(validationSplit: Float): RelationExtractionModel.this.type
Choose the proportion of training dataset to be validated against the model on each Epoch.
Choose the proportion of training dataset to be validated against the model on each Epoch. The value should be between 0.0 and 1.0 and by default it is 0.0 and off.

Definition Classes
GenericClassifierParams
def setlearningRate(lr: Float): RelationExtractionModel.this.type
Learning Rate
Learning Rate

Definition Classes
GenericClassifierParams
val storageRef: Param[String]

Definition Classes
HasStorageRef
final def synchronized[T0](arg0: ⇒ T0): T0

Definition Classes
AnyRef
def toString(): String

Definition Classes
Identifiable → AnyRef → Any
final def transform(dataset: Dataset[_]): DataFrame

Definition Classes
AnnotatorModel → Transformer
def transform(dataset: Dataset[_], paramMap: ParamMap): DataFrame

Definition Classes
Transformer
Annotations
@Since( "2.0.0" )
def transform(dataset: Dataset[_], firstParamPair: ParamPair[_], otherParamPairs: ParamPair[_]*): DataFrame

Definition Classes
Transformer
Annotations
@Since( "2.0.0" ) @varargs()
final def transformSchema(schema: StructType): StructType

Definition Classes
RawAnnotator → PipelineStage
def transformSchema(schema: StructType, logging: Boolean): StructType

Attributes
protected
Definition Classes
PipelineStage
Annotations
@DeveloperApi()
val uid: String

Definition Classes
RelationExtractionModel → GenericClassifierModel → Identifiable
def validate(schema: StructType): Boolean

Attributes
protected
Definition Classes
RawAnnotator
def validateStorageRef(dataset: Dataset[_], inputCols: Array[String], annotatorType: String): Unit

Definition Classes
HasStorageRef
val validationSplit: FloatParam
The proportion of training dataset to be used as validation set.
The proportion of training dataset to be used as validation set.
The model will be validated against this dataset on each Epoch and will not be used for training. The value should be between 0.0 and 1.0.

Definition Classes
GenericClassifierParams
final def wait(): Unit

Definition Classes
AnyRef
Annotations
@throws( ... )
final def wait(arg0: Long, arg1: Int): Unit

Definition Classes
AnyRef
Annotations
@throws( ... )
final def wait(arg0: Long): Unit

Definition Classes
AnyRef
Annotations
@throws( ... ) @native()
def wrapColumnMetadata(col: Column): Column

Attributes
protected
Definition Classes
RawAnnotator
def write: MLWriter

Definition Classes
ParamsAndFeaturesWritable → DefaultParamsWritable → MLWritable
def writeTensorflowHub(path: String, tfPath: String, spark: SparkSession, suffix: String): Unit

Definition Classes
WriteTensorflowModel
def writeTensorflowModel(path: String, spark: SparkSession, tensorflow: TensorflowWrapper, suffix: String, filename: String, configProtoBytes: Option[Array[Byte]]): Unit

Definition Classes
WriteTensorflowModel
def writeTensorflowModelV2(path: String, spark: SparkSession, tensorflow: TensorflowWrapper, suffix: String, filename: String, configProtoBytes: Option[Array[Byte]], savedSignatures: Option[Map[String, String]]): Unit

Definition Classes
WriteTensorflowModel

Packages

RelationExtractionModel 

Companion object RelationExtractionModel

class RelationExtractionModel extends GenericClassifierModel with ParamsAndFeaturesWritable with HandleExceptionParams with HasSafeAnnotate[GenericClassifierModel]

Example

Relation Extraction between body parts

Instance Constructors

Type Members

Value Members

Inherited from GenericClassifierModel

Inherited from CheckLicense

Inherited from HasSafeAnnotate[GenericClassifierModel]

Inherited from HandleExceptionParams

Inherited from HasSimpleAnnotate[GenericClassifierModel]

Inherited from WriteTensorflowModel

Inherited from HasStorageRef

Inherited from GenericClassifierParams

Inherited from AnnotatorModel[GenericClassifierModel]

Inherited from CanBeLazy

Inherited from RawAnnotator[GenericClassifierModel]

Inherited from HasOutputAnnotationCol

Inherited from HasInputAnnotationCols

Inherited from HasOutputAnnotatorType

Inherited from ParamsAndFeaturesWritable

Inherited from HasFeatures

Inherited from DefaultParamsWritable

Inherited from MLWritable

Inherited from Model[GenericClassifierModel]

Inherited from Transformer

Inherited from PipelineStage

Inherited from Logging

Inherited from Params

Inherited from Serializable

Inherited from Serializable

Inherited from Identifiable

Inherited from AnyRef

Inherited from Any

Parameters

Annotator types

Members

Parameter setters

Parameter getters

RelationExtractionModel