ChunkMergeModel

Companion object ChunkMergeModel

class ChunkMergeModel extends AnnotatorModel[ChunkMergeModel] with CheckLicense with HasMultipleInputAnnotationCols with HasSimpleAnnotate[ChunkMergeModel] with MergeCommonParams with MergeFeatureParams with MergePrioritizationParams with HandleExceptionParams with HasSafeAnnotate[ChunkMergeModel] with HasFeatures with FilteringParams with ResetSentenceIndicesParam

Merges entities coming from different CHUNK annotations

Linear Supertypes

ResetSentenceIndicesParam, FilteringParams, HasSafeAnnotate[ChunkMergeModel], HandleExceptionParams, MergePrioritizationParams, MergeFeatureParams, MergeCommonParams, HasSimpleAnnotate[ChunkMergeModel], HasMultipleInputAnnotationCols, CheckLicense, AnnotatorModel[ChunkMergeModel], CanBeLazy, RawAnnotator[ChunkMergeModel], HasOutputAnnotationCol, HasInputAnnotationCols, HasOutputAnnotatorType, ParamsAndFeaturesWritable, HasFeatures, DefaultParamsWritable, MLWritable, Model[ChunkMergeModel], Transformer, PipelineStage, Logging, Params, Serializable, Serializable, Identifiable, AnyRef, Any

Ordering

Grouped
Alphabetic
By Inheritance

Inherited

ChunkMergeModel
ResetSentenceIndicesParam
FilteringParams
HasSafeAnnotate
HandleExceptionParams
MergePrioritizationParams
MergeFeatureParams
MergeCommonParams
HasSimpleAnnotate
HasMultipleInputAnnotationCols
CheckLicense
AnnotatorModel
CanBeLazy
RawAnnotator
HasOutputAnnotationCol
HasInputAnnotationCols
HasOutputAnnotatorType
ParamsAndFeaturesWritable
HasFeatures
DefaultParamsWritable
MLWritable
Model
Transformer
PipelineStage
Logging
Params
Serializable
Serializable
Identifiable
AnyRef
Any

Hide All
Show All

Visibility

Public
All

Instance Constructors

new ChunkMergeModel()
new ChunkMergeModel(uid: String)
uid
a unique identifier for the instantiated AnnotatorModel

Type Members

type AnnotationContent = Seq[Row]

Attributes
protected
Definition Classes
AnnotatorModel
type AnnotatorType = String

Definition Classes
HasOutputAnnotatorType

Value Members

final def !=(arg0: Any): Boolean

Definition Classes
AnyRef → Any
final def ##(): Int

Definition Classes
AnyRef → Any
final def $[T](param: Param[T]): T

Attributes
protected
Definition Classes
Params
def $$[T](feature: StructFeature[T]): T

Attributes
protected
Definition Classes
HasFeatures
def $$[K, V](feature: MapFeature[K, V]): Map[K, V]

Attributes
protected
Definition Classes
HasFeatures
def $$[T](feature: SetFeature[T]): Set[T]

Attributes
protected
Definition Classes
HasFeatures
def $$[T](feature: ArrayFeature[T]): Array[T]

Attributes
protected
Definition Classes
HasFeatures
final def ==(arg0: Any): Boolean

Definition Classes
AnyRef → Any
def _transform(dataset: Dataset[_], recursivePipeline: Option[PipelineModel]): DataFrame

Attributes
protected
Definition Classes
AnnotatorModel
def afterAnnotate(dataset: DataFrame): DataFrame

Attributes
protected
Definition Classes
AnnotatorModel
def annotate(annotations: Seq[Annotation]): Seq[Annotation]
Merges columns of chunk Annotations while considering false positives and replacements.
Merges columns of chunk Annotations while considering false positives and replacements.
annotations
a Sequence of chunks to merge
returns
a Sequence of Merged CHUNK Annotations

Definition Classes
ChunkMergeModel → HasSimpleAnnotate
final def asInstanceOf[T0]: T0

Definition Classes
Any
def beforeAnnotate(dataset: Dataset[_]): Dataset[_]

Definition Classes
ChunkMergeModel → AnnotatorModel
val blackList: StringArrayParam
If defined, list of entities to ignore.
If defined, list of entities to ignore. The rest will be processed

Definition Classes
FilteringParams
val caseSensitive: BooleanParam
Determines whether the definitions of the white listed and black listed entities are case sensitive or not.
Determines whether the definitions of the white listed and black listed entities are case sensitive or not. If the filterValue is 'entity', 'caseSensitive' is always false. The default value is true, except: com.johnsnowlabs.nlp.annotators.chunker.AssertionFilterer

Definition Classes
FilteringParams
final def checkSchema(schema: StructType, inputAnnotatorType: String): Boolean

Attributes
protected
Definition Classes
HasInputAnnotationCols
def checkValidEnvironment(spark: Option[SparkSession], scopes: Seq[String]): Unit

Definition Classes
CheckLicense
def checkValidScope(scope: String): Unit

Definition Classes
CheckLicense
def checkValidScopeAndEnvironment(scope: String, spark: Option[SparkSession], checkLp: Boolean): Unit

Definition Classes
CheckLicense
def checkValidScopesAndEnvironment(scopes: Seq[String], spark: Option[SparkSession], checkLp: Boolean): Unit

Definition Classes
CheckLicense
val chunkPrecedence: Param[String]
When ChunkPrecedence ordering feature is used this param contains the comma separated metadata fields that drive prioritization of overlapping annotations.
When ChunkPrecedence ordering feature is used this param contains the comma separated metadata fields that drive prioritization of overlapping annotations. When used by itself (empty chunkPrecedenceValuePrioritization) annotations will be prioritized based on number of metadata fields present. When used together with chunkPrecedenceValuePrioritization param it will prioritize based on the order of its values.

Definition Classes
MergePrioritizationParams
val chunkPrecedenceValuePrioritization: StringArrayParam
When ChunkPrecedence ordering feature is used this param contains an Array of comma separated strings representing the desired order of prioritization for the values in the metadata fields included in chunkPrecedence.
When ChunkPrecedence ordering feature is used this param contains an Array of comma separated strings representing the desired order of prioritization for the values in the metadata fields included in chunkPrecedence.

Definition Classes
MergePrioritizationParams
final def clear(param: Param[_]): ChunkMergeModel.this.type

Definition Classes
Params
def clone(): AnyRef

Attributes
protected[lang]
Definition Classes
AnyRef
Annotations
@throws( ... ) @native()
def copy(extra: ParamMap): ChunkMergeModel

Definition Classes
RawAnnotator → Model → Transformer → PipelineStage → Params
def copyValues[T <: Params](to: T, extra: ParamMap): T

Attributes
protected
Definition Classes
Params
val criteria: Param[String]
It is used to how to compare black and white listed values with the result of the Annotation.
It is used to how to compare black and white listed values with the result of the Annotation. Possible values are the following: 'isin', 'regex'. Default: isin
- isin : Filter by the chunk
- regex : Filter by using a regex
Definition Classes
FilteringParams
val defaultConfidence: FloatParam
When ChunkConfidence ordering feature is included and a given annotation does not have any confidence the value of this param will be used.
When ChunkConfidence ordering feature is included and a given annotation does not have any confidence the value of this param will be used.

Definition Classes
MergePrioritizationParams
final def defaultCopy[T <: Params](extra: ParamMap): T

Attributes
protected
Definition Classes
Params
def dfAnnotate: UserDefinedFunction

Definition Classes
HasSimpleAnnotate
val doExceptionHandling: BooleanParam
If true, exceptions are handled.
If true, exceptions are handled. If exception causing data is passed to the model, a error annotation is emitted which has the exception message. Processing continues with the next one. This comes with a performance penalty.

Definition Classes
HandleExceptionParams
val entitiesConfidence: MapFeature[String, Float]
Pairs (entity,confidenceThreshold).
Pairs (entity,confidenceThreshold). Filter the chunks with entities which have confidence lower than the confidence threshold.

Definition Classes
FilteringParams
lazy val entitiesConfidenceMap: Map[String, Float]

Definition Classes
FilteringParams
final def eq(arg0: AnyRef): Boolean

Definition Classes
AnyRef
def equals(arg0: Any): Boolean

Definition Classes
AnyRef → Any
def explainParam(param: Param[_]): String

Definition Classes
Params
def explainParams(): String

Definition Classes
Params
def extraValidate(structType: StructType): Boolean

Attributes
protected
Definition Classes
RawAnnotator
def extraValidateMsg: String

Attributes
protected
Definition Classes
RawAnnotator
final def extractParamMap(): ParamMap

Definition Classes
Params
final def extractParamMap(extra: ParamMap): ParamMap

Definition Classes
Params
val falsePositives: ArrayFeature[(String, String, String)]
Array of entity pairs that are false positives.
Array of entity pairs that are false positives. If a third value is defined, the pair will be replaced by that value.

Definition Classes
MergeFeatureParams
lazy val falsePositivesArray: Array[(String, String, String)]
val features: ArrayBuffer[Feature[_, _, _]]

Definition Classes
HasFeatures
val filterValue: Param[String]
Possible values are 'result' and 'entity'.
Possible values are 'result' and 'entity'. If the value is 'entity', it filters the ner chunks by the ner label that you want to filter. If the value is 'result', it will filter chunks by the result of the Annotation.

Definition Classes
FilteringParams
def finalize(): Unit

Attributes
protected[lang]
Definition Classes
AnyRef
Annotations
@throws( classOf[java.lang.Throwable] )
def get[T](feature: StructFeature[T]): Option[T]

Attributes
protected
Definition Classes
HasFeatures
def get[K, V](feature: MapFeature[K, V]): Option[Map[K, V]]

Attributes
protected
Definition Classes
HasFeatures
def get[T](feature: SetFeature[T]): Option[Set[T]]

Attributes
protected
Definition Classes
HasFeatures
def get[T](feature: ArrayFeature[T]): Option[Array[T]]

Attributes
protected
Definition Classes
HasFeatures
final def get[T](param: Param[T]): Option[T]

Definition Classes
Params
def getBlackList: Array[String]
Gets blackList parameter
Gets blackList parameter

Definition Classes
FilteringParams
def getCaseSensitive: Boolean
Gets caseSensitive parameter
Gets caseSensitive parameter

Definition Classes
FilteringParams
def getChunkPrecedence: String

Definition Classes
MergePrioritizationParams
def getChunkPrecedenceValuePrioritization: Array[String]

Definition Classes
MergePrioritizationParams
final def getClass(): Class[_]

Definition Classes
AnyRef → Any
Annotations
@native()
final def getDefault[T](param: Param[T]): Option[T]

Definition Classes
Params
def getDefaultConfidence: Float

Definition Classes
MergePrioritizationParams
def getFalsePositives: Array[(String, String, String)]
Array of entity pairs that are false positives.
Array of entity pairs that are false positives. If a third value is defined, the pair will be replaced by that value.

Definition Classes
MergeFeatureParams
def getInputCols: Array[String]

Definition Classes
HasInputAnnotationCols
def getLazyAnnotator: Boolean

Definition Classes
CanBeLazy
def getMergeOverlapping: Boolean

Definition Classes
MergeCommonParams
final def getOrDefault[T](param: Param[T]): T

Definition Classes
Params
def getOrderingFeatures: Array[String]

Definition Classes
MergePrioritizationParams
final def getOutputCol: String

Definition Classes
HasOutputAnnotationCol
def getParam(paramName: String): Param[Any]

Definition Classes
Params
def getReplaceDict: Map[String, String]
Mapping of replacement strings for entities.
Mapping of replacement strings for entities.

Definition Classes
MergeFeatureParams
def getReplaceDictStr: String
Get the replacement strings for entities
Get the replacement strings for entities

Definition Classes
MergeFeatureParams
def getResetSentenceIndices: Boolean
Gets resetSentenceIndices parameter
Gets resetSentenceIndices parameter

Definition Classes
ResetSentenceIndicesParam
def getSelectionStrategy: String

Definition Classes
MergePrioritizationParams
def getWhiteList: Array[String]
Gets whiteList parameter
Gets whiteList parameter

Definition Classes
FilteringParams
final def hasDefault[T](param: Param[T]): Boolean

Definition Classes
Params
def hasParam(paramName: String): Boolean

Definition Classes
Params
def hasParent: Boolean

Definition Classes
Model
def hashCode(): Int

Definition Classes
AnyRef → Any
Annotations
@native()
val inExceptionMode: Boolean

Attributes
protected
Definition Classes
HasSafeAnnotate
def initializeLogIfNecessary(isInterpreter: Boolean, silent: Boolean): Boolean

Attributes
protected
Definition Classes
Logging
def initializeLogIfNecessary(isInterpreter: Boolean): Unit

Attributes
protected
Definition Classes
Logging
val inputAnnotatorType: String
Output annotator types: CHUNK, CHUNK
Output annotator types: CHUNK, CHUNK

Definition Classes
ChunkMergeModel → HasMultipleInputAnnotationCols
lazy val inputAnnotatorTypes: Array[String]

Definition Classes
HasMultipleInputAnnotationCols → HasInputAnnotationCols
final val inputCols: StringArrayParam

Attributes
protected
Definition Classes
HasInputAnnotationCols
final def isDefined(param: Param[_]): Boolean

Definition Classes
Params
final def isInstanceOf[T0]: Boolean

Definition Classes
Any
final def isSet(param: Param[_]): Boolean

Definition Classes
Params
def isTraceEnabled(): Boolean

Attributes
protected
Definition Classes
Logging
val lazyAnnotator: BooleanParam

Definition Classes
CanBeLazy
def log: Logger

Attributes
protected
Definition Classes
Logging
def logDebug(msg: ⇒ String, throwable: Throwable): Unit

Attributes
protected
Definition Classes
Logging
def logDebug(msg: ⇒ String): Unit

Attributes
protected
Definition Classes
Logging
def logError(msg: ⇒ String, throwable: Throwable): Unit

Attributes
protected
Definition Classes
Logging
def logError(msg: ⇒ String): Unit

Attributes
protected
Definition Classes
Logging
def logInfo(msg: ⇒ String, throwable: Throwable): Unit

Attributes
protected
Definition Classes
Logging
def logInfo(msg: ⇒ String): Unit

Attributes
protected
Definition Classes
Logging
def logName: String

Attributes
protected
Definition Classes
Logging
def logTrace(msg: ⇒ String, throwable: Throwable): Unit

Attributes
protected
Definition Classes
Logging
def logTrace(msg: ⇒ String): Unit

Attributes
protected
Definition Classes
Logging
def logWarning(msg: ⇒ String, throwable: Throwable): Unit

Attributes
protected
Definition Classes
Logging
def logWarning(msg: ⇒ String): Unit

Attributes
protected
Definition Classes
Logging
val mergeOverlapping: BooleanParam
whether to merge overlapping matched chunks.
whether to merge overlapping matched chunks. Defaults to true

Definition Classes
MergeCommonParams
def msgHelper(schema: StructType): String

Attributes
protected
Definition Classes
HasInputAnnotationCols
final def ne(arg0: AnyRef): Boolean

Definition Classes
AnyRef
final def notify(): Unit

Definition Classes
AnyRef
Annotations
@native()
final def notifyAll(): Unit

Definition Classes
AnyRef
Annotations
@native()
def onWrite(path: String, spark: SparkSession): Unit

Attributes
protected
Definition Classes
ParamsAndFeaturesWritable
val optionalInputAnnotatorTypes: Array[String]

Definition Classes
HasInputAnnotationCols
val orderingFeatures: StringArrayParam
Array of strings specifying the ordering features to use for overlapping entities.
Array of strings specifying the ordering features to use for overlapping entities. Possible values are ChunkBegin, ChunkLength, ChunkPrecedence, ChunkConfidence.

Definition Classes
MergePrioritizationParams
val outputAnnotatorType: AnnotatorType
Input annotator types: CHUNK
Input annotator types: CHUNK

Definition Classes
ChunkMergeModel → HasOutputAnnotatorType
final val outputCol: Param[String]

Attributes
protected
Definition Classes
HasOutputAnnotationCol
lazy val params: Array[Param[_]]

Definition Classes
Params
var parent: Estimator[ChunkMergeModel]

Definition Classes
Model
def prioritize(annotations: Seq[Annotation]): Seq[Annotation]

Attributes
protected
Definition Classes
MergePrioritizationParams
val regex: StringArrayParam
If defined, list of regex to process the chunks (Default: Array())
If defined, list of regex to process the chunks (Default: Array())

Definition Classes
FilteringParams
val replaceDict: MapFeature[String, String]
Mapping of replacement strings for entities.
Mapping of replacement strings for entities.

Definition Classes
MergeFeatureParams
lazy val replaceDictMap: Map[String, String]
def resetSentenceIndices(metadata: Map[String, String]): Map[String, String]
Reset sentence index in metadata by adding "sentence" -> "0"
Reset sentence index in metadata by adding "sentence" -> "0"

Attributes
protected
Definition Classes
ResetSentenceIndicesParam
val resetSentenceIndices: BooleanParam
Whether to reset sentence indices to treat the entire output as if it originates from a single document.
Whether to reset sentence indices to treat the entire output as if it originates from a single document.
When set to true, the metadata of each entity will be updated by assigning the sentence key a value of 0, effectively treating the entire output as if it comes from a single document, regardless of the original sentence boundaries. Default: False.

Definition Classes
ResetSentenceIndicesParam
def resolveFilter(chunkerAnnotations: Seq[Annotation]): Seq[Annotation]

Attributes
protected
Definition Classes
FilteringParams
def resolveMergeFilter(a: Annotation, entityValue: String, falsePositivesArray: Array[(String, String, String)], replaceDictMap: Map[String, String] = Map.empty): Option[Annotation]

Attributes
protected
Definition Classes
FilteringParams
def resolveWhiteListBlackListFilter(annotations: Seq[Annotation]): Seq[Annotation]

Attributes
protected
Definition Classes
FilteringParams
def safeAnnotate(annotations: Seq[Annotation]): Seq[Annotation]
A protected method designed to safely annotate a sequence of Annotation objects by handling exceptions.
A protected method designed to safely annotate a sequence of Annotation objects by handling exceptions.
annotations
A sequence of Annotation.
returns
A sequence of Annotation objects after processing, potentially containing error annotations.

Attributes
protected
Definition Classes
HasSafeAnnotate
def save(path: String): Unit

Definition Classes
MLWritable
Annotations
@Since( "1.6.0" ) @throws( ... )
val selectionStrategy: Param[String]
Whether to select annotations sequentially based on annotation order (Sequential) or using any other available strategy; currently only Sequential and DiverseLonger are available.
Whether to select annotations sequentially based on annotation order (Sequential) or using any other available strategy; currently only Sequential and DiverseLonger are available.

Definition Classes
MergePrioritizationParams
def set[T](feature: StructFeature[T], value: T): ChunkMergeModel.this.type

Attributes
protected
Definition Classes
HasFeatures
def set[K, V](feature: MapFeature[K, V], value: Map[K, V]): ChunkMergeModel.this.type

Attributes
protected
Definition Classes
HasFeatures
def set[T](feature: SetFeature[T], value: Set[T]): ChunkMergeModel.this.type

Attributes
protected
Definition Classes
HasFeatures
def set[T](feature: ArrayFeature[T], value: Array[T]): ChunkMergeModel.this.type

Attributes
protected
Definition Classes
HasFeatures
final def set(paramPair: ParamPair[_]): ChunkMergeModel.this.type

Attributes
protected
Definition Classes
Params
final def set(param: String, value: Any): ChunkMergeModel.this.type

Attributes
protected
Definition Classes
Params
final def set[T](param: Param[T], value: T): ChunkMergeModel.this.type

Definition Classes
Params
def setAllowList(list: String*): ChunkMergeModel.this.type

Definition Classes
FilteringParams
def setAllowList(list: Array[String]): ChunkMergeModel.this.type

Definition Classes
FilteringParams
def setBlackList(list: String*): ChunkMergeModel.this.type

Definition Classes
FilteringParams
def setBlackList(list: Array[String]): ChunkMergeModel.this.type
If defined, list of entities to ignore.
If defined, list of entities to ignore. The rest will be processed.

Definition Classes
FilteringParams
def setCaseSensitive(value: Boolean): ChunkMergeModel.this.type
Determines whether the definitions of the white listed and black listed entities are case sensitive or not.
Determines whether the definitions of the white listed and black listed entities are case sensitive or not. If the filterValue is 'entity', 'caseSensitive' is always False. The default value is true, except: com.johnsnowlabs.nlp.annotators.chunker.AssertionFilterer

Definition Classes
FilteringParams
def setChunkPrecedence(m: String): ChunkMergeModel.this.type

Definition Classes
MergePrioritizationParams
def setChunkPrecedenceValuePrioritization(m: Array[String]): ChunkMergeModel.this.type

Definition Classes
MergePrioritizationParams
def setCriteria(s: String): ChunkMergeModel.this.type
Sets criteria for how to compare black and white listed values with the result of the Annotation.
Sets criteria for how to compare black and white listed values with the result of the Annotation. Possible values are the following: 'isin', 'regex'. Default: isin.
- 'isin' : Filter by the chunk.
- 'regex' : Filter by using a regex.
- You can use 'assertion' in com.johnsnowlabs.nlp.annotators.chunker.AssertionFilterer and 'assertion' option is default value for com.johnsnowlabs.nlp.annotators.chunker.AssertionFilterer
Definition Classes
FilteringParams
def setDefault[T](feature: StructFeature[T], value: () ⇒ T): ChunkMergeModel.this.type

Attributes
protected
Definition Classes
HasFeatures
def setDefault[K, V](feature: MapFeature[K, V], value: () ⇒ Map[K, V]): ChunkMergeModel.this.type

Attributes
protected
Definition Classes
HasFeatures
def setDefault[T](feature: SetFeature[T], value: () ⇒ Set[T]): ChunkMergeModel.this.type

Attributes
protected
Definition Classes
HasFeatures
def setDefault[T](feature: ArrayFeature[T], value: () ⇒ Array[T]): ChunkMergeModel.this.type

Attributes
protected
Definition Classes
HasFeatures
final def setDefault(paramPairs: ParamPair[_]*): ChunkMergeModel.this.type

Attributes
protected
Definition Classes
Params
final def setDefault[T](param: Param[T], value: T): ChunkMergeModel.this.type

Attributes
protected[org.apache.spark.ml]
Definition Classes
Params
def setDefaultConfidence(m: Float): ChunkMergeModel.this.type

Definition Classes
MergePrioritizationParams
def setDenyList(list: String*): ChunkMergeModel.this.type

Definition Classes
FilteringParams
def setDenyList(list: Array[String]): ChunkMergeModel.this.type

Definition Classes
FilteringParams
def setDoExceptionHandling(value: Boolean): ChunkMergeModel.this.type
If true, exceptions are handled.
If true, exceptions are handled. If exception causing data is passed to the model, a error annotation is emitted which has the exception message. Processing continues with the next one. This comes with a performance penalty.

Definition Classes
HandleExceptionParams
def setEntitiesConfidence(value: Map[String, Float]): ChunkMergeModel.this.type

Definition Classes
FilteringParams
def setFalsePositives(value: List[List[String]]): ChunkMergeModel.this.type
Array of entity pairs that are false positives.
Array of entity pairs that are false positives. If a third value is defined, the pair will be replaced by that value.

Definition Classes
MergeFeatureParams
def setFalsePositives(value: Array[(String, String, String)]): ChunkMergeModel.this.type
Array of entity pairs that are false positives.
Array of entity pairs that are false positives. If a third value is defined, the pair will be replaced by that value.

Definition Classes
MergeFeatureParams
def setFilterEntity(v: String): ChunkMergeModel.this.type
Possible values are 'result' and 'entity'.
Possible values are 'result' and 'entity'. If the value is 'entity', it filters the ner chunks by the ner label that you want to filter. If the value is 'result', it will filter chunks by the result of the Annotation.

Definition Classes
FilteringParams
def setInputCols(value: Array[String]): ChunkMergeModel.this.type

Definition Classes
HasMultipleInputAnnotationCols → HasInputAnnotationCols
final def setInputCols(value: String*): ChunkMergeModel.this.type

Definition Classes
HasInputAnnotationCols
def setLazyAnnotator(value: Boolean): ChunkMergeModel.this.type

Definition Classes
CanBeLazy
def setMergeOverlapping(v: Boolean): ChunkMergeModel.this.type
whether to merge overlapping matched chunks.
whether to merge overlapping matched chunks.

Definition Classes
MergeCommonParams
def setOrderingFeatures(m: Array[String]): ChunkMergeModel.this.type

Definition Classes
MergePrioritizationParams
final def setOutputCol(value: String): ChunkMergeModel.this.type

Definition Classes
HasOutputAnnotationCol
def setParent(parent: Estimator[ChunkMergeModel]): ChunkMergeModel

Definition Classes
Model
def setRegex(list: String*): ChunkMergeModel.this.type
Sets the list of regexes to process the chunks.
Sets the list of regexes to process the chunks.

Definition Classes
FilteringParams
def setReplaceDict(replaceDict: HashMap[String, String]): ChunkMergeModel.this.type
Set the lists of entity pairs allowed for a given relation
Set the lists of entity pairs allowed for a given relation

Definition Classes
MergeFeatureParams
def setReplaceDict(value: Map[String, String]): ChunkMergeModel.this.type
Mapping of replacement strings for entities.
Mapping of replacement strings for entities.

Definition Classes
MergeFeatureParams
def setResetSentenceIndices(value: Boolean): ChunkMergeModel.this.type
Set whether to reset sentence indices to treat the entire output as if it originates from a single document.
Set whether to reset sentence indices to treat the entire output as if it originates from a single document.
When set to true, the metadata of each entity will be updated by assigning the sentence key a value of 0, effectively treating the entire output as if it comes from a single document, regardless of the original sentence boundaries. Default: False.

Definition Classes
ResetSentenceIndicesParam
def setSelectionStrategy(m: String): ChunkMergeModel.this.type

Definition Classes
MergePrioritizationParams
def setWhiteList(list: String*): ChunkMergeModel.this.type

Definition Classes
FilteringParams
def setWhiteList(list: Array[String]): ChunkMergeModel.this.type
Sets the list of entities to process.
Sets the list of entities to process. The rest will be ignored. Do not include IOB prefix on labels.

Definition Classes
FilteringParams
final def synchronized[T0](arg0: ⇒ T0): T0

Definition Classes
AnyRef
def toString(): String

Definition Classes
Identifiable → AnyRef → Any
final def transform(dataset: Dataset[_]): DataFrame

Definition Classes
AnnotatorModel → Transformer
def transform(dataset: Dataset[_], paramMap: ParamMap): DataFrame

Definition Classes
Transformer
Annotations
@Since( "2.0.0" )
def transform(dataset: Dataset[_], firstParamPair: ParamPair[_], otherParamPairs: ParamPair[_]*): DataFrame

Definition Classes
Transformer
Annotations
@Since( "2.0.0" ) @varargs()
final def transformSchema(schema: StructType): StructType

Definition Classes
RawAnnotator → PipelineStage
def transformSchema(schema: StructType, logging: Boolean): StructType

Attributes
protected
Definition Classes
PipelineStage
Annotations
@DeveloperApi()
val uid: String

Definition Classes
ChunkMergeModel → Identifiable
def validate(schema: StructType): Boolean

Attributes
protected
Definition Classes
RawAnnotator
final def wait(): Unit

Definition Classes
AnyRef
Annotations
@throws( ... )
final def wait(arg0: Long, arg1: Int): Unit

Definition Classes
AnyRef
Annotations
@throws( ... )
final def wait(arg0: Long): Unit

Definition Classes
AnyRef
Annotations
@throws( ... ) @native()
val whiteList: StringArrayParam
If defined, list of entities to process.
If defined, list of entities to process. The rest will be ignored. Does not include IOB prefix on labels (Default: Array())

Definition Classes
FilteringParams
def wrapColumnMetadata(col: Column): Column

Attributes
protected
Definition Classes
RawAnnotator
def write: MLWriter

Definition Classes
ParamsAndFeaturesWritable → DefaultParamsWritable → MLWritable

Packages

ChunkMergeModel 

Companion object ChunkMergeModel

Instance Constructors

Type Members

Value Members

Inherited from ResetSentenceIndicesParam

Inherited from FilteringParams

Inherited from HasSafeAnnotate[ChunkMergeModel]

Inherited from HandleExceptionParams

Inherited from MergePrioritizationParams

Inherited from MergeFeatureParams

Inherited from MergeCommonParams

Inherited from HasSimpleAnnotate[ChunkMergeModel]

Inherited from HasMultipleInputAnnotationCols

Inherited from CheckLicense

Inherited from AnnotatorModel[ChunkMergeModel]

Inherited from CanBeLazy

Inherited from RawAnnotator[ChunkMergeModel]

Inherited from HasOutputAnnotationCol

Inherited from HasInputAnnotationCols

Inherited from HasOutputAnnotatorType

Inherited from ParamsAndFeaturesWritable

Inherited from HasFeatures

Inherited from DefaultParamsWritable

Inherited from MLWritable

Inherited from Model[ChunkMergeModel]

Inherited from Transformer

Inherited from PipelineStage

Inherited from Logging

Inherited from Params

Inherited from Serializable

Inherited from Serializable

Inherited from Identifiable

Inherited from AnyRef

Inherited from Any

Parameters

Annotator types

Members

Parameter setters

Parameter getters

ChunkMergeModel