General model for table detection

Description

General model for table detection inspired by https://arxiv.org/abs/2004.12629

Predicted Entities

Copy S3 URI

How to use

This modes is used by ImageTableDetector

binary_to_image = BinaryToImage()
binary_to_image.setImageType(ImageType.TYPE_3BYTE_BGR)

table_detector = ImageTableDetector
.pretrained("general_model_table_detection_v2", "en", "clinical/ocr")
.setInputCol("image")
.setOutputCol("table_regions")

pipeline = PipelineModel(stages=[
    binary_to_image,
    table_detector
])
var imgDf = spark.read.format("binaryFile").load(imagePath)
var bin2imTransformer = new BinaryToImage()
bin2imTransformer.setImageType(ImageType.TYPE_3BYTE_BGR)

val dataFrame = bin2imTransformer.transform(imgDf)
val tableDetector = ImageTableDetector
.pretrained("general_model_table_detection_v2", "en", "clinical/ocr")
.setInputCol("image")
.setOutputCol("table regions")

Model Information

Model Name: ocr_table_detection_general_model
Type: ocr
Compatibility: Visual NLP 3.0.0+
License: Licensed
Edition: Official
Output Labels: [table regions]
Language: en