3.11.0
Release date: 28-02-2022
Overview
We are glad to announce that Spark OCR 3.11.0 has been released! This release comes with new models, new features, bug fixes, and notebook examples.
New Features
- Added ImageTextDetectorV2 Python Spark-OCR Transformer for detecting printed and handwritten text using CRAFT architecture with Refiner Net.
- Added ImageTextRecognizerV2 Python Spark-OCR Transformer for recognizing printed and handwritten text based on Deep Learning Transformer Architecture.
- Added FormRelationExtractor for detecting relations between key and value entities in forms.
- Added the capability of fine tuning VisualDocumentNerV2 models for key-value pairs extraction.
New Models
- ImageTextDetectorV2: this extends the ImageTextDetectorV1 character level text detection model with a refiner net architecture.
- ImageTextRecognizerV2: Text recognition for printed text based on the Deep Learning Transformer Architecture.
New notebooks
- SparkOcrImageToTextV2
- ImageTextDetectorV2
- Visual Document NER v2
- SparkOcrFormRecognition
- SparkOCRVisualDocumentNERv2FineTune
- Creating Rest a API with Synapse to extract text from images, SparkOcrRestApi
- Creating Rest a API with Synapse to extract text from PDFs, SparkOcrRestApiPdf
Versions
- 5.4.1
- 5.4.0
- 5.3.2
- 5.3.1
- 5.3.0
- 5.2.0
- 5.1.2
- 5.1.0
- 5.0.2
- 5.0.1
- 5.0.0
- 4.4.4
- 4.4.3
- 4.4.2
- 4.4.1
- 4.4.0
- 4.3.3
- 4.3.0
- 4.2.4
- 4.2.1
- 4.2.0
- 4.1.0
- 4.0.2
- 4.0.0
- 3.14.0
- 3.13.0
- 3.12.0
- 3.11.0
- 3.10.0
- 3.9.1
- 3.9.0
- 3.8.0
- 3.7.0
- 3.6.0
- 3.5.0
- 3.4.0
- 3.3.0
- 3.2.0
- 3.1.0
- 3.0.0
- 1.11.0
- 1.10.0
- 1.9.0
- 1.8.0
- 1.7.0
- 1.6.0
- 1.5.0
- 1.4.0
- 1.3.0
- 1.2.0
- 1.1.2
- 1.1.1
- 1.1.0
- 1.0.0
PREVIOUSRelease Notes