3.11.0
Release date: 28-02-2022
Overview
We are glad to announce that Spark OCR 3.11.0 has been released! This release comes with new models, new features, bug fixes, and notebook examples.
New Features
- Added ImageTextDetectorV2 Python Spark-OCR Transformer for detecting printed and handwritten text using CRAFT architecture with Refiner Net.
- Added ImageTextRecognizerV2 Python Spark-OCR Transformer for recognizing printed and handwritten text based on Deep Learning Transformer Architecture.
- Added FormRelationExtractor for detecting relations between key and value entities in forms.
- Added the capability of fine tuning VisualDocumentNerV2 models for key-value pairs extraction.
New Models
- ImageTextDetectorV2: this extends the ImageTextDetectorV1 character level text detection model with a refiner net architecture.
- ImageTextRecognizerV2: Text recognition for printed text based on the Deep Learning Transformer Architecture.
New notebooks
- SparkOcrImageToTextV2
- ImageTextDetectorV2
- Visual Document NER v2
- SparkOcrFormRecognition
- SparkOCRVisualDocumentNERv2FineTune
- Creating Rest a API with Synapse to extract text from images, SparkOcrRestApi
- Creating Rest a API with Synapse to extract text from PDFs, SparkOcrRestApiPdf
Versions
- Version 3.12.0
- Version 3.11.0
- Version 3.10.0
PREVIOUSRelease Notes