Spark NLP release notes 3.11.0

 

3.11.0

Release date: 28-02-2022

Overview

We are glad to announce that Spark OCR 3.11.0 has been released! This release comes with new models, new features, bug fixes, and notebook examples.

New Features

  • Added ImageTextDetectorV2 Python Spark-OCR Transformer for detecting printed and handwritten text using CRAFT architecture with Refiner Net.
  • Added ImageTextRecognizerV2 Python Spark-OCR Transformer for recognizing printed and handwritten text based on Deep Learning Transformer Architecture.
  • Added FormRelationExtractor for detecting relations between key and value entities in forms.
  • Added the capability of fine tuning VisualDocumentNerV2 models for key-value pairs extraction.

New Models

  • ImageTextDetectorV2: this extends the ImageTextDetectorV1 character level text detection model with a refiner net architecture.
  • ImageTextRecognizerV2: Text recognition for printed text based on the Deep Learning Transformer Architecture.

New notebooks

Versions

Last updated