Spark NLP release notes 3.2.0

3.2.0

Release date: 28-05-2021

Overview

Multi-modal visual document understanding, built on the LayoutLM architecture. It achieves new state-of-the-art accuracy in several downstream tasks, including form understanding and receipt understanding.

New Features

VisualDocumentNER is a DL model for NER problem using text and layout data. Currently available pre-trained model on the SROIE dataset.

Enhancements

Added support SPARK_OCR_LICENSE env key for read license.
Update dependencies and sync Spark versions with Spark NLP.

Bugfixes

Fixed an issue that some ImageReaderSpi plugins are unavailable in the fat jar.

New notebooks

Visual Document NER

Versions

Version
Version
Version

PREVIOUSRelease Notes