Spark NLP release notes 3.2.0

 

3.2.0

Release date: 28-05-2021

Overview

Multi-modal visual document understanding, built on the LayoutLM architecture. It achieves new state-of-the-art accuracy in several downstream tasks, including form understanding and receipt understanding.

New Features

  • VisualDocumentNER is a DL model for NER problem using text and layout data. Currently available pre-trained model on the SROIE dataset.

Enhancements

  • Added support SPARK_OCR_LICENSE env key for read license.
  • Update dependencies and sync Spark versions with Spark NLP.

Bugfixes

  • Fixed an issue that some ImageReaderSpi plugins are unavailable in the fat jar.

New notebooks

Versions

Last updated