Spark NLP release notes 1.3.0

 

1.3.0

Release date: 22-05-2020

Overview

New functionality for de-identification problem.

Enhancements

  • Renamed TesseractOCR to ImageToText.
  • Simplified installation.
  • Added check license from SPARK_NLP_LICENSE env varibale.

New Features

  • Support storing for binaryFormat. Added support storing Image and PDF files.
  • Support selectable pdf for TextToPdf transformer.
  • Added UpdateTextPosition transformer.

Versions

Last updated