Spark NLP release notes 1.10.0

 

1.10.0

Release date: 20-01-2021

Overview

Support Microsoft Docx documents.

New Features

  • Added DocToText transformer for extract text from DOCX documents.
  • Added DocToTextTable transformer for extract table data from DOCX documents.
  • Added DocToPdf transformer for convert DOCX documents to PDF format.

Bugfixes

  • Fixed issue with loading model data on some cluster configurations

Versions

Last updated