Spark NLP release notes 1.2.0

 

1.2.0

Release date: 08-04-2020

Overview

Improved support Databricks and processing selectable pdfs.

Enhancements

  • Adapted Spark OCR for run on Databricks.
  • Added rewriting positions in ImageToText when run together with PdfToText.
  • Added ‘positionsCol’ param to ImageToText.
  • Improved support Spark NLP. Changed start function.

New Features

  • Added showImage implicit to Dataframe for display images in Scala Databricks notebooks.
  • Added display_images function for display images in Python Databricks notebooks.
  • Added propagation selectable pdf file in TextToPdf. Added ‘inputContent’ param to ‘TextToPdf’.

Versions

Last updated