4.3.1
Release date: 17-02-2023
We’re glad to announce that Visual NLP 😎 4.3.1 has been released.
Highlights
- ImageTextCleaner & ImageTableDetector have improved memory consumption.
- New Annotators supported in LightPipelines.
- Table extraction from Digital PDFs pipeline now entirely supported as a LightPipeline.
ImageTextCleaner & ImageTableDetector improved memory consumption
- ImageTextCleaner & ImageTableDetector improved memory consumption: we reduced about 30% the memory consumption for this annotator making it more memory friendly and enabling running on memory constrained environments like Colab.
New Annotators supported in LightPipelines
Now the following annotators are supported in LightPipelines,
- PdfToHocr,
- HocrTokenizer,
- ImageTableDetector,
- ImageScaler,
- HocrToTextTable,
Table extraction from Digital PDFs pipeline now entirely supported as a LightPipeline.
- Our Table Extraction from digital PDFs pipeline now supports running as a LightPipeline, check the updated notebook: SparkOCRPdfToTable.ipynb
This release is compatible with Spark NLP for Healthcare 4.3.0, and Spark NLP 4.3.0.
Previous versions
- 5.5.0
- 5.4.2
- 5.4.1
- 5.4.0
- 5.3.2
- 5.3.1
- 5.3.0
- 5.2.0
- 5.1.2
- 5.1.0
- 5.0.2
- 5.0.1
- 5.0.0
- 4.4.4
- 4.4.3
- 4.4.2
- 4.4.1
- 4.4.0
- 4.3.3
- 4.3.0
- 4.2.4
- 4.2.1
- 4.2.0
- 4.1.0
- 4.0.2
- 4.0.0
- 3.14.0
- 3.13.0
- 3.12.0
- 3.11.0
- 3.10.0
- 3.9.1
- 3.9.0
- 3.8.0
- 3.7.0
- 3.6.0
- 3.5.0
- 3.4.0
- 3.3.0
- 3.2.0
- 3.1.0
- 3.0.0
- 1.11.0
- 1.10.0
- 1.9.0
- 1.8.0
- 1.7.0
- 1.6.0
- 1.5.0
- 1.4.0
- 1.3.0
- 1.2.0
- 1.1.2
- 1.1.1
- 1.1.0
- 1.0.0
PREVIOUSVersion Compatibility