5.0.1
Release date: 21-09-2023
We are glad to announce that Visual NLP 5.0.1 has been released! 🚀🚀🚀 New features, new models, bug fixes, and more! 📢📢📢
🚨 New Features
- New Dit based Text Detection Model: Continuing with our commitment to empower Text Extraction and De-identification pipelines we are delivering a new model for text detection, it was trained on the FUNSD dataset, and its utilization is similar to other related models,
python ImageTextDetector \ .pretrained("image_text_detector_dit", "en", "clinical/ocr") \ .setInputCol("image") .setOutputCol("region") .setScoreThreshold(0.5)
It is currently the best performing model at the FUNSD dataset, achieving an accuracy of 94% vs Craft detector which achieved 78.7%, and is recommended for De-identification and Text Extraction pipelines.
-
Dit based VisualDocumentClassifierV3 now supports fine tuning: check the new tutorial, and notebook, on how to fine-tune Dit-based VisualDocumentClassifierV3 on the RVL-CDIP dataset using a Docker image.
-
New Pretrained Pipeline for Table Extraction: this new pipeline, digital_pdf_table_extractor, extracts tables from digital PDFs. Check and end-to-end example in this notebook.
-
New notebook explaining how to do inference on RvlCdip with VisualDocumentClassiferV3 on Databricks: check this new notebook explaining how you can process the entire RVL-CDIP dataset using auto-scaling in Databricks in few minutes.
-
New RvlCdipReader to help read both training and test parts of the RvlCdip document classification dataset. Check this notebook for an example.
🪲 Bug Fixes
- Avoid to use downloadable metrics script for Lilt NER training: now all the metric computation can be handled offline for Lilt NER model training.
- The bug in data consumption for VisualDocumentNer Lilt models was fixed: this bug affected data ingestion during fine tuning, and affected the quality of the resulting models.
- Serialization issues preventing ImageTableDetector and HocrToTextTable from working properly in a pipeline were fixed.
- PositionFinder has improved error reporting logic.
- ImageToText MacOS errors were solved.
Previous versions
- 5.4.1
- 5.4.0
- 5.3.2
- 5.3.1
- 5.3.0
- 5.2.0
- 5.1.2
- 5.1.0
- 5.0.2
- 5.0.1
- 5.0.0
- 4.4.4
- 4.4.3
- 4.4.2
- 4.4.1
- 4.4.0
- 4.3.3
- 4.3.0
- 4.2.4
- 4.2.1
- 4.2.0
- 4.1.0
- 4.0.2
- 4.0.0
- 3.14.0
- 3.13.0
- 3.12.0
- 3.11.0
- 3.10.0
- 3.9.1
- 3.9.0
- 3.8.0
- 3.7.0
- 3.6.0
- 3.5.0
- 3.4.0
- 3.3.0
- 3.2.0
- 3.1.0
- 3.0.0
- 1.11.0
- 1.10.0
- 1.9.0
- 1.8.0
- 1.7.0
- 1.6.0
- 1.5.0
- 1.4.0
- 1.3.0
- 1.2.0
- 1.1.2
- 1.1.1
- 1.1.0
- 1.0.0