Visual Document Understanding - Live Demos & Notebooks
Visual Document Classification
Classify documents using text and layout data with the new features offered by Spark OCR. (...)
Extract Data from FoundationOne Sequencing Reports
Extract patient, genomic and biomarker information from FoundationOne Sequencing Reports. (...)
Recognize entities in scanned PDFs
End-to-end example of regular NER pipeline: import scanned images from cloud storage, preprocess them for improving their quality, recognize text using Spark OCR, correct the spelling mistakes for improving OCR results and finally run NER for extracting entities. (...)
Extract brands from visual documents
This demo shows how brands from image can be detected using Spark OCR. (...)
Visual NER Key-Values v2
This demo extract the main document key points using our pre-trained Spark OCR model. (...)
Visual Question Answering
This demo allows Inferring the answer from a given image and a text-based question by using our pre-trained Spark OCR models. (...)
Chart to Text
Obtain a description of the charts in the image input document by using our Spark OCR model. (...)
Chart to Text powered by LLM
Obtain a deeper interpretation of the charts in the image input document by using our Spark OCR model powered by LLM. (...)
Infographic Visual Question Answering
Infer the answer from a given infographic related image and a text-based question by using our pre-trained Spark OCR model. (...)