PDF

 

Rate PDF

You can rate PDF in Generative AI Lab. The HyperText tag shows the PDF, when you specify “pdf” as the name parameter and pdf header of the JSON in the value parameter. To rate the article, you also need the Rating tag. The Rating tag adds a rating selection to the labeling interface. A simple example configuration is shown below.

rate-pdf

Image Classification

This task mainly uses the Image and Choices tags. You can optionally provide headers to the choices using the Headers tag.

Image-classification

</div><div class="h3-box" markdown="1">

Visual NER Labeling

To label entities in an image, you need to create rectangular labels spanning across the entity to be labeled. To enable this, you have to use RectangleLabels tag that creates labeled rectangles.They are used to apply labels to bounding box semantic segmentation tasks. The name and toName parameters are mandatory.

Visual-NER

The zoom and zoomControl parameters in the Image tag enable you too zoom in or out the image.

</div>

Identify and Validate Checkboxes with Precision

Version 6.9.0 introduces a new project type called Checkbox Detection. With the new update, users can now use the model offered by Generative AI Lab to identify checkboxes in the tasks, including the checked and unchecked status in the respective tasks.

This project type can be selected from the Content Type page under the Image tab during project setup. The default model associated with Checkbox Detection is automatically downloaded from the Models Hub page and added to the project configuration.

690image

After the project is configured, users can add relevant tasks and leverage the model to detect checkboxes and their respective checked and unchecked statuses.

690image

This new update integrates seamlessly with the existing workflow, ensuring no changes or disruptions to the current application processes.

This model can not currently be combined with other models.

Detect and Validate Handwritten Text and Signatures

This update continues with the Handwritten Text and Signature Detection project type. This new feature enables the automatic identification and annotation of handwritten text and signatures within documents, using John Snow Lab’s Visual NLP Library. The new project type can be selected from the Content Type page under Image tab during project configuration. Upon selection, the default model for Handwritten Text and Signature Detection is automatically downloaded from the Models Hub and integrated into the project configuration.

690image

Users can then add relevant tasks to the project and use the model to identify and annotate handwritten content and signatures in documents efficiently.

690image

This feature doesn’t change the existing application workflow, and can not be combined with other models at this time.

View Text alongside images, and Optimized Image Loading in Visual NER

Generative AI Lab 6.11 brings an Image-Text Side-by-Side Annotation project, allowing users to view the original image or PDF alongside its OCR-extracted text for easier annotation. It also includes optimized Visual NER for faster PDF processing, enhanced zoom functionality, and a new licensing model for on-premises deployments.

Other minor improvements have been made to Generative AI Lab, specifically to model training and de-identification.

Annotate while referencing Orginal Documents

This feature improves visibility for image-based documents by displaying both the image/PDF and its OCR-extracted text side by side. Users can now annotate more efficiently while maintaining a visual reference. While Generative AI Lab has offered annotation on top of Image and PDF formats for quite some time, there was a gap in annotating large amounts of data on top of the original document, as there was not enough space to adequately address more robust annotation projects.

Image on the Left, OCR Text on the Right: By Selecting this project type, users can now view the image/PDF document on the left side of the interface and its corresponding OCR-extracted text on the right side.

Paginated Documents: All document pages are paginated, allowing users to navigate the document effortlessly.

Image and PDF documents now support all text-based annotation features—including NER, assertion, relations, resolvers, and lookup code annotation—and allow OCR text annotation.

Key Features

How to Configure the Project

  1. Select the Project Template:
    • Navigate to the Image tab and choose the Image & Text Side-By-Side Annotation template.
    • Save the configuration to proceed.
  2. Add Models to the Reuse Resource Page:
    • Click Next to move to the Reuse Resource page.
    • Add the required models, such as NER, relation, and assertion models, to the project.
  3. Configure Individual Labels:
    • Click Next again to access the label configuration page.
    • Click on individual labels to add lookup data or resolver models.
  4. Save and Deploy the Preannotation Server:
    • Once all configurations are complete, save the project.
    • Deploy the preannotation server to start using the project.

For a visual guide, refer to the GIF below that demonstrates the configuration process step-by-step:

6110image

How to Use Preannotation

After deploying the preannotation server, follow these steps to preannotate your documents:

  1. Import a PDF or Image:
    • Go to the Task Page and navigate to the Import Page.
    • Import the PDF or image you wish to annotate.
  2. Automatic OCR Processing:
    • The imported document will automatically pass through our OCR process.
    • Once OCR is complete, a task will be generated for the document.
  3. Preannotate the Task:
    • Use the deployed preannotation server to preannotate the task.
    • Alternatively, click on the task to annotate it manually.
  4. Automatic NER Label Projection:
    • As you annotate the text side, NER labels are automatically projected onto the image side for visual reference.
    • This ensures a seamless and intuitive annotation experience.

For a visual guide, refer to the GIF below that demonstrates the pre-annotation/annotation process step-by-step:

6110image

Image with Text Annotation (Performance-Optimized)

The second project template is similar to the first but is optimized for speed and larger multi-page PDFs. It maintains the side-by-side view found in the above project without the NER labels appearing on the image. This is to improve annotation speed and reduce server resources. This will work better for high-volume document processing.

6110image

Support for HCC Coding

This release introduces support for HCC Coding for text and PDF content. The system now maps detected ICD-10 codes to their corresponding HCC codes, streamlining clinical risk adjustment workflows and insurance claim verification.

New project types:

  1. HCC Coding for Text
  2. HCC Coding for PDF and Text (side by side)

These project types enable the association of HCC codes with annotated clinical entities using preconfigured lookup datasets, reducing manual input and improving consistency in medical coding.

700image

Usage Instructions

To enable HCC Coding Support, follow these steps:

To enable HCC Coding Support, follow these steps:

  1. Project Setup
    • Select either of the new project templates during project creation.
    • Choose the HCC Coding for PDF and Text (side by side) option if you need a visual representation of the original document while performing HCC coding.

700image

  1. Label Customization - On the Customize Labels page, users can either:
    • Apply a lookup dataset globally, to all labels in your taxonomy at once.
    • Assign Lookup options to specific labels.

700image

  1. Annotation Process
    • Annotate entities and assign codes using the annotation widget.
    • Edit codes inline or through the Annotation Widget from the right panel.
    • Annotated chunks are listed under their respective labels. Users can expand labels by clicking the down arrow to view all chunks associated with them.
    • Lookup code can be edited or updated directly from labeled tokens or via the labeling section by clicking the edit button.
    • Predictions can be copied to generate a completion, allowing the HCC code to be reviewed using the annotation widget on the right.

700image

  1. Review and Confirmation Once a task is labeled and lookup codes are assigned along with HCC Codes, reviewers have the following options:
    • Accept and confirm the labeled text.
    • Decline and remove the labels.
    • Tag the label as non-billable, if necessary.

700image

Raking Score Integration

Tasks can now include ranking scores to support triaging and prioritization, allowing users to manage large annotation datasets more effectively. When importing tasks, users can associate each task with a ranking score that reflects its clinical significance or urgency. These scores are then displayed in the task list and can be used to sort and filter tasks dynamically. This functionality is particularly beneficial in risk adjustment workflows where prioritizing complex or high-impact cases is critical. Ranking scores also integrate with the HCC coding workflow, enabling annotators and reviewers to systematically focus on the most relevant cases for validation.

700image

Last updated