Release Notes

11-03-2025

Model Updates

We’re excited to announce major updates to our vision-language models with enhanced capabilities and improved performance.

Medical-Visual-LLM-8B: An 8B parameter vision-language model that delivers enhanced medical-grade multimodal intelligence. This model combines medical expertise with visual comprehension capabilities, understanding both clinical text and visual content including X-rays, MRIs, pathology slides, medical diagrams, and structured documents. It is optimized for RAG applications and healthcare integrations.
Medical-Visual-LLM-30B: A 30B parameter vision-language model that seamlessly integrates advanced medical reasoning with powerful visual understanding. This model excels at processing both medical images (X-rays, MRIs, CT scans, pathology slides and anatomical diagrams) and text, enabling comprehensive analysis of visual and textual medical data for extended medical reports and multi-image cases.

Specifications

Model Name	Parameters	Recommended GPU Memory	Max Sequence Length	Model Size	Max KV-Cache	Tensor Parallel Sizes
Medical-Visual-LLM-8B	8B	~64 GB	262K	16 GB	48 GB	1, 2, 4, 8
Medical-Visual-LLM-30B	30B	~150 GB	262K	58 GB	92 GB	2, 4, 8

Benchmark Performance

Medical-Visual-LLM-8B:

Achieves 77.6% average across OpenMed benchmarks
Scores 80.75% on clinical knowledge assessment
Reaches 86% on medical genetics understanding
Performs at 90.97% for college biology concepts
Processes professional medicine with 81.25% accuracy
Handles medical MCQAs with 60.87% precision
Maintains 66.61% accuracy on MedQA 4-options test

Medical-Visual-LLM-30B:

Achieves 83.5% average across OpenMed benchmarks
Scores 85.66% on clinical knowledge assessment
Reaches 95% on medical genetics understanding
Performs at 93.75% for college biology concepts
Processes professional medicine with 89.34% accuracy
Handles medical MCQAs with 68.8% precision
Maintains 77.61% accuracy on MedQA 4-options test

09-15-2025

Model Updates

We’re excited to announce the addition of a new compact yet powerful vision-language model to our Medical LLM lineup.

Medical-Visual-LLM-7B: A vision-language model that delivers medical-grade multimodal intelligence in a compact, efficient format. This model combines medical expertise with visual comprehension capabilities, understanding both clinical text and visual content including X-rays, MRIs, pathology slides, medical diagrams, and structured documents. It’s well-suited for RAG applications and healthcare integrations.

Specifications

Model Name	Parameters	Recommended GPU Memory	Max Sequence Length	Model Size	Max KV-Cache	Tensor Parallel Sizes
Medical-Visual-LLM-7B	7B	~64 GB	128K	15 GB	48 GB	1, 2, 4

Benchmark Performance

Achieves 71.0% average across OpenMed benchmarks
Scores 75.85% on clinical knowledge assessment
Reaches 81% on medical genetics understanding
Performs at 84.72% for college biology concepts
Processes professional medicine with 71.69% accuracy
Handles medical MCQAs with 55.08% precision
Maintains 59.07% accuracy on MedQA 4-options test

08-17-2025

Model Updates

We’re excited to announce that an updated versions of our Medical-Reasoning-LLM-32B models is now available, delivering significant accuracy improvements across key medical benchmarks.

Specifications

Model Name	Parameters	Recommended GPU Memory	Max Sequence Length	Model Size	Max KV-Cache	Tensor Parallel Sizes
Medical-Reasoning-LLM-32B	32B	~111 GB	40K	61 GB	50 GB	2, 4, 8

Benchmark Accuracy Comparison with previous versions

Medical-Reasoning-LLM-32B leads with refined gains in:

Clinical Knowledge: Improved from 86.66% to 87.04% — a +0.38 point increase
Professional Medicine: Improved from 89.24% to 90.07% — a +0.83 point increase
Medical Genetics: Improved from 92.0% to 93.0% — a +1.0 point increase

07-17-2025

Model Updates

We’re excited to announce that updated versions of our Medical-LLM-8B, Medical-LLM-Small, and Medical-Reasoning-LLM-32B models are now available, delivering significant accuracy improvements across key medical benchmarks.

Medical-LLM-8B: A new 8B parameter model that delivers enhanced clinical reasoning capabilities while maintaining deployment efficiency. This model introduces dedicated reasoning mode with multi-step clinical logic and improved performance in differential diagnosis and care planning.
Medical-LLM-Small: Updated with advanced reasoning capabilities and expanded medical knowledge processing. The enhanced model now features improved chain-of-thought supervision and better performance in complex medical case analysis.
Medical-Reasoning-LLM-32B: Significantly enhanced with focus on clinical reasoning and diagnostic decision support. The updated model provides transparent decision pathways, considers multiple hypotheses, and offers structured reasoning patterns aligned with clinical guidelines.

Specifications

Model Name	Parameters	Recommended GPU Memory	Max Sequence Length	Model Size	Max KV-Cache	Tensor Parallel Sizes
Medical-LLM-8B	8B	~38 GB	40K	15 GB	23 GB	1, 2, 4, 8
Medical-LLM-Small	14B	~59 GB	40K	28 GB	31 GB	1, 2, 4, 8
Medical-Reasoning-LLM-32B	32B	~111 GB	40K	61 GB	50 GB	2, 4, 8

Benchmark Accuracy Comparison with previous versions

Medical-LLM-8B shows a notable boost in Anatomy (8.88 pts), Clinical Knowledge (6.41 pts), and College Medicine (9.83 pts).
Medical-LLM-Small delivers higher precision in Anatomy (6.7 pts), College Biology (3.12 pts), while maintaining top-tier performance in compact deployments
Medical-Reasoning-LLM-32B leads with refined gains in College Medicine (+2.89 pts), Professional Medicine (+1.1 pts), and PubMedQA (+2.0 pts).

These models are optimized for clinical decision support, medical research, and education use cases, offering best-in-class accuracy with each new release.

Model Name	Anatomy	Clinical Knowledge	College Biology	College Medicine	Medical Genetics	Professional Medicine	PubMedQA
Medical-LLM-8B (Previous version)	65.19	73.21	86.81	64.74	77	72.43	75.6
Medical-LLM-8B (Latest version)	74.07	79.62	86.81	74.57	82	79.78	76.6
Medical-LLM-Small (Previous version)	77.04	83.02	90.28	76.30	90	85.29	79.0
Medical-LLM-Small (Latest version)	83.74	83.40	93.40	78.03	84	85.66	77.4
Medical-Reasoning-LLM-32B (Previous version)	78.52	85.66	93.75	80.35	91	87.5	75.8
Medical-Reasoning-LLM-32B (Latest version)	78. 52	85.66	94.44	83.24	92	88.6	77.8

05-19-2025

We are excited to announce the addition of two new powerful models to our Medical LLM lineup.

Medical-Visual-LLM-24B: A 24B parameter vision-language model that combines medical expertise with visual comprehension capabilities. This model excels at processing both medical images (X-rays, MRIs, pathology slides) and text, enabling comprehensive analysis of visual and textual medical data.

Get more information about the Medical-Visual-LLM-24B model in this blog.

Medical-Spanish-LLM-24B: A specialized 24B parameter model designed for Spanish-speaking healthcare environments, offering native processing of Spanish medical terminology and clinical documentation. The model maintains high precision in Spanish medical language understanding without requiring translation.

Specifications

Model Name	Parameters	Recommended GPU Memory	Max Sequence Length	Model Size	Max KV-Cache	Tensor Parallel Sizes
Medical-Visual-LLM-24B	24B	~145 GB	128K	45 GB	100 GB	2, 4, 8
Medical-Spanish-LLM-24B	24B	~145 GB	128K	45 GB	100 GB	2, 4, 8

04-06-2025

Welcome to John Snow Labs Medicall LLMs on premise deployments Documentation and Updates Hub!

We are excited to announce the launch of the on premise deplyment of our Medical LLM models page, a centralized repository for all the latest features, enhancements, and resolutions of known issues within the. This dedicated space is designed to keep users informed of the most recent developments, enabling seamless testing and facilitating the provision of valuable feedback. Our commitment is to ensure that users have immediate access to the latest information, empowering them to leverage the full capabilities of out Medical LLM models effectively. Stay updated with us as we continue to improve and expand the functionalities of our Medical LLMs to meet and exceed your expectations.

Supported Medical LLM Models

Model Name	Parameters	Recommended GPU Memory	Max Sequence Length	Model Size	Max KV-Cache	Tensor Parallel Sizes
Medical-LLM-7B	7B	~25 GB	32K	14 GB	11 GB	1, 2, 4
Medical-LLM-14B	14B	~40 GB	16K	27 GB	13 GB	1, 2
Medical-LLM-24B	24B	~69 GB	32K	44 GB	25 GB	1, 2, 4, 8
Medical-LLM-Small	14B	~58 GB	32K	28 GB	30 GB	1, 2, 4, 8
Medical-LLM-Medium	70B	~452 GB	128K	131 GB	320 GB	4, 8
Medical-Reasoning-LLM-32B	32B	~222 GB	128K	61 GB	160 GB	2, 4, 8

PREVIOUSSupport