High Performance NLP with Apache Spark

John Snow Labs’ NLP is a text processing library built on top of Apache Spark and its Spark ML library. It's goal is to provide easy API for NLP annotations allowing a scalable approach within a distributed large scale environment.

Questions? Join our Slack

2019 Feb 8th - Update! 1.8.2 Released! Performance improvements, OCR autorotation and more

Get Started

Quick start guide to setup spark-nlp and get going


Pretrained models, pipelines and other concepts reference

Commercial Support

Go production with enterprise-grade reliability, security & scale


Jupyter notebooks and scala examples


Benchmarks, articles, blog-posts, FAQ and Troubleshooting


Conference talks, tutorials and podcasts