High Performance NLP with Apache Spark

John Snow Labs’ NLP is a text processing library built on top of Apache Spark and its Spark ML library. It's goal is to provide easy API for NLP annotations allowing a scalable approach within a distributed large scale environment.

Questions? Join our Slack

2018 Dec 23rd - Update! 1.8.0 Released! Dependency Parser, new Spell Checker, Spark 2.4.0, performance boosts and more!

Get Started

Quick start guide to setup spark-nlp and get going


Pretrained models, pipelines and other concepts reference

Commercial Support

Go production with enterprise-grade reliability, security & scale


Jupyter notebooks and scala examples


Benchmarks, articles, blog-posts, FAQ and Troubleshooting


Conference talks, tutorials and podcasts