High Performance NLP with Apache Spark

John Snow Labs’ NLP is a text processing library built on top of Apache Spark and its Spark ML library. It's goal is to provide easy API for NLP annotations allowing a scalable approach within a distributed large scale environment.

Questions? Join our Slack

2019 March 23rd - Update! 2.0.1 Released! Bert embeddings, embeddings as annotators, better OCR, new pretrained pipelines and much more!

Get Started

Quick start guide to setup spark-nlp and get going


Pretrained models, pipelines and other concepts reference

Commercial Support

Go production with enterprise-grade reliability, security & scale


Jupyter notebooks and scala examples


Benchmarks, articles, blog-posts, FAQ and Troubleshooting


Conference talks, tutorials and podcasts