High Performance NLP with Apache Spark

John Snow Labs’ NLP is a text processing library built on top of Apache Spark and its Spark ML library. It's goal is to provide easy API for NLP annotations allowing a scalable approach within a distributed large scale environment.

Questions? Join our Slack

2018 Dec 23rd - Update! 1.8.0 Released! Dependency Parser, new Spell Checker, Spark 2.4.0, performance boosts and more!

Get Started

Quick start guide to setup spark-nlp and get going

Documentation

Pretrained models, pipelines and other concepts reference

Commercial Support

Go production with enterprise-grade reliability, security & scale

Examples

Jupyter notebooks and scala examples

Articles

Benchmarks, articles, blog-posts, FAQ and Troubleshooting

Videos

Conference talks, tutorials and podcasts