High Performance NLP with Apache Spark

John Snow Labs’ NLP is a text processing library built on top of Apache Spark and its Spark ML library. It's goal is to provide easy API for NLP annotations allowing a scalable approach within a distributed large scale environment.

Questions? Join our Slack

2018 May 19th - Update! 1.5.4 Released! Better Normalizer with slang dictionary, improvements in annotators and fixed python2 support Learn more HERE and check out updated documentation below

Get started

Quick start guide to setup spark-nlp and get going


Pretrained models, pipelines and other concepts reference


Sample Notebooks, guideline to use SparkNLP


Ways to Contribute to spark-nlp repository

Resources & FAQs

Videos, Podcasts, Whitepapers and other questions

License & Credits

Licensing / Acknowledgements