Spark NLP: Natural Language Understanding at Scale
2021-01-26Code Available2· sign in to hype
Veysel Kocaman, David Talby
Code Available — Be the first to reproduce this paper.
ReproduceCode
- github.com/JohnSnowLabs/spark-nlpOfficialIn papertf★ 4,116
Abstract
Spark NLP is a Natural Language Processing (NLP) library built on top of Apache Spark ML. It provides simple, performant and accurate NLP annotations for machine learning pipelines that can scale easily in a distributed environment. Spark NLP comes with 1100 pre trained pipelines and models in more than 192 languages. It supports nearly all the NLP tasks and modules that can be used seamlessly in a cluster. Downloaded more than 2.7 million times and experiencing nine times growth since January 2020, Spark NLP is used by 54% of healthcare organizations as the worlds most widely used NLP library in the enterprise.