Abstract
NLP Workbench is a web-based platform for text mining that allows non-expert users to obtain semantic understanding of large-scale corpora using state-of-the-art text mining models. The platform is built upon latest pre-trained models and open source systems from academia that provide semantic analysis functionalities, including but not limited to entity linking, sentiment analysis, semantic parsing, and relation extraction. Its extensible design enables researchers and developers to smoothly replace an existing model or integrate a new one. To improve efficiency, we employ a microservice architecture that facilitates allocation of acceleration hardware and parallelization of computation. This paper presents the architecture of NLP Workbench and discusses the challenges we faced in designing it. We also discuss diverse use cases of NLP Workbench and the benefits of using it over other approaches. The platform is under active development, with its source code released under the MIT license. A website and a short video demonstrating our platform are also available.
Abstract (translated)
NLP Workbench是一个基于Web的文本挖掘平台,它允许非专家用户使用最先进的文本挖掘模型,对大规模语料库进行语义理解。该平台基于学术界最新的预训练模型和开源系统,提供了语义分析功能,包括但不限于实体链接、情感分析、语义解析和关系提取。该平台可扩展的设计使得研究人员和开发人员可以轻松地更换现有模型或集成新的模型。为了提高效率,我们采用了微服务架构,便于分配加速硬件和并行计算。本文介绍了NLP Workbench的架构,并讨论了在设计该平台时所面临的挑战。我们还讨论了NLP Workbench多种使用场景以及使用它相比其他方法的优势。该平台正在积极开发,其源代码已采用MIT许可证发布。一个网站和一个简短的视频演示了我们的平台。
URL
https://arxiv.org/abs/2303.01410