Data Generation, Testing and Evaluation of Chinese Natural Language Processing in the Cloud


연구 분야: Artificial Intelligence



학회: 2022 IEEE 7th International Conference on Smart Cloud (SmartCloud)


초록

With the rapid development of artificial intelligence, natural language processing, as an important branch, has also become a hot research field. A series of super large-scale pre-trained models represented by BERT and GPT have made great progress in natural language understanding and natural language generation, even some of the experimental accuracy exceed the human benchmark. However, these models will also make some mistakes and even fairness problems when they have the language ability equivalent to human beings. In order to verify whether the models can truly understand natural language, the evaluation of these models is particularly important. More methods are needed to evaluate the model. The language model-based evaluation tools often require a lot of computing resources. In this paper, we propose a method for testing and evaluation of Chinese natural language processing in cloud, generate testing data and design tests for Chinese data and test two pre-trained models. The experimental results show that our method can find defects of the model, though it has high performance on specific dataset.


Author Profile
Minjie Ding

Shanghai Key Laboratory of Computer Software Testing and Evaluating Shanghai Development Center of Computer Software Technology Shanghai China

Andorra
Author Profile
Mingang Chen

Shanghai Key Laboratory of Computer Software Testing and Evaluating Shanghai Development Center of Computer Software Technology Shanghai China

Andorra
Author Profile
Wenjie Chen

Shanghai Key Laboratory of Computer Software Testing and Evaluating Shanghai Development Center of Computer Software Technology Shanghai China

Andorra

📄 논문 정보

발행 연도 2022년
인용수 203
출판 국가 Andorra, India
사이트 IEEE
좋아요 수 0

연관 논문 목록 (69건)