Domain Adaptation of Transformer-Based Models Using Unlabeled Data for Relevance and Polarity Classification of German Customer Feedback


연구 분야: Infrastructure



학회: SN Computer Science


초록

Understanding customer feedback is becoming a necessity for companies to identify problems and improve their products and services. Text classification and sentiment analysis can play a major role in analyzing this data by using a variety of machine and deep learning approaches. In this work, different transformer-based models are utilized to explore how efficient these models are when working with a German customer feedback dataset. In addition, these pre-trained models are further analyzed to determine if adapting them to a specific domain using unlabeled data can yield better results than off-the-shelf pre-trained models. To evaluate the models, two downstream tasks from the GermEval 2017 are considered. The experimental results show that transformer-based models can reach significant improvements compared to a fastText baseline and outperform the published scores and previous models. For the subtask Relevance Classification, the best models achieve a micro-averaged F1-Score of 96.1 % on the first test set and 95.9 % on the second one, and a score of 85.1 % and 85.3 % for the subtask Polarity Classification.


Author Profile
Ahmad Idrissi-Yaghir

Department of Computer Science University of Applied Sciences and Arts Dortmund (FHDO) Emil-Figge Str. 42 Dortmund 44227 Germany

Andorra
Author Profile
Henning Schäfer

Institute for Medical Informatics Biometry and Epidemiology (IMIBE) University Hospital Essen Hufelandstraße 55 Essen 45147 Germany

Andorra
Author Profile
Nadja Bauer

Department of Computer Science University of Applied Sciences and Arts Dortmund (FHDO) Emil-Figge Str. 42 Dortmund 44227 Germany

Andorra

📄 논문 정보

발행 연도 2023년
인용수 0
출판 국가 Germany, Andorra
사이트 Springer
좋아요 수 0

연관 논문 목록 (84건)