Improving ethical sensitivity for ethical decision-making in conversational artificial intelligence


연구 분야: Artificial Intelligence



학회: Discover Computing


초록

The development of large language models has significantly advanced the reasoning capabilities of artificial intelligence (AI), often surpassing human-level performance. As AI's cognitive abilities rapidly progress, ethical concerns surrounding its applications have also increased. This indicates a heightened risk of bias, as AI models scale up and train on vast amounts of general data that inherently include social conventions related to gender, race, politics, and religion. This study proposes methods for enhancing ethical sensitivity to social bias. To achieve this, we defined 20 categories of social bias and developed a model that predicts the ethical sensitivity of sentences by leveraging the influence scores of words within these categories. The ethical sensitivity prediction model was validated using a paired-sample t-test to compare the ethical sensitivity evaluations of 25 AI-generated responses assessed by both AI and human evaluators. The test revealed no significant differences between the two groups, thus confirming the validity of the model. The findings of this study suggest that recognizing and predicting the ethical sensitivity of utterances concerning social biases can enhance ethical sensitivity, mitigate the risk of bias, and contribute to more ethical decision-making in AI interactions.


Author Profile
Kyungsun Yoo

Department of Computer Education Sungkyunkwan University Hoam Hall 25-2 Sungkyunkwan-Ro Jongno-Gu Seoul 50803 Republic of Korea

Guam
Author Profile
Seongjin Ahn

Department of Computer Education Sungkyunkwan University Hoam Hall 25-2 Sungkyunkwan-Ro Jongno-Gu Seoul 50803 Republic of Korea

Guam

📄 논문 정보

발행 연도 2025년
인용수 0
출판 국가 Guam
사이트 Springer
좋아요 수 0

연관 논문 목록 (74건)