An End-to-End Continuous Speech Recognition System in Bengali for General and Elderly Domain


연구 분야: Artificial Intelligence



학회: SN Computer Science


초록

Although a substantial amount of research has been carried out on the development of Automatic Speech Recognition (ASR) in Bengali, we did not find any open ASR system that works well when tested with utterances of elderly people. Developing a new system for the elderly domain from scratch demands a huge amount of training resources. However, such domain-specific Bengali ASR resources are not available, and the creation of sufficient resources is costly and time-consuming. In this paper, we investigate the efficiency of transfer learning where we used a small domain-specific in-house dataset along with available general-domain open resources. First, we develop a CNN-BiGRU network using the openSLR data for the generic domain Bengali. Then we use the network in a transfer learning architecture where only 5 h of elderly data is fed. With the existing open general domain data, the system proposed a CER of 7.72%. When the same system was tested using the elderly data, the CER was reduced to 19.46%. Then we used the proposed transfer learning framework, and the CER is improved to 12.37%. In our experiments, we found that the proposed model outperforms the existing systems tested on the elderly domain. This improvement demonstrates the effectiveness of transfer learning for developing ASR systems in languages or domains where sufficient training resources are not available.


Author Profile
Shubhojeet Paul

Department of Computer Science and Engg. Birla Institute of Technology Mesra Ranchi 835215 India

Andorra
Author Profile
Vandana Bhattacharjee

Department of Computer Science and Engg. Birla Institute of Technology Mesra Ranchi 835215 India

Andorra
Author Profile
Sujan Kumar Saha

Department of Computer Science and Engg. National Institute of Technology Durgapur Durgapur 713209 India

Andorra

📄 논문 정보

발행 연도 2025년
인용수 0
출판 국가 Andorra
사이트 Springer
좋아요 수 0

연관 논문 목록 (146건)