A Study of Speech Recognition, Speech Translation, and Speech Summarization of TED English Lectures


연구 분야: Artificial Intelligence



학회: 2023 IEEE 12th Global Conference on Consumer Electronics (GCCE)


초록

Our research focuses on developing an automatic speech recognition system for English lectures, which involves summarizing the content and providing Japanese subtitles. Subtitling the entire audio of an English lecture could hinder comprehension and readability, so a summarization system is desired. By employing the DNN-HMM based speech recognition system, we achieved an 88% word accuracy for recognizing TED lecture speeches. Speech translation results showed a lower BLEU score of approximately 14% compared to text translation. Conversely, speech summarization proved its robustness to speech recognition errors, as the extracted important sentences were almost the same as those in the text summarization process.


Author Profile
Kazumasa Yamamoto

Department of Computer Science Chubu University Kasugai Japan

Japan
Author Profile
Haruhiko Banno

Department of Computer Science Chubu University Kasugai Japan

Japan
Author Profile
Haruki Sakurai

Department of Computer Science Chubu University Kasugai Japan

Japan

📄 논문 정보

발행 연도 2023년
인용수 6
출판 국가 Japan
사이트 IEEE
좋아요 수 0

연관 논문 목록 (35건)