ITRT(IT Research Trends)

SEASR: Speech Enhancement for Automatic Speech Recognition Systems using Convolution Recurrent Neural Network with Residual Connections

연구 분야: Artificial Intelligence

논문 키워드: #algorithm #algorithms #improving #improvements #noisy

학회: 2024 IEEE 5th Women in Technology Conference (WINTECHCON)

초록

Automatic Speech Recognition (ASR) systems have become increasingly popular in recent years with applications like virtual assistants and auto-captions. However, one of the major hurdles is the noise in speech signals which significantly degrades ASR accuracy. A simpler solution is to enhance the speech signal using a noise reduction algorithm prior to ASR. The existing speech enhancement algorithms suffer from a trade-off between noise reduction and speech distortion, and the difficulty in estimating non-stationary noises. In this work, we present SEASR, a lightweight fully causal realtime speech enhancement network for improving ASR accuracy in noisy environments. Our network seamlessly removes non-stationary noises from speech at low SNRs demonstrating a relative improvement of 20-60% of accuracy across various ASR systems. We also showcase the performance and quality improvements of our model compared to other popular noise suppression methods.

📄 논문 정보

발행 연도	2024년
인용수	106
출판 국가	India
사이트	IEEE
좋아요 수	0

SEASR: Speech Enhancement for Automatic Speech Recognition Systems using Convolution Recurrent Neural Network with Residual Connections

SEASR: Speech Enhancement for Automatic Speech Recognition Systems using Convolution Recurrent Neural Network with Residual Connections

Manasi Remane

Revanth Reddy Nalia

Ambrish Dantrey

📄 논문 정보

연관 논문 목록 (299건)

SEASR: Speech Enhancement for Automatic Speech Recognition Systems using Convolution Recurrent Neural Network with Residual Connections

SEASR: Speech Enhancement for Automatic Speech Recognition Systems using Convolution Recurrent Neural Network with Residual Connections

📄 논문 정보

연관 논문 목록 (299건) 내 서재 담기

연관 논문 목록 (299건)