ITRT(IT Research Trends)

diarization

정의: Speaker diarisation is the process of partitioning an audio stream containing human speech into homogeneous segments according to the identity of each speaker. It can enhance the readability of an automatic speech transcription by structuring the audio stream into speaker turns and, when used together with speaker recognition systems, by providing the speaker’s true identity. It is used to answer the question "who spoke when?" Speaker diarisation is a combination of speaker segmentation and speaker clustering. The first aims at finding speaker change points in an audio stream. The second aims at grouping together speech segments on the basis of speaker characteristics.

Relation Part

관련 분야를 찾을 수 없습니다.

연도별 키워드 출현 빈도

연관 키워드 네트워크

📄 키워드 상세정보

핵심 연구 분야	Artificial Intelligence
주요 연도	2024년
주요 연관 키워드	neural
좋아요 수	0

diarization

diarization

Relation Part

연도별 키워드 출현 빈도

연관 키워드 네트워크

📄 키워드 상세정보

키워드별 논문 목록 (1건) 내 서재 담기

Emotion Neural Transducer for Fine-Grained Speech Emotion Recognition

키워드별 논문 목록 (1건)