TUTCRIS - Tampereen teknillinen yliopisto

TUTCRIS

Noise-Robust Detection of Whispering in Telephone Calls Using Deep Neural Networks

Tutkimustuotosvertaisarvioitu

Yksityiskohdat

AlkuperäiskieliEnglanti
Otsikko24th European Signal Processing Conference (EUSIPCO)
JulkaisupaikkaBudapest, Hungary
KustantajaIEEE
ISBN (elektroninen)978-0-9928-6265-7
DOI - pysyväislinkit
TilaJulkaistu - 1 elokuuta 2016
OKM-julkaisutyyppiA4 Artikkeli konferenssijulkaisussa
TapahtumaEUROPEAN SIGNAL PROCESSING CONFERENCE -
Kesto: 1 tammikuuta 1900 → …

Julkaisusarja

Nimi
ISSN (elektroninen)2076-1465

Conference

ConferenceEUROPEAN SIGNAL PROCESSING CONFERENCE
Ajanjakso1/01/00 → …

Tiivistelmä

Detection of whispered speech in the presence of high levels of background noise has applications in fraudulent behaviour recognition. For instance, it can serve as an indicator of possible insider trading. We propose a deep neural network (DNN)-based whispering detection system, which operates on both magnitude and phase features, including the group delay feature from all-pole models (APGD). We show that the APGD feature outperforms the conventional ones. Trained and evaluated on the collected diverse dataset of whispered and normal speech with emulated phone line distortions and significant amounts of added background noise, the proposed system performs with accuracies as high as 91.8%.

Tutkimusalat

Julkaisufoorumi-taso