Classification and separation techniques based on fundamental frequency for speech enhancement

Cabañas Molero, P.

Classification and separation techniques based on fundamental frequency for speech enhancement

Cabañas Molero, P.

Supervised by:

Nicolás Ruiz Reyes Director
P. Vera-Candeas Director

Defence university: Universidad de Jaén

Fecha de defensa: 11 January 2016

Committee:

Antonio Miguel Peinado Herreros Chair
Damián Martínez Muñoz Secretary
Juan Andrés Morales Cordovilla Committee member

Department:

INGENIERÍA DE TELECOMUNICACIÓN

Type: Thesis

Teseo: 421216 DIALNET RUJA editor

Abstract

This thesis is focused on the development of new classification and speech enhancement algorithms based, explicitly or implicitly, on the fundamental frequency (F0). The F0 of speech has a number of properties that enable speech discrimination from the remaining signals in the acoustic scene, either by defining F0-based signal features (for classification) or F0-based signal models (for separation). Three main contributions are included in this work: 1) an acoustic environment classification algorithm for hearing aids based on F0 to classify the input signal into speech and nonspeech classes; 2) a frame-by-frame basis voiced speech detection algorithm based on the aperiodicity measure, able to work under non-stationary noise and applicable to speech enhancement; 3) a speech denoising algorithm based on a regularized NMF decomposition, in which the background noise is described in a generic way with mathematical constraints.