The signal-to-noise ratio (SNR) at a given microphone can be weak relative to the background noise since the signal energy is inversely proportional to the square of the distance to the sound source . Moreover, room acoustics leads to a reverberated speech signal.
Interferences, such as speech from the codriver, may greatly hamper the speech recognizer performance, which is crucial for human–machine dialog applications. Separation of the target speaker during periods of competing speech from the codriver represent a particular challenge. This is because the characteristics of the interferer signals cannot be directly estimated from the microphone signals during these periods . This problem is of particular importance since spontaneous multiparty speech contains lots of overlaps between the speech flows of the participants .
These issues make the seamless speech input a challenging problem. Before recognizing speech as a sequence of words, an important preprocessing step is to denoise the speech signal from its perturbations. In this book, we address the issue of separating the desired signal from interfering speech, i.e., the point (ii) above.
KeywordsSpeech Signal Blind Source Separation Microphone Array Target Speaker Adaptive Beamformers
Unable to display preview. Download preview PDF.