Facta Univ. Ser.: Elec. Energ., vol. 19, No. 2, August 2006, pp. 287-298

Enhancing Robustness of Speech Recognizers by Bimodal Features

Inge Gavat, Gabriel Costache, and Claudia Iancu

Abstract: In this paper a robust speech recognizer is presented based on features obtained from the speech signal and also from the image of the speaker. The features were combined by simple concatenation, resulting composed feature vectors to train the models corresponding to each class. For recognition, the classification process relies on a very effective algorithm, namely the multiclass SVM. Under additive noise conditions the bimodal system based on combined features acts better than the unimodal system, based only on the speech features, the added information obtained from the image playing an important role in robustness improvement.

Keywords: Robust speech, bimodal system, support vector machines, neural networks.

gavat.pdf