Citation
Abstract
Robustness is a key issue in speech recognition. This paper proposes a speech recognition algorithm for Malay digits from 0 to 9. This paper also proposes an algorithm for noise cancellation by using recursive least squares (RLS). This system consists of speech processing inclusive of digit margin and recognition which uses zero crossing and energy calculations. Mel-Frequency Cepstral Coefficient (MFCC) vectors are used to provide an estimate of the vocal tract filter. Meanwhile dynamic time warping (DTW) is used to detect the nearest recorded voice with appropriate global constraint. The global constraint is used to set a valid search region because the variation of the speech rate of the speaker is considered to be limited in a reasonable range, which means that it can prune the unreasonable search space. The algorithm is tested on speech samples that are recorded as a part of a Malay corpus. The results show that the algorithm can recognize almost 80.5% of the Malay digits for all recorded words. By adding RLS noise canceller in the preprocessing stage it increases the accuracy to 92.3%.
Download File
Full text not available from this repository.
|
Additional Metadata
Item Type: | Conference or Workshop Item (Paper) |
---|---|
Divisions: | Faculty of Engineering |
DOI Number: | https://doi.org/10.1109/ITSIM.2008.4631680 |
Keywords: | Robustness; Speech recognition; Algorithm; Digit margin; Recognition; Vocal tract; Filter; Time warping; Voice |
Depositing User: | Fatimah Zahrah @ Aishah Amran |
Date Deposited: | 06 Jan 2014 09:23 |
Last Modified: | 03 Feb 2016 07:05 |
URI: | http://psasir.upm.edu.my/id/eprint/16587 |
Statistic Details: | View Download Statistic |
Actions (login required)
View Item |