Publication Type |
Journal Article |
School or College |
College of Science |
Department |
Computing, School of |
Creator |
Christiansen, Richard Wesley |
Title |
Word recognition in continuous speech using linear prediction analysis |
Date |
1976 |
Description |
A promising method of automatic word recognition in continuous speech, recently designated as word spotting, has been demonstrated. The method uses error residual ratios from LPC (Linear Predictive Coding) vocoder analysis for waveform comparison and a dynamic programming procedure for time registration between the incoming speech and a template of the key word. Using a similarity threshold, the incoming speech is compared with several templates to account for variability is spectral shape. This system can work in real time using a real time vocoder. The multiple templates are used in such a way that a small number of templates, three or four, is made to look like several hundred or more. This is accomplished by dynamically constructing a composite template from parts of each single template as part of the processing of the incoming speech. Thus, a particular composite template is constructed for each word being recognized. An accuracy of 99 percent with no false alarms was achieved using 205 key words, five different speakers, and approximately ten minutes of speech text. Performance in the presence of additive white gaussian noise of approximately 11 dB signal-to-noise ratio was 66 percent. When the speech was processed to account for the noise, results improved to 85 percent to 90 percent accuracy. Finally, a digit recognition experiment was performed using over 1200 digits spoken by ten different people with a resultant accuracy of 97 percent. |
Type |
Text |
Publisher |
University of Utah |
First Page |
1 |
Last Page |
78 |
Subject |
Word recognition; continuous speech; linear prediction analysis; word spotting; error residual ratios; LPC; Linear Predictive Coding; vocoder analysis |
Subject LCSH |
Vocoder; Speech processing systems |
Language |
eng |
Bibliographic Citation |
Christiansen, R. W. (1976). Word recognition in continuous speech using linear prediction analysis. 1-78. UTEC-76-225. |
Series |
University of Utah Computer Science Technical Report |
Relation is Part of |
ARPANET |
Rights Management |
©University of Utah |
Format Medium |
application/pdf |
Format Extent |
8,093,451 bytes |
Identifier |
ir-main,16096 |
ARK |
ark:/87278/s6th953z |
Setname |
ir_uspace |
ID |
705464 |
Reference URL |
https://collections.lib.utah.edu/ark:/87278/s6th953z |