Word recognition in continuous speech using linear prediction analysis

Update Item Information
Publication Type Journal Article
School or College College of Science
Department Computing, School of
Creator Christiansen, Richard Wesley
Title Word recognition in continuous speech using linear prediction analysis
Date 1976
Description A promising method of automatic word recognition in continuous speech, recently designated as word spotting, has been demonstrated. The method uses error residual ratios from LPC (Linear Predictive Coding) vocoder analysis for waveform comparison and a dynamic programming procedure for time registration between the incoming speech and a template of the key word. Using a similarity threshold, the incoming speech is compared with several templates to account for variability is spectral shape. This system can work in real time using a real time vocoder. The multiple templates are used in such a way that a small number of templates, three or four, is made to look like several hundred or more. This is accomplished by dynamically constructing a composite template from parts of each single template as part of the processing of the incoming speech. Thus, a particular composite template is constructed for each word being recognized. An accuracy of 99 percent with no false alarms was achieved using 205 key words, five different speakers, and approximately ten minutes of speech text. Performance in the presence of additive white gaussian noise of approximately 11 dB signal-to-noise ratio was 66 percent. When the speech was processed to account for the noise, results improved to 85 percent to 90 percent accuracy. Finally, a digit recognition experiment was performed using over 1200 digits spoken by ten different people with a resultant accuracy of 97 percent.
Type Text
Publisher University of Utah
First Page 1
Last Page 78
Subject Word recognition; continuous speech; linear prediction analysis; word spotting; error residual ratios; LPC; Linear Predictive Coding; vocoder analysis
Subject LCSH Vocoder; Speech processing systems
Language eng
Bibliographic Citation Christiansen, R. W. (1976). Word recognition in continuous speech using linear prediction analysis. 1-78. UTEC-76-225.
Series University of Utah Computer Science Technical Report
Relation is Part of ARPANET
Rights Management ©University of Utah
Format Medium application/pdf
Format Extent 8,093,451 bytes
Identifier ir-main,16096
ARK ark:/87278/s6th953z
Setname ir_uspace
ID 705464
Reference URL https://collections.lib.utah.edu/ark:/87278/s6th953z
Back to Search Results