| Voice or speech recognition is the ability of a machine
or program to receive and interpret dictation, or to understand and carry
out spoken commands.
For use with computers, analog audio must be converted into digital signals.
This requires analog-to-digital conversion. For a computer to decipher
the signal, it must have a digital database, or vocabulary, of words or
syllables, and a speedy means of comparing this data with signals. The
speech patterns are stored on the hard drive and loaded into memory when
the program is run. A comparator checks these stored patterns against
the output of the A/D converter.
In practice, the size of a voice-recognition program's effective vocabulary
is directly related to the random access memory capacity of the computer
in which it is installed. A voice-recognition program runs many times
faster if the entire vocabulary can be loaded into RAM, as compared with
searching the hard drive for some of the matches. Processing speed is
critical as well, because it affects how fast the computer can search
the RAM for matches.
|