Professional Documents
Culture Documents
Modern speech recognition software enables a single computer user to speak text and/or
commands to the computer, largely, but not entirely, bypassing the use of the keyboard
and mouse interface.
The idea has been portrayed in science fiction for many decades, quite frequently
depicting computers that do not even have keyboards or mice. Such computers are also
typically depicted as being able to keep up no matter how fast a person speaks, and
without regard to who the speaker is, the language spoken, or even how many speakers
there are. In other words, they're depicting a computer that hears in like manner as a
multilingual person.
Attempts to develop usable speech recognition software began in the mid-1900s, and
proved to be far more daunting than anyone had imagined. It also turned out to require so
much computing power that only the most modern computers are now able to perform the
functions required in real time (i.e., as fast as you can speak).
The first commercially practical products became available around 1990, (e.g. the Voice
Navigator, a standalone computer dedicated 100% to speech recognition) and used up all
the available computing power of the machine, which would send its output to a second
computer. They weren't particularly accurate and could only understand a single person at
a time, requiring retraining, not of the operator but of the machine itself, to work for
another person. Despite these limitations, they could type so rapidly that even after taking
time to make corrections, a person with disabilities could easily accomplish more work
with the machine than without it. For persons with physical disabilities, the ability to
simply talk to your computer could be a priceless asset. Consider for instance, an author
with Parkinson's disease who can barely control his hands, yet is conveniently able to
create an article.
There are other scenarios where the deficiencies of the equipment are easily outweighed.
Consider for example, the machines do not have enough intelligence yet to properly
process a child's voice. Obstacles include the fact that most children don't yet fully
understand how language is used (e.g. proper construction of a complete sentence) and as
they are growing their voices are continuously changing.
There are now both proprietary and open source systems on the market, with
development emphasis being placed upon serving the legal and medical markets.
Contents
[hide]
• 1 Free software
o 1.1 Free speech corpus and acoustic model repositories
• 2 Proprietary software
• 3 References
[edit] References
1. ^ [1]
2. ^ Philips SpeechMagic named European Technology Leader by Frost & Sullivan
Views
• Article
• Discussion
• Edit this page
• History
Personal tools
• Try Beta
• Log in / create account
Navigation
• Main page
• Contents
• Featured content
• Current events
• Random article
Search
Go Search
Interaction
• About Wikipedia
• Community portal
• Recent changes
• Contact Wikipedia
• Donate to Wikipedia
• Help
Toolbox