submit urlsubmit rss feedadd directoryphysician directorymedical jobs

article

Speech recognition (in many contexts, also known as automatic speech recognition, computer speech recognition or voice recognition) is the process of converting a speech signal to a set of words, by means of an algorithm implemented as a computer program. Speech recognition applications that have emerged over the last years include voice dialing (e.g., Call home), call routing (e.g., I would like to make a collect call), simple data entry (e.g., entering a credit card number), and preparation of structured documents (e.g., a radiology report).

Defining the Problem


According to "Survey of the State of the Art in Human Language Technology (1997) by Ron Cole et all" Speech recognition is the process of converting an acoustic signal, captured by a microphone or a telephone, to a set of words. The recognized words can be the final results, for such applications as commands & control, data entry, and document preparation. They can also serve as the input to further linguistic processing in order to achieve text formating or speech understanding.

Speech recognition systems can be characterized by many parameters as in the table below.

Parameters Range
Speaking Mode Isolated words to continuous speech
Speaking Style Read speech to spontaneous speech
Enrollment Speaker-dependent to Speaker-independent
Vocabulary Small (< 20 words) to large (> 20,000 words)
Language Model Finite-state to context-sensitive
Perplexity Small (< 10) to large (> 100)
SNR High (> 30 dB) to low (< 10 dB)
Transducer Voice-cancelling microphone to telephone
An isolated-word speech recognition system requires that the speaker pause briefly between words, whereas a continuous speech recognition system does not. Spontaneous, or extemporaneously generated, speech contains disfluencies and is much more dificult to recognize than speech read from script. Some systems require speaker enrollment (a user must provide samples of his or her speech before using them) whereas other systems are said to be speaker-independent, in that no enrollment is necessary. Some of the other parameters depend on the specific task. Recognition is generally more difficult when vocabularies are large or have many similar-sounding words. When speech is produced in a sequence of words, language models or artificial grammars are used to restrict the combination of words. The simplest language model can be specified as a finite-state network, where the permissible words following each word are explicitly given. More general language models approximating natural language are specified in terms of a context-sensitive grammar. One popular measure of the difficulty of the task, combining the vocabulary size and the language model, is perplexity, loosely defined as the geometric mean of the number of words that can follow a word after the language model has been applied. In addition, there are some external parameters that can affect speech recognition system performance, including the characteristics of the environmental noise and the type and the placement of the microphone.

More on [ Speech recognition ]


directory of related categories

 
 
directory of related topics

Medical Transcription :: Speech Technology
Resellers :: Speech Technology

 
Voice_Recognition RSS feed
Voice Recognition - Twitter Search

Arabanin voice recognition sistemine bir turlu ibrahim tatlises'i algilatamadim. Ne denesem olmadi. Evet utanmiyorum, dinliyorum tatlises :)
cenksidar (Cenk Sidar) Thu, 05 Nov 2009 02:06:22 -0000
Arabanin voice recognition sistemine bir turlu ibrahim tatlises'i algilatamadim. Ne denesem olmadi. Evet utanmiyorum, dinliyorum tatlises :)
@dniederg working on my NCA Sustainability paper, I spoke your name into my voice recognition software. It wrote "needed his ace."
baconred (Greg Wilson) Thu, 05 Nov 2009 01:43:14 -0000
@dniederg working on my NCA Sustainability paper, I spoke your name into my voice recognition software. It wrote "needed his ace."
Too much feedback in TOTUS's voice-recognition software.
P0TUS (Barracks O'Bama) Thu, 05 Nov 2009 00:49:48 -0000
Too much feedback in TOTUS's voice-recognition software.
@steverunner don't get excited about voice recognition. It works about 25%. Try goog411. Results will speak for themselves.
cjweeks (Chris) Thu, 05 Nov 2009 00:33:48 -0000
@steverunner don't get excited about voice recognition. It works about 25%. Try goog411. Results will speak for themselves.
@cjweeks yeah, but voice recognition and turn by turn is all you need with GPS, why would you wanna touch it?
steverunner (steverunner) Thu, 05 Nov 2009 00:27:26 -0000
@cjweeks yeah, but voice recognition and turn by turn is all you need with GPS, why would you wanna touch it?
orange topup system has voice recognition omgg scared xD
Olliez0r (Olliez0r) Wed, 04 Nov 2009 23:53:48 -0000
orange topup system has voice recognition omgg scared xD

 
Subscribe to Voice_Recognition RSS feed

directory of related sites

Custom Speech USA - Ask questions about using voice recognition for medical transcription, or about the future of voice for use by doctors. Describes the benefits of speech recognition for medical transcriptionists.

Voice_Recognition related videos
Peter Schiff - Valley Forge - 9/19/09
Next Video
Voice_Recognition related videos

 

HOMEADVERTISINGABOUT US

articlesartsbusinesscomputersgameshealthhospitalshomekids & teensnewsmobilephysiciansrecreationreferenceregionalscienceshoppingsocietysportsworld


Submit a Site About Become an Editor