Beruflich Dokumente
Kultur Dokumente
FOR
SPEECH RECOGNITION
By
S.V.S.MANEESH
CONTENTS
• Introduction
• What is speech recognition
• How it works
• Components
• ASR Architecture
• Benefits
• Drawbacks
• conclusion
INTRODUCTION
• An Acoustic model in ASR is used to represent the relationship between an audio signal
and the phonemes.
• This model is learned from a set of audio recordings and their text transcriptions.
• It is created by taking the audio recordings of the speech, and their text transcriptions,
and using software to create statistical representations of sounds that make up each
word.
COMPONENTS
• Audio input
• Grammar
• Speech Recognition Engine
• Acoustic model
• Recognized text
ASR ARCHITECTURE
PHONEMES
• Distinct units of sound in a specified language that distinguish one word from another.
• Example:
• Word : probably
• Phoneme : pr aa b iy
BENEFITS
• Security
• Productivity
• Advantage for handicapped and blind
• Usability of other languages increases
• Personal voice macros can be created
DRAWBACKS
• If the system has to work under noisy environments, background noise may corrupt the
original data.
• If words that are pronounced similar.
• for example: their, there, this technology face difficulty in distinguishing them.
CONCLUSION
• Revolutionize the way people conduct business over the web and, differentiate world-
class e-business.
• Voice XML ties speech recognition and telephony together.
• Voice-enabled web solutions TODAY!