Sie sind auf Seite 1von 13

ARTIFICIAL INTELLIGENCE

FOR
SPEECH RECOGNITION

By
S.V.S.MANEESH
CONTENTS

• Introduction
• What is speech recognition
• How it works
• Components
• ASR Architecture
• Benefits
• Drawbacks
• conclusion
INTRODUCTION

• Speech recognition is the process of converting an acoustic signal, captured by a


microphone or a telephone, to a set of words.
• The recognized words can be end in themselves as for applications such as commands &
control, data entry, and document preparation.
• They can also serve as the input to further linguistic processing in order to achieve
speech understanding.
• It is also known as automatic speech recognition(ASR), computer speech
recognition(CSR), speech to text(STT).
WHAT IS SPEECH RECOGNITION?

• Speech Recognition(SR) is the ability to translate a dictation or spoken word to text.


• Also known as “automatic speech recognition”(ASR),“computer speech recognition”, or
“speech to text”(STT).
HOW IT WORKS
ACOUSTIC MODEL

• An Acoustic model in ASR is used to represent the relationship between an audio signal
and the phonemes.
• This model is learned from a set of audio recordings and their text transcriptions.
• It is created by taking the audio recordings of the speech, and their text transcriptions,
and using software to create statistical representations of sounds that make up each
word.
COMPONENTS

• Audio input
• Grammar
• Speech Recognition Engine
• Acoustic model
• Recognized text
ASR ARCHITECTURE
PHONEMES

• Distinct units of sound in a specified language that distinguish one word from another.
• Example:
• Word : probably
• Phoneme : pr aa b iy
BENEFITS

• Security
• Productivity
• Advantage for handicapped and blind
• Usability of other languages increases
• Personal voice macros can be created
DRAWBACKS

• If the system has to work under noisy environments, background noise may corrupt the
original data.
• If words that are pronounced similar.
• for example: their, there, this technology face difficulty in distinguishing them.
CONCLUSION

• Revolutionize the way people conduct business over the web and, differentiate world-
class e-business.
• Voice XML ties speech recognition and telephony together.
• Voice-enabled web solutions TODAY!

Das könnte Ihnen auch gefallen