Beruflich Dokumente
Kultur Dokumente
Agenda
Introduction Applications Working and Types FSM Problems Proposed Solution Results Conclusion References
Application of SR
There are innumerable applications. Some are Military Uses
Remote Command and Control Centers
(plane ,Satellite etc)
Health Care
Automated medical prescriptions WOW!!!
Educational Uses
Helps teachers and students too
Approaches of SR
Basically divided into 3
Acoustic Phonetic Approach (Works on phonemes) Pattern Recognition Approach ( Works on Patterns) Artificial Intelligence Approach ( Advanced Functionality)
Pattern Recognition
Pattern Recognition Works in 2 Phases
Pattern Training Comparison
Pattern Training is modeled by a FSM (Finite State Machines). In simple words Speech Templates are created and stored .
The speakers recognized words and the stored templates are compared and verified If Matched: Accept Not Matched :Reject
Solution :We can attach weights to them to improve recognition (This can work better )
Results
Notice how the results affect the accuracy
Type of Speech Normal Dictionary Speech Accuracy 50-90%
Choices (Customized)
Choices (General ) Individual Letters
90%
80% 30%
Customized Phonetics
70%
Conclusion
Speech is a natural way of Communication. Numerous applications of Speech are present. There are various approaches and they have their own Pros and Cons FSMs are one way to make job easier and better
There are lots of problems Recognition problems Integrity issues So , We need a platform independent framework that can solve these issues and make the life of speech developers easier.
References
[1] Wienstien C.J. Military and government applications of human-machine communication by voice. In Proceedings of the Natl. Acad. Sci. USA. Volume 92 10011 10016. October 1995. [2].Dat Tat Tran, Fuzzy Approaches to Speech and Speaker Recognition, A thesis submitted for the degree of Doctor of Philosophy of the university of Canberra. [3] R.K.Moore, Twenty things we still don t know about speech, Proc.CRIM/ FORWISS Workshop on Progress and Prospects of speech Research an Technology , 1994. [4].Sadaoki Furui, 50 years of Progress in speech and Speaker Recognition Research, ECTI Transactions on Computer and Information Technology, Vol.1. No.2 November 2005. [5]. Willie Walker .etal. Sphinx-4: A Flexible Open Source Framework for Speech Recognition http://cmusphinx.sourceforge.net/sphinx4 [6] M.A.Anusuya, Speech Recognition by Machine: A Review. In (IJCSIS) International Journal of Computer Science and Information Security, Vol. 6, No. 3, 2009 http://arxiv.org/ftp/arxiv/papers/1001/1001.2267.pdf [7] Neann Mathai, A Literature Survey of Speech Recognition and Hidden Markov Models. http://shenzi.cs.uct.ac.za/~honsproj/cgibin/view/2009/katz_mathai_sobey.zip/Speech_Katz_Mathai_Sobey/Downloads/NeannMathaiLiteratureSu rvey.pdf [8] Pavel Stemberk, Speech recognition based on FSM and HTK toolkits http://stembep.wz.cz/!papers/Zilina-dt04/zildt04.pdf [9] Steve Renals, Speech recognition. http://dsp-book.narod.ru/rec-notes.pdf