Beruflich Dokumente
Kultur Dokumente
ABSTRACT
Most natural form of human communication depends on speech. In order to understand human speech by enabling machines,
computers can act as an intermediate for human expert, so that it can respond accurately and reliably to human voices. This can
be achieved by speech recognition system, which permits a data processor to identify the words a person speaks in a
microphone or telephone, and converts them into written text. Speech Recognition (SR) and its application is the thrust area in
research during the past three decades which is carried out on various aspects, especially in the field of Information and
Communication Technology (ICT) for speeding up scientific advancements. Also, people interact with system to utilize the
technology in greater amount without the acquaintance of operating computer keyboard. At present, Automatic Speech
Recognition (ASR) is effectively utilized for communication between human and machines. This paper analysis the accuracy of
feature extraction based on modeling which is implemented using MFCC and HMM for two different type connected and
continuous speech. The recognition result shows that the overall system accuracy for connected word is 69.22 % and continuous
word is 50%.
Keywords Speech Recognition, Analysis, Feature Extraction, Training, Testing, Modeling
TABLE IX
Authors Dataset Techniques
V.Naveen Kumar and et Stuttered speech is recorded Frame length-20ms Pre-Emphasis- FIR filter Hamming Window
al. (2015) [42] Mel Frequency Wrapping Discrete Cosine Transform (DCT) VQ
DTW
Hazrat Ali et al. (2015) Urdu Digits recorded by 10 speakers of both MFCC, Support Vector Machines (SVM) Random Forests Linear
[13] male and female Discriminant Analysis
Kishori R. Ghule Medicinal plants names recorded by 100 speaker Extraction by Discrete Wavelet Transforms(DWT), ANN
et al. (2015) [21] of different age group from 20-50
Thiang and et al. (2011) Robot control words are recorded LPC , ANN Back propagation method is used for training
[39]
Mousmita Sarma Three group of Assamese Language Dardic Recurrent Neural Networks (RNNs) Multi-Layer Perceptron
et al. (2009) [28] (Pisacha), the Indic (Indo-Aryan) and Iranian (MLP) Levenberg-Marquardth Back Propagation Algorithms
Azam Beg and et 0-9 Urdu digits recorded by 15 speakers Multi-Layer Feed-Forward Networks Fast Fourier Transformer
al.(2008) [4] (FFT) Discrete Fourier Transformer (DFT)
DIFFERENT TECHNIQUES FOR RECOGNITION SYSTEM
V. RESULT AND DISCUSSION [5] Bacha Rehmam, Zahid Halim, Ghulam Abbas, Tufail
Muhammad Artificial Neural Network-Based
The dataset consists of 13 connected words and 14
Speech Recognition Using DWT Analysis Applied on
continuous words which are recorded by 7 male speakers
Isolated Words from Oriental Languages, Malaysian
and 7 female speakers and the system is trained with 29
Journal of Computer Science, Vol. 28(3),2015,
words totally. The recognition accuracy is calculated for
pp.242-262.
connected and continuous words, also overall recognition
[6] C.Sivaranjani, B. Bharathi, Syllable Based
rate for connected and continuous speech using MFCC
Continuous Speech Recognition for Tamil Language,
with HMM model is shown in Table XI.
International Journal of Advanced Engineering
VI. CONCLUSION AND FUTURE WORK Technology, Vol. 7, Issue 1, E-ISSN 0976-3945,
March 2016, pp. 01-04.
[7] Cini Kuriana, Kannan Balakrishnan, Development &
Speech Recognition has been in development of
evaluation of different acoustic models for Malayalam
more than 60 years. The various speech recognition
continuous speech recognition, in Proceedings of
methodologies and approach is available to enhance the
International Conference on Communication
recognition system. The fundamentals of SR system,
Technology and System Design 2011 Published by
various approaches existing for developing an ASR
Elsevier Ltd, December 2011, pp.1081-1088.
system are explained and compared in this paper. In recent
[8] Desai Vijayendra and Dr.Vishvjit K. Thakar, Neural
years large vocabulary independent continuous speech has
Network based Gujarati Speech Recognition for
highly enhanced. From the review, it is concluded that
Dataset Collected by in-ear Microphone, in
HMM based MFCC feature is more suitable for speech
Proceedings of 6th International Conference on
recognition requirements and produces more good results
Advances in Computing & Communications ICACC
than other models. In this paper, MFCC feature is
2016, Cochin, ISSN: 1877-0509, 6-8th September
extracted and the speech is trained by HMM model which
2016, pp.668-675.
is implemented for both connected and continuous speech.
[9] Dufour, R., Jousse, V.,Estve, Y., Bchet, F.,Linars,
In order to improve the accuracy, other modeling
G., Spontaneous Speech Characterization and
techniques will be implemented in future.
Detection in Large Audio Database, in Proceedings
of 13th International Conference on Speech and
Computer (SPECOM 2009), St Petersburg(Russia),
REFERENCES
21-25th June 2009.
[1] A.Srinivasan, K.Srinivasa Rao, K.Kannan and [10] Geeta Nijhawan and Dr. M.K Soni, Real Time
D.Narasimhan, Speech Recognition of the letter Speaker Recognition System for Hindi Words,
zha(H) in Tamil Language using HMM, International Journal of Information Engineering and
International Journal of Engineering Science and Electronic Business, Vol. 6, DOI:
Technology, Vol.1(2),2009, pp.67-72. 10.5815/ijieeb.2014.02.04, April 2014, pp. 35-40.
[2] AN. Sigappi and S. Palanivel, Spoken Word [11] Harpreet Kaur and Rekha Bhatia, Speech
Recognition Strategy for Tamil Language, Recognition System for Punjabi Language,
International Journal of Computer Science International Journal of Advanced Research in
Issues( IJCSI), Vol. 9, Issue 1, No 3, ISSN Computer Science and Software Engineering, Vol. 5,
(Online):1694-0814 , January 2012, pp.227-233. Issue 8, ISSN: 2277 128X, August 2015, pp. 566-573.
[3] Annu Choudhary, Mr. R.S. Chauhan, Mr. Gautam [12] Haim Sak, Andrew Senior, Kanishka Rao, Franoise
Gupta, Automatic Speech Recognition System for Beaufays and Johan Schalkwyk (September 2015):
Isolated & Connected Words of Hindi Language By Google voice search: faster and more accurate.
Using Hidden Markov Model Toolkit (HTK), in [13] Hazrat Ali, An Jianwei and Khalid Iqbal, Automatic
Proceedings of International Conference on Speech Recognition of Urdu Digits with Optimal
Emerging Trends in Engineering and Technology, Classification Approach, International Journal of
DOI: 03.AETS.2013.3.234, 22-24th February 2012, Computer Applications, Vol.118, No.9, May 2015, pp.
pp.244 252. 1-5.
[4] Azam Beg and S. K. Hasnain, A Speech Recognition [14] http://www.pcworld.com/article/243060/speech_reco
System for Urdu Language, in Lecture Notes in gnition_through_the_decades_how_we_ended_up_wi
Computer Science (LNCS) Wireless Networks, th_siri.html
Information Processing and Systems: Springer, 2008, [15] I. Mohamed Kalith, David Ashirvatham, Samantha
pp. 118-126. Thelijjagoda, Isolated to Connected Tamil Digit
Speech Recognition System Based on Hidden Markov
Model, International Journal of New Technologies iDravadian , Vol. 15 ISSN: 2229-6913 (Print),
in Science and Engineering, Vol.3, Issue 4, December 2014, pp. 22-26.
ISSN:2231-5381, April 2016, pp.51-60. [25] Md. Akkas Ali, Manwar Hossain, Mohammad
[16] Jitendra Singh Pokhariya and Dr. Sanjay Mathur, Nuruzzaman Bhuiyan, Automatic Speech
Sanskrit Speech Recognition using Hidden Markov Recognition Technique for Bangla Words,
Model Toolkit, International Journal of Engineering International Journal of Advanced Science and
Research & Technology (IJERT),Vol.3, Issue 10, Technology, Vol. 50, January, 2013, pp.51-60.
ISSN: 2278-0181, October-2014, pp.93-98. [26] Megha Agrawal, Tina Raikwar, Speech Recognition
[17] John Butzberger, Hy Murveit, Elizabeth Shriberg, Using Signal Processing Techniques, International
Patti Price, Spontaneous Speech Effects In Large Journal of Engineering and Innovative Technology
Vocabulary Speech Recognition Applications, in (IJEIT), Vol.5, Issue8, ISSN: 2277-3754, February
Proceedings of DARPA Speech and Natural 2016, pp. 65-68.
Language Workshop, Morgan Kaufmann, New York, [27] Mohamed S. Abdo and Ahmed H. Kandil, Semi-
1992, pp.339-343. Automatic Segmentation System for Syllables
[18] K.P.Unnikrishnan, John J. Hopfield and David Extraction from Continuous Arabic Audio Signal,
W.Tank, Connected-Digit Speaker-Dependent International Journal of Advanced Computer Science
Speech Recognition Using a Neural Network with and Applications (IJACSA), Vol.7, No.1, 2016,
Time-Delayed Connections, IEEE Transaction on pp.535-540.
Signal Processing,Vol.39, [28] Mousmita Sarma, Krishna Dutta and Kandarpa
No.3,DOI:10.1109/78.80888, March 1991, pp.698- Kumar Sarma, Assamese Numeral Corpus for
713. Speech Recognition using Cooperative ANN
[19] K.Venkataramana, P.Jayaprakash, P.S. Nagendra Architect, International Journal of Electrical and
Babu Developing Telugu Speech Recognition Electronics Engineering,Vol.3,Issue8,Nov
System using Sphinx-4, in Proceedings of National 2009,pp.456-465.
Conference on Emerging Trends in Computing, [29] Ms.Vimala.C and Dr.V.Radha, Speaker Independent
Communication & Control Engineering, Chennai, Isolated Speech Recognition System for Tamil
ISSN (Online):23198753, 10 September 2015, Language using HMM, in Proceedings International
pp.41-48. Conference on Communication Technology and
[20] Khalil Ahammad, Md. Mahfuzur Rahman, System Design 2011, Procedia Engineering 30 ISSN:
Connected Bangla Speech Recognition using 1877-7058, 13March2012, pp.1097 1102.
Artificial Neural Network, International Journal of [30] Parwinder Kaur, Ranpreet Kaur, Amanpreet Kaur ,
Computer Applications, Vol.149, No.9, ISSN 0975 Detection of Syllables in Continuous Punjabi Speech
8887, September 2016, pp.38- 41. Signal and Extraction of Formant Frequencies of
[21] Kishori R. Ghule and Ratnadeep R. Deshmukh, Vowels, International Journal of Advanced
Automatic Speech Recognition of Marathi isolated Research in Computer Science and Software
words using Neural Network, International Journal Engineering, Vol. 5, Issue 4, ISSN: 2277 128X,
of Computer Science and Information Technologies 2015,pp.1091-1095.
(IJCSIT), Vol. 6(5), ISSN: 0975-9646 2015, pp.4296- [31] Preeti Saini, Parneet Kaur, Mohit Dua, Hindi
4298. Automatic Speech Recognition Using HTK,
[22] Kuldeep Kumar and R.K. Aggarwal, A Hindi speech International Journal of Engineering Trends and
recognition system for connected words using HTK, Technology (IJETT), Vol.4, Issue 6, ISSN:2231-
International Journal Computational Systems 5381, June 2013, pp.2223-2229.
Engineering, Vol.1, No.1, Online ISSN: 2046-3405, [32] Purnima Pandit, Shardav Bhatt Automatic Speech
2012, pp.25-32. Recognition of Gujarati digits using Dynamic Time
[23] Lakshmi A, Hema A Murthy, A Syllable Based Warping, International Journal of Engineering and
Continuous Speech Recognizer For Tamil, in Innovative Technology (IJEIT), Vol.3, Issue 12, ISSN:
Proceedings of the Ninth International Conference on 2277-3754, June 2014, pp.69-73.
Spoken Language Processing, INTERSPEECH 2006 [33] Satyanand Singh and Dr. E.G. Rajan, Vector
(ICSLP), Pittsburgh, Pennsylvania, USA,17-21st Quantization Approach for Speaker Recognition using
September 2006, pp.1878-1881. MFCC and Inverted MFCC, International Journal of
[24] Maya Moneykumar, Elizabeth Sherly, Win Sam Computer Applications, Vol.17, No.1, ISSN: 0975
Varghese, Malayalam Word Identification for 8887, March 2011, pp.1-7.
Speech Recognition System An International [34] Shaik Shafee, B.Anuradha, Isolated Telugu Speech
Journal of Engineering Sciences, Special Issue Recognition using MFCC and Gamma tone features
by Radial Basis Networks in Noisy Environment,