A study on different linear and non-linear filtering techniques of speech and speech recognition

Minajul Haque, Kaustubh Bhattacharyya

Abstract


In any signal noise is an undesired quantity, however most of thetime every signal get mixed with noise at different levels of theirprocessing and application, due to which the information containedby the signal gets distorted and makes the whole signal redundant.A speech signal is very prominent with acoustical noises like bubblenoise, car noise, street noise etc. So for removing the noises researchershave developed various techniques which are called filtering. Basicallyall the filtering techniques are not suitable for every application,hence based on the type of application some techniques are betterthan the others. Broadly, the filtering techniques can be classifiedinto two categories i.e. linear filtering and non-linear filtering.In this paper a study is presented on some of the filtering techniqueswhich are based on linear and nonlinear approaches. These techniquesincludes different adaptive filtering based on algorithm like LMS,NLMS and RLS etc., Kalman filter, ARMA and NARMA time series applicationfor filtering, neural networks combine with fuzzy i.e. ANFIS. Thispaper also includes the application of various features i.e. MFCC,LPC, PLP and gamma for filtering and recognition.

Full Text:

PDF

References


Kalandharan N, “Speech Enhancement by Spectral Subtraction Method”, International Journal of Computer Applications (0975 – 8887) Volume 96– No.13, June 2014

L. Wang, Z. Zhang, and A. Kai, “Hands-free speaker identification based on spectral subtraction using a multi-channel least mean square approach”, IEEE International Conference on Acoustics, Speech and Signal Processing, Vancouver, 2013, pp. 7224– 7228

R.V.Pawar and R.M.Jalenkar, “Review on speech production model” International jour nal of engineering and innovation technology (IJEIT), Volume 3, Issue 9, March 2014

Tohkura.Y, “Mathematical modeling of soeech production and its application to noise cancellation”, IEEE transaction ASSP, Vol. ASSP-35

Coker, C. H., “A model of articulatory dynamics and control”, Proc. IEEE, pp. 452-460, 1989

Zhe Chen, Dan Shi and Fuliang Yin, “Dynamic Noise Reduction Algorithm Based on Time-variety Filter”, Awareness Science and Technology (iCAST), 2011 3rd Interna-tional Conference, ISBN Information: INSPEC Accession Number: 12577337 DOI: 10.1109/ICAwST.2011.6163085

Kanika Garg and Goonjan Jain, “A Comparative Study of Noise Reduction Techniques for Automatic Speech Recognition Systems”, International Conference on Advances in Computing, Communications and Informatics (ICACCI), Sept. 21-24, 2016, Jaipur, India

G.Smith et al., “Subspace technique in speech enhancement”, Neural Networks for Signal Processing IX, 1999. Proceedings of the 1999 IEEE Signal Processing Society Workshop

Shuping Lv and Cheng Zhang, “Blind Signal Separation for Speech Signals With Noise”, Proceedings of 2014 IEEE International Conference on Mechatronics and Automation August 3 - 6, Tianjin, China

Yang Liu et al., “Speech enhancement in instantaneous amplitude and phase or application in noisy reverberant environments”, Speech communication, doi: 10.10.16/j.specom.2016.08.002

Jacob Benesty et al., “ A brief review of speech enhancement with linear filtering”, EURASIP journal on advance in signal processing, 2014

Vartika Anand et al., “Intelligent Adaptive Filtering For Noise Cancellation”, Inter-national Journal of Advanced Research in Electrical, Electronics and Instrumentation Engineering, ISSN ONLINE(2278-8875) PRINT (2320-3765)

Monson H.hayes, “Statistical digital signal processing and modeling”, chapter 4 and 9, Wiley edition

Jingdong Chen et al., “New Insights Into the Noise Reduction Wiener Filter”, IEEE transaction on audio, speech and language processing, Vol.14, No.4, July 2006

Lalit P.patil et al., “Efficient algorithm for speech enhancement using adaptive fil-ter”, International journal of electrical, electronics and computer engineering 3(1):98-103(2014)

Sayed A. Hadei, M. lotfizad,“A Family of Adaptive Filter Algorithms in Noise Cancellation for Voice recognition” IJCEE, Vol.2, No.2, April 2010

Jyoti Dhiman et al., “Comparison between adaptive filter algorithm (LMS, NLMS, RLS)”, International journal of science, engineering and technology research (IJSETR), Volume 2, Issue 5, may 2013

Sivaranjan Goswami et al., “A novel approach for design of a speech enhancement system using auto-trained NLMS adaptive filter”, International journal of information and communication technology, Vol.6, Nos.3/4, 2014

Anuja N. Untwale and Kishor S. Degaonkar, “Survey on noise cancellation technique of speech signal by adaptive filtering”, Internation conference on pervasive computing (ICPC), 2015

M.Mathe and S.P.Nadyala, “ Speech enhancement using kalman filter for white, random and color noise”, Signal processing conference, 2000

M.S.Kavalekalam et al., “Kalman filter for speech enhancement in cocktail party sce-narios using a codebook-based approach”, International conference on advance statistical signal processing, 2016

Kishor Odugu and B.M.S.Rao, “ A new speech enhancement using gammatone filters and perceptual wiener filtering based on sub banding”, International conference on signal processing and communication, 2013, IEEE publisher

Simon J. Godsill, “Robust modeling of noisy ARMA signals”, IEEE international con-ference on acoustic, speech and signal processing 1997

Cliston Cole et al. “Increasing Additive Noise Removal in Speech Processing Using Spectral Subtraction”, Fifth International Conference on Information Technology: New Generations, April

, ISBN Information: INSPEC Accession Number: 9926122 DOI: 10.1109/ITNG.2008.86

Rayan Kutty P.P and Sreenivasa Murthy A, “Kalman filter using quantile based noise estimation for audio restoration”, Proceedings of ICETECT 2011

M.Gabrea and D.O’shaughnessy, “Speech signal recovery in white noise using and adaptive kalman filter”, Journal of mechanical science and technology, 23 march 2007, DOI:10.1007/BF02916349

Zhe Chen, Dan Shi and Fuliang Yin, “Dynamic Noise Reduction Algorithm Based on Time-variety Filter”, 3rd International conference on awareness science and tech nology (ICAST), ISBN Information: INSPEC Accession Number: 12577337 DOI: 10.1109/ICAwST.2011.6163085 Publisher: IEEE

Hwa Soo Kim et al., “Speech enhancement via mel-scale wiener filtering with a frequency-wise voice activity detection”, Signal processing conference, 2000, 10th European

Harry Levitt, “Noise reduction in hearing aids: a review”, Journal of Rehabilitation research and Development Vol. 38 No. 1, January/February 2001, Pages 111–121

Premanada B S et al., “Speech Enhancement Algorithm to Reduce the Effect of Background Noise in Mobile Phones”, International Journal of Wireless & Mobile Networks (IJWMN) Vol. 5, No. 1, February 2013

Paulo S.R.Diniz, “Adaptive filtering algorithms and practical implementation”, Third edition, ISBN: 978-0-387-31274-3 e-ISBN:, 978-0-387-68606-6, DOI: 10.1007/978-0-387-68606-6

BinWen Fan et al., “The improvement and realization of speech enhancement algorithm based on wiener filtering”, 8th Internation congress on image and signal processing (CISP 2015)

F.Ykhlef and L.Bendaouia, “Evaluation of time domain features for voiced/non-voiced classification of speech”, ICSES 2012-International conference on signal and electronic systems, WROCLAW @I 2012EEE

V.P.Patil, “Voice signal recognition using LMS and UNANR filtering”, IORD journal of science and technology, Volume 2, issue 2, pp 32-36

Steven F.Boll, “Suppression of Acoustic Noise in Speech Using Spectral Subtraction”, IEEE Transactions on acoustics, speech and signal processing, Vol. ASSP-27, No.2, April 1979

Santos S.Pratapwar, “Reduction of Background Noise in Alaryngeal Speech using Spec-tral Subtraction with Quantile Based Noise Estimation”7th World Multiconference on Systemics, Cybernetics and Informatics (SCI 2003), Orlando, USA, July 27-30, 2003 [37] Orchisama Das, Bhaswati Goswami and Ratna Ghosh, “Application of the Tuned Kalman Filter in Speech Enhancement”, 2016 IEEE First International Conference on Control, Measurement and Instrumentation (CMI)

K.K. Paliwal and B.S. Atal, “Frequency related representation of speech”, Proc. EU-ROSPEECH, p.p.65-68 Sep. (2003)

Wei HAN et al., “An efficient MFCC extraction method in speech recognition”, IEEE international symposium on circuits and system, 2006

Vibha Tiwari, “MFCC and its application in speaker recognition”, International journal on emerging technologies, ISSN:0975-8364

Sunil Kumar and M.Laxminarayana, “Choice of mel filter bank in computing MFCC of a resampled speech”, 10th international conference on information science, Signal processing and their applications (ISSPA 2010)

Utpal Bhattacharjee, “A Comparative Study Of LPCC And MFCC Features For The Recognition Of Assamese Phonemes”, International Journal of Engineering Research & Technology (IJERT), Vol. 2 Issue 1, January- 2013Vol. 2 Issue 1, January- 2013, ISSN: 2278-0181

Safdar Tanweer, Abdul Mobin and Afshar Alam, “Analysis of Combined Use of NN and MFCC for Speech Recognition”, World Academy of Science, Engineering and Technology International Journal of Computer, Electrical, Automation, Control and Information Engineering Vol:8, No:9, 2014 1736 International Scholarly and Scientific Research & Innovation 8(9) 2014 scholar.waset.org/1999.4/10000797 Inter-national Science Index, Computer and Information Engineering Vol:8, No:9, 2014 waset.org/Publication/10000797

[44] Neeraj Kaberpanthi and Ashutosh Datar, “Speaker Independent Speech Recognition using MFCC with Cubic-Log Compression and VQ Analysis”, International Journal of Computer Applications (0975 – 8887) Volume 95– No.26, June 2014

Siddhant C. Joshi, Dr. A.N.Cheeran, “MATLAB Based Feature Extraction Using Mel-Frequency Cepstrum Coefficients for Automatic Speech Recognition”, International Journal of Science, Engineering and Technology Research (IJSETR), Volume 3, Is-sue 6, June 2014

Rabiner, L. and Schafer, R., “Digital Processing of Speech Signals”. Prentice Hall, Inc., Englewood Cliffs, New Jersey, 1978

Lyne R.Palomar and Toshio Fukuda, “A comparative analysis of the topological struc-tres of different LPC feature-based speech models”, International conference on Neural Networks, 1999. IJCNN ‘99

Bishnu S. Atal and Joel R. Remde, “A new mpodel of LPC excitation for producing natural sounding speech at low bit rates”, IEEE international conference on acoustic, speech and signal processing, june 2013

R. L. K. Venkateswarlu et al., “Novelapproach for speech recognition by using self-organized maps”, International Journal of Computer Science & Information Technology (IJCSIT) Vol 3, No 4, August 2011

H. Hermansky, “Perceptual linear predictive (PLP) analysis of speech”, Acoustical society of America journal, vol. 87, pp.1738-1752, Apr.1990

Yang Shao et al., “AN auditory-based feature for robust speech recognition”, IEEE international conference on acoustical speecg signal processing, 2009

Jun Qi et al., “Auditory feature based on gammatone filters for robust speech recogni-tion”, IEEE international symposium oncircuits and systems (ISCAS), 2013

Aurelio Uncini, “Audion signal processing by Neural networks”, Elsevier, Volume55, Issue 3-4, pp.593-625

Marina- Anca Cidota, “Choosing the parameter of the NARMA model implemented with recurrent perceptron for speech prediction”, Cidotï¿oe, M. Neural Comput & Applic (2010) 19: 903. doi:10.1007/s00521-010-0375-7

Brockwell PJ, Davis RA (1987) ,“Time series: theory and methods”, Springer, New York

Sh. Oveisgharan and M.B. Shamollahi, “Speech modeling and voiced/unvoiced/mixed/silence speech segmentation with fractionally Gaussian noise based model”, International conference on acoustical speech signal processing, 2004

Danilo P. Mandic et al., “Advance RNN based NARMA predictors”, Journal of VLSI signal processing sytem for signal, image and video technology, Volume 26, pp 105-117, 2010

Anna Esposito et al., “Designing a Fast Neuro-fuzzy System for Speech Noise Cancellation”, Book Title MICAI 2000: Advances in Artificial Intelligence Pages pp 482-492 Copyright 2000 DOI 10.1007/10720076_44 Print ISBN 978-3-540-67354-5 Online ISBN 978-3-540-45562-2

Jay Kumar et al., “environmental noise cancellation by using neuro fuzzy adaptive filtering”, 2015 Fifth International Conference on Communication Systems and Network Technologies, IEEE publisher

Sachin Lakra et al., “Selective noise filtering of speech signals using an adaptive neuro-fuzzy inference system as a frequency pre-classifier”, Journal of Theoretical and Applied Information Technology 30th November 2015. Vol.81. No.3

Mohammed Hussein Miry et al., “Adaptive Noise Cancellation for speech Employing Fuzzy and Neural Network”, Iraq J. Electrical and Electronic Engineering, Vol. 7, No. 2, 2011

Jasmin Thevaril and H. K. Kwan, “Speech enhancement using Adaptive Neuro-Fuzzy Filtering”, Proceedings of 2005 International Symposium on Intelligent Signal Process-ing and Communication Systems, December 13-16, 2005 Hong Kong

Kunjithapatham Meena et al., “Gender Classification in Speech Recognition using Fuzzy Logic and Neural Network”, The International Arab Journal of Information Tech-nology, Vol. 10, No. 5, September 2013

T.Meera Devi et al. “Environmental Noise Classification and Cancellation using Fuzzy Classifier and Fuzzy Adaptive Filters”, IJCSI International Journal of Computer Science Issues, Vol. 9, Issue 2, No 2, March 2012

Radek Martinek et al., “A robust approach for acoustic noise suppression in speech using ANFIS ”, Journal of electrical engineering, Vol. 66, No. 6, 2015, 301–310

J.Benesty et al., “Speech enhancement”, Springer series on signals and communication technology, ISBN 3-540-24039

Yi Hu, “A simulation study of harmonics regeneration in noise reduction for electric and acoustic stimulation”, The Journal of the Acoustical Society of America ï¿oe May 2010 DOI: 10.1121/1.3372718


Refbacks

  • There are currently no refbacks.


------------------------------------------------------------------------------------------------------------------------

The “ADBU Journal of Engineering Technology (AJET)" ISSN:2348-7305

This journal is published under the terms of the Creative Commons Attribution (CC-BY) (http://creativecommons.org/licenses/)

Number of Visitors to this Journal:web counter