A study on different linear and non-linear filtering techniques of speech and speech recognition

Minajul Haque, Kaustubh Bhattacharyya

Abstract


In any signal noise is an undesired quantity, however most of thetime every signal get mixed with noise at different levels of theirprocessing and application, due to which the information containedby the signal gets distorted and makes the whole signal redundant.A speech signal is very prominent with acoustical noises like bubblenoise, car noise, street noise etc. So for removing the noises researchershave developed various techniques which are called filtering. Basicallyall the filtering techniques are not suitable for every application,hence based on the type of application some techniques are betterthan the others. Broadly, the filtering techniques can be classifiedinto two categories i.e. linear filtering and non-linear filtering.In this paper a study is presented on some of the filtering techniqueswhich are based on linear and nonlinear approaches. These techniquesincludes different adaptive filtering based on algorithm like LMS,NLMS and RLS etc., Kalman filter, ARMA and NARMA time series applicationfor filtering, neural networks combine with fuzzy i.e. ANFIS. Thispaper also includes the application of various features i.e. MFCC,LPC, PLP and gamma for filtering and recognition.

Full Text:

PDF

References


Kalandharan N, “Speech Enhancement by Spectral Subtraction Methodâ€, International Journal of Computer Applications (0975 – 8887) Volume 96– No.13, June 2014

L. Wang, Z. Zhang, and A. Kai, “Hands-free speaker identification based on spectral subtraction using a multi-channel least mean square approachâ€, IEEE International Conference on Acoustics, Speech and Signal Processing, Vancouver, 2013, pp. 7224– 7228

R.V.Pawar and R.M.Jalenkar, “Review on speech production model†International jour nal of engineering and innovation technology (IJEIT), Volume 3, Issue 9, March 2014

Tohkura.Y, “Mathematical modeling of soeech production and its application to noise cancellationâ€, IEEE transaction ASSP, Vol. ASSP-35

Coker, C. H., “A model of articulatory dynamics and controlâ€, Proc. IEEE, pp. 452-460, 1989

Zhe Chen, Dan Shi and Fuliang Yin, “Dynamic Noise Reduction Algorithm Based on Time-variety Filterâ€, Awareness Science and Technology (iCAST), 2011 3rd Interna-tional Conference, ISBN Information: INSPEC Accession Number: 12577337 DOI: 10.1109/ICAwST.2011.6163085

Kanika Garg and Goonjan Jain, “A Comparative Study of Noise Reduction Techniques for Automatic Speech Recognition Systemsâ€, International Conference on Advances in Computing, Communications and Informatics (ICACCI), Sept. 21-24, 2016, Jaipur, India

G.Smith et al., “Subspace technique in speech enhancementâ€, Neural Networks for Signal Processing IX, 1999. Proceedings of the 1999 IEEE Signal Processing Society Workshop

Shuping Lv and Cheng Zhang, “Blind Signal Separation for Speech Signals With Noiseâ€, Proceedings of 2014 IEEE International Conference on Mechatronics and Automation August 3 - 6, Tianjin, China

Yang Liu et al., “Speech enhancement in instantaneous amplitude and phase or application in noisy reverberant environmentsâ€, Speech communication, doi: 10.10.16/j.specom.2016.08.002

Jacob Benesty et al., “ A brief review of speech enhancement with linear filteringâ€, EURASIP journal on advance in signal processing, 2014

Vartika Anand et al., “Intelligent Adaptive Filtering For Noise Cancellationâ€, Inter-national Journal of Advanced Research in Electrical, Electronics and Instrumentation Engineering, ISSN ONLINE(2278-8875) PRINT (2320-3765)

Monson H.hayes, “Statistical digital signal processing and modelingâ€, chapter 4 and 9, Wiley edition

Jingdong Chen et al., “New Insights Into the Noise Reduction Wiener Filterâ€, IEEE transaction on audio, speech and language processing, Vol.14, No.4, July 2006

Lalit P.patil et al., “Efficient algorithm for speech enhancement using adaptive fil-terâ€, International journal of electrical, electronics and computer engineering 3(1):98-103(2014)

Sayed A. Hadei, M. lotfizad,“A Family of Adaptive Filter Algorithms in Noise Cancellation for Voice recognition†IJCEE, Vol.2, No.2, April 2010

Jyoti Dhiman et al., “Comparison between adaptive filter algorithm (LMS, NLMS, RLS)â€, International journal of science, engineering and technology research (IJSETR), Volume 2, Issue 5, may 2013

Sivaranjan Goswami et al., “A novel approach for design of a speech enhancement system using auto-trained NLMS adaptive filterâ€, International journal of information and communication technology, Vol.6, Nos.3/4, 2014

Anuja N. Untwale and Kishor S. Degaonkar, “Survey on noise cancellation technique of speech signal by adaptive filteringâ€, Internation conference on pervasive computing (ICPC), 2015

M.Mathe and S.P.Nadyala, “ Speech enhancement using kalman filter for white, random and color noiseâ€, Signal processing conference, 2000

M.S.Kavalekalam et al., “Kalman filter for speech enhancement in cocktail party sce-narios using a codebook-based approachâ€, International conference on advance statistical signal processing, 2016

Kishor Odugu and B.M.S.Rao, “ A new speech enhancement using gammatone filters and perceptual wiener filtering based on sub bandingâ€, International conference on signal processing and communication, 2013, IEEE publisher

Simon J. Godsill, “Robust modeling of noisy ARMA signalsâ€, IEEE international con-ference on acoustic, speech and signal processing 1997

Cliston Cole et al. “Increasing Additive Noise Removal in Speech Processing Using Spectral Subtractionâ€, Fifth International Conference on Information Technology: New Generations, April

, ISBN Information: INSPEC Accession Number: 9926122 DOI: 10.1109/ITNG.2008.86

Rayan Kutty P.P and Sreenivasa Murthy A, “Kalman filter using quantile based noise estimation for audio restorationâ€, Proceedings of ICETECT 2011

M.Gabrea and D.O’shaughnessy, “Speech signal recovery in white noise using and adaptive kalman filterâ€, Journal of mechanical science and technology, 23 march 2007, DOI:10.1007/BF02916349

Zhe Chen, Dan Shi and Fuliang Yin, “Dynamic Noise Reduction Algorithm Based on Time-variety Filterâ€, 3rd International conference on awareness science and tech nology (ICAST), ISBN Information: INSPEC Accession Number: 12577337 DOI: 10.1109/ICAwST.2011.6163085 Publisher: IEEE

Hwa Soo Kim et al., “Speech enhancement via mel-scale wiener filtering with a frequency-wise voice activity detectionâ€, Signal processing conference, 2000, 10th European

Harry Levitt, “Noise reduction in hearing aids: a reviewâ€, Journal of Rehabilitation research and Development Vol. 38 No. 1, January/February 2001, Pages 111–121

Premanada B S et al., “Speech Enhancement Algorithm to Reduce the Effect of Background Noise in Mobile Phonesâ€, International Journal of Wireless & Mobile Networks (IJWMN) Vol. 5, No. 1, February 2013

Paulo S.R.Diniz, “Adaptive filtering algorithms and practical implementationâ€, Third edition, ISBN: 978-0-387-31274-3 e-ISBN:, 978-0-387-68606-6, DOI: 10.1007/978-0-387-68606-6

BinWen Fan et al., “The improvement and realization of speech enhancement algorithm based on wiener filteringâ€, 8th Internation congress on image and signal processing (CISP 2015)

F.Ykhlef and L.Bendaouia, “Evaluation of time domain features for voiced/non-voiced classification of speechâ€, ICSES 2012-International conference on signal and electronic systems, WROCLAW @I 2012EEE

V.P.Patil, “Voice signal recognition using LMS and UNANR filteringâ€, IORD journal of science and technology, Volume 2, issue 2, pp 32-36

Steven F.Boll, “Suppression of Acoustic Noise in Speech Using Spectral Subtractionâ€, IEEE Transactions on acoustics, speech and signal processing, Vol. ASSP-27, No.2, April 1979

Santos S.Pratapwar, “Reduction of Background Noise in Alaryngeal Speech using Spec-tral Subtraction with Quantile Based Noise Estimationâ€7th World Multiconference on Systemics, Cybernetics and Informatics (SCI 2003), Orlando, USA, July 27-30, 2003 [37] Orchisama Das, Bhaswati Goswami and Ratna Ghosh, “Application of the Tuned Kalman Filter in Speech Enhancementâ€, 2016 IEEE First International Conference on Control, Measurement and Instrumentation (CMI)

K.K. Paliwal and B.S. Atal, “Frequency related representation of speechâ€, Proc. EU-ROSPEECH, p.p.65-68 Sep. (2003)

Wei HAN et al., “An efficient MFCC extraction method in speech recognitionâ€, IEEE international symposium on circuits and system, 2006

Vibha Tiwari, “MFCC and its application in speaker recognitionâ€, International journal on emerging technologies, ISSN:0975-8364

Sunil Kumar and M.Laxminarayana, “Choice of mel filter bank in computing MFCC of a resampled speechâ€, 10th international conference on information science, Signal processing and their applications (ISSPA 2010)

Utpal Bhattacharjee, “A Comparative Study Of LPCC And MFCC Features For The Recognition Of Assamese Phonemesâ€, International Journal of Engineering Research & Technology (IJERT), Vol. 2 Issue 1, January- 2013Vol. 2 Issue 1, January- 2013, ISSN: 2278-0181

Safdar Tanweer, Abdul Mobin and Afshar Alam, “Analysis of Combined Use of NN and MFCC for Speech Recognitionâ€, World Academy of Science, Engineering and Technology International Journal of Computer, Electrical, Automation, Control and Information Engineering Vol:8, No:9, 2014 1736 International Scholarly and Scientific Research & Innovation 8(9) 2014 scholar.waset.org/1999.4/10000797 Inter-national Science Index, Computer and Information Engineering Vol:8, No:9, 2014 waset.org/Publication/10000797

[44] Neeraj Kaberpanthi and Ashutosh Datar, “Speaker Independent Speech Recognition using MFCC with Cubic-Log Compression and VQ Analysisâ€, International Journal of Computer Applications (0975 – 8887) Volume 95– No.26, June 2014

Siddhant C. Joshi, Dr. A.N.Cheeran, “MATLAB Based Feature Extraction Using Mel-Frequency Cepstrum Coefficients for Automatic Speech Recognitionâ€, International Journal of Science, Engineering and Technology Research (IJSETR), Volume 3, Is-sue 6, June 2014

Rabiner, L. and Schafer, R., “Digital Processing of Speech Signalsâ€. Prentice Hall, Inc., Englewood Cliffs, New Jersey, 1978

Lyne R.Palomar and Toshio Fukuda, “A comparative analysis of the topological struc-tres of different LPC feature-based speech modelsâ€, International conference on Neural Networks, 1999. IJCNN ‘99

Bishnu S. Atal and Joel R. Remde, “A new mpodel of LPC excitation for producing natural sounding speech at low bit ratesâ€, IEEE international conference on acoustic, speech and signal processing, june 2013

R. L. K. Venkateswarlu et al., “Novelapproach for speech recognition by using self-organized mapsâ€, International Journal of Computer Science & Information Technology (IJCSIT) Vol 3, No 4, August 2011

H. Hermansky, “Perceptual linear predictive (PLP) analysis of speechâ€, Acoustical society of America journal, vol. 87, pp.1738-1752, Apr.1990

Yang Shao et al., “AN auditory-based feature for robust speech recognitionâ€, IEEE international conference on acoustical speecg signal processing, 2009

Jun Qi et al., “Auditory feature based on gammatone filters for robust speech recogni-tionâ€, IEEE international symposium oncircuits and systems (ISCAS), 2013

Aurelio Uncini, “Audion signal processing by Neural networksâ€, Elsevier, Volume55, Issue 3-4, pp.593-625

Marina- Anca Cidota, “Choosing the parameter of the NARMA model implemented with recurrent perceptron for speech predictionâ€, Cidotï¿oe, M. Neural Comput & Applic (2010) 19: 903. doi:10.1007/s00521-010-0375-7

Brockwell PJ, Davis RA (1987) ,“Time series: theory and methodsâ€, Springer, New York

Sh. Oveisgharan and M.B. Shamollahi, “Speech modeling and voiced/unvoiced/mixed/silence speech segmentation with fractionally Gaussian noise based modelâ€, International conference on acoustical speech signal processing, 2004

Danilo P. Mandic et al., “Advance RNN based NARMA predictorsâ€, Journal of VLSI signal processing sytem for signal, image and video technology, Volume 26, pp 105-117, 2010

Anna Esposito et al., “Designing a Fast Neuro-fuzzy System for Speech Noise Cancellationâ€, Book Title MICAI 2000: Advances in Artificial Intelligence Pages pp 482-492 Copyright 2000 DOI 10.1007/10720076_44 Print ISBN 978-3-540-67354-5 Online ISBN 978-3-540-45562-2

Jay Kumar et al., “environmental noise cancellation by using neuro fuzzy adaptive filteringâ€, 2015 Fifth International Conference on Communication Systems and Network Technologies, IEEE publisher

Sachin Lakra et al., “Selective noise filtering of speech signals using an adaptive neuro-fuzzy inference system as a frequency pre-classifierâ€, Journal of Theoretical and Applied Information Technology 30th November 2015. Vol.81. No.3

Mohammed Hussein Miry et al., “Adaptive Noise Cancellation for speech Employing Fuzzy and Neural Networkâ€, Iraq J. Electrical and Electronic Engineering, Vol. 7, No. 2, 2011

Jasmin Thevaril and H. K. Kwan, “Speech enhancement using Adaptive Neuro-Fuzzy Filteringâ€, Proceedings of 2005 International Symposium on Intelligent Signal Process-ing and Communication Systems, December 13-16, 2005 Hong Kong

Kunjithapatham Meena et al., “Gender Classification in Speech Recognition using Fuzzy Logic and Neural Networkâ€, The International Arab Journal of Information Tech-nology, Vol. 10, No. 5, September 2013

T.Meera Devi et al. “Environmental Noise Classification and Cancellation using Fuzzy Classifier and Fuzzy Adaptive Filtersâ€, IJCSI International Journal of Computer Science Issues, Vol. 9, Issue 2, No 2, March 2012

Radek Martinek et al., “A robust approach for acoustic noise suppression in speech using ANFIS â€, Journal of electrical engineering, Vol. 66, No. 6, 2015, 301–310

J.Benesty et al., “Speech enhancementâ€, Springer series on signals and communication technology, ISBN 3-540-24039

Yi Hu, “A simulation study of harmonics regeneration in noise reduction for electric and acoustic stimulationâ€, The Journal of the Acoustical Society of America ï¿oe May 2010 DOI: 10.1121/1.3372718


Refbacks

  • There are currently no refbacks.


------------------------------------------------------------------------------------------------------------------------

The ADBU Journal of Engineering Technology (AJET)" ISSN:2348-7305

This journal is published under the terms of the Creative Commons Attribution (CC-BY) (http://creativecommons.org/licenses/)

Number of Visitors to this Journal: