A Hybrid Model of MFCC/MSFLA for Speaker Recognition
[1]
Majida Ali Abed, College of Computers Sciences & Mathematics, University of Tikrit, Tikrit, Iraq.
[2]
Hamid Ali Abed Alasadi, Computers Sciences Department, Education for Pure Science College, University of Basra, Basra, Iraq.
In this paper, speaker recognition system is optimized based on one of Swarm Intelligence Algorithm called Modified Shuffle Frog Leaping Algorithm (MSFLA) with Cepstral analysis and the Mel Frequency Cepstral Coefficients (MFCC) feature extraction approach. In this algorithm Search has been applied on speaker recognition systems and voice. Thus by applying this algorithm, the process of speaker recognition is optimized by a fitness function by matching of voices being done on only the extracted optimized features produced by the MSFLA. The recognition accuracy for various noise conditions (white Gaussian noises, car-noises and B-noises) with same dataset are 94.02%, 96.78% and 84.33%, respectively, using a Hybrid model of MFCC/MSFLA.
Speaker Recognition, Mel Frequency Cepstral Coefficients (MFCCs), Modified Shuffled Frog Leaping Algorithm (MSFLA)
[1]
D. Ververidis, C. Kotropoulos, “Gaussian mixture modeling by exploiting the mahalanobis distance”, IEEE transactions on signal processing, Vol. 56, No. 7, July 2008.
[2]
K. Sri Rama Murty and B. Yegnanarayana, “Combining evidence from residual phase and MFCC features for speaker recognition”, IEEE Signal Processing Letters, vol 13, no 1, Jan. 2006.
[3]
S.R.M. Prasanna, S.G. Cheedella, B. Yegnanarayana, “Extraction of speaker-specific excitation information from linear prediction residual of speech”, Speech Communication, Vol. 48, Issue 10, October 2006.
[4]
S. Chakroborty, A. Roy, S. Majumdar, G. Saha, “Capturing Complementary Information via Reversed Filter Bank and Parallel Implementation with MFCC for Improved Text-Independent Speaker Identification”, International conference on Computing theory and applications, March 2007.
[5]
Y. Liu, M. Russell, M. Carey,” The Role of Dynamic Features in Text-Dependent and Independent Speaker Verification”, IEEE international conf. on acousto. Speech and signal processing (ICASSP), Vol. 1, May 2006.
[6]
E. Elbeltagi, T. Hegazy, and D. Grierson, “Comparison among five evolutionary based optimization algorithms,” Advanced Engineering Informatics, Vol. 19, Jan. 2005.
[7]
D. A. Reynolds, “Speaker identification and verification using Gaussian mixture models,” Speech Comm., vol. 17, Aug. 1995.
[8]
Chu, W. C., "Speech Coding Algorithms'', John Wiley & Sons, Vol.4, USA. 2003.
[9]
S. P. Kishore and B. Yegnanarayana, “Speaker verification Minimizing the channel effects using auto associative neural network models,” in Proceedings of IEEE Int. Conf. Acoust., Speech, and Signal Processing, Istanbul, 2000.
[10]
M. Shajith Ikbal, Hemant Misra, and B. Yegnanarayana,“Analysis of auto associative mapping neural networks,” in Int. Joint Conf. on Neural Networks,Washington, USA, 1999.
[11]
B.Wildermoth and K. K. Paliwal. Use of voicing and pitch information for speaker recognition. In Use of Voicing and Pitch Information for Speaker Recognition, 2000.
[12]
Eusuff, M.M. and Lansey, K.E. ‘Optimization of water distribution network design using the shuffled frog leaping algorithm’, Journal of Water Resources Planning andManagement, Vol. 129, No. 3, 2003.
[13]
Taher Niknam, Ehsan Azad Farsani, A hybrid self-adaptive particle swarm optimization and modified shuffled frog leaping algorithm for distribution feeder reconfiguration , Engineering Applications of Artificial Intelligence, 2010.
[14]
B. Amiri, M. Fathian, A. Maroosi, Application of shuffled frog-leaping algorithm on clustering, Journal of International Advanced Manufacturing Technology, Vol.45, 2009.
[15]
X. H. Luo, Y. Yang, and X. Li, “Modified shuffled frog-leaping algorithm to solve traveling salesman problem,” Journal of Communications, Vol. 30, Jul. 2009.
[16]
A. Khorsandi, A. Alimardani, B. Vahidi, and S.H. Hosseinian, “Hybrid shuffled frog leaping algorithm and Nelder–Mead simplexsearch for optimal reactive power dispatch,” IET Genetation Transmission & Distribution, Vol. 5, 2, 2011.
[17]
H.B. Kekre, Vaishali Kulkarni, Prashant Gaikar and Nishant Gupta, ”Speaker Identification using Spectrograms of Varying Frame Sizes”, International Journal of Computer Applications Vol. 50 - No. 20, July 2012.