Acceso abierto

Evaluation of localization precision by proposed quasi-spherical nested microphone array in combination with multiresolution adaptive steered response power


Cite

[1] X. Sheng and Y.-H. Hu, “Maximum Likelihood Multiple-Source Localization Using Acoustic Energy Measurements with Wireless Sensor Networks”, IEEE Transactions on Signal Processing, vol. 53, pp. 44-53, 2005.10.1109/TSP.2004.838930Search in Google Scholar

[2] A. Ikeda, H. Mizoguchi, Y. Sasaki, T. Enomoto, and S. Kagami, “2D Sound Source Localization in Azimuth & Elevation from Microphone Array by Using a Directional Pattern of Element”, IEEE SENSORS, Atlanta, GA, pp. 1213-1216, 2007.10.1109/ICSENS.2007.4388627Search in Google Scholar

[3] M. I. Mandel, R. J. Weiss, and D. P. Ellis, “Model-based expectation maximization source separation and localization”, IEEE Transactions on Audio, Speech, and Language Processing, vol. 18, no. 2, p. 382-394, 2010.10.1109/TASL.2009.2029711Search in Google Scholar

[4] F. Antonacci, M. Matteucci, D. Migliore, D. Riva, A. Sarti, M. Tagliasacchi, and S. Tubaro, “Tracking multiple acoustic sources in reverberant environments using regularized particle filter”, In Proceedings IEEE International Conference on Digital Signal Processing, Cardi, UK, pp. 99-102, 2017.Search in Google Scholar

[5] Q. Yan, J. Chen, G. Ottoy, and L. D. Strycker, “Robust AOA based acoustic source localization method with unreliable measurements”, Signal Processing, vol. 152, pp. 13-21, 2018.10.1016/j.sigpro.2018.05.010Search in Google Scholar

[6] K. Na, Y. Kim, and H. Cha, Acoustic sensor network-based parking lot surveillance system, Berlin, Heidelberg: Springer Berlin Heidelberg, p. 247-262, 2009.10.1007/978-3-642-00224-3_16Search in Google Scholar

[7] D. Su, K. Nakamura, K. Nakadai, and J. V. Miro, “Robust sound source mapping using three-layered selective audio rays for mobile robots”, In Proceedings IEEE/RSJ International Conference on Intelligent Robots and Systems, Daejeon, Korea, pp. 2771-2777, 2016.10.1109/IROS.2016.7759430Search in Google Scholar

[8] D. Su, T. V. Calleja, and J. V. Miro, “Towards real-time 3D sound sources mapping with linear microphone arrays”, In Proceedings IEEE International Conference on Robotics and Automation, Singapore, Singapore, pp. 1662-1668, 2017.10.1109/ICRA.2017.7989196Search in Google Scholar

[9] J. H. DiBiase, H. F. Silverman, and M. S. Brandstein, “”Robust Localization in Reverberant Rooms, Springer, Berlin, Heidelberg, Ch. 8, pp. 157-180, 2001.10.1007/978-3-662-04619-7_8Search in Google Scholar

[10] M. Cobos, A. Marti, and J. J. Lopez, “A modified SRP-PHAT functional for robust real-time sound source localization with scalable spatial sampling”, IEEE Signal Processing Letters, vol. 18, no. 1, pp. 71-74, 2011.10.1109/LSP.2010.2091502Search in Google Scholar

[11] S. Tervo and T. Lokki, “Interpolation methods for the SRP-PHAT algorithm”, In 11th International Workshop on Acoustic Echo and Noise Control (IWAENC), 2008.Search in Google Scholar

[12] A. Canclini, F. Antonacci, A. Sarti, and S. Tubaro, “Acoustic source localization with distributed asynchronous microphone networks”, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 21, no. 2, p. 439-443, 2013.10.1109/TASL.2012.2215601Search in Google Scholar

[13] T. Dvorkind and S. Gannot, “Time difference of arrival estimation of speech source in a noisy and reverberant environment”, Signal Processing, vol. 85, no. 1, pp. 177-204, 2005.10.1016/j.sigpro.2004.09.014Search in Google Scholar

[14] A. Canclini, P. Bestagini, F. Antonacci, M. Compagnoni, A. Sarti, and S. Tubaro, “A robust and low-complexity source localization algorithm for asynchronous distributed microphone networks”, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 23, no. 10, pp. 1563-1575, 2015.10.1109/TASLP.2015.2439040Search in Google Scholar

[15] H. Krim and M. Viberg, “Two decades of array signal processing research: The parametric approach”, IEEE SP Magazine, vol. 13, pp. 67-94, 1996.10.1109/79.526899Search in Google Scholar

[16] P. Stoica and R. Mose, “”Introduction to Spectral Analysis, Prentice-Hall, 1997.Search in Google Scholar

[17] R. Schmidt, “Multiple Emitter Location and Signal Parameter Estimation”, IEEE Transactions on Antennas and Propagation, vol. AP-34, pp. 276-280, 1986.10.1109/TAP.1986.1143830Search in Google Scholar

[18] R. Roy and K. Kailath, “ESPRIT-Estimation of Signal Parameters via Rotational Invariance Techniques”, IEEE Transactions on ASSP, vol. 37, no. 7, pp. 984-995, 1989.10.1109/29.32276Search in Google Scholar

[19] B. Kwon, Y. Park, and Y. S. Park, “Multiple sound source localization using the spatially mapped GGC function”, In ICROS-SICE International Conference, Japan, pp. 1773-1776, 2009.Search in Google Scholar

[20] Y. Hikoa, M. Matsuo, and N. Hamada, “Multiple-speech-source-localization using advanced histogram mapping method”, Acoustical Science and Technology, vol. 30, no. 2, pp. 143-146, 2009.10.1250/ast.30.143Search in Google Scholar

[21] M. Farmani, M. S. Pedersen, Z. Tan, and J. Jensen, “Informed Sound Source Localization Using Relative Transfer Functions for Hearing Aid Applications”, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 25, no. 3, pp. 611-623, 2017.10.1109/TASLP.2017.2651373Search in Google Scholar

[22] N. Stefanakis, D. Pavlidi, and A. Mouchtaris, “Perpendicular Cross-Spectra Fusion for Sound Source Localization With a Planar Microphone Array”, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 25, no. 9, pp. 1821-1835, 2017.10.1109/TASLP.2017.2718733Search in Google Scholar

[23] N. Ma, J. A. Gonzalez, and G. J. Brown, “Robust Binaural Localization of a Target Sound Source by Combining Spectral Source Models and Deep Neural Networks,” IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 26, no. 11, pp. 2122-2131, 2018.10.1109/TASLP.2018.2855960Search in Google Scholar

[24] W. Dai and H. Chen, “Multiple Speech Sources Localization in Room Reverberant Environment Using Spherical Harmonic Sparse Bayesian Learning”, IEEE Sensors Letters, vol. 3, no. 2, pp. 1-4, 2019.10.1109/LSENS.2018.2890129Search in Google Scholar

[25] S. Rickard and F. Dietrich, “DOA estimation of many W-disjoint orthogonal sources from two mixtures using DUET”, Proceedings of the Tenth IEEE Workshop on Statistical Signal and Array Processing (Cat. No. 00TH8496), Pocono Manor, PA, USA, pp. 311-314, 2000.Search in Google Scholar

[26] A. D. Firoozabadi, P. Irarrazaval, P. Adasme, H. Durney, and M. S. Olave, “A Novel Quasi-Spherical Nested Microphone Array and Multiresolution Modified SRP by GammaTone Filter-bank for Multiple Speakers Localization”, In Proceedings Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA), Poznan, Poland, pp. 208-213, 2019.10.23919/SPA.2019.8936771Search in Google Scholar

[27] Y. R. Zheng, R. A. Goubran, and M. E. Tanany, “Experimental Evaluation of a Nested Microphone Array With Adaptive Noise Cancellers”, IEEE Transactions on Instrumentation and Measurement Journal, vol. 53, no. 3, pp. 777-786, 2004.10.1109/TIM.2004.827304Search in Google Scholar

[28] P. I. Johannesma, “The pre-response stimulus ensemble of neurons in the cochlear nucleus”, Symposium on Hearing Theory, IPO Eindhoven, Holland, pp. 58-69, 1972.Search in Google Scholar

[29] A. Aertsen, P. Johannesma, and D. Hermes, “Spectro-temporal receptive fields of auditory neurons in the grass frog”, Biological Cybernetics, vol. 38, no. 4, pp. 235-248, 1980.10.1007/BF00337016Search in Google Scholar

[30] R. Patterson, I. Nimmo-Smith, J. Holdsworth, and P. Rice, “An efficient auditory filterbank based on the gammatone function”, In a meeting of the IOC Speech Group on Auditory Modeling at RSRE, vol. 2, no. 7, 1987.Search in Google Scholar

[31] A. D. Firoozabadi and H. R. Abutalebi, “SRP-ML: A Robust SRP-based speech source localization method for Noisy environments,” 18-th Iranian Conference on Electrical Engineering (ICEE), Isfahan, Iran, pp. 2950-2955, 2010.Search in Google Scholar

[32] I. Vinals, P. Gimeno, A. Ortega, A. Miguel, and E. Lleida, “Estimation of the Number of Speakers with Variational Bayesian PLDA in the DIHARD Diarization Challenge”, Proceeding Interspeech, pp. 2803-2807, 2018.10.21437/Interspeech.2018-1841Search in Google Scholar

[33] J. S. Garofolo, L. F. Lamel, W. M. Fisher, J. G. Fiscus, D. S. Pallett, N. L. Dahlgren, and V. Zue, “TIMIT Acoustic-Phonetic Continuous Speech Corpus LDC93S1”, Web Download. Philadelphia: Linguistic Data Consortium (1993). Available from: https://catalog.ldc.upenn.edu/LDC93S1. Last accessed May 2019.Search in Google Scholar

[34] O. Cetin and E. Shriberg, “Analysis of overlaps in meetings by dialog factors, hot spots, speakers, and collection site: Insights for automatic speech recognition”, Proceeding Inter-speech, pp. 293-296, 2006.10.21437/Interspeech.2006-91Search in Google Scholar

[35] J. Allen and D. Berkley, “Image method for efficiently simulating small room acoustics”, The Journal of the Acoustical Society of America, vol. 65, pp. 943-950, 1979.10.1121/1.382599Search in Google Scholar

eISSN:
1339-309X
Idioma:
Inglés
Calendario de la edición:
6 veces al año
Temas de la revista:
Engineering, Introductions and Overviews, other