**********
References
**********

.. # Use BIbTex alpha style

.. [KS10] C. Kim and R. M. Stern. Nonlinear enhancement of onset for robust speech recognition. In Proc. Interspeech 2010

.. [HAY02] S. Haykin, "Adaptive filter theory," Prentice Hall, 2002.

.. [VSR13] T. Virtanen, R. Singh, B Raj, editors, "Techniques for noise robustness in automatic speech recognition," Willey, 2013

.. [TRE02] H. L. V. Trees, "Optimum Array Processing," Wiley Interscience, 2002.

.. [VAD92] P. P. Vaidyanathan, "Multirate systems and filter banks", Prentice Hall, 1992.

.. [KMSK+08] K. Kumatani, J. W. McDonough, S. Schachl, D. Klakow, P. N. Garner and W. Li, "Filter bank design based on minimization of individual aliasing terms for minimum mutual information subband adaptive beamforming," in Proc. ICASSP, Las Vegas, USA, 2008.

.. [DGCN03] J. M. de Haan, N. Grbic, I. Claesson and S. E. Nordholm, "Filter bank design for subband adaptive microphone arrays," IEEE Transactions on Speech and Audio Processing, pp. 14-23, 2003.

.. [CBH06] J. Chen, J. Benesty, Y. Huang,  Time Delay Estimation in Room Acoustic Environments: An Overview. EURASIP J. Adv. Sig. Proc. 2006 (2006)

.. [CAR81]  G. Carter, "Time delay estimation for passive sonar signal processing," IEEE Transactions on Acoustics, Speech, and Signal Processing, pp. 463-469, 1981.

.. [OS94] M. Omologo, P. Svaizer, "Acoustic event localization using a crosspower-spectrum phase based technique," in Proc. ICASSP 1994

.. [DSB01] J.H. DiBiase, H.F. Silverman, M.S Brandstein (2001) "Robust Localization in Reverberant Rooms," In: Brandstein M., Ward D. (eds) Microphone Arrays. Digital Signal Processing. Springer, Berlin, Heidelberg

.. [AWPA05] X. Anguera, C. Wooters, B. Peskin, M. Aguilo, "Robust Speaker Segmentation for Meetings: The ICSI-SRI Spring 2005 Diarization System," in Proc.  MLMI 2005.

.. [SR87] H. Schau, A. Robinson, "Passive source localization employing intersecting spherical surfaces from time-of-arrival differences," IEEE Transactions on Acoustics, Speech, and Signal Processing, pp. 1223-1225, 1987 

.. [AS87] J. Abel, J. Smith, "The spherical interpolation method for closed-form passive source localization using range difference measurements," In Proc. ICASSP, 1987

.. [BRA95] M. S. Brandstein, "A Framework for Speech Source Localization Using Sensor Arrays," Ph.D., Brown University, 1995.

.. [YKA96] J. Yli-Hietanen, K. Kalliojarvi, J. Astola, "Low-complexity angle of arrival estimation of wideband signals using small arrays," In Proc. IEEE Signal Processing Workshop on Statistical Signal and Array Processing,  1996.

.. [KGM06] U. Klee, T. Gehrig and J. W. McDonough, "Kalman Filters for Time Delay of Arrival-Based Source Localization," EURASIP J. Adv. Sig., 2006.

.. [WM09] M. Woelfel and J. W. McDonough, "Distant Speech Recognition", New York: Wiley, 2009. 

.. [SBM01] K. U. Simmer, J. Bitzer and C. Marro, "Post-Filtering Techniques," in Microphone Arrays, Heidelberg, Germany, Springer Verlag, 2001, pp. 39-60.

.. [SU96] T. M. Sullivan, Multi-Microphone Correlation-Based Processing for Robust Automatic Speech Recognition, Ph.D. thesis, Carnegie Mellon University, Pittsburgh, Pennsylvania, 8 1996.

.. [MB03] I. McCowan and H. Bourlard, "Microphone array post-filter based on noise field coherence," IEEE Transactions on Speech and Audio Processing, pp. 709-716, 2003. 

.. [LM07] S. Lefkimmiatis and P. Maragos, "A generalized estimation approach for linear and nonlinear microphone array post-filters," Speech Communication, vol. 49, pp. 7-8, 2007. 

.. [SKMC12] R. Singh, K. Kumatani, J. McDonough, L. Chen, "A signal-separation-based array postfilter for distant speech recognition," in Proc. Interspeech 2012.

.. [BS01] J. Bitzer and K. U. Simmer, "Superdirective Microphone Arrays," in Microphone Arrays, Heidelberg, Germany, Springer Verlag, 2001, pp. 19-38.

.. [SBA10] M. Souden, J. Benesty and S. Affes, "On optimal frequency-domain multichannel linear filtering for noise reduction," IEEE Trans. Audio, Speech, Language Process, pp. 260-276, 2010. 

.. [KMB12] K. Kumatani, J. McDonough and B. Raj, "Microphone array processing for distant speech recognition: From close-talking microphones to far-field sensors," IEEE Signal Processing Magazine, pp. 127-140, 2012. 

.. [KMRK+09] K. Kumatani, J. W. McDonough, B. Rauch, D. Klakow, P. N. Garner and W. Li, "Beamforming With a Maximum Negentropy Criterion," IEEE Transactions on Audio, Speech & Language Processing, pp. 994-1008, 2009. 

.. [KMRG+08] K. Kumatani, J. W. McDonough, B. Rauch, P. N. Garner, W. Li and J. Dines, "Maximum kurtosis beamforming with the generalized sidelobe canceller," in INTERSPEECH, Brisbane, Australia, 2008. 

.. [ME04] J. Meyer and G. W. Elko, "Spherical Microphone Arrays for 3D Sound Recording," in Audio Signal Processing for Next-Generation Multimedia Communication Systems, 2004, pp. 67-89.

.. [MKAY+13] J. W. McDonough, K. Kumatani, T. Arakawa, K. Yamamoto and B. Raj, "Speaker tracking with spherical microphone arrays," in ICASSP, Vancouver, Canada, 2013. 

.. [TA05] I. Tashev and D. Allred, "Reverberation reduction for improved speech recognition," in Proceedings of Hands-Free Communication and Microphone Arrays, Piscataway, USA, 2005. 

.. [KDGH+16] K. Kinoshita and M. Delcroix and S. Gannot and E. Habets and R. Haeb-Umbach and W. Kellermann and V. Leutnant and R. Maas and T. Nakatani and B. Raj and A. Sehr and T. Yoshioka; "A summary of the REVERB challenge: state-of-the-art and remaining challenges in reverberant speech processing research" EURASIP Journal on Advances in Signal Processing, 2016

.. [YN12] T. Yoshioka and T. Nakatani, "Generalization of multi-channel linear prediction methods for blind MIMO impulse response shortening," IEEE Trans. Audio, Speech, Language Process, pp. 2707-2720, 2012. 

.. [TAS09] I. Tashev, "Sound Capture and Processing: Practical Approaches," Wiley, 2009. 

.. [HS04] E. Haensler and G. Schmidt, "Acoustic Echo and Noise Control - A Practical Approach," Wiley Interscience, 2004. 

.. [KEL97] W. Kellermann, "Strategies for combining acoustic echo cancellation and adaptive beamforming microphone arrays," in Proc. ICASSP, 1997.

.. [HNK05] W. Herbordt, S. Nakamura and W. Kellerman, "Joint optimization of LCMV beamforming and acoustic echo cancellation for automatic speech recognition," in ICASSP, Philadelphia, PA, USA, 2005.
 
.. [MCKR+11] J. McDonough, W. Chu, K. Kumatani, B. Raj and J. Lehman, "An Information Filter for Voice Prompt Suppression," in Asilomar, Pacific Grove, CA, 2011. 

.. [MKR11] J. McDonough, K. Kumatani and B. Raj, "On the Combination of Voice Prompt Suppression with Maximum Kurtosis Beamforming," in Proc. WASPAA, New Paltz, NY, 2011.

.. [EV06] G. Enzner and P. Vary, "Frequency-domain adaptive Kalman filter for acoustic echo control in hands-free telephones," Signal Processing, pp. 1140-1156, 2006.

.. [FF18] J. Franzen, T. Fingscheidt, "An Efficient Residual Echo Suppression for Multi-Channel Acoustic Echo Cancellation Based on the Frequency-Domain Adaptive Kalman Filter", in Proc. ICASSP 2018.

.. [CSVH18] G. Carbajal, R. Serizel, E. Vincent, E. Humbert, "Multiple-input neural network-based residual echo suppression", in Proc. ICASSP 2018.

.. [WM05] M. Wölfel and J. McDonough, "Minimum variance distortionless response spectral estimation, review and refinements," IEEE Signal Processing Magazine, pp. 117-126, 2005. 

.. [MKGS+07] J. McDonough, K. Kumatani, T. Gehrig, E. Stoimenov, U. Mayer, S. Schacht, M. Woelfel and D. Klakow, "To separate speech: A system for recognizing simultaneous speech," in Proceedings of the 4th international conference on Machine learning for multimodal interaction, Brno, Czech Republic, 2007.

.. [WH07] E. Warsitz and R. Haeb-Umbach, "Blind Acoustic Beamforming based on Generalized Eigenvalue Decomposition," IEEE Transactions on Audio, Speech, and Language Processing, vol. 15, 2007.

.. [HDH16] J. Heymann, L. Drude, R. Haeb-Umbach, "Neural network based spectral mask estimation for acoustic beamforming," in Proc. ICASSP 2016.

.. [KMR11] K Kumatani, J McDonough, B Raj, "Block-wise incremental adaptation algorithm for maximum kurtosis beamforming," in Proc. WASPAA, 2011

.. [HKIK+18] T. Higuchi, K. Kinoshita, N. Ito, S. Karita, and T. Nakatani, "Frame-by-frame closed-form update for mask-based adaptive MVDR beamforming," in Proc. ICASSP, 2018.