You are here: Home / Selected Publications / Refereed journal papers

Refereed journal papers

  1. Cernak, M., Asaei, A. and Bourlard, H. (2016), “On Structured Sparsity of Phonological Posteriors for Linguistic Parsing,” to be published in Speech Communication.
  2. Ferras, M, Madikeri, S., Motlicek, P., Dey, S., and Bourlard, H. (2016), “A Large-Scale Open-Source Acoustic Simulator for Speaker Recognition,” to be published in IEEE Signal Processing Letters, 2016.
  3. Ferràs, M., Madikeri, S., and Bourlard, H. (2016), “Speaker Diarization and Linking of Meeting data,” IEEE Trans. on Audio, Speech and Language Processing, 2016.
  4. Ullmann, R. and Bourlard, H. (2015), “Predicting the Instrusiveness of Noise Through Sparse Coding With Auditory Kernels,” to be published in Speech Communication, Special Issue on Advances in Sparse Modeling and Low-rank Modeling in Speech Processing, 2015.
  5. P. Dighe, A. Asaei, and H. Bourlard (2015), “Sparse Modeling of Neural Network Posterior Probabilities of Exemplar-Based Speech Recognition,” to be published in Speech Communication, Special Issue on Advances in Sprase Modeling and Low-rank Modeling in Speech Processing, 2015.
  6. Asaei, A., Bourlard, H., Taghizadeh, M., and Cevher, V. (2015), “Computational Methods for Underestimated Convolutive Speech Localization and Separation via Model-Based Sparse Component Analysis,” to be published in Speech Communication, 2015.
  7. Sapru, A. and Bourlard, H. (2015), “Automatic Recognition of Emergent Social Roles in Small Group Interactions,” IEEE Trans. on Multimedia, Vol.17, No. 5, pp. 746-760, 2015.
  8. Taghizadeh, M.J., Asaei, A., Haghighatshoar, S., Garner, P., and Bourlard, H. (2015), “Source Localization via Multipath Distance Matrix Recovery with Theoretical Guarantees,” to be published in IEEE Journal of Selected Topics in Signal Processing, 2015.
  9. Taghizadeh, M.J., Parkhizbar, R., Garner, P., Bourlard, H., and Asaei, A. (2015), “Ad Hoc Microphone Array Calibration: Euclidean Distance Matrix Completion Algorithm and Theoretical Guarantees", Signal Processing, Elsevier, pp. 123-140, 2015.
  10. Li, W., Wand, L., Zhou, Y., Dines, J., Magimai-Doss, M., Bourlard, H., and Liao, Q., (2014), “Feature Mapping of Multiple Beamformed Sources for Robust Overlapping Speech Recognition Using a Microphone Array,” accepted for publication in IEEE Trans. on Acoustics, Speech and Signal Processing, 2014.
  11. Taghizadeh, M.J., Garner, P., and Bourlard, H. (2014), “Enhanced Diffuse Field Model for Ad Hoc Microphone Array Calibration,” to be published in Signal Processing, Elsevier, 2014.
  12. Yella, S. and Bourlard, H. (2014), “Overlapping speech detection using long-term conversational features for speaker diarization in meeting room conversations,” accepted for publication in IEEE Trans. on Acoustics, Speech and Language Processing, 2014.
  13. Asaei, A., Golbabaee, M., Bourlard, H., and Cevher, V. (2013), “Structured Sparsity Models for Reverberant Speech Recognition,” IEEE/ACM Trans. on Speech and Audio Processing, 22(3), pp. 620-633, 2014.
  14. Motlicek, P., Duffner, S., Korchagin, D., Bourlard, H., Scheffler, C., Odobez, J-M., Del Galdo, G., Kallinger, M., and Thiergart, O. (2013), “Real-Time Audio-Visual Analysis For Multi-Person Videoconferencing,"  Advances in Multimedia, Hindawi Publishing Corporation, Article ID 175745, padoi:10.1155/2013/175745, Vol. 2013.
  15. Imseng, D., Motlicek, O., Bourlard, H., and Garner, P. (2013), “Using Out-of-Language Data to Improve an Under-Resourced Speech Recognizer,” Speech Communication, Vol. 56, pp. 142-151, 2014.
  16. Li, W. and Bourlard, H. (2013), “Robust Log-Energy Estimation and its Dynamic Change Enhancement for In-car Speech Recognition,” IEEE Trans. on Audio, Speech, and Language Processing, Vol. 21, No. 8, August 2013.
  17. Imseng, D., Bourlard, H., Dines, J., Garner, P., and Magimai.-Doss, M. (2013), “Applying Multi- and Cross-Lingual Stochastic Phoneme Space Transformations to Non-Native Speech Recognition,” to be published in  IEEE Trans. on Audio, Speech, and Language Processing, 2013.
  18. Parthasarathi, S.H.K., Bourlard, H., and Gatica-Perez, D. (2013), “Wordless Sounds: Robust Speaker Diarization using Privacy-Preserving Audio Representations,” IEEE Trans. on Audio, Speech, and Language Processing, Vol. 21, No. 1, pp. 85-98, January 2013.
  19. Popescu-Belis, A., Lalanne, D, and Bourlard, H. (2012), “Finding Information in Multimedia Meeting Records,” IEEE Transactions on Multimedia (IEEE Computer Society), 2012.
  20. Bourlard, H., Dines, J., Magimai-Doss, M., Garner, P.N., Imseng, D., Motlicek, P., Liang, H., Saheer, L., and Valente, F. (2011), “Currents Trends in Multilingual Speech Processing,” invited paper, Sadhana (Indian Academy of Engineering Sciences), Special Issue on Speech Processing, Vol. 36, Part 5, October 2011, pp. 885–915.2011.
  21. Pinto, J., Sivaram, G.S, Magimai.-Doss, M., Hermansky, H., and Bourlard, H. (2011), “Analysis of MLP-Based Hierarchical Posterior Estimation using Volterra Series,’’ IEEE Trans. on Audio, Speech, and Language Processing, Vol. 19, No. 2, pp. 225-241, February 2011.
  22. Vijayasenan, D., Valente, F., and Bourlard, H. (2011), “An Information Theoretic Combination of MFCC and TDOA Features for Speaker Diarization,” IEEE Trans. on Audio, Speech, and Language Processing, Vol. 19, No. 2, pp. 431-438, 2011.
  23. Parthasarathi, S.H.K., Gatica-Perez, D., Bourlard, H., and Magimai-Doss, M. (2010), “Privacy-Sensitive Audio Features for Speech/Nonspeech Detection,” accepted for publication in IEEE Trans. on Audio, Speech, and Language Processing.
  24. Ketabdar, H. and Bourlard, H. (2010), “Enhanced Phone Posteriors for Improving Speech Recognition Systems,” IEEE Transactions on Speech and Audio Processing, Vol. 18, No. 6, pp. 1094-1106, April 2010.
  25. Pinto, J., Sivaram, G., Magimai-Doss, M., Hermansky, H., and Bourlard, H. (2009), “Volterra Series for Analysing MLP based Phoneme Posterior Probability Estimator,” submitted to IEEE Transactions on Neural Networks.
  26. Vijayasenan, D., Valente, F., and Bourlard, H. (2011), “An Information Theoretic Approach to Speaker Diarization of Meeting Data,” IEEE Trans. on Audio, Speech, and Language Processing, Vol. 17, No. 7, pp. 1382-1393, 2009.
  27. Vinciarelli, A., Pantic, M., and Bourlard, H. (2008), “Social Signal Processing: Survey of an Emerging Domain,” Image and Vision Computing, vol. 27, no. 12, pp. 1743-1759, Elsevier, November 2009.
  28. Aradilla, G., Bourlard, H., and Magimai-Doss, M. (2008), “Kullback-Leibler Divergence Based Acoustic Models for Posterior Features in Speech Recognition,” submitted to IEEE Transactions on Speech and Audio Processing, under revision.
  29. Lathoud, G., Magimai-Doss, M, and Bourlard, H. (2006), “Unsupervised Spectral Subtraction for Noise-Robust ASR on Unknown Transmission Channels, “ IEEE Transactions on Speech and Audio Processing.
  30. Marc Al-Hames, Thomas Hain, Jan Cernocky, Sascha Schreiber, Mannes Poel, Ronald Müller, Sebastien Marcel, David Van Leeuwen, Jean-marc Odobez, Sileye Ba, Hervé Bourlard, Fabien Cardinaux, Daniel Gatica-Perez, Adam Janin, Petr Motlicek, Stephan Reiter, Steve Renals, Jeroen Van Rest, Rutger Rienks, Gerhard Rigoll, Kevin Smith, Andrew Thean, Pavel Zemcik (200), “Audio-Visual Processing in Meetings: Seven Questions and Current AMI Answers,” Machine Learning for Multimodal Interaction, pp. 24-35, Springer Berlin/Heidelberg, 2006.
  31. Tyagi, V., Bourlard, H., and Wellekens, C.J. (2006). “On variable-scale piecewise stationary spectral analysis of speech for ASR,” Speech Communication, Vol. 48, No. 9, pp. 1182-1191, September 2006.
  32. BenZeghiba, M.F and Bourlard, H. (2006), “User-customized password speaker verification using multiple reference and background models,” Speech Communication, Vol. 48, No. 9, pp. 1200-1213, September 2006.
  33. McCowan, I., Moore, D., Dines, J., Gatica-Perez, D., Flynn, M., Wellner, P., and Bourlard, H. (2005), “On the Use of Information Retrieval Measures for Speech Recognition Evaluation,” submitted for publication.
  34. Morgan, N., Zhu, Q., Stolcke, A., Sommez, K., Sivadas, SA., Shonozaki, T., Ostendorf, M., Jain, P., Hermansky, H., Gelbart, D., Ellis, D., Doddington, G., Chen, B., Cetin, O., Bourlard, H., and Athineos, M. (2005), “Pushing the Envelope – Aside : Beyond the Spectral Envelope as the Fundamental Representation for Speech Recognition,” IEEE Signal Processing Magazine, Vol. 22, No. 5, pp. 81-88, September 2005.
  35. Ikbal, S., Misra, H., Bourlard, H., and Hermansky, H. (2004), “PAC features”, submitted to IEEE Transaction on Speech and Audio Processing.
  36. Pujol, P., Hagen, A., Bourlard, H., and Nadeu, C. (2003), “Comparison and combination of features in hybrid HMM/MLP and HMM/GMM speech recognition,” IEEE Transactions of Speech and Audio Processing, pp. 14-22, Vol. 13, No.1, January 2005.
  37. de Wet, F., Weber, K, Boves, L., Cranen, B., Bengio, S., and Bourlard, H. (2004), “Evaluation of formant-like features for automatic speech recognition,” in Journal of Acoustical Society of America (JASA), Vol. 116, Issue 3, pp. 1781-1792, September 2004.
  38. Ajmera, J., McCowan, I., and Bourlard, H. (2004), “Robust Speaker Change Detection,” IEEE Signal Processing Letters, pp. 649-651, Vol. 11, No. 8, August 2004.
  39. Stephenson, T., Magimai Doss, M., and Bourlard, H. (2003), “Speech Recognition with Auxiliary Information,” IEEE Trans on Speech and Audio Processing, pp. 189-203, Vol. 12, No. 3.
  40. Ajmera, J., McCowan, I., and Bourlard, H. (2003) “Speech/Music Discrimination Using Entropy and Dynamism Features in a HMM Classification Famework,” Speech Communication, vol. 40, pp. 351-363.
  41. Chen, D., Odobez, J.-M., and Bourlard, H. (2003), “Text Detection and Recognition in Images and Videos”, accepted for publication in Intl. Journal of Pattern Recognition and Artificial Intelligence.
  42. Weber, K., Ikbal, S., Bengio, S., and Bourlard, H., “Robust Speech Recognition and Feature Extraction Using HMM2,” Computer, Speech, and Language, Vol. 17, No. 2-3, April-July 2003, pp. 195-212, Academic Press, 2003.
  43. McCowan, I. and Bourlard, H. (2003) “Microphone Array Post-filter Based on Noise Field Coherence,” IEEE Transactions on Speech and Audio Processing, Vol. 11, No. 6, pp. 709-716, November 2003.
  44. Moeller, S. and Bourlard, H. (2002) “Analytic Assessment of Telephone Transmission Impact on ASR Performance Using a Simulation Model,” published in Speech Communication, 2002.
  45. Morris, A., Hagen, A., Glotin, H., Bourlard, H. (2001) “Multi-stream adaptive evidence combination for noise robust ASR,” Speech Communication (Elsevier, North Holland), Vol. 34, Nos. 1-2, pp. 25-40, 2001.
  46. Bourlard, H., Dupont, S., and Ris C. (1998), “Multi-Stream Speech Recognition,” invited paper, CC AI The Journal for the Integrated Study of Artificial Intelligence, Cognitive Science and Applied Epistemology, vol. 15, no. 3, pp. 215-234.
  47. Bourlard, H., Konig, Y., and Morgan, N. (1996), “A Training Algorithm for Statistical Sequence Recognition with Applications to Transition-Based Speech Recognition,” IEEE Signal Processing Letters, vol. 3, no. 7, pp. 203-205.
  48. Bourlard, H., Hermansky, H., and Morgan, N. (1996), “Towards Increasing Speech Recognition Error Rates,” special-interest invited paper, Speech Communication, vol. 18, no. 3, pp. 205-231, June 1996.
  49. Morgan, N. and Bourlard, H. (1995), “Neural Networks for Statistical Recognition of Continuous Speech,” Proceedings of the IEEE, Invited Paper, vol. 83, no. 5, pp. 741-770, May 1995.
  50. de Veth, J. and Bourlard, H. (1995), “Comparison of Hidden Markov Model Techniques for Automatic Speaker Verification in Real-World Conditions,”  Invited Paper, Speech Communication, vol. 17, no. 1-2, pp. 81-90, Aug. 1995, North-Holland.
  51. Morgan, N. and Bourlard, H. (1995), “Continuous Speech Recognition: An Introduction to the Hybrid HMM/Connectionist Approach,” IEEE Signal Processing Magazine, Invited Paper, vol. 12, no. 3, pp. 25-42, May 1995 (IEEE Award paper).
  52. Renals, S., Morgan, N., Bourlard, H., Cohen, M. and Franco, H. (1994), “Connectionist Probability Estimators in HMM Speech Recognition,” IEEE Trans. on Speech and Audio Processing, vol. 2, no. 1, pp. 161-174.
  53. Morgan, N., Bourlard, H., Renals, S., Cohen, M., and Franco. H. (1993), “Hybrid Neural Network/Hidden Markov Model Systems for Continuous Speech Recognition,” Intl. Journal of Pattern Recognition and Artificial Intelligence, vol. 7, no. 4, pp. 899-916.
  54. Bourlard, H. and Morgan, N. (1993), “Continuous Speech Recognition by Connectionist Statistical Methods,” IEEE Trans. on Neural Networks, vol. 4, no. 6, pp. 893-909.
  55. Morgan N. and Bourlard, H. (1992), “Factoring Neural Networks by Statistical Methods,” Neural Computation, 4, pp. 835-838.
  56. Bourlard, H., Morgan, N., and Renals, S. (1992), “Neural Nets and HMMs: Review and Generalizations,” Invited Paper, Speech Communication, pp. 237- 246, vol. 11, no. 2-3, June 1992.
  57. Bourlard, H. and Wellekens, C.J. (1990), “Links Between Markov Models and Multilayer Perceptrons,” IEEE Trans. on Pattern Analysis and Machine Intelligence, vol. 12, no. 12, pp. 1167-1178.
  58. Bourlard, H. and Wellekens, C.J. (1989), “Speech Pattern Discrimination and Multilayer Perceptrons,” Computer, Speech and Language (Academic Press), vol. 3, pp. 1-19.
  59. Bourlard, H. and Kamp, Y. (1988), “Auto-Association by Multilayer Perceptrons and Singular Value Decomposition,” Biological Cybernetics, vol. 59, pp. 291-294.
  60. Aubert, X., Bourlard, H., Kamp, Y., and Wellekens, C.J. (1988), “Improved Hidden Markov Models for Speech Recognition,” in Philips Journal of Research, vol. 43, pp. 224-245.