You are here: Home / Publications / Peer reviewed conference papers

Peer reviewed conference papers

  1. Hannah Muckenhirn, Mathew Magimai.-Doss and Sébastien Marcel. Presentation Attack Detection Using Long-Term Spectral Statistics for Trustworthy Speaker Verification. in Proceedings of International Conference of the Biometrics Special Interest Group (BIOSIG), 2016. (pdf)
  2. Marzieh Razavi and Mathew Magimai.-Doss. Improving Under-Resourced Language ASR Through Latent Subword Unit Space Discovery. in Proceedings of Interspeech, 2016. (pdf)
  3. Ramya Rasipuram, Milos Cernak and Mathew Magimai.-Doss. HMM-based Non-native Accent Assessment using Posterior Features. in Proceedings of Interspeech, 2016. (pdf)
  4. Marzieh Razavi, Ramya Rasipuram and Mathew Magimai.-Doss. Pronunciation Lexicon Development for Under-Resourced Languages Using Automatically Derived Subword Units: A Case Study on Scottish Gaelic. in Proceedings of 4th Biennial Workshop on Less-Resourced Languages, 2015. (pdf[Received one of the best student paper awards]
  5. Dimitri Palaz, Mathew Magimai.-Doss and Ronan Collobert. Analysis of CNN-based speech recognition system using raw speech as input. in Proceedings of Interspeech, 2015.
  6. Ramya Rasipuram, Milos Cernak, Alexandre Nanchen and Mathew Magimai.-Doss. Automatic accentedness evaluation of non-native speech using phonetic and sub-phonetic posterior probabilities. in Proceedings of Interspeech, 2015.
  7. Raphael Ullmann, Ramya Rasipuram, Mathew Magimai.-Doss and Hervé Bourlard. Objective intelligibility assessment of text-to-speech systems through utterance verification. in Proceedings of Interspeech, 2015. [Received one of the best student paper award]
  8. Marzieh Razavi and Mathew Magimai.-Doss. An HMM-based Formalism for Automatic Subword Unit Derivation and Pronunciation Generation. in Proceedings of IEEE Int. Conf. Acoust., Speech, and Signal Processing, 2015.
  9. Dimitri Palaz, Mathew Magimai.-Doss and Ronan Collobert. Convolutional Neural Networks based Continuous Speech Recognition using Raw Speech Signal. in Proceedings of IEEE Int. Conf. Acoust., Speech, and Signal Processing, 2015.
  10. Ramya Rasipuram, Marzieh Razavi and Mathew Magimai.-Doss. Integrated Pronunciation Learning for Automatic Speech Recognition using Probabilistic Lexical Modeling. in Proceedings of IEEE Int. Conf. Acoust., Speech, and Signal Processing, 2015.
  11. Raphael Ullmann, Mathew Magimai.-Doss and Hervé Bourlard. Objective Speech Intelligibility Assessment Through Comparison of Phoneme Class Conditional Probability Sequences. in Proceedings of IEEE Int. Conf. Acoust., Speech, and Signal Processing, 2015.
  12. Dimitri Palaz, Mathew Magimai.-Doss and Ronan Collobert. Joint Phoneme Segmentation Inference and Classification using CRFs. in GlobalSIP14-Machine Learning Applications in Speech Processing, December 2014.
  13. Marzieh Razavi and Mathew Magimai.-Doss. On Recognition of Non-Native Speech using Probabilistic Lexical Model. in Proceedings of Interspeech, September2014.
  14. Marzieh Razavi, Ramya Rasipuram and Mathew Magimai.-Doss. On Modeling Context-dependent Clustered States: Comparing HMM/GMM, Hybrid HMM/ANN and KL-HMM Approaches. in Proceedings of IEEE Int. Conf. Acoust., Speech, and Signal Processing, May 2014.
  15. Ramya Rasipuram, Marzieh Razavi and Mathew Magimai.-Doss. Probabilistic Lexical Modeling and Unsupervised Training For Zero-Resourced ASR. in Proceedings of IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), December 2013.
  16. Ramya Rasipuram and Mathew Magimai.-Doss. Improving Grapheme-based ASR by Probabilistic Lexical Modeling Approach. in Proceedings of Interspeech, August 2013.
  17. Dimitri Palaz, Ronan Collobert and Mathew Magimai.-Doss. Estimating Phoneme Class Conditional Probabilities from Raw Speech Signal using Convolutional Neural Networks. in Proceedings of Interspeech, August 2013.
  18. Ramya Rasipuram, Peter Bell and Mathew Magimai.-Doss. Grapheme and Multilingual Posterior Features for Under-Resourced Speech Recognition: A Study on Scottish Gaelic. in Proceedings of IEEE Int. Conf. Acoust., Speech, and Signal Processing, May 2013.
  19. Youssef Oualil, Mathew Magimai.-Doss, Friedrich Faubel and Dietrich Klakow. A probabilistic framework for multiple speaker localization. in Proceedings of IEEE Int. Conf. Acoust., Speech, and Signal Processing, May 2013.
  20. Anindya Roy, Mathew Magimai.-Doss and Sebastien Marcel. Boosting localized binary features for speech recognition. in Sympoisum on Machine Learning in Speech and Language Processing (MLSLP), 2012.
  21. Ramya Rasipuram and Mathew Magimai.-Doss. Combining Acoustic Data-Driven G2P and Letter-to-Sound Rule Rules for Under Resources Lexicon Generation. in Proceedings of Interspeech, 2012.
  22. Serena Soldo, Mathew Magimai.-Doss and Hervé Bourlard. Synthetic References for Template-based ASR using Posterior Features. in Proceedings of Interspeech, 2012.
  23. Yang Sun, Mathew M. Doss, Jort F. Gemmeke, Bert Cranen, Louis ten Bosch and Lou Boves. Combination of Sparse Classification and Multilayer Perceptron for Noise Robust ASR. in Proceedings of Interspeech, 2012.
  24. Yang Sun, Bert Cranen, Jort F. Gemmeke, Louis ten Bosch, Lou Boves and Mathew M. Doss. Using Sparse Classification Outputs as Feature Observations for Noise Robust ASR. in Proceedings of Interspeech, 2012.
  25. Serena Soldo, Mathew Magimai.-Doss and Hervé Bourlard. Template-based ASR using Posterior Features and Synthetic References: comparing different TTS Systems. in SAPA-SCALE Conference, 2012.
  26. Youssef Oualil, Mathew Magimai.-Doss, Friedrich Faubel and Dietrich Klakow. Joint Detection and Localization of Multiple Speakers using a Probabilistic Steered Response Power. in SAPA-SCALE Conference, 2012.
  27. Youssef Oualil, Friedrich Faubel, Mathew Magimai.-Doss, and Dietrich Klakow. A TDOA Gaussian Mixture Model for Improving Acoustic Source Tracking. in Proceedings of  European Signal Processing Conference (EUSIPCO), 2012.
  28. Ramya Rasipuram and Mathew Magimai.-Doss. Acoustic data-driven grapheme-to-phoneme conversion using KL-HMM. in Proceedings of IEEE International Conference on Acoustic, Speech Signal Processing (ICASSP), 2012. (pdf, bibtex)
  29. David Imseng, Ramya Rasipuram and Mathew Magimai.-Doss. Fast and flexible Kullback-Leibler divergence based acoustic modeling for non-native speech recognition. in Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding (ASRU), December 2011. (pdf, bibtex) [Short listed for student best paper award]
  30. Anindya Roy, Mathew Magimai.-Doss and Sebastien Marcel. Fast Speaker verification on mobile Phone data using boosted slice classifiers. in IAPR IEEE International Joint Conference on Biometrics, October 2011. (pdf, bibtex)
  31. Mathew Magimai.-Doss, Ramya Rasipuram, Guillermo Aradilla, and Hervé Bourlard. Grapheme-based automatic speech recognition using KL-HMM. in Proceedings of Interspeech, August 2011. (pdf, bibtex)
  32. Joel Pinto, Mathew Magimai.-Doss and Hervé Bourlard. Hierarchical tandem features for ASR in Mandarin. in Proceedings of Interspeech, August 2011. (pdf, bibtex)
  33. David Imseng, Hervé Bourlard, John Dines, Phillip N. Garner and Mathew Magimai.-Doss. Improving non-native ASR through stochastic multilingual phoneme space transformations. in Proceedings of Interspeech, August 2011. (pdf, bibtex)
  34. Fabio Valente, Mathew Magimai.-Doss and Wen Wang. Analysis and comparison of recent MLP features for LVCSR systems. in Proceedings of Interspeech, August 2011. (pdf, bibtex)
  35. Ramya Rasipuram and Mathew Magimai.-Doss. Improving articulatory feature and phoneme recognition using multitask learning. in Proceedings of International Conference on Artificial Neural Networks (ICANN), June 2011. (pdf, bibtex)
  36. Ramya Rasipuram and Mathew Magimai.-Doss. Integrating articulatory features using Kullback-Leibler divergence based acoustic model for phoneme recognition. in Proceedings of IEEE International Conference on Acoustic Speech Signal Processing (ICASSP), May 2011. (pdf, bibtex)
  37. Serena Soldo, Mathew Magimai.-Doss, Joel Pinto, and Hervé Bourlard. Posterior features for template-based ASR. in Proceedings of IEEE International Conference on Acoustic Speech Signal Processing (ICASSP), May 2011. (pdf, bibtex)
  38. Anindya Roy, Mathew Magimai.-Doss, and Sebastien Marcel. Phoneme recognition using boosted binary feature. in Proceedings of IEEE International Conference on Acoustic Speech Signal Processing (ICASSP), May 2011. (pdf, bibtex)
  39. David Imseng, Hervé Bourlard, Mathew Magimai.-Doss, and John Dines. Language dependent universal phoneme posterior estimation for mixed-language speech recognition. in Proceedings of IEEE International Conference on Acoustic Speech Signal Processing (ICASSP), May 2011. (pdf, bibtex)
  40. David Imseng, Mathew Magimai.-Doss and Hervé Bourlard. Hierarchical multilayer perceptron based language identification. in Proceedings of Interspeech, September 2010. (pdf, bibtex)
  41. Fabio Valente, Mathew Mathew Magimai.-Doss, Christian Plahl, Suman Ravuri, and Wen Wang. A comparative study of MLP front-ends for Mandarin ASR. in Proceedings of Interspeech, September 2010. (pdf, bibtex)
  42. David Imseng, Hervé Bourlard, and Mathew Magimai.-Doss. Towards mixed language recognition. in Proceedings of Interspeech, September 2010. (pdf, bibtex)
  43. Sree Hari Krishnan Parthasarathi, Mathew Magimai.-Doss, Hervé Bourlard, and Daniel Gatica-Perez. Evaluating the robustness of privacy-sensitive audio features for speech detection in personal audio log scenarios. in Proceedings of IEEE International Conference on Acoustic Speech Signal Processing (ICASSP), March 2010. (pdf, bibtex)
  44. Anindya Roy, Mathew Magimai.-Doss, and Sebastien Marcel. Boosted binary features for noise robust speaker verification. in Proceedings of IEEE International Conference on Acoustic Speech Signal Processing (ICASSP), March 2010. (pdf, bibtex)
  45. Joel Pinto, Mathew Magimai.-Doss and Hervé Bourlard. MLP based hierarchical system for task adaptation in ASR. in Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding (ASRU), December 2009. (pdf, bibtex)
  46. Sree Hari Krishnan Parthasarathi, Mathew Magimai.-Doss, Daniel Gatica-Perez, and Hervé Bourlard. Speaker change detection with privacy-preserving audio cues. in Proceedings of ICMI-MLMI, November 2009. (pdf, bibtex)
  47. Sree Hari Krishnan Parthasarathi, Mathew Magimai.-Doss, Hervé Bourlard, and Daniel Gatica-Perez. Investigating privacy-sensitive features for speech detection in multiparty conversations. in Proceedings of Interspeech, September 2009. (pdf, bibtex)
  48. Fabio Valente, Mathew Magimai.-Doss, Christian Plahl, and Suman Ravuri. Hierarchical processing of the modulation spectrum for GALE Mandarin. in Proceedings of Interspeech, September 2009. (pdf, bibtex)
  49. Guillermo Aradilla, Hervé Bourlard, and Mathew Magimai-Doss. Posterior features applied to speech recognition task with user-friendly vocabulary. in Proceedings of IEEE International Conference on Acoustic Speech Signal Processing (ICASSP), April 2009. (pdf, bibtex)
  50. Joel Pinto, G.S.V.S. Sivaram, H. Hermansky, and M. Magimai.-Doss. Volterra series for analyzing MLP-based phoneme posterior estimator. in Proceedings of IEEE International Conference on Acoustic Speech Signal Processing (ICASSP), April 2009. (pdf, bibtex)
  51. Weifeng Li, John Dines, Mathew Magimai-Doss, and Hervé Bourlard. Non-linear mapping for multi-channel speech separation and robust overlapping speech recognition. in Proceedings of IEEE International Conference on Acoustic Speech Signal Processing (ICASSP), April 2009. (pdf, bibtex)
  52. Guillermo Aradilla, Herv\'e Bourlard, and Mathew Magimai-Doss. Using KL-based acoustic models in a large vocabulary recognition task. in Proceedings of Interspeech, September 2008. (pdf, bibtex)
  53. Weifeng Li, John Dines, Mathew Magimai.-Doss, and Hervé Bourlard. Neural network based regeression for robust overlapping speech recognition using microphone arrays. in Proceedings of Interspeech, September 2008. (pdf, bibtex)
  54. Tamara Tosic, Mathew~Magimai Doss, and Hynek Hermansky. Using comparison of parallel phoneme probability streams for OOV word detection. in Proceedings of 16th European Signal Processing Conference (EUSIPCO), August 2008. (pdf, bibtex)
  55. Weifeng Li, John Dines, Mathew Magimai.-Doss, and Hervé Bourlard. MLP-based log spectral energy mapping for robust overlapping speech recognition. in Proceedings of 16th European Signal Processing Conference (EUSIPCO), August 2008. (pdf, bibtex)
  56. Joel Pinto, B. Yegnanarayana, Hynek Hermansky, and Mathew Magimai-Doss. Exploiting contextual information for improved phoneme recognition. in Proceedings of IEEE International Conference on Acoustic Speech Signal Processing (ICASSP), April 2008. (pdf, bibtex)
  57. O. Cetin, M. Magimai-Doss, K. Livescu, A. Kantor, S. King, C. Bartels, and J. Frankel. Monolingual and crosslingual comparison of tandem features derived from articulatory and phone MLPs. in Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding (ASRU), December 2007. (pdf, bibtex)
  58. Joe Frankel, Mathew Magimai-Doss, Simon King, Karen Livescu, and Ozgur Cetin. Articulatory feature recognition MLPs trained on 2000 hours of telephone speech. in Proceedings of Interspeech, September 2007. (pdf, bibtex)
  59. E. Matusov, D. Hillard, M. Magimai-Doss, D. Hakkani-Tur, M. Ostendorf, and H. Ney. Improving speech translation with automatic boundary prediction. in Proceedings of Interspeech, September 2007. (pdf, bibtex)
  60. J. Fung, D. Hakkani-Tur, M. Magimai-Doss, E. Shriberg, S. Cuendet, and N. Mirghafori. Prosodic features and feature selection for multi-lingual sentence segmentation. in Proceedings of Interspeech, September 2007. (pdf, bibtex)
  61. M. Magimai.-Doss, D. Hakkani-Tur, O. Cetin, E. Shriberg, J. Fung, and N. Mirghafori. New entropy based combination for sentence segmentation. in Proceedings of IEEE International Conference on Acoustic Speech Signal Processing (ICASSP), April 2007. (pdf, bibtex)
  62. Octavian Cheng, John Dines, and Mathew Magimai-Doss. A generalized dynamic composition algorithm of weighted finite state transducers for large vocabulary speech recognition. in Proceedings of IEEE International Conference on Acoustic Speech Signal Processing (ICASSP), April 2007. (pdf, bibtex)
  63. Ozgur Cetin, Arthur Kantor, Simon King, Chris Bartels, Mathew Magimai-Doss, Joe Frankel, and Karen Livescu. An articulatory feature-based tandem approach and factored observation modeling. in Proceedings of IEEE International Conference on Acoustic Speech Signal Processing (ICASSP), April 2007. (pdf, bibtex)
  64. K. Livescu, A. Bezman, N. Borges, L. Yung, O. Cetin, J. Frankel, S. King, M. Magimai-Doss, X. Chi, and L. Lavoie. Manual transcription of conversational speech at the articulatory feature level. in Proceedings of IEEE International Conference on Acoustic Speech Signal Processing (ICASSP), April 2007.
  65. K. Livescu, O. Cetin, M. Hasegawa-Johnson, S. King, C. Bartels, N. Borges, A. Kantor, P. Lal, L. Yung, A. Bezman, S. Dawson-Haggerty, B. Woods, J. Frankel, M. Magimai-Doss, and K. Saenko. Articulatory feature-based methods for acoustic and audio-visual speech recognition: summary from the 2006 JHU summer. in Proceedings of IEEE International Conference on Acoustic Speech Signal Processing (ICASSP), April 2007.
  66. Guillaume Lathoud, Mathew Magimai.-Doss, and Hervé Bourlard. Threshold selection for unsupervised detection with an application to microphone arrays. in Proceedings of IEEE International Conference on Acoustic Speech Signal Processing (ICASSP), May 2006.
  67. Guillaume Lathoud, Mathew Magimai.-Doss, Bertrand Mesot, and Hervé Bourlard. Unsupervised spectral subtraction for noise-robust ASR. in Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding (ASRU), December 2005. (pdf, bibtex)
  68. Guillaume Lathoud, Mathew Magimai.-Doss, and Bertrand Mesot. A spectrogram model for enhanced source localization and noise-robust ASR. in Proceedings of Interspeech 2005, September 2005. (pdf, bibtex)
  69. Guillaume Lathoud and Mathew Magimai.-Doss. A sector-based frequency-domain approach to detection and localization of multiple speakers. in Proceedings of IEEE International Conference on Acoustic Speech Signal Processing (ICASSP), March 2005. (pdf, bibtex)
  70. S. Ikbal, H. Bourlard, and M. Magimai.-Doss. HMM/ANN based spectral peak location estimation for noise robust speech. in Proceedings of IEEE International Conference on Acoustic Speech Signal Processing (ICASSP), March 2005. (pdf, bibtex)
  71. Hervé Bourlard, Samy Bengio, Mathew Magimai Doss, Qifeng Zhu, Bertrand Mesot, and Nelson Morgan. Towards using hierarchical posteriors for flexible automatic speech recognition systems. in Proceedings of RT04, 2004. (pdf, bibtex)
  72. Mathew Magimai.-Doss, Todd A. Stephenson, Shajith Ikbal, and Hervé Bourlard. Modelling auxiliary features in tandem systems. in Proceedings of International Conference on Spoken Language Processing (ICSLP), October 2004. (pdf, bibtex)
  73. Shajith Ikbal, Mathew Magimai.-Doss, Hemant Misra, and Hervé Bourlard. Spectro-temporal activity pattern (STAP) features for noise robust ASR. in Proceedings of International Conference on Spoken Language Processing (ICSLP), October 2004. (pdf, bibtex)
  74. Mathew Magimai-Doss, Samy Bengio, and Hervé Bourlard. Joint decoding for phoneme-grapheme continuous speech recognition. in Proceedings of IEEE International Conference on Acoustic Speech Signal Processing (ICASSP), May 2004. (pdf, bibtex)
  75. Mathew Magimai-Doss, Todd A. Stephenson, Hervé Bourlard, and Samy Bengio. Phoneme-grapheme based automatic speech recognition system. in Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding (ASRU), December 2003. (pdf, bibtex)
  76. Mathew Magimai-Doss, Todd A. Stephenson, and Hervé. Bourlard. Using pitch frequency information in speech recognition. in Proceedings of Eurospeech, September 2003. (pdf, bibtex)
  77. Todd A. Stephenson, Mathew Magimai-Doss, and Hervé Bourlard. Speech recognition of spontaneous, noisy speech using auxiliary information in Bayesian networks. in Proceedings of IEEE International Conference on Acoustic Speech Signal Processing (ICASSP), April 2003. (pdf, bibtex)
  78. Todd A. Stephenson, Mathew Magimai-Doss, and Hervé Bourlard. Auxiliary variables in conditional Gaussian mixtures for automatic speech recognition. in Proceedings of International Conference on Spoken Language Processing (ICSLP), September 2002. (pdf, bibtex)
  79. Todd A. Stephenson, Jaume Escofet, Mathew Magimai-Doss, and Hervé Bourlard. Dynamic Bayesian network based speech recognition with pitch and energy as auxiliary variables. in Proceedings of the IEEE Signal Processing Society Workshop on Neural Networks for Signal Processing (NNSP), September 2002. (pdf, bibtex)
  80. Todd A. Stephenson, Mathew Magimai-Doss, and Hervé Bourlard. Mixed Bayesian networks with auxiliary variables for automatic speech recognition. in Proceedings of International Conference on Pattern Recognition (ICPR), August 2002. (pdf, bibtex)
  81. Todd A. Stephenson, Mathew Magimai-Doss, and Hervé Bourlard. Modeling auxiliary information in Bayesian network based ASR. in Proceedings of Eurospeech, September 2001. (pdf, bibtex)
  82. M. Mathew, B. Yegnanarayana, and R. Sundar. A neural network-based speaker verification system using suprasegmental features. in Proceedings of Eurospeech, September 1999.