You are here: Home / Publications

Publications

Theses

  1. Mathew Magimai Doss. Using Auxiliary Sources of Knowledge for Automatic Speech Recognition. PhD Thesis, Ecole Polytechnique Fédérale de Lausanne (EPFL), Switzerland, July, 2005. (pdf, bibtex)
  2. M. Mathew. Combining Evidence from Different Classifiers for Text-Dependent Speaker Verification. M.S Thesis, Indian Institute of Technology Madras (IIT Madras), India, 1999.

Book Chapters

  1. Mathew Magimai Doss. Speech processing. in: Interactive Multimodal Information Management, Hervé Bourlard and Andrei Popescu-Bellis (editors), pages 221--245, EPFL Press, 2013.
  2. Weifeng Li, Kenichi Kumatani, John Dines, Mathew Magimai.-Doss, and Hervé Bourlard. A neural network based regression approach for recognizing simultaneous speech. in Machine Learning for Multimodal Interaction (MLMI), Andrei Popescu-Bellis and Rainer Stiefelhagen (editors), Lecture Notes in Computer Science No. 5237. Springer-Verlag, 2008. (pdf) [peer reviewed]
  3. John Dines and Mathew Magimai-Doss. A study of phoneme and grapheme based context-dependent ASR systems. in Machine Learning for Multimodal Interaction (MLMI), Andrei Popescu-Bellis and Steve Renals (editors), Lecture Notes in Computer Science No. 4892. Springer-Verlag, 2008. (pdf) [peer reviewed]
  4. Andreas Stolcke, Xavier Anguera, Kofi Boakye, Ozgur Cetin, Adam Janin, Mathew Magimai-Doss, Chuck Wooters, and Jing Zheng. The SRI-ICSI spring 2007 meeting and lecture recognition system. in Multimodal Technologies for Perception of Humans: International Evaluation Workshops CLEAR 2007 and RT 2007, Rainer Stiefelhagen, Rachel Bowers, and Jonathan Fiscus (editors), Lecture Notes in Computer Science No. 4625, 2007. (pdf)
  5. Darren Moore, John Dines, Mathew Magimai-Doss, Jithendra Vepa, Octavian Cheng, and Thomas Hain. Juicer: A weighted finite-state transducer speech decoder. in Machine learning for Multimodal Interaction (MLMI), Steve Renals, Samy Bengio, and Jonathan G. Fiscus (editors), Lecture Notes in Computer Science No. 4299, 2006. (pdf) [peer reviewed]
  6. Mathew Magimai.-Doss and Hervé Bourlard. On the adequacy of baseform pronunciations and pronunciation variants. in Machine learning for Multimodal Interaction (MLMI), Samy Bengio and Hervé Bourlard (editors), Lecture Notes in Computer Science No. 3361, 2005. (pdf) [peer reviewed]

Peer reviewed journal papers

  1. Marzieh Razavi and Mathew Magimai.-Doss. A Posterior-Based Multi-Stream Formulation for G2P Conversion. in IEEE SIgnal Processing Letters, 2017. (to appear)
  2. Marzieh Razavi, Ramya Rasipuram and Mathew Magimai-Doss. Acoustic data-driven grapheme-to-phoneme conversion in the probabilistic lexical modeling framework. in Speech Communication, Vol. 80, June 2016, Pages 1–21. (pdf)
  3. Ramya Rasipuram and Mathew Magimai-Doss. Articulatory feature based continuous speech recognition using probabilistic lexical modeling. in Computer Speech and Language, Vol. 36, March 2016, Pages 233–259 (pdf).
  4. Ramya Rasipuram and Mathew Magimai.-Doss. Acoustic and lexical resource constrained ASR using language-independent acoustic model and language-dependent probabilistic lexical model. in Speech Communication, Volume 68, April, Pages 23-40, 2015 (pdf).
  5. Mathew Magimai-Doss and Ramya Rasipuram. On learning grapheme-to-phoneme relationships through the acoustic speech signal. in The Phonetician, Number 109-110, 2014. (pdf)
  6. Weifeng Li, Longbiao Wang, Yicong Zhou, John Dines, Mathew Magimai.-Doss, Hervé Bourlard and Qingmin Liao. Feature mapping of multiple beamformed sources for robust overlapping speech recognition using a microphone array. in IEEE/ACM Trans. on Audio, Speech and Language Processing, December 2014.
  7. David Imseng, Hervé Bourlard, John Dines, Philip N. Garner and Mathew Magimai.-Doss. Applying multi- and cross-lingual stochastic phone space transformations to non-native speech recognition. in IEEE Transactions on Audio, Speech, and Language Processing, 2013.
  8. Sunder Ram Krishnan, Mathew Magimai.-Doss, and Chandra Sekhar Seelamantula, A Savitzky-Golay Filtering Perspective of Dynamic Feature Computation. in IEEE Signal Processing Letters, 2013.
  9. Shajith Ikbal, Hemant Misra, Hynek Hermansky, and Mathew Magimai-Doss. Phase AutoCorrelation (PAC) Features for Noise Robust Speech Recognition. in Speech Communication, Volume 54, Issue 7, September 2012, Pages 867-880.
  10. Anindya Roy, Mathew Magimai.-Doss, and Sebastien Marcel. A Fast Parts-based Approach to Speaker Verifi cation using Boosted Slice Classi fiers. in IEEE Transactions on Information Forensics and Security, Volume 7, Number 1, February 2012.
  11. Sree Hari Krishnan Parthasarathi, Daniel Gatica-Perez, Hervé Bourlard, and Mathew Magimai.-Doss. Privacy-Sensitive Audio Features for Speech/Nonspeech Detection. in IEEE Transactions on Audio, Speech and Language Processing, Volume 19, Issue 8, November 2011.
  12. Fabio Valente, Mathew Magimai.-Doss, Christian Plahl, Suman Ravuri, and Wen Wang. Transcribing Mandarin Broadcast Speech Using Multi-Layer Perceptron Acoustic Features. in IEEE Transactions on Audio, Speech and Language Processing, Volume 19, Issue 8, November 2011.
  13. Hervé Bourlard, John Dines, Mathew Magimai.-Doss, Philip N. Garner, David Imseng, Petr Motlicek, Hui Liang, Lakshmi Saheer, and Fabio Valente. Current Trends in Multilingual Speech Processing. in Sadhana, Volume 36, Number 5, October 2011. (invited paper)
  14. Joel Pinto, G. S. V. S. Sivaram, Mathew Magimai.-Doss, Hynek Hermansky, and Hervé Bourlard. Analysis of MLP based Hierarchical Phoneme Posterior Probability Estimator. in IEEE Transactions on Audio, Speech and Language Processing, Volume 19, Issue 2, February 2011.
  15. Todd A. Stephenson, Mathew Magimai-Doss, and Hervé Bourlard. Speech Recognition with Auxiliary Information. in IEEE Transactions on Speech and Audio Processing, Volume 12, Issue 3, May, 2001.

Peer reviewed conference papers

  1. Hannah Muckenhirn, Mathew Magimai.-Doss and Sébastien Marcel. Presentation Attack Detection Using Long-Term Spectral Statistics for Trustworthy Speaker Verification. in Proceedings of International Conference of the Biometrics Special Interest Group (BIOSIG), 2016. (pdf)
  2. Marzieh Razavi and Mathew Magimai.-Doss. Improving Under-Resourced Language ASR Through Latent Subword Unit Space Discovery. in Proceedings of Interspeech, 2016. (pdf)
  3. Ramya Rasipuram, Milos Cernak and Mathew Magimai.-Doss. HMM-based Non-native Accent Assessment using Posterior Features. in Proceedings of Interspeech, 2016. (pdf)
  4. Marzieh Razavi, Ramya Rasipuram and Mathew Magimai.-Doss. Pronunciation Lexicon Development for Under-Resourced Languages Using Automatically Derived Subword Units: A Case Study on Scottish Gaelic. in Proceedings of 4th Biennial Workshop on Less-Resourced Languages, 2015. (pdf[Received one of the best student paper awards]
  5. Dimitri Palaz, Mathew Magimai.-Doss and Ronan Collobert. Analysis of CNN-based speech recognition system using raw speech as input. in Proceedings of Interspeech, 2015.
  6. Ramya Rasipuram, Milos Cernak, Alexandre Nanchen and Mathew Magimai.-Doss. Automatic accentedness evaluation of non-native speech using phonetic and sub-phonetic posterior probabilities. in Proceedings of Interspeech, 2015.
  7. Raphael Ullmann, Ramya Rasipuram, Mathew Magimai.-Doss and Hervé Bourlard. Objective intelligibility assessment of text-to-speech systems through utterance verification. in Proceedings of Interspeech, 2015. [Received one of the best student paper award]
  8. Marzieh Razavi and Mathew Magimai.-Doss. An HMM-based Formalism for Automatic Subword Unit Derivation and Pronunciation Generation. in Proceedings of IEEE Int. Conf. Acoust., Speech, and Signal Processing, 2015.
  9. Dimitri Palaz, Mathew Magimai.-Doss and Ronan Collobert. Convolutional Neural Networks based Continuous Speech Recognition using Raw Speech Signal. in Proceedings of IEEE Int. Conf. Acoust., Speech, and Signal Processing, 2015.
  10. Ramya Rasipuram, Marzieh Razavi and Mathew Magimai.-Doss. Integrated Pronunciation Learning for Automatic Speech Recognition using Probabilistic Lexical Modeling. in Proceedings of IEEE Int. Conf. Acoust., Speech, and Signal Processing, 2015.
  11. Raphael Ullmann, Mathew Magimai.-Doss and Hervé Bourlard. Objective Speech Intelligibility Assessment Through Comparison of Phoneme Class Conditional Probability Sequences. in Proceedings of IEEE Int. Conf. Acoust., Speech, and Signal Processing, 2015.
  12. Dimitri Palaz, Mathew Magimai.-Doss and Ronan Collobert. Joint Phoneme Segmentation Inference and Classification using CRFs. in GlobalSIP14-Machine Learning Applications in Speech Processing, December 2014.
  13. Marzieh Razavi and Mathew Magimai.-Doss. On Recognition of Non-Native Speech using Probabilistic Lexical Model. in Proceedings of Interspeech, September2014.
  14. Marzieh Razavi, Ramya Rasipuram and Mathew Magimai.-Doss. On Modeling Context-dependent Clustered States: Comparing HMM/GMM, Hybrid HMM/ANN and KL-HMM Approaches. in Proceedings of IEEE Int. Conf. Acoust., Speech, and Signal Processing, May 2014.
  15. Ramya Rasipuram, Marzieh Razavi and Mathew Magimai.-Doss. Probabilistic Lexical Modeling and Unsupervised Training For Zero-Resourced ASR. in Proceedings of IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), December 2013.
  16. Ramya Rasipuram and Mathew Magimai.-Doss. Improving Grapheme-based ASR by Probabilistic Lexical Modeling Approach. in Proceedings of Interspeech, August 2013.
  17. Dimitri Palaz, Ronan Collobert and Mathew Magimai.-Doss. Estimating Phoneme Class Conditional Probabilities from Raw Speech Signal using Convolutional Neural Networks. in Proceedings of Interspeech, August 2013.
  18. Ramya Rasipuram, Peter Bell and Mathew Magimai.-Doss. Grapheme and Multilingual Posterior Features for Under-Resourced Speech Recognition: A Study on Scottish Gaelic. in Proceedings of IEEE Int. Conf. Acoust., Speech, and Signal Processing, May 2013.
  19. Youssef Oualil, Mathew Magimai.-Doss, Friedrich Faubel and Dietrich Klakow. A probabilistic framework for multiple speaker localization. in Proceedings of IEEE Int. Conf. Acoust., Speech, and Signal Processing, May 2013.
  20. Anindya Roy, Mathew Magimai.-Doss and Sebastien Marcel. Boosting localized binary features for speech recognition. in Sympoisum on Machine Learning in Speech and Language Processing (MLSLP), 2012.
  21. Ramya Rasipuram and Mathew Magimai.-Doss. Combining Acoustic Data-Driven G2P and Letter-to-Sound Rule Rules for Under Resources Lexicon Generation. in Proceedings of Interspeech, 2012.
  22. Serena Soldo, Mathew Magimai.-Doss and Hervé Bourlard. Synthetic References for Template-based ASR using Posterior Features. in Proceedings of Interspeech, 2012.
  23. Yang Sun, Mathew M. Doss, Jort F. Gemmeke, Bert Cranen, Louis ten Bosch and Lou Boves. Combination of Sparse Classification and Multilayer Perceptron for Noise Robust ASR. in Proceedings of Interspeech, 2012.
  24. Yang Sun, Bert Cranen, Jort F. Gemmeke, Louis ten Bosch, Lou Boves and Mathew M. Doss. Using Sparse Classification Outputs as Feature Observations for Noise Robust ASR. in Proceedings of Interspeech, 2012.
  25. Serena Soldo, Mathew Magimai.-Doss and Hervé Bourlard. Template-based ASR using Posterior Features and Synthetic References: comparing different TTS Systems. in SAPA-SCALE Conference, 2012.
  26. Youssef Oualil, Mathew Magimai.-Doss, Friedrich Faubel and Dietrich Klakow. Joint Detection and Localization of Multiple Speakers using a Probabilistic Steered Response Power. in SAPA-SCALE Conference, 2012.
  27. Youssef Oualil, Friedrich Faubel, Mathew Magimai.-Doss, and Dietrich Klakow. A TDOA Gaussian Mixture Model for Improving Acoustic Source Tracking. in Proceedings of  European Signal Processing Conference (EUSIPCO), 2012.
  28. Ramya Rasipuram and Mathew Magimai.-Doss. Acoustic data-driven grapheme-to-phoneme conversion using KL-HMM. in Proceedings of IEEE International Conference on Acoustic, Speech Signal Processing (ICASSP), 2012. (pdf, bibtex)
  29. David Imseng, Ramya Rasipuram and Mathew Magimai.-Doss. Fast and flexible Kullback-Leibler divergence based acoustic modeling for non-native speech recognition. in Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding (ASRU), December 2011. (pdf, bibtex) [Short listed for student best paper award]
  30. Anindya Roy, Mathew Magimai.-Doss and Sebastien Marcel. Fast Speaker verification on mobile Phone data using boosted slice classifiers. in IAPR IEEE International Joint Conference on Biometrics, October 2011. (pdf, bibtex)
  31. Mathew Magimai.-Doss, Ramya Rasipuram, Guillermo Aradilla, and Hervé Bourlard. Grapheme-based automatic speech recognition using KL-HMM. in Proceedings of Interspeech, August 2011. (pdf, bibtex)
  32. Joel Pinto, Mathew Magimai.-Doss and Hervé Bourlard. Hierarchical tandem features for ASR in Mandarin. in Proceedings of Interspeech, August 2011. (pdf, bibtex)
  33. David Imseng, Hervé Bourlard, John Dines, Phillip N. Garner and Mathew Magimai.-Doss. Improving non-native ASR through stochastic multilingual phoneme space transformations. in Proceedings of Interspeech, August 2011. (pdf, bibtex)
  34. Fabio Valente, Mathew Magimai.-Doss and Wen Wang. Analysis and comparison of recent MLP features for LVCSR systems. in Proceedings of Interspeech, August 2011. (pdf, bibtex)
  35. Ramya Rasipuram and Mathew Magimai.-Doss. Improving articulatory feature and phoneme recognition using multitask learning. in Proceedings of International Conference on Artificial Neural Networks (ICANN), June 2011. (pdf, bibtex)
  36. Ramya Rasipuram and Mathew Magimai.-Doss. Integrating articulatory features using Kullback-Leibler divergence based acoustic model for phoneme recognition. in Proceedings of IEEE International Conference on Acoustic Speech Signal Processing (ICASSP), May 2011. (pdf, bibtex)
  37. Serena Soldo, Mathew Magimai.-Doss, Joel Pinto, and Hervé Bourlard. Posterior features for template-based ASR. in Proceedings of IEEE International Conference on Acoustic Speech Signal Processing (ICASSP), May 2011. (pdf, bibtex)
  38. Anindya Roy, Mathew Magimai.-Doss, and Sebastien Marcel. Phoneme recognition using boosted binary feature. in Proceedings of IEEE International Conference on Acoustic Speech Signal Processing (ICASSP), May 2011. (pdf, bibtex)
  39. David Imseng, Hervé Bourlard, Mathew Magimai.-Doss, and John Dines. Language dependent universal phoneme posterior estimation for mixed-language speech recognition. in Proceedings of IEEE International Conference on Acoustic Speech Signal Processing (ICASSP), May 2011. (pdf, bibtex)
  40. David Imseng, Mathew Magimai.-Doss and Hervé Bourlard. Hierarchical multilayer perceptron based language identification. in Proceedings of Interspeech, September 2010. (pdf, bibtex)
  41. Fabio Valente, Mathew Mathew Magimai.-Doss, Christian Plahl, Suman Ravuri, and Wen Wang. A comparative study of MLP front-ends for Mandarin ASR. in Proceedings of Interspeech, September 2010. (pdf, bibtex)
  42. David Imseng, Hervé Bourlard, and Mathew Magimai.-Doss. Towards mixed language recognition. in Proceedings of Interspeech, September 2010. (pdf, bibtex)
  43. Sree Hari Krishnan Parthasarathi, Mathew Magimai.-Doss, Hervé Bourlard, and Daniel Gatica-Perez. Evaluating the robustness of privacy-sensitive audio features for speech detection in personal audio log scenarios. in Proceedings of IEEE International Conference on Acoustic Speech Signal Processing (ICASSP), March 2010. (pdf, bibtex)
  44. Anindya Roy, Mathew Magimai.-Doss, and Sebastien Marcel. Boosted binary features for noise robust speaker verification. in Proceedings of IEEE International Conference on Acoustic Speech Signal Processing (ICASSP), March 2010. (pdf, bibtex)
  45. Joel Pinto, Mathew Magimai.-Doss and Hervé Bourlard. MLP based hierarchical system for task adaptation in ASR. in Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding (ASRU), December 2009. (pdf, bibtex)
  46. Sree Hari Krishnan Parthasarathi, Mathew Magimai.-Doss, Daniel Gatica-Perez, and Hervé Bourlard. Speaker change detection with privacy-preserving audio cues. in Proceedings of ICMI-MLMI, November 2009. (pdf, bibtex)
  47. Sree Hari Krishnan Parthasarathi, Mathew Magimai.-Doss, Hervé Bourlard, and Daniel Gatica-Perez. Investigating privacy-sensitive features for speech detection in multiparty conversations. in Proceedings of Interspeech, September 2009. (pdf, bibtex)
  48. Fabio Valente, Mathew Magimai.-Doss, Christian Plahl, and Suman Ravuri. Hierarchical processing of the modulation spectrum for GALE Mandarin. in Proceedings of Interspeech, September 2009. (pdf, bibtex)
  49. Guillermo Aradilla, Hervé Bourlard, and Mathew Magimai-Doss. Posterior features applied to speech recognition task with user-friendly vocabulary. in Proceedings of IEEE International Conference on Acoustic Speech Signal Processing (ICASSP), April 2009. (pdf, bibtex)
  50. Joel Pinto, G.S.V.S. Sivaram, H. Hermansky, and M. Magimai.-Doss. Volterra series for analyzing MLP-based phoneme posterior estimator. in Proceedings of IEEE International Conference on Acoustic Speech Signal Processing (ICASSP), April 2009. (pdf, bibtex)
  51. Weifeng Li, John Dines, Mathew Magimai-Doss, and Hervé Bourlard. Non-linear mapping for multi-channel speech separation and robust overlapping speech recognition. in Proceedings of IEEE International Conference on Acoustic Speech Signal Processing (ICASSP), April 2009. (pdf, bibtex)
  52. Guillermo Aradilla, Herv\'e Bourlard, and Mathew Magimai-Doss. Using KL-based acoustic models in a large vocabulary recognition task. in Proceedings of Interspeech, September 2008. (pdf, bibtex)
  53. Weifeng Li, John Dines, Mathew Magimai.-Doss, and Hervé Bourlard. Neural network based regeression for robust overlapping speech recognition using microphone arrays. in Proceedings of Interspeech, September 2008. (pdf, bibtex)
  54. Tamara Tosic, Mathew~Magimai Doss, and Hynek Hermansky. Using comparison of parallel phoneme probability streams for OOV word detection. in Proceedings of 16th European Signal Processing Conference (EUSIPCO), August 2008. (pdf, bibtex)
  55. Weifeng Li, John Dines, Mathew Magimai.-Doss, and Hervé Bourlard. MLP-based log spectral energy mapping for robust overlapping speech recognition. in Proceedings of 16th European Signal Processing Conference (EUSIPCO), August 2008. (pdf, bibtex)
  56. Joel Pinto, B. Yegnanarayana, Hynek Hermansky, and Mathew Magimai-Doss. Exploiting contextual information for improved phoneme recognition. in Proceedings of IEEE International Conference on Acoustic Speech Signal Processing (ICASSP), April 2008. (pdf, bibtex)
  57. O. Cetin, M. Magimai-Doss, K. Livescu, A. Kantor, S. King, C. Bartels, and J. Frankel. Monolingual and crosslingual comparison of tandem features derived from articulatory and phone MLPs. in Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding (ASRU), December 2007. (pdf, bibtex)
  58. Joe Frankel, Mathew Magimai-Doss, Simon King, Karen Livescu, and Ozgur Cetin. Articulatory feature recognition MLPs trained on 2000 hours of telephone speech. in Proceedings of Interspeech, September 2007. (pdf, bibtex)
  59. E. Matusov, D. Hillard, M. Magimai-Doss, D. Hakkani-Tur, M. Ostendorf, and H. Ney. Improving speech translation with automatic boundary prediction. in Proceedings of Interspeech, September 2007. (pdf, bibtex)
  60. J. Fung, D. Hakkani-Tur, M. Magimai-Doss, E. Shriberg, S. Cuendet, and N. Mirghafori. Prosodic features and feature selection for multi-lingual sentence segmentation. in Proceedings of Interspeech, September 2007. (pdf, bibtex)
  61. M. Magimai.-Doss, D. Hakkani-Tur, O. Cetin, E. Shriberg, J. Fung, and N. Mirghafori. New entropy based combination for sentence segmentation. in Proceedings of IEEE International Conference on Acoustic Speech Signal Processing (ICASSP), April 2007. (pdf, bibtex)
  62. Octavian Cheng, John Dines, and Mathew Magimai-Doss. A generalized dynamic composition algorithm of weighted finite state transducers for large vocabulary speech recognition. in Proceedings of IEEE International Conference on Acoustic Speech Signal Processing (ICASSP), April 2007. (pdf, bibtex)
  63. Ozgur Cetin, Arthur Kantor, Simon King, Chris Bartels, Mathew Magimai-Doss, Joe Frankel, and Karen Livescu. An articulatory feature-based tandem approach and factored observation modeling. in Proceedings of IEEE International Conference on Acoustic Speech Signal Processing (ICASSP), April 2007. (pdf, bibtex)
  64. K. Livescu, A. Bezman, N. Borges, L. Yung, O. Cetin, J. Frankel, S. King, M. Magimai-Doss, X. Chi, and L. Lavoie. Manual transcription of conversational speech at the articulatory feature level. in Proceedings of IEEE International Conference on Acoustic Speech Signal Processing (ICASSP), April 2007.
  65. K. Livescu, O. Cetin, M. Hasegawa-Johnson, S. King, C. Bartels, N. Borges, A. Kantor, P. Lal, L. Yung, A. Bezman, S. Dawson-Haggerty, B. Woods, J. Frankel, M. Magimai-Doss, and K. Saenko. Articulatory feature-based methods for acoustic and audio-visual speech recognition: summary from the 2006 JHU summer. in Proceedings of IEEE International Conference on Acoustic Speech Signal Processing (ICASSP), April 2007.
  66. Guillaume Lathoud, Mathew Magimai.-Doss, and Hervé Bourlard. Threshold selection for unsupervised detection with an application to microphone arrays. in Proceedings of IEEE International Conference on Acoustic Speech Signal Processing (ICASSP), May 2006.
  67. Guillaume Lathoud, Mathew Magimai.-Doss, Bertrand Mesot, and Hervé Bourlard. Unsupervised spectral subtraction for noise-robust ASR. in Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding (ASRU), December 2005. (pdf, bibtex)
  68. Guillaume Lathoud, Mathew Magimai.-Doss, and Bertrand Mesot. A spectrogram model for enhanced source localization and noise-robust ASR. in Proceedings of Interspeech 2005, September 2005. (pdf, bibtex)
  69. Guillaume Lathoud and Mathew Magimai.-Doss. A sector-based frequency-domain approach to detection and localization of multiple speakers. in Proceedings of IEEE International Conference on Acoustic Speech Signal Processing (ICASSP), March 2005. (pdf, bibtex)
  70. S. Ikbal, H. Bourlard, and M. Magimai.-Doss. HMM/ANN based spectral peak location estimation for noise robust speech. in Proceedings of IEEE International Conference on Acoustic Speech Signal Processing (ICASSP), March 2005. (pdf, bibtex)
  71. Hervé Bourlard, Samy Bengio, Mathew Magimai Doss, Qifeng Zhu, Bertrand Mesot, and Nelson Morgan. Towards using hierarchical posteriors for flexible automatic speech recognition systems. in Proceedings of RT04, 2004. (pdf, bibtex)
  72. Mathew Magimai.-Doss, Todd A. Stephenson, Shajith Ikbal, and Hervé Bourlard. Modelling auxiliary features in tandem systems. in Proceedings of International Conference on Spoken Language Processing (ICSLP), October 2004. (pdf, bibtex)
  73. Shajith Ikbal, Mathew Magimai.-Doss, Hemant Misra, and Hervé Bourlard. Spectro-temporal activity pattern (STAP) features for noise robust ASR. in Proceedings of International Conference on Spoken Language Processing (ICSLP), October 2004. (pdf, bibtex)
  74. Mathew Magimai-Doss, Samy Bengio, and Hervé Bourlard. Joint decoding for phoneme-grapheme continuous speech recognition. in Proceedings of IEEE International Conference on Acoustic Speech Signal Processing (ICASSP), May 2004. (pdf, bibtex)
  75. Mathew Magimai-Doss, Todd A. Stephenson, Hervé Bourlard, and Samy Bengio. Phoneme-grapheme based automatic speech recognition system. in Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding (ASRU), December 2003. (pdf, bibtex)
  76. Mathew Magimai-Doss, Todd A. Stephenson, and Hervé. Bourlard. Using pitch frequency information in speech recognition. in Proceedings of Eurospeech, September 2003. (pdf, bibtex)
  77. Todd A. Stephenson, Mathew Magimai-Doss, and Hervé Bourlard. Speech recognition of spontaneous, noisy speech using auxiliary information in Bayesian networks. in Proceedings of IEEE International Conference on Acoustic Speech Signal Processing (ICASSP), April 2003. (pdf, bibtex)
  78. Todd A. Stephenson, Mathew Magimai-Doss, and Hervé Bourlard. Auxiliary variables in conditional Gaussian mixtures for automatic speech recognition. in Proceedings of International Conference on Spoken Language Processing (ICSLP), September 2002. (pdf, bibtex)
  79. Todd A. Stephenson, Jaume Escofet, Mathew Magimai-Doss, and Hervé Bourlard. Dynamic Bayesian network based speech recognition with pitch and energy as auxiliary variables. in Proceedings of the IEEE Signal Processing Society Workshop on Neural Networks for Signal Processing (NNSP), September 2002. (pdf, bibtex)
  80. Todd A. Stephenson, Mathew Magimai-Doss, and Hervé Bourlard. Mixed Bayesian networks with auxiliary variables for automatic speech recognition. in Proceedings of International Conference on Pattern Recognition (ICPR), August 2002. (pdf, bibtex)
  81. Todd A. Stephenson, Mathew Magimai-Doss, and Hervé Bourlard. Modeling auxiliary information in Bayesian network based ASR. in Proceedings of Eurospeech, September 2001. (pdf, bibtex)
  82. M. Mathew, B. Yegnanarayana, and R. Sundar. A neural network-based speaker verification system using suprasegmental features. in Proceedings of Eurospeech, September 1999.

Research reports

  1. Dimitri Palaz, Mathew Magimai.-Doss and Ronan Collobert. End-to-End Acoustic Modeling using Convolutional Neural Networks for Automatic Speech Recognition. Idiap Research Report, Idiap-RR-18-2016, 2016. (pdf)
  2. Marzieh Razavi, Ramya Rasipuram and Mathew Magimai.-Doss. Towards Multiple Pronunciation Generation in Acoustic G2P Conversion Framework. Idiap Research Report, Idiap-RR-34-2015, 2015. (pdf)
  3. Marzieh Razavi and Mathew Magimai.-Doss. Posterior-Based Multi-Stream Formulation To Combine Multiple Grapheme-to-Phoneme Conversion Techniques. Idiap Research Report, Idiap-RR-33-2015, 2015. (pdf)
  4. Dimitri Palaz, Mathew Magimai.-Doss and Ronan Collobert. Learning linearly separable features for speech recognition using convolutional neural networks. Idiap Research Report, Idiap-RR-24-2015, 2015. (Peer reviewed and presented at ICLR 2015)
  5. Marzieh Razavi, Ramya Rasipuram and Mathew Magimai.-Doss. On the application of automatic subword unit derivation and pronunciation generation for under-resourced language ASR: a study on Scottish Gaelic. Idiap Research Report, Idiap-RR-13-2015, 2015.
  6. Dimitri Palaz, Ronan Collobert and Mathew Magimai.-Doss. End-to-end Phoneme Sequence Recognition using Convolutional Neural Networks. Idiap Research Report, Idiap-RR-40-2013, 2013. (Peer reviewed and presented at NIPS Deep learning Workshop 2013)
  7. Ramya Rasipuram and Mathew Magimai.-Doss. Probabilistic Lexical Modeling and Grapheme-based Automatic Speech Recognition. Idiap Research Report, Idiap-RR-14-2013, 2013.
  8. Ramya Rasipuram and Mathew Magimai.-Doss. KL-HMM and Probabilistic Lexical Modeling. Idiap Research Report, Idiap-RR-04-2013, 2013.
  9. Serena Soldo and Mathew Magimai.-Doss. Integrating Posterior Features and Self-Organizing Maps for Isolated Word Recognition without Dynamic Programming. Idiap Research Report, Idiap-RR-17-2012, 2012.
  10. Anindya Roy, Mathew Magimai.-Doss and Sebastien Marcel. Continuous Speech Recognition using Boosted Binary Features. Idiap Research Report, Idiap-RR-35-2011, 2011.
  11. Ramya Rasipuram and Mathew Magimai.-Doss. Multitask Learning to Improve Articulatory Feature Estimation and Phoneme Recognition. Idiap Research Report Idiap-RR-21-2011, 2011.
  12. Mathew Magimai.-Doss, Fabio Valente, Joel Pinto, Suman Ravuri, and Wen Wang. An investigation on genre-dependent acoustic modeling using MLP features for Mandarin ASR. Idiap Internal Report, Idiap-Internal-RR-179-2010, 2010.
  13. Serena Soldo, Mathew Magimai.-Doss, Joel Pinto, and Hervé Bourlard. On MLP-based posterior features for template-based ASR. Idiap Research Report Idiap-RR-37-2009, 2009.
  14. Mathew Magimai.-Doss, Guillermo Aradilla, and Hervé Bourlard. On joint modelling of grapheme and phoneme information using KL-HMM for ASR. Idiap Research Report Idiap-RR-24-2009, 2009.
  15. Marianna Pronobis and Mathew Magimai.-Doss. Analysis of F0 and cepstral features for robust automatic gender recognition. Idiap Research Report Idiap-RR-30-2009, 2009.
  16. Marianna Pronobis and Mathew Magimai.-Doss. Integrating audio and vision for robust automatic gender recognition. Idiap Research Report Idiap-RR-73-2008, 2008.
  17. Weifeng Li, Mathew Magimai.-Doss, and John Dines. Robust overlapping speech recognition based on neural networks. Idiap Research Report Idiap-RR-55-2007, 2007.
  18. Guillaume Lathoud, Mathew Magimai.-Doss, and Hervé Bourlard. Unsupervised spectral subtraction for noise-robust ASR on unknown transmission channels. Idiap Research Report Idiap-RR-09-2006, 2006.
  19. Mathew Magimai.-Doss, John Dines, Hervé Bourlard, and Hynek Hermansky. Improving continuous speech recognition system performance with grapheme modelling. Idiap Research Report Idiap-RR-16-2005, 2005.
  20. Mathew Magimai.-Doss, John Dines, Hervé Bourlard, and Hynek Hermansky. Phoneme vs grapheme based automatic speech recognition. Idiap Research Report Idiap-RR-48-2004, 2004.
  21. Mathew Magimai-Doss, Todd A. Stephenson, and Hervé Bourlard. Modelling auxiliary information (pitch frequency) in hybrid HMM/ANN based ASR systems. Idiap Research Report Idiap-RR-62-2002, 2002.
  22. Mathew Magimai-Doss and Hervé Bourlard. Pronunciation models and their evaluation using confidence measures. Idiap Research Report Idiap-RR-29-2001, 2001.
  23. Todd A. Stephenson, Mathew Magimai-Doss, and Hervé Bourlard. Automatic speech recognition using pitch information in dynamic Bayesian networks. Idiap Research Report Idiap-RR-41-2000, 2000.