You are here: Home / Publications

Publications

The publications are ordered chronologically, journals/patents in red, bookchapters in green.

  1. Milos Cernak, Afsaneh Asaei, Alexandre Hyafil
    Cognitive Speech Coding. In: IEEE Signal Processing Magazine, 2017 (accepted)
  2. Orozco-Arroyave, Juan Rafael and Vasquez-Correa, Juan Camilo and Vargas-Bonilla, Jesus Francisco and Arora, Raman and Dehak, Najim and Nidadavolu, Phani Sankar and Christensen, Heidi and Rudzicz, Frank and Yancheva, Maria and Vann, Alyssa and Vogler, Nikolai and Bocklet, Tobias and Cernak, Milos and Hannink, Julius and Elmar Nöth
    NeuroSpeech: An open-source software for Parkinson's speech analysis. In: Digital Signal Processing, 2017 (accepted)
  3. Milos Cernak, Alain Komaty, Amir Mohammadi, André Anjos and Sébastien Marcel
    Bob Speaks Kaldi. In Proc. of Interspeech (Show and tell demonstration), Aug. 2017, Stockholm, Sweden
  4. Afsaneh Asaei, Milos Cernak, Hervé Bourlard
    Perceptual Information Loss due to Impaired Speech Production. IEEE/ACM Transactions on Audio, Speech and Language Processing, 2017
  5. Milos Cernak, Juan Rafael Orozco-Arroyave, Frank Rudzicz, Heidi Christensen, Juan Camilo Vasquez and Elmar Nöth
    Characterisation of voice quality of Parkinson's disease using differential phonological posterior features. In: Computer Speech & Language 46, p. 196-208, November 2017
  6. Afsaneh Asaei, Milos Cernak, Hervé Bourlard
    Signal processing method and apparatus based on structured sparsity of phonological features. US 20170069306 A1, US Patent Application US 14/846,036, March 9, 2017
  7. Afsaneh Asaei, Milos Cernak, Hervé Bourlard and Dhananjay Ram
    Sparse Pronunciation Codes for Perceptual Phonetic Information Assessment. Workshop on Signal Processing with Adaptive Sparse Structured Representations (SPARS), 2017
  8. Milos Cernak, Elmar Nöth, Frank Rudzicz, Heidi Christensen, Juan Rafael Orozco-Arroyave, Raman Arora, Tobias Bocklet, Hamidreza Chinaei, Julius Hannink, Phani Sankar Nidadavolu, Juan Camilo Vasquez, Maria Yancheva, Alyssa Vann and Nikolai Vogler
    On the Impact of Non-modal Phonation On Phonological Features. Proceedings of 2017 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2017), March 2017, New Orleans, USA
  9. Juan Camilo Vasquez-Correa, Juan Rafael Orozco-Arroyave, Raman Arora, Elmar Nöth, Najim Dehak, Heidi Christensen, Frank Rudzicz, Tobias Bocklet, Milos Cernak, Hamidreza Chinaei, Julius Hannink, Phani Sankar Nidadavolu, Maria Yancheva, Alyssa Vann and Nikolai Vogler
    Multi-view Representation Learning Via GCCA for Multimodal Analysis of Parkinson's Disease. Proceedings of 2017 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2017), March 2017, New Orleans, USA
  10. Milos Cernak, Štefan Beňuš, Alexandros Lazaridis
    Speech vocoding for laboratory phonology , In: Computer Speech & Language 42, p. 100-121, 2017
  11. Milos Cernak, Alexandros Lazaridis, Afsaneh Asaei, Philip N. Garner
    Composition of Deep and Spiking Neural Networks for Very Low Bit Rate Speech Coding, In: IEEE/ACM Trans. on Audio, Speech and Language Processing, 24(12), December 2016
  12. Milos Cernak, Afsaneh Asaei, Hervé Bourlard
    On Structured Sparsity of Phonological Posteriors for Linguistic Parsing, In: Speech Communication 84, p. 36-45, November 2016
  13. Afsaneh Asaei, Milos Cernak, Marina Laganaro
    PAoS Markers: Trajectory Analysis of Selective Phonological Posteriors for Assessment of Progressive Apraxia of Speech. In Proc. of 7th Workshop on Speech and Language Processing for Assistive Technologies, Sept. 2016, San Francisco, USA
  14. Alexandros Lazaridis, Milos Cernak, Pierre-Edouard Honnet, Philip N. Garner
    Investigating Spectral Amplitude Modulation Phase Hierarchy Features in Speech Synthesis. In Proc. of 9th ISCA Speech Synthesis Workshop, Sept. 2016, Sunnyvale, California, USA
  15. Milos Cernak, Philip N. Garner
    PhonVoc: A Phonetic and Phonological Vocoding Toolkit . In: Interspeech 2016, San Francisco, USA
  16. Milos Cernak, Afsaneh Asaei, Pierre-Edouard Honnet, Philip N. Garner, Hervé Bourlard
    Sound Pattern Matching for Automatic Prosodic Event Detection. In: Interspeech 2016, San Francisco, USA
  17. Ramya Rasipuram, Milos Cernak, Mathew Magimai.-Doss
    HMM-based Non-native Accent Assessment using Posterior Features. In: Interspeech 2016, San Francisco, USA
  18. Afsaneh Asaei, Gil Luyet, Milos Cernak, Herve Bourlard
    Efficient Posterior Exemplar Search Space Hashing Exploiting Class-Specific Sparsity Structures . In: Interspeech 2016, San Francisco, USA
  19. Alexandros Lazaridis, Milos Cernak, Philip N. Garner
    Probabilistic Amplitude Demodulation Features in Speech Synthesis for Improving Prosody. In: Interspeech 2016, San Francisco, USA
  20. Tamas Csapo, Geza Nemeth, Milos Cernak, Philip N. Garner
    Modeling Unvoiced Sounds In Statistical Parametric Speech Synthesis with a Continuous Vocoder. In: Proc. of EUSIPCO 2016, Budapest, Hungary
  21. Tamás Gábor Csapó, Géza Németh, Milos Cernak
    Residual-based excitation with continuous F0 modeling in HMM-based speech synthesis. In: 3rd International Conference on Statistical Language and Speech Processing, Nov. 2015, Budapest, Hungary
  22. Afsaneh Asaei, Milos Cernak, Hervé Bourlard
    On Compressive Sampling of Neural Network Phonological Features for Speech Coding. In: Interspeech 2015, Dresden, Germany
  23. Milos Cernak, Pierre-Edouard Honnet
    An Empirical Model of Emphatic Word Detection. In: Interspeech 2015, Dresden, Germany
  24. Alexandre Hyafil, Milos Cernak
    Neuromorphic Based Oscillatory Device for Incremental Syllable Boundary Detection. In: Interspeech 2015, Dresden, Germany
  25. Ramya Rasipuram, Milos Cernak, Alexandre Nanchen, Mathew Magimai.-Doss
    Automatic Accentedness Evaluation of Non-Native Speech Using Phonetic and Sub-Phonetic Posterior Probabilities. In: Interspeech 2015, Dresden, Germany
  26. Milos Cernak, Philip N. Garner, Alexandros Lazaridis, Petr Motlicek and Xingyu Na
    Incremental Syllable-Context Phonetic Vocoding. In: IEEE/ACM Trans. on Audio, Speech, and Language Processing, 23(6), June 2015
  27. Milos Cernak, Blaise Potard and Philip N. Garner
    Phonological Vocoding Using Artificial Neural Networks . In: Proceedings of the IEEE Intl. Conference on Acoustics, Speech and Signal Processing (ICASSP). Brisbane, Australia, 2015
  28. Milos Cernak, Alexandros Lazaridis, Philip N. Garner and Petr Motlicek
    Stress and Accent Transmission In HMM-Based Syllable-Context Very Low Bit Rate Speech Coding. In: INTERSPEECH 2014, pp. 2799–2803, Singapore
  29. Petr Motlicek, David Imseng, Milos Cernak and Namhoon Kim
    Development of Bilingual ASR System for MediaParl Corpus. In: INTERSPEECH 2014, pp. 1391–1394, Singapore
  30. Lakshmi Saheer and Milos Cernak
    Automatic Staging of Audio with Emotions. In: International Conference on Affective Computing and Intelligent Interaction, 2-5 September 2013, Geneva, Switzerland
  31. Milos Cernak, Xingyu Na and Philip N. Garner
    Syllable-based Pitch Encoding for Low Bit Rate Speech Coding with Recognition/Synthesis Architecture. In: INTERSPEECH 2013, pp. 3449-3452, Lyon, France
  32. Milos Cernak, Petr Motlicek and Philip N. Garner
    On the (Un)importance of the Contextual Factors In HMM-Based Speech Synthesis and Coding. In: Proceedings of the IEEE Intl. Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 8140 - 8143, Vancouver, Canada, 2013
  33. Philip N. Garner, Milos Cernak and Petr Motlicek
    A Simple Continuous Pitch Estimation Algorithm. In: IEEE Signal Processing Letters. Vol 20, no. 1, p. 102 - 105. January 2013
  34. KANTOR Arthur - CERŇAK, Miloš - HAVELKA, Jiři - HUBER, Sean - KLEINDIENST, Jan - GANZALEZ, Doris B.
    Reading Companion: The Technical and Social Design of an Automated Reading Tutor . In Workshop on Child, Computer and Interaction, September 2012, Portland, Oregon, U.S.A.
  35. CERŇAK, Miloš - IMSENG, David - BOURLARD, Hervé
    Robust triphone mapping for acoustic modeling . In INTERSPEECH 2012, Portland, Oregon, U.S.A.
  36. DARJAA, Sakhia - CERŇAK, Miloš - TRNKA, Marián - RUSKO, Milan - SABO, Róbert
    Effective triphone mapping for acoustic modeling in speech recognition. In INTERSPEECH 2011 : Speech Science and Technology for Real Life, p. 1717-1720. ISSN 1990-9772.
  37. BEŇUŠ, Štefan - CERŇAK, Miloš - RUSKO, Milan - TRNKA, Marián - DARJAA, Sakhia
    Adapting Slovak ASR for native Germans speaking Slovak. In Proceedings of EMNLP 2011 : conference on Empirical Methods in Natural Language. - Stroudsburg : Association for Computational Linguistics, 2011, p. 60-64. ISBN 978-1-937284-13-8.
  38. DARJAA, Sakhia - CERŇAK, Miloš - BEŇUŠ, Štefan - RUSKO, Milan - SABO, Róbert - TRNKA, Marián
    Rule-based triphone mapping for acoustic modeling in automatic speech recognition. In Text, speech and dialogue : 14th International Conference, TSD 2011. - Berlin : Springer, 2011, lNAI 6836, p. 268-275. ISBN 978-3-642-23537-5. ISSN 0302-9743.
  39. BEŇUŠ, Štefan - CERŇAK, Miloš - RUSKO, Milan - TRNKA, Marián - DARJAA, Sakhia - SABO, Róbert
    Semi-automatic approach to ASR errors categorization in multi-speaker Corpora. In Natural Language Processing Multilinguality : Sixth International Conference. - Brno : Tribun EU, 2011, p. 9-17. ISBN 978-80-263-0049-6.
  40. RUSKO, Milan - JUHÁR, Jozef - TRNKA, Marián - STAŠ, Ján - DARJAA, Sakhia - HLÁDEK, Daniel - CERŇAK, Miloš - PAPCO, Marek - SABO, Róbert - PLEVA, Matúš - RITOMSKÝ, Marian - LOJKA, Martin
    Slovak automatic transcription and dictation system for the judicial domain. In Human Language Technologies as a Challenge for Computer Science and Linguistics : 5th Language & Technology Conference. - Poznań : Fundacja Uniwersytetu Im. A. Miczkiewicza, 2011, p. 365-369. ISBN 978-83-932640-1-8.
  41. DARJAA, Sakhia - TRNKA, Marián - CERŇAK, Miloš - RUSKO, Milan - SABO, Róbert - HLUCHÝ, Ladislav
    HMM speech synthesizer in Slovak. In GCCP 2011 : 7th International Workshop on Grid Computing for Complex Problems. - Bratislava : Institute of Informatics SAS, 2011, p. 212-221. ISBN 978-80-970145-5-1.
  42. CERŇAK, Miloš
    Spracovanie chýb v rozpoznávaní reči: kapitola 8. In JUHÁR, Jozef. Rečové technológie v telekomunikačných a informačných systémoch. - Košice : EQUILIBRIA, s.r.o., 2011, p. 283-300. ISBN 978-80-89284-75-7 (In Slovak)
  43. M. Cernak
    Diagnostics for Debugging Speech Recognition Systems. In: Text, Speech and Dialogue, Lecture Notes in Computer Science, 2010, Volume 6231/2010, 251-258, DOI: 10.1007/978-3-642-15760-8_32
  44. M. Cernak
    A Comparison of Decision Tree Classifiers for Automatic Diagnosis of Speech Recognition Errors. In Computing and Informatics. Vol. 29, no. 3 (2010), p. 489-501. ISSN 1335-9150.
  45. M. Cernak
    Diagnostic Evaluation of Synthetic Speech Using Speech Recognition. In: The 16th International Congress on Sound and Vibration, Krakow, Poland 5-9 July 2009.
  46. M. Cernak
    A Contribution of Intrinsic Speech Variabilities to Errors Done by Speech Recognition. In: Acoustics and Speech Processing, Bratislava 2008. ISBN 978-80-969202-8-0, pp. 149-156.
  47. M. Cernak and S. Darjaa
    Noisy Speech Recognition Failure Diagnosis Using Minimum Message Length Decision Trees. In Proceedings of the 15th International Conference on Systems, Signals and Image Processing (IWSSIP), June 25-28, 2008, Bratislava, Slovak Republic
  48. M. Cernak, M. Benzeghiba and Ch. J. Wellekens
    Diagnostics of Speech Recognition: on Evaluating Feature Set Performance. In: Proceedings of the 12th International Conference on Speech and Computer (SPECOM), vol. I, pp. 188-193. Moscow 2007. ISBN 6-7452-0110-X
  49. M. Cernak and Ch. J. Wellekens
    Diagnostics of Speech Recognition Using Classification Phoneme Diagnostic Trees, CI 2006, special session on NLP, San Francisco, USA
  50. M. Cernak and Ch. J. Wellekens
    Emotional aspects of intrinsic speech variabilities in automatic speech recognition, 11th SPECOM'2006, June 25-29, 2006, Saint-Petersburg, Russia
  51. M. Cernak
    Unit selection speech synthesis in noise, ICASSP 2006, 31st IEEE ICASSP, May 14-19, 2006, Toulouse, France
  52. V. Tyagi, M. Benzheghiba, M. Cernak and Ch. Wellekens
    Comparative Study of Different Features on OLLO Logatome Recognition Task, Speech Recognition and Intrinsic Variation Workshop, May 20, 2006 Toulouse, France
  53. M. Cernak and M. Trnka
    Development of a Real-Time ASR System for Slovak Speechdat Database, 5th EURASIP Conference focused on Speech and Image Processing, Multimedia Communications and Services, June 2005, Smolenice, Slovakia
  54. T. Dutoit and M. Cernak
    TTSBOX: A Matlab Toolbox for Teaching Text-to-Speech Synthesis, 30th IEEE ICASSP, March 2005, Philadelphia, PA, USA
  55. M. Cernak and M. Rusko
    An evaluation of a synthetic speech using the PESQ measure, Proceedings of Forum Acusticum 2005, August 29 - September 2, Budapest, Hungary
  56. M. Cernak
    The Use of Objective Quality Measures in Corpus-based Speech Synthesis, Ph.D Thesis, September 2004, Slovak University of Technology, Bratislava, Slovakia (in Slovak; an English abstract)
  57. M. Rusko, M. Trnka, S. Darjaa and M. Cernak
    Slovak Speech Database for Experiments and Application Building in Unit Selection Speech Synthesis. In P. Sojka, I. Kopecek, and K. Pala, editors, Proceedings of TSD 2004. Springer, Brno, 2004
  58. M. Cernak
    Slovko: A Phoneme-based Speech Re-sequencing System in MATLAB. In 3rd International Conference on Emerging Telecommunication Technologies and Applications, pages 95-98, 2004, Kosice, Slovakia
  59. M. Cernak and G. Rozinaj
    Forward masking phenomenon in concatenative speech synthesis. In Proceedings of the 4th EURASIP Conference focused on Video/Image Processing and Multimedia Communications, July 2-5, 2003, pages 691-694 vol. 2, Zagreb, Croatia
  60. M. Cernak, M. Rusko, M. Trnka, and S. Darzagin
    Data-driven Versus Knowledge-based Approaches to Orthoepic Transcription in Slovak. In International Conference on Emerging Telecommunication Technologies and Applications, pages 95-98, 2003, Kosice, Slovakia
  61. M. Cernak
    Speech Synthesis in IVR Systems: A Proposal of the R&D Environment. In ELOSYS 2002, October 15-18, 2002, Trencín, Slovakia (in Slovak)
  62. M. Cernak
    Speech Synthesis Research in Virtual Reality Systems. In Proceedings of the Int. Conference Research in Telecommunication Technology, September 17-19, 2002, Zilina, Slovakia

Technical Reports

  1. Sucheta Ghosh and Milos Cernak
    An Analysis of Rhythmic Staccato-Vocalization Based on Frequency Demodulation for Laughter Detection in Conversational Meetings, Research report Idiap-RR-02-2016, Idiap, Jan 2016, Martigny, Switzerland
  2. Philip N. Garner, Milos Cernak and Blaise Potard
    A simple continuous excitation model for parametric vocoding , Research report Idiap-RR-03-2015, Idiap, Jan 2015, Martigny, Switzerland
  3. G. Szaszak, M. Cernak, P.N. Garner, P. Motlicek, A. Nanchen
    Automatic Speech Indexing System of Bilingual Video Parliament Interventions , Research report Idiap-RR-25-2013, Idiap, July 2013, Martigny, Switzerland
  4. Sandrine Revaz and Milos Cernak
    Baseline System for Automatic Speech Recognition with French GlobalPhone Database , Research report RR-26-2012, Idiap, Aug 2012, Martigny, Switzerland
  5. Milos Cernak, Philip N. Garner and Petr Motlicek
    Progress report of a project in very low bit-rate speech coding, Research report RR-08-2012, Idiap, Feb 2012, Martigny, Switzerland
  6. M. Cernak
    DASR: A diagnostic tool for automatic speech recognition, Rapport de recherche RR-06-182, EURECOM, December 2006, Sophia-Antipolis, France
  7. M. Cernak
    Command Speech Interface to Virtual Reality Applications, Technical Report, Virtual Reality Application Center, Iowa State University, June 2002, USA

Talks

  1. Failure Diagnosis Using Decision Trees: A Case Study of Human and Computer Speech Recognition
    Telecommunications Forum, Telecommunication Research Center Vienna, September 2007, Austria
  2. Diagnostics of Speech Recognition Using Classification Phoneme Diagnostic Trees
    DIVINES meeting, Avignon, June 2006, France
  3. Design, Implementation and Evaluation of the Robust Slovak Speech Synthesis in Noise
    Talk at EURECOM, Sophia-Antipolis, October 2005, France
  4. Should Speech Synthesizers Have Ears?
    Telecommunications Forum, Telecommunication Research Center Vienna, March 2003, Austria

Awards

  • 2012 (team) Prize of the Minister of Education, Science, Research and Sport of the Slovak Republic for best national scientific-technological team of 2012.
  • 2011 (team) Prize of the Slovak Academy of Sciences for the best applicable research project.
  • A principal investigator of research grant assigned by Scientific Grant Agency of the Ministry of Education of Slovak Republic and the Academy of Sciences (2010 -- 2013) – Project: Automatic meeting speech recognition with application to courtroom recordings transcription.
  • A principal investigator of research grant assigned by Scientific Grant Agency of the Ministry of Education of Slovak Republic and the Academy of Sciences (2007 -- 2010) – Project: Robust speech technologies for information systems and their diagnostics.
  • Boeing sholarship awarded: visiting scientist at Virtual Reality Application Center, Iowa State University, 2002.