Publications of Mathias Creutz
2009
- Mathias Creutz, Sami Virpioja, and
Anna Kovaleva (2009).
- Web augmentation of language models for
continuous speech recognition of SMS text messages. In Proc. EACL
2009, 30 March - 3 April, Athens, Greece, pages 157-165.
[ Publisher's site ]
- Mikko Kurimo, Mathias Creutz, and Ville
Turunen (2009).
- Morpho Challenge
evaluation by information retrieval. In Advances in
Multilingual and MultiModal Information Retrieval, 9th
Workshop of the Cross-Language Evaluation Forum, CLEF 2008,
Aarhus, Denmark, September 17-19, 2008, Revised Selected
Papers, Lecture Notes in Computer Science, pages 991-998. Springer.
2008
- Mikko Kurimo, Mathias Creutz, and Matti
Varjokallio (2008).
- Morpho Challenge evaluation using a linguistic
Gold Standard. In Advances
in Multilingual and MultiModal Information Retrieval, 8th Workshop
of the Cross-Language Evaluation Forum, CLEF 2007, Budapest,
Hungary, September 19-21, 2007, Revised Selected Papers, Lecture
Notes in Computer Science , Vol. 5152, pages 864-873. Springer.
- David Ellis, Mathias Creutz, Timo
Honkela, and Mikko Kurimo (2008).
- Speech to speech
machine translation: Biblical chatter from Finnish to
English. In Proceedings of the IJCNLP-08
Workshop on NLP for Less Privileged Languages, pages 123-130,
Hyderabad, India, January 2008. Asian Federation of Natural
Language Processing.
2007
- Mathias Creutz, Teemu Hirsimäki, Mikko
Kurimo, Antti Puurula, Janne Pylkkönen, Vesa Siivola, Matti Varjokallio,
Ebru Arisoy, Murat Saraclar, and Andreas Stolcke.
- Morph-Based Speech
Recognition and Modeling of Out-of-Vocabulary Words Across Languages.
ACM Transactions on Speech and Language Processing, Volume 5, Issue 1,
Article No. 3, December 2007.
[ Publisher's site ]
- Mikko Kurimo, Mathias Creutz, Ville
Turunen (2007).
- Overview of Morpho Challenge in CLEF
2007. In Working Notes of the CLEF 2007 Workshop. Edited by Alessandro
Nardi and Carol Peters. 19-21 September, Budapest, Hungary.
[ PDF ]
- Mikko Kurimo, Mathias Creutz, Matti
Varjokallio (2007).
- Unsupervised Morpheme Analysis Evaluation by a
Comparison to a Linguistic Gold Standard @ Morpho Challenge 2007. In
Working Notes of the CLEF 2007 Workshop. Edited by Alessandro Nardi and Carol
Peters. 19-21 September, Budapest, Hungary.
[ PDF ]
- Mikko Kurimo, Mathias Creutz, and Ville
Turunen (2007).
- Unsupervised
Morpheme Analysis Evaluation by IR experiments @ Morpho Challenge
2007. In Working Notes of the CLEF 2007 Workshop. Edited by Alessandro
Nardi and Carol Peters. 19-21 September, Budapest, Hungary.
[ PDF ]
- Sami Virpioja, Jaakko J. Väyrynen,
Mathias Creutz, and Markus Sadeniemi (2007).
- Morphology-Aware
Statistical Machine Translation Based on Morphs Induced in an Unsupervised
Manner. In Proceedings of Machine Translation Summit XI, Copenhagen,
Denmark, 10 - 14 September, pages 491-498.
[ PDF ]
- Vesa Siivola, Mathias Creutz and Mikko
Kurimo (2007).
- Morfessor and VariKN machine learning tools for
speech and language technology. In Interspeech 2007, August.
[ PDF ]
- Mathias Creutz, Teemu
Hirsimäki, Mikko Kurimo, Antti Puurula, Janne Pylkkönen,
Vesa Siivola, Matti Varjokallio, Ebru Arisoy, Murat Saraclar, and
Andreas Stolcke (2007).
- Analysis of Morph-Based Speech
Recognition and the Modeling of Out-of-Vocabulary Words Across
Languages. In Proceedings of Human Language
Technologies / The Annual Conference of the North American Chapter of
the Association for Computational Linguistics (NAACL-HLT 2007),
Rochester, NY, USA, 23-25 April, pages 380-387.
[ PDF ]
- Mathias Creutz and Krista Lagus (2007).
- Unsupervised Models for Morpheme Segmentation and
Morphology Learning. ACM Transactions on Speech and Language
Processing, Volume 4, Issue 1, January 2007.
[ Publisher's site ]
2006
- Teemu Hirsimäki, Mathias Creutz,
Vesa Siivola, Mikko Kurimo, Sami Virpioja, and Janne Pylkkönen (2006).
- Unlimited Vocabulary Speech Recognition with Morph
Language Models Applied to Finnish. Computer Speech and
Language, Volume 20, Issue 4, October 2006, pages 515-541.
[ Publisher's site ]
[ PDF (manuscript) ]
- Mathias Creutz, Krista Lagus, and Sami
Virpioja (2006).
- Unsupervised Morphology Induction Using
Morfessor. In Finite-State Methods and Natural Language
Processing, Lecture Notes in Computer Science, Volume 4002, pages
300-301, Springer Berlin / Heidelberg.
[ Publisher's site ]
- Mikko Kurimo, Mathias Creutz, Matti
Varjokallio, Ebru Arisoy, and Murat Saraclar (2006).
- Unsupervised segmentation of words into morphemes -
Morpho Challenge 2005: Application to Automatic Speech
Recognition. In the Proceedings of the International
Conference on Spoken Language Processing - Interspeech 2006 ICSLP.
Pittsburgh, Pennsylvania, USA, September 17-21.
[ PDF ]
- Mathias Creutz (2006).
- Induction of the Morphology of Natural Language: Unsupervised
Morpheme Segmentation with Application to Automatic Speech
Recognition. Doctoral thesis, Dissertations in Computer and
Information Science, Report D13, Helsinki University of Technology, Espoo,
Finland.
[ Electronic archive
at the TKK library ]
- Mathias Creutz and Krista Lagus (2006).
- Morfessor in the Morpho Challenge. In the Proceedings of
the PASCAL Challenge Workshop on Unsupervised segmentation of words into
morphemes, Venice, Italy, April 12.
[ PDF ]
- Mikko Kurimo, Mathias Creutz, Matti Varjokallio,
Ebru Arisoy, and Murat Saraclar (2006).
- Unsupervised segmentation of
words into morphemes - Challenge 2005, An Introduction and Evaluation
Report. In the Proceedings of the PASCAL Challenge Workshop on
Unsupervised segmentation of words into morphemes, Venice, Italy, April
12.
[ PDF ]
2005
- Mathias Creutz and Krista Lagus (2005).
- Inducing the Morphological Lexicon of a Natural Language
from Unannotated Text.
In Proceedings of the International and Interdisciplinary Conference on
Adaptive Knowledge Representation and Reasoning (AKRR'05), pages
106-113, Espoo, Finland, June.
[ PDF ]
- Teemu Hirsimäki, Mathias Creutz,
Vesa Siivola and Mikko Kurimo (2005).
- Morphologically
Motivated Language Models in Speech Recognition. In
Proceedings of the International and Interdisciplinary Conference on
Adaptive Knowledge Representation and Reasoning (AKRR'05), pages
121-126, Espoo, Finland,
June.
[ PDF ]
- Mathias Creutz, Krista Lagus, Krister
Lindén, and Sami Virpioja (2005).
- Morfessor and Hutmegs: Unsupervised Morpheme
Segmentation for Highly-Inflecting and Compounding Languages.
In Proceedings of the Second Baltic Conference on Human Language
Technologies, pages 107-112, Tallinn, Estonia, 4 - 5 April.
[ PDF ]
[ PS ]
- Mathias Creutz and Krista Lagus
(2005).
- Unsupervised Morpheme Segmentation and Morphology
Induction from Text Corpora Using Morfessor 1.0.
Publications in Computer and Information Science,
Report A81, Helsinki University of Technology, March.
[ PDF ]
[ PS ]
2004
- Mathias Creutz and Krister Lindén
(2004).
- Morpheme Segmentation Gold Standards for Finnish and
English. Publications in Computer and Information Science,
Report A77, Helsinki University of Technology, October.
[ PDF ]
[ PS ]
- Krista Lagus, Mathias Creutz, and Sami
Virpioja (2004).
- Latent Linguistic Codes for Morphemes
using Independent Component Analysis. Ninth Neural
Computation and Psychology Workshop: Modelling Language, Cognition and
Action, Plymouth, England, September 8-10, New Jersey etc. 2005,
World Scientific.
- Mathias Creutz and Krista Lagus
(2004).
- Induction of a Simple Morphology for Highly-Inflecting
Languages. In Proceedings of the 7th Meeting of the ACL
Special Interest Group in Computational Phonology (SIGPHON), pages
43-51, Barcelona, Spain, 26 July.
[ PDF ]
[ PS ]
2003
- Vesa Siivola, Teemu Hirsimäki,
Mathias Creutz, and Mikko Kurimo (2003).
- Unlimited vocabulary
speech recognition based on morphs discovered in an unsupervised
manner. In Proceedings of the 8th European Conference on
Speech Communication and Technology (Eurospeech), pages 2293-2296,
Geneva, Switzerland, 1-4 September.
[ PDF ]
[ PS ]
- Kadri Hacioglu, Bryan Pellom,
Tolga Ciloglu, Ozlem Ozturk, Mikko Kurimo, and Mathias Creutz
(2003).
- On lexicon creation for Turkish LVCSR.
In Proceedings of the 8th European Conference on
Speech Communication and Technology (Eurospeech), pages 1165-1168,
Geneva, Switzerland, 1-4 September.
[ PDF ]
- Kadri Hacioglu, Bryan Pellom, Tolga
Ciloglu, Ozlem Ozturk, Mikko Kurimo, and Mathias Creutz (2003).
-
Word splitting for Turkish LVCSR. In Proceedings
of the Turkish Signal Processing Conference (SIU 2003), Istanbul, Turkey.
- Mathias Creutz (2003).
-
Unsupervised segmentation of words using prior distributions
of morph length and frequency. In Proceedings of ACL-03,
the 41st Annual Meeting of the Association of Computational
Linguistics, pages 280-287, Sapporo, Japan, 7-12 July.
[ PDF ]
[ PS ]
2002
- Krista Lagus, Anu Airola, and Mathias Creutz
(2002).
- Data analysis of conceptual similarities of
Finnish verbs. In Proceedings of CogSci 2002, the 24th
annual meeting of the Cognitive Science Society, Fairfax,
Virginia, USA, August 7-10.
[ PDF ]
[ PS ]
- Mathias Creutz, and Krista Lagus (2002).
- Unsupervised discovery of morphemes.
In Proceedings of the Workshop on Morphological and Phonological Learning
of ACL-02, pages 21-30, Philadelphia, Pennsylvania, USA, July 11.
[ PDF ]
[ PS ]
Page last updated: 24 March 2012