Jefrey Lijffijt
I am a postgraduate student and researcher at Aalto University, School of Science, Department of Information and Computer
Science, under supervision of Heikki Mannila and I also work with
Panagiotis Papapetrou, Kai Puolamäki and Terttu
Nevalainen.
I am looking for fundamental breakthroughs in modeling sequential data and
natural language, and I have a particular interest in burstiness. My
thesis is tentatively titled Data mining methods for sequential data and
corpus linguistics.
I have a funded position in the Finnish Doctoral Programme in Computational
Sciences (FICS) and I am also affiliated
with the Academy of Finland centre-of-excellence in Algorithmic Data Analysis
(ALGODAN), the
Helsinki Institute for Information Technology (HIIT) and the EU PASCAL2 Network of Excellence.
Because most researchers in DM/ML are not familiar with this resource, I am
promoting the Corpus
Resource Database (CoRD), which is maintained at the University of Helsinki
and contains descriptions of and links to a large set of linguistic corpora,
many of which are freely available.
Contact information
E-mail: jefrey.lijffijt@aalto.fi
Office: Room B333, Konemiehentie 2, FI-02150 Espoo, Finland
Mail: Aalto University School of Science, Department of Information and
Computer Science, PO Box 15400, FI-00076 Aalto, Finland
Refereed Publications
Journal Articles
Nothing to report, yet.
Conference Articles
- Turo Vartiainen, Jefrey Lijffijt. Premodifying -ing participles in the
parsed BNC. In Mukherjee, Joybrato and Magnus Huber (eds.) Corpus
Linguistics and Variation in English: Theory and Description, pages
247-258. Rodopi, Amsterdam, 2012. (Presentation)
- Jefrey Lijffijt, Panagiotis Papapetrou, Kai Puolamäki, Heikki
Mannila. Analyzing word frequencies in large text corpora using
inter-arrival times and bootstrapping. In Proceedings of the European
Conference of Machine Learning and Principles and Practices of Knowledge
Discovery in Databases (ECML-PKDD 2011), pages 341-357. Spring-Verlag,
Berlin-Heidelberg, 2011. (Article,Presentation,Poster)
Workshop Articles
- Kai Puolamäki, Panagiotis Papapetrou, Jefrey Lijffijt. Visually
controllable data mining methods. In Proceedings of the 2010 IEEE
International Conference on Data Mining Workshops, pages 409-417. IEEE
Computer Society, Washington, DC, USA, 2010. (Article)
- Jefrey Lijffijt, Panagiotis Papapetrou, Jaakko Hollmén, Vassilis
Athitsos. Benchmarking dynamic time warping for music retrieval. In
Proceedings of the 3rd International Conference on Pervasive Technologies
Related to Assistive Environments (PETRA), article 59. ACM New York, NY,
USA, 2010. (Article)
- Jefrey Lijffijt, Panagiotis Papapetrou, Jaakko Hollmén. Tracking
your steps on the track: body sensor recordings of a controlled walking
experiment. In Proceedings of the 3rd International Conference on
Pervasive Technologies Related to Assistive Environments (PETRA), article
58. ACM New York, NY, USA, 2010. (Data,Article)
Non-refereed Publications
Letters to Journals
- Jefrey Lijffijt, Stefan Th. Gries. Correction to Stefan Th. Gries'
"Dispersions and adjusted frequencies in corpora". International Journal
of Corpus Linguistics, 17 (1), 147-149, 2012.
Technical Reports
- Jefrey Lijffijt, Panagiotis Papapetrou, Niko Vuokko, Kai Puolamäki.
The smallest set of constraints that explains the data: a randomization
approach. TKK-ICS-R31, TKK Reports in Information and Computer Science,
Espoo, May 2010. (Report)
- Jefrey Lijffijt, Ingrid C. M. Flinsenberg. Compression-based activity
classification and motif discovery in time series of acceleration data.
TN-2008-00521, Koninklijke Philips Electronics N.V., Eindhoven, September 2008.
Master's Thesis
- Jefrey Lijffijt. Compression-based activity classification and motif
discovery in time series of acceleration data. Master's thesis, Utrecht
University, Sep. 2008.
Presentations
Invited Talks
- Jefrey Lijffijt. Are you talking Bernoulli to me? Significance testing
and burstiness of words in text corpora. Department of Mathematics and
Statistics, University of Jyväskylä, 11 November 2011,
Jyväskylä, Finland. (Presentation)
Conference Presentations and Posters
- Panagiotis Papapetrou, Jefrey Lijffijt, Tanja Säily, Kai
Puolamäki, Terttu Nevalainen, Heikki Mannila. Are you talking Bernoulli
to me? Comparing methods of assessing word frequencies. Helsinki Corpus
Festival, 28 Sep - 2 Oct, Helsinki, Finland, 2011. (Presentation)
- Turo Vartiainen, Jefrey Lijffijt. Can articles predict the word class of
the premodifier? A study of the -ing participle. ICAME 32, 1 - 5 June,
Oslo, Norway, 2011.
- Turo Vartiainen, Jefrey Lijffijt. Premodifying -ing participles in the
parsed BNC. ICAME 31, 26 - 30 May, Giessen, Germany, 2010. (Presentation)
- Jefrey Lijffijt, Harri Siirtola, Tanja Säily, Turo Vartiainen, Terttu
Nevalainen, Heikki Mannila. Towards interactive visual analysis of
corpora. ICAME 31, 26 - 30 May, Giessen, Germany, 2010. (Poster)
- Jefrey Lijffijt. Local and global lexicon: a novel approach to
quantifying persistence. XXXVII Kielitieteen päivät Helsingin
yliopistossa, 20 - 22 May, Helsinki, Finland, 2010. (Presentation)
Other Presentations and Posters
- Jefrey Lijffijt. Analysis of linguistic variation. Poster: Spring
Workshop on Mining and Learning (SML), Bad Neuenahr, Germany, 2012. (Poster)
Summary of current publications
resulting from the DAMMOC project.
- Jefrey Lijffijt. Data mining tools for analysis of linguistic
variation. Poster: Lorentz Workshop on Mining Patterns and Subgroups,
Leiden, The Netherlands, 2010. (Poster)
My
vision at the start of the DAMMOC project.
Page maintained by lijffijt at cis.hut.fi,
last updated Tuesday, 08-May-2012 19:48:01 EEST