Automatic detection of idiomatic clauses

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

21 Scopus citations


We describe several experiments whose goal is to automatically identify idiomatic expressions in written text. We explore two approaches for the task: 1) idiom recognition as outlier detection; and 2) supervised classification of sentences. We apply principal component analysis for outlier detection. Detecting idioms as lexical outliers does not exploit class label information. So, in the following experiments, we use linear discriminant analysis to obtain a discriminant subspace and later use the three nearest neighbor classifier to obtain accuracy. We discuss pros and cons of each approach. All the approaches are more general than the previous algorithms for idiom detection - neither do they rely on target idiom types, lexicons, or large manually annotated corpora, nor do they limit the search space by a particular type of linguistic construction.

Original languageEnglish
Title of host publicationComputational Linguistics and Intelligent Text Processing - 14th International Conference, CICLing 2013, Proceedings
Number of pages12
EditionPART 1
StatePublished - 2013
Event14th Annual Conference on Intelligent Text Processing and Computational Linguistics, CICLing 2013 - Samos, Greece
Duration: 24 Mar 201330 Mar 2013

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
NumberPART 1
Volume7816 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349


Other14th Annual Conference on Intelligent Text Processing and Computational Linguistics, CICLing 2013


Dive into the research topics of 'Automatic detection of idiomatic clauses'. Together they form a unique fingerprint.

Cite this