FEED PETs: Further Experimentation and Expansion on the Disambiguation of Potentially Euphemistic Terms

Patrick Lee, Iyanuoluwa Shode, Alain Chirino Trujillo, Yuan Zhao, Olumide Ebenezer Ojo, Diana Cuevas Plancarte, Anna Feldman, Jing Peng

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

Transformers have been shown to work well for the task of English euphemism disambiguation, in which a potentially euphemistic term (PET) is classified as euphemistic or non-euphemistic in a particular context. In this study, we expand on the task in two ways. First, we annotate PETs for vagueness, a linguistic property associated with euphemisms, and find that transformers are generally better at classifying vague PETs, suggesting linguistic differences in the data that impact performance. Second, we present novel euphemism corpora in three different languages: Yoruba, Spanish, and Mandarin Chinese. We perform euphemism disambiguation experiments in each language using multilingual transformer models mBERT and XLM-RoBERTa, establishing preliminary results from which to launch future work.

Original languageEnglish
Title of host publicationStarSEM 2023 - 12th Joint Conference on Lexical and Computational Semantics, Proceedings of the Conference
EditorsAlexis Palmer, Jose Camacho-Collados
PublisherAssociation for Computational Linguistics (ACL)
Pages437-448
Number of pages12
ISBN (Electronic)9781959429760
StatePublished - 2023
Event12th Joint Conference on Lexical and Computational Semantics, StarSEM 2023, co-located with ACL 2023 - Toronto, Canada
Duration: 13 Jul 202314 Jul 2023

Publication series

NameProceedings of the Annual Meeting of the Association for Computational Linguistics
ISSN (Print)0736-587X

Conference

Conference12th Joint Conference on Lexical and Computational Semantics, StarSEM 2023, co-located with ACL 2023
Country/TerritoryCanada
CityToronto
Period13/07/2314/07/23

Fingerprint

Dive into the research topics of 'FEED PETs: Further Experimentation and Expansion on the Disambiguation of Potentially Euphemistic Terms'. Together they form a unique fingerprint.

Cite this