TY - GEN
T1 - FEED PETs
T2 - 12th Joint Conference on Lexical and Computational Semantics, StarSEM 2023, co-located with ACL 2023
AU - Lee, Patrick
AU - Shode, Iyanuoluwa
AU - Trujillo, Alain Chirino
AU - Zhao, Yuan
AU - Ojo, Olumide Ebenezer
AU - Plancarte, Diana Cuevas
AU - Feldman, Anna
AU - Peng, Jing
N1 - Publisher Copyright:
© 2023 Association for Computational Linguistics.
PY - 2023
Y1 - 2023
N2 - Transformers have been shown to work well for the task of English euphemism disambiguation, in which a potentially euphemistic term (PET) is classified as euphemistic or non-euphemistic in a particular context. In this study, we expand on the task in two ways. First, we annotate PETs for vagueness, a linguistic property associated with euphemisms, and find that transformers are generally better at classifying vague PETs, suggesting linguistic differences in the data that impact performance. Second, we present novel euphemism corpora in three different languages: Yoruba, Spanish, and Mandarin Chinese. We perform euphemism disambiguation experiments in each language using multilingual transformer models mBERT and XLM-RoBERTa, establishing preliminary results from which to launch future work.
AB - Transformers have been shown to work well for the task of English euphemism disambiguation, in which a potentially euphemistic term (PET) is classified as euphemistic or non-euphemistic in a particular context. In this study, we expand on the task in two ways. First, we annotate PETs for vagueness, a linguistic property associated with euphemisms, and find that transformers are generally better at classifying vague PETs, suggesting linguistic differences in the data that impact performance. Second, we present novel euphemism corpora in three different languages: Yoruba, Spanish, and Mandarin Chinese. We perform euphemism disambiguation experiments in each language using multilingual transformer models mBERT and XLM-RoBERTa, establishing preliminary results from which to launch future work.
UR - http://www.scopus.com/inward/record.url?scp=85175400017&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:85175400017
T3 - Proceedings of the Annual Meeting of the Association for Computational Linguistics
SP - 437
EP - 448
BT - StarSEM 2023 - 12th Joint Conference on Lexical and Computational Semantics, Proceedings of the Conference
A2 - Palmer, Alexis
A2 - Camacho-Collados, Jose
PB - Association for Computational Linguistics (ACL)
Y2 - 13 July 2023 through 14 July 2023
ER -