GoldenWind at SemEval-2021 Task 5: Orthrus – An Ensemble Approach to Identify Toxicity

Research output: Contribution to journalConference proceedings published in a journalpeer-review

32 Downloads (Pure)

Abstract

Many new developments to detect and mitigate toxicity are currently being evaluated. We
are particularly interested in the correlation between toxicity and the emotions expressed in
online posts. While toxicity may be disguised
by amending the wording of posts, emotions
will not. Therefore, we describe here an ensemble method to identify toxicity and classify the emotions expressed on a corpus of
annotated posts published by Task 5 of SemEval 2021—our analysis shows that the majority of such posts express anger, sadness and
fear. Our method to identify toxicity combines
a lexicon-based approach, which on its own
achieves an F1 score of 61.07%, with a supervised learning approach, which on its own
achieves an F1 score of 60%. When both methods are combined, the ensemble achieves an F1
score of 66.37%.

Fingerprint

Dive into the research topics of 'GoldenWind at SemEval-2021 Task 5: Orthrus – An Ensemble Approach to Identify Toxicity'. Together they form a unique fingerprint.

Cite this