ALT at SemEval-2020 Task 12: Arabic and English Offensive Language Identification in Social Media

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

23 Citations (Scopus)

Abstract

This paper describes the systems submitted by the Arabic Language Technology group (ALT) at SemEval-2020 Task 12: Multilingual Offensive Language Identification in Social Media. We focus on sub-task A (Offensive Language Identification) for two languages: Arabic and English. Our efforts for both languages achieved more than 90% macro-averaged F1-score on the official test set. For Arabic, the best results were obtained by a system combination of Support Vector Machine, Deep Neural Network, and fine-tuned Bidirectional Encoder Representations from Transformers (BERT). For English, the best results were obtained by fine-tuning BERT.

Original languageEnglish
Title of host publicationCOLING 2020 - The International Workshop on Semantic Evaluation, Proceedings of the 14th Workshop
EditorsAurelie Herbelot, Xiaodan Zhu, Alexis Palmer, Nathan Schneider, Jonathan May, Ekaterina Shutova
PublisherInternational Committee for Computational Linguistics
Pages1891-1897
Number of pages7
ISBN (Electronic)9781952148316
DOIs
Publication statusPublished - 2020
Event14th International Workshops on Semantic Evaluation, SemEval 2020, co-located with COLING 2020 - Virtual, Online, Spain
Duration: 12 Dec 202013 Dec 2020

Publication series

Name14th International Workshops on Semantic Evaluation, SemEval 2020 - co-located 28th International Conference on Computational Linguistics, COLING 2020, Proceedings

Conference

Conference14th International Workshops on Semantic Evaluation, SemEval 2020, co-located with COLING 2020
Country/TerritorySpain
CityVirtual, Online
Period12/12/2013/12/20

Fingerprint

Dive into the research topics of 'ALT at SemEval-2020 Task 12: Arabic and English Offensive Language Identification in Social Media'. Together they form a unique fingerprint.

Cite this