ArabGend: Gender Analysis and Inference on Arabic Twitter

Research output: Contribution to journalConference articlepeer-review

1 Citation (Scopus)

Abstract

Gender analysis of Twitter can reveal important socio-cultural differences between male and female users. There has been a significant effort to analyze and automatically infer gender in the past for most widely spoken languages’ content, however, to our knowledge very limited work has been done for Arabic. In this paper, we perform an extensive analysis of differences between male and female users in the Arabic Twitter-sphere. We study differences in user engagement, topics of interest, and the gender gap in professions. Along with gender analysis, we also propose a method to infer gender by utilizing usernames, profile pictures, tweets, and networks of friends. In order to do so, we manually annotated gender and locations for ∼167K Twitter accounts associated with ∼92K user location, which we make publicly available.1 Our proposed gender inference method achieves an F1 score of 82.1% (47.3% higher than the majority baseline).

Original languageEnglish
Pages (from-to)124-135
Number of pages12
JournalProceedings - International Conference on Computational Linguistics, COLING
Volume29
Issue number4
Publication statusPublished - 16 Oct 2022
Event8th Workshop on Noisy User-Generated Text, W-NUT 2022 - Hybrid, Gyeongju, Korea, Republic of
Duration: 12 Oct 202217 Oct 2022

Fingerprint

Dive into the research topics of 'ArabGend: Gender Analysis and Inference on Arabic Twitter'. Together they form a unique fingerprint.

Cite this