TY - GEN
T1 - Palm
T2 - 63rd Annual Meeting of the Association for Computational Linguistics, ACL 2025
AU - Alwajih, Fakhraddin
AU - Mekki, Abdellah El
AU - Magdy, Samar Mohamed
AU - Elmadany, Abdelrahim A.
AU - Nacar, Omer
AU - Nagoudi, El Moatez Billah
AU - Abdel-Salam, Reem
AU - Atwany, Hanin
AU - Nafea, Youssef
AU - Yahya, Abdulfattah Mohammed
AU - Alhamouri, Rahaf
AU - Alsayadi, Hamzah A.
AU - Zayed, Hiba
AU - Shatnawi, Sara
AU - Sibaee, Serry
AU - Ech-Chammakhy, Yasir
AU - Al-Dhabyani, Walid
AU - Ali, Marwa Mohamed
AU - Jarraya, Imen
AU - El-Shangiti, Ahmed Oumar
AU - Alraeesi, Aisha
AU - Al-Ghrawi, Mohammed Anwar
AU - Al-Batati, Abdulrahman S.
AU - Mohamed, Elgizouli
AU - Elgindi, Noha Taha
AU - Saeed, Muhammed
AU - Atou, Houdaifa
AU - Yahia, Issam Ait
AU - Bouayad, Abdelhak
AU - Machrouh, Mohammed
AU - Makouar, Amal
AU - Alkawi, Dania
AU - Mohamed, Mukhtar
AU - Abdelfadil, Safaa Taher
AU - Ounnoughene, Amine Ziad
AU - Anfel, Rouabhia
AU - Assi, Rwaa
AU - Sorkatti, Ahmed
AU - Tourad, Mohamedou Cheikh
AU - Koubaa, Anis
AU - Berrada, Ismail
AU - Jarrar, Mustafa
AU - Shehata, Shady
AU - Abdul-Mageed, Muhammad
N1 - Publisher Copyright:
© 2025 Association for Computational Linguistics.
PY - 2025
Y1 - 2025
N2 - As large language models (LLMs) become increasingly integrated into daily life, ensuring their cultural sensitivity and inclusivity is paramount. We introduce PALM, a year-long community-driven project covering all 22 Arab countries. The dataset includes instructions (input, response pairs) in both Modern Standard Arabic (MSA) and dialectal Arabic (DA), spanning 20 diverse topics. Built by a team of 44 researchers across the Arab world, all of whom are authors of this paper, PALM offers a broad, inclusive perspective. We use PALM to evaluate the cultural and dialectal capabilities of several frontier LLMs, revealing notable limitations. For instance, while closed-source LLMs generally exhibit strong performance, they are not without flaws, and smaller open-source models face greater challenges. Moreover, certain countries (e.g., Egypt, the UAE) appear better represented than others (e.g., Iraq, Mauritania, Yemen). Our annotation guidelines, code, and data for reproducibility are publicly available. More information about PALM is available at our project page: https://github.com/UBC-NLP/palm.
AB - As large language models (LLMs) become increasingly integrated into daily life, ensuring their cultural sensitivity and inclusivity is paramount. We introduce PALM, a year-long community-driven project covering all 22 Arab countries. The dataset includes instructions (input, response pairs) in both Modern Standard Arabic (MSA) and dialectal Arabic (DA), spanning 20 diverse topics. Built by a team of 44 researchers across the Arab world, all of whom are authors of this paper, PALM offers a broad, inclusive perspective. We use PALM to evaluate the cultural and dialectal capabilities of several frontier LLMs, revealing notable limitations. For instance, while closed-source LLMs generally exhibit strong performance, they are not without flaws, and smaller open-source models face greater challenges. Moreover, certain countries (e.g., Egypt, the UAE) appear better represented than others (e.g., Iraq, Mauritania, Yemen). Our annotation guidelines, code, and data for reproducibility are publicly available. More information about PALM is available at our project page: https://github.com/UBC-NLP/palm.
UR - https://www.scopus.com/pages/publications/105021032585
M3 - Conference contribution
AN - SCOPUS:105021032585
T3 - Proceedings of the Annual Meeting of the Association for Computational Linguistics
SP - 32871
EP - 32894
BT - Long Papers
A2 - Che, Wanxiang
A2 - Nabende, Joyce
A2 - Shutova, Ekaterina
A2 - Pilehvar, Mohammad Taher
PB - Association for Computational Linguistics (ACL)
Y2 - 27 July 2025 through 1 August 2025
ER -