Ara-Pic: A Framework for Enhancing Arabic Cultural Representation in AI-Generated Images

Wala Elsharif*, Mahmoud Alzubaidi, James She, Marco Agus

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Recent advancements in text-to-image (TTI) models have demonstrated impressive generative capabilities but often fail to produce culturally relevant images, particularly for underrepresented cultures such as Arabic culture. In this paper, we introduce Ara-Pic, a novel approach that iteratively enhances image generation prompts to improve cultural relevance. Our method leverages the capabilities of vision-language models (VLMs) to culturally enhance prompts based on feedback from the Cultural Relevance Index (CRI), a metric designed to assess the cultural alignment of AI-generated images. By systematically modifying prompts to increase positive cultural amplification while reducing penalties, Ara-Pic optimizes image generation to better reflect authentic Arabic cultural elements. We evaluate our approach by comparing baseline CRI scores of images generated with initial prompts with the scores of those refined through Ara-Pic, demonstrating its effectiveness in increasing CRI scores. Our findings highlight the potential of iterative prompting as well as the importance of quantified cultural evaluation for scalable and automated solutions for improving cultural representation in AI-generated content.

Original languageEnglish
Title of host publicationIEEE International Conference on Multimedia and Expo Workshops
Subtitle of host publicationJourney to the Center of Machine Imagination, ICMEW 2025 - Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798331587437
DOIs
Publication statusPublished - 2025
Event2025 IEEE International Conference on Multimedia and Expo Workshops, ICMEW 2025 - Nantes, France
Duration: 30 Jun 20254 Jul 2025

Publication series

NameIEEE International Conference on Multimedia and Expo Workshops: Journey to the Center of Machine Imagination, ICMEW 2025 - Proceedings

Conference

Conference2025 IEEE International Conference on Multimedia and Expo Workshops, ICMEW 2025
Country/TerritoryFrance
CityNantes
Period30/06/254/07/25

Keywords

  • content evaluation
  • culture representation
  • text-to-image
  • vision language models

Fingerprint

Dive into the research topics of 'Ara-Pic: A Framework for Enhancing Arabic Cultural Representation in AI-Generated Images'. Together they form a unique fingerprint.

Cite this