Skip to main navigation Skip to search Skip to main content

MEENA (PersianMMMU): Multimodal-Multilingual Educational Exams for N-level Assessment

  • Omid Ghahroodi
  • , Arshia Hemmat
  • , Marzia Nouri
  • , Seyed Mohammad Hadi Hosseini
  • , Doratossadat Dastgheib
  • , Mohammad Vali Sanian*
  • , Alireza Sahebi*
  • , Reihaneh Zohrabi*
  • , Mohammad Hossein Rohban
  • , Ehsaneddin Asgari
  • , Mahdieh Soleymani Baghshah
  • *Corresponding author for this work
  • Hamad bin Khalifa University
  • University of Oxford
  • Independent Researcher
  • Sharif University of Technology

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Recent advancements in large vision-language models (VLMs) have primarily focused on English, with limited attention given to other languages. To address this gap, we introduce MEENA (also known as PersianMMMU), the first dataset designed to evaluate Persian VLMs across scientific, reasoning, and human-level understanding tasks. Our dataset comprises approximately 7,500 Persian and 3,000 English questions, covering a wide range of topics such as reasoning, mathematics, physics, diagrams, charts, and Persian art and literature. Key features of MEENA include: (1) diverse subject coverage spanning various educational levels, from primary to upper secondary school, (2) rich metadata, including difficulty levels and descriptive answers, (3) original Persian data that preserves cultural nuances, (4) a bilingual structure to assess cross-linguistic performance, and (5) a series of diverse experiments assessing various capabilities, including overall performance, the model’s ability to attend to images, and its tendency to generate hallucinations. We hope this benchmark contributes to enhancing VLM capabilities beyond English.

Original languageEnglish
Title of host publication19th Conference of the European Chapter of the Association for Computational Linguistics, Findings of EACL 2026
PublisherAssociation for Computational Linguistics (ACL)
Pages6457-6491
Number of pages35
ISBN (Electronic)9798891763869
DOIs
Publication statusPublished - 2026
Event19th Conference of the European Chapter of the Association for Computational Linguistics, Findings of EACL 2026 - Rabat, Morocco
Duration: 24 Mar 202629 Mar 2026

Publication series

Name19th Conference of the European Chapter of the Association for Computational Linguistics, Findings of EACL 2026

Conference

Conference19th Conference of the European Chapter of the Association for Computational Linguistics, Findings of EACL 2026
Country/TerritoryMorocco
CityRabat
Period24/03/2629/03/26

Fingerprint

Dive into the research topics of 'MEENA (PersianMMMU): Multimodal-Multilingual Educational Exams for N-level Assessment'. Together they form a unique fingerprint.

Cite this