CHARTQAPRO: A More Diverse and Challenging Benchmark for Chart Question Answering

  • Ahmed Masry*
  • , Mohammed Saidul Islam
  • , Mahir Ahmed
  • , Aayush Bajaj
  • , Firoz Kabir
  • , Aaryaman Kartha
  • , Md Tahmid Rahman Laskar
  • , Mizanur Rahman
  • , Shadikur Rahman
  • , Mehrad Shahmohammadi
  • , Megh Thakkar
  • , Md Rizwan Parvez
  • , Enamul Hoque
  • , Shafiq Joty
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Charts are ubiquitous, as people often use them to analyze data, answer questions, and discover critical insights. However, performing complex analytical tasks with charts requires significant perceptual and cognitive effort. Chart Question Answering (CQA) systems automate this process by enabling models to interpret and reason with visual representations of data. However, existing benchmarks like ChartQA lack real-world diversity and have recently shown performance saturation with modern large vision-language models (LVLMs). To address these limitations, we introduce CHARTQAPRO, a new benchmark that includes 1,341 charts from 99 diverse sources, spanning various chart types-including info-graphics and dashboards-and featuring 1,948 questions in various types, such as multiple-choice, conversational, hypothetical, and unanswerable questions, to better reflect real-world challenges. Our evaluations with 21 models show a substantial performance drop for LVLMs on CHARTQAPRO; e.g., Claude Sonnet 3.5 scores 90.5% on ChartQA but only 55.81% on CHARTQAPRO, underscoring the complexity of chart reasoning. We complement our findings with detailed error analyses and ablation studies, identifying key challenges and opportunities for advancing LVLMs in chart understanding and reasoning. We release CHARTQAPRO at https://github.com/visnlp/ChartQAPro.

Original languageEnglish
Title of host publicationFindings of the Association for Computational Linguistics
Subtitle of host publicationACL 2025
EditorsWanxiang Che, Joyce Nabende, Ekaterina Shutova, Mohammad Taher Pilehvar
PublisherAssociation for Computational Linguistics (ACL)
Pages19123-19151
Number of pages29
ISBN (Electronic)9798891762565
DOIs
Publication statusPublished - 2025
Event63rd Annual Meeting of the Association for Computational Linguistics, ACL 2025 - Vienna, Austria
Duration: 27 Jul 20251 Aug 2025

Publication series

NameProceedings of the Annual Meeting of the Association for Computational Linguistics
ISSN (Print)0736-587X

Conference

Conference63rd Annual Meeting of the Association for Computational Linguistics, ACL 2025
Country/TerritoryAustria
CityVienna
Period27/07/251/08/25

Fingerprint

Dive into the research topics of 'CHARTQAPRO: A More Diverse and Challenging Benchmark for Chart Question Answering'. Together they form a unique fingerprint.

Cite this