DATANARRATIVE: Automated Data-Driven Storytelling with Visualizations and Texts

  • Mohammed Saidul Islam
  • , Md Tahmid Rahman Laskar
  • , Md Rizwan Parvez
  • , Enamul Hoque
  • , Shafiq Joty

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

4 Citations (Scopus)

Abstract

Data-driven storytelling is a powerful method for conveying insights by combining narrative techniques with visualizations and text.These stories integrate visual aids, such as highlighted bars and lines in charts, along with textual annotations explaining insights.However, creating such stories requires a deep understanding of the data and meticulous narrative planning, often necessitating human intervention, which can be time-consuming and mentally taxing.While Large Language Models (LLMs) excel in various NLP tasks, their ability to generate coherent and comprehensive data stories remains underexplored.In this work, we introduce a novel task for data story generation and a benchmark containing 1,449 stories from diverse sources.To address the challenges of crafting coherent data stories, we propose a multi-agent framework employing two LLM agents designed to replicate the human storytelling process: one for understanding and describing the data (Reflection), generating the outline, and narration and another for verification at each intermediary step.While our agentic framework generally outperforms non-agentic counterparts in both model-based and human evaluations, the results also reveal unique challenges in data story generation.

Original languageEnglish
Title of host publicationEMNLP 2024 - 2024 Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference
EditorsYaser Al-Onaizan, Mohit Bansal, Yun-Nung Chen
PublisherAssociation for Computational Linguistics (ACL)
Pages19253-19286
Number of pages34
ISBN (Electronic)9798891761643
DOIs
Publication statusPublished - Nov 2024
Event2024 Conference on Empirical Methods in Natural Language Processing, EMNLP 2024 - Hybrid, Miami, United States
Duration: 12 Nov 202416 Nov 2024

Publication series

NameEMNLP 2024 - 2024 Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference

Conference

Conference2024 Conference on Empirical Methods in Natural Language Processing, EMNLP 2024
Country/TerritoryUnited States
CityHybrid, Miami
Period12/11/2416/11/24

Fingerprint

Dive into the research topics of 'DATANARRATIVE: Automated Data-Driven Storytelling with Visualizations and Texts'. Together they form a unique fingerprint.

Cite this