Skip to main navigation Skip to search Skip to main content

ArtInsight: A Multimodal AI Framework for Interpreting Children's Drawings and Enhancing Emotional Understanding

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Recent advancements in multimodal image-to-text models have greatly enhanced the interpretation of children's drawings for emotional understanding purposes. This paper introduces a framework that analyzes these drawings to fully automatically generate detailed reports, covering art descriptions, emotional themes, assessments, and personalized recommendations. Our approach involved annotating 5,000 images by exploiting a Large Language Model (ChatGPT) and by fine-tuning the BLIP (Bootstrapping Language-Image Pre-training) multimodal model. We performed fine-tuning in two steps: 1) we applied Low-Rank Adaptation (LoRA) to the image encoder to preserve its pre-trained features while adapting it to our task, and 2) we refined the text decoder to capture the language patterns needed for comprehensive assessments. The system processes children's artwork as input, using multimodal image-to-text techniques to derive meaningful insights. Although these reports are initial evaluations rather than formal clinical assessments, they provide a valuable starting point for understanding children's emotional and psychological states. This tool can assist art therapists, educators, and parents in gaining a deeper understanding of children's inner worlds. Our research highlights the intersection of artificial intelligence and child psychology, showing how technology can complement human expertise in nurturing children's emotional well-being. By offering a structured, AI-driven analysis of children's drawings, this framework creates new opportunities for early intervention, personalized support, and enhanced communication between children and their caregivers. The impact of this work may extend beyond individual assessments, potentially informing broader strategies in child development, art therapy, and educational practices.

Original languageEnglish
Title of host publicationIntelligent Health Systems - From Technology to Data and Knowledge, Proceedings of MIE 2025
EditorsElisavet Andrikopoulou, Parisis Gallos, Theodoros N. Arvanitis, Rosalynn Austin, Arriel Benis, Ronald Cornet, Panagiotis Chatzistergos, Alexander Dejaco, Linda Dusseljee-Peute, Alaa Mohasseb, Pantelis Natsiavas, Haythem Nakkas, Philip Scott
PublisherIOS Press BV
Pages808-812
Number of pages5
ISBN (Electronic)9781643685960
DOIs
Publication statusPublished - 15 May 2025
Event35th Medical Informatics Europe Conference, MIE 2025 - Glasgow, United Kingdom
Duration: 19 May 202521 May 2025

Publication series

NameStudies in Health Technology and Informatics
Volume327
ISSN (Print)0926-9630
ISSN (Electronic)1879-8365

Conference

Conference35th Medical Informatics Europe Conference, MIE 2025
Country/TerritoryUnited Kingdom
CityGlasgow
Period19/05/2521/05/25

Keywords

  • Art Therapy
  • Children's Drawings
  • Emotional Assessment
  • Image-to-Text Models

Fingerprint

Dive into the research topics of 'ArtInsight: A Multimodal AI Framework for Interpreting Children's Drawings and Enhancing Emotional Understanding'. Together they form a unique fingerprint.

Cite this