Multi-scale Dynamic Network for Document Shadow Removal

  • Jiarui Li*
  • , Jiaqi Ma
  • , Zeyu Xiao
  • , Ziyi Zhuang
  • , Zhihe Lu*
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Document clarity is pivotal for reliable information transmission, yet shadows remain a pervasive degradation that harms visual quality and downstream processing efficiency. Existing approaches often falter under extreme illumination, struggle with multi-scale shadow patterns, and exhibit limited generalization across document types and capture conditions. We present an end-to-end multi-modal, multi-scale framework that integrates advanced architectures with dynamic feature fusion to adaptively suppress shadows of varying sizes while preserving fine textual details. Concretely, we introduce a Multi-scale Dynamic Network (MDN) that performs scale-aware gating and cross-branch aggregation, enabling the model to emphasize informative cues and attenuate shadow bias at each resolution level. The pipeline is streamlined to reduce manual pre-/post-processing, thereby improving both efficiency and effectiveness in practical document workflows. Experimental results show consistent gains over traditional and deep learning baselines on the Jung and Kligler datasets, confirming superior accuracy and robustness under challenging lighting.

Original languageEnglish
Title of host publicationWorkshop Proceedings of the 7th ACM International Conference on Multimedia in Asia, MMAsia 2025 Workshops
EditorsTat-Seng Chua, Lai-Kuan Wong, Chee Seng Chan, Jinhui Tang, Chong-Wah Ngo, Klaus Schoeffmann, Jiaying Liu, Yo-Sung Ho
PublisherAssociation for Computing Machinery, Inc
ISBN (Electronic)9798400722479
DOIs
Publication statusPublished - 8 Dec 2025
Event7th ACM International Conference on Multimedia in Asia, MMAsia 2025 Workshops - Kuala Lumpur, Malaysia
Duration: 9 Dec 202512 Dec 2025

Publication series

NameWorkshop Proceedings of the 7th ACM International Conference on Multimedia in Asia, MMAsia 2025 Workshops

Conference

Conference7th ACM International Conference on Multimedia in Asia, MMAsia 2025 Workshops
Country/TerritoryMalaysia
CityKuala Lumpur
Period9/12/2512/12/25

Keywords

  • Deep Learning
  • Document Shadow Removal
  • Image Processing
  • Image Shadow Removal

Fingerprint

Dive into the research topics of 'Multi-scale Dynamic Network for Document Shadow Removal'. Together they form a unique fingerprint.

Cite this