Skip to main navigation Skip to search Skip to main content

Innovative temporal summarization for complex video classification

  • Hamad bin Khalifa University
  • Kharazmi University
  • KTH Royal Institute of Technology

Research output: Contribution to journalArticlepeer-review

Abstract

Video classification is an important domain within computer vision. It categorizes video content into meaningful classes such as actions or emotional states. In relation to image classification, it has to deal with the problem of spatiotemporal dimensions as well as a large data volume that is present in a video. In this work we introduce a novel distance metric based video summarization technique which minimizes the size of the dataset while maintaining key temporal information. We performed our experiments using distance metrics such as norm of rows distance other than euclidean distance, norm of columns distance and eigenvalue based distance metrics. Our results show that the norm of rows distance performed well and provides a suitable balance between efficiency and accuracy. Our proposed method achieved significant accuracy of \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$81.23\%$$\end{document}, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$92.42\%$$\end{document}, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$98.89\%$$\end{document} and \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$90.27\%$$\end{document} on MMAC, UCF101, UCF11 and HMDB51 benchmark datasets. Our proposed technique continuously tracked temporal information while recalculating the distance from each key frame. Due to less computational demands, our approach performs effectively in real-world application scenarios.
Original languageEnglish
Article number7970
Number of pages16
JournalScientific Reports
Volume16
Issue number1
DOIs
Publication statusPublished - 9 Feb 2026

Fingerprint

Dive into the research topics of 'Innovative temporal summarization for complex video classification'. Together they form a unique fingerprint.

Cite this