Abstract
Video classification is an important domain within computer vision. It categorizes video content into meaningful classes such as actions or emotional states. In relation to image classification, it has to deal with the problem of spatiotemporal dimensions as well as a large data volume that is present in a video. In this work we introduce a novel distance metric based video summarization technique which minimizes the size of the dataset while maintaining key temporal information. We performed our experiments using distance metrics such as norm of rows distance other than euclidean distance, norm of columns distance and eigenvalue based distance metrics. Our results show that the norm of rows distance performed well and provides a suitable balance between efficiency and accuracy. Our proposed method achieved significant accuracy of \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$81.23\%$$\end{document}, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$92.42\%$$\end{document}, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$98.89\%$$\end{document} and \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$90.27\%$$\end{document} on MMAC, UCF101, UCF11 and HMDB51 benchmark datasets. Our proposed technique continuously tracked temporal information while recalculating the distance from each key frame. Due to less computational demands, our approach performs effectively in real-world application scenarios.
| Original language | English |
|---|---|
| Article number | 7970 |
| Number of pages | 16 |
| Journal | Scientific Reports |
| Volume | 16 |
| Issue number | 1 |
| DOIs | |
| Publication status | Published - 9 Feb 2026 |
Fingerprint
Dive into the research topics of 'Innovative temporal summarization for complex video classification'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver