Abstract
We present a novel real-time framework for the detection, instance segmentation, and tracking of soccer players in video footage. Our method, called ReST, is designed to overcome challenges posed by complex player interactions and occlusions. This is achieved by enhancing video frames by incorporating motion vectors obtained using the Scharr filter and frame differencing. This provides additional shape cues over RGB frames that are not considered in traditional approaches. We use the Generalized Efficient Layer Aggregation Network (GELAN), combining the best qualities of CSPDarknet53 and ELAN as a robust backbone for instance segmentation and tracking. We evaluate our method rigorously on both publicly available and our proprietary (SoccerPro) datasets to validate its performance across diverse soccer video contexts. We train our model concurrently on multiple datasets, thereby improving generalization and reducing dataset bias. Our results demonstrate an impressive 97% accuracy on the DFL Bundesliga Data Shootout, 98% on SoccerNet-Tracking, and 99% on the SoccerPro dataset. These findings underscore the framework’s efficacy and practical relevance for advancing real-time soccer video analysis.
| Original language | English |
|---|---|
| Pages (from-to) | 138-149 |
| Number of pages | 12 |
| Journal | Proceedings of the International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications |
| Volume | 2 |
| DOIs | |
| Publication status | Published - Feb 2025 |
| Event | 20th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications, VISIGRAPP 2025 - Porto, Portugal Duration: 26 Feb 2025 → 28 Feb 2025 |
Keywords
- Computer Vision
- Instance Segmentation
- Motion Vectors
- Soccer Player Tracking
- Sports Analytics