TY - GEN
T1 - MV-Soccer
T2 - 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2024
AU - Majeed, Fahad
AU - Gilal, Nauman Ullah
AU - Al-Thelaya, Khaled
AU - Yang, Yin
AU - Agus, Marco
AU - Schneider, Jens
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024/6/18
Y1 - 2024/6/18
N2 - This work presents a novel real-time detection, instance segmentation, and tracking approach for soccer videos. Unlike conventional methods, we augment video frames by incorporating motion vectors, thus adding valuable shape cues that are not readily present in RGB frames. This facilitates improved foreground/background separation and enhances the ability to distinguish between players, especially in scenarios involving partial occlusion. The proposed framework leverages the Cross-Stage-Partial Network53 (CSPDarknet53) as a backbone, for instance segmentation and integrates motion vectors, coupled with frame differencing. The model is simultaneously trained on two publicly available datasets and a private dataset, SoccerPro, which we created. The reason for simultaneous training is to reduce biases and increase generalization ability. To validate the effectiveness of our approach, we conducted extensive experiments and attained 97% accuracy for the DFL - Bundesliga Data Shootout, 98% on the SoccerNet-Tracking dataset, and an impressive 99% on the SoccerPro (our) dataset.
AB - This work presents a novel real-time detection, instance segmentation, and tracking approach for soccer videos. Unlike conventional methods, we augment video frames by incorporating motion vectors, thus adding valuable shape cues that are not readily present in RGB frames. This facilitates improved foreground/background separation and enhances the ability to distinguish between players, especially in scenarios involving partial occlusion. The proposed framework leverages the Cross-Stage-Partial Network53 (CSPDarknet53) as a backbone, for instance segmentation and integrates motion vectors, coupled with frame differencing. The model is simultaneously trained on two publicly available datasets and a private dataset, SoccerPro, which we created. The reason for simultaneous training is to reduce biases and increase generalization ability. To validate the effectiveness of our approach, we conducted extensive experiments and attained 97% accuracy for the DFL - Bundesliga Data Shootout, 98% on the SoccerNet-Tracking dataset, and an impressive 99% on the SoccerPro (our) dataset.
KW - Artificial Intelligence
KW - Detection
KW - Instance Segmentation
KW - Motion Vectors
KW - Tracking
UR - https://www.scopus.com/pages/publications/85206455708
U2 - 10.1109/CVPRW63382.2024.00330
DO - 10.1109/CVPRW63382.2024.00330
M3 - Conference contribution
AN - SCOPUS:85206455708
T3 - Ieee Computer Society Conference On Computer Vision And Pattern Recognition Workshops
SP - 3245
EP - 3255
BT - 2024 Ieee/cvf Conference On Computer Vision And Pattern Recognition Workshops, Cvprw
PB - IEEE Computer Society
Y2 - 16 June 2024 through 22 June 2024
ER -