Computer Science > Computer Vision and Pattern Recognition

arXiv:2308.16876 (cs)

[Submitted on 31 Aug 2023 (v1), last revised 12 Dec 2023 (this version, v2)]

Title:SportsSloMo: A New Benchmark and Baselines for Human-centric Video Frame Interpolation

Abstract:Human-centric video frame interpolation has great potential for improving people's entertainment experiences and finding commercial applications in the sports analysis industry, e.g., synthesizing slow-motion videos. Although there are multiple benchmark datasets available in the community, none of them is dedicated for human-centric scenarios. To bridge this gap, we introduce SportsSloMo, a benchmark consisting of more than 130K video clips and 1M video frames of high-resolution ($\geq$720p) slow-motion sports videos crawled from YouTube. We re-train several state-of-the-art methods on our benchmark, and the results show a decrease in their accuracy compared to other datasets. It highlights the difficulty of our benchmark and suggests that it poses significant challenges even for the best-performing methods, as human bodies are highly deformable and occlusions are frequent in sports videos. To improve the accuracy, we introduce two loss terms considering the human-aware priors, where we add auxiliary supervision to panoptic segmentation and human keypoints detection, respectively. The loss terms are model agnostic and can be easily plugged into any video frame interpolation approaches. Experimental results validate the effectiveness of our proposed loss terms, leading to consistent performance improvement over 5 existing models, which establish strong baseline models on our benchmark. The dataset and code can be found at: this https URL.

Comments:	Project Page: this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2308.16876 [cs.CV]
	(or arXiv:2308.16876v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2308.16876

Submission history

From: Jiaben Chen [view email]
[v1] Thu, 31 Aug 2023 17:23:50 UTC (7,146 KB)
[v2] Tue, 12 Dec 2023 18:59:06 UTC (7,375 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:SportsSloMo: A New Benchmark and Baselines for Human-centric Video Frame Interpolation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:SportsSloMo: A New Benchmark and Baselines for Human-centric Video Frame Interpolation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators