CLaP - Contrast, Label, Predict: a quest for cheaper labeling in 3D human pose estimation

D Cavicchini; A Pivotto; S Lorengo; Andrea Rosani; N Garau

doi:10.1109/WACVW65960.2025.00141

Back

CLaP - Contrast, Label, Predict: a quest for cheaper labeling in 3D human pose estimation

Conference proceeding

Peer reviewed

CLaP - Contrast, Label, Predict: a quest for cheaper labeling in 3D human pose estimation

D Cavicchini, A Pivotto, S Lorengo, Andrea Rosani and N Garau

2025 IEEE/CVF Winter Conference on Applications of Computer Vision Workshops: Proceedings WACVW 2025, pp.1186-1194

Winter Conference on Applications of Computer Vision (WACV) Workshops (Tucson, 28/02/2025–04/03/2025)

2025

DOI: https://doi.org/10.1109/WACVW65960.2025.00141

Handle:

https://hdl.handle.net/10863/51520

Abstract

contrastive learning

Deep learning

human pose estimation

Human pose estimation (HPE) is a pivotal task in computer vision with applications spanning a wide range of domains, such as sports analytics, rehabilitation, performance capture and many more. However, obtaining labeled datasets for 3D pose estimation remains costly and resource intensive. To address this challenge, we propose a novel pipeline that uses contrastive learning to reduce labeling requirements while maintaining adequate performance. Our method employs unsupervised fine-tuning of pre-trained ResNet backbones on unannotated multiview data acquired in a skiing scenario. The learned repre-sentations are then utilized to strategically select a minimal, yet diverse subset of data for labeling, which is sub-sequently used for supervised training. We demonstrate the effectiveness of this approach using three contrastive paradigms, namely SimCLR, MoCo, and SimSiam, evaluating their impact on data efficiency and model performance on the SkiPose dataset. Our results indicate that contrastive learning can significantly reduce labeling costs while re-taining good pose estimation results, making it a promising solution for resource-constrained applications. Code is available at mmlab-cv.github.ioICLaP.

Files and links (1)

url

https://doi.org/10.1109/WACVW65960.2025.00141View

Details

Title: CLaP - Contrast, Label, Predict: a quest for cheaper labeling in 3D human pose estimation
Creators: D Cavicchini - University of Trento
A Pivotto - University of Trento
S Lorengo - University of Trento
Andrea Rosani - Free University of Bozen-Bolzano
N Garau - University of Trento
Publication Details: 2025 IEEE/CVF Winter Conference on Applications of Computer Vision Workshops: Proceedings WACVW 2025, pp.1186-1194
ISBN: 9798331536633
EISBN: 9798331536626
Conference: Winter Conference on Applications of Computer Vision (WACV) Workshops (Tucson, 28/02/2025–04/03/2025)
Publisher: IEEE
Number of pages: 8
Identifiers: 979-8-3315-3663-3
(UNIBZ)93120951
991007292853901241
Scopus ID: 2-s2.0-105005026561
Academic Unit: Faculty of Engineering
Language: English
Resource Type: Conference proceeding
Author Names String: Cavicchini D, Pivotto A, Lorengo S, Rosani A, Garau N

Metrics

1 Record Views