Hybridmatch: Semi-supervised facial landmark detection via hybrid heatmap representations

S Kang, M Lee, M Kim, H Shim - IEEE Access, 2023 - ieeexplore.ieee.org
IEEE Access, 2023ieeexplore.ieee.org
Facial landmark detection is an essential task in face-processing techniques. Traditional
methods, however, require expensive pixel-level labels. Semi-supervised facial landmark
detection has been explored as an alternative, but previous approaches only focus on
training-oriented issues (eg, noisy pseudo-labels in semi-supervised learning), neglecting
task-oriented issues (ie, the quantization error in landmark detection). We argue that semi-
supervised landmark detectors should resolve the two technical issues simultaneously …
Facial landmark detection is an essential task in face-processing techniques. Traditional methods, however, require expensive pixel-level labels. Semi-supervised facial landmark detection has been explored as an alternative, but previous approaches only focus on training-oriented issues (e.g., noisy pseudo-labels in semi-supervised learning), neglecting task-oriented issues (i.e., the quantization error in landmark detection). We argue that semi-supervised landmark detectors should resolve the two technical issues simultaneously. Through a simple experiment, we found that task- and training-oriented solutions may negatively influence each other, thus eliminating their negative interactions is important. To this end, we devise a new heatmap regression framework via hybrid representation, namely HybridMatch. We utilize both 1-D and 2-D heatmap representations. Here, the 1-D and 2-D heatmaps help alleviate the task-oriented and training-oriented issues, respectively. To exploit the advantages of our hybrid representation, we introduce curriculum learning; relying more on the 2-D heatmap at the early training stage and gradually increasing the effects of the 1-D heatmap. By resolving the two issues simultaneously, we can capture more precise landmark points than existing methods with only a few annotated data. Extensive experiments show that HybridMatch achieves state-of-the-art performance on three benchmark datasets, especially showing 26.3% NME improvement over the existing method in the 300-W full set at 5% data ratio. Surprisingly, our method records a comparable performance, 5.04 (challenging set in the 300-W) to the fully-supervised facial landmark detector 5.03. The remarkable performance of HybridMatch shows its potential as a practical alternative to the fully-supervised model.
ieeexplore.ieee.org
Showing the best result for this search. See all results