Approximate the unlabeled data manifold with many small linear pieces, use those local neighborhoods to estimate which points are similar, and train an embedding for zero-shot image retrieval, without any labels.
Unsupervised deep metric learning (UDML) focuses on learning a semantic representation space using only unlabeled data. This challenging problem requires accurately estimating the similarity between data points, which is used to supervise a deep network. For this purpose, we propose to model the high-dimensional data manifold using a piecewise-linear approximation, with each low-dimensional linear piece approximating the data manifold in a small neighborhood of a point. These neighborhoods are used to estimate similarity between data points. We empirically show that this similarity estimate correlates better with the ground truth than the similarity estimates of current state-of-the-art techniques. We also show that proxies, commonly used in supervised metric learning, can be used to model the piecewise-linear manifold in an unsupervised setting, helping improve performance. Our method outperforms existing unsupervised metric learning approaches on standard zero-shot image retrieval benchmarks.
Approximate the unlabeled data manifold with local linear pieces, use those neighborhoods to estimate similarity and supervise an embedding network, and model the pieces with proxies, giving state-of-the-art unsupervised deep metric learning.
We propose a framework for unsupervised metric learning having 3 steps:
The figure below provides an overview of our method, from neighborhood sampling and the piecewise-linear approximation of the manifold to the proxy- and point-based similarities used to train the embedding.
Our method outperforms current unsupervised metric learning approaches on standard zero-shot image retrieval benchmarks (CUB-200-2011, Cars-196 and SOP). Please refer to the paper for more detailed results and ablations.
If you find our work useful, please consider citing:
@InProceedings{pmlr-v234-bhatnagar24a,
title = {Piecewise-Linear Manifolds for Deep Metric Learning},
author = {Bhatnagar, Shubhang and Ahuja, Narendra},
booktitle = {Conference on Parsimony and Learning},
pages = {269--281},
year = {2024},
editor = {Chi, Yuejie and Dziugaite, Gintare Karolina and Qu, Qing
and Wang, Atlas Wang and Zhu, Zhihui},
volume = {234},
series = {Proceedings of Machine Learning Research},
month = {03--06 Jan},
publisher = {PMLR}
}