arXiv Analytics

Sign in

arXiv:2206.07259 [cs.CV]AbstractReferencesReviewsResources

Self-Supervised Learning of Image Scale and Orientation

Jongmin Lee, Yoonwoo Jeong, Minsu Cho

Published 2022-06-15Version 1

We study the problem of learning to assign a characteristic pose, i.e., scale and orientation, for an image region of interest. Despite its apparent simplicity, the problem is non-trivial; it is hard to obtain a large-scale set of image regions with explicit pose annotations that a model directly learns from. To tackle the issue, we propose a self-supervised learning framework with a histogram alignment technique. It generates pairs of image patches by random rescaling/rotating and then train an estimator to predict their scale/orientation values so that their relative difference is consistent with the rescaling/rotating used. The estimator learns to predict a non-parametric histogram distribution of scale/orientation without any supervision. Experiments show that it significantly outperforms previous methods in scale/orientation estimation and also improves image matching and 6 DoF camera pose estimation by incorporating our patch poses into a matching process.

Comments: Presented in BMVC 2021, code is available on https://github.com/bluedream1121/self-sca-ori
Categories: cs.CV
Related articles: Most relevant | Search more
arXiv:1811.05304 [cs.CV] (Published 2018-11-13)
Self-Supervised Learning of Depth and Camera Motion from 360° Videos
Fu-En Wang et al.
arXiv:1912.01991 [cs.CV] (Published 2019-12-04)
Self-Supervised Learning of Pretext-Invariant Representations
arXiv:2104.04015 [cs.CV] (Published 2021-04-08)
CutPaste: Self-Supervised Learning for Anomaly Detection and Localization