arXiv Analytics

Sign in

arXiv:1812.00312 [cs.CV]AbstractReferencesReviewsResources

ECO: Egocentric Cognitive Mapping

Jayant Sharma, Zixing Wang, Alberto Speranzon, Vijay Venkataraman, Hyun Soo Park

Published 2018-12-02Version 1

We present a new method to localize a camera within a previously unseen environment perceived from an egocentric point of view. Although this is, in general, an ill-posed problem, humans can effortlessly and efficiently determine their relative location and orientation and navigate into a previously unseen environments, e.g., finding a specific item in a new grocery store. To enable such a capability, we design a new egocentric representation, which we call ECO (Egocentric COgnitive map). ECO is biologically inspired, by the cognitive map that allows human navigation, and it encodes the surrounding visual semantics with respect to both distance and orientation. ECO possesses three main properties: (1) reconfigurability: complex semantics and geometry is captured via the synthesis of atomic visual representations (e.g., image patch); (2) robustness: the visual semantics are registered in a geometrically consistent way (e.g., aligning with respect to the gravity vector, frontalizing, and rescaling to canonical depth), thus enabling us to learn meaningful atomic representations; (3) adaptability: a domain adaptation framework is designed to generalize the learned representation without manual calibration. As a proof-of-concept, we use ECO to localize a camera within real-world scenes---various grocery stores---and demonstrate performance improvements when compared to existing semantic localization approaches.

Related articles: Most relevant | Search more
arXiv:1905.10622 [cs.CV] (Published 2019-05-25)
Beyond Visual Semantics: Exploring the Role of Scene Text in Image Understanding
arXiv:2101.10253 [cs.CV] (Published 2021-01-25)
The emergence of visual semantics through communication games
arXiv:1708.05812 [cs.CV] (Published 2017-08-19)
Discovery of Visual Semantics by Unsupervised and Self-Supervised Representation Learning