arXiv Analytics

Sign in

arXiv:2209.12068 [cs.CV]AbstractReferencesReviewsResources

NeRF-Loc: Transformer-Based Object Localization Within Neural Radiance Fields

Jiankai Sun, Yan Xu, Mingyu Ding, Hongwei Yi, Jingdong Wang, Liangjun Zhang, Mac Schwager

Published 2022-09-24Version 1

Neural Radiance Fields (NeRFs) have been successfully used for scene representation. Recent works have also developed robotic navigation and manipulation systems using NeRF-based environment representations. As object localization is the foundation for many robotic applications, to further unleash the potential of NeRFs in robotic systems, we study object localization within a NeRF scene. We propose a transformer-based framework NeRF-Loc to extract 3D bounding boxes of objects in NeRF scenes. NeRF-Loc takes a pre-trained NeRF model and camera view as input, and produces labeled 3D bounding boxes of objects as output. Concretely, we design a pair of paralleled transformer encoder branches, namely the coarse stream and the fine stream, to encode both the context and details of target objects. The encoded features are then fused together with attention layers to alleviate ambiguities for accurate object localization. We have compared our method with the conventional transformer-based method and our method achieves better performance. In addition, we also present the first NeRF samples-based object localization benchmark NeRFLocBench.

Related articles: Most relevant | Search more
arXiv:2311.01815 [cs.CV] (Published 2023-11-03)
Estimating 3D Uncertainty Field: Quantifying Uncertainty for Neural Radiance Fields
arXiv:2309.11966 [cs.CV] (Published 2023-09-21)
NeuralLabeling: A versatile toolset for labeling vision datasets using Neural Radiance Fields
arXiv:2208.11300 [cs.CV] (Published 2022-08-24)
E-NeRF: Neural Radiance Fields from a Moving Event Camera