AR-NERF：无监督的深度和散焦效应的自然图像带有神经辐射场的效果

论文标题

AR-NERF：无监督的深度和散焦效应的自然图像带有神经辐射场的效果

AR-NeRF: Unsupervised Learning of Depth and Defocus Effects from Natural Images with Aperture Rendering Neural Radiance Fields

论文作者

Kaneko, Takuhiro

论文摘要

完全无监督的3D表示学习因其在数据收集中的优势而引起了人们的关注。一种成功的方法涉及一种观点感知方法，该方法基于生成模型（例如生成对抗网络（GAN））学习图像分布，同时基于3D感知模型（例如，神经辐射场（NERFS））生成各种视图图像。但是，他们需要具有各种视图的图像进行培训，因此，它们在很少或有限的观点的数据集中的应用仍然是一个挑战。作为一种补充方法，提出了使用散焦提示的孔渲染gan（AR-GAN）。但是，AR-GAN是一个基于CNN的模型，尽管相关性很高，但它独立于观点变化代表散焦，这是其性能的原因之一。作为AR-GAN的替代方案，我们提出了一个光圈渲染的NERF（AR-NERF），该启示可以通过在常见的射线追踪框架中代表这两个因素来以统一的方式利用观点和散焦提示。此外，要以散布的方式学习散热性和独立的表示，我们提出了孔径随机训练，为此我们学会生成图像，同时独立地将光圈大小和潜在代码随机化。在实验过程中，我们将AR-NERF应用于各种自然图像数据集，包括花，鸟和面部图像，其结果证明了AR-NERF对深度和散焦效应的无监督学习的实用性。

Fully unsupervised 3D representation learning has gained attention owing to its advantages in data collection. A successful approach involves a viewpoint-aware approach that learns an image distribution based on generative models (e.g., generative adversarial networks (GANs)) while generating various view images based on 3D-aware models (e.g., neural radiance fields (NeRFs)). However, they require images with various views for training, and consequently, their application to datasets with few or limited viewpoints remains a challenge. As a complementary approach, an aperture rendering GAN (AR-GAN) that employs a defocus cue was proposed. However, an AR-GAN is a CNN-based model and represents a defocus independently from a viewpoint change despite its high correlation, which is one of the reasons for its performance. As an alternative to an AR-GAN, we propose an aperture rendering NeRF (AR-NeRF), which can utilize viewpoint and defocus cues in a unified manner by representing both factors in a common ray-tracing framework. Moreover, to learn defocus-aware and defocus-independent representations in a disentangled manner, we propose aperture randomized training, for which we learn to generate images while randomizing the aperture size and latent codes independently. During our experiments, we applied AR-NeRF to various natural image datasets, including flower, bird, and face images, the results of which demonstrate the utility of AR-NeRF for unsupervised learning of the depth and defocus effects.

下载PDF全文

下载文献需遵守相关版权规定

论文标题