Real-Time Light Field Reconstruction and Neural Rendering
Abstract
Building on the foundations of spherical light field reconstruction and rendering, this work introduces a series of progressively more capable and efficient view synthesis networks. These methods evolve from full light field reconstruction to a direct, position-aware approach capable of rendering novel views within a large volume from a single input image at interactive frame rates. To enhance these synthesized views for augmented reality (AR) applications, a unified multimodal network is developed that jointly predicts a new view and its corresponding depth map from a shared latent space. This multimodal approach ensures multi-view and geometric consistency between appearance and geometry, which is crucial for preventing visual artifacts like flickering.
PVSNet: Real-Time Position-Aware View Synthesis from Single-View Input
We introduce a lightweight, position-aware network designed for real-time view synthesis from a single input image and a target camera pose. The proposed framework consists of a Position Aware Embedding, which efficiently maps positional information from the target pose to generate high dimensional feature maps. These feature maps, along with the input image, are fed into a Rendering Network that merges features from dual encoder branches to resolve both high and low level details, producing a realistic new view of the scene.
LFSphereNet: Real Time Spherical Light Field Reconstruction from a Single Omnidirectional Image
We propose a fully learning-based method for spherical light field reconstruction from a single omnidirectional image. The proposed LFSphereNet utilizes two different networks: The first network learns to reconstruct the light field in cubemap projection (CMP) format given the six cube faces of an omnidirectional image and the corresponding cube face positions as input. The cubemap format implies a linear re-projection, which is more appropriate for a neural network. The second network refines the reconstructed cubemaps in equirectangular projection (ERP) format by removing cubemap border artifacts.
Comments