Projects
Perceptual Quality Assessment of NeRF and Neural View Synthesis Methods for Front-Facing Views
Abstract
Neural view synthesis (NVS) is one of the most successful techniques for synthesizing free viewpoint videos, capable of achieving high fidelity from only a sparse set of captured images. This success has led to many variants of the techniques, each evaluated on a set of test views typically using image quality metrics such as PSNR, SSIM, or LPIPS. There has been a lack of research on how NVS methods perform with respect to perceived video quality. We present the first study on perceptual evaluation of NVS and NeRF variants. For this study, we collected two datasets of scenes captured in a controlled lab environment as well as in-the-wild. In contrast to existing datasets, these scenes come with reference video sequences, allowing us to test for temporal artifacts and subtle distortions that are easily overlooked when viewing only static images. We measured the quality of videos synthesized by several NVS methods in a wellcontrolled perceptual quality assessment experiment as well as with many existing state-of-the-art image/video quality metrics. We present a detailed analysis of the results and recommendations for dataset and metric selection for NVS evaluation.
Evaluated NVS Methods
We tested ten representative NVS methods (including two generalizable NeRF variants) that encompass a diverse range of models, which feature both explicit and implicit geometric representations, distinct rendering modelings, as well as generalizable and per-scene optimization strategies.
Subjective Evaluation
To attain precise subjective quality scores of the videos synthesized by the aforementioned NVS methods, we conducted a controlled quality assessment experiment with human participants.We employed pairwise comparison experiments for subjective evaluation. We scaled the results of the pairwise comparison and expressed the subjective evaluation score in Just-Objectionable-Difference (JOD) units using the Thurstone Case V observer model.
Assessing Quality Metrics for Neural View Synthesis
Our collected datasets with video references, together with perceptual quality results of reconstructed videos, allow us to test how well the existing image/video quality metrics can measure the perceived quality. To test the reliability of popular metrics, we followed the standard protocol used to evaluate quality metrics, and computed rank-order (Spearman) correlations between metric predictions and perceptual JOD values. For each dataset and each NVS method, we averaged subjective scores and quality metric predictions across all scenes and then computed a single correlation per dataset per metric. This serves two purposes: (a) it mitigates the effects of measurement noise and improves the predictions of quality metrics.
Materials
Acknowledgement
This project has received funding from the European Union's Horizon 2020 research and innovation programme under the Marie Skłodowska-Curie grant agreement N◦ 765911 (RealVision) and from the Royal Society, award IES\R2\202141. This project is also supported by a UKRI Future Leaders Fellowship [grant number G104084]. We greatly thank Prof. Damiano Marchi and Dr. Simone Farina for granting access to collect NeRFs inside the Natural History Museum of the University of Pisa.