Supplementary Material: CGIntrinsics: Better …In Figure 1 we provide additional visual comparisons...

Supplementary Material:CGIntrinsics: Better Intrinsic Image Decomposition

through Physically-Based Rendering

Zhengqi Li[0000−0003−2929−8149] and Noah Snavely[0000−0002−6921−6833]

Department of Computer Science & Cornell Tech, Cornell University

In this supplementary material, we first provide additional details about the ordinalloss we use when training with our CGINTRINSICS dataset (Equation 2 in the mainpaper), as well as detailed hyperparameter settings used in training. Next, we showadditional visual comparisons between the images in our CGINTRINSICS dataset and theoriginal SUNCG/PBRS dataset [1]. Finally, we provide additional qualitative predictionresults and comparisons to Bell et al. [2] and Zhou et al. [3] on the IIW and SAW testsets.

1 Additional details for training losses

1.1 Ordinal term for CGINTRINSICS

Recall LCGI (Equation 2 in the main paper), the loss defined for our CGINTRINSICStraining data:

LCGI = Lsup + λordLord + λrecLreconstruct (1)

We now provide the full formula for Lord, the ordinal loss term. In particular, for agiven CGINTRINSICS training image and predicted reflectance R, we accumulate lossesfor each pair of pixels (i, j) generated from a set of pixels P , where one pixel is sampledat random from oversegmented regions in that image:

Lord(R) =∑

(i,j)∈P×Pi 6=j

fi,j(R), (2)

fi,j(R) =

(logRi − logRj)

2, −τ1 < P ∗i,j < τ1

(max(0, τ2 − logRi + logRj))2, P ∗i,j > τ2

(max(0, τ2 − logRj + logRi))2, P ∗i,j < −τ2

0, otherwise

where R∗ is the rendered ground truth reflectance, P ∗i,j = logR∗i − logR∗j , τ1 =log(1.05) and τ2 = log(1.5). The intuition is that we categorize pairs of ground truthreflectances at pixels (i, j) as having an “equal,” “greater than,” or “less than” relation-ship, and then add a penalty if the predicted reflectances at those pixels do not satisfy thesame relationship.

2 Zhengqi Li and Noah Snavely

1.2 Additional hyperparameter settings

In all experiments described in the main paper, we set λIIW = λSAW = 2, λord =λrs = λS/NS = 1, λrec = 2 and λss = 4. The number of image scales L = 4. The marginin Equation 7 in the main paper m = 0.425. For simplicity, the covariance matrix Σdefined in Lrsmooth (Equation 10 in the main paper) is a diagonal matrix, defined as:

where σp = 0.1, σI = 0.12, σc = 0.03.

2 Visual comparisons between CGI and SUNCG/PBRS renderings

In Figure 1 we provide additional visual comparisons between rendered imagesfrom our CGINTRINSICS dataset and the SUNCG/PBRS dataset, illustrating the greatersignal-to-noise ratio and realism in our renderings.

3 Additional qualitative results on IIW/SAW

In this section, we provide a large number of additional qualitative comparisons ofintrinsic image decompositions from the IIW/SAW test sets. We include decompositionspredicted from our network trained solely on the CGINTRINSICS dataset, as well asdecompositions from our network trained on CGINTRINSICS plus training data fromIIW and SAW. We compare our results to those of two state-of-the-art algorithms, Bell etal. [2] and Zhou et al. [3]. Predictions are shown in Figures 2-13. Note that the imagesin Figure 12 and 13 are from the NYUv2 [4] dataset, and are also included in the SAWdataset.

We observe that our decompositions are consistently better than those of the twostate-of-the-art algorithms [2, 3] across a large number of photos of a variety of indoorenvironments. Note in particular how our method is better able to avoid attributingreflectance texture to the shading image. These results suggests the strong generalizationabilities of our models and the surprising effectiveness of our proposed synthetic CGIN-TRINSICS dataset. However, our predictions still make mistakes such as residual texturesin shading channels and residual shadows in reflectance channels.

CGIntrinsics 3

SUNCG/PBRS CGI SUNCG/PBRS CGI

Fig. 1. Additional visual comparisons between SUNCG/PBRS renderings and our CGI ren-derings. Odd columns: images from SUNCG/PBRS. Even columns: images from our CGI dataset.The images in our dataset have higher SNR and are generally more realistic.

Image Bell et al. Zhou et al. Ours (CGI) Ours (All)

Fig. 2. Additional qualitative comparisons on the IIW/SAW test sets. Odd rows: predictedreflectance images. Even rows: predicted shading images. Columns from left to right: input image,results of Bell et al. [2], results of Zhou et al. [3], results of our CGI-trained network, results ofour network trained on CGI+IIW+SAW.

CGIntrinsics 5

CGIntrinsics 7

CGIntrinsics 9

CGIntrinsics 11

CGIntrinsics 13

Fig. 12. Additional qualitative comparisons on the NYUv2 dataset. Note that the images inNYUv2 are also included in the SAW testset. Odd rows: predicted reflectance images. Evenrows: predicted shading images. Columns from left to right: input image, results of Bell et al. [2],results of Zhou et al. [3], results of our CGI-trained network, results of our network trained onCGI+IIW+SAW.

CGIntrinsics 15

Fig. 13. Additional qualitative comparisons on the NYUv2 dataset. Note that the images inNYUv2 are also included in the SAW testset. Odd rows: predicted reflectance images. Evenrows: predicted shading images. Columns from left to right: input image, results of Bell et al. [2],results of Zhou et al. [3], results of our CGI-trained network, results of our network trained onCGI+IIW+SAW.

References

1. Zhang, Y., Song, S., Yumer, E., Savva, M., Lee, J.Y., Jin, H., Funkhouser, T.: Physically-based rendering for indoor scene understanding using convolutional neural networks. In: Proc.Computer Vision and Pattern Recognition (CVPR). (2017) 5057–5065

2. Bell, S., Bala, K., Snavely, N.: Intrinsic images in the wild. ACM Trans. Graphics 33(4) (2014)159

3. Zhou, T., Krahenbuhl, P., Efros, A.A.: Learning data-driven reflectance priors for intrinsicimage decomposition. In: Proc. Int. Conf. on Computer Vision (ICCV). (2015) 3469–3477

4. Silberman, N., Hoiem, D., Kohli, P., Fergus, R.: Indoor segmentation and support inferencefrom rgbd images. In: Proc. European Conf. on Computer Vision (ECCV). (2012)

Supplementary Material: CGIntrinsics: Better …In Figure 1 we provide additional visual comparisons...

Documents

SDSS Dataset and SkyServer Workloadsdatasys.cs.iit.edu/reports/2006_SDSS-DataSet-and-SkyServer-Workloads-v03.pdfSDSS Dataset and SkyServer Workloads Ioan Raicu Page 1 of 9 SDSS Dataset

Effective dataset construction in computer visionvision.is.tohoku.ac.jp/.../mva2015-effective-dataset-construction.pdfEffective Dataset Construction ... EDPMs increase the e xpressi

The IMHG dataset: A Multi-View Hand Gesture RGB-D Dataset

CGIntrinsics: Better Intrinsic Image Decomposition through

Tugas 4 PBRS III

The inD Dataset: A Drone Dataset of Naturalistic Road User Trajectories at … · 2019. 11. 19. · The inD Dataset: A Drone Dataset of Naturalistic Road User Trajectories at German

rgbd dataset

Dataset Tracht6A

CGIntrinsics: Better Intrinsic Image Decomposition through ...openaccess.thecvf.com/content_ECCV_2018/papers/Zhengqi_Li_CGI… · CGIntrinsics: Better Intrinsic Image Decomposition

SED2012 Dataset

Accueil | Agence française anticorruption...PBRS PBRS est une société anonyme de droit suisse dont le siège social est situé 9-17 Quai de Bergues, à Genève (Suisse) PBRS a été

contoh laporan pbrs

SVPU PBRs - Blitzing 10-25

The highD Dataset: A Drone Dataset of Naturalistic Vehicle

CGIntrinsics: Better Intrinsic Image Decomposition through ... · renderings derived from SUNCG have low signal-to-noise ratio (SNR) and non-realistic sensor properties. We show that

DATASET - landlaeknir.is

DataSet Pro DataSet Pro Vous présente Introduction

3. L’IMPORTAZIONE DI DATASET ESTERNI 3.1 Introduzione · 2020-03-29 · 31 3.2 L’importazione di dataset SAS I dataset SAS, ossia i dataset ottenuti come output con un programma

What is CDISC 360: an enterprise architecture perspective · Noumena Solutions (Greg Steffens) Source Dataset 1 Source Dataset 2 Source Dataset 3 DTE Process or with Map Target dataset

Closing remarks · 2018-11-05 · Closing remarks. Open Images V4 dataset Open Images V4 dataset. Open Images V4 dataset Open Images V4 dataset. Dataset paper coming soon. Future