skip to main content
research-article

Learning to reconstruct shape and spatially-varying reflectance from a single image

Published:04 December 2018Publication History
Skip Abstract Section

Abstract

Reconstructing shape and reflectance properties from images is a highly under-constrained problem, and has previously been addressed by using specialized hardware to capture calibrated data or by assuming known (or highly constrained) shape or reflectance. In contrast, we demonstrate that we can recover non-Lambertian, spatially-varying BRDFs and complex geometry belonging to any arbitrary shape class, from a single RGB image captured under a combination of unknown environment illumination and flash lighting. We achieve this by training a deep neural network to regress shape and reflectance from the image. Our network is able to address this problem because of three novel contributions: first, we build a large-scale dataset of procedurally generated shapes and real-world complex SVBRDFs that approximate real world appearance well. Second, single image inverse rendering requires reasoning at multiple scales, and we propose a cascade network structure that allows this in a tractable manner. Finally, we incorporate an in-network rendering layer that aids the reconstruction task by handling global illumination effects that are important for real-world scenes. Together, these contributions allow us to tackle the entire inverse rendering problem in a holistic manner and produce state-of-the-art results on both synthetic and real data.

Skip Supplemental Material Section

Supplemental Material

a269-li.mov

mov

106.5 MB

References

  1. Miika Aittala, Timo Aila, and Jaakko Lehtinen. 2016. Reflectance modeling by neural texture synthesis. ACM Trans. Graphics 35, 4 (2016). Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Miika Aittala, Tim Weyrich, Jaakko Lehtinen, et al. 2015. Two-shot SVBRDF capture for stationary materials. ACM Trans. Graphics 34, 4 (2015). Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Aayush Bansal, Bryan Russell, and Abhinav Gupta. 2016. Marr Revisited: 2D-3D Model Alignment via Surface Normal Prediction. In CVPR.Google ScholarGoogle Scholar
  4. Jonathan T Barron and Jitendra Malik. 2015. Shape, illumination, and reflectance from shading. PAMI 37, 8 (2015).Google ScholarGoogle Scholar
  5. Jonathan T Barron and Ben Poole. 2016. The fast bilateral solver. In European Conference on Computer Vision. Springer, 617--632.Google ScholarGoogle ScholarCross RefCross Ref
  6. Sean Bell, Paul Upchurch, Noah Snavely, and Kavita Bala. 2015. Material Recognition in the Wild with the Materials in Context Database. In CVPR.Google ScholarGoogle Scholar
  7. Volker Blanz and Thomas Vetter. 1999. A morphable model for the synthesis of 3D faces. In Proc. SIGGRAPH. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Manmohan Chandraker. 2014. On shape and material recovery from motion. In ECCV.Google ScholarGoogle Scholar
  9. Manmohan Chandraker, Fredrik Kahl, and David Kriegman. 2005. Reflections on the generalized bas-relief ambiguity. In CVPR. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Angel X Chang, Thomas Funkhouser, Leonidas Guibas, Pat Hanrahan, Qixing Huang, Zimo Li, Silvio Savarese, Manolis Savva, Shuran Song, Hao Su, et al. 2015. Shapenet: An information-rich 3d model repository. arXiv preprint arXiv:1512.03012 (2015).Google ScholarGoogle Scholar
  11. Michael F Cohen and John R Wallace. 1993. Radiosity and realistic image synthesis. Elsevier. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Paul Debevec, Tim Hawkins, Chris Tchou, Haarm-Pieter Duiker, Westley Sarokin, and Mark Sagar. 2000. Acquiring the reflectance field of a human face. In SIGGRAPH. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Valentin Deschaintre, Miika Aittala, Fredo Durand, George Drettakis, and Adrien Bousseau. 2018. Single-image SVBRDF Capture with a Rendering-aware Deep Network. ACM Trans. Graph. 37, 4 (2018). Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. David Eigen and Rob Fergus. 2015. Predicting depth, surface normals and semantic labels with a common multi-scale convolutional architecture. In ICCV. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Marc-André Gardner, Kalyan Sunkavalli, Ersin Yumer, Xiaohui Shen, Emiliano Gambaretto, Christian Gagné, and Jean-François Lalonde. 2017. Learning to predict indoor illumination from a single image. ACM Trans. Graphics 9, 4 (2017). Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Stamatios Georgoulis, Konstantinos Rematas, Tobias Ritschel, Mario Fritz, Tinne Tuytelaars, and Luc Van Gool. 2017. What is around the camera?. In ICCV.Google ScholarGoogle Scholar
  17. Clement Godard, Peter Hedman, Wenbin Li, and Gabriel J Brostow. 2015. Multi-view reconstruction of highly specular surfaces in uncontrolled environments. In 3DV. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Dan B Goldman, Brian Curless, Aaron Hertzmann, and Steven M Seitz. 2010. Shape and spatially-varying brdfs from photometric stereo. PAMI 32, 6 (2010). Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep residual learning for image recognition. In CVPR.Google ScholarGoogle Scholar
  20. Yannick Hold-Geoffroy, Kalyan Sunkavalli, Sunil Hadap, Emiliano Gambaretto, and Jean-François Lalonde. 2017. Deep Outdoor Illumination Estimation. In CVPR.Google ScholarGoogle Scholar
  21. Z. Hui and A. C. Sankaranarayanan. 2017. Shape and Spatially-Varying Reflectance Estimation from Virtual Exemplars. PAMI 39, 10 (2017).Google ScholarGoogle Scholar
  22. Zhuo Hui, Kalyan Sunkavalli, Joon-Young Lee, Sunil Hadap, Jian Wang, and Aswin C. Sankaranarayanan. 2017. Reflectance capture using univariate sampling of BRDFs. In ICCV.Google ScholarGoogle Scholar
  23. Eddy Ilg, Nikolaus Mayer, Tonmoy Saikia, Margret Keuper, Alexey Dosovitskiy, and Thomas Brox. 2017. FlowNet 2.0: Evolution of Optical Flow Estimation with Deep Networks. In CVPR.Google ScholarGoogle Scholar
  24. Carlo Innamorati, Tobias Ritschel, Tim Weyrich, and Niloy J Mitra. 2017. Decomposing single images for layered photo retouching. 36, 4 (2017). Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. M. K. Johnson and E. H. Adelson. 2011. Shape estimation in natural illumination. In CVPR. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Brian Karis and Epic Games. 2013. Real shading in Unreal Engine 4. SIGGRAPH 2013 Courses: Physically Based Shading Theory Practice (2013).Google ScholarGoogle Scholar
  27. Martin Knecht, Georg Tanzmeister, Christoph Traxler, and Michael Wimmer. 2012. Interactive BRDF Estimation for Mixed-Reality Applications. WSCG 20, 1 (2012).Google ScholarGoogle Scholar
  28. Xiao Li, Yue Dong, Pieter Peers, and Xin Tong. 2017a. Modeling surface appearance from a single photograph using self-augmented convolutional neural networks. ACM Trans. Graphics 36, 4 (2017). Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Zhengqin Li, Kalyan Sunkavalli, and Manmohan Chandraker. 2018. Materials for Masses: SVBRDF Acquisition with a Single Mobile Phone Image. In ECCV.Google ScholarGoogle Scholar
  30. Z. Li, Z. Xu, R. Ramamoorthi, and M. Chandraker. 2017b. Robust Energy Minimization for BRDF-Invariant Shape from Light Fields. In CVPR.Google ScholarGoogle Scholar
  31. Guilin Liu, Duygu Ceylan, Ersin Yumer, Jimei Yang, and Jyh-Ming Lien. 2017. Material Editing using a Physically Based Rendering Network. ICCV.Google ScholarGoogle Scholar
  32. Julio Marco, Quercus Hernandez, Adolfo Munoz, Yue Dong, Adrian Jarabo, Min H Kim, Xin Tong, and Diego Gutierrez. 2017. DeepToF: off-the-shelf real-time correction of multipath interference in time-of-flight imaging. ACM Trans. Graphics 36, 6 (2017). Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. Stephen R Marschner, Stephen H Westin, Eric PF Lafortune, Kenneth E Torrance, and Donald P Greenberg. 1999. Image-based BRDF measurement including human skin. In Rendering Techniques. Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. Wojciech Matusik, Hanspeter Pfister, Matt Brand, and Leonard McMillan. 2003. A Data-Driven Reflectance Model. ACM Trans. Graphics 22, 3 (2003). Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. Abhimitra Meka, Maxim Maximov, Michael Zollhoefer, Avishek Chatterjee, Hans-Peter Seidel, Christian Richardt, and Christian Theobalt. 2018. LIME: Live Intrinsic Material Estimation. In CVPR.Google ScholarGoogle Scholar
  36. Oliver Nalbach, Elena Arabadzhiyska, Dushyant Mehta, H-P Seidel, and Tobias Ritschel. 2017. Deep shading: convolutional neural networks for screen space shading. Comput. Graph. Forum 36, 4 (2017). Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. Shree K. Nayar, Katsushi Ikeuchi, and Takeo Kanade. 1991. Shape from interreflections. IJCV 6, 3 (1991). Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. Shree K. Nayar, Gurunandan Krishnan, Michael D. Grossberg, and Ramesh Raskar. 2006. Fast Separation of Direct and Global Components of a Scene Using High Frequency Illumination. ACM Trans. Graphics 25, 3 (2006). Google ScholarGoogle ScholarDigital LibraryDigital Library
  39. Diego Nehab, Szymon Rusinkiewicz, James Davis, and Ravi Ramamoorthi. 2005. Efficiently combining positions and normals for precise 3D geometry. In ACM transactions on graphics (TOG), Vol. 24. ACM, 536--543. Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. Alejandro Newell, Kaiyu Yang, and Jia Deng. 2016. Stacked Hourglass Networks for Human Pose Estimation. In ECCV.Google ScholarGoogle Scholar
  41. Matthew O'Toole and Kiriakos N. Kutulakos. 2010. Optical Computing for Fast Light Transport Analysis. ACM Trans. Graphics 29, 6, Article 164 (2010). Google ScholarGoogle ScholarDigital LibraryDigital Library
  42. Geoffrey Oxholm and Ko Nishino. 2016. Shape and reflectance estimation in the wild. PAMI 38, 2 (2016), 376--389. Google ScholarGoogle ScholarDigital LibraryDigital Library
  43. Ravi Ramamoorthi and Pat Hanrahan. 2001. An efficient representation for irradiance environment maps. In SIGGRAPH. Google ScholarGoogle ScholarDigital LibraryDigital Library
  44. Konstantinos Rematas, Tobias Ritschel, Mario Fritz, Efstratios Gavves, and Tinne Tuytelaars. 2016. Deep reflectance maps. In CVPR.Google ScholarGoogle Scholar
  45. Kosta Ristovski, Vladan Radosavljevic, Slobodan Vucetic, and Zoran Obradovic. 2013. Continuous Conditional Random Fields for Efficient Regression in Large Fully Connected Graphs.. In AAAI. Google ScholarGoogle ScholarDigital LibraryDigital Library
  46. J. Riviere, P. Peers, and A. Ghosh. 2016. Mobile Surface Reflectometry. Comput. Graph. Forum 35, 1 (2016). Google ScholarGoogle ScholarDigital LibraryDigital Library
  47. O. Ronneberger, P.Fischer, and T. Brox. 2015. U-Net: Convolutional Networks for Biomedical Image Segmentation. In MICCAI.Google ScholarGoogle Scholar
  48. Soumyadip Sengupta, Angjoo Kanazawa, Carlos D. Castillo, and David W. Jacobs. 2018. SfSNet: Learning Shape, Refectance and Illuminance of Faces in the Wild. In CVPR.Google ScholarGoogle Scholar
  49. Jian Shi, Yue Dong, Hao Su, and Stella X Yu. 2017. Learning Non-Lambertian Object Intrinsics Across ShapeNet Categories. In CVPR.Google ScholarGoogle Scholar
  50. Z. Shu, E. Yumer, S. Hadap, K. Sunkavalli, E. Shechtman, and D. Samaras. 2017. Neural Face Editing with Intrinsic Image Disentangling. In CVPR.Google ScholarGoogle Scholar
  51. A. Tewari, M. Zollhofer, H. Kim, P. Garrido, F. Bernard, P. Perez, and C. Theobalt. 2018. MoFA: Model-Based Deep Convolutional Face Autoencoder for Unsupervised Monocular Reconstruction. In ICCV.Google ScholarGoogle Scholar
  52. A. Toshev and C. Szegedy. 2014. DeepPose: Human Pose Estimation via Deep Neural Networks. In CVPR. Google ScholarGoogle ScholarDigital LibraryDigital Library
  53. Ting-Chun Wang, Manmohan Chandraker, Alexei Efros, and Ravi Ramamoorthi. 2017. SVBRDF-Invariant Shape and Reflectance Estimation from Light-Field Cameras. PAMI (2017).Google ScholarGoogle Scholar
  54. S. E. Wei, V. Ramakrishna, T. Kanade, and Y. Sheikh. 2016. Convolutional Pose Machines. In CVPR.Google ScholarGoogle Scholar
  55. Robert J. Woodham. 1980. Photometric Method For Determining Surface Orientation From Multiple Images. Optical Engineering 19 (1980).Google ScholarGoogle Scholar
  56. Hongzhi Wu and Kun Zhou. 2015. AppFusion: Interactive Appearance Acquisition Using a Kinect Sensor. Comput. Graph. Forum 34, 6 (2015). Google ScholarGoogle ScholarDigital LibraryDigital Library
  57. Zexiang Xu, Kalyan Sunkavalli, Sunil Hadap, and Ravi Ramamoorthi. 2018. Deep image-based relighting from optimal sparse samples. ACM Trans. Graphics 37, 4 (2018). Google ScholarGoogle ScholarDigital LibraryDigital Library
  58. Yizhou Yu, Paul Debevec, Jitendra Malik, and Tim Hawkins. 1999. Inverse Global Illumination: Recovering Reflectance Models of Real Scenes from Photographs. In SIGGRAPH. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Learning to reconstruct shape and spatially-varying reflectance from a single image

            Recommendations

            Comments

            Login options

            Check if you have access through your login credentials or your institution to get full access on this article.

            Sign in

            Full Access

            • Published in

              cover image ACM Transactions on Graphics
              ACM Transactions on Graphics  Volume 37, Issue 6
              December 2018
              1401 pages
              ISSN:0730-0301
              EISSN:1557-7368
              DOI:10.1145/3272127
              Issue’s Table of Contents

              Copyright © 2018 ACM

              Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

              Publisher

              Association for Computing Machinery

              New York, NY, United States

              Publication History

              • Published: 4 December 2018
              Published in tog Volume 37, Issue 6

              Permissions

              Request permissions about this article.

              Request Permissions

              Check for updates

              Qualifiers

              • research-article

            PDF Format

            View or Download as a PDF file.

            PDF

            eReader

            View online with eReader.

            eReader