skip to main content
10.1145/3458305.3478443acmconferencesArticle/Chapter ViewAbstractPublication PagesmmsysConference Proceedingsconference-collections
research-article
Open Access
Artifacts Evaluated & Functional / v1.1
Best Paper

AliceVision Meshroom: An open-source 3D reconstruction pipeline

Published:22 September 2021Publication History

ABSTRACT

This paper introduces the Meshroom software and its underlying 3D computer vision framework AliceVision. This solution provides a photogrammetry pipeline to reconstruct 3D scenes from a set of unordered images. It also features other pipelines for fusing multi-bracketing low dynamic range images into high dynamic range, stitching multiple images into a panorama and estimating the motion of a moving camera. Meshroom's node-graph architecture allows the user to customize the different pipelines to adjust them to their domain specific needs. The user can interactively add other processing nodes to modify a pipeline, export intermediate data to analyze the result of the algorithms and easily compare the outputs given by different sets of parameters. The software package is released in open source and relies on open file formats. These features enable researchers to conveniently run the pipelines, access and visualize the data at each step, thus promoting the sharing and the reproducibility of the results.

Skip Supplemental Material Section

Supplemental Material

References

  1. Jong-Beom Jeong, Soonbin Lee, Il-Woong Ryu, Tuan Thanh Le, and Eun-Seok Ryu. Towards Viewport-dependent 6DoF 360 Video Tiled Streaming for Virtual Reality Systems. In Proc. 28th ACM Int. Conf. Multimed., pages 3687--3695, New York, NY, USA, oct 2020. ACM.Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Savino Dambra, Giuseppe Samela, Lucile Sassatelli, Romaric Pighetti, Ramon Aparicio-Pardo, and Anne-Marie Pinna-Déry. Film editing. In Proc. 9th ACM Multimed. Syst. Conf., pages 27--39, New York, NY, USA, jun 2018. ACM.Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Mohammad Hosseini and Christian Timmerer. Dynamic Adaptive Point Cloud Streaming. In Proc. 23rd Pack. Video Work., pages 25--30, New York, NY, USA, jun 2018. ACM.Google ScholarGoogle Scholar
  4. Suraj Raghuraman, Kanchan Bahirat, and Balakrishnan Prabhakaran. A Visual Latency Estimator for 3D Tele-Immersion. In Proc. 8th ACM Multimed. Syst. Conf., pages 272--283, New York, NY, USA, jun 2017. ACM.Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Cédric Portaneri, Pierre Alliez, Michael Hemmer, Lukas Birklein, and Elmar Schoemer. Cost-driven framework for progressive compression of textured meshes. In Proc. 10th ACM Multimed. Syst. Conf., pages 175--188, New York, NY, USA, jun 2019. ACM.Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Pierre Moulon. Positionnement robuste et précis de réseaux d'images. PhD thesis, Laboratoire d'Informatique Gaspard-Monge, École des Ponts ParisTech, 2014.Google ScholarGoogle Scholar
  7. Michal Jančošek. Large Scale Surface Reconstruction based on Point Visibility. PhD thesis, Czech Technical University in Prague, 2015.Google ScholarGoogle Scholar
  8. Lilian Calvet. Méthodes de reconstruction tridimensionnelle intégrant des points cycliques : application au suivi d'une caméra. PhD thesis, Institut National Polytechnique de Toulouse, 2014.Google ScholarGoogle Scholar
  9. Carsten Griwodz, Lilian Calvet, and Pål Halvorsen. Popsift. In Proc. 9th ACM Multimed. Syst. Conf., pages 415--420, New York, NY, USA, jun 2018. ACM.Google ScholarGoogle Scholar
  10. Toby Collins, Daniel Pizarro, Simone Gasparini, Nicolas Bourdel, Pauline Chauvet, Michel Canis, Lilian Calvet, and Adrien Bartoli. Augmented Reality Guided Laparoscopic Surgery of the Uterus. IEEE Transactions on Medical Imaging, 40(1):371--380, January 2021.Google ScholarGoogle ScholarCross RefCross Ref
  11. Asla Medeiros e Sá, Adolfo Bartolome Ibañez Vila, Karina Rodriguez Echavarria, Ricardo Marroquim, and Vivian Luiz Fonseca. Accessible Digitisation and Visualisation of Open Cultural Heritage Assets. In Selma Rizvic and Karina Rodriguez Echavarria, editors, Eurographics Workshop on Graphics and Cultural Heritage. The Eurographics Association, 2019.Google ScholarGoogle Scholar
  12. J. Milàn, P. Falkingham, and Inken Juliane Mueller-Töwe. Small ornithopod dinosaur tracks and crocodilian remains from the middle jurassic bagå formation, bornholm, denmark: Important additions to the rare middle jurassic vertebrate faunas of northern europe. Bulletin of The Geological Society of Denmark, 68:245--253, 2020.Google ScholarGoogle ScholarCross RefCross Ref
  13. Jens Lallensack, Michael Buchwitz, and Anthony Romilio. Photogrammetry in ichnology: 3d model generation, visualisation, and data extraction. November 2020.Google ScholarGoogle Scholar
  14. Shah Ariful Hoque Chowdhury, Chuong Nguyen, Hengjia Li, and Richard Hartley. Fixed-lens camera setup and calibrated image registration for multifocus multiview 3d reconstruction. Neural Computing and Applications, April 2021.Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Marco Wallner, Daniel Steininger, Verena Widhalm, Matthias Schörghuber, and Csaba Beleznai. RGB-d railway platform monitoring and scene understanding for enhanced passenger safety. In Pattern Recognition. ICPR International Workshops and Challenges, pages 656--671. Springer International Publishing, 2021.Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Johannes Lutz Schönberger and Jan-Michael Frahm. Structure-from-Motion Revisited. In Conference on Computer Vision and Pattern Recognition (CVPR), 2016.Google ScholarGoogle Scholar
  17. Ewelina Rupnik, Mehdi Daakir, and Marc Pierrot Deseilligny. MicMac -- a free, open-source solution for photogrammetry. Open Geospatial Data, Softw. Stand., 2(1):14, dec 2017.Google ScholarGoogle Scholar
  18. Simon Fuhrmann, Fabian Langguth, and Michael Goesele. MVE - A Multi-View Reconstruction Environment. In Reinhard Klein and Pedro Santos, editors, Eurographics Workshop on Graphics and Cultural Heritage. The Eurographics Association, 2014.Google ScholarGoogle Scholar
  19. OpenDroneMap Authors. Odm - a command line toolkit to generate maps, point clouds, 3d models and dems from drone, balloon or kite images.Google ScholarGoogle Scholar
  20. Regard3D Authors. Regard3d.Google ScholarGoogle Scholar
  21. C. Strecha, W. von Hansen, L. Van Gool, P. Fua, and U. Thoennessen. On benchmarking camera calibration and multi-view stereo for high resolution imagery. In 2008 IEEE Conference on Computer Vision and Pattern Recognition. IEEE, June 2008.Google ScholarGoogle ScholarCross RefCross Ref
  22. David G. Lowe. Distinctive Image Features from Scale-Invariant Keypoints. Int. J. Comput. Vis., 60(2):91--110, nov 2004.Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Jingming Dong and Stefano Soatto. Domain-size pooling in local descriptors: DSP-SIFT. In 2015 IEEE Conf. Comput. Vis. Pattern Recognit., pages 5097--5106. IEEE, jun 2015.Google ScholarGoogle ScholarCross RefCross Ref
  24. P. F. Alcantarilla, J. Nuevo, and A. Bartoli. Fast explicit diffusion for accelerated features in nonlinear scale spaces. In British Machine Vision Conf. (BMVC), 2013.Google ScholarGoogle ScholarCross RefCross Ref
  25. Lilian Calvet, Pierre Gurdjos, Carsten Griwodz, and Simone Gasparini. Detection and Accurate Localization of Circular Fiducials under Highly Challenging Conditions. In 2016 IEEE Conf. Comput. Vis. Pattern Recognit., pages 562--570. IEEE, jun 2016.Google ScholarGoogle ScholarCross RefCross Ref
  26. Edwin Olson. AprilTag: A robust and flexible visual fiducial system. In 2011 IEEE Int. Conf. Robot. Autom., pages 3400--3407. IEEE, may 2011.Google ScholarGoogle ScholarCross RefCross Ref
  27. D. Nister and H. Stewenius. Scalable Recognition with a Vocabulary Tree. In 2006 IEEE Comput. Soc. Conf. Comput. Vis. Pattern Recognit. - Vol. 2, volume 2, pages 2161--2168. IEEE.Google ScholarGoogle Scholar
  28. Jian Cheng, Cong Leng, Jiaxiang Wu, Hainan Cui, and Hanqing Lu. Fast and Accurate Image Matching with Cascade Hashing for 3D Reconstruction. In 2014 IEEE Conf. Comput. Vis. Pattern Recognit., pages 1--8. IEEE, jun 2014.Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Marius Muja and David G Lowe. Fast Approximate Nearest Neighbors with Automatic Algorithm Configuration. In Alpesh Ranchordas and Helder Araújo, editors, {VISAPP} 2009 - Proc. Fourth Int. Conf. Comput. Vis. Theory Appl. Lisboa, Port. Febr. 5-8, 2009 - Vol. 1, pages 331--340. {INSTICC} Press, 2009.Google ScholarGoogle Scholar
  30. M. Slaney and M. Casey. Locality-Sensitive Hashing for Finding Nearest Neighbors [Lecture Notes]. IEEE Signal Process. Mag., 25(2):128--131, mar 2008.Google ScholarGoogle ScholarCross RefCross Ref
  31. Martin A Fischler and Robert C Bolles. Random Sample Consensus: A Paradigm for Model Fitting with Applications to Image Analysis and Automated Cartography. Commun. Assoc. Comput. Mach., 24(6):381--395, jun 1981.Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. Changchang Wu. Towards Linear-Time Incremental Structure from Motion. In 2013 Int. Conf. 3D Vis., pages 127--134. IEEE, jun 2013.Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. Pierre Moulon, Pascal Monasse, and Renaud Marlet. Global Fusion of Relative Motions for Robust, Accurate and Scalable Structure from Motion. In 2013 IEEE Int. Conf. Comput. Vis., pages 3248--3255. IEEE, dec 2013.Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. Michal Havlena, Akihiko Torii, and Tomáš Pajdla. Efficient Structure from Motion by Graph Optimization. pages 100--113. 2010.Google ScholarGoogle Scholar
  35. Roberto Toldo, Riccardo Gherardi, Michela Farenzena, and Andrea Fusiello. Hierarchical structure-and-motion recovery from uncalibrated images. Comput. Vis. Image Underst., 140:127--143, nov 2015.Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. Rajvi Shah, Aditya Deshpande, and P. J. Narayanan. Multistage SFM: Revisiting Incremental Structure from Motion. In 2014 2nd Int. Conf. 3D Vis., pages 417--424. IEEE, dec 2014.Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. H. Hirschmuller. Stereo Processing by Semiglobal Matching and Mutual Information. IEEE Trans. Pattern Anal. Mach. Intell., 30(2):328--341, feb 2008.Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. Xing Mei, Xun Sun, Mingcai Zhou, Shaohui Jiao, Haitao Wang, and Xiaopeng Zhang. On building an accurate stereo matching system on graphics hardware. In 2011 IEEE Int. Conf. Comput. Vis. Work. (ICCV Work., pages 467--474. IEEE, nov 2011.Google ScholarGoogle ScholarCross RefCross Ref
  39. Jose Luis Blanco and Pranjal Kumar Rai. nanoflann: a C++ header-only fork of FLANN, a library for nearest neighbor (NN) with kd-trees. https://github.com/jlblancoc/nanoflann, 2014.Google ScholarGoogle Scholar
  40. Bruno Lévy and Alain Filbois. Geogram: a library for geometric algorithms. In Int. Conf. Adapt. Model. Simul., Nantes, France, 2015. CNIME.Google ScholarGoogle Scholar
  41. Michal Jancosek and Tomas Pajdla. Multi-view reconstruction preserving weakly-supported surfaces. In CVPR 2011, pages 3121--3128. IEEE, jun 2011.Google ScholarGoogle ScholarDigital LibraryDigital Library
  42. Michal Jancosek and Tomas Pajdla. Exploiting Visibility Information in Surface Reconstruction to Preserve Weakly Supported Surfaces. Int. Sch. Res. Not., 2014:1--20, aug 2014.Google ScholarGoogle Scholar
  43. Y. Boykov and V. Kolmogorov. An experimental comparison of min-cut/max- flow algorithms for energy minimization in vision. IEEE Trans. Pattern Anal. Mach. Intell., 26(9):1124--1137, sep 2004.Google ScholarGoogle ScholarDigital LibraryDigital Library
  44. Bruno Lévy, Sylvain Petitjean, Nicolas Ray, and Jérome Maillot. Least squares conformal maps for automatic texture atlas generation. ACM Trans. Graph., 21(3):362--371, jul 2002.Google ScholarGoogle ScholarDigital LibraryDigital Library
  45. Peter J. Burt and Edward H. Adelson. A multiresolution spline with application to image mosaics. ACM Trans. Graph., 2(4):217--236, oct 1983.Google ScholarGoogle ScholarDigital LibraryDigital Library
  46. Adam Baumberg. Blending Images for Texturing 3D Models. Proc. Br. Mach. Vis. Conf., 2003.Google ScholarGoogle Scholar
  47. Cedric Allene, Jean-Philippe Pons, and Renaud Keriven. Seamless image-based texture atlases using multi-band blending. In 2008 19th Int. Conf. Pattern Recognit., pages 1--4. IEEE, dec 2008.Google ScholarGoogle ScholarCross RefCross Ref
  48. Simone Gasparini, Fabien Castan, and Yann Lanthony. Buddha dataset, January 2017. https://github.com/alicevision/datasetbuddha.Google ScholarGoogle Scholar
  49. Richard dataset. pi3dscan. https://www.pi3dscan.com/index.php/download/item/example-model-richard.Google ScholarGoogle Scholar
  50. Paul E. Debevec and Jitendra Malik. Recovering high dynamic range radiance maps from photographs. In Proceedings of the 24th annual conference on Computer graphics and interactive techniques - SIGGRAPH 97. ACM Press, 1997.Google ScholarGoogle Scholar
  51. M.D. Grossberg and S.K. Nayar. Modeling the space of camera response functions. IEEE Transactions on Pattern Analysis and Machine Intelligence, 26(10):1272--1282, October 2004.Google ScholarGoogle ScholarDigital LibraryDigital Library
  52. Carsten Rother, Vladimir Kolmogorov, and Andrew Blake. "GrabCut". ACM Transactions on Graphics, 23(3):309--314, August 2004.Google ScholarGoogle ScholarDigital LibraryDigital Library
  53. Matthew Brown and David G. Lowe. Automatic panoramic image stitching using invariant features. International Journal of Computer Vision, 74(1):59--73, December 2006.Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. AliceVision Meshroom: An open-source 3D reconstruction pipeline

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        MMSys '21: Proceedings of the 12th ACM Multimedia Systems Conference
        June 2021
        254 pages
        ISBN:9781450384346
        DOI:10.1145/3458305

        Copyright © 2021 Owner/Author

        Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 22 September 2021

        Check for updates

        Qualifiers

        • research-article

        Acceptance Rates

        MMSys '21 Paper Acceptance Rate18of55submissions,33%Overall Acceptance Rate176of530submissions,33%

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader