Skip to main content

Deep Learning and OcTree-GPU-Based ICP for Efficient 6D Model Registration of Large Objects

  • Conference paper
  • First Online:
Human-Friendly Robotics 2021

Part of the book series: Springer Proceedings in Advanced Robotics ((SPAR,volume 23))

Abstract

In this paper, the shape alignment method that takes into account large objects with very limited visibility and significant occlusions utilizing an octree data structure on the GPU is presented. The proposed algorithm relies on the offline computed 3D model of the object and an initial estimation of it’s pose using a deep learning technique to detect key features of the object, in order to improve the accuracy and the speed of the registration process. The final aligned pose is achieved by computing the iterative closest point algorithm on GPU utilizing octree, starting from the initial estimated pose. To highlight the application of the proposed method, autonomous robotic tasks requiring interaction with washing machine is discussed. Finally, the performance in terms of both speed and accuracy of the different implementations of the algorithm on the CPU and GPU, as well as with and without augmented octree neighbourhood search is provided.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 149.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 199.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 199.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Abadi, A. et al.: TensorFlow: Large-scale machine learning on heterogeneous systems (2015). Software available from tensorflow.org

    Google Scholar 

  2. AraĂºjo, H., Carceroni, R., Brown, C.M.: A Fully Projective Formulation for Lowe’s Tracking Algorithm (1996)

    Google Scholar 

  3. Besl, P.J., McKay, N.D.: A method for registration of 3-d shapes. IEEE Trans. Pattern Anal. Mach. Intell. 14(2), 239–256 (1992)

    Google Scholar 

  4. Chen, Y., Medioni, G.: Object modeling by registration of multiple range images. In: Proceedings. 1991 IEEE International Conference on Robotics and Automation, vol. 3, pp. 2724–2729 (1991)

    Google Scholar 

  5. Elseberg, J., Borrmann, D., Nuchter, A.: Efficient processing of large 3d point clouds, 10 (2011)

    Google Scholar 

  6. Elseberg, J., Magnenat, S., Siegwart, R., Nuchter, A.: Comparison on nearest-neigbour-search strategies and implementations for efficient shape registration. J. Softw. Eng. Robot. (JOSER) 3, 2–12 (2012)

    Google Scholar 

  7. Gao, X.-S., Hou, X.-R., Tang, J., Cheng, H.-F.: Complete solution classification for the perspective-three-point problem. IEEE Trans. Pattern Anal. Mach. Intell. 25(8), 930–943 (2003)

    Google Scholar 

  8. Garrido-Jurado, S., Muñz-Salinas, R., Madrid-Cuevas, F., Marín-Jiménez, M.: Automatic generation and detection of highly reliable fiducial markers under occlusion. Pattern Recogn. 47, 2280–2292 (2014)

    Google Scholar 

  9. Girshick, R.B., Donahue, J., Darrell, T., Malik, J.: Rich feature hierarchies for accurate object detection and semantic segmentation. CoRR, abs/1311.2524 (2013)

  10. He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. CoRR, abs/1512.03385 (2015)

  11. Holz, D., Ichim, A.E., Tombari, F., Rusu, R.B., Behnke, S.: Registration with the point cloud library: a modular framework for aligning in 3-d. IEEE Robot. Autom., Mag. 22, 110–124 (2015)

    Google Scholar 

  12. Jian, B., Vemuri, B.C.: Robust point set registration using gaussian mixture models. IEEE Trans. Pattern Anal. Mach. Intell. 33(8), 1633–1645 (2011)

    Google Scholar 

  13. Le, H., Do, T.-T., Hoang, T., Cheung, N.-M.: SDRSAC: semidefinite-based randomized approach for robust point cloud registration without correspondences. CoRR, abs/1904.03483 (2019)

  14. Lepetit, V., Moreno-Noguer, F., Fua, P.: Epnp: an accurate o(n) solution to the pnp problem. Int. J. Comput. Vis. 81, 02 (2009)

    Article  Google Scholar 

  15. Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S.E., Fu, C.-Y., Berg, A.C.: SSD: single shot multibox detector. CoRR, abs/1512.02325 (2015)

  16. Lowe, D.G.: Fitting parameterized three-dimensional models to images. IEEE Trans. Pattern Anal. Mach. Intell. 13(5), 441–450 (1991)

    Google Scholar 

  17. Lowe, D.G.: Three-dimensional object recognition from single two-dimensional images. Artif. Intell. 31(3), 355–395 (1987)

    Article  Google Scholar 

  18. Lu, C.P., Hager, G., Mjolsness, E.: Fast and globally convergent pose estimation from video images. IEEE Trans. Pattern Anal. Mach. Intell. 22, 610–622 (2000)

    Google Scholar 

  19. Myronenko, A., Song, X.: Point set registration: Coherent point drift. IEEE Trans. Pattern Anal. Mach. Intell. 32(12), 2262–2275 (2010)

    Google Scholar 

  20. Ren, S., He, K., Girshick, R.B., Sun, J.: Faster R-CNN: towards real-time object detection with region proposal networks. CoRR, abs/1506.01497 (2015)

  21. Romero-Ramirez, F., Muñoz-Salinas, R., Medina-Carnicer, R.: Speeded up detection of squared fiducial markers. Image Vis. Comput. 76, 06 (2018)

    Article  Google Scholar 

  22. Rusinkiewicz, S. Levoy, M.: Efficient variants of the icp algorithm. In: Proceedings Third International Conference on 3-D Digital Imaging and Modeling, pp. 145–152 (2001)

    Google Scholar 

  23. Rusu, R.B., Blodow, N., Beetz, M.: Fast point feature histograms (fpfh) for 3d registration. In: 2009 IEEE International Conference on Robotics and Automation, pp. 3212–3217 (2009)

    Google Scholar 

  24. Wehr, D., Radkowski, R.: Parallel kd-tree construction on the gpu with an adaptive split and sort strategy. Int. J. Parallel Program. 46, 12 (2018)

    Article  Google Scholar 

  25. Weng, J., Ahuja, N., Huang, T.S.: Optimal motion and structure estimation. In: Proceedings IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 144–152 (1989)

    Google Scholar 

  26. Yang, J., Li, H., Campbell, D., Jia, Y.: Go-icp: a globally optimal solution to 3d icp point-set registration. IEEE Trans. Pattern Anal. Mach. Intell. 38(11), 2241–2254 (2016)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Wendwosen B. Bedada .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Ahmadli, I., Bedada, W.B., Palli, G. (2022). Deep Learning and OcTree-GPU-Based ICP for Efficient 6D Model Registration of Large Objects. In: Palli, G., Melchiorri, C., Meattini, R. (eds) Human-Friendly Robotics 2021. Springer Proceedings in Advanced Robotics, vol 23. Springer, Cham. https://doi.org/10.1007/978-3-030-96359-0_3

Download citation

Publish with us

Policies and ethics