Abstract
A number of recent papers have investigated reconstruction under Manhattan world assumption, in which surfaces in the world are assumed to be aligned with one of three dominant directions [1,2,3,4]. In this paper we present a dynamic programming solution to the reconstruction problem for “indoor” Manhattan worlds (a sub–class of Manhattan worlds). Our algorithm deterministically finds the global optimum and exhibits computational complexity linear in both model complexity and image size. This is an important improvement over previous methods that were either approximate [3] or exponential in model complexity [4]. We present results for a new dataset containing several hundred manually annotated images, which are released in conjunction with this paper.
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Coughlan, J., Yuille, A.: Manhattan world: compass direction from a single image by bayesian inference. In: CVPR, vol. 2, pp. 941–947 (1999)
Kǒsecká, J., Zhang, W.: Video compass. In: Heyden, A., Sparr, G., Nielsen, M., Johansen, P. (eds.) ECCV 2002, Part IV. LNCS, vol. 2353, pp. 476–490. Springer, Heidelberg (2002)
Furukawa, Y., Curless, B., Seitz, S., Szeliski, R.: Manhattan-world stereo. In: CVPR, pp. 1422–1429 (2009)
Lee, D.C., Hebert, M., Kanade, T.: Geometric reasoning for single image structure recovery. In: CVPR (2009)
Flint, A., Mei, C., Reid, I., Murray, D.: Growing semantically meaningful models for visual slam. In: CVPR (2010)
Hedau, V., Hoiem, D., Forsyth, D.: Recovering the spatial layout of cluttered rooms. In: ICCV, vol. 2 (2009)
Huffman, D.A.: Impossible objects as nonsense sentences. Machine Intelligence 6, 295–323 (1971)
Waltz, D.L.: Generating semantic descriptions from drawings of scenes with shadows. Technical report, MIT (1972)
Sugihara, K.: Mathematical structures of line drawings of polyhedrons. PAMI 4, 458–469 (1982)
Kanade, T.: A theory of origami world. Artificial Intelligence 13, 279–311 (1980)
Hoiem, D., Efros, A.A., Hébert, M.: Geometric context from a single image. In: ICCV, pp. 654–661 (2005)
Saxena, A., Sun, M., Ng, A.Y.: Make3d: Learning 3d scene structure from a single still image. PAMI 31, 824–840 (2009)
Barinova, O., Konushin, V., Yakubenko, A., Lee, K., Lim, H., Konushin, A.: Fast automatic single-view 3-d reconstruction of urban scenes. In: Forsyth, D., Torr, P., Zisserman, A. (eds.) ECCV 2008, Part II. LNCS, vol. 5303, pp. 100–113. Springer, Heidelberg (2008)
Felzenszwalb, D., Veksler, O.: Tiered scene labeling with dynamic programming. In: CVPR (2010)
Criminisi, A.: Accurate visual metrology from single and multiple uncalibrated images. Springer, New York (2001)
Kolmogorov, V., Zabih, R.: What energy functions can be minimized via graph cuts? PAMI (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Flint, A., Mei, C., Murray, D., Reid, I. (2010). A Dynamic Programming Approach to Reconstructing Building Interiors. In: Daniilidis, K., Maragos, P., Paragios, N. (eds) Computer Vision – ECCV 2010. ECCV 2010. Lecture Notes in Computer Science, vol 6315. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15555-0_29
Download citation
DOI: https://doi.org/10.1007/978-3-642-15555-0_29
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-15554-3
Online ISBN: 978-3-642-15555-0
eBook Packages: Computer ScienceComputer Science (R0)