Make3D: Learning 3D Scene Structure from a Single Still Image

Top Cited Papers

30 May 2008

journal article
Published by Institute of Electrical and Electronics Engineers (IEEE)

Vol. 31 (5), 824-840
https://doi.org/10.1109/tpami.2008.132

Abstract

We consider the problem of estimating detailed 3D structure from a single still image of an unstructured environment. Our goal is to create 3D models that are both quantitatively accurate as well as visually pleasing. For each small homogeneous patch in the image, we use a Markov random field (MRF) to infer a set of "plane parametersrdquo that capture both the 3D location and 3D orientation of the patch. The MRF, trained via supervised learning, models both image depth cues as well as the relationships between different parts of the image. Other than assuming that the environment is made up of a number of small planes, our model makes no explicit assumptions about the structure of the scene; this enables the algorithm to capture much more detailed 3D structure than does prior art and also give a much richer experience in the 3D flythroughs created using image-based rendering, even for scenes with significant nonvertical structure. Using this approach, we have created qualitatively correct 3D models for 64.9 percent of 588 images downloaded from the Internet. We have also extended our model to produce large-scale 3D models from a few images.

Keywords

This publication has 28 references indexed in Scilit:

3-D Depth Reconstruction from a Single Still Image
International Journal of Computer Vision, 2007
Using depth features to retrieve monocular video shots
Published by Association for Computing Machinery (ACM) ,2007
Depth from Familiar Objects: A Hierarchical Model for 3D Scenes
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2006
A Survey of Motion-Parallax-Based 3-D Reconstruction Algorithms
IEEE Transactions on Systems, Man and Cybernetics, Part C (Applications and Reviews), 2004
Efficient Graph-Based Image Segmentation
International Journal of Computer Vision, 2004
Learning to detect natural image boundaries using local brightness, color, and texture cues
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2004
Convex Optimization
Published by Cambridge University Press (CUP) ,2004
Real-time three-dimensional video image composition by depth information
IEICE Electronics Express, 2004
Depth estimation from image structure
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
Looking down is looking up
Nature, 2001

Cited by 1233 articles