|
With the recent rise in popularity of Internet photo sharing sites like Flickr and Google Images, community photo collections (CPCs) have emerged as a powerful new type of image dataset for computer vision and computer graphics research. With billions of such photos now online, these collections should enable huge opportunities in 3D reconstruction, visualization, image-based rendering, recognition, and other research areas. The challenge is that these collections have extreme variability, having been taken by numerous photographers from myriad viewpoints with varying lighting and appearance, and often with significant occlusions and clutter. Our research seeks to develop robust algorithms that operate successfully on such image sets to solve problems in computer vision and computer graphics.
The Google Tech Talk "Navigating the World's Photographs" by Steve Seitz, Noah Snavely, and Michael Goesele gives a good overview over our current work on community photo collections. View a video of the talk on Google Video or download the video in Flash Video (FLV) format (114 MB) or AVI format (141 MB).
The paper Scene Reconstruction and Visualization From Community Photo Collections by Noah Snavely, Ian Simon, Michael Goesele, Rick Szeliski, and Steve Seitz has been published in the August 2010 special issue of Proceedings of the IEEE on Internet Vision. This paper also gives a good overview of our work on community photo collections.
The following projects are based on community photo collections:
Photo Tourism
Photo tourism: Exploring photo collections in 3D (PDF) More information is available at the project page. |
|
Multi-View Stereo for Community Photo Collections
Multi-View Stereo for Community Photo Collections (PDF) More information is available at the project page. |
|
Scene Summarization for Online Image Collections
Scene Summarization for Online Image Collections (PDF) More information is available at the project page. |
|
Finding Paths through the World's Photos
Finding Paths through the World's
Photos (PDF)
(qt video) |
|
Skeletal Graphs for Efficient Structure from Motion
Skeletal Sets for Efficient Structure from Motion (PDF) |
|
Scene Segmentation Using the Wisdom of Crowds
Scene Segmentation Using the Wisdom of Crowds
|
|
Building
Rome in a Day (PDF)
Sameer Agarwal, Noah Snavely, Ian Simon, Steven M. Seitz and Richard Szeliski International Conference on Computer Vision, 2009, Kyoto, Japan. |