Processing 3D Geo-Information for Augmenting Georeferenced and Oriented Photographs with Text Labels
Online photo libraries face the problem of organizing their rapidly growing image collections. Fast and reliable image retrieval requires good qualitative captions added to a photo; however, this is considered by photographers as a time-consuming and annoying task. In order to do it in a fully automated way, the process of augmenting a photo with captions or labels starts by identifying the objects that the photo depicts. Previous attempts for a fully automatic process using computer vision technology only proved not to be optimal due to calibration issues. Existing photo annotation tools from GPS or geo-tagging services can only apply generic location information to add textual descriptions about the context and surroundings of the photo, not actually what the photo shows. To be able to exactly describe what is captured on a digital photo, the view orientation is required to exactly identify the captured scene extent and identify the features from existing spatial datasets that are within the extent. Assumption that camera devices with integrated GPS and digital compass will become available in the near future, our research introduces an approach to identify and localize captured objects on a digital photo using this full spatial metadata. It proposes the use of GIS technology and conventional spatial data sets to place a label next to a pictured object at its best possible location.
Keywordsphoto annotation object identification label placement.
Unable to display preview. Download preview PDF.
- Azuma R (2004) Overview of augmented reality. International Conference on Computer Graphics and Interactive Techniques ACM SIGGRAPH 2004 Course Notes, Los Angelos.Google Scholar
- Bell B, Feiner S, Höllerer T (2001) View Management for Virtual and Augmented Reality. Proceedings of the 14th annual ACM symposium on User interface software and technology 2001 pp 101-110.Google Scholar
- Cartwright W, Gartner G, Peterson MP (2007) Multimedia Cartography Second Edition. Springerlink, Berlin Heidelberg New York.Google Scholar
- Chang EY (2005) EXTENT: Fusing Context, Content, and Semantic Ontology for Photo Annotation. ACM SIGMOD CVDB Workshop, Baltimore.Google Scholar
- Dias E, de Boer A, Fruijtier S, Oddoye JP, Harding J, Matyas C, Minelli S (2007) Requirements and business case study. Project deliverable D1.2. TRIPOD: TRI-Partite multimedia Object Description. EC-IST Project 045335 (www.projecttripod.org).Google Scholar
- Li J, Plaisant C, Schneiderman B (1998) Data object and label placement for information abundant visualizations. Proceedings of the 1998 workshop on New paradigms in information visualization and manipulation, Washington D.C., pp 41-48Google Scholar
- Götzelman T, Hartman K, Strothotte T (2006) Agent-based annotation of Interactive 3D Visualizations. 6th International Symposium on Smart Graphics, Vancouver, pp 24-35.Google Scholar
- Hagedorn B, Maass S, Döllner J (2007) Chaining Geoinformation Services for the Visualisation and Annotation of 3D Geovirtual Environments. 4th International Symposium on LBS and Telecartography, Hong Kong.Google Scholar
- Kolbe TH (2007) Augmented Videos and Panoramas for Pedestrian Navigation. 2th International Symposium on LBS and Telecartography, Vienna.Google Scholar
- Naaman M, Harada S, Wang QY, Garcia-Molina H, Paepcke A (2004) Automatic Organization for Digital Photographs with Geographic Coordinates. Proceedings of the Fourth ACM/IEEE-CS Joint Conference on Digital Libraries, pp 53-62.Google Scholar
- Maass S, Döllner J (2006a) Dynamic Annotation of Interactive Environments using Object-Integrated Billboards. Proceedings 14-th International Conference in Central Europe on Computer Graphics, Visualization and Computer Vision, WSCG’2006, Plzen, pp 327-334.Google Scholar
- Maass S, Döllner J (2006b) Efficient View Management for Dynamic Annotation Placement in Virtual Landscapes. 6th Int. Symposium on Smart Graphics, Vancouver, pp 1-12.Google Scholar
- Schmalstieg D, Reitmayr G (2007) The World as a User Interface: Augmented Reality for Ubiquitous Computing. Location Based Service and TeleCartography, Springer, pp 369-391.Google Scholar
- Toye E, Sharp R, Madhavapeddy A, Scott D, Upton E, Blackwell A (2006) Interacting with mobile service: an evaluation of camera-phones and visual tags. Personal and Ubiquitous Computing, pp 1 – 10.Google Scholar
- Yamamoto M, Lorena LAN (2005) A Constructive genetic approach to point-feature cartographic label placement. Metaheuristics: Progress as Real Problem Solvers, Springerlink, New York.Google Scholar