Abstract
We live in a dynamic visual world where the appearance of scenes changes dramatically from hour to hour or season to season. In this work we study "transient scene attributes" -- high level properties which affect scene appearance, such as "snow", "autumn", "dusk", "fog". We define 40 transient attributes and use crowdsourcing to annotate thousands of images from 101 webcams. We use this "transient attribute database" to train regressors that can predict the presence of attributes in novel images. We demonstrate a photo organization method based on predicted attributes. Finally we propose a high-level image editing method which allows a user to adjust the attributes of a scene, e.g. change a scene to be "snowy" or "sunset". To support attribute manipulation we introduce a novel appearance transfer technique which is simple and fast yet competitive with the state-of-the-art. We show that we can convincingly modify many transient attributes in outdoor scenes.
Supplemental Material
Available for Download
Supplemental material.
- An, X., and Pellacini, F. 2010. User-controllable color transfer. Comput. Graph. Forum 29, 2. Google ScholarDigital Library
- Bell, S., Upchurch, P., Snavely, N., and Bala, K. 2013. Opensurfaces: A richly annotated catalog of surface appearance. ACM Trans. Graph. (proc. SIGGRAPH) 32, 4. Google ScholarDigital Library
- Berthouzoz, F., Li, W., Dontcheva, M., and Agrawala, M. 2011. A framework for content-adaptive photo manipulation macros. ACM Trans. Graph. 30, 5. Google ScholarDigital Library
- Bonneel, N., Sunkavalli, K., Paris, S., and Pfister, H. 2013. Example-based video color grading. ACM Trans. Graph. (proc. SIGGRAPH) 32. Google ScholarDigital Library
- Bychkovsky, V., Paris, S., Chan, E., and Durand, F. 2011. Learning photographic global tonal adjustment with a database of input / output image pairs. In CVPR. Google ScholarDigital Library
- Caicedo, J. C., Kapoor, A., and Kang, S. B. 2011. Collaborative personalization of image enhancement. In CVPR. Google ScholarDigital Library
- Chen, J., Paris, S., and Durand, F. 2007. Real-time edge-aware image processing with the bilateral grid. ACM Trans. Graph. (proc. SIGGRAPH) 26, 3. Google ScholarDigital Library
- Cheng, M.-M., Zheng, S., Lin, W.-Y., Vineet, V., Sturgess, P., Crook, N., Mitra, N., and Torr, P. 2014. Imagespirit: Verbal guided image parsing. ACM Trans. Graph..Google ScholarDigital Library
- Comaniciu, D., and Meer, P. 2002. Mean shift: a robust approach toward feature space analysis. IEEE Trans. PAMI 24. Google ScholarDigital Library
- Cusano, C., Gasparini, F., and Schettini, R. 2012. Color transfer using semantic image annotation. In SPIE, vol. 8299. Google ScholarDigital Library
- Dale, K., Johnson, M. K., Sunkavalli, K., Matusik, W., and Pfister, H. 2009. Image restoration using online photo collections. In ICCV.Google Scholar
- Dhar, S., Ordonez, V., and Berg, T. L. 2011. High level describable attributes for predicting aesthetics and interestingness. In CVPR. Google ScholarDigital Library
- Eitz, M., Hays, J., and Alexa, M. 2012. How do humans sketch objects? ACM Trans. Graph. (proc. SIGGRAPH) 31, 4. Google ScholarDigital Library
- Farhadi, A., Endres, I., Hoiem, D., and Forsyth, D. 2009. Describing objects by their attributes. In CVPR. Google ScholarDigital Library
- Fattal, R. 2008. Single image dehazing. ACM Trans. Graph. (proc. SIGGRAPH) 27, 3. Google ScholarDigital Library
- Ferrari, V., and Zisserman, A. 2007. Learning visual attributes. In NIPS.Google Scholar
- Garg, R., Du, H., Seitz, S. M., and Snavely, N. 2009. The dimensionality of scene appearance. In ICCV.Google Scholar
- Hertzmann, A., Jacobs, C. E., Oliver, N., Curless, B., and Salesin, D. H. 2001. Image analogies. In SIGGRAPH. Google ScholarDigital Library
- Hoiem, D., Efros, A. A., and Hebert, M. 2007. Recovering surface layout from an image. Int. J. Comput. Vision 75, 1. Google ScholarDigital Library
- Jacobs, N., Roman, N., and Pless, R. 2007. Consistent temporal variations in many outdoor scenes. In CVPR.Google Scholar
- Johnson, M. K., Dale, K., Avidan, S., Pfister, H., Freeman, W. T., and Matusik, W. 2011. Cg2real: Improving the realism of computer generated images using a large collection of photographs. IEEE Trans. Vis. Comput. Graph. 17, 9. Google ScholarDigital Library
- Kang, S. B., Kapoor, A., and Lischinski, D. 2010. Personalization of image enhancement. In CVPR.Google Scholar
- Kopf, J., Cohen, M. F., Lischinski, D., and Uyttendaele, M. 2007. Joint bilateral upsampling. ACM Trans. Graph. (proc. SIGGRAPH) 26, 3. Google ScholarDigital Library
- Kovashka, A., Parikh, D., and Grauman, K. 2012. Whittle-search: Image search with relative attribute feedback. In CVPR. Google ScholarDigital Library
- Kumar, N., Berg, A., Belhumeur, P., and Nayar, S. 2011. Describable visual attributes for face verification and image search. IEEE Trans. PAMI 33, 10. Google ScholarDigital Library
- Laffont, P.-Y., Bousseau, A., Paris, S., Durand, F., and Drettakis, G. 2012. Coherent intrinsic images from photo collections. ACM Trans. Graph. (proc. SIGGRAPH Asia) 31, 6. Google ScholarDigital Library
- Lalonde, J.-F., Hoiem, D., Efros, A. A., Rother, C., Winn, J., and Criminisi, A. 2007. Photo clip art. ACM Trans. Graph. (proc. SIGGRAPH) 26, 3. Google ScholarDigital Library
- Lalonde, J.-F., Efros, A., and Narasimhan, S. 2009. Web-cam clip art: Appearance and illuminant transfer from time-lapse sequences. ACM Trans. Graph. (proc. SIGGRAPH Asia) 28, 5. Google ScholarDigital Library
- Liu, Q., Ihler, A., and Steyvers, M. 2013. Scoring workers in crowdsourcing: how many control questions are enough? In NIPS.Google Scholar
- Matusik, W., Pfister, H., Brand, M., and McMillan, L. 2003. A data-driven reflectance model. ACM Trans. Graph. (proc. SIGGRAPH) 22, 3. Google ScholarDigital Library
- Murphy, K. P. 2012. Machine Learning: A Probabilistic Perspective. The MIT Press. Google ScholarDigital Library
- Narasimhan, S., Wang, C., and Nayar, S. 2002. All the images of an outdoor scene. In ECCV. Google ScholarDigital Library
- Parikh, D., and Grauman, K. 2011. Relative attributes. In ICCV. Google ScholarDigital Library
- Patterson, G., and Hays, J. 2012. Sun attribute database: Discovering, annotating, and recognizing scene attributes. In CVPR. Google ScholarDigital Library
- Perronnin, F., Sánchez, J., and Mensink, T. 2010. Improving the fisher kernel for large-scale image classification. In ECCV. Google ScholarDigital Library
- Pitié, F., Kokaram, A., and Dahyot, R. 2005. N-Dimensional Probability Density Function Transfer and its Application to Colour Transfer. In ICCV. Google ScholarDigital Library
- Pouli, T., and Reinhard, E. 2011. Progressive color transfer for images of arbitrary dynamic range. Computers & Graphics 35. Google ScholarDigital Library
- Reinhard, E., Ashikhmin, M., Gooch, B., and Shirley, P. 2001. Color transfer between images. IEEE Comput. Graph. Appl. 21, 5. Google ScholarDigital Library
- Scholkopf, B., Smola, A., Williamson, R., and Bartlett, P. 2000. New support vector algorithms. Neural Computation 12. Google ScholarDigital Library
- Shih, Y., Paris, S., Durand, F., and Freeman, W. T. 2013. Data-driven hallucination of different times of day from a single outdoor photo. ACM Trans. Graph. (proc. SIGGRAPH Asia) 32, 6. Google ScholarDigital Library
- Snavely, N., Seitz, S. M., and Szeliski, R. 2006. Photo tourism: exploring photo collections in 3D. ACM Trans. Graph. (proc. SIGGRAPH) 25, 3. Google ScholarDigital Library
- Sunkavalli, K., Matusik, W., Pfister, H., and Rusinkiewicz, S. 2007. Factored time-lapse video. ACM Trans. Graph. (proc. SIGGRAPH) 26, 3. Google ScholarDigital Library
- Tao, L., Yuan, L., and Sun, J. 2009. Skyfinder: Attribute-based sky image search. ACM Trans. Graph. (proc. SIGGRAPH) 28, 3. Google ScholarDigital Library
- Wu, F., Dong, W., Kong, Y., Mei, X., Paul, J.-C., and Zhang, X. 2013. Content-Based Colour Transfer. Comput. Graph. Forum 32, 1.Google ScholarDigital Library
- Xiao, J., Hays, J., Ehinger, K. A., Oliva, A., and Torralba, A. 2010. Sun database: Large-scale scene recognition from abbey to zoo. In CVPR.Google Scholar
- Yu, Y., and Malik, J. 1998. Recovering photometric properties of architectural scenes from photographs. In SIGGRAPH. Google ScholarDigital Library
Index Terms
- Transient attributes for high-level understanding and editing of outdoor scenes
Recommendations
Exploring outdoor appearance changes with transient scene attributes
SA '13: SIGGRAPH Asia 2013 PostersThe appearance of outdoor scenes changes dramatically with lighting and weather conditions, time of day, and season. We relate visual changes to scene attributes, which are human-nameable concepts used for high-level description of scenes. They carry ...
High-level attributes modeling for indoor scenes classification
Scene classification is a challenging problem in computer vision. Though conventional methods show good performance in recognizing outdoor scenes, these methods does not work well in indoor scenes recognition. In recent years, high level image ...
3D reconstruction of indoor and outdoor building scenes from a single image
SMVC '10: Proceedings of the 2010 ACM workshop on Surreal media and virtual cloningIn this paper, a novel method is proposed able to automatically generate accurate 3D models of both outdoor buildings and indoor scenes with perspective cues from line segments that are automatically extracted from a single image with an uncalibrated ...
Comments