Abstract
We propose a new model for object detection that is based on set representations of the contextual elements. In this formulation, relative spatial locations and relative scores between pairs of detections are considered as sets of unordered items. Directly training classification models on sets of unordered items, where each set can have varying cardinality can be difficult. In order to overcome this problem, we propose SetBoost, a discriminative learning algorithm for building set classifiers. The SetBoost classifiers are trained to rescore detected objects based on object-object and object-scene context. Our method is able to discover composite relationships, as well as intra-class and inter-class spatial relationships between objects. The experimental evidence shows that our set-based formulation performs comparable to or better than existing contextual methods on the SUN and the VOC 2007 benchmark datasets.
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Blaschko, M.B., Lampert, C.H.: Object localization with global and local context kernels. In: BMVC (2009)
Desai, C., Ramanan, D., Fowlkes, C.: Discriminative models for multi-class object layout. In: ICCV (2009)
Choi, M.J., Lim, J.J., Torralba, A., Willsky, A.S.: Exploiting hierarchical context on a large database of object categories. In: CVPR (2010)
Gemert, J.C.V., Snoek, C.G., Veenman, C.J., Smeulders, A.W., Geusebroek, J.M.: Comparing compact codebooks for visual categorization. CVIU (2010)
Everingham, M., Van Gool, L., Williams, C.K.I., Winn, J., Zisserman, A.: The PASCAL Visual Object Classes Challenge (2007)
Li, C., Parikh, D., Chen, T.: Extracting adaptive contextual cues from unlabeled regions. In: ICCV (2011)
Grauman, K., Darrell, T.: Approximate correspondences in high dimensions. In: NIPS (2007)
Heitz, G., Koller, D.: Learning Spatial Context: Using Stuff to Find Things. In: Forsyth, D., Torr, P., Zisserman, A. (eds.) ECCV 2008, Part I. LNCS, vol. 5302, pp. 30–43. Springer, Heidelberg (2008)
Liu, C., Yuen, J., Torralba, A.: Nonparametric scene parsing: Label transfer via dense scene alignment. In: CVPR (2009)
Galleguillos, C., Rabinovich, A., Belongie, S.: Object categorization using co-occurrence, location and appearance. In: CVPR (2008)
Oliva, A., Torralba, A.: Modeling the shape of the scene: A holistic representation of the spatial envelope. IJCV 42 (2001)
Hoiem, D., Efros, A., Hebert, M.: Putting objects in perspective. IJCV (2008)
Galleguillos, C., Belongie, S.: Context based object categorization: A critical survey. CVIU 114 (2010)
Felzenszwalb, P., McAllester, D., Ramanan, D., Grishick: Object detection with discriminatively trained part based models. PAMI 32 (2010)
Kondor, R., Jebara, T.: A kernel between sets of vectors. In: ICML (2003)
Cuturi, M., Vert, J.: Semigroup kernels on finite sets. In: NIPS (2005)
Lyu, S.: Mercer kernels for object recognition with local features. In: CVPR (2005)
Csurka, G., Dance, C., Fan, L., Willamowski, J., Bray, C.: Visual categorization with bags of keypoints. In: Workshop on Stat. Learning in Comp. Vision (2004)
Moosmann, F., Nowak, E., Jurie, F.: Randomized clustering forests for image classification. PAMI 30 (2008)
Yang, L., Jin, R., Sukthankar, R., Jurie, F.: Unifying discriminative visual codebook generation with classifier training for object recognition. In: CVPR (2008)
Dollár, P., Babenko, B., Belongie, S., Perona, P., Tu, Z.: Multiple Component Learning for Object Detection. In: Forsyth, D., Torr, P., Zisserman, A. (eds.) ECCV 2008, Part II. LNCS, vol. 5303, pp. 211–224. Springer, Heidelberg (2008)
Mason, L., Baxter, J., Bartlett, P., Frean, M.: Boosting algorithms as gradient descent in function space. In: NIPS (1999)
Byrd, R.H., Lu, P., Nocedal, J., Zhu, C.: A limited memory algorithm for bound constrained optimization. SIAM J. on Scientific Comp. 16 (1995)
Freund, Y., Schapire, R.E.: A Decision-Theoretic generalization of On-Line learning and an application to boosting. J. of Comp. and Sys. Sci. 55 (1997)
Friedman, J.H.: Stochastic gradient boosting. Comp. Stat. and Data Analysis 38 (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Cinbis, R.G., Sclaroff, S. (2012). Contextual Object Detection Using Set-Based Classification. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds) Computer Vision – ECCV 2012. ECCV 2012. Lecture Notes in Computer Science, vol 7577. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-33783-3_4
Download citation
DOI: https://doi.org/10.1007/978-3-642-33783-3_4
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-33782-6
Online ISBN: 978-3-642-33783-3
eBook Packages: Computer ScienceComputer Science (R0)