Abstract
In this article, we exploit the problem of annotating a large-scale image corpus by label propagation over noisily tagged web images. To annotate the images more accurately, we propose a novel kNN-sparse graph-based semi-supervised learning approach for harnessing the labeled and unlabeled data simultaneously. The sparse graph constructed by datum-wise one-vs-kNN sparse reconstructions of all samples can remove most of the semantically unrelated links among the data, and thus it is more robust and discriminative than the conventional graphs. Meanwhile, we apply the approximate k nearest neighbors to accelerate the sparse graph construction without loosing its effectiveness. More importantly, we propose an effective training label refinement strategy within this graph-based learning framework to handle the noise in the training labels, by bringing in a dual regularization for both the quantity and sparsity of the noise. We conduct extensive experiments on a real-world image database consisting of 55,615 Flickr images and noisily tagged training labels. The results demonstrate both the effectiveness and efficiency of the proposed approach and its capability to deal with the noise in the training labels.
- Belkin, M. and Niyogi, P. 2003. Laplacian eigenmaps for dimensionality reduction and data representation. Neural Comput.. Google ScholarDigital Library
- Chang, C.-C. and Lin, C.-J. 2001. LIBSVM: A library for support vector machines. http://www.csie.ntu.edu.tw/&ctilde;jlin/libsvm.Google Scholar
- Chapelle, O., Zien, A., and Scholkopf, B. 2006. Semi-Supervised Learning. MIT Press. Google ScholarDigital Library
- Chua, T.-S., Tang, J., Hong, R., Li, H., Luo, Z., and Zheng, Y.-T. 2009. NUS-WIDE: A real-world web image database from national university of singapore. In Proceedings of the ACM Conference on Image and Video Retrieval. Google ScholarDigital Library
- Donoho, D. L. 2006. For most large underdetermined systems of linear equations the minimal ℓ1-norm solution is also the sparsest solution. Comm. Pure Appl. Math. 59, 6, 797--829.Google ScholarCross Ref
- Duda, R., Stork, D., and Hart, P. 2000. Pattern Classification. J. Wiley. Google ScholarDigital Library
- Fergus, R., Fei-Fei, L., Perona, P., and Zisserman, A. 2005. Learning object categories from google's image search. In Proceedings of the IEEE International Conference on Computer Vision. Google ScholarDigital Library
- Goh, K.-S., Chang, E. Y., and Lai, W.-C. 2004. Multimodal concept-dependent active learning for image retrieval. In Proceedings of the 12th Annual ACM International Conference on Multimedia. 564--571. Google ScholarDigital Library
- He, J., Li, M., Zhang, H.-J., Tong, H., and Zhang, C. 2004. Manifold-ranking based image retrieval. In Proceedings of the 12th Annual ACM International Conference on Multimedia. Google ScholarDigital Library
- ℓ1 MAGIC. http://www.acm.caltech.edu/l1magic/.Google Scholar
- Li, X., Chen, L., Zhang, L., Lin, F., and Ma, W.-Y. 2006. Image annotation by large-scale content-based image retrieval. In Proceedings of the 14th Annual ACM International Conference on Multimedia. Google ScholarDigital Library
- Mount, D. and Arya, S. 1997. Ann: A library for approximate nearest neighbor searching. In Proceedings of the CGC 2nd Annual Fall Workship on Computational Geometry.Google Scholar
- Rao, R., Olshausen, B., and Lewicki, M. 2002. Probabilistic Models of the Brain: Perception and Neural Function. MIT Press.Google ScholarCross Ref
- Roweis, S. T. and Saul, L. K. 2000. Nonlinear dimensionality reduction by locally linear embedding. Science 290, 2323--2326.Google ScholarCross Ref
- Saad, Y. 2003. Iterative Methods for Sparse Linear Systems 2nd Ed. Society for Industrial and Applied Mathematics. Google ScholarDigital Library
- Saad, Y. and Schultz, M. 1986. GMRES: A generalized minimal residual algorithm for solving nonsymmetric linear systems. SIAM J. Sci. Stat. Comp. 7, 856--869. Google ScholarDigital Library
- Sun, Y., Shimada, S., Taniguchi, Y., and Kojima, A. 2008. A novel region-based approach to visual concept modeling using web images. In Proceedings of the 16th ACM International Conference on Multimedia. Google ScholarDigital Library
- Tang, J., Hua, X.-S., Qi, G.-J., Wang, M., Mei, T., and Wu, X. 2007. Structure-sensitive manifold ranking for video concept detection. In Proceedings of the 15th ACM International Conference on Multimedia. Google ScholarDigital Library
- Tang, J., Hua, X.-S., Song, Y., Qi, G.-J., and Wu, X. 2008. Video annotation based on kernel linear neighborhood propagation. IEEE Trans. Multimedia 10, 4. Google ScholarDigital Library
- Tang, J., Yan, S., Hong, R., Qi, G.-J., and Chua, T.-S. 2009. Inferring semantic concepts from community-contributed images and noisy tags. In Proceedings of the 17th ACM International Conference on Multimedia. 223--232. Google ScholarDigital Library
- Torralba, A., Fergus, R., and Freeman, W. 2008. 80 million tiny images: A large data set for nonparametric object and scene recognition. IEEE Trans. Patt. Anal. Mach. Intell. 30, 11. Google ScholarDigital Library
- TREC. Trec-10 proceedings appendix on common evaluation measures. http://trec.nist.gov/pubs/trec10/ appendices/measures.pdf.Google Scholar
- Wang, C., Jing, F., Zhang, L., and Zhang, H.-J. 2006a. Image annotation refinement using random walk with restarts. In Proceedings of the 14th ACM International Conference on Multimedia. Google ScholarDigital Library
- Wang, F. and Zhang, C. 2008. Label propagation through linear neighborhoods. IEEE Trans. Knowl. Data Engin. 20, 1, 55--67. Google ScholarDigital Library
- Wang, X.-J., Zhang, L., Jing, F., and Ma, W.-Y. 2006b. Annosearch: Image auto-annotation by search. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Google ScholarDigital Library
- Wang, X.-J., Zhang, L., Li, X., and Ma, W.-Y. 2008. Annotating images by mining image search results. IEEE Trans. Patt. Anal. Mach. Intell. 30, 11, 1919--1932. Google ScholarDigital Library
- Wright, J., Yang, A., Ganesh, A., Sastry, S., and Ma, Y. 2009. Robust face recognition via sparse representation. IEEE Trans. Patt. Anal. Mach. Intell. 31, 2 (Feb.), 210--227. Google ScholarDigital Library
- Zhou, D., Bousquet, O., Lal, T. N., Weston, J., and Scholkopf, B. 2003. Learning with local and global consistency. In Proceedings of the 17th Annual Conference on Neural Information Processing Systems.Google Scholar
- Zhu, X. 2005. Semi-Supervised Learning with Graphs. Ph.D. dissertation, Carnegie Mellon University. Google ScholarDigital Library
- Zhu, X., Ghahramani, Z., and Lafferty, J. 2003. Semi-supervised learning using gaussian fields and harmonic function. In Proceedings of the 20th International Conference on Machine Learning.Google Scholar
Index Terms
Image annotation by kNN-sparse graph-based label propagation over noisily tagged web images
Recommendations
Inferring semantic concepts from community-contributed images and noisy tags
MM '09: Proceedings of the 17th ACM international conference on MultimediaIn this paper, we exploit the problem of inferring images' semantic concepts from community-contributed images and their associated noisy tags. To infer the concepts more accurately, we propose a novel sparse graph-based semi-supervised learning ...
Semi-supervised partial label learning algorithm via reliable label propagation
AbstractPartial label learning (PLL) is a weakly supervised learning method that is able to predict one label as the correct answer from a given candidate label set. In PLL, when all possible candidate labels are as signed to real-world training examples, ...
Transductive Multilabel Learning via Label Set Propagation
The problem of multilabel classification has attracted great interest in the last decade, where each instance can be assigned with a set of multiple class labels simultaneously. It has a wide variety of real-world applications, e.g., automatic image ...
Comments