skip to main content
10.5555/1687878.1687880dlproceedingsArticle/Chapter ViewAbstractPublication PagesaclConference Proceedingsconference-collections
research-article
Free Access

Heterogeneous transfer learning for image clustering via the social web

Authors Info & Claims
Published:02 August 2009Publication History

ABSTRACT

In this paper, we present a new learning scenario, heterogeneous transfer learning, which improves learning performance when the data can be in different feature spaces and where no correspondence between data instances in these spaces is provided. In the past, we have classified Chinese text documents using English training data under the heterogeneous transfer learning framework. In this paper, we present image clustering as an example to illustrate how unsupervised learning can be improved by transferring knowledge from auxiliary heterogeneous data obtained from the social Web. Image clustering is useful for image sense disambiguation in query-based image search, but its quality is often low due to imagedata sparsity problem. We extend PLSA to help transfer the knowledge from social Web data, which have mixed feature representations. Experiments on image-object clustering and scene clustering tasks show that our approach in heterogeneous transfer learning based on the auxiliary data is indeed effective and promising.

References

  1. Alina Andreevskaia and Sabine Bergler. 2008. When specialists and generalists work together: Overcoming domain dependence in sentiment tagging. In ACL-08: HLT, pages 290--298, Columbus, Ohio, June.Google ScholarGoogle Scholar
  2. Andrew Arnold, Ramesh Nallapati, and William W. Cohen. 2007. A comparative study of methods for transductive transfer learning. In ICDM 2007 Workshop on Mining and Management of Biological Data, pages 77--82. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Andrew Arnold, Ramesh Nallapati, and William W. Cohen. 2008. Exploiting feature hierarchy for transfer learning in named entity recognition. In ACL-08: HLT.Google ScholarGoogle Scholar
  4. Sugato Basu, Mikhail Bilenko, and Raymond J. Mooney. 2004. A probabilistic framework for semi-supervised clustering. In ACM SIGKDD 2004, pages 59--68. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. John Blitzer, Ryan Mcdonald, and Fernando Pereira. 2006. Domain adaptation with structural correspondence learning. In EMNLP 2006, pages 120--128, Sydney, Australia. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. John Blitzer, Mark Dredze, and Fernando Pereira. 2007. Biographies, bollywood, boom-boxes and blenders: Domain adaptation for sentiment classification. In ACL 2007, pages 440--447, Prague, Czech Republic.Google ScholarGoogle Scholar
  7. Avrim Blum and Tom Mitchell. 1998. Combining labeled and unlabeled data with co-training. In COLT 1998, pages 92--100, New York, NY, USA. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Rich Caruana. 1997. Multitask learning. Machine Learning, 28(1):41--75. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Yee Seng Chan and Hwee Tou Ng. 2007. Domain adaptation with active learning for word sense disambiguation. In ACL 2007, Prague, Czech Republic.Google ScholarGoogle Scholar
  10. David A. Cohn and Thomas Hofmann. 2000. The missing link - a probabilistic model of document content and hypertext connectivity. In NIPS 2000, pages 430--436.Google ScholarGoogle Scholar
  11. Wenyuan Dai, Yuqiang Chen, Gui-Rong Xue, Qiang Yang, and Yong Yu. 2008a. Translated learning: Transfer learning across different feature spaces. In NIPS 2008, pages 353--360.Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Wenyuan Dai, Qiang Yang, Gui-Rong Xue, and Yong Yu. 2008b. Self-taught clustering. In ICML 2008, pages 200--207. Omnipress. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Hal Daume, III. 2007. Frustratingly easy domain adaptation. In ACL 2007, pages 256--263, Prague, Czech Republic.Google ScholarGoogle Scholar
  14. Jesse Davis and Pedro Domingos. 2008. Deep transfer via second-order markov logic. In AAAI 2008 Workshop on Transfer Learning, Chicago, USA.Google ScholarGoogle Scholar
  15. Scott Deerwester, Susan T. Dumais, George W. Furnas, Thomas K. L, and Richard Harshman. 1990. Indexing by latent semantic analysis. Journal of the American Society for Information Science, pages 391--407.Google ScholarGoogle ScholarCross RefCross Ref
  16. A. P. Dempster, N. M. Laird, and D. B. Rubin. 1977. Maximum likelihood from incomplete data via the em algorithm. J. of the Royal Statistical Society, 39:1--38.Google ScholarGoogle Scholar
  17. Thomas Finley and Thorsten Joachims. 2005. Supervised clustering with support vector machines. In ICML 2005, pages 217--224, New York, NY, USA. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. G. Griffin, A. Holub, and P. Perona. 2007. Caltech-256 object category dataset. Technical Report 7694, California Institute of Technology.Google ScholarGoogle Scholar
  19. Thomas Hofmann. 1999 Probabilistic latent semantic analysis. In Proc. of Uncertainty in Artificial Intelligence, UAI99. Pages 289--296 Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Thomas Hofmann. 2001. Unsupervised learning by probabilistic latent semantic analysis. Machine Learning. volume 42, number 1--2, pages 177--196. Kluwer Academic Publishers. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Jing Jiang and Chengxiang Zhai. 2007. Instance weighting for domain adaptation in NLP. In ACL 2007, pages 264--271, Prague, Czech Republic, June.Google ScholarGoogle Scholar
  22. Leonard Kaufman and Peter J. Rousseeuw. 1990. Finding groups in data: an introduction to cluster analysis. John Wiley and Sons, New York.Google ScholarGoogle Scholar
  23. Svetlana Lazebnik, Cordelia Schmid, and Jean Ponce. 2006. Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. In CVPR 2006, pages 2169--2178, Washington, DC, USA. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Fei-Fei Li and Pietro Perona. 2005. A bayesian hierarchical model for learning natural scene categories. In CVPR 2005, pages 524--531, Washington, DC, USA. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Xiao Ling, Gui-Rong Xue, Wenyuan Dai, Yun Jiang, Qiang Yang, and Yong Yu. 2008. Can chinese web pages be classified with english data source? In WWW 2008, pages 969--978, New York, NY, USA. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Nicolas Loeff, Cecilia Ovesdotter Alm, and David A. Forsyth. 2006. Discriminating image senses by clustering with multimodal features. In COLING/ACL 2006 Main conference poster sessions, pages 547--554. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. David G. Lowe. 2004. Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision (IJCV) 2004, volume 60, number 2, pages 91--110. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. J. B. MacQueen. 1967. Some methods for classification and analysis of multivariate observations. In Proceedings of Fifth Berkeley Symposium on Mathematical Statistics and Probability, pages 1:281--297, Berkeley, CA, USA.Google ScholarGoogle Scholar
  29. Kamal Nigam and Rayid Ghani. 2000. Analyzing the effectiveness and applicability of co-training. In Proceedings of the Ninth International Conference on Information and Knowledge Management, pages 86--93, New York, USA. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Rajat Raina, Alexis Battle, Honglak Lee, Benjamin Packer, and Andrew Y. Ng. 2007. Self-taught learning: transfer learning from unlabeled data. In ICML 2007, pages 759--766, New York, NY, USA. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. Roi Reichart and Ari Rappoport. 2007. Self-training for enhancement and domain adaptation of statistical parsers trained on small datasets. In ACL 2007.Google ScholarGoogle Scholar
  32. Roi Reichart, Katrin Tomanek, Udo Hahn, and Ari Rappoport. 2008. Multi-task active learning for linguistic annotations. In ACL-08: HLT, pages 861--869.Google ScholarGoogle Scholar
  33. C. E. Shannon. 1948. A mathematical theory of communication. Bell system technical journal, 27.Google ScholarGoogle Scholar
  34. J. Sivic, B. C. Russell, A. A. Efros, A. Zisserman, and W. T. Freeman. 2005. Discovering object categories in image collections. In ICCV 2005.Google ScholarGoogle Scholar
  35. Naftali Tishby, Fernando C. Pereira, and William Bialek. The information bottleneck method. 1999. In Proc. of the 37th Annual Allerton Conference on Communication, Control and Computing, pages 368--377.Google ScholarGoogle Scholar
  36. Pengcheng Wu and Thomas G. Dietterich. 2004. Improving svm accuracy by training on auxiliary data sources. In ICML 2004, pages 110--117, New York, NY, USA. Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. Yejun Wu and Douglas W. Oard. 2008. Bilingual topic aspect classification with a few training examples. In ACM SIGIR 2008, pages 203--210, New York, NY, USA. Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. Xiaojin Zhu. 2007. Semi-supervised learning literature survey. Technical Report 1530, Computer Sciences, University of Wisconsin-Madison.Google ScholarGoogle Scholar

Index Terms

  1. Heterogeneous transfer learning for image clustering via the social web

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image DL Hosted proceedings
          ACL '09: Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 1 - Volume 1
          August 2009
          572 pages
          ISBN:9781932432459
          • General Chair:
          • Keh-Yih Su

          Publisher

          Association for Computational Linguistics

          United States

          Publication History

          • Published: 2 August 2009

          Qualifiers

          • research-article

          Acceptance Rates

          Overall Acceptance Rate85of443submissions,19%

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader