skip to main content
10.1145/1014052.1014093acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
Article

IDR/QR: an incremental dimension reduction algorithm via QR decomposition

Published:22 August 2004Publication History

ABSTRACT

Dimension reduction is critical for many database and data mining applications, such as efficient storage and retrieval of high-dimensional data. In the literature, a well-known dimension reduction scheme is Linear Discriminant Analysis (LDA). The common aspect of previously proposed LDA based algorithms is the use of Singular Value Decomposition (SVD). Due to the difficulty of designing an incremental solution for the eigenvalue problem on the product of scatter matrices in LDA, there is little work on designing incremental LDA algorithms. In this paper, we propose an LDA based incremental dimension reduction algorithm, called IDR/QR, which applies QR Decomposition rather than SVD. Unlike other LDA based algorithms, this algorithm does not require the whole data matrix in main memory. This is desirable for large data sets. More importantly, with the insertion of new data items, the IDR/QR algorithm can constrain the computational cost by applying efficient QR-updating techniques. Finally, we evaluate the effectiveness of the IDR/QR algorithm in terms of classification accuracy on the reduced dimensional space. Our experiments on several real-world data sets reveal that the accuracy achieved by the IDR/QR algorithm is very close to the best possible accuracy achieved by other LDA based algorithms. However, the IDR/QR algorithm has much less computational cost, especially when new data items are dynamically inserted.

References

  1. P.N. Belhumeour, J.P. Hespanha, and D.J. Kriegman. Eigenfaces vs. Fisherfaces: Recognition using class specific linear projection. IEEE Trans. Pattern Analysis and Machine Intelligence, 19(7):711--720, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. C. Bohm, S. Berchtold, and D. A. Keim. Searching in high-dimensional spaces: Index structures for improving the performance of multimedia databases. ACM Computing Surveys, 33(3):322--373, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. S. Chakrabarti, S. Roy, and M. Soundalgekar. Fast and accurate text classification via multiple linear discriminant projections. In VLDB, pages 658--669, Hong Kong, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. S. Chandrasekaran, B. S. Manjunath, Y. F. Wang, J. Winkeler, and H. Zhang. An eigenspace update algorithm for image analysis. Graphical Models and Image Processing: GMIP, 59(5):321--332, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. C. Chatterjee and V. P. Roychowdhury. On self-organizing algorithms and networks for class-separability features. IEEE Trans. Neural Networks, 8(3):663--678, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. J.W. Daniel, W. B. Gragg, L. Kaufman, and G. W. Stewart. Reorthogonalization and stable algorithms for updating the gram-schmidt QR factorization. Mathematics of Computation, 30:772--795, 1976.Google ScholarGoogle Scholar
  7. R.O. Duda, P.E. Hart, and D. Stork. Pattern Classification. Wiley, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. J. H. Friedman. Regularized discriminant analysis. Journal of the American Statistical Association, 84(405):165--175, 1989.Google ScholarGoogle ScholarCross RefCross Ref
  9. K. Fukunaga. Introduction to Statistical Pattern Classification. Academic Press, USA, 1990. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. G. H. Golub and C. F. Van Loan. Matrix Computations. The Johns Hopkins University Press, Baltimore, MD, USA, third edition, 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. P. Hall, D. Marshall, and R. Martin. Merging and splitting eigenspace models. IEEE Trans. Pattern Analysis and Machine Intelligence, 22(9):1042--1049, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. P. Howland, M. Jeon, and H. Park. Structure preserving dimension reduction for clustered text data based on the generalized singular value decomposition. SIAM Journal on Matrix Analysis and Applications, 25(1):165--179, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. I. T. Jolliffe. Principal Component Analysis. Springer-Verlag, New York, 1986.Google ScholarGoogle ScholarCross RefCross Ref
  14. K. V. Ravi Kanth, D.t Agrawal, A. E. Abbadi, and A. Singh. Dimensionality reduction for similarity searching in dynamic databases. Computer Vision and Image Understanding: CVIU, 75(1--2):59--72, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. W.J. Krzanowski, P. Jonathan, W.V McCarthy, and M.R. Thomas. Discriminant analysis with singular covariance matrices: methods and applications to spectroscopic data. Applied Statistics, 44:101--115, 1995.Google ScholarGoogle ScholarCross RefCross Ref
  16. J. Mao and K. Jain. Artificial neural networks for feature extraction and multivariate data projection. IEEE Trans. Neural Networks, 6(2):296--317, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. A. Martinez and A. Kak. PCA versus LDA. In IEEE Trans. Pattern Analysis and Machine Intelligence, volume 23, pages 228--233, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. A.M. Martinez and R. Benavente. The AR face database. Technical Report No. 24, 1998.Google ScholarGoogle Scholar
  19. H. Park, M. Jeon, and J.B. Rosen. Lower dimensional representation of text data based on centroids and least squares. BIT, 43(2):1--22, 2003.Google ScholarGoogle ScholarCross RefCross Ref
  20. R. Polikar, L. Udpa, S. Udpa, and V. Honavar. Learn++: An incremental learning algorithm for supervised neural networks. IEEE Trans. Systems, Man, and Cybernetics, 31:497--508, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. D. L. Swets and J.Y. Weng. Using discriminant eigenfeatures for image retrieval. IEEE Trans. Pattern Analysis and Machine Intelligence, 18(8):831--836, 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. F.D.L. Torre and M. Black. Robust principal component analysis for computer vision. In ICCV, volume I, pages 362--369, 2001.Google ScholarGoogle Scholar
  23. J. Ye, R. Janardan, C.H. Park, and H. Park. An optimization criterion for generalized discriminant analysis on undersampled problems. IEEE Trans. Pattern Analysis and Machine Intelligence, 26(8):982--994, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. IDR/QR: an incremental dimension reduction algorithm via QR decomposition

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      KDD '04: Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
      August 2004
      874 pages
      ISBN:1581138881
      DOI:10.1145/1014052

      Copyright © 2004 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 22 August 2004

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • Article

      Acceptance Rates

      Overall Acceptance Rate1,133of8,635submissions,13%

      Upcoming Conference

      KDD '24

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader