skip to main content
10.1145/1553374.1553463acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicmlConference Proceedingsconference-collections
research-article

Online dictionary learning for sparse coding

Published:14 June 2009Publication History

ABSTRACT

Sparse coding---that is, modelling data vectors as sparse linear combinations of basis elements---is widely used in machine learning, neuroscience, signal processing, and statistics. This paper focuses on learning the basis set, also called dictionary, to adapt it to specific data, an approach that has recently proven to be very effective for signal reconstruction and classification in the audio and image processing domains. This paper proposes a new online optimization algorithm for dictionary learning, based on stochastic approximations, which scales up gracefully to large datasets with millions of training samples. A proof of convergence is presented, along with experiments with natural images demonstrating that it leads to faster performance and better dictionaries than classical batch algorithms for both small and large datasets.

References

  1. Aharon, M., & Elad, M. (2008). Sparse and redundant modeling of image content using an image-signature-dictionary. SIAM Imaging Sciences, 1, 228--247. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Aharon, M., Elad, M., & Bruckstein, A. M. (2006). The K-SVD: An algorithm for designing of overcomplete dictionaries for sparse representations. IEEE Transactions Signal Processing, 54, 4311--4322 Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Bertsekas, D. (1999). Nonlinear programming. Athena Scientific Belmont, Mass.Google ScholarGoogle Scholar
  4. Bickel, P., Ritov, Y., & Tsybakov, A. (2007). Simultaneous analysis of Lasso and Dantzig selector. preprint.Google ScholarGoogle Scholar
  5. Bonnans, J., & Shapiro, A. (1998). Optimization problems with perturbation: A guided tour. SIAM Review, 40, 202--227. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Borwein, J., & Lewis, A. (2006). Convex analysis and nonlinear optimization: theory and examples. Springer.Google ScholarGoogle Scholar
  7. Bottou, L. (1998). Online algorithms and stochastic approximations. In D. Saad (Ed.), Online learning and neural networks. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Bottou, L., & Bousquet, O. (2008). The tradeoffs of large scale learning. Advances in Neural Information Processing Systems, 20, 161--168.Google ScholarGoogle Scholar
  9. Chen, S., Donoho, D., & Saunders, M. (1999). Atomic decomposition by basis pursuit. SIAM Journal on Scientific Computing, 20, 33--61. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Efron, B., Hastie, T., Johnstone, I., & Tibshirani, R. (2004). Least angle regression. Annals of Statistics, 32, 407--499.Google ScholarGoogle ScholarCross RefCross Ref
  11. Elad, M., & Aharon, M. (2006). Image denoising via sparse and redundant representations over learned dictionaries. IEEE Transactions Image Processing, 54, 3736--3745. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Fisk, D. (1965). Quasi-martingale. Transactions of the American Mathematical Society, 359--388.Google ScholarGoogle ScholarCross RefCross Ref
  13. Friedman, J., Hastie, T., Hölfling, H., & Tibshirani, R. (2007). Pathwise coordinate optimization. Annals of Statistics, 1, 302--332.Google ScholarGoogle ScholarCross RefCross Ref
  14. Fu, W. (1998). Penalized Regressions: The Bridge Versus the Lasso. Journal of computational and graphical statistics, 7, 397--416.Google ScholarGoogle Scholar
  15. Fuchs, J. (2005). Recovery of exact sparse representations in the presence of bounded noise. IEEE Transactions Information Theory, 51, 3601--3608. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Lee, H., Battle, A., Raina, R., & Ng, A. Y. (2007). Efficient sparse coding algorithms. Advances in Neural Information Processing Systems, 19, 801--808.Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Mairal, J., Elad, M., & Sapiro, G. (2008). Sparse representation for color image restoration. IEEE Transactions Image Processing, 17, 53--69. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Mairal, J., Bach, F., Ponce, J., Sapiro, G., & Zisserman, A. (2009). Supervised dictionary learning. Advances in Neural Information Processing Systems, 21, 1033--1040.Google ScholarGoogle Scholar
  19. Mallat, S. (1999). A wavelet tour of signal processing, second edition. Academic Press, New York.Google ScholarGoogle Scholar
  20. Olshausen, B. A., & Field, D. J. (1997). Sparse coding with an overcomplete basis set: A strategy employed by V1? Vision Research, 37, 3311--3325.Google ScholarGoogle ScholarCross RefCross Ref
  21. Osborne, M., Presnell, B., & Turlach, B. (2000). A new approach to variable selection in least squares problems. IMA Journal of Numerical Analysis, 20, 389--403.Google ScholarGoogle ScholarCross RefCross Ref
  22. Protter, M., & Elad, M. (2009). Image sequence denoising via sparse and redundant representations. IEEE Transactions Image Processing, 18, 27--36. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Raina, R., Battle, A., Lee, H., Packer, B., & Ng, A. Y. (2007). Self-taught learning: transfer learning from unlabeled data. Proceedings of the 26th International Conference on Machine Learning, 759--766. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Tibshirani, R. (1996). Regression shrinkage and selection via the Lasso. Journal of the Royal Statistical Society Series B, 67, 267--288.Google ScholarGoogle Scholar
  25. Van der Vaart, A. (1998). Asymptotic Statistics. Cambridge University Press.Google ScholarGoogle Scholar
  26. Zou, H., & Hastie, T. (2005). Regularization and variable selection via the elastic net. Journal of the Royal Statistical Society Series B, 67, 301--320.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Online dictionary learning for sparse coding

                    Recommendations

                    Comments

                    Login options

                    Check if you have access through your login credentials or your institution to get full access on this article.

                    Sign in
                    • Published in

                      cover image ACM Other conferences
                      ICML '09: Proceedings of the 26th Annual International Conference on Machine Learning
                      June 2009
                      1331 pages
                      ISBN:9781605585161
                      DOI:10.1145/1553374

                      Copyright © 2009 Copyright 2009 by the author(s)/owner(s).

                      Publisher

                      Association for Computing Machinery

                      New York, NY, United States

                      Publication History

                      • Published: 14 June 2009

                      Permissions

                      Request permissions about this article.

                      Request Permissions

                      Check for updates

                      Qualifiers

                      • research-article

                      Acceptance Rates

                      Overall Acceptance Rate140of548submissions,26%

                    PDF Format

                    View or Download as a PDF file.

                    PDF

                    eReader

                    View online with eReader.

                    eReader