Article

An empirical evaluation of deep architectures on problems with many factors of variation

Authors:
Hugo Larochelle

Université de Montréal C.P., Montreal, Qc, Canada

Université de Montréal C.P., Montreal, Qc, Canada
View Profile

,
Dumitru Erhan

Université de Montréal C.P., Montreal, Qc, Canada

Université de Montréal C.P., Montreal, Qc, Canada
View Profile

,
Aaron Courville

Université de Montréal C.P., Montreal, Qc, Canada

Université de Montréal C.P., Montreal, Qc, Canada
View Profile

,
James Bergstra

Université de Montréal C.P., Montreal, Qc, Canada

Université de Montréal C.P., Montreal, Qc, Canada
View Profile

,
Yoshua Bengio

Université de Montréal C.P., Montreal, Qc, Canada

Université de Montréal C.P., Montreal, Qc, Canada
View Profile

ICML '07: Proceedings of the 24th international conference on Machine learningJune 2007Pages 473–480https://doi.org/10.1145/1273496.1273556

Published:20 June 2007Publication History

ICML '07: Proceedings of the 24th international conference on Machine learning

Pages 473–480

ABSTRACT

Recently, several learning algorithms relying on models with deep architectures have been proposed. Though they have demonstrated impressive performance, to date, they have only been evaluated on relatively simple problems such as digit recognition in a controlled environment, for which many machine learning algorithms already report reasonable results. Here, we present a series of experiments which indicate that these models show promise in solving harder learning problems that exhibit many factors of variation. These models are compared with well-established algorithms such as Support Vector Machines and single hidden-layer feed-forward neural networks.

References

Bengio, Y., Lamblin, P., Popovici, D., & Larochelle, H. (2007). Greedy layer-wise training of deep networks. Advances in Neural Information Processing Systems 19. MIT Press.Google Scholar
Bengio, Y., & Le Cun, Y. (2007). Scaling learning algorithms towards AI. In L. Bottou, O. Chapelle, D. De-Coste and J. Weston (Eds.), Large scale kernel machines. MIT Press.Google Scholar
Chang, C.-C., & Lin, C.-J. (2001). LIBSVM: a library for support vector machines. Software available at http://www.csie.ntu.edu.tw/~cjlin/libsvm.Google Scholar
Decoste, D., & Scholkopf, B. (2002). Training invariant support vector machines. Machine Learning, 46, 161--190. Google ScholarDigital Library
Hinton, G. (2002). Training products of experts by minimizing contrastive divergence. Neural Computation, 14, 1771--1800. Google ScholarDigital Library
Hinton, G. (2006). To recognize shapes, first learn to generate images (Technical Report UTML TR 2006-003). University of Toronto.Google Scholar
Hinton, G., & Salakhutdinov, R. (2006). Reducing the dimensionality of data with neural networks. Science, 313, 504--507.Google ScholarCross Ref
Hinton, G. E., Osindero, S., & Teh, Y. (2006). A fast learning algorithm for deep belief nets. Neural Computation, 18, 1527--1554. Google ScholarDigital Library
LeCun, Y., Boser, B., Denker, J., Henderson, D., Howard, R., Hubbard, W., & Jackel, L. (1989). Backpropagation applied to handwritten zip code recognition. Neural Computation, 1, 541--551.Google ScholarDigital Library
LeCun, Y., Huang, F.-J., & Bottou, L. (2004). Learning methods for generic object recognition with invariance to pose and lighting. Proceedings of CVPR'04. IEEE Press. Google ScholarDigital Library
Salakhutdinov, R., & Hinton, G. (2007). Learning a nonlinear embedding by preserving class neighbourhood structure. To Appear in Proceedings of AISTATS'2007.Google Scholar
Welling, M., Rosen-Zvi, M., & Hinton, G. (2005). Exponential family harmoniums with an application to information retrieval. Advances in Neural Information Processing Systems 17. MIT Press.Google Scholar

An empirical evaluation of deep architectures on problems with many factors of variation
1. Computing methodologies
  1. Machine learning
    1. Machine learning approaches

Recommendations

Evaluation of FPGA routing architectures under process variation
GLSVLSI '11: Proceedings of the 21st edition of the great lakes symposium on Great lakes symposium on VLSI

Uncertainty in performance of FPGAs is becoming an important issue due to increased process variations in nanometer regime. Therefore, it is vital to decrease the impact of variability in these devices. FPGA routing architecture enhancement can be an ...
Read More
An empirical exploration of recurrent network architectures
ICML'15: Proceedings of the 32nd International Conference on International Conference on Machine Learning - Volume 37

The Recurrent Neural Network (RNN) is an extremely powerful sequence model that is often difficult to train. The Long Short-Term Memory (LSTM) is a specific RNN architecture whose design makes it much easier to train. While wildly successful in practice,...
Read More
Autoencoders, unsupervised learning and deep architectures
UTLW'11: Proceedings of the 2011 International Conference on Unsupervised and Transfer Learning workshop - Volume 27

Autoencoders play a fundamental role in unsupervised learning and in deep architectures for transfer learning and other tasks. In spite of their fundamental role, only linear autoencoders over the real numbers have been solved analytically. Here we ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
ICML '07: Proceedings of the 24th international conference on Machine learning
June 2007
1233 pages
ISBN:9781595937933
DOI:10.1145/1273496
Editor:
Zoubin Ghahramani
University of Cambridge, United Kingdom
Copyright © 2007 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 20 June 2007
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Qualifiers
- Article
Conference

Acceptance Rates
Overall Acceptance Rate140of548submissions,26%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 631
  Total Citations
  View Citations
- 2,548
  Total Downloads
- Downloads (Last 12 months)190
- Downloads (Last 6 weeks)27
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

An empirical evaluation of deep architectures on problems with many factors of variation

ICML '07: Proceedings of the 24th international conference on Machine learning

ABSTRACT

References

Cited By

Recommendations

Evaluation of FPGA routing architectures under process variation

An empirical exploration of recurrent network architectures

Autoencoders, unsupervised learning and deep architectures

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Qualifiers

Conference

Acceptance Rates

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

An empirical evaluation of deep architectures on problems with many factors of variation

ICML '07: Proceedings of the 24th international conference on Machine learning

ABSTRACT

References

Cited By

Recommendations

Evaluation of FPGA routing architectures under process variation

An empirical exploration of recurrent network architectures

Autoencoders, unsupervised learning and deep architectures

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Qualifiers

Conference

Acceptance Rates

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media