Abstract
This chapter describes Lagrange multipliers and some selected subtopics from matrix analysis from a machine learning perspective. The goal is to give a detailed description of a number of mathematical constructions that are widely used in applied machine learning.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Rouse Ball, W.W.: A Short Account of the History of Mathematics, 4th edn. Dover, Mineola (1908)
Belkin, M., Niyogi, P.: Laplacian eigenmaps for dimensionality reduction and data representation. In: Advances in Neural Information Processing Systems, vol. 14. MIT Press, Cambridge (2002)
Bell, E.T.: Men of Mathematics. Simon and Schuster. Touchstone edition (1986) (first published 1937)
Boyd, S., Vandenberghe, L.: Convex Optimization. Cambridge University Press, Cambridge (2004)
Buck, B., Macaualay, V. (eds.): Maximum Entropy in Action. Clarendon Press, Oxford (1991)
Burges, C.J.C.: A tutorial on support vector machines for pattern recognition. Data Mining and Knowledge Discovery 2(2), 121–167 (1998)
Burges, C.J.C.: Geometric Methods for Feature Extraction and Dimensional Reduction. In: Rokach, L., Maimon, O. (eds.) Data Mining and Knowledge Discovery Handbook: A Complete Guide for Practitioners and Researchers. Kluwer Academic, Dordrecht (to appear, 2004)
Cox, T.F., Cox, M.A.A.: Multidimensional Scaling. Chapman and Hall, Sydney (2001)
Cressie, N.A.C.: Statistics for spatial data. Wiley, Chichester (1993) (revised edition)
Deerwester, S., Dumais, S.T., Landauer, T.K., Furnas, G.W., Harshman, R.A.: Indexing by Latent Semantic Analysis. Journal of the Society for Information Science 41(6), 391–407 (1990)
Golub, G.H., Van Loan, C.F.: Matrix Computations, 3rd edn. Johns Hopkins, Baltimore (1996)
Horn, R.A., Johnson, C.R.: Matrix Analysis. Cambridge University Press, Cambridge (1985)
Jaynes, E.T.: Bayesian methods: General background. In: Justice, J.H. (ed.) Maximum Entropy and Bayesian Methods in Applied Statistics, pp. 1–25. Cambridge University Press, Cambridge (1985)
Kline, M.: Mathematical Thought from Ancient to Modern Times, vol. 1,2,3. Oxford University Press, Oxford (1972)
Mangasarian, O.L.: Nonlinear Programming. McGraw Hill, New York (1969)
Nigam, K., Lafferty, J., McCallum, A.: Using maximum entropy for text classification. In: IJCAI 1999 Workshop on Machine Learning for Information Filtering, pp. 61–67 (1999)
Roweis, S.T., Saul, L.K.: Nonlinear dimensionality reduction by locally linear embedding. Science 290(22), 2323–2326 (2000)
Schoenberg, I.J.: Remarks to maurice frechet’s article sur la d’efinition axiomatique d’une classe d’espace distanci’es vectoriellement applicable sur l’espace de Hilbert. Annals of Mathematics 36, 724–732 (1935)
Tipping, M.E., Bishop, C.M.: Probabilistic principal component analysis. Journal of the Royal Statistical Society 61(3), 611 (1999)
Williams, C.K.I.: Prediction with gaussian processes: from linear regression to linear prediction and beyond. In: Jordan, M.I. (ed.) Learning in Graphical Models, pp. 599–621. MIT Press, Cambridge (1999)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Burges, C.J.C. (2004). Some Notes on Applied Mathematics for Machine Learning. In: Bousquet, O., von Luxburg, U., Rätsch, G. (eds) Advanced Lectures on Machine Learning. ML 2003. Lecture Notes in Computer Science(), vol 3176. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-28650-9_2
Download citation
DOI: https://doi.org/10.1007/978-3-540-28650-9_2
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-23122-6
Online ISBN: 978-3-540-28650-9
eBook Packages: Springer Book Archive