Statistical analysis of big data: an approach based on support vector machines for classification and regression problems

Kadyrova, N. O.; Pavlova, L. V.

doi:10.1134/S0006350914030105

Statistical analysis of big data: an approach based on support vector machines for classification and regression problems

Molecular Biophysics
Published: 15 August 2014

Volume 59, pages 364–373, (2014)
Cite this article

Biophysics Aims and scope Submit manuscript

N. O. Kadyrova¹ &
L. V. Pavlova¹

151 Accesses
7 Citations
Explore all metrics

Abstract

A new type of learning algorithms with the supervisor for estimating multidimensional functions is considered. These methods based on Support Vector Machines are widely used due to their ability to deal with high-dimensional and large datasets, and their flexibility in modeling diverse sources of data. Support vector machines and related kernel methods are extremely good at solving prediction problems in computational biology. A background about statistical learning theory and kernel feature spaces is given including practical and algorithmic considerations.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A Systematic Review on Supervised and Unsupervised Machine Learning Algorithms for Data Science

Feature dimensionality reduction: a review

Article Open access 21 January 2022

Supervised Classification Algorithms in Machine Learning: A Survey and Review

References

V. Dyuk and A. Samoilenko, Data Mining: Educational Course (Piter, SPb., 2001) [in Russian].
Google Scholar
V. N. Vapnik, The Nature of Statistical Learning Theory (Springer-Verlag, 2000).
Book MATH Google Scholar
V. N. Vapnik, Statistical Learning Theory (John Wiley, 1998).
MATH Google Scholar
Y. Jiang, J. Jiang, and P. Capodieci, in Proceedings of the 2nd International Workshop on Computational Intelligence in Security for Information Systems (CISIS’09) (Springer AISC, 2009), vol. 63, p. 61.
Google Scholar
A. Patcha and J.-M. Park, Computer Networks 51, 3448 (2007).
Article ADS Google Scholar
T. Shon and J. Moon, Information Sciences 177, 3799 (2007).
Article Google Scholar
T. Trafalis, I. Huseyin, and M. Richman, International Conference on Computational Science (2003).
Google Scholar
T. Trafalis and I. Huseyin, IJCNN 6, 348 (2000).
Google Scholar
I. Huseyin and T. Trafalis, J. General Systems 37(6), 677 (2008).
Article MATH Google Scholar
E. P. Kondratovich, N. I. Zhokhova, I. I. Baskin, et al., Izv. RAN Ser. Khim., no. 4, 641 (2009).
Google Scholar
I. Guyon, J. Weston, S. Barnhill, and V. Vapnik, J. Machine Learning 46(1–3), 389 (2002).
Article MATH Google Scholar
S. Mukherjee, P. Tamayo, D. Slonim, et al., AI memo 182. CBCL paper 182. MIT, 2000.
Google Scholar
T. Furey, N. Cristianini, N. Duffy, et al., Bioinformatics 16 (10), 906 (2000).
Google Scholar
M. Brown, W. Grundy, D. Lin, et al., Proc. Natl. Acad. Sci. USA 97(10), 262 (2000).
Article ADS Google Scholar
P. Bradley and O. Mangasarian, in Proc. 13th International Conference on Machine Learning (1998), p. 82.
Google Scholar
G. Lanckriet, T. D. Bie, N. Cristianini, et al., Bioinformatics 20, 2626 (2004).
Article Google Scholar
K. R. Muller, S. Mika, G. Rätsch, et al., IEEE Transactions on Neural Networks 12(2), 181 (2001).
Article Google Scholar
V. Kecman, Learning and Soft Computing: Support Vector Machines, Neural Networks, and Fuzzy Logic Models (MIT Press, 2001).
Google Scholar
N. Aronszajn, Trans. Amer. Math. Soc. 68, 337 (1950).
Article MathSciNet MATH Google Scholar
C. Leslie, E. Eskin, and W. Noble, The Spectrum Kernel: A string kernel for SVM protein classification (2002).
Google Scholar
A. Ben-Hur, C. Soon Ong, S. Sonnenburg, et al., PLoS Computational Biology 4(10), 1 (2008).
Article Google Scholar
B. Schölkopf, A. Smola, R. Williamson, and P. Bartlett, Neural Computation 12, 1207 (2000).
Article Google Scholar
M. Law and J. Kwok, Machine Learning: ECML 2001, Proceedings, Lecture Notes in Artificial Intelligence 2167, 312 (2001).
Google Scholar
C.-C. Chang and C.-J. Lin, Neural Computation 13(9), 2119 (2001).
Article MATH Google Scholar
P.-H. Chen, C.-J. Chih-jen Lin, and B. Schölkopf, OAI-PMH server at cs1.ist.psu.edu (2003).
Google Scholar
A. Chalimourda, B. Schölkopf, and A. Smola, Neural Networks 17(1), 127 (2004).
Article MATH Google Scholar
T. Joachims, in Advanced Kernel Methods — Support Vector Learning (MIT Press, 1998), p. 41.
Google Scholar
R. Collobert and S. Bengio, J. MachineLearning Res. MIT Press 1, 143 (2001).
MathSciNet Google Scholar
J. Platt, Advances in Kernel Methods. Support Vector Learning (MIT Press, 1998), p. 41.
Google Scholar
J. Platt, Advances in Neural Information Processing Systems 11 (MIT Press, 1999), p. 557.
Google Scholar
S. Shevade, S. Keerthi, C. Bhattacharyya, and K. Murthy, IEEE Transactions on Neural Networks 11(5), (2000).
Google Scholar
S. Keerthi and E. Gilbert, Machine Learning 46(1–3), 351 (2002).
Article MATH Google Scholar
G. Flake and S. Lawrence, Machine Learning 46(1–3), 271 (2002).
Article MATH Google Scholar
P.-H. Chen, R.-E. Fan, and C.-J. Lin, Lecture Notes in Artifical Intelligence 3734, 45 (2005).
MathSciNet Google Scholar
H. Zhang, X. Wang, C. Zhang, and X. Xu, ICNC 1, 221 (2005).
MATH Google Scholar
O. Mangasarian and D. Musicant, IEEE Transactions on Neural Networks 10(5), 1032 (1999).
Article Google Scholar
O. Mangasarian and D. Musicant, OAI-PMH server at cs1.ist.psu.edu (1999).
Google Scholar
Y. Quan, J. Yang, L.-X. Yao, and C.-Z. Ye. J. Software 15 (2), 200 (2004).
Google Scholar
G. Cauwenberghs and T. Tomaso Poggio, Advances in Neural Information Processing Systems. MIT Press 13, 409 (2001).
Google Scholar
P. Laskov, C. Gehl, S. Krüger, and K.-R. Müller, OAI-PMH server at eprints.pascal-network.org (2005).
Google Scholar
M. Martin, ECML, p. 282 (2002).
Google Scholar
G. Cawley and N. Talbot, ICANN, p. 681 (2002).
Google Scholar
G. Cawley and N. Talbot, Neurocomputing 48, 1025 (2002).
Article MATH Google Scholar
Y. Engel, S. Mannor, and R. Meir, ECML, p. 84 (2002).
Google Scholar
J. Jyrki Kivinen, S. Smola, and R. Williamson, IEEE Transactions on Signal Processing 52(8), 2165 (2004).
Article MathSciNet ADS Google Scholar
S. Vishwanathan, N. Schraudolph, and A. Smola, J. Machine Learning Res. 6, 1 (2005).
Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Applied Mathematics and Mechanics, St. Petersburg State Polytechnical University, St. Petersburg, 195251, Russia
N. O. Kadyrova & L. V. Pavlova

Authors

N. O. Kadyrova
View author publications
You can also search for this author in PubMed Google Scholar
L. V. Pavlova
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to N. O. Kadyrova.

Additional information

Rights and permissions

Reprints and permissions

About this article

Cite this article

Kadyrova, N.O., Pavlova, L.V. Statistical analysis of big data: an approach based on support vector machines for classification and regression problems. BIOPHYSICS 59, 364–373 (2014). https://doi.org/10.1134/S0006350914030105

Download citation

Received: 03 April 2014
Published: 15 August 2014
Issue Date: May 2014
DOI: https://doi.org/10.1134/S0006350914030105

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Statistical analysis of big data: an approach based on support vector machines for classification and regression problems

Abstract

Access this article

Similar content being viewed by others

A Systematic Review on Supervised and Unsupervised Machine Learning Algorithms for Data Science

Feature dimensionality reduction: a review

Supervised Classification Algorithms in Machine Learning: A Survey and Review

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Statistical analysis of big data: an approach based on support vector machines for classification and regression problems

Abstract

Access this article

Similar content being viewed by others

A Systematic Review on Supervised and Unsupervised Machine Learning Algorithms for Data Science

Feature dimensionality reduction: a review

Supervised Classification Algorithms in Machine Learning: A Survey and Review

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation