Skip to main content
Log in

Support vector machine applied in QSAR modelling

  • Articles
  • Published:
Chinese Science Bulletin

Abstract

Support vector machine (SVM), partial least squares (PLS), and Back-Propagation artificial neural network (ANN) were employed to establish QSAR models of 2 dipeptide datasets. In order to validate predictive capabilities on external dataset of the resulting models, both internal and external validations were performed. The division of dataset into both training and test sets was carried out by D-optimal design. The results showed that support vector machine (SVM) behaved well in both calibration and prediction. For the dataset of 48 bitter tasting dipeptides (BTD), the results obtained by support vector regression (SVR) were superior to that by PLS in both calibration and prediction. When compared with BP artificial neural network, SVR showed less calibration power but more predictive capability. For the dataset of angiotensin-converting enzyme (ACE) inhibitors, the results obtained by support vector machine (SVM) regression were equivalent to those by PLS and BP artificial neural network. In both datasets, SVR using linear kernel function behaved well as that using radial basis kernel function. The results showed that there is wide prospect for the application of support vector machine (SVM) into QSAR modeling.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Cortes, C., Vapnik, V., Support vector networks, Machine Learning, 1995, 20(3): 273–293.

    Google Scholar 

  2. Flach, P. A., On the state of the art in machine learning: A personal review, Artificial Intelligence, 2001, 131(1): 199–222.

    Article  Google Scholar 

  3. Zhang, X. G., Introduction to statistical learning theory and support vector machine, Acta Automatica Sinica, 2000, 26(1): 32–42.

    Google Scholar 

  4. Sanchez, A. V. D., Advanced support vector machines and kernel methods, Neurocomputing, 2003, 55(1): 5–20.

    Article  Google Scholar 

  5. Chaucharda, F., Cogdillb, R., Rousselc, S. et al., Application of LS-SVM to non-linear phenomena in NIR spectroscopy: development of a robust and portable sensor for acidity prediction in grapes, Chemom. Intell. Lab. Syst., 2004, 71(2): 141–150.

    Article  Google Scholar 

  6. Hua, S., Sun, Z., A novel method of protein secondary structure prediction with high segment overlap measure:support vector machine approach, J. Mol. Biol., 2001, 308(2): 397–407.

    Article  Google Scholar 

  7. Burbidge, R., Trotter, M., Buxton, B. et al., Drug design by machine learning: support vector machines for pharmaceutical data analysis, Computers and Chemistry, 2001, 26(1): 5–14.

    Article  Google Scholar 

  8. Hong, S. J., Weiss, S. M., Advances in predictive models for data mining, Pattern Recognition Letters, 2001, 22(1): 55–61.

    Article  Google Scholar 

  9. Belousov, A. I., Verzakov, S. A., Frese, J. V., A flexible classification approach with optimal generalization performance: support vector machines, Chemom. Intell. Lab. Syst., 2002, 64(1): 15–25.

    Article  Google Scholar 

  10. Cai, Y. D., Feng, K. Y., Li, Y. X. et al., Support vector machine for predicting α-turn types, Peptides, 2003, 24(4): 629–630.

    Article  Google Scholar 

  11. Gaoa, J. B., Gunnb, S. R., Harrisb, C. J., SVM regression through variational methods and its sequential implementation, Neurocomputing, 2003, 55(1): 151–167.

    Article  Google Scholar 

  12. Mei, H., Zhou, Y., Li, S. Z. et al., A new set of amino acid descriptors and its application in peptide QSARs, Biopolymers (Pept. Sci.) 2005, DOI 10.1002bip.20296.

  13. Golbraikh, A., Tropsha, A., Beware of q2!, J. Mol. Graphics Mod., 2002, 20(4): 269–276.

    Article  Google Scholar 

  14. Tropsha, A., Gramatica, P., Gombar, V. K., The importance of being earnest: validation is the absolute essential for successful application and interpretation of QSPR models, QSAR Comb. Sci., 2003, 22(1): 69–77.

    Article  Google Scholar 

  15. Gramatica, P., Pilutti, P., Papa, E., Validated QSAR prediction of OH tropospheric degradation of VOCs: splitting into training-test sets and consensus modeling, J. Chem. Inf. Comput. Sci., 2004, 44(5): 1794–1802.

    Google Scholar 

  16. Collantes, E. R., Dunn, W. J., Amino acid side chain descriptors for quantitative structure activity relationship studies of peptide analogues, J. Med. Chem., 1995, 38(14): 2705–2713.

    Article  Google Scholar 

  17. Hellberg, S., Eriksson, L., Jonsson, J. et al., Minimum analogue peptide sets (MAPS) for quantitative structure-activity relationships, Int. J. Pept Protein Res., 1991, 37(5): 414–424.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Zhiliang Li.

About this article

Cite this article

Mei, H., Zhou, Y., Liang, G. et al. Support vector machine applied in QSAR modelling. Chin.Sci.Bull. 50, 2291–2296 (2005). https://doi.org/10.1007/BF03183737

Download citation

  • Received:

  • Accepted:

  • Issue Date:

  • DOI: https://doi.org/10.1007/BF03183737

Keywords

Navigation