Skip to main content
Log in

A methodology for detecting relevant single nucleotide polymorphism in prostate cancer with multivariate adaptive regression splines and backpropagation artificial neural networks

  • S.I. : AI and ML applied to Health Sciences (MLHS)
  • Published:
Neural Computing and Applications Aims and scope Submit manuscript

Abstract

The objective of the present paper is to model the genetic influence on prostate cancer with multivariate adaptive regression splines (MARS) and artificial neural networks (ANNs) techniques for classification. These models will be able to classify subjects that have cancer according to the values of the selected proteins from the genes selected with the models as most relevant. Subjects are selected as cases and controls from the MCC-Spain database and represent a heterogeneous group. Multivariate adaptive regression splines models allow to select a set of the most relevant proteins from the database. These models were trained in nine different degrees and chosen regarding its performance and complexity. Artificial neural networks models were trained on with data restricted to the most significant variables. The performance of both types of models was analyzed in terms of the area under the curve of the receiver operating characteristics curve. The ANN technique resulted in a model with AUC of 0.62006, while for MARS technique, the value was 0.569312 in the best situation. Then, the artificial neural network model obtained can determine whether a patient suffers prostate cancer significantly better than MARS models and with high rate of success. The best model presented was based on support vector machines, reaching values of AUC of 0.65212.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3

Similar content being viewed by others

References

  1. Center MM, Jemal A, Lortet-Tieulent J, Ward E, Ferlay J, Brawley O, Bray F (2012) International variation in prostate cancer incidence and mortality rates. Eur Urol 61:1079–1092

    Article  Google Scholar 

  2. Ferlay J, Soerjomataram I, Ervik M, Dikshit R, Eser S, Mathers C, Rebelo M, Parkin DM, Forman D, Bray F (2013) Cancer incidence and mortality worldwide: IARC Cancer Base No. 11. International Agency for Research on Cancer, Lyon. GLOBOCAN 2012 v1. 0, 2013

  3. Di Sebastiano KM, Mourtzakis M (2014) The role of dietary fat throughout the prostate cancer trajectory. Nutrients 6:6095–6109

    Article  Google Scholar 

  4. Allott EH, Masko EM, Freedland SJ (2013) Obesity and prostate cancer: weighing the evidence. Eur Urol 63:800–809

    Article  Google Scholar 

  5. Huncharek M, Haddock KS, Reid R, Kupelnick B (2010) Smoking as a risk factor for prostate cancer: a meta-analysis of 24 prospective cohort studies. Am J Public Health 100:693–701

    Article  Google Scholar 

  6. Liu Y, Hu F, Li D, Wang F, Zhu L, Chen W, Ge J, An R, Zhao Y (2011) Does physical activity reduce the risk of prostate cancer? A systematic review and meta-analysis. Eur Urol 60:1029–1044

    Article  Google Scholar 

  7. Discacciati A, Wolk A (2014) Lifestyle and dietary factors in prostate cancer prevention. In: Cuzick J, Thorat M (eds) prostate cancer prevention. Springer, Berlin, pp 27–37

    Chapter  Google Scholar 

  8. Gong Z, Neuhouser ML, Goodman PJ, Albanes D, Chi C, Hsing AW, Lippman SM, Platz EA, Pollak MN, Thompson IM et al (2006) Obesity, diabetes, and risk of prostate cancer: results from the prostate cancer prevention trial. Cancer Epidemiol Prev Biomark 15:1977–1983

    Article  Google Scholar 

  9. Bosetti C, Rosato V, Gallus S, Cuzick J, La Vecchia C (2012) Aspirin and cancer risk: a quantitative review to 2011. Ann Oncol 23:1403–1415

    Article  Google Scholar 

  10. Thompson IM, Goodman PJ, Tangen CM, Lucia MS, Miller GJ, Ford LG, Lieber MM, Cespedes RD, Atkins JN, Lippman SM (2003) others: The influence of finasteride on the development of prostate cancer. N Engl J Med 349:215–224

    Article  Google Scholar 

  11. Andriole GL, Bostwick DG, Brawley OW, Gomella LG, Marberger M, Montorsi F, Pettaway CA, Tammela TL, Teloken C, Tindall DJ et al (2010) Effect of dutasteride on the risk of prostate cancer. N Engl J Med 362:1192–1202

    Article  Google Scholar 

  12. Hamilton RJ, Freedland SJ (2008) Review of recent evidence in support of a role for statins in the prevention of prostate cancer. Curr Opin Urol 18:333–339

    Article  Google Scholar 

  13. Jalving M, Gietema JA, Lefrandt JD, de Jong S, Reyners AKL, Gans ROB, de Vries EGE (2010) Metformin: Taking away the candy for cancer? Eur J Cancer 46:2369–2380

    Article  Google Scholar 

  14. De Andrés J, Sánchez-Lasheras F, Lorca P, de Cos Juez FJ (2011) A hybrid device of Self Organizing Maps (SOM) and Multivariate Adaptive Regression Splines (MARS) for the forecasting of firms’ bankruptcy. Account Manag Inf Syst 10:351

    Google Scholar 

  15. Fernández JRA, Muñiz CD, Nieto PJG, de Cos Juez FJ, Lasheras FS, Roqueñí MN (2013) Forecasting the cyanotoxins presence in fresh waters: a new model based on genetic algorithms combined with the MARS technique. Ecol Eng 53:68–78

    Article  Google Scholar 

  16. Friedman JH (1991) Multivariate adaptive regression splines. Ann Stat 19:1–67

    Article  MathSciNet  Google Scholar 

  17. Sekulic S, Kowalski BR (1992) MARS: a tutorial. J Chemom 6:199–216

    Article  Google Scholar 

  18. Breiman L, Friedman JH, Olshen RA, Stone CJ (1993) Classification and Regression Trees, Wadsworth International Group, Belmon, CA, 1984. Case Descr Featur Subset Correct Missed FA Misclass 1:1–3

    Google Scholar 

  19. Antón JCÁ, Nieto PJG, de Cos Juez FJ, Lasheras FS, Viejo CB, Gutiérrez NR (2013) Battery state-of-charge estimator using the MARS technique. IEEE Trans Power Electron 28:3798–3805

    Article  Google Scholar 

  20. Guzmán D, de Cos Juez FJ, Lasheras FS, Myers R, Young L (2010) Deformable mirror model for open-loop adaptive optics using multivariate adaptive regression splines. Opt Express 18:6492–6505

    Article  Google Scholar 

  21. Nieto PJG, Torres JM, de Cos Juez FJ, Lasheras FS (2012) Using multivariate adaptive regression splines and multilayer perceptron networks to evaluate paper manufactured using Eucalyptus globulus. Appl Math Comput 219:755–763

    Google Scholar 

  22. Nieto PJG, Lasheras FS, de Cos Juez FJ, Fernández JRA (2011) Study of cyanotoxins presence from experimental cyanobacteria concentrations using a new data mining methodology based on multivariate adaptive regression splines in Trasona reservoir (Northern Spain). J Hazard Mater 195:414–421

    Article  Google Scholar 

  23. Friedman JH, Roosen CB (1995) An introduction to multivariate adaptive regression splines. Springer, Heidelberg

    Book  Google Scholar 

  24. Suárez Gómez SL, Gutiérrez CG, Rodríguez JDS, Rodríguez MLS, Lasheras FS, de Cos Juez FJ (2016) Analysing the performance of a tomographic reconstructor with different neural networks frameworks. In: International conference on intelligent systems design and applications, pp 1051–1060

    Google Scholar 

  25. Suárez Gómez SL, Santos Rodríguez JD, Iglesias Rodríguez FJ, de Cos Juez FJ (2017) Analysis of the temporal structure evolution of physical systems with the self-organising tree algorithm (SOTA): application for validating neural network systems on adaptive optics data before on-sky implementation. Entropy 19:103

    Article  Google Scholar 

  26. Sánchez AS, Fernández PR, Lasheras FS, de Cos Juez FJ, Nieto PJG (2011) Prediction of work-related accidents according to working conditions using support vector machines. Appl Math Comput 218:3539–3552

    MathSciNet  Google Scholar 

  27. Vilán JAV, Fernández JRA, Nieto PJG, Lasheras FS, de Cos Juez FJ, Muñiz CD (2013) Support vector machines and multilayer perceptron networks used to evaluate the cyanotoxins presence from experimental cyanobacteria concentrations in the Trasona reservoir (Northern Spain). Water Resour Manag 27:3457–3476

    Article  Google Scholar 

  28. Basden AG, Atkinson D, Bharmal NA, Bitenc U, Brangier M, Buey T, Butterley T, Cano D, Chemla F, Clark P (2016) others: experience with wavefront sensor and deformable mirror interfaces for wide-field adaptive optics systems. Mon Not R Astron Soc 459:1350–1359

    Article  Google Scholar 

  29. de Cos Juez FJ, Lasheras FS, Roqueñí N, Osborn J (2012) An ANN-based smart tomographic reconstructor in a dynamic environment. Sensors (Switzerland) 12:8895–8911. https://doi.org/10.3390/s120708895

    Article  Google Scholar 

  30. Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. In: Advances in neural information processing systems, pp 1097–1105

  31. Gardner MW, Dorling SR (1998) Artificial neural networks (the multilayer perceptron)—a review of applications in the atmospheric sciences. Atmos Environ 32:2627–2636

    Article  Google Scholar 

  32. Hornik K, Stinchcombe M, White H (1989) Multilayer feedforward networks are universal approximators. Neural Netw 2:359–366

    Article  Google Scholar 

  33. González-Gutiérrez C, Santos-Rodríguez JD, Díaz RÁF, Rolle JLC, Gutiérrez NR, de Cos Juez FJ (2016) Using GPUs to speed up a tomographic reconstructor based on machine learning. In: International conference on european transnational education, pp 279–289

    Google Scholar 

  34. Haykin S (1999) Neural networks: a comprehensive foundation. Prentice-Hall, Upper Saddle River

    MATH  Google Scholar 

  35. Rumelhart DE, Hinton GE, Williams RJ (1988) Learning representations by back-propagating errors. Cogn Model 5:1

    MATH  Google Scholar 

  36. Lasheras, F.S., Gómez, S.L.S., Garc’\ia, M.V.R., Krzemień, A., Sánchez, A.S.: Time series and artificial intelligence with a genetic algorithm hybrid approach for rare earth price prediction. (2017)

  37. Ríos EMA, Crespo MMS, Sánchez AS, Gómez SLS, Lasheras FS (2017) Genetic algorithm based on support vector machines for computer vision syndrome classification. In: International Joint Conference SOCO’17-CISIS’17-ICEUTE’17 Le{ó}n, Spain, September 6–8, 2017, Proceeding, pp 381–390

  38. Castaño-Vinyals G, Aragonés N, Pérez-Gómez B, Martin V, Llorca J, Moreno V, Altzibar JM, Ardanaz E, De Sanjosé S, Jiménez-Moleón JJ et al (2015) Population-based multicase-control study in common tumors in Spain (MCC-Spain): rationale and study design. Gac. Sanit. 29:308–315

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Francisco Javier de Cos Juez.

Ethics declarations

Conflict of interest

The authors report no conflicts of interest in this work.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Sánchez Lasheras, J.E., González Donquiles, C., García Nieto, P.J. et al. A methodology for detecting relevant single nucleotide polymorphism in prostate cancer with multivariate adaptive regression splines and backpropagation artificial neural networks. Neural Comput & Applic 32, 1231–1238 (2020). https://doi.org/10.1007/s00521-018-3503-4

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00521-018-3503-4

Keywords

Navigation