Skip to main content

The Joint Role of Batch Size and Query Strategy in Active Learning-Based Prediction - A Case Study in the Heart Attack Domain

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 13566))

Abstract

This paper proposes an Active Learning algorithm that could detect heart attacks based on different body measures, which requires much less data than the passive learning counterpart while maintaining similar accuracy. To that end, different parameters were tested, namely the batch size and the query strategy used. The initial tests on batch size consisted of varying its value until 50. From these experiments, the conclusion was that the best results were obtained with lower values, which led to the second set of experiments, varying the batch size between 1 and 5 to understand in which value the accuracy was higher. Four query strategies were tested: random sampling, least confident sampling, margin sampling and entropy sampling. The results of each approach were similar, reducing by 57% to 60% the amount of data required to obtain the same results of the passive learning approach.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    Active Learning is also used in the ML branch of Reinforcement Learning; in this paper, we are confined to the ML branch of Supervised and Semi-Supervised Learning.

References

  1. Balcan, M.F., Long, P.: Active and passive learning of linear separators under log-concave distributions. In: Conference on Learning Theory, pp. 288–316. PMLR (2013)

    Google Scholar 

  2. Bisdas, S., et al.: Artificial intelligence in medicine: a multinational multi-center survey on the medical and dental students’ perception. Front. Public Health 9 (2021). https://doi.org/10.3389/fpubh.2021.795284, https://www.frontiersin.org/article/10.3389/fpubh.2021.795284

  3. Chowdhury, M.E., et al.: Wearable real-time heart attack detection and warning system to reduce road accidents. Sens. (Switz.) 19(12) (2019). https://doi.org/10.3390/s19122780, https://www.mdpi.com/1424-8220/19/12/2780

  4. Danka, T., Horvath, P.: modAL: a modular active learning framework for Python. CoRR (2018). https://github.com/cosmic-cortex/modAL. Available on arXiv at https://arxiv.org/abs/1805.00979

  5. Han, W., et al.: Semi-supervised active learning for sound classification in hybrid learning environments. PLoS One 11(9), e0162075 (2016)

    Article  Google Scholar 

  6. Janosi, A., Steinbrunn, W., Pfisterer, M., Detrano, R.: Heart disease data set (2020). https://archive.ics.uci.edu/ml/datasets/Heart+Disease. Accessed 03 Nov 2021

  7. Mahapatra, D., Schüffler, P.J., Tielbeek, J.A.W., Vos, F.M., Buhmann, J.M.: Semi-supervised and active learning for automatic segmentation of Crohn’s disease. In: Mori, K., Sakuma, I., Sato, Y., Barillot, C., Navab, N. (eds.) MICCAI 2013. LNCS, vol. 8150, pp. 214–221. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-40763-5_27

    Chapter  Google Scholar 

  8. Settles, B.: Active learning literature survey. Mach. Learn. 15(2), 201–221 (2010). 10.1.1.167.4245

    Google Scholar 

  9. Settles, B.: From theories to queries. In: Guyon, I., Cawley, G.C., Dror, G., Lemaire, V., Statnikov, A.R. (eds.) Active Learning and Experimental Design workshop, In conjunction with AISTATS 2010, Sardinia, Italy, 16 May 2010. JMLR Proceedings, vol. 16, pp. 1–18. JMLR.org (2011). http://proceedings.mlr.press/v16/settles11a/settles11a.pdf

  10. Smailagic, A., et al.: MedAL: deep active learning sampling method for medical image analysis. arXiv preprint arXiv:1809.09287 (2018)

  11. Srinivas, K., Rani, B.K., Govrdhan, A.: Applications of data mining techniques in healthcare and prediction of heart attacks. Int. J. Comput. Sci. Eng. (IJCSE) 2(02), 250–255 (2010)

    Google Scholar 

  12. Tengnah, M.A.J., Sooklall, R., Nagowah, S.D.: A predictive model for hypertension diagnosis using machine learning techniques. In: Telemedicine Technologies, pp. 139–152. Elsevier (2019)

    Google Scholar 

  13. World Health Organization: Cardiovascular diseases (2021). https://www.who.int/health-topics/cardiovascular-diseases. Accessed 04 Nov 2021

  14. Yakar, D., Ongena, Y.P., Kwee, T.C., Haan, M.: Do people favor artificial intelligence over physicians? A survey among the general population and their view on artificial intelligence in medicine. Value Health 25(3), 374–381 (2022). https://www.sciencedirect.com/science/article/pii/S1098301521017411

Download references

Acknowledgements

This work is funded by the FCT - Foundation for Science and Technology, I.P./MCTES through national funds (PIDDAC), within the scope of CISUC R &D Unit - UIDB/00326/2020 or project code UIDP/00326/2020.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Luis Macedo .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Faria, B., Perdigão, D., Brás, J., Macedo, L. (2022). The Joint Role of Batch Size and Query Strategy in Active Learning-Based Prediction - A Case Study in the Heart Attack Domain. In: Marreiros, G., Martins, B., Paiva, A., Ribeiro, B., Sardinha, A. (eds) Progress in Artificial Intelligence. EPIA 2022. Lecture Notes in Computer Science(), vol 13566. Springer, Cham. https://doi.org/10.1007/978-3-031-16474-3_38

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-16474-3_38

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-16473-6

  • Online ISBN: 978-3-031-16474-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics