Skip to main content

Introduction to Big Data and Data Science: Methods and Applications

  • Chapter
  • First Online:
Advances in Data Science: Methodologies and Applications

Part of the book series: Intelligent Systems Reference Library ((ISRL,volume 189))

Abstract

Big data and data science are transforming our world today in ways we could not have imagined at the beginning of the twenty-first century. The accompanying wave of innovation has sparked advances in healthcare, engineering, business, science, and human perception, among others. In this chapter we discuss big data and data science to establish a context for the state-of-the-art technologies and applications in this book. In addition, to provide a starting point for new researchers, we present an overview of big data management and analytics methods. Finally, we suggest opportunities for future research.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 149.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 199.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 199.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Agrawal, R., Imieliński, T., Swami, A.: Mining association rules between sets of items in large databases. ACM SIGMOD Rec. 22, 207–216 (1993)

    Article  Google Scholar 

  2. Chong, A.Y.L., Li, B., Ngai, E.W.T., Ch’ng, E., Lee, F.: Predicting online product sales via online reviews, sentiments, and promotion strategies: a big data architecture and neural network approach. Int. J. Oper. Prod. Manag 36(4), 358–383 (2016)

    Article  Google Scholar 

  3. Cui, B., Mondal, A., Shen, J., Cong, G., Tan, K. L.: On effective e-mail classification via neural networks. In: International Conference on Database and Expert Systems Applications (pp. 85–94). Springer, Berlin, Heidelberg (2005, August)

    Google Scholar 

  4. Dang, T., Stasak, B., Huang, Z., Jayawardena, S., Atcheson, M., Hayat, M., Le, P., Sethu, V., Goecke, R., Epps, J.: Investigating word affect features and fusion of probabilistic predictions incorporating uncertainty in AVEC 2017. In: Proceedings of the 7th Annual Workshop on Audio/Visual Emotion Challenge, Mountain View, CA. 27–35, (2017)

    Google Scholar 

  5. Epasto, A., Lattanzi, S., Mirrokni, V., Sebe, I.O., Taei, A., Verma, S.: Ego-net community mining applied to friend suggestion. Proc. VLDB Endowment 9, 324–335 (2015)

    Article  Google Scholar 

  6. Erlandsson, F., Bródka, P., Borg, A., Johnson, H.: Finding influential users in social media using association rule learning. Entropy 18(164), 1–15 (2016). https://doi.org/10.3390/e1805016

  7. Espejo, P.G., Ventura, S., Herrera, F.: A survey on the application of genetic programming to classification. IEEE Trans. Syst. Many, and Cybern. Part C: Appl. Rev. 40(2), 121–144 (2010)

    Google Scholar 

  8. Gandomi, A., Haider, M.: Beyond the hype: big data concepts, methods, and analytics. Int. J. Inf. Manage. 35, 137–144 (2015)

    Article  Google Scholar 

  9. Gong, Y., Poellabauer, C.: Topic modeling based on multi-modal depression detection. In: Proceeding of the 7th Annual Workshop on Audio/Visual Emotion Challenge, Mountain View, CA, pp. 69–76, (2017)

    Google Scholar 

  10. Güneş, I., Gündüz-Öĝüdücü, Ş., Çataltepe, Z.: Link prediction using time series of neighborhood-based node similarity scores. Data Min. Knowl. Disc. 30, 147–180 (2016)

    Article  MathSciNet  Google Scholar 

  11. Gupta, B., Rawat, A., Jain, A., Arora, A., Dhami, N.: Analysis of various decision tree algorithms for classification in data mining. Int. J. Comput. Appl. 163(8), 15–19 (2017)

    Google Scholar 

  12. Koc, Y., Eyduran, E., Akbulut, O.: Application of regression tree method for different data from animal science. Pakistan J. Zool. 49(2), 599–607 (2017)

    Article  Google Scholar 

  13. Linden, A., Yarnold, P.R.: Modeling time-to-event (survival) data using classification tree analysis. J Eval. Clin. Pract. 23(6), 1299–1308 (2017)

    Article  Google Scholar 

  14. Liu, C., Wang, J., Zhang, H., Yin, M.: Mapping the hierarchical structure of the global shipping network by weighted ego network analysis. Int. J. Shipping Transp. Logistics 10, 63–86 (2018)

    Article  Google Scholar 

  15. Mowlaei, M.F., Abadeh, M.S., Keshavarz, H.: Aspect-based sentiment analysis using adaptive aspect-based lexicons. Expert Syst. Appl. 148, 113234 (2020)

    Google Scholar 

  16. Nisbet R., Elder J., Miner G.: The three most common data mining software tools. In: Handbook of Statistical Analysis and Data Mining Applications, Chapter 10, pp. 197–234, (2009)

    Google Scholar 

  17. Pang-Ning T., Steinbach M., Vipin K.: Association analysis: basic concepts and algorithms. In: Introduction to Data Mining, Chap. 6, Addison-Wesley, pp. 327–414, (2005). ISBN 978-0-321-32136-7

    Google Scholar 

  18. Park, S., Lee, J., Kim, K.: Semi-supervised distributed representations of documents for sentiment analysis. Neural Networks 119, 139–150 (2019)

    Article  Google Scholar 

  19. Phillips-Wren G., Iyer L., Kulkarni U., Ariyachandra T.: Business analytics in the context of big data: a roadmap for research. Commun. Assoc. Inf. Syst. 37, 23 (2015)

    Google Scholar 

  20. Provost, F., Fawcett, T.: Data science and its relationship to big data and data-driven decision making. Big Data 1(1), 51–59 (2013)

    Article  Google Scholar 

  21. Rout, J.K., Choo, K.K.R., Dash, A.K., Bakshi, S., Jena, S.K., Williams, K.L.: A model for sentiment and emotion analysis of unstructured social media text. Electron. Commer. Res. 18(1), 181–199 (2018)

    Article  Google Scholar 

  22. Tiefenbacher K., Olbrich S.: Applying big data-driven business work schemes to increase customer intimacy. In: Proceedings of the International Conference on Information Systems, Transforming Society with Digital Innovation, (2017)

    Google Scholar 

  23. Tsai, C.-F., Eberleb, W., Chua, C.-Y.: Genetic algorithms in feature and instance selection. Knowl. Based Syst. 39, 240–247 (2013)

    Article  Google Scholar 

  24. Yadava, A., Jhaa, C.K., Sharanb, A., Vaishb, V.: Sentiment analysis of financial news using unsupervised approach. Procedia Comput. Sci. 167, 589–598 (2020)

    Article  Google Scholar 

  25. Zheng, L., Hongwei, W., Song, G.: Sentimental feature selection for sentiment analysis of Chinese online reviews. Int. J. Mach. Learn. Cybernet. 9(1), 75–84 (2018)

    Article  Google Scholar 

Download references

Acknowledgements

The research leading to these results has received funding from the EU H2020 research and innovation program under grant agreement N. 769872 (EMPATHIC) and N. 823907 (MENHIR), the project SIROBOTICS that received funding from Italian MIUR, PNR 2015-2020, D. D. 1735, 13/07/2017, and the project ANDROIDS funded by the program V: ALERE 2019 Università della Campania “Luigi Vanvitelli”, D. R. 906 del 4/10/2019, prot. n. 157264, 17/10/2019.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Gloria Phillips-Wren .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Switzerland AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Phillips-Wren, G., Esposito, A., Jain, L. (2021). Introduction to Big Data and Data Science: Methods and Applications. In: Phillips-Wren, G., Esposito, A., Jain, L.C. (eds) Advances in Data Science: Methodologies and Applications. Intelligent Systems Reference Library, vol 189. Springer, Cham. https://doi.org/10.1007/978-3-030-51870-7_1

Download citation

Publish with us

Policies and ethics