Skip to main content
Log in

An overview of the commercial cloud monitoring tools: research dimensions, design issues, and state-of-the-art

  • Published:
Computing Aims and scope Submit manuscript

Abstract

Cloud monitoring activity involves dynamically tracking the Quality of Service (QoS) parameters related to virtualized resources (e.g., VM, storage, network, appliances, etc.), the physical resources they share, the applications running on them and data hosted on them. Applications and resources configuration in cloud computing environment is quite challenging considering a large number of heterogeneous cloud resources. Further, considering the fact that at given point of time, there may be need to change cloud resource configuration (number of VMs, types of VMs, number of appliance instances, etc.) for meet application QoS requirements under uncertainties (resource failure, resource overload, workload spike, etc.). Hence, cloud monitoring tools can assist a cloud providers or application developers in: (i) keeping their resources and applications operating at peak efficiency, (ii) detecting variations in resource and application performance, (iii) accounting the service level agreement violations of certain QoS parameters, and (iv) tracking the leave and join operations of cloud resources due to failures and other dynamic configuration changes. In this paper, we identify and discuss the major research dimensions and design issues related to engineering cloud monitoring tools. We further discuss how the aforementioned research dimensions and design issues are handled by current academic research as well as by commercial monitoring tools.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

Notes

  1. http://www.nist.gov/itl/cloud/.

  2. http://bitnami.org/faq/cloud_amazon_ec2.

References

  1. Mell P, Grance T (2011) The NIST definition of cloud computing (draft). NIST Spec Publ 800:145

  2. Letaifa A, Haji A, Jebalia M, Tabbane S (2010) State of the art and research challenges of new services architecture technologies: virtualization, SOA and cloud computing. Int J Grid Distrib Comput 3

  3. Cong C, Liu J, Zhang Q, Chen H, Cong Z (2010) The characteristics of cloud computing. In: 39th international conference on parallel processing workshops (ICPPW), pp 275–279

  4. Zhang S, Zhang S, Chen X, Huo X (2010) Cloud computing research and development trend. In: 2nd international conference on future networks, ICFN’10, pp 93–97

  5. Ahmed M, Chowdhury ASMR, Ahmed M, Rafee MMH (2012) An advanced survey on cloud computing and state-of-the-art research issues. Int J Comput Sci Issues (IJCSI) 9

  6. Atzori L, Granelli F, Pescapè A (2011) A network-oriented survey and open issues in cloud computing

  7. Shin S, Gu G (2012) CloudWatcher: network security monitoring using openflow in dynamic cloud networks (or: How to provide security monitoring as a service in clouds?). In: 2012 20th IEEE international conference on network protocols (ICNP), pp 1–6

  8. De Chaves SA, Uriarte RB, Westphall CB (2011) Toward an architecture for monitoring private clouds. IEEE Commun Mag 49:130–137

    Google Scholar 

  9. Grobauer B, Walloschek T, Stocker E (2011) Understanding cloud computing vulnerabilities. In: IEEE security and privacy, vol 9, pp 50–57

  10. Moses J, Iyer R, Illikkal R, Srinivasan S, Aisopos K (2011) Shared resource monitoring and throughput optimization in cloud-computing datacenters. In: 2011 IEEE international parallel and distributed processing symposium (IPDPS), pp 1024–1033

  11. Wang L, Kunze M, Tao J, von Laszewski G (2011) Towards building a cloud for scientific applications. Adv Eng Softw 42(9):714–722

    Article  Google Scholar 

  12. Wang L, Chen D, Ma Y, Wang J (2013) Towards enabling cyberinfrastructure as a service in clouds. Comput Electr Eng 39(1):3–14

    Article  Google Scholar 

  13. Wang L, von Laszewski G, Younge AJ, He X, Kunze M, Tao J (2010) Cloud computing: a perspective study. New Gener Comput 28(2):137–146

    Article  MATH  Google Scholar 

  14. Begoli E, Horey J (2012) Design principles for effective knowledge discovery from big data. In: Joint working IEEE/IFIP conference on software architecture (WICSA) and European conference on software architecture (ECSA), pp 215–218

  15. Bryant R, Katz RH, Lazowska ED (2008) Big-data computing: creating revolutionary breakthroughs in commerce, science and society

  16. Labrinidis A, Jagadish H (2012) Challenges and opportunities with big data. In: Proceedings of the VLDB endowment, vol 5, pp 2032–2033

  17. Ma Y, Wang L, Liu D, Yuan T, Liu P, Zhang W (2013) Distributed data structure templates for data-intensive remote sensing applications. Concurr Comput Pract Exp 25(12):1784–1797

    Article  Google Scholar 

  18. Zhang W, Wang L, Liu D, Song W, Ma Y, Liu P, Chen Dan (2013) Towards building a multi-datacenter infrastructure for massive remote sensing image processing. Concurr Comput Pract Exp 25(12):1798–1812

    Article  Google Scholar 

  19. Zhang W, Wang L, Ma Y, Liu D (2013) Design and implementation of task scheduling strategies for massive remote sensing data processing across multiple data centers. Pract Exp Softw. doi:10.1002/spe.2229

  20. Twitter and Natural Disasters (2011) Crisis communication lessons from the Japan tsunami. http://www.sciencedaily.com/releases/2011/04/110415154734.htm. Accessed 22 Feb 2014

  21. Nita M-C, Chilipirea C, Dobre C, Pop F (2013) A SLA-based method for big-data transfers with multi-criteria optimization constraints for IaaS. In: 2013 11th roedunet international conference (RoEduNet), pp 1, 6

  22. Zhao M, Figueiredo RJ (2007) Experimental study of virtual machine migration in support of reservation of cluster resources. In: Proceedings of the 2nd international workshop on virtualization technology in distributed computing, p 5

  23. Wang L, Chen D, Zhao J, Tao J (2012) Resource management of distributed virtual machines. IJAHUC 10(2):96–111

    Article  Google Scholar 

  24. Calheiros RN, Ranjan R, Buyya R (2011) Virtual machine provisioning based on analytical performance and qos in cloud computing environments. In: International conference on parallel processing (ICPP), pp 295–304

  25. Kirschnick J, Calero A, Edwards N (2010) Toward an architecture for the automated provisioning of cloud services. IEEE Commun Mag 48:124–131

    Article  Google Scholar 

  26. Ranjan R, Zhao L, Wu X, Liu A, Quiroz A, Parashar M (2010) Peer-to-peer cloud provisioning: service discovery and load-balancing. In: Cloud computing, Springer, pp 195–217

  27. Liu X, Yang Y, Yuan D, Zhang G, Li W, Cao D (2011) A generic QoS framework for cloud workflow systems. In: 2011 IEEE ninth international conference on dependable, autonomic and secure computing (DASC), pp 713–720

  28. Ranjan R, Benatallah B Programming cloud resource orchestration framework: operations and research challenges. In: Technical report. http://arxiv.org/abs/1204.2204. Accessed 22 Feb 2014

  29. Aceto G, Botta A, de Donato W, Pescapè A (2013) Cloud monitoring: a survey. Comput Netw 57:2093–2115

    Google Scholar 

  30. Shao J, Wei H, Wang Q, Mei H (2010) A runtime model based monitoring approach for cloud. In: 2010 IEEE 3rd international conference on cloud computing (CLOUD), pp 313–320

  31. Caron E, Rodero-Merino L, Desprez F, Muresan A (2012) Auto-scaling, load balancing and monitoring in commercial and open-source clouds

  32. Spring J (2011) Monitoring cloud computing by layer, part 1. IEEE Secur Priv 9:66–68

    Google Scholar 

  33. Anand M (2012) Cloud monitor: monitoring applications in cloud. In: Cloud computing in emerging markets (CCEM), 2012 IEEE international conference on communication, networking and broadcasting, pp 1–4

  34. Kutare M, Eisenhauer G, Wang C, Schwan K, Talwar V, Wolf M (2010) Monalytics: online monitoring and analytics for managing large scale data centers. In: Proceedings of the 7th international conference on autonomic computing, pp 141–150

  35. Sundaresan S, de Donato W, Feamster N, Teixeira R, Crawford S, Pescape A (2011) Broadband internet performance: a view from the gateway. In: ACM SIGCOMM computer communication review, pp 134–145

  36. Massonet P, Naqvi S, Ponsard C, Latanicki J, Rochwerger B, Villari M (2011) A monitoring and audit logging architecture for data location compliance in federated cloud infrastructures. In: IEEE international symposium on parallel and distributed processing workshops and PhD forum (IPDPSW), pp 1510–1517

  37. Davis C, Neville S, Fernandez J, Robert J-M, Mchugh J (2008) Structured peer-to-peer overlay networks: ideal botnets command and control infrastructures? In: Computer security—ESORICS 2008, pp 461–480

  38. Monitis (2014) http://portal.monitis.com/. Accessed 22 Feb 2014

  39. RevealCloud (2014) http://copperegg.com/. Accessed 22 Feb 2014

  40. RevealCloud (2014) http://sandhill.com/article. Accessed 22 Feb 2014

  41. LogicMonitor (2014) http://www.logicmonitor.com/why-logicmonitor. Accessed 22 Feb 2014

  42. Nimsoft (2014) http://www.nimsoft.com/solutions/nimsoft-monitor/cloud. Accessed 22 Feb 2014

  43. Nagios (2014) http://www.nagios.com. Accessed 22 Feb 2014

  44. SPAE (2014) http://shalb.com/en/spae/spae_features/. Accessed 22 Feb 2014

  45. SPAE (2014) http://www.rackaid.com/resources/server-monitoring-cloud. Accessed 22 Feb 2014

  46. CloudWatch (2014) http://awsdocs.s3.amazonaws.com/AmazonCloudWatch/latest/acw-dg.pdf. Accessed 22 Feb 2014

  47. OpenNebula (2014) http://opennebula.org/documentation:rel4.0. Accessed 22 Feb 2014

  48. Cloudharmony (2014) http://cloudharmony.com/. Accessed 22 Feb 2014

  49. Azure FC (2014) http://www.techopedia.com/definition/26433/azure-fabric-controller. Accessed 22 Feb 2014

  50. Azure FC (2014) http://snarfed.org/windows_azure_details#Configuration_and_APIs. Accessed 22 Feb 2014

  51. Nathuji R, Kansal A, Ghaffarkhah A (2010) Q-clouds: managing performance interference effects for QoS-aware clouds. In: Proceedings of the 5th European conference on computer systems, pp 237–250

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Rajiv Ranjan.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Alhamazani, K., Ranjan, R., Mitra, K. et al. An overview of the commercial cloud monitoring tools: research dimensions, design issues, and state-of-the-art. Computing 97, 357–377 (2015). https://doi.org/10.1007/s00607-014-0398-5

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00607-014-0398-5

Keywords

Mathematics Subject Classification

Navigation