Abstract
We study the virtual machine live migration (LM) and disaster recovery (DR) from a networking perspective, considering long-distance networks, for example, between data centers. These networks are usually constrained by limited available bandwidth, increased latency and congestion, or high cost of use when dedicated network resources are used, while their exact characteristics cannot be controlled. LM and DR present several challenges due to the large amounts of data that need to be transferred over long-distance networks, which increase with the number of migrated or protected resources. In this context, our work presents the way LM and DR are currently being performed and their operation in long-distance networking environments, discussing related issues and bottlenecks and surveying other works. We also present the way networks are evolving today and the new technologies and protocols (e.g., software-defined networking, or SDN, and flexible optical networks) that can be used to boost the efficiency of LM and DR over long distances. Traffic redirection in a long-distance environment is also an important part of the whole equation, since it directly affects the transparency of LM and DR. Related works and solutions both from academia and the industry are presented.
- R. Ahmad, A. Gani, S. Hamid, M. Shiraz, A. Yousafzai, and F. Xia. 2015. A survey on virtual machine migration and server consolidation frameworks for cloud data centers. Journal of Network and Computer Applications 52, 11--25. Google ScholarDigital Library
- S. Akoush, R. Sohan, B. Roman, A. Rice, and A. Hopper. 2011. Activity based sector synchronisation: Efficient transfer of disk-state for wan live migration. MASCOTS. 22--31. Google ScholarDigital Library
- Alcatel Lucent. 2013. Bell Labs, The Cloud-Optimized MAN and WAN: Leveraging a Multi-Layer SDN Framework to Deliver Scalable and Agile Cloud Services.Google Scholar
- O. Alhazmi and Y. Malaiya. 2013. Evaluating disaster recovery plans using the cloud. IEEE Reliability and Maintainability Symposium.Google Scholar
- S. Al-Kiswany, D. Subhraveti, P. Sarkar, and M. Ripeanu. 2011. VMFlock: Virtual machine co-migration for the cloud. International ACM Symposium on High-Performance Parallel and Distributed Computing (HPDC’11). Google ScholarDigital Library
- Amazon. 2011. Summary of the Amazon EC2 and Amazon rds service disruption in the us east region. http://aws.amazon.com/message/65648/.Google Scholar
- Amazon. 2015. EC2 Instances. https://aws.amazon.com/ec2/instance-types./ Retrieved November 2015.Google Scholar
- Amazon. 2016. Route 53. http://aws.amazon.com/route53/.Google Scholar
- Amazon. 2016. Using Amazon Web Services for Disaster Recovery. http://media.amazonwebservices.com/ AWS_Disaster_Recovery.pdf.Google Scholar
- A. Anand, V. Sekar, and A. Akella. 2009. SmartRE: An architecture for coordinated network-wide redundancy elimination. SIGCOMM. Google ScholarDigital Library
- A. Asensio and L. Velasco. Managing transfer-based datacenter connections. Journal of Optical Communications and Networking 6, 7, 660--669.Google Scholar
- A. Asensio, M. Ruiz, and L. Velasco. 2015. Orchestrating connectivity services to support elastic operations in datacenter federations. Photonic Network Communications. 1--16. Google ScholarDigital Library
- F. Balus, D. Stiliadis, and N. Bitar. 2012. Federated SDN-based Controllers for NVO3. www.ietf.org/proceedings/86/slides/slides-86-nvo3-7.pdf.Google Scholar
- J. Barrera, M. Ruiz, and L. Velasco. 2015. Orchestrating virtual machine migrations in telecom clouds. In Proceedings of IEEE/OSA Optical Fiber Communication Conference (OFC’15).Google Scholar
- A. Bianco, J. Finochietto, L. Giraudo, M. Modesti, and F. Neri. 2008. Network planning for disaster recovery. IEEE Workshop in Local and Metropolitan Area Networks. 43--48.Google Scholar
- S. Bose, S. Brock, R. Skeoch, and S. Rao. 2011. Cloud spider: Combining replication with scheduling for optimizing live migration of virtual machines across wide area networks. IEEE CCGRID. Google ScholarDigital Library
- B. Boughzala, R. Ben Ali, M. Lemay, Y. Lemieux, and O. Cherkaoui. 2011. OpenFlow supporting inter-domain virtual machine migration. International Conference on Wireless and Optical Communications Networks.Google Scholar
- R. Bradford, E. Kotsovinos, A. Feldmann, and H. Schioberg. 2007. Live wide-area migration of virtual machines including local persistent state. International Conference on Virtual Execution Environments (VEE’07). 169--179. Google ScholarDigital Library
- T. C. Bressoud and F. B. Schneider. 1996. Hypervisor based fault tolerance. ACM Transactions on Computer Systems (TOCS). 14, 1, 80--107. Google ScholarDigital Library
- BT. 2015. Hourly Network Summary. http://ippm.bt.net./ Retrieved November 2015.Google Scholar
- M. Casado, T. Koponen, R. Ramanathan, and S. Shenker. 2010. Virtualizing the network forwarding plane. ACM SIGCOMM Workshop on Programmable Routers for Extensible Services of Tomorrow. Google ScholarDigital Library
- W. Cerroni. 2015. Network performance of multiple virtual machine live migration in cloud federations. Journal of Internet Services and Applications. 6, 1, 1--20.Google ScholarCross Ref
- R. Chakravorty, S. Katti, J. Crowcroft, and I. Pratt. 2003. Flow aggregation for enhanced TCP over wide-area wireless. INFOCOM. 1754--1764.Google Scholar
- X. Chen, S. Chen, F. Tseng, L. Chou, and H. Chao. 2013. Minimizing virtual machine migration probability for cloud environments. HPCC.Google Scholar
- K. Christodoulopoulos, I. Tomkos, and E. A. Varvarigos. 2011. Elastic bandwidth allocation in flexible OFDM-based optical networks. Journal of Lightwave Technology. 29, 9, 1354--1366.Google ScholarCross Ref
- Cisco. 2006. InfiniBand SDR, DDR, and QDR Technology GuideGoogle Scholar
- Cisco. 2015. Cisco Visual Networking Index: Forecast and Methodology, 2014-2019. http://www.cisco.com/ c/en/us/solutions/collateral/service-provider/ip-ngn-ip-next-generation-network/white_paper_c11-4813 60.html. Retrieved November 2015.Google Scholar
- Cloudping. 2015. http://www.cloudping.info./ Retrieved November 2015.Google Scholar
- CloudFlare. 2011. A Brief Primer on Anycast. http://blog.cloudflare.com/a-brief-anycast-primer.Google Scholar
- CloudFlare. 2013. Load Balancing Without Load Balancers.Google Scholar
- Gluster. 2015. Managing GlusterFS Geo-replication. http://www.gluster.org/community/documentation/index. php/Gluster_3.2:_Managing_GlusterFS_Geo-replication. Retrieved September 2015.Google Scholar
- C. Clark, K. Fraser, S. Hand, and J. G. Hansen. 2005. Live migration of virtual machines. Network System Design and Implementation. Google ScholarDigital Library
- Contrail EU project. 2014. Overview of the Contrail System, Components and Usage.Google Scholar
- T. Costello. 2012. Business continuity: Beyond disaster recovery. Journal IT Professional. 14, 5. Google ScholarDigital Library
- R. Couto, S. Secci, M. Campista, and L. Costa. 2014. Network design requirements for disaster resilience in IaaS clouds. IEEE Communications Magazine. 52, 10, 52--58.Google ScholarCross Ref
- R. Couto, S. Secci, M. Campista, and L. Costa. 2015. Server placement with shared backups for disaster-resilient clouds. Computer Networks. Google ScholarDigital Library
- B. Cully, G. Lefebvre, D. Meyer, M. Feeley, N. Hutchinson, and A. Warfield. 2008. Remus: High availability via asynchronous virtual machine replication. USENIX Symposium on Networked Systems Design and Implementation. 161--174. Google ScholarDigital Library
- D. Darsena, G. Gelli, A. Manzalini, F. Melito, and F. Verde. 2013. Live migration of virtual machines among edge networks via WAN links. IEEE Future Network and Mobile Summit (FutureNetworkSummit’13).Google Scholar
- B. Davie and J. Gross. 2014 April. Stateless transport tunneling protocol for network virtualization (STT). Draft-Davie-Stt-06 (Work in Progress).Google Scholar
- U. Deshpande, U. Kulkarni, and K. Gopalan. 2012. Inter-rack live migration of multiple virtual machines. International Workshop on Virtualization Technologies in Distributed Computing (VTDC’12). Google ScholarDigital Library
- U. Deshpande, X. Wang, and K. Gopalan. 2011. Live gang migration of virtual machines. International ACM Symposium on High Performance Parallel and Distributed Computing (HPDC’11). Google ScholarDigital Library
- Y. Dong, W. Ye, Y. Jiang, I. Pratt, S. Ma, J. Li, and H. Guan. 2013. COLO: COarse-grained LOck-stepping virtual machines for non-stop service. Symposium on Cloud Computing (SOCC’13). Google ScholarDigital Library
- D. Erickson, G. Gibb, B. Heller, D. Underhill, J. Naous, G. Appenzeller, G. Parulkar, N. McKeown, M. Rosenblum, M. Lam, S. Kumar, V. Alaria, P. Monclus, F. Bonomi, J. Tourrilhes, P. Yalagandula, S. Banerjee, C. Clark, and R. McGeer. 2008. Demo: A demonstration of virtual machine mobility in an openflow network. ACM SIGCOMM.Google Scholar
- Ericsson Review. 2015. IP-optical convergence: A complete solution. https://www.ericsson.com/res/thecompany/docs/publications/ericsson_review/2014/er-ip-optical-convergence.pdf. Retrieved October 2015.Google Scholar
- EVault Cloud Disaster Recovery. 2014. http://www.seagate.com/files/www-content/services-software/cloud-resiliency-services/_shared/masters/docs/wp-cloud-disaster-recovery-ready-for-midmarket-2014-09-0019-w-us.pdf.Google Scholar
- Facebook Hits New Peak Of 1 Billion Users On A Single Day. 2015 http://techcrunch.com/2015/08/27/ facebook-hits-1-billion-users-in-a-single-day/. Retrieved October 2015.Google Scholar
- T. C. Ferreto, M. A. S. Netto, R. N. Calheiros, and C. A. F. De Rose. 2011. Server consolidation with migration control for virtualized data centers. Future Generation Computer Systems. Google ScholarDigital Library
- F5 Network and VMware. 2011. Enabling Long Distance Live Migration with F5 and VMware vMotion.Google Scholar
- Forbes. 2015. The big bang: How the cloud is changing resilience in the expanding universe of digital data. http://www.forbes.com/forbesinsights/ibm_big_bang/index.html. Retrieved October 2015.Google Scholar
- A. Ganguly, A. Agrawal, P. Boykin, and R. Figueiredo. 2006. WOW: self-organizing wide area overlay networks of virtual workstations, International Symposium on High-Performance Distributed Computing.Google Scholar
- Gartner. 2015. Magic Quadrant Disaster Recovery as a Service. https://www.gartner.com/doc/3033519/magic-quadrant-disaster-recovery-service. Retrieved September 2015.Google Scholar
- Gartner. 2015. Magic Quadrant for WAN Optimization. https://www.gartner.com/doc/3008618/magic-quadrant-wan-optimization. Retrieved September 2015.Google Scholar
- Gartner. 2015. Magic Quadrant for x86 Server Virtualization Infrastructure. https://www.gartner.com/doc/ 3093222/magic-quadrant-x-server-virtualization. Retrieved September 2015Google Scholar
- Gartner. 2015. Magic Quadrant for Enterprise Backup Software and Integrated Appliances. https://www.gartner.com/doc/3074822/magic-quadrant-enterprise-backup-software. Retrieved September 2015.Google Scholar
- B. Gerofi and Y. Ishikawa. 2011. Workload adaptive checkpoint scheduling of virtual machine replication. Pacific Rim International Symposium on Dependable Computing (PRDC’11). 204--213. Google ScholarDigital Library
- O. Gerstel, M. Jinno, A. Lord, and S. J. B. Yoo. 2012. Elastic optical networking: A new dawn for the optical layer? IEEE Communication Magazine. 50, 2, 12--20.Google ScholarCross Ref
- S. Ghorbani, C. Schlesinger, M. Monaco, E. Keller, M. Caesar, J. Rexford, and D. Walker. 2014. Transparent, live migration of a software-defined network. ACM Symposium on Cloud Computing. 1--14. Google ScholarDigital Library
- V. Gramoli, G. Jourjon, and O. Mehani. 2014. Can SDN mitigate disasters? arXiv:1410.4296.Google Scholar
- V. Gramoli, G. Jourjon, and O. Mehani. 2015. Disaster-tolerant storage with SDN. International Conference on Networked Systems.Google Scholar
- F. Hao, T. Lakshman, S. Mukherjee, and H. Song. 2009. Enhancing dynamic cloud-based services using network virtualization. ACM Workshop on Virtualized Infrastructure Systems and Architectures. 37--44. Google ScholarDigital Library
- E. Harney, S. Goasguen, J. Martin, M. Murphy, and M. Westall. 2007. The efficacy of live virtual machine migrations over the internet. International Workshop on Virtualization Technology in Distributed Computing (VTDC’07), 1--7. Google ScholarDigital Library
- M. R. Hines, U. Deshpande, and K. Gopalan. 2009. Post-copy live migration of virtual machines. ACM SIGOPS Operating Systems Review. 43, 3. Google ScholarDigital Library
- T. Hirofuchi, H. Nakada, S. Itoh, and S. Sekiguchi. 2012. Kagemusha: A guest-transparent mobile IPv6 mechanism for wide-area live VM migration. IEEE Network Operations and Management Symposium (NOMS’12). 1319--1326.Google Scholar
- T. Hirofuchi, H. Ogawa, H. Nakada, S. Itoh, and S. Sekiguchi. 2009. A live storage migration mechanism over WAN for relocatable virtual machine services over clouds. IEEE/ACM International Symposium on Cluster Computing and the Grid (CCGrid’09). 460--465. Google ScholarDigital Library
- T. Hirofuchi, M. Tsugawa, H. Nakada, and T. Kudoh. 2012. A wan-optimized live storage migration mechanism toward virtual machine evacuation upon severe disasters. IEICE Transactions on Information and Systems 96, 12, 2663--2674.Google Scholar
- K. Hou, K. G. Shin, Y. Turner, and S. Singhal. 2013. Tradeoffs in compressing virtual machine checkpoints. International Workshop on Virtualization Technologies in Distributed Computing (VTDC’13). 41--48. Google ScholarDigital Library
- W. Huang, Q. Gao, J. Liu, and D. K. Panda. 2007. High performance virtual machine migration with RDMA over modern interconnects. IEEE International Conference on Cluster Computing (CLUSTER’07). Google ScholarDigital Library
- Huawei. 2014. Huawei Grandly Launches Active-Active Data Center Disaster Recovery Solution. http://pr.huawei.com/en/news/hw-371633-recoverysolution.htm#.VHeShDGUcYM. Retrieved November 2016.Google Scholar
- Infinera. 2015. http://www.infinera.com./ Retrieved September 2015.Google Scholar
- Infonetics. 2015. IHS Forecasts Huge Growth for 100 Gigabit Optical Ports as Operators Increase Network Capacity. http://www.infonetics.com/pr/2015/100G-Coherent-Optical-Ports-Highlights.asp. Retrieved September 2015.Google Scholar
- Infonetics. 2015. Carriers on Track to Spend $5.7B on SDN Hardware, Software and Services by 2019. http://www.infonetics.com/pr/2015/Carrier-SDN-Market-Forecast.asp.Google Scholar
- InfiniBand Trade Association. 2016. InfiniBand Architecture Specification. http://www.infinibandta.org/.Google Scholar
- IEEE802, Data Center Bridging. 2013. http://www.ieee802.org/1/pages/dcbridges.html.Google Scholar
- IETF. 2016. Locator/ID Separation Protocol (lisp). http://datatracker.ietf.org/wg/lisp/charter/.Google Scholar
- IBM. 2014. High availability vs. fault tolerance. http://www-01.ibm.com/support/knowledgecenter/SSPHQG_ 6.1.0/com.ibm.hacmp.concepts/ha_concepts_fault.htm. Retrieved November 2014.Google Scholar
- Infonetics. 2015. http://www.infonetics.com/pr/2014/Cloud-Services-IT-Market-Highlights.asp. Retrieved October 2015.Google Scholar
- ISO/IEC 27031:2011. 2011. http://www.iso.org/iso/catalogue_detail?csnumber=44374.Google Scholar
- ISO 22301:2012. 2012. http://www.iso.org/iso/catalogue_detail.htm?csnumber=50038.Google Scholar
- ISO 22313:2012. 2012. http://www.iso.org/iso/catalogue_detail?csnumber=50050.Google Scholar
- A. Izaddoost and S. Heydari. 2014. Enhancing network service survivability in large-scale failure scenarios. Journal of Communications and Networks 16, 5, 534--547.Google ScholarCross Ref
- X. Jiang and D. Xu. 2004. VIOLIN: Virtual internetworking on overlay infrastructure. ISPA. 937--946. Google ScholarDigital Library
- H. Jin, L. Deng, S. Wu, X. Shi, and X. Pan. 2009. Live virtual machine migration with adaptive memory compression. IEEE International Conference on Cluster Computing.Google Scholar
- U. Kalim, M. Gardner, E. Brown, and W. Feng. 2013. Seamless migration of virtual machines across networks. IEEE Computer Communications and Networks (ICCCN’13).Google Scholar
- T. S. Kang, M. Tsugawa, A. Matsunaga, T. Hirofuchi, and J. A. Fortes. 2014. Design and implementation of middleware for cloud disaster recovery via virtual machine migration management. IEEE/ACM 7th International Conference on Utility and Cloud Computing. 166--175. Google ScholarDigital Library
- T. Kang, M. Tsugawa, J. Fortes, and T. Hirofuchi. 2013. Reducing the migration times of multiple VMs on WANs using a feedback controller. IEEE International Parallel and Distributed Processing Symposium Workshops & PhD Forum (IPDPSW’’13). 1480--1489. Google ScholarDigital Library
- D. Kapil, E. Pilli, and R. Joshi. 2013. Live virtual machine migration techniques: Survey and research challenges. IEEE International Advance Computing Conference (IACC’13). 963--969.Google Scholar
- A. Khoshkholghi, A. Abdullah, R. Latip, S. Subramaniam, and M. Othman. 2014. Disaster recovery in cloud computing: A survey. Computer and Information Science. 7, 4.Google ScholarCross Ref
- S. Kihara and S. Moriai. 2008. Kemari: Virtual machine synchronization for fault tolerance. USENIX Annual Technical Conference.Google Scholar
- J. Kim, D. Chae, J. Kim, and J. Kim. 2013. Guide-copy: Fast and silent migration of virtual machine for datacenters. International Conference on High Performance Computing, Networking, Storage and Analysis. Google ScholarDigital Library
- KVM. 2016. http://www.linux-kvm.org/page/Main_Page.Google Scholar
- H. Lai, Y. Wu, and Y. Cheng. 2013. Exploiting neigborhood similarity for virtual machine migration over wide-area network. IEEE International Conference on Software Security and Reliability (SERE’13). 149--158. Google ScholarDigital Library
- A. Lenk and S. Tai. 2014. Cloud standby: disaster recovery of distributed systems in the cloud. In Service-Oriented and Cloud Computing. 32--46.Google Scholar
- L. Lewin-Eytan, K. Barabash, R. Cohen, V. Jain, and A. Levin. 2012. Designing modular overlay solutions for network virtualization. IBM Technical Paper.Google Scholar
- Q. Li, J. Huai, J. Li, Tianyu Wo, and Minxiong Wen. 2008. HyperMIP: Hypervisor controlled mobile IP for virtual machine live migration across networks. 11th IEEE High Assurance Systems Engineering Symposium. 80--88. Google ScholarDigital Library
- H. Liu and B. He. 2015. VMbuddies: Coordinating live migration of multi-tier applications in cloud environments. IEEE Transactions on Parallel and Distributed Systems. 26, 4.Google ScholarDigital Library
- H. Liu, H Jin, X. Liao, C. Yu, and C. Xu. 2011. Live virtual machine migration via asynchronous replication and state synchronization. IEEE Transactions on Parallel and Distributed Systems. 22, 12, 1986--1999. Google ScholarDigital Library
- J. Liu, Y. Li, and D. Jin. 2014. SDN-based live VM migration across datacenters. ACM SIGCOMM. 583--584. Google ScholarDigital Library
- Alcatel Lucent and Bell Labs. 2013. Metro Network Traffic Growth: An Architecture Impact Study.Google Scholar
- D. Malanik and R. Jaek. 2014. The performance of the data-cluster based on the CEPH platform with geographically separated nodes. IEEE International Conference Mathematics and Computers in Sciences and in Industry (MCSI’14). 299--307. Google ScholarDigital Library
- T. Malleswari, D. Malathi, and G. Vadivu. 2015. Deduplication techniques: A technical survey. International Journal for Innovative Research in Science and Technology. 1, 7, 318--325.Google Scholar
- U. Mandal, M. Habib, S. Zhang, P. Chowdhury, M. Tornatore, and B. Mukherjee. 2014. Heterogeneous bandwidth provisioning for virtual machine migration over SDN-enabled optical networks. IEEE Optical Fiber Communications Conference and Exhibition (OFC’14).Google Scholar
- V. Mann et al. 2012. Crossroads: Seamless vm mobility across datacenters through software defined networking. IEEE Network Operations and Management Symposium (NOMS’12). 88--96.Google ScholarCross Ref
- A. J. Mashtizadeh, M. Cai, G. Tarasuk-Levin, R. Koller, T. Garfinkel, and S. Setty. 2014. XvMotion: Unified virtual machine migration over long distance. USENIX Annual Technical Conference. Google ScholarDigital Library
- F. Mattos, D. Menezez, and O. C. Muniz Bandeira Duarte. 2014. XenFlow: Seamless migration primitive and quality of service for virtual networks. IEEE Global Communications Conference (GLOBECOM’14). 2326--2331.Google Scholar
- A. Mayoral, R. Vilalta, R. Munoz, R. Casellas, and R. Martinez. 2015. Experimental seamless virtual machine migration using an integrated SDN IT and network orchestrator. IEEE Optical Fiber Communications Conference and Exhibition (OFC’15).Google Scholar
- N. McKeown, T. Anderson, H. Balakrishnan, G. Parulkar, L. Peterson, J. Rexford, S. Shenker, and J. Turner. 2008. Openflow: Enabling innovation in campus networks. SIGCOMM Computer Communication Review. 38, 2, 69--74. Google ScholarDigital Library
- V. Medina and J. García. 2014. A survey of migration mechanisms of virtual machines. ACM Computing Surveys (CSUR). 46, 3. Google ScholarDigital Library
- Microsoft. 2015. Azure VMs. https://azure.microsoft.com/en-us/pricing/details/virtual-machines. Retrieved November 2015.Google Scholar
- Microsoft. 2016. Hyper-V. http://www.microsoft.com/en-us/server-cloud/solutions/virtualization.aspx.Google Scholar
- U. F. Minhas, S. Rajagopalan, B. Cully, A. Aboulnaga, K. Salem, and A. Warfield. 2013. RemusDB: Transparent high availability for database systems. International Journal on Very Large Data Bases (VLDB). 22, 1, 29--45. Google ScholarDigital Library
- K. Nagin, D. Hadas, Z. Dubitzky, A. Glikson, I. Loy, B. Rochwerger, and L. Schour. 2011. Inter-cloud mobility of virtual machines. Annual International Conference on Systems and Storage (SYSTOR’11). Google ScholarDigital Library
- I. Nakagawa, K. Ichikawa, T. Kondo, Y. Kitaguchi, H. Kashiwazaki, and S. Shimojo. 2014. Transpacific live migration with wide area distributed storage. IEEE Computer Software and Applications Conference (COMPSAC’14). 486--492. Google ScholarDigital Library
- NetApp. 2016. SnapMirror. http://www.netapp.com/us/products/protection-software/snapmirror.aspx.Google Scholar
- Nuage Networks. 2015. http://www.nuagenetworks.net./ Retrieved September 2015.Google Scholar
- C. Oberg, A. Whitt, and R. Mills. 2011. Disasters will happen-are you ready? IEEE Communications Magazine. 1, 49, 36--42. Google ScholarDigital Library
- Kei Ohmura. 2011. Rapid VM Synchronization with I/O Emulation Logging-Replay.Google Scholar
- Open Networking Foundation. 2015. https://www.opennetworking.org. Retrieved November 2015Google Scholar
- Ovirt. 2015. Storage Live Migration. http://www.ovirt.org/Features/Design/StorageLiveMigration. Retrieved November 2015.Google Scholar
- A. Peddemors, R. Spoor, P. Dekkers, and C. den Besten. 2011. Using DRBD over Wide Area Networks.Google Scholar
- P. Pisa, N. Fernandes, H. Carvalho, M. Moreira, M. E. Campista, L. H. Costa, and O. C. Duarte. 2010. OpenFlow and xen-based virtual network migration. Communications: Wireless in Developing Countries and Networks of the Future. 170--181.Google Scholar
- Y. Pu, Y. Deng, and A. Nakao. 2011. Cloud rack: Enhanced virtual topology migration approach with Open vSwitch. International Conference on Information Networking. 160--164.Google Scholar
- P. Raad, G. Colombo, D. Chi, S. Secci, A. Cianfrani, P. Gallard, and G. Pujolle. 2013. Achieving sub-second downtimes in internet-wide virtual machine live migrations in LISP networks. IFIP/IEEE International Symposium on Integrated Network Management. 286--293.Google Scholar
- P. Raad, G. Colombo, D. Phung Chi, S. Secci, A. Cianfrani, P. Gallard, and G. Pujolle. Demonstrating LISP-based virtual machine mobility for cloud networks. IEEE 1st International Conference on Cloud Networking (CLOUDNET’12). 200--202.Google Scholar
- C. Raiciu, D. Niculescu, M. Bagnulo, and M. J. Handley. 2011. Opportunistic mobility with multipath TCP. MobiArch. Google ScholarDigital Library
- S. Rajagopalan, B. Cully, R. O'Connor, and A. Warfield. 2012. Secondsite: Disaster tolerance as a service. ACM SIGPLAN/SIGOPS Conference on Virtual Execution Environments. 97--108. Google ScholarDigital Library
- S. Rajagopalan, D. Williams, and H. Jamjoom. 2013. Pico replication: A high availability framework for middleboxes. Annual Symposium on Cloud Computing (SOCC’13). Google ScholarDigital Library
- K. Ramakrishnan, P. Shenoy, and J. Van der Merwe, Live data center migration across WANs: A robust cooperative context aware approach. ACM SIGCOMM, Workshop on Internet Network Management. 262--267. Google ScholarDigital Library
- RFC 7348. 2014 August. VxLAN: A framework for overlaying virtualized layer 2 networks over layer 3 networks.Google Scholar
- P. Riteau, C. Morin, and T. Priol. 2011. Shrinker: Improving live migration of virtual clusters over WANs with distributed data deduplication and content-based addressing. European Conference on Parallel Processing. Google ScholarDigital Library
- A. Sahoo, K. Kant, and P. Mohapatra. 2009. BGP convergence delay under large-scale failures: Characterization and solutions. Computer Communications. 32, 7, 1207--1218. Google ScholarDigital Library
- P. Samadi, J. Xu, and K. Bergman. 2015. Virtual machine migration over optical circuit switching network in a converged inter/intra data center architecture. Optical Fiber Communication Conference.Google Scholar
- C. P. Sapuntzakis, R. Chandra, B. Pfaff, J. Chow, M. S. Lam, and M. Rosenblum. 2002. Optimizing the migration of virtual computers. USENIX Symposium on Operating Systems Design and Implementation (OSDI’02). Google ScholarDigital Library
- T. Sarker and M. Tang. 2013. Performance-driven live migration of multiple virtual machines in datacenters. IEEE International Conference on Granular Computing (GrC’13).Google Scholar
- D. J. Scales, M. Nelson, and G. Venkitachalam. 2010. The design of a practical system for fault-tolerant virtual machines. ACM SIGOPS Operating Systems Review. 44, 4, 30--39. Google ScholarDigital Library
- Serverdensity. 2014. Network performance at AWS, Google, Rackspace and Softlayer. https://blog. serverdensity.com/network-performance-aws-google-rackspace-softlayer. Retrieved November 2015.Google Scholar
- K. Shima, and N Dang. 2012. Indexes for Distributed File/Storage Systems as a Large Scale Virtual Machine Disk Image Storage in a Wide Area Network.Google Scholar
- V. Shrivastava, P. Zerfos, L. Kang-won, H. Jamjoom, L. Yew-Huey, and S. Banerjee. 2011. Application-aware virtual machine migration in data centers. IEEE INFOCOM. 66--70.Google Scholar
- E. Silvera, G. Sharaby, D. Lorenz, and I. Shapira. 2009. IP mobility to support live migration of virtual machines across subnets. SYSTOR. Google ScholarDigital Library
- Silver-peak. 2015. Silver Peak and VMware vSphere Replication. https://www.silver-peak.com/sites/default/ files/infoctr/silver-peak_ss_vmware-vsphere-replication.pdf. Retrieved September 2015.Google Scholar
- A. Snoeren, D. Andersen, and H. Balakrishnan. 2001. Fine-grained failover using connection migration. Conference on USENIX Symposium on Internet Technologies and Systems (USITS’01). Google ScholarDigital Library
- Solutions-review. 2015. Backup and Disaster Recovery Buyers Guide. http://solutions-review.com/backup-disaster-recovery/get-a-free-backup-and-disaster-recovery-buyers-guide. Retrieved September 2015.Google Scholar
- M. Sridharan, K. Duda, I. Ganga, A. Greenberg, G. Lin, M. Pearson, P. Thaler, C. Tumuluri, N. Venkataramiah, and Y. Wang. 2013. NVGRE: Network virtualization using generic routing encapsulation. Draft-Sridharan-Virtualization-Nvgre-03.Google Scholar
- T. E. Stern and K. Bala. 1999. Multiwavelength Optical Networks: A Layered Approach. Prentice Hall. Google ScholarDigital Library
- A. Strunk. 2012. Costs of virtual machine live migration: A survey. IEEE 8th World Congress on Services. 323--329. Google ScholarDigital Library
- P. Svard, B. Hudzia, J. Tordsson, and E. Elmroth. 2011. Evaluation of delta compression techniques for effcient live migration of large virtual machines. Conference on Virtual Execution Environments. Google ScholarDigital Library
- P. Svärd, B. Hudzia, S. Walsh, J. Tordsson, and E. Elmroth. 2015. Principles and performance characteristics of algorithms for live VM migration. ACM SIGOPS Operating Systems Review. 49, 1, 142--155. Google ScholarDigital Library
- Y. Tan, H. Jiang, D. Feng, L. Tian, and Z. Yan. 2011. CABdedupe: A causality-based deduplication performance booster for cloud backup services. IEEE International Parallel & Distributed Processing Symposium (IPDPS’’11). 1266--1277. Google ScholarDigital Library
- F. Travostino, P. Daspit, L. Gommans, C. Jog, C. de Laat, J. Mambretti, I. Monga, B. van Oudenaarde, S. Raghunath, and P. Yonghui Wang. 2006. Seamless live migration of virtual machines over the long distance. Future Generation Computer Systems. 22, 8, 901--907. Google ScholarDigital Library
- M. Tsugawa, P. Riteau, A. Matsunaga, and J. Fortes. 2010. User-level virtual networking mechanisms to support virtual machine migration over multiple clouds. IEEE GLOBECOM Workshops. 568--572.Google Scholar
- A Vahdat. 2013. Scale and programmability in google's software defined data center WAN. ACM Symposium on Cloud Computing (SoCC’13).Google Scholar
- Velocloud. 2016. from: http://www.velocloud.com./ Retrieved February 2016.Google Scholar
- VirtualBox. 2016. https://www.virtualbox.org/.Google Scholar
- VMWare. 2016. http://www.vmware.com/.Google Scholar
- VMWare. 2015. Long Distance vMotion requirements in VMware vSphere 6.0. http://kb.vmware.com/selfservice/microsites/search.do?language=en_US&cmd==displayKC&externalId==2106949. Retrieved September 2015.Google Scholar
- VMware. 2015. vMotion. https://www.vmware.com/products/vsphere/features/vmotion. Retrieved November 2015.Google Scholar
- VMware. 2015. VMware pushes the envelope with vSphere 6.0 vMotion. https://blogs.vmware.com/ performance/2015/02/vmware-pushes-envelope-vsphere-6-0-vmotion.html. Retrieved September 2015.Google Scholar
- VMware. 2015. vSphere Replication. http://www.vmware.com/products/vsphere/features/replication. Retrieved September 2015.Google Scholar
- VMware. 2015c. vSphere 6.0 Advantages Over Hyper-V. https://www.vmware.com/files/pdf/vSphere-6.0-Advantages-Over-Hyper-V.pdf. Retrieved September 2015.Google Scholar
- VMWare, VMWare vCenter Site Recovery Manager. 2016. https://www.vmware.com/products/site-recovery-manager.Google Scholar
- VMWare and Cisco. 2009. Virtual Machine Mobility with Vmware VMotion and Cisco Data Center Interconnect Technologies.Google Scholar
- G. Wang, D. G. Andersen, M. Kaminsky, K. Papagiannaki, T. S. E. Ng, M. Kozuch, and M. P. Ryan. 2010. c-Through: Part-time optics in data centers. ACM SIGCOMM. 327--338. Google ScholarDigital Library
- Y. Wang, E. Keller, B. Biskeborn, J. van der Merwe, and J. Rexford. 2008. Virtual routers on the move: Live router migration as a network management primitive. ACM SIGCOMM Computer Communication Review. 38, 4, 231--242. Google ScholarDigital Library
- L. Wang, H. Ramasamy, R. Harper, M. Viswanathan, and E. Plattier. 2015. Experiences with building disaster recovery for enterprise-class clouds. Annual IEEE/IFIP International Conference on Dependable Systems and Networks. 231--238. Google ScholarDigital Library
- L. Wang. 2006. Desigh and implementation of TCPHA. Draft Release. http://dragon.linux-vs.org/∼dragonfly/.Google Scholar
- H. Watanabe, T. Ohigashi, T. Kondo, K. Nishimura, and R. Aibara. 2010. A performance improvement method for the global live migration of virtual machine with IP mobility. International Conference on Mobile Computing and Ubiquitous Networking (ICMU’10).Google Scholar
- T. Wood, E. Cecchet, K. Ramakrishnan, P. Shenoy, J. Van Der Merwe, and A. Venkataramani. 2010. Disaster recovery as a cloud service: Economic benefits & deployment challenges. 2nd USENIX Workshop on Hot Topics in Cloud Computing. 1--7. Google ScholarDigital Library
- T. Wood, H. Lagar-Cavilla, K. Ramakrishnan, P. Shenoy, and J. Van der Merwe. 2011. PipeCloud: Using causality to overcome speed-of-light delays in cloud-based disaster recovery. SoCC. Google ScholarDigital Library
- T. Wood, K. Ramakrishnan, P. Shenoy, and J. van der Merwe. 2011. CloudNet: Dynamic pooling of cloud resources by live WAN migration of virtual machines. ACM SIGPLAN/SIGOPS International Conference on Virtual Execution Environments (VEE’11). Google ScholarDigital Library
- Xen. 2016. http://www.xenproject.org/.Google Scholar
- R. Xie, Y. Wen, X. Jia, and H. Xie. 2014. Supporting seamless virtual machine migration via named data networking in cloud data center. IEEE Transactions on Parallel and Distributed Systems. Google ScholarDigital Library
- K. Ye, X Jiang, R Ma, and F Yan. 2012. VC-migration: Live migration of virtual clusters in the cloud. ACM/IEEE International Conference on Grid Computing (GRID’12). Google ScholarDigital Library
- Zerto. 2015. http://www.zerto.com. Retrieved September 2015.Google Scholar
- W. Zhang, K. T. Lam, and C. L. Wang. 2014. Adaptive live VM migration over a WAN: Modeling and implementation. IEEE International Conference Cloud Computing (CLOUD’13). 368--375. Google ScholarDigital Library
- X. Zhang, Z. Huo, J. Ma, and D. Meng. 2010. Exploiting data deduplication to accelerate live virtual machine migration. IEEE International Conference on Cluster Computing. 88--96. Google ScholarDigital Library
- J. Zheng, T. Eugene Ng, K. Sripanidkulchai, and Z. Liu. 2014. COMMA: Coordinating the migration of multi-tier applications. ACM SIGPLAN/SIGOPS International Conference on Virtual Execution Environments (VEE’14). Google ScholarDigital Library
- J. Zheng, T. Sing Eugene Ng, and K. Sripanidkulchai. 2011. Workload-aware live storage migration for clouds. ACM SIGPLAN/SIGOPS International Conference on Virtual Execution Environments (VEE’11). Google ScholarDigital Library
Index Terms
- Survey: Live Migration and Disaster Recovery over Long-Distance Networks
Recommendations
Reducing the Migration Times of Multiple VMs on WANs Using a Feedback Controller
IPDPSW '13: Proceedings of the 2013 IEEE 27th International Symposium on Parallel and Distributed Processing Workshops and PhD ForumVirtual machine (VM) migration is affected by network latency and throughput, which are highly fluctuating and unpredictable in wide-area networks (WANs). Hence, it is difficult to statically minimize the time required to transfer a large number of VMs ...
SRVM: Hypervisor Support for Live Migration with Passthrough SR-IOV Network Devices
VEE '16: Proceedings of the12th ACM SIGPLAN/SIGOPS International Conference on Virtual Execution EnvironmentsSingle-Root I/O Virtualization (SR-IOV) is a specification that allows a single PCI Express (PCIe) device (ysical function or PF) to be used as multiple PCIe devices (virtual functions or VF). In a virtualization system, each VF can be directly assigned ...
SRVM: Hypervisor Support for Live Migration with Passthrough SR-IOV Network Devices
VEE '16Single-Root I/O Virtualization (SR-IOV) is a specification that allows a single PCI Express (PCIe) device (ysical function or PF) to be used as multiple PCIe devices (virtual functions or VF). In a virtualization system, each VF can be directly assigned ...
Comments