Skip to main content
Log in

Transport protocols for data center networks: a survey of issues, solutions and challenges

  • Published:
Photonic Network Communications Aims and scope Submit manuscript

Abstract

In recent years, data centers play an important role in academia and industry for supporting various services and applications. Compared with other IP networks, data center networks have some special features such as many-to-one communication pattern with high bandwidth, low latency, auto-scaling, shallow buffered switches and multi-rooted tree topology. Owing to these special features of data center networks, traditional TCP suffers from severe performance degradation. For improving the performance of TCP in data center networks, various solutions have been proposed in recent years. This article presents a comprehensive survey of existing transport layer solutions proposed for mitigating the problems of TCP in data center networks. The objective of this article is threefold: to discuss about the issues of TCP in data center networks; to introduce various transport layer solutions and finally to compare and discuss the challenges of existing solutions proposed for improving the performance of TCP in data center networks.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10

Similar content being viewed by others

References

  1. Chen, K., Chengchen, H., Zhang, X., Zheng, K., Chen, Y., Vasilakos, A.V.: Survey on routing in data centers: insights and future directions. IEEE Netw. 25(4), 6–10 (2011)

    Article  Google Scholar 

  2. Hua, Y., Xue, L., Jiang, H.: ANTELOPE: a semantic-aware data cube scheme for cloud data center networks. IEEE Trans. Comput. 63(9), 2146–2159 (2014)

    Article  MathSciNet  Google Scholar 

  3. Kato, M.: Improving Transmission Performance with One-Sided Datacenter TCP. M.S. Thesis, Keio University (2014). http://eggert.org/students/kato-thesis.pdf

  4. Wang, L., Zhang, F., Vasilakos, A.V., Hou, C., Liu, Z.: Joint virtual machine assignment and traffic engineering for green data center networks. SIGMETRICS Perform. Eval. Rev. 41(3), 107–112 (2014)

    Article  Google Scholar 

  5. Zhang, Y., Ansari, N.: On architecture design, congestion notification, TCP Incast and power consumption in data centers. IEEE Commun. Surv. Tutor. 15(1), 39–64 (2013)

    Article  Google Scholar 

  6. Yu, Y.-J., Chuang, C.-C., Lin, H.-P., Pang, A.-C.: Efficient multicast delivery for wireless data center networks. In: 2013 IEEE 38th Conference on Local Computer Networks (LCN), pp. 228–235, 21–24 Oct 2013

  7. Li, D., Wu, J.: On the design and analysis of data center network architectures for interconnecting dual-port servers. In: INFOCOM, 2014 Proceedings IEEE, pp. 1851–1859, 27 April–2 May 2014

  8. Ilyadis, N.: The evolution of next-generation data center networks for high capacity computing. In: 2012 Symposium on VLSI Circuits (VLSIC), pp. 1–5, 13–15 June 2012

  9. Kachris, C., Tomkos, I.: A survey on optical interconnects for data centers. IEEE Commun. Surv. Tutor. 14(4), 1021–1036 (2012)

    Article  Google Scholar 

  10. Li, D., Mingwei, X., Liu, Y., Xie, X., Cui, Y., Wang, J., Chen, G.: Reliable multicast in data center networks. IEEE Trans. Comput. 63(8), 2011–2024 (2014)

    Article  MathSciNet  Google Scholar 

  11. Xu, H., Li, B.: RepFlow: minimizing flow completion times with replicated flows in data centers. In: INFOCOM, 2014 Proceedings IEEE, pp. 1581–1589, 27 April–2 May 2014

  12. Zhang, J., Ren, F., Lin, C.: Modeling and understanding TCP incast in data center networks. In: INFOCOM, 2011 Proceedings IEEE, pp. 1377–1385, 10–15 April 2011

  13. Zhang, J., Ren, F., Yue, X., Shu, R., Lin, C.: Sharing bandwidth by allocating switch buffer in data center networks. IEEE J. Sel. Areas Commun. 32(1), 39–51 (2014)

    Article  Google Scholar 

  14. Nagle, D., Serenyi, D., Matthews, A.: The panasas activescale storage cluster: delivering scalable high bandwidth storage. In: Proceedings of the ACM/IEEE Conference on Supercomputing, pp. 53–62 (2004)

  15. Phanishayee, A., Krevat, E., Vasudevan, V., Andersen, D.G., Ganger, G.R., Gibson, G.A., Seshan, S.: Measurement and analysis of TCP throughput collapse in cluster-based storage systems. In: Baker, M., Riedel, E. (eds.) Proceedings of the 6th USENIX Conference on File and Storage Technologies (FAST’08). USENIX Association, Berkeley, CA, USA, Article 12 (2008)

  16. Chen, Y., Griffith, R., Liu, J., Katz, R.H., Joseph, A.D.: Understanding TCP Incast throughput collapse in datacenter networks. In: Proceedings of the 1st ACM Workshop on Research on Enterprise Networking (WREN ’09), pp. 73–82. ACM, New York, NY, USA (2009)

  17. Vasudevan, V., Phanishayee, A., Shah, H., Krevat, E., Andersen, D.G., Ganger, G.R., Gibson, G.A., Mueller, B.: Safe and effective fine-grained TCP retransmissions for datacenter communication. In: Proceedings of the ACM SIGCOMM 2009 Conference on Data Communication (SIGCOMM ’09), pp. 303–314. ACM, New York, NY, USA (2009)

  18. Zhang, J., Ren, F., Tang, L., Lin, C.: Modeling and solving TCP Incast problem in data center networks. IEEE Trans. Parallel Distrib. Syst. 26(2), 478–491 (2015)

    Article  Google Scholar 

  19. Shukla, S., Chan, S., Tam, A.S.-W., Gupta, A., Xu, Y., Chao, H.J.: TCP PLATO: packet labelling to alleviate time-out. IEEE J. Sel. Areas Commun. 32(1), 65–76 (2014)

    Article  Google Scholar 

  20. Zhang, J., Ren, F., Tang, L., Lin, C.: Taming TCP incast throughput collapse in data center networks. In: 2013 21st IEEE International Conference on Network Protocols (ICNP), pp. 1–10, 7–10 Oct 2013

  21. Prakash, P., Dixit, A., Hu, Y.C., Kompella, R.: The TCP outcast problem: exposing unfairness in data center networks. In: Proceedings of the 9th USENIX Conference on Networked Systems Design and Implementation (NSDI’12), pp. 30–30. USENIX Association, Berkeley, CA, USA (2012)

  22. Qin, Y., Shi, Y., Sun, Q., Zhao, L.: Analysis for unfairness of TCP outcast problem in data center networks. In: 2013 25th International on Teletraffic Congress (ITC), pp. 1–4, 10–12 Sept 2013

  23. http://www.cs.cityu.edu.hk/~hxu/dcn.html

  24. Zats, D., Das, T., Mohan, P., Borthakur, D., Katz, R.: DeTail: reducing the flow completion time tail in datacenter networks. In: Proceedings of the ACM SIGCOMM 2012 Conference on Applications, Technologies, Architectures, and Protocols for Computer Communication (SIGCOMM ’12), pp. 139–150. ACM, New York, NY, USA (2012)

  25. Munir, A., Qazi, I.A., Uzmi, Z.A., Mushtaq, A., Ismail, S.N., Iqbal, M.S., Khan, B.: Minimizing flow completion times in data centers. In: INFOCOM, 2013 Proceedings IEEE, pp. 2157–2165, 14–19 April 2013

  26. https://tools.ietf.org/html/rfc1122

  27. Ming, L., Lukyanenko, A., Tarkoma, S., Yla-Jaaski, A.: MPTCP Incast in data center networks. Commun. China 11(4), 25–37 (2014)

    Article  Google Scholar 

  28. Alizadeh, M., Greenberg, A., Maltz, D.A., Padhye, J., Patel, P., Prabhakar, B., Sengupta, S., Sridharan, M.: Data center TCP (DCTCP). SIGCOMM Comput. Commun. Rev. 40(4), 63–74 (2010)

    Article  Google Scholar 

  29. Chen, W., Cheng, P., Ren, F., Shu, R., Lin, C.: Ease the queue oscillation: analysis and enhancement of DCTCP. In: 2013 IEEE 33rd International Conference on Distributed Computing Systems (ICDCS), pp. 450–459, 8–11 July 2013

  30. Das, T., Sivalingam, K.M.: TCP improvements for data center networks In: 2013 Fifth International Conference on Communication Systems and Networks (COMSNETS), pp. 1–10, 7–10 Jan 2013

  31. Zhang, J., Wen, J., Wang, J., Zhao, W.: TCP-FITDC: an adaptive approach to TCP Incast avoidance for data center applications. In: 2013 International Conference on Computing, Networking and Communications (ICNC), pp. 1048–1052, 28–31 Jan 2013

  32. Hwang, J., Yoo, J., Choi, N.: Deadline and Incast aware TCP for cloud data center networks. Comput. Netw. 68, 20–34 (2014)

    Article  Google Scholar 

  33. Wang, G., Ren, Y., Dou, K., Li, J.: IDTCP: an effective approach to mitigating the TCP Incast problem in data center networks. Inf. Syst. Front. 16(1), 35–44 (2014)

    Article  Google Scholar 

  34. Fang, S., Foh, C.H., Aung, K.M.M.: Prompt congestion reaction scheme for data center network using multiple congestion points. In: 2012 IEEE International Conference on Communications (ICC), pp. 2679–2683, 10–15 June 2012

  35. Haitao, W., Feng, Z., Guo, C., Zhang, Y.: ICTCP: Incast congestion control for TCP in data-center networks. IEEE/ACM Trans. Netw. 21(2), 345–358 (2013)

    Article  Google Scholar 

  36. Hwang, J., Yoo, J., Choi, N.: IA-TCP: a rate based incast-avoidance algorithm for TCP in data center networks. In: 2012 IEEE International Conference on Communications (ICC), pp. 1292–1296, 10–15 June 2012

  37. Zheng, F., Huang, Y., Sun, D.: Designing a new TCP based on FAST TCP for datacenter. In: 2014 IEEE International Conference on Communications (ICC), pp. 3209–3214, 10–14 June 2014

  38. http://onlinelibrary.wiley.com/doi/10.1002/ett.1485/abstract

  39. Jiang, C., Li, D., Mingwei, X.: LTTP: an LT-code based transport protocol for many-to-one communication in data centers. IEEE J. Sel. Areas Commun. 32(1), 52–64 (2014)

    Article  Google Scholar 

  40. Stephens, B., Cox, A.L., Singla, A., Carter, J., Dixon, C., Felter, W.: Practical DCB for improved data center networks. In: INFOCOM, 2014 Proceedings IEEE, pp. 1824–1832, 27 April–2 May 2014

  41. Bai, W., Chen, K., Wu, H., Lan, W., Zhao, Y.: PAC: taming TCP Incast congestion using proactive ACK control. In: 2014 IEEE 22nd International Conference on Network Protocols (ICNP), pp. 385–396, 21–24 Oct 2014

  42. Lee, C., Jang, K., Moon, S.: Reviving delay-based TCP for data centers. SIGCOMM Comput. Commun. Rev. 42(4), 111–112 (2012)

    Article  Google Scholar 

  43. Munir, A., Qazi, I.A., Bin Qaisar, S.: On achieving low latency in data centers. In: 2013 IEEE International Conference on Communications (ICC), pp. 3721–3725, 9–13 June 2013

  44. Alizadeh, M., Yang, S., Sharif, M., Katti, S., McKeown, N., Prabhakar, B., Shenker, S.: pFabric: minimal near-optimal datacenter transport. SIGCOMM Comput. Commun. Rev. 43(4), 435–446 (2013)

    Google Scholar 

  45. Hong, C.-Y., Caesar, M., Godfrey, P.B.: Finishing flows quickly with preemptive scheduling. In: Proceedings of the ACM SIGCOMM 2012 Conference on Applications, Technologies, Architectures, and Protocols for Computer Communication (SIGCOMM ’12), pp. 127–138. ACM, New York, NY, USA (2012)

  46. Wu, W., Chen, Y., Durairajan, R., Kim, D., Anand, A., Akella, A.: Adaptive data transmission in the cloud. In: 2013 IEEE/ACM 21st International Symposium on Quality of Service (IWQoS), pp. 1–10, 3–4 June 2013

  47. Ding, C., Rojas-Cessa, R.: DAQ: deadline-aware queue scheme for scheduling service flows in data centers. In: 2014 IEEE International Conference on Communications (ICC), pp. 2989–2994, 10–14 June 2014

  48. Vamanan, B., Hasan, J., Vijaykumar, T.N.: Deadline-aware datacenter TCP (D2TCP). In: Proceedings of the ACM SIGCOMM 2012 Conference on Applications, Technologies, Architectures, and Protocols for Computer Communication (SIGCOMM ’12), pp. 115–126. ACM, New York, NY, USA (2012)

  49. Chen, L., Hu, S., Chen, K., Wu, H., Tsang, D.H.K.: Towards minimal-delay deadline-driven data center TCP. In: Proceedings of the Twelfth ACM Workshop on Hot Topics in Networks (HotNets-XII). ACM, New York, NY, USA, Article 21 (2013)

  50. http://queue.acm.org/detail.cfm?id=2208919

  51. Haitao, W., Jiabo, J., Guohan, L., Guo, C., Xiong, Y., Zhang, Y.: Tuning ECN for data center networks. In: Proceedings of the 8th International Conference on Emerging Networking Experiments and Technologies (CoNEXT ’12), pp. 25–36. ACM, New York, NY, USA (2012)

Download references

Acknowledgments

The authors gratefully acknowledge the anonymous reviewers for their valuable comments and suggestions for improving the manuscript. This research was supported by the Korea Science and Engineering Foundation (KOSEF) grant funded by the Korea government (MOST) (No. 2011-0023177). This research was also supported by the MSIP (Ministry of Science, ICT & Future Planning), Korea, under the “Establishing IT Research Infrastructure Projects” supervised by the NIPA (National IT Industry Promotion Agency) (I2221-14-1005).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jae-il Jung.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Sreekumari, P., Jung, Ji. Transport protocols for data center networks: a survey of issues, solutions and challenges. Photon Netw Commun 31, 112–128 (2016). https://doi.org/10.1007/s11107-015-0550-y

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11107-015-0550-y

Keywords

Navigation