[go: up one dir, main page]

skip to main content
10.1145/3404397.3404454acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicppConference Proceedingsconference-collections
research-article

Polo: Receiver-Driven Congestion Control for Low Latency over Commodity Network Fabric

Published: 17 August 2020 Publication History

Abstract

Recently, numerous novel transport protocols are proposed for the low latency of applications deployed in data center networks, e.g., web search and retail recommendation system. The state-of-art receiver-driven protocols, e.g., Homa and NDP, show the superior performance for achieving the lowest possible latency. However, Homa assumes that the core layer in data center network has no congestion, which limits its application for the existing over-subscribed networks. NDP requires to modify the switch hardware since it trims packets to headers when the packets cause the switch buffer to overflow, resulting in high deployment cost. In this paper, we present Polo to realize low latency for flows over commodity network fabric relying on Explicit Congestion Notification (ECN) and priority queues. According to packets with ECN marking, the Polo receiver obtains the congestion information such that it dynamically adjusts the number of data packets in network for maintaining the small switch queue. The adjustment is carried out periodically. The time interval is determined by keeping the extra high priority packet always in flight nor by a fine-grained timer. Further, Polo designs the packet recovery mechanisms to retransmit the lost packets as soon as possible. Simulation experiment results show that Polo outperforms the state-of-art receiver-driven protocols in a wide range of scenarios including incast.

References

[1]
[1] M. Alizadeh, A. Greenberg, D. Maltz, J. Padhye, P. Patel, B. Prabhakar, S. Sengupta, and M. Sridharan, “Data Center TCP (DCTCP)”, in Proceedings of the SIGCOMM, pp. 63-74, 2010.
[2]
[2] P. Cheng, F. Ren, R. Shu, and C. Lin, “Catch the Whole Lot in an Action: Rapid Precise Packet Loss Notification in Data Center”, in Proceedings of the NSDI, pp. 17-28, 2014.
[3]
[3] R. Mittal, V. T. Lam, N. Dukkipati, E. Blem, H. Wassel, M. Ghobadi, A. Vahdat, Y. Wang, D. Wetherall, and D. Zats, “TIMELY: RTT-based Congestion Control for the Datacenter”, in Proceedings of the SIGCOMM, pp. 537-550, 2015.
[4]
[4] L. Chen, K. Chen, W. Bai, M. Alizadeh, and D. Zats, “Scheduling Mix-flows in Commodity Datacenters with Karuna”, in Proceedings of the SIGCOMM, pp. 174-187, 2016.
[5]
[5] M. Handley, C. Raiciu, A. Agache, A. Voinescu, A. W. Moore, G. Antichi, and M. Wójcik, “Re-architecting datacenter networks and stacks for low latency and high performance”, in Proceedings of the SIGCOMM, pp. 29-42, 2017.
[6]
[6] B. Montazeri, Y. Li, M. Alizadeh, and J. Ousterhout, “Homa: A Receiver-Driven Low-Latency Transport Protocol Using Network Priorities”, in Proceedings of the SIGCOMM, pp. 221-235, 2018.
[7]
[7] D. Nagle, T. Serenyi, and A. Matthews, “The panasas activescale storage cluster: Delivering scalable high bandwidth storage”, in Proceedings of the ACM/IEEE Supercomputing, pp. 53-62, 2004.
[8]
[8] S. Wang, J. Zhang, T. Huang, T. Pan, J. Liu, Y. Liu, J. Li, and F. Li, “Skipping congestion-links for coflow scheduling”, in Proceedings of the IWQoS, pp. 1-10, 2017.
[9]
[9] H. Zhang, J. Zhang, W. Bai, K. Chen, and M. Chowdhury, “Resilient Datacenter Load Balancing in the Wild”, in Proceedings of the SIGCOMM, pp. 253-266, 2017.
[10]
[10] C. Hopps, “Analysis of an Equal-Cost Multi-Path Algorithm”, RFC 3782, 2004.
[11]
[11] C. Raiciu, S. Barre, C. Pluntke, A. Greenhalgh, D. Wischik, and M. Handley, “Improving Datacenter Performance and Robustness with Multipath TCP”, in Proceedings of the ACM SIGCOMM, pp. 266-277, 2011.
[12]
[12] P. X. Gao, A. Narayan, G. Kumar, R. Agarwal, S. Ratnasamy and S. Shenker, “pHost: Distributed Near-Optimal Datacenter Transport Over Commodity Network Fabric”, in Proceedings of the CoNext, pp. 1–12, 2015.
[13]
[13] I. Cho, K. Jang, and D. Han, “Credit-scheduled delay-bounded congestion control for datacenters”, in Proceedings of the ACM SIGCOMM, pp. 239–252, 2017.
[14]
[14] A. Dixit, P. Prakash, Y. C. Hu, and R. Kompella, “On the Impact of Packet Spraying in Data Center Networks”, in Proceedings of the INFOCOM, pp. 2130–2138, 2013.
[15]
[15] M. Alizadeh, S. Yang, M. Sharif, and S. Katti, “pFabric: Minimal Near-Optimal Datacenter Transport”, in Proceedings of the ACM SIGCOMM, pp. 435–446, 2013.
[16]
[16] Y. Zhu, H. Eran, D. Firestone, C. Guo, M. Lipshteyn, Y. Liron, J. Padhye, S. Raindel, M. H. Yahia, and M. Zhang, “Congestion Control for Large-Scale RDMA Deployments”, in Proceedings of the ACM SIGCOMM, pp. 523-536, 2015.
[17]
[17] W. Bai, L. Chen, K. Chen, D. Han, C. Tian, and H. Wang, “PIAS: Practical Information-Agnostic Flow Scheduling for Commodity Data Centers”, IEEE/ACM TRANSACTIONS ON NETWORKING, vol. 25, no. 4, pp. 1954-1967, 2017.
[18]
[18] C. Hong, M. Caesar and P. B. Godfrey, “Finishing Flows Quickly with Preemptive Scheduling”, in Proceedings of the ACM SIGCOMM, pp. 127-138, 2012.
[19]
[19] Y. Li, R. Miao, H. H. Liu, Y. Zhuang, F. Feng, L. Tang, Z. Cao, and M. Zhang, “HPCC: High Precision Congestion Control”, in Proceedings of the ACM SIGCOMM, pp. 44-58, 2019.
[20]
[20] H. Wu, Z. Feng, C. Guo, and Y. Zhang, “ICTCP: Incast Congestion Control for TCP in Data-Center Networks”, IEEE/ACM Transactions on Networking, vol. 21, no. 2, pp. 345-358, 2013.
[21]
[21] J. Huang, Y. Huang, J. Wang, and T. He, “Packet Slicing for Highly Concurrent TCPs in Data Center Networks with COTS Switches”, in Proceedings of the ICNP,pp. 22-31, 2015.
[22]
[22] V. Vasudevan, A. Phanishayee, J. Shah, E. Krevat, D. G. Andersen, G. R. Ganger, G. A. Glibson, and B.  Mueller, “Safe and effective fine-grained tcp retransmissions for datacenter communication”, in Proceedings of the ACM SIGCOMM, pp. 303–314, 2009.
[23]
[23] J. Perry, A. Ousterhout, H. Balakrishnan, D. Shah, and H. Fugal, “Fastpass: a centralized “zero-queue” datacenter network”, in Proceedings of the ACM SIGCOMM, pp. 307-318, 2014.
[24]
[24] L. Chen, J. Lingys, K. Chen, and F. Liu, “AuTO: Scaling Deep Reinforcement Learning for Datacenter-Scale Automatic Traffic Optimization”, in Proceedings of the ACM SIGCOMM, pp. 191-205, 2018.
[25]
[25] B. Vamanan, J. Hasan, and T. Vijaykumar, “Deadline-Aware Datacenter TCP (D2TCP)”, in Proceedings of the ACM SIGCOMM, pp. 115-126, 2012.
[26]
[26] S. Lee, M. Lee, D. Lee, H. Jung and B. Lee, “TCPRand: Randomizing TCP payload size for TCP fairness in data center networks”, in Proceedings of the INFOCOM, pp. 1697-1705, 2015.
[27]
[27] V. Misra, W. Gong, D. Towsley, “Fluid-based analysis of a network of AQM routers supporting TCP flows with an application to RED”, in Proceedings of the ACM SIGCOMM, pp. 151–160, 2000.
[28]
[28] Y. Cheng, J. Chu, S. Radhakrishnan, and A. Jain, “TCP fast open”’ RFC 7413, 2014.
[29]
[29] IEEE DCB, “802.3bd - MAC Control Frame for Priority-based Flow Control Project”, http://www.ieee802.org/3/bd/, 2010.
[30]
[30] A. Romanow and S. Floyd, “Dynamics of TCP traffic over ATM networks”, in Proceedings of the ACM SIGCOMM, pp. 79-88, 1994.
[31]
[31] R. Nishtala, H. Fugal, S. Grimm, M. Kwiatkowski, H. Lee, H. C. Li, R. McElroy, M. Paleczny, D. Peek, P. Saab, D. Stafford, T. Tung, and V. Venkataramani, “Scaling Memcache at Facebook”, in Proceedings of the NSDI, pp. 385-398, 2013.

Cited By

View all
  • (2023)REN: Receiver-Driven Congestion Control Using Explicit Notification for Data CenterIEEE Transactions on Cloud Computing10.1109/TCC.2021.313502711:2(1381-1394)Online publication date: 1-Apr-2023
  • (2022)DC4: Reconstructing Data-Credit-Coupled Congestion Control for Data CentersProceedings of the 51st International Conference on Parallel Processing10.1145/3545008.3545023(1-11)Online publication date: 29-Aug-2022
  • (2022)ERA: Meeting the Fairness between Sender-driven and Receiver-driven Transmission Protocols in Data Center Networks2022 IEEE 42nd International Conference on Distributed Computing Systems (ICDCS)10.1109/ICDCS54860.2022.00036(291-301)Online publication date: Jul-2022
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
ICPP '20: Proceedings of the 49th International Conference on Parallel Processing
August 2020
844 pages
ISBN:9781450388160
DOI:10.1145/3404397
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 17 August 2020

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Data Center Networks
  2. ECN
  3. priority
  4. protocol

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

ICPP '20

Acceptance Rates

Overall Acceptance Rate 91 of 313 submissions, 29%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)18
  • Downloads (Last 6 weeks)0
Reflects downloads up to 04 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2023)REN: Receiver-Driven Congestion Control Using Explicit Notification for Data CenterIEEE Transactions on Cloud Computing10.1109/TCC.2021.313502711:2(1381-1394)Online publication date: 1-Apr-2023
  • (2022)DC4: Reconstructing Data-Credit-Coupled Congestion Control for Data CentersProceedings of the 51st International Conference on Parallel Processing10.1145/3545008.3545023(1-11)Online publication date: 29-Aug-2022
  • (2022)ERA: Meeting the Fairness between Sender-driven and Receiver-driven Transmission Protocols in Data Center Networks2022 IEEE 42nd International Conference on Distributed Computing Systems (ICDCS)10.1109/ICDCS54860.2022.00036(291-301)Online publication date: Jul-2022
  • (2021)SCDP: Systematic Rateless Coding for Efficient Data Transport in Data CentersIEEE/ACM Transactions on Networking10.1109/TNET.2021.309838629:6(2723-2736)Online publication date: 15-Dec-2021
  • (2021)Analysis and improvement of the latency-based congestion control algorithm DXFuture Generation Computer Systems10.1016/j.future.2021.05.004123(206-218)Online publication date: Oct-2021

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media