跳到主要导航 跳到搜索 跳到主要内容

IBalancer: Load-Aware in-Server Flow Scheduling for Sub-Millisecond Tail Latency

  • Qi Zhang
  • , Yi Liu*
  • , Tao Liu
  • *此作品的通讯作者
  • Beihang University
  • Qilu University of Technology

科研成果: 期刊稿件文章同行评审

摘要

Achieving microsecond-scale tail latency poses an extreme challenge to the conventional architecture of 'NIC-OS-Application' in the face of high concurrent requests. Existing kernel-bypass network systems improve this situation significantly. Still, they cannot achieve load-aware in-server requests distribution, which in turn not only harms resource efficiency but, more importantly, beats the goal of squeezing tail latency. This paper proposes iBalancer, an in-server proactive load balancer for the kernel-bypass system, which aggressively handles NIC-side flow scheduling according to the load of threads on the processor-side. Furthermore, we propose a novel metric, 'polling time interval (PTI),' to quantify the load of worker threads, which not only indicates utilization of the core bound to the worker thread but also reflects the differences in the processing time of different flows. By scheduling flows according to the metric PTI, iBalancer tends to average the queueing latencies of different flows, such as Set Get operations for an in-memory key-value store. In addition, by decoupling flow scheduling from packet steering, iBalancer achieves a tail latency aware flow-to-core binding and preserves hardware-based request distribution among cores. The proposed system is evaluated and compared to mTCP and Shenango using two representative microsecond-scale network applications: Memcached KVS and a real-time deep-learning-based financial fraud identification application. Experimental results show that iBalancer can process up to 4.75 × × and 1.55 × × higher load over mTCP and Shenango under 500μs 99th percentile tail latency limit on Memcached. For the financial fraud identification application, iBalancer is able to process 4.56 × × and 1.16 × × higher load than mTCP and Shenango considering 900μs tail latency.

源语言英语
页(从-至)1761-1774
页数14
期刊IEEE Transactions on Parallel and Distributed Systems
33
8
DOI
出版状态已出版 - 1 8月 2022

联合国可持续发展目标

此成果有助于实现下列可持续发展目标:

  1. 可持续发展目标 8 - 体面工作和经济增长
    可持续发展目标 8 体面工作和经济增长
  2. 可持续发展目标 12 - 负责任消费和生产
    可持续发展目标 12 负责任消费和生产

指纹

探究 'IBalancer: Load-Aware in-Server Flow Scheduling for Sub-Millisecond Tail Latency' 的科研主题。它们共同构成独一无二的指纹。

引用此