Aodos: Affinity-aware Orchestration and Deterministic Operator Overlap for Simultaneous DNN Services in the GPU Cluster

Journal
Authors

Weihao Cui, Chunyu Xue, Han Zhao, Quan Chen, Minyi Guo.

Published

31 October 2022

Publication details

submitted to TOCS

Links