Combining HW/SW Mechanisms to Improve NUMA Performance of Multi-GPU Systems | IEEE Conference Publication | IEEE Xplore