Inter-block GPU communication via fast barrier synchronization | IEEE Conference Publication | IEEE Xplore