ActBERT: Learning Global-Local Video-Text Representations | IEEE Conference Publication | IEEE Xplore

ActBERT: Learning Global-Local Video-Text Representations | IEEE Conference Publication | IEEE Xplore