Robust speech interface based on audio and video information fusion for humanoid HRP-2 | IEEE Conference Publication | IEEE Xplore