Paper
20 October 2022 Visual odometry based on camera motion and bidirectional long short-term memory network
Zhangzhen Zhao, Bin Xing, Tao Song, Yi Liu, Bin Chen
Author Affiliations +
Proceedings Volume 12451, 5th International Conference on Computer Information Science and Application Technology (CISAT 2022); 124514N (2022) https://doi.org/10.1117/12.2656772
Event: 5th International Conference on Computer Information Science and Application Technology (CISAT 2022), 2022, Chongqing, China
Abstract
Most of the deep learning-based visual odometers have defects such as being unable to effectively learn camera motion information directly from RGB images, and low pose accuracy in the face of vehicle turning, high-speed driving and other scenes with dramatic changes in motion. Aiming at the above problems, this paper proposes a monocular visual odometry method that combines the camera's own motion estimation and a bidirectional long-short-term memory network. Firstly, a camera motion based on a gated recurrent convolutional neural network is proposed by introducing a 4D correlation volume of pixel pairs. Feature extraction algorithm to extract the motion features of adjacent frames of the camera; Then, a monocular visual odometry combining motion features and bidirectional long-short-term memory network is proposed. By learning the geometric motion relationship between adjacent frames before and after, more accurate six-degree-of-freedom camera pose information can be obtained. Comparative experiments on the KITTI dataset show that the proposed algorithm can estimate the pose with higher accuracy.
© (2022) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Zhangzhen Zhao, Bin Xing, Tao Song, Yi Liu, and Bin Chen "Visual odometry based on camera motion and bidirectional long short-term memory network", Proc. SPIE 12451, 5th International Conference on Computer Information Science and Application Technology (CISAT 2022), 124514N (20 October 2022); https://doi.org/10.1117/12.2656772
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Cameras

Visualization

Optical flow

Feature extraction

Motion models

Convolutional neural networks

Error analysis

Back to Top