Hostname: page-component-7bb8b95d7b-495rp Total loading time: 0 Render date: 2024-09-12T07:01:24.507Z Has data issue: false hasContentIssue false

Dynamic simultaneous localization and mapping based on object tracking in occluded environment

Published online by Cambridge University Press:  27 May 2024

Weili Ding*
Affiliation:
School of Electrical Engineering, Yanshan University, Qinhuangdao, Hebei, China
Ziqi Pei
Affiliation:
School of Electrical Engineering, Yanshan University, Qinhuangdao, Hebei, China
Tao Yang
Affiliation:
School of Electrical Engineering, Yanshan University, Qinhuangdao, Hebei, China
Taiyu Chen
Affiliation:
School of Electrical Engineering, Yanshan University, Qinhuangdao, Hebei, China
*
Corresponding author: Weili Ding; Email: weiye51@ysu.edu.cn

Abstract

In practical applications, many robots equipped with embedded devices have limited computing capabilities. These limitations often hinder the performance of existing dynamic SLAM algorithms, especially when faced with occlusions or processor constraints. Such challenges lead to subpar positioning accuracy and efficiency. This paper introduces a novel lightweight dynamic SLAM algorithm designed primarily to mitigate the interference caused by moving object occlusions. Our proposed approach combines a deep learning object detection algorithm with a Kalman filter. This combination offers prior information about dynamic objects for each SLAM algorithm frame. Leveraging geometric techniques like RANSAC and the epipolar constraint, our method filters out dynamic feature points, focuses on static feature points for pose determination, and enhances the SLAM algorithm’s robustness in dynamic environments. We conducted experimental validations on the TUM public dataset, which demonstrated that our approach elevates positioning accuracy by approximately 54% and boosts the running speed by 75.47% in dynamic scenes.

Type
Research Article
Copyright
© The Author(s), 2024. Published by Cambridge University Press

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

References

Cadena, C., Carlone, L., Carrillo, H., Latif, Y., Scaramuzza, D., Neira, J.é, Reid, I. and Leonard, J. J., “Past, present, and future of simultaneous localization and mapping: Toward the robust-perception age,” IEEE Trans Robot 32(6), 13091332 (2016).CrossRefGoogle Scholar
Engelhard, N., Endres, F., Hess, J., Sturm, J. and Burgard, W., “Real-Time 3D Visual SLAM with a Hand-Held RGB-D Camera,” In: Proc. of the RGB-D Workshop on 3D Perception in Robotics at the European Robotics Forum, (2011) pp. 115.Google Scholar
Campos, C., Elvira, R., Rodriguez, J. J. G., M. Montiel, J. M. and D. Tardos, J., “ORB-SLAM3: An accurate open-source library for visual, visual–inertial, and multimap SLAM,” IEEE Trans Robot 37(6), 18741890 (2021).CrossRefGoogle Scholar
Sharafutdinov, D., Griguletskii, M., Kopanev, P., Kurenkov, M., Ferrer, G., Burkov, A., Gonnochenko, A. and Tsetserukou, D., “Comparison of modern open-source visual SLAM approaches,” J Intell Robot Syst 107(3), 43 (2023).CrossRefGoogle Scholar
Forster, C., Pizzoli, M. and Scaramuzza, D., “Svo: Fast Semi-Direct Monocular Visual Odometry,” In: 2014 IEEE international conference on robotics and automation (ICRA), (2014) pp. 1522.Google Scholar
Engel, J., Schöps, T. and Cremers, D., “LSD-SLAM: Large-Scale Direct Monocular Slam,” In: Computer Vision–ECCV 2014: 13th European Conference Part II, (2014) pp. 834849 Google Scholar
Saputra, M. R. U., Markham, A. and Trigoni, N., “Visual slam and structure from motion in dynamic environments: A survey,” ACM Comput Surv 51(2), 136 (2018).CrossRefGoogle Scholar
Wan Aasim, W. F. A., Okasha, M. and Faris, W. F., “Real-time artificial intelligence based visual simultaneous localization and mapping in dynamic environments–a review,” J Intell Robot Syst 105(1), 15 (2022).CrossRefGoogle Scholar
Barath, D., Cavalli, L. and Pollefeys, M., “Learning to Find Good Models in RANSAC,” In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, (2022) pp. 1574415753.Google Scholar
Chum, O.řej, Matas, J.ří and Kittler, J., “Locally Optimized RANSAC,” In: Pattern Recognition: 25th DAGM Symposium, (2003) pp. 236243.Google Scholar
Zhao, Y., Xiong, Z., Zhou, S., Peng, Z., Campoy, P. and Zhang, L., “KSF-SLAM: A key segmentation frame based semantic SLAM in dynamic environments,” J Intell Robot Syst 105(1), 3 (2022).CrossRefGoogle Scholar
Bescos, B., Fácil, J. M., Civera, J. and Neira, J., “Dynaslam: Tracking, mapping, and inpainting in dynamic scenes,” IEEE Robot Autom Lett 3(4), 40764083 (2018).CrossRefGoogle Scholar
Yu, C., Liu, Z., Liu, X.-J., Xie, F., Yang, Y., Wei, Q. and Fei, Q., “DS-SLAM: A Semantic Visual SLAM Towards Dynamic Environments,” In: 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), (2018) pp. 11681174.Google Scholar
Zhong, F., Wang, S., Zhang, Z. and Wang, Y., “Detect-SLAM: Making Object Detection and SLAM Mutually Beneficial.” In: 2018 IEEE Winter Conference on Applications of Computer Vision (WACV), (2018) pp. 10011010.Google Scholar
Demim, F., Nemra, A., Boucheloukh, A., Louadj, K. and Bazoula, A., “Robust SVSF-SLAM Algorithm for Unmanned Vehicle in Dynamic Environment,” In: 2018 International Conference on Signal, Image, Vision and their Applications (SIVA), (2018) pp. 15.Google Scholar
Demim, F., Boucheloukh, A., Nemra, A., Kobzili, E., Hamerlain, M. and Bazoula, A., “An Adaptive SVSF-SLAM Algorithm in Dynamic Environment for Cooperative Unmanned Vehicles,” In: IFAC Symposium on Mechatronic Systems, (2020).CrossRefGoogle Scholar
Demim, F., Nemra, A., Boucheloukh, A., Kobzili, E., Hamerlain, M. and Bazoula, A., “SLAM based on adaptive SVSF for cooperative unmanned vehicles in dynamic environment,” IFAC-PapersOnLine 52(8), 7380 (2019).CrossRefGoogle Scholar
Demim, F., Nemra, A. and Louadj, K., “Robust SVSF-SLAM for unmanned vehicle in unknown environment,” IFAC-PapersOnLine 49(21), 386394 (2016).CrossRefGoogle Scholar
Thamrin, N. M., Arshad, N. H. M., Adnan, R., Sam, R. and Mahmud, S. F., “Simultaneous Localization and Mapping Based Real-Time Inter-Row Tree Tracking Technique for Unmanned Aerial vehicle,” In: IEEE International Conference on Control System, (2013).CrossRefGoogle Scholar
Kenye, L. and Kala, R., “Improving RGB-D SLAM in dynamic environments using semantic aided segmentation,” Robotica 40(6), 20652090 (2022).CrossRefGoogle Scholar
Scona, R., Jaimez, M., Petillot, Y. R., Fallon, M. and Cremers, D., “Staticfusion: Background Reconstruction for Dense RGB-D SLAM in Dynamic Environments,” In: 2018 IEEE international conference on robotics and automation (ICRA), (2018) pp. 38493856.Google Scholar
Runz, M., Buffier, M. and Agapito, L., “Maskfusion: Real-Time Recognition, Tracking and Reconstruction of Multiple Moving Objects,” In: 2018 IEEE International Symposium on Mixed and Augmented Reality (ISMAR), (2018) pp. 1020.Google Scholar
Zhang, J., Henein, M., Mahony, R. and Ila, V., “Vdo-slam: A visual dynamic object-aware slam system, “ (2020). arXiv preprint arXiv: 2005.11052.Google Scholar
Ferrera, M., Eudes, A., Moras, J., Sanfourche, M. and Besnerais, G. L., “Ov $^{2}$ slam: A fully online and versatile visual slam for real-time applications,” IEEE Robot Autom Lett 6(2), 13991406 (2021).CrossRefGoogle Scholar
Rosten, E. and Drummond, T., “Machine Learning for High-Speed Corner Detection,” In: Computer Vision–ECCV 2006: 9th European Conference on Computer Vision, (2006) pp. 430443.Google Scholar
Lucas, B. D. and Kanade, T., “An Iterative Image Registration Technique with an Application to Stereo Vision,” In: IJCAI’81: 7th international joint conference on Artificial intelligence, (1981) pp. 674679.Google Scholar
Mur-Artal, R. and Tardós, J. D., “ORB-SLAM2: An open-source SLAM system for monocular, stereo, and RGB-D cameras,” IEEE Trans Robot 33(5), 12551262 (2017).CrossRefGoogle Scholar
Garcia-Fidalgo, E. and Ortiz, A., “An appearance-based loop-closure detection approach using incremental bags of binary words,” IEEE Robot Autom Lett 3(4), 30513057 (2018).CrossRefGoogle Scholar
Hartley, R. I., “In defense of the eight-point algorithm,” IEEE Trans Patt Anal Mach Intell 19(6), 580593 (1997).CrossRefGoogle Scholar
Zhao, L., Huang, S., Yan, L. and Dissanayake, G., “A new feature parametrization for monocular SLAM using line features,” Robotica 33(3), 513536 (2015).CrossRefGoogle Scholar