Skip to main content

2022 | OriginalPaper | Buchkapitel

Research on Navigation Algorithm of Unmanned Ground Vehicle Based on Imitation Learning and Curiosity Driven

verfasst von : Shiqi Liu, Jiawei Chen, Bowen Zu, Xuehua Zhou, Zhiguo Zhou

Erschienen in: Methods and Applications for Modeling and Simulation of Complex Systems

Verlag: Springer Nature Singapore

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The application of deep reinforcement learning (DRL) for autonomous navigation of unmanned ground vehicle (UGV) has the problem of sparse rewards, which makes the trained algorithm model difficult to converge and cannot be transferred to real vehicles. In this regard, this paper proposes an effective exploratory learning autonomous navigation algorithm Double I-PPO, which designs pre-training behaviors based on imitation learning (IL) to guide UGV to try positive states, and introduces the intrinsic curiosity module (ICM) to generate intrinsic reward signals to encourage exploratory learning strategies. Build the training scene in Unity to evaluate the performance of the algorithm, and integrate the algorithm strategy into the motion planning stack of the ROS vehicle, so as to extend to the actual scene for testing. Experiments show that in the environment of random obstacles, the method does not need to rely on prior map information. Compared with similar DRL algorithms, the convergence speed is faster and the navigation success rate can reach more than 85%.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Shi, H., Shi, L., Xu, M., et al.: End-to-end navigation strategy with deep reinforcement learning for mobile robots. IEEE Trans. Industr. Inf. 16(4), 2393–2402 (2019)CrossRef Shi, H., Shi, L., Xu, M., et al.: End-to-end navigation strategy with deep reinforcement learning for mobile robots. IEEE Trans. Industr. Inf. 16(4), 2393–2402 (2019)CrossRef
2.
Zurück zum Zitat Weiwei, Z., Wei, W., Nengcheng, C., et al.: Path planning strategies for UAV based on improved A* algorithm. Geomat. Inf. Sci. Wuhan Univ. 40(3), 315–320 (2015) Weiwei, Z., Wei, W., Nengcheng, C., et al.: Path planning strategies for UAV based on improved A* algorithm. Geomat. Inf. Sci. Wuhan Univ. 40(3), 315–320 (2015)
3.
Zurück zum Zitat Song, P., Huang, J., Mansaray, L.R., et al.: An improved soil moisture retrieval algorithm based on the land parameter retrieval model for water-land mixed pixels using AMSR-E data. IEEE Trans. Geosci. Remote Sens. 57(10), 7643–7657 (2019)CrossRef Song, P., Huang, J., Mansaray, L.R., et al.: An improved soil moisture retrieval algorithm based on the land parameter retrieval model for water-land mixed pixels using AMSR-E data. IEEE Trans. Geosci. Remote Sens. 57(10), 7643–7657 (2019)CrossRef
4.
Zurück zum Zitat Xinyi, Y., Yichen, Z., Liang, L., et al.: Dynamic window with virtual goal (DW-VG): a new reactive obstacle avoidance approach based on motion prediction. Robotica 37(8), 1438–1456 (2019)CrossRef Xinyi, Y., Yichen, Z., Liang, L., et al.: Dynamic window with virtual goal (DW-VG): a new reactive obstacle avoidance approach based on motion prediction. Robotica 37(8), 1438–1456 (2019)CrossRef
5.
Zurück zum Zitat Xu, Y., Liu, X., Hu, X., et al.: A genetic-algorithm-aided fuzzy chance-constrained programming model for municipal solid waste management. Eng. Optim. 24(5), 652–668 (2019)MathSciNetMATH Xu, Y., Liu, X., Hu, X., et al.: A genetic-algorithm-aided fuzzy chance-constrained programming model for municipal solid waste management. Eng. Optim. 24(5), 652–668 (2019)MathSciNetMATH
6.
Zurück zum Zitat Luo, Q., Wang, H., Zheng, Y., et al.: Research on path planning of mobile robot based on improved ant colony algorithm. Neural Comput. Appl. 32(6), 1555–1566 (2020)CrossRef Luo, Q., Wang, H., Zheng, Y., et al.: Research on path planning of mobile robot based on improved ant colony algorithm. Neural Comput. Appl. 32(6), 1555–1566 (2020)CrossRef
7.
Zurück zum Zitat Liu, J., Gao, F., Luo, X.: Survey of deep reinforcement learning based on value function and policy gradient. Chin. J. Comput. 42(6), 1406–1438 (2019) Liu, J., Gao, F., Luo, X.: Survey of deep reinforcement learning based on value function and policy gradient. Chin. J. Comput. 42(6), 1406–1438 (2019)
8.
Zurück zum Zitat Tai, L., Paolo, G., Liu, M.: Virtual-to-real deep reinforcement learning: continuous control of mobile robots for mapless navigation. In: 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 31–36. IEEE (2017) Tai, L., Paolo, G., Liu, M.: Virtual-to-real deep reinforcement learning: continuous control of mobile robots for mapless navigation. In: 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 31–36. IEEE (2017)
9.
Zurück zum Zitat Josef, S., Degani, A.: Deep reinforcement learning for safe local planning of a ground vehicle in unknown rough terrain. IEEE Robot. Autom. Lett. 5(4), 6748–6755 (2020)CrossRef Josef, S., Degani, A.: Deep reinforcement learning for safe local planning of a ground vehicle in unknown rough terrain. IEEE Robot. Autom. Lett. 5(4), 6748–6755 (2020)CrossRef
10.
Zurück zum Zitat Cèsar-Tondreau, B., Warnell, G., Stump, E., et al.: Improving autonomous robotic navigation using imitation learning. Front. Robot. AI 146 (2021) Cèsar-Tondreau, B., Warnell, G., Stump, E., et al.: Improving autonomous robotic navigation using imitation learning. Front. Robot. AI 146 (2021)
11.
Zurück zum Zitat Tai, L., Zhang, J., Liu, M., et al.: Socially compliant navigation through raw depth inputs with generative adversarial imitation learning. In: 2018 IEEE International Conference on Robotics and Automation (ICRA), pp. 1111–1117. IEEE (2018) Tai, L., Zhang, J., Liu, M., et al.: Socially compliant navigation through raw depth inputs with generative adversarial imitation learning. In: 2018 IEEE International Conference on Robotics and Automation (ICRA), pp. 1111–1117. IEEE (2018)
12.
Zurück zum Zitat Wei, P., Liang, R., Michelmore, A., et al.: Vision-based 2D navigation of unmanned aerial vehicles in riverine environments with imitation learning. J. Intell. Robot. Syst. 104(3), 1–19 (2022)CrossRef Wei, P., Liang, R., Michelmore, A., et al.: Vision-based 2D navigation of unmanned aerial vehicles in riverine environments with imitation learning. J. Intell. Robot. Syst. 104(3), 1–19 (2022)CrossRef
13.
Zurück zum Zitat Wu, Q., Gong, X., Xu, K., et al.: Towards target-driven visual navigation in indoor scenes via generative imitation learning. IEEE Robot. Autom. Lett. 6(1), 175–182 (2020)CrossRef Wu, Q., Gong, X., Xu, K., et al.: Towards target-driven visual navigation in indoor scenes via generative imitation learning. IEEE Robot. Autom. Lett. 6(1), 175–182 (2020)CrossRef
14.
Zurück zum Zitat Pathak, D., Agrawal, P., Efros, A.A., et al.: Curiosity-driven exploration by self-supervised prediction. In: International Conference on Machine Learning, pp. 2778–2787. PMLR (2017) Pathak, D., Agrawal, P., Efros, A.A., et al.: Curiosity-driven exploration by self-supervised prediction. In: International Conference on Machine Learning, pp. 2778–2787. PMLR (2017)
15.
Zurück zum Zitat Juliani, A., Berges, V.P., Teng, E., et al.: Unity: a general platform for intelligent agents. arXiv preprint arXiv:1809.02627 (2018) Juliani, A., Berges, V.P., Teng, E., et al.: Unity: a general platform for intelligent agents. arXiv preprint arXiv:​1809.​02627 (2018)
Metadaten
Titel
Research on Navigation Algorithm of Unmanned Ground Vehicle Based on Imitation Learning and Curiosity Driven
verfasst von
Shiqi Liu
Jiawei Chen
Bowen Zu
Xuehua Zhou
Zhiguo Zhou
Copyright-Jahr
2022
Verlag
Springer Nature Singapore
DOI
https://doi.org/10.1007/978-981-19-9198-1_46

Premium Partner