Access Full-Text Recommend to Your Library

Free Access

Open access articles are freely available for download

Add to Personal Library

Share

Share with Librarian Share with Colleague Fair Use Policy

More Information

Access on Platform
Favorite
Cite Article Cite Article

MLA

Yan, Zhongzhen, et al. "Deep Learning-Driven Robot Arm Control Fusing Convolutional Visual Perception and Predictive Modeling for Motion Planning." JOEUC vol.36, no.1 2024: pp.1-29. https://doi.org/10.4018/JOEUC.355191

APA

Yan, Z., Chang, Y., Yuan, L., Wei, F., Wang, X., Dong, X., & Han, H. (2024). Deep Learning-Driven Robot Arm Control Fusing Convolutional Visual Perception and Predictive Modeling for Motion Planning. Journal of Organizational and End User Computing (JOEUC), 36(1), 1-29. https://doi.org/10.4018/JOEUC.355191

Chicago

Yan, Zhongzhen, et al. "Deep Learning-Driven Robot Arm Control Fusing Convolutional Visual Perception and Predictive Modeling for Motion Planning," Journal of Organizational and End User Computing (JOEUC) 36, no.1: 1-29. https://doi.org/10.4018/JOEUC.355191

Export Reference

For Librarians

Deep Learning-Driven Robot Arm Control Fusing Convolutional Visual Perception and Predictive Modeling for Motion Planning

Zhongzhen Yan (Hubei University of Technology, China), Yiming Chang (Hubei University of Technology, China), Lukang Yuan (Hubei University of Technology, China), Feifei Wei (Hubei University of Economics, China), Xianglong Wang (Hubei University of Technology, China), Xinhua Dong (Hubei University of Technology, China), and Hongmu Han (Hubei University of Technology, China)

Source Title: Journal of Organizational and End User Computing (JOEUC) 36(1)

DOI: 10.4018/JOEUC.355191

Abstract

The wide application of robotic technology in various industries from industrial automation to medical assistance is gradually changing our production and lifestyle, and has attracted widespread attention in many fields. However, existing robotic control systems often grapple with limited flexibility and poor adaptability to complex environments, particularly in highly dynamic operational contexts. Against this backdrop, the integration of deep learning technologies offers new possibilities in enhancing robotic perception and decision-making, especially in visual perception and motion planning. To address these challenges, we have introduced a novel robotic arm control network model, MPC-WGAN-Faster R-CNN, which combines Model Predictive Control concepts with Wasserstein Generative Adversarial Networks and Faster R-CNN visual recognition technology. This integration aims to improve the precision and adaptability of robotic arm operations in complex environments.

Article Preview

Top

Introduction

In current society, robots play an increasingly important role in our daily life, and their application scope and research fields are also expanding and developing (Fong et al., 2003). In recent years, deep learning technology has taken an important position in robotics research (Janiesch et al., 2021); indeed, its rise has brought new hope to robotics research. Deep learning is a neural network-based approach that enables machines to better process and analyze data, giving robots stronger perception and decision-making capabilities that enable them to better adapt to different environments and tasks. In the field of robotics research, deep learning has been widely used to improve the performance of robots (Heo et al., 2019). In particular, convolutional visual perception (Ran et al., 2021) and prediction models (Dasari, Erbert et al., 2019) have become one of the focal points of research. These models can effectively use convolutional neural networks (CNNs) to extract key information about the environment from sensor data, including object detection, motion estimation, and environment modeling, which are crucial for robot motion planning and provide valuable information input. The continuous development and innovation in robotics have enabled them to tackle challenges and tasks in a variety of different fields. Whether automating manufacturing on industrial production lines, performing precise operations in medical surgeries, or even performing missions to explore uncharted territories in space exploration (Biswal & Mohanty, 2021), robots play a key role and have become an integral part of modern society.

However, as robots continue to expand in various application areas, a series of challenges and problems have emerged. These problems include, but are not limited to, how to enable robots to better perceive and understand their surroundings (Rubio et al., 2019), how to make intelligent decisions in complex and unknown situations (Herrera-Viedma et al., 2020), and how to achieve more natural human-robot interaction (Andronas et al., 2021). In addressing these challenges, existing studies still have some limitations, although computer models such as deep learning have made some progress in relevant aspects. These limitations include the real-time requirements for motion planning (Castillo-Lopez et al., 2020), the realism requirements for environment sensing data (Martinez-Gonzalez et al., 2020), and the requirements for target detection and tracking accuracy (Wu et al., 2022). Therefore, the field of robotics still requires continuous research and innovation to meet the growing demands and overcome these challenges, to promote a greater role for robots in various fields, and to ensure that they are able to safely and efficiently interact with human society.

To overcome these shortcomings, we have devised the innovative MPC-WGAN-faster R-CNN network model. We strategically chose model predictive control (MPC), Wasserstein generative adversarial networks (WGAN), and faster region CNN (faster R-CNN) for their synergistic potential to significantly improve our models performance across various dimensions. We selected MPC for its exceptional precision in real-time trajectory planning and adaptability, which allows for the efficient forecasting and adjustment of robot actions in dynamic environments. Then, we integrated WGAN to refine the generation of realistic and synthetic visual data, thereby enhancing the model’s visual perception training processes, which enriches the robot’s interpretative capabilities. Finally, we adopted faster R-CNN because it ensures fast and reliable recognition of objects within the environment, providing essential information for nuanced motion planning.

Complete Article List

Search this Journal:

Reset

Volume 38: 1 Issue (2026)

Volume 37: 1 Issue (2025)

Volume 36: 1 Issue (2024)

Volume 35: 3 Issues (2023)

Volume 34: 10 Issues (2022)

Volume 33: 6 Issues (2021)

Volume 32: 4 Issues (2020)

Volume 31: 4 Issues (2019)

Volume 30: 4 Issues (2018)

Volume 29: 4 Issues (2017)

Volume 28: 4 Issues (2016)

Volume 27: 4 Issues (2015)

Volume 26: 4 Issues (2014)

Volume 25: 4 Issues (2013)

Volume 24: 4 Issues (2012)

Volume 23: 4 Issues (2011)

Volume 22: 4 Issues (2010)

Volume 21: 4 Issues (2009)

Volume 20: 4 Issues (2008)

Volume 19: 4 Issues (2007)

Volume 18: 4 Issues (2006)

Volume 17: 4 Issues (2005)

Volume 16: 4 Issues (2004)

Volume 15: 4 Issues (2003)

Volume 14: 4 Issues (2002)

Volume 13: 4 Issues (2001)

Volume 12: 4 Issues (2000)

Volume 11: 4 Issues (1999)

Volume 10: 4 Issues (1998)

Volume 9: 4 Issues (1997)

Volume 8: 4 Issues (1996)

Volume 7: 4 Issues (1995)

Volume 6: 4 Issues (1994)

Volume 5: 4 Issues (1993)

Volume 4: 4 Issues (1992)

Volume 3: 4 Issues (1991)

Volume 2: 4 Issues (1990)

Volume 1: 3 Issues (1989)

View Complete Journal Contents Listing

MLA

APA

Chicago

Export Reference

Deep Learning-Driven Robot Arm Control Fusing Convolutional Visual Perception and Predictive Modeling for Motion Planning

Abstract

Introduction

Complete Article List