基于深度强化学习的机械臂动态目标抓取方法

A Dynamic Target Grasping Method for Manipulator Based on Deep Reinforcement Learning

在线阅读下载PDF

导出

摘要针对现有机械臂动态目标抓取方法轨迹规划困难、实时性不足、难以实现六自由度抓取等问题,提出一种基于深度强化学习(deep reinforcement learning,DRL)的机械臂动态目标抓取方法。进行马尔可夫决策过程(Markov decision process,MDP)建模,设计状态空间、动作空间以及奖励函数,实现机械臂对动态目标的六自由度抓取。基于Pybullet构建机械臂动态目标抓取仿真试验环境,对该方法进行训练,将训练得到的策略在新颖场景进行测试,并与经典规划控制的动态目标抓取方法进行对比。仿真结果表明:该方法能实现机械臂对动态目标的六自由度抓取,在抓取成功率和速度上具有优势。 Aiming at the problems of trajectory planning difficulty,insufficient real-time performance and difficulty in realizing six-degree-of-freedom grasping of existing manipulator dynamic target grasping methods,a manipulator dynamic target grasping method based on deep reinforcement learning(DRL)is proposed.The Markov decision process(MDP)is modeled,and the state space,action space and reward function are designed to realize the six-degree-of-freedom grasping of the dynamic target by the manipulator.Based on Pybullet,the dynamic target grasping simulation test environment of manipulator is constructed,and the method is trained.The trained strategy is tested in a novel scene,and compared with the dynamic target grasping method of classical planning control.The simulation results show that the method can realize the six-degree-of-freedom grasping of the dynamic target by the manipulator,and has advantages in grasping success rate and speed.

作者张轩卢惠民任君凯莫新民肖浩然张伟杰杨璇 Zhang Xuan;Lu Huimin;Ren Junkai;Mo Xinmin;Xiao Haoran;Zhang Weijie;Yang Xuan(Human Enhancement Technology Innovation Center,Northwest Institute of Mechanical&Electrical Engineering,Xianyang 712099,China;College of Intelligence Science and Technology,National University of Defense Technology,Changsha 410073,China)

机构地区西北机电工程研究所人体增强技术创新中心国防科技大学智能科学学院

出处《兵工自动化》北大核心 2024年第6期91-96,共6页 Ordnance Industry Automation

关键词动态目标抓取马尔科夫轨迹规划深度强化学习六自由度抓取 dynamic target grasping Markov trajectory planning deep reinforcement learning six-degree-of-freedom grasping

分类号 TP241 [自动化与计算机技术—检测技术与自动化装置]

引文网络
相关文献

1潘迟龙.基于马尔可夫判定过程的无线传感网络入侵检测研究[J].江西通信科技,2024(2):49-51.
2李忻阳,卢倪斌,吕诗为,刘海瑞.基于深度强化学习的水下机械臂自主控制研究[J].控制与信息技术,2023(6):45-50.
3陈壮壮,汪源,吴海飞.基于双边控制法的仿真交通拥堵缓解分析[J].汽车与新动力,2023,6(4):1-5.
4许尧,操蓉蓉,翟志敏,汪立立.弱光环境下仓库搬运机器人抓取控制方法[J].测控技术,2024,43(6):8-13.
5杨慧杰,肖桃顺,武晨,郭凌峰.无人集群虚实混合仿真试验环境集成构建研究[J].系统仿真学报,2024,36(4):825-833.
6ZHAN Guang,ZHANG Kun,LI Ke,PIAO Haiyin.UAV maneuvering decision-making algorithm based on deep reinforcement learning under the guidance of expert experience[J].Journal of Systems Engineering and Electronics,2024,35(3):644-665.
7马力.蜂窝基站中基于迁移强化学习的网络节能方案[J].数字通信世界,2024(6):33-36.
8李国艺,徐梦潇,孙文辉.基于Simulink的油浸式变压器性能试验及分析[J].电工技术,2024(10):158-160.
9Qiyue Li,Yadong Zhu,Jinjin Ding,Weitao Li,Wei Sun,Lijian Ding.Deep Reinforcement Learning Based Resource Allocation for Fault Detection with Cloud Edge Collaboration in Smart Grid[J].CSEE Journal of Power and Energy Systems,2024,10(3):1220-1230.
10曾湖洋,徐刚.深度强化学习方法求解梯级水库随机优化问题[J].三峡大学学报（自然科学版）,2024,46(4):1-9.

兵工自动化

2024年第6期

浏览历史

内容加载中请稍等...

基于深度强化学习的机械臂动态目标抓取方法

相关作者

相关机构

相关主题

浏览历史