版权说明 操作指南
首页 > 成果 > 详情

Overview of Deep Reinforcement Learning Improvements and Applications

认领
导出
Link by DOI
反馈
分享
QQ微信 微博
成果类型:
期刊论文
作者:
Zhang, Junjie;Zhang, Cong;Chien, Wei-Che
作者机构:
[Zhang, Junjie; Zhang, Cong] Wuhan Polytech Univ, Sch Math & Comp Sci, Wuhan, Peoples R China.
[Chien, Wei-Che] Natl Dong Hwa Univ, Dept Comp Sci & Informat Engn, Shoufeng Township, Hualien County, Taiwan.
语种:
英文
关键词:
Deep reinforcement learning;Value function;Policy gradient;Sparse reward
期刊:
JOURNAL OF INTERNET TECHNOLOGY
ISSN:
1607-9264
年:
2021
卷:
22
期:
2
页码:
239-255
基金类别:
Hubei Provincial Major Science and Technology Special Projects [2018ABA099]; Natural Science Foundation of Hubei ProvinceNatural Science Foundation of Hubei Province [2018CFB408]; National Natural Science Foundation of ChinaNational Natural Science Foundation of China (NSFC) [61272278]; innovation and education promotion fund of science and technology development center of Ministry of education in 2019 [2018A01038]; Wuhan Polytechnic University Talent Introduction (Training) Scientific Research Project [2019RZ02]
机构署名:
本校为第一机构
院系归属:
数学与计算机学院
摘要:
The deep reinforcement learning value has received a lot of attention from researchers since it was proposed. It combines the data representation capability of deep learning and the self-learning capability of reinforcement learning to give agents the ability to make direct action decisions on raw data. Deep reinforcement learning continuously optimizes the control strategy by using value function approximation and strategy search methods, ultimately resulting in an agent with a higher level of understanding of the target task. This paper provides a systematic description and summary of the co...

反馈

验证码:
看不清楚,换一个
确定
取消

成果认领

标题:
用户 作者 通讯作者
请选择
请选择
确定
取消

提示

该栏目需要登录且有访问权限才可以访问

如果您有访问权限,请直接 登录访问

如果您没有访问权限,请联系管理员申请开通

管理员联系邮箱:yun@hnwdkj.com