Overview of Deep Reinforcement Learning Improvements and Applications

首页 > 成果 > 详情

认领

导出

Link by DOI

反馈

作者信息关键词期刊信息基础信息归属信息摘要

成果类型：

期刊论文

作者：

Zhang, Junjie;Zhang, Cong;Chien, Wei-Che

作者机构：

[Zhang, Junjie; Zhang, Cong] Wuhan Polytech Univ, Sch Math & Comp Sci, Wuhan, Peoples R China.

[Chien, Wei-Che] Natl Dong Hwa Univ, Dept Comp Sci & Informat Engn, Shoufeng Township, Hualien County, Taiwan.

语种：

英文

关键词：

Deep reinforcement learning;Value function;Policy gradient;Sparse reward

期刊：

JOURNAL OF INTERNET TECHNOLOGY

ISSN：

1607-9264

年：

2021

卷：

期：

页码：

239-255

DOI：

10.3966/160792642021032202002

基金类别：

Hubei Provincial Major Science and Technology Special Projects [2018ABA099]; Natural Science Foundation of Hubei ProvinceNatural Science Foundation of Hubei Province [2018CFB408]; National Natural Science Foundation of ChinaNational Natural Science Foundation of China (NSFC) [61272278]; innovation and education promotion fund of science and technology development center of Ministry of education in 2019 [2018A01038]; Wuhan Polytechnic University Talent Introduction (Training) Scientific Research Project [2019RZ02]

机构署名：

本校为第一机构

院系归属：

数学与计算机学院

摘要：

The deep reinforcement learning value has received a lot of attention from researchers since it was proposed. It combines the data representation capability of deep learning and the self-learning capability of reinforcement learning to give agents the ability to make direct action decisions on raw data. Deep reinforcement learning continuously optimizes the control strategy by using value function approximation and strategy search methods, ultimately resulting in an agent with a higher level of understanding of the target task. This paper provides a systematic description and summary of the co...

反馈

产权有误：本人成果被他人认领

数据有误：数据基本信息有误

归属有误：成果的院系归属、机构署名归属有误

其他原因：

验证码：

看不清楚，换一个

确定

取消

成果认领

标题：

用户	作者	通讯作者	--
	请选择	请选择	--

确定

取消

Overview of Deep Reinforcement Learning Improvements and Applications

反馈

成果认领

提示

该栏目需要登录且有访问权限才可以访问