Successive Over-Relaxation Q-Learning-MedSci.cn - 梅斯（MedSci）

Successive Over-Relaxation Q-Learning

Kamanchi, C; Diddigi, RB; Bhatnagar, S

Kamanchi, C (corresponding author), Indian Inst Sci, Dept Comp Sci & Automat, Bengaluru 560012, India.

IEEE CONTROL SYSTEMS LETTERS, 2020; 4 (1): 55

Abstract

In a discounted reward Markov decision process (MDP), the objective is to find the optimal value function, i.e., the value function corresponding to a......

Full Text Link

Links

期刊讨论 | 中国SCI论文 | 期刊主页 | 投稿经验 | 杂志官网 | 投稿链接 | 作者需知 | PMC链接 | Pubmed全文检索

科室
- - 订阅+
  - 更多科室
工具
服务