|
|
时效性决策RL |
Jiangcheng Zhu1 , Zhepei Wang1 , Douglas Mcilwraith2 , Chao Wu3 , Chao Xu1 , Yike Guo2 |
1 Institute of Cyber-Systems and Control, Department of Control Science and Engineering, Zhejiang University, Hangzhou, People's Republic of China
2 Data Science Institute, Department of Computing, Imperial College London, London, UK
3 School of Public Affairs, Zhejiang University, Hangzhou, People's Republic of China
|
|
Time-in-action RL |
Jiangcheng Zhu1 , Zhepei Wang1 , Douglas Mcilwraith2 , Chao Wu3 , Chao Xu1 , Yike Guo2 |
1
Institute of Cyber-Systems and Control, Department of Control Science and Engineering, Zhejiang University, Hangzhou, People's Republic of
China
2 Data Science Institute, Department of Computing, Imperial College London, London, UK
3 School of Public Affairs, Zhejiang University, Hangzhou, People's Republic of China
|
|
Viewed |
|
|
|
Full text
|
|
|
|
|
Abstract
|
|
|
|
|
Cited |
|
|
|
|
|
Shared |
|
|
|
|
|
Discussed |
|
|
|
|