|
|
|
| 时效性决策RL |
| Jiangcheng Zhu1 , Zhepei Wang1 , Douglas Mcilwraith2 , Chao Wu3 , Chao Xu1 , Yike Guo2 |
| 1 Institute of Cyber-Systems and Control, Department of Control Science and Engineering, Zhejiang University, Hangzhou, People's Republic of China
2 Data Science Institute, Department of Computing, Imperial College London, London, UK
3 School of Public Affairs, Zhejiang University, Hangzhou, People's Republic of China
|
|
| Time-in-action RL |
| Jiangcheng Zhu1 , Zhepei Wang1 , Douglas Mcilwraith2 , Chao Wu3 , Chao Xu1 , Yike Guo2 |
| 1
Institute of Cyber-Systems and Control, Department of Control Science and Engineering, Zhejiang University, Hangzhou, People's Republic of
China
2 Data Science Institute, Department of Computing, Imperial College London, London, UK
3 School of Public Affairs, Zhejiang University, Hangzhou, People's Republic of China
|
|
Viewed |
|
|
|
Full text
|
|
|
|
|
Abstract
|
|
|
|
|
Cited |
|
|
|
|
| |
Shared |
|
|
|
|
| |
Discussed |
|
|
|
|