-
Notifications
You must be signed in to change notification settings - Fork 0
Closed
Labels
studystudy research papers, etc.study research papers, etc.
Description
Linear IRL [1, 2] 의 수렴성 및 그에 필요한 조건을 확인한다
- 기법 특성 (e.g. online, model-free, linear, etc.)
- 수렴 조건
Refs
[1] D. Vrabie, O. Pastravanu, M. Abu-Khalaf, and F. L. Lewis, “Adaptive optimal control for continuous-time linear systems based on policy iteration,” Automatica, vol. 45, no. 2, pp. 477–484, Feb. 2009, doi: 10.1016/j.automatica.2008.08.017.
[2] “Reinforcement Learning and Feedback Control: Using Natural Decision Methods to Design Optimal Adaptive Controllers,” IEEE Control Syst., vol. 32, no. 6, pp. 76–105, Dec. 2012, doi: 10.1109/MCS.2012.2214134.
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
studystudy research papers, etc.study research papers, etc.