Check the proof and assumptions for linear IRL

Linear IRL [1, 2] 의 수렴성 및 그에 필요한 조건을 확인한다

- [x] 기법 특성 (e.g. online, model-free, linear, etc.)
- [x] 수렴 조건


# Refs
[1] D. Vrabie, O. Pastravanu, M. Abu-Khalaf, and F. L. Lewis, “Adaptive optimal control for continuous-time linear systems based on policy iteration,” Automatica, vol. 45, no. 2, pp. 477–484, Feb. 2009, doi: 10.1016/j.automatica.2008.08.017.
[2] “Reinforcement Learning and Feedback Control: Using Natural Decision Methods to Design Optimal Adaptive Controllers,” IEEE Control Syst., vol. 32, no. 6, pp. 76–105, Dec. 2012, doi: 10.1109/MCS.2012.2214134.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Check the proof and assumptions for linear IRL #16

Refs

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Check the proof and assumptions for linear IRL #16

Description

Refs

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions