Skip to content

Check the proof and assumptions for linear IRL #16

@JinraeKim

Description

@JinraeKim

Linear IRL [1, 2] 의 수렴성 및 그에 필요한 조건을 확인한다

  • 기법 특성 (e.g. online, model-free, linear, etc.)
  • 수렴 조건

Refs

[1] D. Vrabie, O. Pastravanu, M. Abu-Khalaf, and F. L. Lewis, “Adaptive optimal control for continuous-time linear systems based on policy iteration,” Automatica, vol. 45, no. 2, pp. 477–484, Feb. 2009, doi: 10.1016/j.automatica.2008.08.017.
[2] “Reinforcement Learning and Feedback Control: Using Natural Decision Methods to Design Optimal Adaptive Controllers,” IEEE Control Syst., vol. 32, no. 6, pp. 76–105, Dec. 2012, doi: 10.1109/MCS.2012.2214134.

Metadata

Metadata

Assignees

Labels

studystudy research papers, etc.

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions