Dear all, I am confused about the difference between Approximate Dynamic Programming and Reinforcement Learning (such as QL) or approximated RL? you know, somebody told me that they are the same but RL/QL is in the context of computer science and ADP is in the context of operational research and engineering.

also, I've heard policy iteration and policy search keywords. are the different?

asked 16 Oct '17, 04:09

Mahdi%20Massahi's gravatar image

Mahdi Massahi
accept rate: 0%

edited 21 Oct '17, 04:29

Be the first one to answer this question!
toggle preview

Follow this question

By Email:

Once you sign in you will be able to subscribe for any updates here



Answers and Comments

Markdown Basics

  • *italic* or _italic_
  • **bold** or __bold__
  • link:[text]( "Title")
  • image?![alt text](/path/img.jpg "Title")
  • numbered list: 1. Foo 2. Bar
  • to add a line break simply add two spaces to where you would like the new line to be.
  • basic HTML tags are also supported



Asked: 16 Oct '17, 04:09

Seen: 258 times

Last updated: 21 Oct '17, 04:29

OR-Exchange! Your site for questions, answers, and announcements about operations research.