Home

Terminology in RL/AI and DP/Control

RL uses MAX/Value, DP uses Min/Cost

  • Reward of a stage = (Opposite of) Cost of a stage.
  • State value = (Opposite of) State cost.
  • Value (or state-value) function = (Opposite of) Cost function.

The min. number of char. for a blog name is 4, so the webpage is https://websites.uta.edu/cost.


Zhenlin Pei | Department of Electrical Engineering

https://websites.uta.edu/zpei


UT Arlington has top AI research ranking based on publications (2009-2019) at: