Home
Terminology in RL/AI and DP/Control
RL uses MAX/Value, DP uses Min/Cost
- Reward of a stage = (Opposite of) Cost of a stage.
- State value = (Opposite of) State cost.
- Value (or state-value) function = (Opposite of) Cost function.
The min. number of char. for a blog name is 4, so the webpage is https://websites.uta.edu/cost.
Zhenlin Pei | Department of Electrical Engineering
UT Arlington has top AI research ranking based on publications (2009-2019) at:
- Artificial Intelligence (ranks #30 in the USA)
- Robotics (ranks #50 in the USA)
- Machine Learning & Data Mining (ranks #60 in the USA)