Create Markov decision process environment for reinforcement learning
A Markov decision process (MDP) is a discrete time stochastic control process. It
provides a mathematical framework for modeling decision making in situations where outcomes
are partly random and partly under the control of the decision maker. MDPs are useful for
studying optimization problems solved using reinforcement learning. Use
rlMDPEnv
to create a Markov decision process environment for
reinforcement learning in MATLAB®.
getActionInfo | Obtain action data specifications from reinforcement learning environment or agent |
getObservationInfo | Obtain observation data specifications from reinforcement learning environment or agent |
sim | Simulate trained reinforcement learning agents within specified environment |
train | Train reinforcement learning agents within a specified environment |
validateEnvironment | Validate custom reinforcement learning environment |