Actor-critic reinforcement learning agent
Actor-critic (AC) agents implement actor-critic algorithms such as A2C and A3C, which are model-free, online, on-policy reinforcement learning methods. The actor-critic agent optimizes the policy (actor) directly and uses a critic to estimate the return or future rewards. The action space can be either discrete or continuous.
For more information, see Actor-Critic Agents. For more information on the different types of reinforcement learning agents, see Reinforcement Learning Agents.
creates an actor-critic agent for an environment with the given observation and action
specifications, using default initialization options. The actor and critic
representations in the agent use default deep neural networks built from the observation
specification agent
= rlACAgent(observationInfo
,actionInfo
)observationInfo
and the action specification
actionInfo
.
creates an actor-critic agent for an environment with the given observation and action
specifications. The agent uses default networks in which each hidden fully connected
layer has the number of units specified in the agent
= rlACAgent(observationInfo
,actionInfo
,initOpts
)initOpts
object.
Actor-critic agents do not support recurrent neural networks. For more information on
the initialization options, see rlAgentInitializationOptions
.
creates an actor-critic agent and sets the AgentOptions
property to the agent
= rlACAgent(___,agentOptions
)agentOptions
input argument. Use this syntax after
any of the input arguments in the previous syntaxes.
train | Train reinforcement learning agents within a specified environment |
sim | Simulate trained reinforcement learning agents within specified environment |
getAction | Obtain action from agent or actor representation given environment observations |
getActor | Get actor representation from reinforcement learning agent |
setActor | Set actor representation of reinforcement learning agent |
getCritic | Get critic representation from reinforcement learning agent |
setCritic | Set critic representation of reinforcement learning agent |
generatePolicyFunction | Create function that evaluates trained policy of reinforcement learning agent |
For continuous action spaces, the rlACAgent
object does not enforce
the constraints set by the action specification, so you must enforce action space
constraints within the environment.
Deep Network Designer | rlACAgentOptions
| rlAgentInitializationOptions
| rlStochasticActorRepresentation
| rlValueRepresentation