Twin-delayed deep deterministic policy gradient reinforcement learning agent
The twin-delayed deep deterministic policy gradient (DDPG) algorithm is an actor-critic, model-free, online, off-policy reinforcement learning method which computes an optimal policy that maximizes the long-term reward. The action space can only be continuous.
Use rlTD3Agent
to create one of the following types of agents.
Twin-delayed deep deterministic policy gradient (TD3) agent with two Q-value functions. This agent prevents overestimation of the value function by learning two Q value functions and using the minimum values for policy updates.
Delayed deep deterministic policy gradient (delayed DDPG) agent with a single Q value function. This agent is a DDPG agent with target policy smoothing and delayed policy and target updates.
For more information, see Twin-Delayed Deep Deterministic Policy Gradient Agents. For more information on the different types of reinforcement learning agents, see Reinforcement Learning Agents.
creates a TD3 agent for an environment with the given observation and action
specifications, using default initialization options. The actor and critics
representations in the agent use default deep neural networks built from the observation
specification agent
= rlTD3Agent(observationInfo
,actionInfo
)observationInfo
and the action specification
actionInfo
.
creates a deep deterministic policy gradient agent for an environment with the given
observation and action specifications. The agent uses default networks configured using
options specified in the agent
= rlTD3Agent(observationInfo
,actionInfo
,initOpts
)initOpts
object. For more information on
the initialization options, see rlAgentInitializationOptions
.
creates a TD3 agent and sets the agent
= rlTD3Agent(___,agentOptions
)AgentOptions
property to the agentOptions
input argument. Use this syntax after
any of the input arguments in the previous syntaxes.
train | Train reinforcement learning agents within a specified environment |
sim | Simulate trained reinforcement learning agents within specified environment |
getAction | Obtain action from agent or actor representation given environment observations |
getActor | Get actor representation from reinforcement learning agent |
setActor | Set actor representation of reinforcement learning agent |
getCritic | Get critic representation from reinforcement learning agent |
setCritic | Set critic representation of reinforcement learning agent |
generatePolicyFunction | Create function that evaluates trained policy of reinforcement learning agent |
Deep Network Designer | rlAgentInitializationOptions
| rlDeterministicActorRepresentation
| rlQValueRepresentation
| rlTD3AgentOptions