rlACAgent

Actor-critic reinforcement learning agent

Description

Actor-critic (AC) agents implement actor-critic algorithms such as A2C and A3C, which are model-free, online, on-policy reinforcement learning methods. The goal of this agent is to optimize the policy (actor) directly and train a critic to estimate the return or future rewards.

For more information see Actor-Critic Agents.

For more information on the different types of reinforcement learning agents, see Reinforcement Learning Agents.

Creation

Syntax

agent = rlACAgent(actor,critic,agentOptions)

Description

example

agent = rlACAgent(actor,critic,agentOptions) creates an actor-critic agent with the specified actor and critic networks and sets the AgentOptions property.

Input Arguments

expand all

`actor` — Actor network representation
`rlStochasticActorRepresentation` object

Actor network representation for the policy, specified as an rlStochasticActorRepresentation object. For more information on creating actor representations, see Create Policy and Value Function Representations.

`critic` — Critic network representation
`rlValueRepresentation` object

Critic network representation for estimating the discounted long-term reward, specified as an rlValueRepresentation. For more information on creating critic representations, see Create Policy and Value Function Representations.

Properties

expand all

`AgentOptions` — Agent options
`rlACAgentOptions` object

Agent options, specified as an rlACAgentOptions object.

Object Functions

`train`	Train a reinforcement learning agent within a specified environment
`sim`	Simulate a trained reinforcement learning agent within a specified environment
`getActor`	Get actor representation from reinforcement learning agent
`setActor`	Set actor representation of reinforcement learning agent
`getCritic`	Get critic representation from reinforcement learning agent
`setCritic`	Set critic representation of reinforcement learning agent
`generatePolicyFunction`	Create function that evaluates trained policy of reinforcement learning agent

Examples

collapse all

Create Actor-Critic Agent

Open Live Script

Create an environment interface and obtain its observation and action specifications.

env = rlPredefinedEnv("CartPole-Discrete");
obsInfo = getObservationInfo(env);
actInfo = getActionInfo(env);

Create a critic representation.

% create the network to be used as approximator in the critic
criticNetwork = [
    imageInputLayer([4 1 1],'Normalization','none','Name','state')
    fullyConnectedLayer(1,'Name','CriticFC')];

% set some options for the critic
criticOpts = rlRepresentationOptions('LearnRate',8e-3,'GradientThreshold',1);

% create the critic
critic = rlValueRepresentation(criticNetwork,obsInfo,'Observation',{'state'},criticOpts);

Create an actor representation.

% create the network to be used as approximator in the actor
actorNetwork = [
    imageInputLayer([4 1 1],'Normalization','none','Name','state')
    fullyConnectedLayer(2,'Name','action')];

% set some options for the actor
actorOpts = rlRepresentationOptions('LearnRate',8e-3,'GradientThreshold',1);

% create the actor
actor = rlStochasticActorRepresentation(actorNetwork,obsInfo,actInfo,...
    'Observation',{'state'},actorOpts);

Specify agent options, and create an AC agent using the environment, actor, and critic.

agentOpts = rlACAgentOptions('NumStepsToLookAhead',32,'DiscountFactor',0.99);
agent = rlACAgent(actor,critic,agentOpts)

agent = 
  rlACAgent with properties:

    AgentOptions: [1x1 rl.option.rlACAgentOptions]

To check your agent, use getAction to return the action from a random observation.

getAction(agent,{rand(4,1)})

ans = -10

You can now test and train the agent against the environment.

Documentation

rlACAgent

Description

Creation

Syntax

Description

Input Arguments

`actor` — Actor network representation
`rlStochasticActorRepresentation` object

`critic` — Critic network representation
`rlValueRepresentation` object

Properties

`AgentOptions` — Agent options
`rlACAgentOptions` object

Object Functions

Examples

Create Actor-Critic Agent

See Also

Topics

Introduced in R2019a

Reinforcement Learning Toolbox Documentation

Support

Documentation

rlACAgent

Description

Creation

Syntax

Description

Input Arguments

actor — Actor network representation rlStochasticActorRepresentation object

critic — Critic network representation rlValueRepresentation object

Properties

AgentOptions — Agent options rlACAgentOptions object

Object Functions

Examples

Create Actor-Critic Agent

See Also

Topics

Introduced in R2019a

Reinforcement Learning Toolbox Documentation

Support

`actor` — Actor network representation
`rlStochasticActorRepresentation` object

`critic` — Critic network representation
`rlValueRepresentation` object

`AgentOptions` — Agent options
`rlACAgentOptions` object