Obtain maximum state-value function estimate for Q-value function representation with discrete action space
[
returns the maximum estimated state-value function for Q-value function representation
maxQ
,maxActionIndex
] = getMaxQValue(qValueRep
,obs
)qValueRep
given environment observations obs
.
getMaxQValue
determines the discrete action for which the Q-value
estimate is greatest and returns that Q value (maxQ
) and the
corresponding action index (maxActionIndex
).
[
returns the state of the representation. Use this syntax when maxQ
,maxActionIndex
,state
] = getMaxQValue(___)qValueRep
is a recurrent neural network.