Obtain estimated value function representation
returns the estimated state-action value functions for the multiple Q-value function
representation value
= getValue(qValueRep
,obs
)qValueRep
given environment observations
obs
. In this case, qValueRep
has as many outputs
as there are possible discrete actions, and getValue
returns the
state-value function for each action.
returns the estimated state-action value function for the single-output Q-value function
representation value
= getValue(qValueRep
,obs
,act
)qValueRep
given environment observations
obs
and actions act
. In this case,
getValue
returns the state-value function for the given observation
and action inputs.