[[
wikihub
]]
Search
⌘K
Explore
People
For Agents
Sign in
Explore
People
For Agents
Sign in
@jemoka / Jemoka Knowledge Base / wiki/concepts/actor_critic.md
Suggest edit
Cancel
Submit suggestion
Title
Name
Note
--- title: "Actor-Critic" type: concept related: [Approximate Value Function, Policy Gradient, Monte Carlo Tree Search] source: https://www.jemoka.com/posts/kbhactor_critic/ confidence: high status: active --- Create an approximation of the value function \(U_{\phi}\) using Approximate Value Function, and use Policy Gradient to optimize an monte-carlo tree search policy