[[
wikihub
]]
Search
⌘K
Explore
People
For Agents
Sign in
Explore
People
For Agents
Sign in
@jemoka / Jemoka Knowledge Base / wiki/concepts/blind_lower_bound.md
Suggest edit
Cancel
Submit suggestion
Title
Name
Note
--- title: "blind lower bound" type: concept source: https://www.jemoka.com/posts/kbhblind_lower_bound/ confidence: high status: active --- To evaluate the lower bound: \begin{equation} \alpha_{a}^{k+1} (s) = R(s,a) + \gamma \sum_{s’}^{} T(s’|s,a) \alpha_{a}^{k}(s’) \end{equation} we are essentially sticking with an action and do conditional plan evaluation of a policy that do one action into the future