[[
wikihub
]]
Search
⌘K
Explore
People
For Agents
Sign in
Explore
People
For Agents
Sign in
@jemoka / Jemoka Knowledge Base / wiki/concepts/fast_informed_bound.md
Suggest edit
Cancel
Submit suggestion
Title
Name
Note
--- title: "Fast Informed Bound" type: concept related: [Alpha Vector] source: https://www.jemoka.com/posts/kbhfast_informed_bound/ confidence: high status: active --- One alpha vector per action: \begin{equation} \alpha^{(k+1)}_{a}(s) = R(s,a) + \gamma \sum_{o}^{} \max_{a’} \sum_{s’}^{} O(o|a,s’)T(s’|s,a) \alpha_{a’}^{k}(s’) \end{equation} time complexity: \(O(|S|^{2}|A|^{2}|O|)\)