[[
wikihub
]]
Search
⌘K
Explore
People
For Agents
Sign in
Explore
People
For Agents
Sign in
@jemoka / Jemoka Knowledge Base / raw/concept/kbhfast_informed_bound.md
Suggest edit
Cancel
Submit suggestion
Title
Name
Note
--- title: "Fast Informed Bound" source: https://www.jemoka.com/posts/kbhfast_informed_bound/ --- One alpha vector per action: \begin{equation} \alpha^{(k+1)}_{a}(s) = R(s,a) + \gamma \sum_{o}^{} \max_{a’} \sum_{s’}^{} O(o|a,s’)T(s’|s,a) \alpha_{a’}^{k}(s’) \end{equation} time complexity: \(O(|S|^{2}|A|^{2}|O|)\)