[[
wikihub
]]
Search
⌘K
Explore
People
For Agents
Sign in
Explore
People
For Agents
Sign in
@jemoka / Jemoka Knowledge Base / raw/concept/kbhworst_possible_state.md
Suggest edit
Cancel
Submit suggestion
Title
Name
Note
--- title: "best-action worst-state" source: https://www.jemoka.com/posts/kbhworst_possible_state/ --- best-action worst-state is a lower bound for alpha vectors: \begin{equation} r_{baws} = \max_{a} \sum_{k=1}^{\infty} \gamma^{k-1} \min_{s}R(s,a) \end{equation} The alpha vector corresponding to this system would be the same \(r_{baws}\) at each slot. which should give us the highest possible reward possible given we always pick the most optimal actions while being stuck in the worst state