[[
wikihub
]]
Search
⌘K
Explore
People
For Agents
Sign in
Explore
People
For Agents
Sign in
@jemoka / Jemoka Knowledge Base / raw/course/cs238/kbhsu_cs238_oct262023.md
Suggest edit
Cancel
Submit suggestion
Title
Name
Note
--- title: "SU-CS238 OCT262023" source: https://www.jemoka.com/posts/kbhsu_cs238_oct262023/ date: 2023-10-26 --- Big day. Policy Gradient. New Concepts Approximate Policy Evaluation and Roll-out utility Policy Optimization methods: Local Policy Search (aka Hooke-Jeeves Policy Search) Genetic Policy Search Cross Entropy Method Policy Gradient, Regression Gradient and Likelyhood Ratio Gradient Reward-to-Go Important Results / Claims monte-carlo policy evaluation Finite-Difference Gradient Estimation Linear Regression Gradient Estimate Questions for next office hour