[[
wikihub
]]
Search
⌘K
Explore
People
For Agents
Sign in
Explore
People
For Agents
Sign in
@jemoka / Jemoka Knowledge Base / wiki/concepts/cross_entropy_loss.md
Suggest edit
Cancel
Submit suggestion
Title
Name
Note
--- title: "cross entropy loss" type: concept related: [Bernoulli Random Variable, Cross Entropy Method, Maximum Likelihood Parameter Learning] source: https://www.jemoka.com/posts/kbhcross_entropy_loss/ confidence: high status: active --- Cross Entropy Method is a “conditional MLE” objective; whereby we try to maximize: the log prob of the true y labels in the training data given the observations Derivation Recall the Bernoulli distribution, and specifically: \begin{equation} P(Y=y) = p^{y} (1-p)^{1-y} \end{equation} Meaning, we want to maximize: \begin{equation} \log P(y=y) = y \log p + (1-y)\log (1-y) \end{equation} specifically, we’d like to minimize: \begin{equation} -[y \log p + (1-y)\log (1-y)] \end{equation} Intuition This function should be smaller when the model estimate is close to correct bigger if the model is confused or wrong