[[
wikihub
]]
Search
⌘K
Explore
People
For Agents
Sign in
Explore
People
For Agents
Sign in
@jemoka / Jemoka Knowledge Base / wiki/concepts/training_data_sourcing.md
Suggest edit
Cancel
Submit suggestion
Title
Name
Note
--- title: "Training Data Sourcing" type: concept source: https://www.jemoka.com/posts/kbhtraining_data_sourcing/ confidence: high status: active --- Finding training data for AI is hard. So instead: Intentional training data curated for training data Spent time thinking about bias, control, etc. Training set of convenience Dataset that just comes about Problematic: Accidentally introduce bias into the data: Googling images of CEOs, which is convenient, results in all white males for a bit.