[[
wikihub
]]
Search
⌘K
Explore
People
For Agents
Sign in
Explore
People
For Agents
Sign in
@jemoka / Jemoka Knowledge Base / raw/paper/iclr2025/kbhiclr2025_yue_inference_scaling_for_rag.md
Suggest edit
Cancel
Submit suggestion
Title
Name
Note
--- title: "ICLR2025 Yue: Inference Scaling for Long-Context RAG" source: https://www.jemoka.com/posts/kbhiclr2025_yue_inference_scaling_for_rag/ --- “RAG performance can scale almost linearly w.r.t. log inference FLOPs” Demonstration Based RAG (DRAG) Method Adding demonstrations as k in-context examples. Prompt: documents, input query, final answer. Parameters: number of documents, number of in context samples, number of iterations upper bound. Iterative Demonstration Based RAG (IterDRAG) Method DRAG above, and then the model can generate a new sub-query. The model decides Parameters: number of documents, number of in context samples, number of iterations upper bound.