Report Generation 報告生成 Deep Research

> 相關條目 [[Report Generation 報告生成 Deep Search]] * Building Blocks of LLM Report Generation: Beyond Basic RAG (2024/11/5) * https://www.llamaindex.ai/blog/building-blocks-of-llm-report-generation-beyond-basc-rag * https://x.com/llama_index/status/1853882080658424226 (2024/11/6) * LlamaReport Preview: Transform any Documents into Structured Reports (2024/12/17) * https://www.llamaindex.ai/blog/llamareport-preview-transform-any-documents-into-structured-reports * A Practical Guide to Implementing DeepSearch/DeepResearch (2025/2/25) * https://jina.ai/news/a-practical-guide-to-implementing-deepsearch-deepresearch/ * https://jina.ai/news/snippet-selection-and-url-ranking-in-deepsearch-deepresearch/ * DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents * https://x.com/_philschmid/status/1935669863453806808 (2025/6/19) * https://github.com/Ayanami0730/deep_research_bench * 九原客分享 https://x.com/9hills/status/1940925762330329292 (2025/7/4) * paper 綜述 https://arxiv.org/abs/2506.18096 * 專案推薦 https://github.com/bytedance/deer-flow * 專案推薦 https://github.com/foreveryh/mentis/tree/main/super_agents/deep_research * Building Deep Research Systems (2025/) * https://maven.com/p/e1319e/building-deep-research-systems * 架構上我的理解是: 在 RAG 的基礎上，根據用戶輸入的需求 * 拆解或辨識出多個問題，每個問題透過 RAG 產生解答 * 將多個 QA pairs 綜合成一個最終產出報告 * 這個最終報告最好是多個 prompt 輸出組合而成的 * 如果著重在搜尋和資料收集，放在 [[Report Generation 報告生成 Deep Search]] 這個條目 --- * 報告生成 Open Deep Research (2024/12/6) ⭐⭐⭐ * 蠻 general 的設計，跟 LlamaReport 感覺非常類似的理念 * 原名 https://github.com/langchain-ai/report-mAIstro * 改名 https://github.com/langchain-ai/open_deep_research * https://github.com/langchain-ai/report-mAIstro/tree/main/report_examples 有不同類型報告的範例 * https://x.com/LangChainAI/status/1864702019614707904 * https://x.com/RLanceMartin/status/1864723445067751508 * https://x.com/LangChainAI/status/1892645710224622024 (2025/2/21) * https://blog.langchain.com/open-deep-research/ (2025/7/16) * 分成三階段執行 * Phase 1: User Clarification and Brief Generation * Phase 2: Research，分成 Research Supervisor 和 Sub-Agents * 主管會判斷子代理人的發現是否充分涵蓋簡報中的工作範圍。如果主管需要更深入的內容，可以派遣更多子代理人進行進一步研究 * Phase 3: Report Writing * 財務報告生成 (2024/10/23) ⭐⭐ * https://github.com/run-llama/llamacloud-demo/blob/main/examples/report_generation/report_generation.ipynb * 錄影 https://www.youtube.com/watch?v=aqtX-sGbevw * https://x.com/llama_index/status/1858207641732084076 (2024/11/18) * 有兩個檢索工具 chunk 跟 documents * 輸入是一個比較複雜的問題 * 先有一個 Agent A 根據 query 去 call 這兩個工具 * 當 A 不再呼叫工具時，代表資訊收集好了 * 這裡有用到 function calling * 接著呼叫 B 開始寫報告 * 報告要求 text 跟 table 交錯輸出，是個 list * 多模態報告生成代理人 (2004/10/23) * https://github.com/run-llama/llama_parse/blob/main/examples/multimodal/multimodal_report_generation_agent.ipynb * 拿多個論文建立向量資料庫 * 工具有 a chunk-level retriever tool 和 a document-retrieval tool. * 輸出是 text 跟 image 交雜，至少要有一張 image * image 是 file path * 跟上述的財務報告基本結構一模一樣，差別在於 image 而已 > 其實這個跟我在課程中的 RAG Chatbot 前半蠻像的 XD > 差別在於這個後半用不一樣的 agent (prompt) 來產生更長的報告 * RFP Response Generation Workflow (2024/10/22) ⭐⭐⭐ * RFP 是 Request for Proposal 提案請求 * 一個組織向潛在供應商或服務提供者發出的一份文件，詳細說明其需求和目標，並邀請他們提交建議書以競爭承攬某項專案或服務 * 因此這個 report generation 是針對 RFP 寫回覆報告 * https://github.com/run-llama/llama_parse/blob/main/examples/report_generation/rfp_response/generate_rfp.ipynb * https://x.com/llama_index/status/1845853830485082474 (2024/10/14) * https://x.com/jerryjliu0/status/1845907081725096329 (2024/10/15) * https://x.com/llama_index/status/1848759935787803091 (2024/10/23) * https://github.com/run-llama/llamacloud-demo/blob/main/examples/report_generation/rfp_response/generate_rfp.ipynb * 一樣的範例麻 * 首先要先有檢索器 * 使用 LlamaParse 解析 RFP 模板 * 用 prompt 2 提取出要詢問的 query 多個問題 * 每個問題逐個回答(要求一定要檢索工具) 用 prompt 1，這裡是一個 research_agent 因為有很多工具要選，他把每個檔案的檢索都拆開當作一個工具 * 會平行多個 agents 一起執行 * 最後將問題/答案串接成一個字典列表 * collect_events 方法應該會等待總共有 num_to_collect 個事件 * 將 RFP 跟問題答案列表，合在一起最後生成報告 (prompt 3) * RFP Response Generation Workflow (with Human-in-the-Loop) (2024/11/3) ⭐ * https://x.com/llama_index/status/1853111702759657825 (11/4) * 添加了人機互動功能，讓人們在繼續之前驗證每個研究答案 * https://github.com/run-llama/llamacloud-demo/blob/main/examples/report_generation/rfp_response/generate_rfp_hitl.ipynb * 拿到答案後丟出 InputRequiredEvent 事件，問人類 Do you like this answer? * 接著 process_human_response 用一個 LLM REFLECT_ON_QA_PROMPT 去解讀人類是 yes or no 有什麼回饋 * 如果人類沒問題才繼續，不然丟回 agent 修改問題再次檢索回答 * 研究報告生成 (2024/10/19) - 這個其實沒啥 Agent 技術成分，單純 workflow * https://github.com/run-llama/llamacloud-demo/blob/main/examples/report_generation/research_paper_report_generation.ipynb * https://x.com/llama_index/status/1848421745101050358 (2024/10/22) * https://x.com/llama_index/status/1852039190982332480 (2024/11/1) * 先要有個論文的向量檢索工具 * 用戶輸入報告大綱 * 根據大綱產生 queries * parse_outline_and_generate_queries 解析大綱並為每個部分和子部分生成查詢 * extract_title * generate_query_with_llm * classify_query 判斷問題是否需要檢索 * 回傳一堆 queries * 進行檢索 * generate_section_content 參數傳入 queries * 產生報告 * format_report 傳入大綱和檢索的資料 * 各 section * 最後再寫 introduction 和 conclusion * 基本上 outline 輸入是有特定格式的，需要 parse 結構 * 這個案例其實沒有用到 function calling，單純 workflow 麻，為何要叫 Agent .... XD * Form Filling using LlamaCloud and LlamaParse (2024/10/22) * 填寫 excel 表單的案例 * https://github.com/run-llama/llamacloud-demo/blob/main/examples/form_filling/Form_Filling_10K_SEC.ipynb * 前提要有可以回答的知識庫 * 解析 excel 檔案，轉成 list 結構 * 針對上述的 list 結構，跑迴圈每個都用 LLM 去檢索回答，把答案也塞回 list * 把 list 輸出成 csv，搞定 * Multimodal Report Generation (from a Slide Deck) * https://github.com/run-llama/llama_parse/blob/main/examples/multimodal/multimodal_report_generation.ipynb * https://x.com/llama_index/status/1857851103058211215 (2024/11/17) * 簡單的 One-shot report generation * 投影片生成 * https://github.com/run-llama/llama-slides * node.js * 兩階段的 deep research 設計 (2026/4/16) * https://x.com/hxiao/status/2044765001370701981 * https://x.com/dotey/status/2044814877051486210