RethinkCraft · LeeLaugh-jason · Mar 25, 2026 · Mar 28, 2026 · Mar 30, 2026 · Mar 30, 2026
diff --git a/CMakeLists.txt b/CMakeLists.txt
@@ -60,6 +60,8 @@ set(SOURCES
     src/agent_tools.cpp
     src/apply_patch.cpp
     src/agent_loop.cpp
+    src/docgen_llm.cpp
+    src/docgen_pipeline.cpp
 )
 
 # Main executable

diff --git a/README.md b/README.md
@@ -35,10 +35,11 @@ For coding work, a task enters through CLI and config setup, flows through the a
 - Agent loop and tool-calling basics
 - HTTP / LLM integration and SSE streaming
 - Safety-bounded `read`, `write`, and `bash` tools
+- Repository-aware search, diff, staging, and commit packaging tools with policy-gated approvals
 - Test infrastructure and deterministic mdBook validation
 - Change-aware documentation automation for README and book chapters, including scope decisions, reference context, blocking verify, and review/rework evidence
 
-`Phase 1` completes the in-repository coding workflow loop: structured git and patch tools, repository search, policy-gated mutations and approvals, bounded build/test, and `git_add` / `git_commit` for packaging changes.
+`Phase 1` completes the in-repository coding workflow loop: structured git and patch tools, repository search, policy-gated mutations and approvals, bounded build/test, `git_add` / `git_commit` packaging guarded by the same approval model, and recovery guidance for blocked or inspectable tool failures.
 
 ## Documentation Automation
 

diff --git a/README_zh.md b/README_zh.md
@@ -35,10 +35,11 @@ NanoCodeAgent 是一个教学型 C++ code agent 运行时，重点放在确定
 - agent loop 与基础 tool-calling
 - HTTP / LLM 集成与 SSE 流式解析
 - 安全受限的 `read`、`write`、`bash` 工具
+- 带策略闸门审批的仓库搜索、diff、staging 与 commit 封装工具
 - 测试基础设施与确定性的 mdBook 校验
 - 面向 README 和书籍章节的 change-aware 文档自动化，包括 scope decision、reference context、blocking verify，以及 review/rework evidence
 
-`Phase 1` 补齐了仓库内 coding workflow 闭环：结构化 git 与 patch 工具、仓库搜索、带策略闸门的变更与审批、受限 build/test，以及用于封装变更的 `git_add` / `git_commit`。
+`Phase 1` 补齐了仓库内 coding workflow 闭环：结构化 git 与 patch 工具、仓库搜索、带策略闸门的变更与审批、受限 build/test、受同一审批模型保护的 `git_add` / `git_commit` 封装流程，以及针对 blocked / inspectable 工具失败的恢复提示。
 
 ## Documentation Automation
 

diff --git a/book/src/04-tools-and-safety.md b/book/src/04-tools-and-safety.md
@@ -14,31 +14,33 @@
 2. **审批闸门**：就算模型请求了某个工具，当前运行策略是否允许它执行。
 3. **执行与停机闸门**：工具真的开始执行后，它是否仍受路径边界、输出上限、超时和 fail-fast 约束。
 
-这三层关系图如下，阅读时只要顺着“请求是否被接住、是否被允许、执行后是否继续”这条控制链往右看：
+这三层关系图如下，阅读时只要顺着“请求是否被接住、审批是否满足、执行后是继续、给提示，还是停机”这条控制链往右看：
 
 ```mermaid
 flowchart LR
     Request[Assistant Tool Call]
     Registered{Registered?}
-    Approved{Allowed By Policy?}
+    Approved{Approvals Satisfied?}
     Executor[Bounded Executor]
     Result[Tool Result]
-    Healthy{Result Clean?}
+    Recoverable{Recoverable?}
+    Guidance[Guidance]
     Stop[Fail-Fast Stop]
     Next[Next Turn]
 
     Request --> Registered
     Registered -- no --> Stop
     Registered -- yes --> Approved
-    Approved -- no --> Stop
+    Approved -- no --> Guidance
     Approved -- yes --> Executor
     Executor --> Result
-    Result --> Healthy
-    Healthy -- no --> Stop
-    Healthy -- yes --> Next
+    Result --> Recoverable
+    Recoverable -- yes --> Guidance
+    Guidance --> Next
+    Recoverable -- no --> Stop
 ```
 
-这张图只展示控制链：请求先确认“系统认不认识这个工具”，再确认“当前策略允不允许它执行”，随后才进入受限 executor，最后由 loop 按结果决定继续还是停机。它没有展开 read-only / mutating / execution 的具体分类细目，也没有展开 executor 内部实现；目的只是帮助读者先抓住边界是如何一层层收紧的。
+这张图只展示控制链：请求先确认“系统认不认识这个工具”，再确认“当前策略要求的审批是否满足”，随后才进入受限 executor，最后由 loop 按结果决定继续、补一条恢复提示，还是停机。它没有展开 read-only、mutating、execution 的具体分类细目，也没有展开 executor 内部实现；目的只是帮助读者先抓住边界是如何一层层收紧、以及失败是如何被分流的。
 
 ## 3. 主流程 (Main Flow)
 当 `src/agent_loop.cpp` 收到一条带有 `tool_calls` 的 assistant message 后，它会顺序取出每个调用，解析 `function.arguments`，然后交给 `execute_tool()`。真正的统一入口在 `ToolRegistry`。
@@ -54,15 +56,15 @@ flowchart LR
 - `mutating`
 - `execution`
 
-只读工具注册后会被归一化成无需审批；变更类和执行类默认阻止，只有 `allow_mutating_tools` 或 `allow_execution_tools` 被明确打开时才放行。注意，这里的 approval 不是“人工逐条弹窗确认”，而是运行开始前就确定好的策略门。
+只读工具注册后会被归一化成无需审批；变更类和执行类默认阻止，只有 `allow_mutating_tools` 或 `allow_execution_tools` 被明确打开时才放行。更具体地说，`git_add` 和 `git_commit` 虽然在分类上归到 `mutating`，但它们同时被标记为“会改仓库状态”以及“会执行 repo-controlled git path 与 hook 行为”，因此默认需要同时满足 mutating 与 execution 两类审批。注意，这里的 approval 不是“人工逐条弹窗确认”，而是运行开始前就确定好的策略门。
 
 一旦通过审批闸门，调用才会进入具体 executor。到这里，不同工具的边界开始分化：
 
 - 文件工具依赖 workspace 解析和 no-follow 打开策略；
-- repo 只读工具依赖受限目录、rg/git 参数硬化和输出限制；
-- bash/build/test 工具依赖工作目录锁定、环境清理、超时和输出截断。
+- repo 工具依赖受限目录、rg 与 git 的参数硬化和输出限制；
+- bash、build、test 工具依赖工作目录锁定、环境清理、超时和输出截断。
 
-最后，`agent_loop` 会把工具结果作为 `role=tool` 消息写回上下文。如果结果里出现 `blocked`、`failed`、`ok:false` 或 `timed_out:true` 这类污染信号，loop 会直接停止，而不是继续让模型带着坏状态往下跑。
+最后，`agent_loop` 会把工具结果作为 `role=tool` 消息写回上下文。这里已经不再把所有失败都粗暴地视为同一种污染：`blocked` 会被归类成需要换工具或改运行策略的 recoverable failure，`apply_patch` 的 `no_match` 与 `multiple_matches` 会被归类成 needs-inspection，build 或 test 超时会被归类成 retryable；只有非结构化失败、普通 executor 失败，或者同一 recoverable failure 被原样重复到超出 retry budget 时，loop 才会真正 fail-fast 停止。
 
 ## 4. 一个真实任务下，工具是如何被允许、被拒绝、然后被停下的？ (Worked Example)
 设想用户给系统一个任务：请你读取 `src/main.cpp`，如果需要就修改一点内容并跑一次命令验证。
@@ -73,40 +75,43 @@ flowchart LR
    这是只读工具，registry 直接放行。
 
 2. 接着模型想请求 `write_file_safe` 或 `apply_patch`。  
-   如果本轮没有打开 `allow_mutating_tools`，registry 会直接返回 `blocked`，executor 根本不会被调用。
+   如果本轮没有打开 `allow_mutating_tools`，registry 会直接返回 `blocked`，executor 根本不会被调用；loop 会把这个结果写回上下文，并附上一条“改用允许的只读工具，或在 run 外调整审批”的恢复提示。
 
 3. 如果模型进一步请求 `bash_execute_safe` 或 `test_project_safe`，情况更严格。  
    这些都属于执行类工具；如果 `allow_execution_tools` 没打开，它们同样会在 registry 层被挡下。
 
-4. 如果执行类工具被允许，真正的 executor 仍然要再过一层边界。  
+4. 如果模型想把已经准备好的改动打包进 git，`git_add` 和 `git_commit` 的要求还会再提高一层。  
+   这两者都会修改仓库状态，而且 `git` 命令本身可能触发仓库控制的 hook 与 config 路径，所以默认需要同时打开 mutating 与 execution 两类审批；少任何一类，registry 都会在 executor 之前返回 `blocked`，并明确列出 `missing_approvals`。
+
+5. 如果执行类工具被允许，真正的 executor 仍然要再过一层边界。  
    例如 `bash_execute_safe()` 会固定 cwd 到 workspace、清理环境变量、用双管道读输出、在超时或输出失控时杀掉进程组。
 
-5. 只要其中任何一步返回失败、超时或阻止状态，`agent_loop` 就会 fail-fast 停止。  
-   这也是为什么“工具被看到”和“工具真的被执行”是两回事。
+6. 只要其中某一步落到 fatal failure，或者模型收到 recoverable guidance 后仍原样重复同一种失败直到耗尽 retry budget，`agent_loop` 才会 fail-fast 停止。  
+   这也是为什么“工具被看到”和“工具真的被执行”是两回事，而“工具失败”与“整轮立刻终止”现在也不再是完全同义。
 
 这个例子想说明的不是“系统特别保守”，而是“工具链路的每一层都在明确回答自己的那一个问题”：可见吗、允许吗、怎么执行、失败后怎么办。
 
 ## 5. 模块职责要和控制链一起看 (Module Roles)
 - `src/agent_tools.cpp`
   定义默认工具集合，把名称、描述、参数 schema、类别和执行函数绑在一起。这里决定的是“模型能请求什么”。
 - `src/tool_registry.cpp`
-  是策略门。它决定的是“当前这个请求在本轮运行里能不能执行”。
+  是策略门。它决定的是“当前这个请求在本轮运行里能不能执行”，并把缺失的是 mutating、execution，还是两者都缺失，显式写回结构化结果。
-  是策略门。它决定的是“当前这个请求在本轮运行里能不能执行”，并把缺失的是 mutating、execution，还是两者都缺失，显式写回结构化结果。
+  是策略门。它决定的是“当前这个请求在本轮运行里能不能执行”，并把缺失的是 `mutating`、`execution`，还是两者都缺失，显式写回结构化结果。
-  是策略门。它决定的是“当前这个请求在本轮运行里能不能执行”，并把缺失的是 mutating、execution，还是两者都缺失，显式写回结构化结果。
+  是策略门。它决定的是“当前这个请求在本轮运行里能不能执行”，并把缺失的是 `mutating`、`execution`，还是两者都缺失，显式写回结构化结果。
 - `src/read_file.cpp`、`src/write_file.cpp`、`src/apply_patch.cpp`
   是文件边界的第一线。它们不是泛泛检查字符串，而是结合 workspace 解析和安全打开策略来防越界与 symlink 穿透。
 - `src/repo_tools.cpp`
-  是只读观察面的 executor。它们也需要硬化，因为底层仍然会调用 rg 或 git。
+  不再只是只读观察面。这里既承载 `git_status`、`git_diff`、`git_show` 这类观察工具，也承载 `git_add` 与 `git_commit` 这种封装型仓库变更工具；它们仍然需要参数硬化、路径约束和输出边界。
 - `src/bash_tool.cpp` 与 `src/build_test_tools.cpp`
   是执行面。这里不承担“绝对隔离”的承诺，而承担“有限执行、及时收束、尽量不泄漏”的承诺。
 - `src/agent_loop.cpp`
-  是最后一道停机闸门。它用工具数、轮数、上下文和 fail-fast 规则防止错误继续扩散。
+  是最后一道分流与停机闸门。它用工具数、轮数、上下文、recoverable guidance 与 fail-fast 规则防止错误继续扩散。
 
 ## 6. 作为贡献者，你通常怎么理解当前工具面？ (What You Usually Do)
 如果你是第一次想看清这套工具系统，最有效的顺序通常不是从工具名开始背，而是从“哪一层在回答哪个问题”开始：
 
 1. 先看 `build_default_tool_registry()` 和 `get_agent_tools_schema()`，确认模型能看到哪些类别的工具。
 2. 再看 `src/tool_registry.cpp`，确认默认策略如何区分只读、变更和执行。
 3. 然后看具体 executor，理解每类工具的真实风险差异。
-4. 最后回到 `src/agent_loop.cpp`，看系统在工具失败后如何停下。
+4. 最后回到 `src/agent_loop.cpp`，看系统如何区分 blocked、retryable、needs-inspection、fatal，并决定是补 guidance 还是停机。
-4. 最后回到 `src/agent_loop.cpp`，看系统如何区分 blocked、retryable、needs-inspection、fatal，并决定是补 guidance 还是停机。
+4. 最后回到 `src/agent_loop.cpp`，看系统如何区分 `blocked`、`retryable`、`needs-inspection`、`fatal`，并决定是补 guidance 还是停机。
-4. 最后回到 `src/agent_loop.cpp`，看系统如何区分 blocked、retryable、needs-inspection、fatal，并决定是补 guidance 还是停机。
+4. 最后回到 `src/agent_loop.cpp`，看系统如何区分 `blocked`、`retryable`、`needs-inspection`、`fatal`，并决定是补 guidance 还是停机。
 
 如果你正在改某一条边界，最相关的测试入口通常是：
 
@@ -120,16 +125,16 @@ flowchart LR
 不对。schema 暴露的是“模型知道它存在”，真正能否执行要看 `ToolRegistry` 和当前配置策略。
 
 ### 误解二：approval 是人工逐条审批
-也不对。当前实现里的 approval 是运行前确定的策略开关，不是交互式确认流程。
+也不对。当前实现里的 approval 是运行前确定的策略开关，不是交互式确认流程；而且像 `git_add` 与 `git_commit` 这样的仓库封装工具，可能同时缺 mutating 与 execution 两类审批。
-也不对。当前实现里的 approval 是运行前确定的策略开关，不是交互式确认流程；而且像 `git_add` 与 `git_commit` 这样的仓库封装工具，可能同时缺 mutating 与 execution 两类审批。
+也不对。当前实现里的 approval 是运行前确定的策略开关，不是交互式确认流程；而且像 `git_add` 与 `git_commit` 这样的仓库封装工具，可能同时缺 `mutating` 与 `execution` 两类审批。
-也不对。当前实现里的 approval 是运行前确定的策略开关，不是交互式确认流程；而且像 `git_add` 与 `git_commit` 这样的仓库封装工具，可能同时缺 mutating 与 execution 两类审批。
+也不对。当前实现里的 approval 是运行前确定的策略开关，不是交互式确认流程；而且像 `git_add` 与 `git_commit` 这样的仓库封装工具，可能同时缺 `mutating` 与 `execution` 两类审批。
 
 ### 误解三：`bash_execute_safe()` 相当于容器或系统级沙箱
 这正是最需要纠正的过度安全表述。当前实现会锁定 workspace、清理环境、限制输出、设置超时并 kill 进程组，但它不是 `chroot`、不是 seccomp，也不是容器。
 
 ### 误解四：文件工具和 bash 工具有差不多的边界强度
 不是。当前最强的路径边界主要在文件工具，而不是 bash。文件工具在 workspace 解析和安全打开上更严格；bash 更像“受限执行器”，不是“绝对隔离器”。
 
-### 误解五：agent loop 会智能理解“这轮已经没意义了”
-当前实现没有那种语义层面的智能裁决。它主要依靠固定阈值和结果状态字符串来 fail-fast。这种做法很朴素，但优点是边界明确。
+### 误解五：只要工具结果里出现失败字样，agent loop 就会立刻停机
+现在不能这样概括。当前实现会先做结构化分类：`blocked`、部分 patch 拒绝、以及 build 或 test timeout 会先转成 guidance，让模型换观察手段、修参数，或停止重复同一坏调用；只有 fatal failure 或重复 recoverable failure 超出预算时才真正停机。
 
 最常见的失败模式也恰好对应这些误解：默认应阻止的工具被放行、shell 输出失控、后台进程泄漏、工具失败后 loop 继续跑、读写路径被越界利用。正因为这些问题具体而危险，这章才必须把“安全”拆回控制链，而不是抽象口号。
 

diff --git a/include/agent_tools.hpp b/include/agent_tools.hpp
@@ -12,7 +12,7 @@
 std::string execute_tool(const ToolCall& cmd, const AgentConfig& config);
 
 // Returns the JSON schema representing the tools the agent is capable of running
-nlohmann::json get_agent_tools_schema();
+nlohmann::json get_agent_tools_schema(bool include_delegate_subagent = true);
 
 // Returns the default built-in tool registry used by the agent runtime.
 const ToolRegistry& get_default_tool_registry();

diff --git a/include/docgen_llm.hpp b/include/docgen_llm.hpp
@@ -0,0 +1,119 @@
+#pragma once
+
+#include "config.hpp"
+#include "docgen_types.hpp"
+#include <string>
+#include <functional>
+#include <nlohmann/json.hpp>
+#include <expected>
+
+namespace docgen {
+
+class LlmClient {
+public:
+    using StreamCallback = std::function<bool(const std::string& delta)>;
+
+    LlmClient(const AgentConfig& cfg, const SubagentContext& ctx);
+
+    std::expected<nlohmann::json, std::string> call(
+        const std::string& system_prompt,
+        const nlohmann::json& user_context,
+        StreamCallback on_delta = nullptr
+    );
+
+    std::expected<nlohmann::json, std::string> call_json(
+        const std::string& system_prompt,
+        const nlohmann::json& user_context,
+        StreamCallback on_delta = nullptr
+    );
+
+private:
+    AgentConfig config_;
+    SubagentContext ctx_;
+
+    std::string extract_json_from_response(const std::string& content);
+};
+
+inline constexpr std::string_view kChangeAnalystPrompt = R"(
+You are a precise code change analyst. Output ONLY valid JSON.
+
+INPUT: git diff text
+
+OUTPUT FORMAT (strict JSON):
+{
+  "affected_files": [
+    {"path": "relative/path", "change_type": "add|modify|delete|rename", "old_path": "optional for rename", "public_symbols": ["optional list"]}
+  ],
+  "intent_summary": "one sentence describing the change purpose"
+}
+
+RULES:
+1. Only include files with actual changes
+2. change_type must be one of: add, modify, delete, rename
+3. For renames, include old_path
+4. public_symbols: only exportable/public symbols affected
+5. NO prose, NO markdown code fences, ONLY the JSON object
+)";
+
+inline constexpr std::string_view kContextRouterPrompt = R"(
+You are a document section locator. Output ONLY valid JSON.
+
+INPUT:
+- doc_outline: JSON with file_path and headings array
+- intent_summary: what change is needed
+
+OUTPUT FORMAT (strict JSON):
+{
+  "locations": [
+    {"target_file": "path/to/doc.md", "start_line": N, "end_line": M, "section_heading": "optional heading"}
+  ]
+}
+
+RULES:
+1. Return only sections that need modification
+2. start_line and end_line must be valid line numbers (1-based)
+3. If no sections need changes, return {"locations": []}
+4. NO prose, NO markdown code fences, ONLY the JSON object
+)";
+
+inline constexpr std::string_view kPatchWriterPrompt = 
+"You are a precise document patch generator. Output ONLY valid JSON.\n\n"
+"INPUT:\n"
+"- original_text: the exact text block to modify (line-numbered)\n"
+"- intent_summary: what change is needed\n\n"
+"OUTPUT FORMAT (strict JSON):\n"
+"{\n"
+"  \"patches\": [\n"
+"    {\n"
+"      \"action\": \"replace|insert_before|insert_after|delete\",\n"
+"      \"old_text\": \"exact text to find (for replace/delete)\",\n"
+"      \"new_text\": \"replacement content (empty for delete)\"\n"
+"    }\n"
+"  ],\n"
+"  \"rationale\": \"one-line explanation\"\n"
+"}\n\n"
+"RULES:\n"
+"1. old_text MUST match EXACTLY (whitespace-sensitive) for replace/delete\n"
+"2. Each patch operates on original_text independently (not sequential)\n"
+"3. Use multiple patches only when truly independent\n"
+"4. NO prose, NO markdown code fences, ONLY the JSON object\n"
+"5. If no change needed, return {\"patches\": [], \"rationale\": \"no change required\"}";
+
+inline constexpr std::string_view kMicroReviewerPrompt = 
+"You are a minimal document patch reviewer. Output ONLY valid JSON.\n\n"
+"INPUT:\n"
+"- before: original text\n"
+"- after: modified text\n"
+"- intent_summary: intended change\n\n"
+"OUTPUT FORMAT (strict JSON):\n"
+"{\n"
+"  \"verdict\": \"approve|reject\",\n"
+"  \"reason\": \"one sentence\",\n"
+"  \"issues\": [\"optional list of problems if rejected\"]\n"
+"}\n\n"
+"RULES:\n"
+"1. approve if: change matches intent, no syntax errors, maintains document structure\n"
+"2. reject if: wrong content, broken formatting, unrelated changes\n"
+"3. NO prose, NO markdown code fences, ONLY the JSON object";
+
+} // namespace docgen