Skip to content

feat(chatbi): §5-3 低置信 RAG 方案预览与 plan_execution_token#100

Merged
Cyning12 merged 6 commits into
mainfrom
task/chatbi-v3-lowconf-rag-preview
Jun 1, 2026
Merged

feat(chatbi): §5-3 低置信 RAG 方案预览与 plan_execution_token#100
Cyning12 merged 6 commits into
mainfrom
task/chatbi-v3-lowconf-rag-preview

Conversation

@Cyning12
Copy link
Copy Markdown
Owner

@Cyning12 Cyning12 commented Jun 1, 2026

Summary

  • §5-3 后端:低置信 rag_search 路径输出 agent.plan.previewrewrite_queryplan_execution_token 等),clarify_plan_once 通用化;确认后带 token 跳过 agent.clarify 并执行检索(api/agent.pyapi/chatbi_plan_token.pyapi/tools.py)。
  • 契约与测试_contract_manifest.json 增补 RAG 预览键;pytest test_v3_rag_plan_* / plan token 用例。
  • Harness 关账:task → docs/tasks/done/、50 reinspect、invoke 链;母单 SPEC §5-3 / §6 RAG 项已勾选。
  • 联调留证docs/diary/samples/chatbi-v3-lowconf-rag-preview/ 真机截图、E2E Timeline、NOTES-future-diary-llm-date.md(答案质量 / LLM 缺日期提示词,非闸门缺陷)。
    依赖:Ink FE 已在 main@72f8f0c(确认卡「按预览执行」);本 PR 仅后端仓。

Test plan

  • pytest tests -m "not intent_eval and not intent_benchmark"
  • python tools/tech_graph_contract_check.py(若 CI 要求)
  • 联调(可选):CHATBI_V3_LOW_CONFIDENCE_CLARIFY=1CHATBI_V3_PLAN_PREVIEW_CONFIRM=1INTENT_MIN_CONFIDENCE=1.0 → 问「2026-04-28日记的大致内容」→ 14 条 preview+clarify → 按预览执行 → 无 clarify、执行 rag_search(见 diary 样本 README)

Out of scope / follow-up

  • 端到端 RAG 答案质量(「未来日记」):见 docs/diary/samples/chatbi-v3-lowconf-rag-preview/NOTES-future-diary-llm-date.md,建议单独 task 注入 wall-clock 日期与 KB 边界 prompt。
  • 工作区未提交的 jsonPKmermaid fixture 变更不在本 PR 内。

CI 状态(自动 · pr-post-ci)

更新:2026-06-01 05:49 UTC · tools/pr_post_ci_update_body.py

Required 全绿:是

Check Status
contract_check success
manifest_check success
pytest success

变更范围(自动统计)

文件数:30

顶层前缀 计数
docs/ 24
api/ 3
tests/ 2
scripts/ 1

⚠ automerge 路径闸:含 api/tests/.github/workflows/ 等 — 勿打 automerge 或须人审 merge。

命中路径
  • api/agent.py
  • api/chatbi_plan_token.py
  • api/tools.py
  • tests/test_chatbi_plan_token.py
  • tests/test_unified_chat_backend_v2_agent.py

CyningMM and others added 6 commits May 31, 2026 22:04
明确 Ink 模块、FE 验收、Harness 跨仓节奏与 5-2 差异;更新 RECENT/backlog/母单 5-3 指针。

Co-authored-by: Cursor <cursoragent@cursor.com>
task in_progress;C1 契约与阶段表;invoke_00 供 semi_auto 22→30→40。

Co-authored-by: Cursor <cursoragent@cursor.com>
…eview

放行 22-R1 与 30 开工;HG-REINSPECT 仍 pending 至 50 后。

Co-authored-by: Cursor <cursoragent@cursor.com>
扩展 clarify 至 rag_search:preview_only 检索方案、clarify_plan_once
令牌按工具校验;契约增量 rewrite_query 等键;Harness 22/30/40 invoke
与 R1 审查落盘。

Co-authored-by: Cursor <cursoragent@cursor.com>
KPI 100% pass;母单/SPEC 同步;reinspect v1;diary 样本索引;task→done。

Co-authored-by: Cursor <cursoragent@cursor.com>
浏览器 E2E 截图、round1/round2 Timeline JSON、NOTES 记录 LLM 缺日期提示词;
README 对齐路线 A 联调 env;附 TestClient 导出脚本供 stub 再生。

Co-authored-by: Cursor <cursoragent@cursor.com>
@vercel
Copy link
Copy Markdown

vercel Bot commented Jun 1, 2026

The latest updates on your projects. Learn more about Vercel for GitHub.

Project Deployment Actions Updated (UTC)
ai-ink-brain-api-python Ready Ready Preview, Comment Jun 1, 2026 5:48am

@Cyning12 Cyning12 merged commit 858349e into main Jun 1, 2026
9 checks passed
@Cyning12 Cyning12 deleted the task/chatbi-v3-lowconf-rag-preview branch June 1, 2026 05:55
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants