完成 Agent 执行闭环:任务持久化/确定性 Planner+RAG 权限/脱敏/审计日志/Dashboard 增强,新增隐藏场景测试,…#26
Open
Aapolaris wants to merge 1 commit into
Open
完成 Agent 执行闭环:任务持久化/确定性 Planner+RAG 权限/脱敏/审计日志/Dashboard 增强,新增隐藏场景测试,…#26Aapolaris wants to merge 1 commit into
Aapolaris wants to merge 1 commit into
Conversation
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
完成能力点
queued/running到completed/failed,并持久化运行结果、错误、事件和 token 成本。answer、citations、filtered_doc_ids,并按用户权限过滤受限文档。vendor_secret、unit_cost_usd、内部合同信息、调试字段和原始异常堆栈进入 API 响应、事件、审计日志或协作日志。COLLABORATION_LOG.md:记录需求理解、环境问题、AGENTS.md 历史备注复核、根因分析、兼容影响、验证结果和剩余风险。核心设计思路和重要取舍
redact()函数统一处理工具输出、run result、run events、audit logs 和 API 响应中的敏感字段。本地验证命令和结果
py scripts/self_check.pypylauncher。python scripts/self_check.pypython为 Python 3.6,不满足项目要求。D:\python3.11\python.exe -m venv .venv.\.venv\Scripts\python.exe -m pip install -e ".[dev]".\.venv\Scripts\python.exe scripts\self_check.py4 passed, 1 warning.\.venv\Scripts\python.exe -m pytest -q4 passed, 6 xpassed, 1 warning审查报告中的新增隐藏场景测试
已知未完成项或风险
BackgroundTasks和现有 SQLite 状态模型。test_acceptance_guidance.py仍保留原始xfail标记,因此完成后表现为 XPASS;这是测试标记问题,不影响功能。API、数据结构或测试兼容影响
sku、warehouse、stock_gap、forecast_units_next_14d、supplier_risk、citations、recommended_action、按条件返回approval_draft_id。seq、type、tool_name、payload、created_at。average_run_seconds、recent_failures、queue_health、permission_denial_count。tasks、runs、run_events、audit_logs、knowledge_chunks。tests/test_hidden_scenarios.py可作为泛化能力验证;若仅作本地审查材料,可不纳入 PR。协作者与 AI 使用披露
Codex/huangwenli。COLLABORATION_LOG.md 关键结论摘要
pylauncher,默认 Python 为 3.6;最终使用 Python 3.11 创建.venv并完成验证。