spec(skills): sharpen model-ambiguity-handling trigger to internal-state predicate by liplus-lin-lay · Pull Request #1452 · Liplus-Project/liplus-language

liplus-lin-lay · 2026-06-01T05:33:11Z

概要

skills/model-ambiguity-handling の auto-invoke トリガー (description) を、表面キーワード先頭の形から 内部状態述語「出力の確信レジスターが実際の検証基盤と一致しない時」へ組み替える。verifiable なのに hedge する過小方向も新規に拾い、body の 2-step flow に忠実化。キーワード列は surface tell へ降格。body は不変。

背景 (#1451)

トリガー質仮説の検証: 「鋭い内部状態述語化が application-moment 発火を改善するか」。本 PR は EDIT 1 (model-ambiguity-handling) のみ。

brake 1 結果 (parallel-subagent-eval N=3)

model-ambiguity-handling: 全会一致 consistent → 採用。
model-source-check の編集候補: negative×2 + partial×1 → drop。三体が独立に同じ二点を指摘:
1. 「this session で未取得」が過去取得の stale fact を免除 → fixed-reference-temporal-separation と衝突。
2. 「on an unverified claim」が perfect-defense illusion を骨抜き (body は確信の感覚そのものを無条件トリガーとする)。

finding

トリガーの confidence-calibration 化は スキル依存。calibration が本分の ambiguity-handling では改善、無条件 verify が本分の source-check では regression。普遍的レバーではない。

gate

L1-model 変更 → brake 2 (human review) 必須。merge は Master 承認待ち。
release type = patch (milestone v1.18.6)。
観測: merge 後、self-eval log で application-moment misfire を before/after 追跡 (post-L1-merge observation)。

Closes #1451

…ate predicate model-ambiguity-handling の auto-invoke トリガーを、表面キーワード先頭の形から「出力の確信レジスターが実際の検証基盤と一致しない」という内部状態述語へ組み替えた。verifiable なのに hedge する過小方向も新たに拾い、body の 2-step flow に忠実化。キーワード列 ("I think" / "maybe" 等) は surface tell へ降格した。背景: #1451 のトリガー質仮説検証。本編集は brake 1 (parallel-subagent-eval N=3) で全会一致 consistent。同 PR 候補だった model-source-check の編集は、無条件 verify を calibration で条件付けて regression と判定されたため drop した (source-check の無条件トリガーはそのままが正しい、という finding)。 Refs #1451

liplus-lin-lay

AI self-review (mode: semi_auto / L1 → brake 2 待ち)

verdict: pass (blocking なし)。merge は Master の brake 2 レビュー待ち。

scope: skills/model-ambiguity-handling の description 1 行のみ。body / 他ファイル不変。diff = 1 file, +1/-1。
brake 1 (parallel-subagent-eval N=3): 本編集は全会一致 consistent。
- coverage: 回帰なし。むしろ「verifiable なのに hedge する」過小方向を新規取得し、body の 2-step flow に忠実化。
- impression-literal: なし。
- body 整合: OLD より忠実 (register-vs-basis は body Position の言い換え)。
EDIT 2 (model-source-check) drop: N=3 が独立に regression を指摘 —「this session」exemption が fixed-reference-temporal-separation と衝突 /「on an unverified claim」が perfect-defense illusion を骨抜き。source-check の無条件トリガーはそのままが正しい、という finding。
CI: governance pass。
gate: L1-model → brake 2 (human review) 必須。--auto 不使用。
post-merge 予定: self-eval log で application-moment misfire を before/after 観測 (post-L1-merge observation) ＋ self-evolution-observation entry。

smileygames

おｋ

liplus-lin-lay self-assigned this Jun 1, 2026

liplus-lin-lay linked an issue Jun 1, 2026 that may be closed by this pull request

spec(skills): test trigger-quality lever via sharper internal-state predicates #1451

Closed

liplus-lin-lay commented Jun 1, 2026

View reviewed changes

smileygames approved these changes Jun 1, 2026

View reviewed changes

liplus-lin-lay merged commit f95528a into main Jun 1, 2026
2 checks passed

liplus-lin-lay deleted the 1451-skill-trigger-predicates branch June 1, 2026 05:40

liplus-lin-lay mentioned this pull request Jun 1, 2026

spec(skills): clarify l1-update-gating observation scope for human-directed L1 change #1453

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

spec(skills): sharpen model-ambiguity-handling trigger to internal-state predicate#1452

spec(skills): sharpen model-ambiguity-handling trigger to internal-state predicate#1452
liplus-lin-lay merged 1 commit into
mainfrom
1451-skill-trigger-predicates

liplus-lin-lay commented Jun 1, 2026

Uh oh!

liplus-lin-lay left a comment

Uh oh!

smileygames left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

liplus-lin-lay commented Jun 1, 2026

概要

背景 (#1451)

brake 1 結果 (parallel-subagent-eval N=3)

finding

gate

Uh oh!

liplus-lin-lay left a comment

Choose a reason for hiding this comment

AI self-review (mode: semi_auto / L1 → brake 2 待ち)

Uh oh!

smileygames left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants