feat(api): playground messages dispatch + prompts.post accepts template_messages (Plan B) by gaurav0107 · Pull Request #12 · tracebility-ai/tracebility

gaurav0107 · 2026-06-07T16:39:33Z

Summary

Plan B of the playground messages redesign. The api can now accept,
render, persist, and dispatch a list of typed messages. Prompts can be
saved as the structured shape too.

Both legacy fields (raw_template on POST /v1/playground/runs,
template on POST /v1/prompts/{id}/versions) keep working for one
release of back-compat; cleanup PR after Plan C lands.

What changed

Migration 0027 — playground_session.rendered_messages jsonb NOT NULL with non-empty array CHECK; backfill old rows as [{role: human, content: <rendered_prompt>}].
_render_messages — sister to the now-removed _render_template; per-message {{ var }} substitution, missing vars render as "" per spec decision 9, returns a fresh list. Shares _coerce_var_value helper for json/string coercion.
_to_dispatch_messages — bridges prompt-side human to dispatch-side user; _PROMPT_TO_DISPATCH_ROLE is the single source of truth (typed Mapping[Literal, Literal]).
PlaygroundCreate.raw_messages — new structured field with strict xor validator across prompt_version_id / raw_template / raw_messages. Empty list rejected via Field(min_length=1). Zero-vs-many error messages split.
Playground POST handler — reads structured messages end-to-end: resolve → render → persist rendered_messages jsonb AND rendered_prompt (newline-joined view) → dispatch via the role-mapped list. Old _resolve_template and _render_template deleted.
PromptVersionCreate.template_messages — same xor + to_messages() resolver. Handler writes BOTH template (legacy, derived) AND template_messages jsonb so rows satisfy migration 0026's NOT NULL constraint. Latent gap closed as a side effect.
No-op short-circuit — re-saving an identical version returns the existing row at HTTP 200; no duplicate row, no audit entry.
_derive_legacy_template helper — single source of truth for the `template` text derivation rule, called by both create-write and read-hydrate paths so the response shape can never diverge.

Files

schemas/postgres/migrations/0027_playground_session_rendered_messages.sql
services/api/tracebility_api/routers/playground.py — render helpers, role mapping, request model, handler
services/api/tracebility_api/routers/prompts.py — request model, no-op short-circuit, derived-template helper
5 new unit-test files: test_playground_render_messages.py, test_playground_role_mapping.py, test_playground_create_validation.py, test_playground_resolve_messages.py, test_prompt_version_create_validation.py

Test plan

Migration 0027 dry-run + apply on local docker postgres
uv run pytest services/api/tests/unit — 73 passed (60 prior + 13 new)
uv run pytest services/api/tests/integration/test_prompts_template_messages.py — clean
CI green
Merge — Plan A's cluster already has 0026; 0027 is additive
Migrator job runs cleanly on next deploy (skips already-applied migrations)

Deferred

HTTP-level e2e test against the live api stack (test_playground_messages_e2e.py, test_prompts_post_template_messages.py) — depends on container health + dev-login auth, brittle in CI. Comparison-contract pinned by unit tests; can land in a follow-up if the unit coverage proves insufficient.

Companion docs

Spec: docs/superpowers/specs/2026-06-07-playground-messages-redesign-design.md
Plan: docs/superpowers/plans/2026-06-07-playground-B-api.md (gitignored)

🤖 Generated with Claude Code

Adds a structured rendered form alongside the existing rendered_prompt text. Trace replay and re-dispatch read the structured column so the message list round-trips byte-for-byte; the human-readable text stays for the trace UI's existing display path. Backfills old rows as [{role: human, content: <rendered_prompt>}] so the column can be NOT NULL. Companion to docs/superpowers/specs/2026-06-07-playground-messages-redesign-design.md. Signed-off-by: gaurav0107 <gauravdubey0107@gmail.com>

Sister to _render_template - applies {{ var }} substitution per message body. Missing variables render as empty string (spec decision 9) so the user can iterate without the renderer fighting them; non-strings serialize via json.dumps for parity with the legacy single-string path. Returns a fresh list; never mutates input. Used by the next-task playground POST handler when the body carries raw_messages. Signed-off-by: gaurav0107 <gauravdubey0107@gmail.com>

The two surfaces use different role vocabularies (LangSmith: system / human vs. provider: system / user / assistant / tool). One small mapper bridges them; system passes through. Used in the next task by the playground POST handler. The role translation lives in a single dict so adding ai / tool support later (spec decision 2 deferral) is a one-line change. Signed-off-by: gaurav0107 <gauravdubey0107@gmail.com>

Adds the structured request shape; the legacy raw_template field stays for one release of back-compat. A strict xor validator across prompt_version_id / raw_template / raw_messages enforces "exactly one template source per request" - zero or more than one is now a 422 instead of a 400 from the handler. Empty raw_messages is rejected too so the renderer never sees a zero-length list. The handler-level "at least one" check is removed; the model validator is the single source of truth for this contract. Signed-off-by: gaurav0107 <gauravdubey0107@gmail.com>

The handler now reads raw_messages (or template_messages from a saved prompt, or wraps a legacy raw_template), Jinja-renders each turn's content against the variables dict, persists rendered_messages jsonb alongside the human-readable rendered_prompt, and dispatches the mapped (human -> user) message list to the LiteLLM gateway verbatim. The legacy raw_template path is preserved by wrapping it as a single human message before render - no behavior change for old web clients. The xor validator on PlaygroundCreate (Plan B Task 4) guarantees exactly one template source per request, so the resolver is a straight-line three-branch dispatch with no precedence rules to remember. The old _resolve_template / _render_template single-string pipeline is deleted - rendered_messages and rendered_prompt (newline-joined view) are the canonical pair from here on. The integration test deferred from this commit is a wire-level e2e against the local docker stack; the unit tests on _resolve_messages, _render_messages, _to_dispatch_messages, and the xor validator cover the request-path logic without depending on container health. Signed-off-by: gaurav0107 <gauravdubey0107@gmail.com>

The structured request shape (template_messages: list[Message]) is the preferred field; the legacy 'template: str' stays for one release of back-compat and wraps to a single human message internally. A strict xor validator across the two fields enforces "exactly one source per request" - zero or both is now a 422. The handler now writes both columns (template_messages jsonb + template text, derived) so the row satisfies migration 0026's NOT NULL constraint on the new column. The latent gap where create_version omitted template_messages is closed as a side effect. Adds the no-op short-circuit: if the new messages match the most recent version byte-for-byte (compared via model_dump), return that existing row with HTTP 200 instead of creating a duplicate. Saves a row per accidental save and matches the spec's rule. Signed-off-by: gaurav0107 <gauravdubey0107@gmail.com>

gaurav0107 added 6 commits June 7, 2026 20:13

gaurav0107 merged commit 2aa1987 into main Jun 7, 2026
3 checks passed

gaurav0107 mentioned this pull request Jun 7, 2026

feat(web): playground composer for typed messages + Save flow (Plan C) #13

Merged

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(api): playground messages dispatch + prompts.post accepts template_messages (Plan B)#12

feat(api): playground messages dispatch + prompts.post accepts template_messages (Plan B)#12
gaurav0107 merged 6 commits into
mainfrom
feat/playground-messages-api

gaurav0107 commented Jun 7, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

gaurav0107 commented Jun 7, 2026

Summary

What changed

Files

Test plan

Deferred

Companion docs

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant