Fix Python remote eval parameter serialization in dev mode by evanmkeith · Pull Request #210 · braintrustdata/bt

ekeith (evanmkeith) · 2026-05-27T21:03:50Z

Summary

Fixes Python bt eval --dev /list responses for evals that define parameters={...}.

The Python eval runner was emitting parameters as a raw JSON Schema object via parameters_to_json_schema(...). The Braintrust UI expects remote eval parameters to use the serialized parameter container shape, such as braintrust.staticParameters or braintrust.parameters. As a result, /list could return 200 OK while the UI still failed to parse the evaluator manifest and showed the generic connection/listing error.

This updates the Python runner to serialize parameters with the same remote eval parameter container shape used by the Python SDK devserver/push flow.

Test Plan

Add a Python list-mode fixture with a Pydantic parameter model.
Assert BT_EVAL_DEV_MODE=list emits parameters.type = "braintrust.staticParameters".
Run:
- python3 -m py_compile scripts/eval-runner.py tests/evals/py/remote_list_params/eval_remote_list_params.py
- cargo fmt --check
- cargo test eval_python_runner_list_mode_serializes_remote_parameter_container --test eval_fixtures -- --nocapture

## Summary Fixes Python `bt eval --dev` `/list` responses for evals that define `parameters={...}`. The Python eval runner was emitting parameters as a raw JSON Schema object via `parameters_to_json_schema(...)`. The Braintrust UI expects remote eval parameters to use the serialized parameter container shape, such as `braintrust.staticParameters` or `braintrust.parameters`. As a result, `/list` could return `200 OK` while the UI still failed to parse the evaluator manifest and showed the generic connection/listing error. This updates the Python runner to serialize parameters with the same remote eval parameter container shape used by the Python SDK devserver/push flow. ## Test Plan - Add a Python list-mode fixture with a Pydantic parameter model. - Assert `BT_EVAL_DEV_MODE=list` emits `parameters.type = "braintrust.staticParameters"`. - Run: - `python3 -m py_compile scripts/eval-runner.py tests/evals/py/remote_list_params/eval_remote_list_params.py` - `cargo fmt --check` - `cargo test eval_python_runner_list_mode_serializes_remote_parameter_container --test eval_fixtures -- --nocapture`

github-actions · 2026-05-27T21:13:33Z

Latest downloadable build artifacts for this PR commit 1f4618877fd0:

Workflow run: https://github.com/braintrustdata/bt/actions/runs/26538607588
Download all artifacts (GitHub CLI): gh run download 26538607588 --repo braintrustdata/bt
Installers are published from main automatically. To publish one for a PR branch, run release-canary manually via workflow_dispatch.

Available artifact names

``artifacts-build-global
``artifacts-build-local-aarch64-pc-windows-msvc
``artifacts-build-local-x86_64-apple-darwin
``artifacts-build-local-x86_64-pc-windows-msvc
``artifacts-build-local-x86_64-unknown-linux-musl
``artifacts-build-local-aarch64-apple-darwin
``artifacts-build-local-x86_64-unknown-linux-gnu
``artifacts-build-local-aarch64-unknown-linux-gnu
``artifacts-plan-dist-manifest
``cargo-dist-cache

Spencer Seale (spencerseale) · 2026-06-03T19:46:48Z

This is blocking on passing params to the playground from remote evals. Subscribed for when this ships

Abhijeet Prasad (AbhiPrasad) · 2026-06-04T15:33:51Z

please rebase and we can get this merged in and released!

Joshua Wootonn (joshuawootonn)

The rest of it looks good though, and for most recent SDK installers they will not hit this branch.

Joshua Wootonn (joshuawootonn) · 2026-06-04T15:39:49Z

+    static_parameters: dict[str, Any] = {}
+    properties = schema.get("properties", {})
+    if isinstance(properties, dict):
+        for name, property_schema in properties.items():
+            if not isinstance(property_schema, dict):
+                continue
+
+            parameter_type = property_schema.get("x-bt-type")
+            if parameter_type == "prompt":
+                parameter: dict[str, Any] = {"type": "prompt"}
+            elif parameter_type == "model":
+                parameter = {"type": "model"}
+            else:
+                parameter = {"type": "data", "schema": property_schema}
+
+            if "default" in property_schema:
+                parameter["default"] = property_schema["default"]
+            if isinstance(property_schema.get("description"), str):
+                parameter["description"] = property_schema["description"]
+            static_parameters[name] = parameter


I'm not sure why this section is necessary. Looking to understand more.

This is unnecessary.

If you are using python SDK >= v0.10.0 serialize_remote_eval_parameters_container will be defined and the playground can handle the container type created for parameters

If you are using python SDK < v0.10.0 serialize_remote_eval_parameters_container will not be defined but parameters_to_json_schema won't actually create a JSON schema. In that old SDK version it was creating the object of JSON schemas that the playground also support.

So there's not a way of hitting this endpoint in which the new version of parameters_to_json_schema is called and returns an actual JSON schema. Which is the error that will cause an error in the playground?

Joshua Wootonn (joshuawootonn) · 2026-06-04T15:42:25Z

This PR is pretty similar to something I shipped last week: #206

Joshua Wootonn (joshuawootonn)

Ok I'm realizing that this is almost the same PR I shipped last week, which is why rebasing is necessary. #206

I failed to check in on what the releasing policy is for this repo / for bt CLI so I don't think it's shipped yet. Sorry for dropping the ball everyone!

I would like us to avoid shipping this PR. I think we just need to get a new version dropped.

ekeith (evanmkeith) · 2026-06-04T17:33:41Z

already fixed here

ekeith (evanmkeith) requested a review from Ankur Goyal (ankrgyl) May 27, 2026 21:03

ekeith (evanmkeith) requested review from Ken Jiang (knjiang) and Parker Henderson (parkerhendo) and removed request for Ankur Goyal (ankrgyl), Ken Jiang (knjiang) and Parker Henderson (parkerhendo) June 1, 2026 19:48

john (j13huang) requested a review from Joshua Wootonn (joshuawootonn) June 4, 2026 05:03

Abhijeet Prasad (AbhiPrasad) approved these changes Jun 4, 2026

View reviewed changes

Joshua Wootonn (joshuawootonn) approved these changes Jun 4, 2026

View reviewed changes

Joshua Wootonn (joshuawootonn) requested changes Jun 4, 2026

View reviewed changes

ekeith (evanmkeith) closed this Jun 4, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix Python remote eval parameter serialization in dev mode#210

Fix Python remote eval parameter serialization in dev mode#210
ekeith (evanmkeith) wants to merge 1 commit into
mainfrom
05-27-fix-python-remote-eval-parameter-listing

ekeith (evanmkeith) commented May 27, 2026

Uh oh!

github-actions Bot commented May 27, 2026

Uh oh!

Spencer Seale (spencerseale) commented Jun 3, 2026

Uh oh!

Abhijeet Prasad (AbhiPrasad) commented Jun 4, 2026

Uh oh!

Joshua Wootonn (joshuawootonn) left a comment

Uh oh!

Joshua Wootonn (joshuawootonn) Jun 4, 2026

Uh oh!

Joshua Wootonn (joshuawootonn) Jun 4, 2026

Uh oh!

Joshua Wootonn (joshuawootonn) commented Jun 4, 2026

Uh oh!

Joshua Wootonn (joshuawootonn) left a comment

Uh oh!

ekeith (evanmkeith) commented Jun 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

ekeith (evanmkeith) commented May 27, 2026

Summary

Test Plan

Uh oh!

github-actions Bot commented May 27, 2026

Uh oh!

Spencer Seale (spencerseale) commented Jun 3, 2026

Uh oh!

Abhijeet Prasad (AbhiPrasad) commented Jun 4, 2026

Uh oh!

Joshua Wootonn (joshuawootonn) left a comment

Choose a reason for hiding this comment

Uh oh!

Joshua Wootonn (joshuawootonn) Jun 4, 2026

Choose a reason for hiding this comment

Uh oh!

Joshua Wootonn (joshuawootonn) Jun 4, 2026

Choose a reason for hiding this comment

Uh oh!

Joshua Wootonn (joshuawootonn) commented Jun 4, 2026

Uh oh!

Joshua Wootonn (joshuawootonn) left a comment

Choose a reason for hiding this comment

Uh oh!

ekeith (evanmkeith) commented Jun 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants