fix: polish azd eval output - dataset path, Rows section, Foundry link, terser logs by placerda · Pull Request #297 · Azure/agentops

placerda · 2026-06-10T17:30:19Z

Four user-visible papercuts in execution: azd runs:

report.md showed Dataset: with an empty value. eval.yaml parser only recognized dataset_reference but azd ai agent eval init emits dataset_file. EvalRecipe now accepts dataset_file and _recipe_dataset_path prefers it.
report.md always shipped a header-only ## Rows table (azd does not expose per-row metrics through agentops eval run). The reporter now omits the row sections when rows is empty and emits a ## Per-row breakdown callout linking to the Foundry run.
CLI did not surface the Foundry run URL. normalize_to_results captures report_url from the azd payload; CLI prints a Foundry run: line next to results.json / report.md / latest/.
The Running azd backend log line repeated the full command with absolute Windows paths. Replaced with the short Running azd backend: azd ai agent eval run (full command stays in the failure debug logs added in 0.3.18). The delegating to azd ai agent eval startup line also uses a workspace-relative recipe path.

921 unit tests pass.

…k, terser logs Four user-visible issues with `execution: azd` runs: 1. `report.md` shipped an empty `Dataset:` line because the eval.yaml parser only recognized `dataset_reference:` and `azd ai agent eval init` actually emits `dataset_file:`. EvalRecipe now accepts `dataset_file:` and `_recipe_dataset_path` prefers it. 2. `report.md` shipped a header-only `## Rows` table on every azd run (azd does not expose per-row metrics through `agentops eval run`). The reporter now omits the row sections when `result.rows` is empty and instead emits a `## Per-row breakdown` callout linking to the Foundry run. 3. The CLI did not surface the Foundry run URL. `azd_runner.normalize_to_results` now captures `report_url` from the azd payload and the CLI prints a `Foundry run:` line next to the results paths. 4. `Running azd backend: azd --no-prompt ai agent eval run --config <long absolute path> --output json` was unreadable; replaced with `Running azd backend: azd ai agent eval run` (full command stays in the failure debug logs added in 0.3.18). The `delegating to azd ai agent eval` startup line also uses a workspace-relative recipe path. 921 unit tests pass. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

placerda merged commit 63180c2 into develop Jun 10, 2026
12 checks passed

placerda deleted the feature/azd-report-polish branch June 10, 2026 17:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: polish azd eval output - dataset path, Rows section, Foundry link, terser logs#297

fix: polish azd eval output - dataset path, Rows section, Foundry link, terser logs#297
placerda merged 1 commit into
developfrom
feature/azd-report-polish

placerda commented Jun 10, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

placerda commented Jun 10, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant