examples: add local ART-E email search task by Vinzz2303 · Pull Request #679 · OpenPipe/ART

Vinzz2303 · 2026-05-13T08:16:51Z

Summary

Adds a lightweight local ART-E email search example under examples/art_e
Includes deterministic email scenarios, search/read helpers, answer scoring, rollout loop, and a training entrypoint
Adds unit tests for the local scenario helpers and links the example from the root README

Motivation

The README points users to the ART-E email-search agent, but the main repo did not include a small local example that can be inspected without external email infrastructure. This adds a portable starter task for experimenting with ART-style email search rollouts before scaling to the full ART-E setup.

What changed

examples/art_e/scenarios.py: local inbox fixtures, search/read helpers, JSON command parsing, and reward scoring
examples/art_e/rollout.py: multi-turn ART trajectory using <search>, <read>, and <answer> commands
examples/art_e/train.py: LocalBackend training loop for the ART-E style task
examples/art_e/README.md: quickstart and file overview
tests/test_art_e_example.py: helper-level tests for search, read, scoring, and command parsing
README.md: adds the local ART-E example to the notebooks/examples table

Testing

Not run locally because this Windows environment does not currently have python or uv available in PATH. The added tests are intentionally small and should run with the repo test suite once dependencies are installed.

Vinzz2303 · 2026-05-14T15:45:47Z

I pushed an update to strengthen this ART-E example PR.

New addition:

offline evaluation harness for the local ART-E task
deterministic baseline over all scenarios
tests confirming the offline evaluator scores the fixtures correctly
README instructions for running the no-model evaluation

This should make the example easier to review because the task contract can be checked without API keys or training infrastructure.

Note: I still could not run Python locally because this Windows environment does not have python/uv in PATH, but git diff --check passes.

Vinzz2303 added 2 commits May 13, 2026 15:09

examples: add local ART-E email search task

cf58445

examples: add offline ART-E evaluation harness

6130f08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

examples: add local ART-E email search task#679

examples: add local ART-E email search task#679
Vinzz2303 wants to merge 2 commits into
OpenPipe:mainfrom
Vinzz2303:add-local-art-e-email-example

Vinzz2303 commented May 13, 2026

Uh oh!

Vinzz2303 commented May 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Vinzz2303 commented May 13, 2026

Summary

Motivation

What changed

Testing

Uh oh!

Vinzz2303 commented May 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant