feat(provider): add Yutori provider support for Navigator model#355
Open
dhruvbatra wants to merge 1 commit into
Open
feat(provider): add Yutori provider support for Navigator model#355dhruvbatra wants to merge 1 commit into
dhruvbatra wants to merge 1 commit into
Conversation
Adds Yutori as a model provider via the standard OpenAICompatibleAPI pattern. Navigator's Chat Completions API uses `max_completion_tokens` and rejects `max_tokens`, so the provider silently remaps the openbench `--max-tokens` flag. Navigator-only request fields (`tool_set`, `disable_tools`, `json_schema`) are exposed via `-M` and forwarded as `extra_body`. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
8096d5a to
e4fd32f
Compare
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
Adds Yutori as a model provider so openbench can target Yutori's Navigator computer-use model family. The Navigator API is OpenAI-compatible Chat Completions, so the new
YutoriAPIplugs into Inspect AI's standardOpenAICompatibleAPImachinery — same pattern as Baseten, SiliconFlow, etc.; closely modeled on #269.Navigator is a screenshot-driven vision/computer-use model, not a text reasoner. This PR is just the provider plumbing — the natural pairing is a browser-use or vision benchmark (bring your own with
bench eval path/to/eval.py, or wait for a future built-in).What are you adding?
Changes Made
YutoriAPI(OpenAICompatibleAPI)insrc/openbench/model/_providers/yutori.py.@modelapi(name="yutori")registration in_registry.py;ProviderType.YUTORI+ entry inPROVIDER_CONFIGS.README.md; provider row + "Yutori-Specific Configuration Options" subsection indocs/providers.mdx.tests/test_yutori_provider.py.Navigator-specific behavior in the provider
max_completion_tokens(the modern OpenAI Chat Completions field) and rejectsmax_tokens. The provider'scompletion_params()silently remaps openbench's--max-tokensflag ontomax_completion_tokensso callers don't need to know about the rename. An explicit-M max_completion_tokens=Nwins over the remapped value.-Mkwargs and forwarded asextra_body:tool_set,disable_tools,json_schema(matching the public Yutori SDK surface).Testing
pytest tests/test_yutori_provider.py— 21 passedpytest— full non-integration suite passingpre-commit run --all-files— ruff, mypy, registry-imports all greeninspect_ai.get_model("yutori/n1.5-latest")resolves toYutoriAPI, picks upYUTORI_API_KEYandYUTORI_BASE_URLoverrides correctlyChecklist