Skip to content

feat: add computer use (desktop control) for openai, codex, and anthropic#34

Merged
jbj338033 merged 11 commits into
mainfrom
worktree-computer-use
Jun 9, 2026
Merged

feat: add computer use (desktop control) for openai, codex, and anthropic#34
jbj338033 merged 11 commits into
mainfrom
worktree-computer-use

Conversation

@jbj338033

Copy link
Copy Markdown
Collaborator

Summary

Adds computer use to goat: the model can take screenshots and drive the mouse/keyboard on the local desktop. Delivered as a built-in Computer function tool (cross-platform via enigo + xcap), gated behind an opt-in config flag. Screenshots flow back as multimodal tool results, which required broadening the tool/content layer to carry images. The exact wire format (function tool + function_call_output with input_image) was validated live against the Codex backend.

Verification

  • cargo fmt --all -- --check — passed
  • cargo clippy --workspace --all-targets -- -D warnings — passed
  • cargo test --workspace — passed (57 suites)
  • live Codex API probe: model reads the screenshot and emits a correctly-shaped click action

@jbj338033 jbj338033 merged commit 0ab199b into main Jun 9, 2026
7 of 10 checks passed
@jbj338033 jbj338033 deleted the worktree-computer-use branch June 9, 2026 23:43
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant