Skip to content

Point Behavioral tests to self-hosted runners#77

Merged
danielholanda merged 6 commits into
mainfrom
dholanda/behavioral_orchestr
Jun 26, 2026
Merged

Point Behavioral tests to self-hosted runners#77
danielholanda merged 6 commits into
mainfrom
dholanda/behavioral_orchestr

Conversation

@danielholanda

@danielholanda danielholanda commented Jun 25, 2026

Copy link
Copy Markdown
Collaborator

Description

This PR moves our behavioral tests to self-hosted infrastructure.

@danielholanda danielholanda self-assigned this Jun 25, 2026
@danielholanda danielholanda changed the title Point to self-hosted runners Point Behavioral tests to self-hosted runners Jun 26, 2026
@danielholanda danielholanda marked this pull request as ready for review June 26, 2026 00:17
@alirekia

Copy link
Copy Markdown
Collaborator

Nice direction overall. Three quick concerns before merge:

  1. Public repo + pull_request on self-hosted means untrusted fork code can execute on those hosts. Suggestion:
  • Keep discovery/lint on ubuntu-latest for all PRs,
  • Gate self-hosted behavioral jobs to trusted authors or a maintainer label (e.g. safe-to-run-self-hosted).
  1. Removing label-gating runs behavioral on every relevant PR increases the token/runner spend.

  2. Windows behavioral matrix is currently red (llm_judge gave no JSON verdict), so this is not merge-ready yet.

@danielholanda

Copy link
Copy Markdown
Collaborator Author

@alirekia Comments addressed offline :)

@danielholanda danielholanda merged commit acd5fe9 into main Jun 26, 2026
10 of 12 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants