Sunscreen Testing Harness

This project uses a tiered validation model so fast CI stays deterministic while real Solana validation remains explicit.

Tiers

Tier	Purpose	Command
Offline deterministic	fmt, clippy, feature gates, full Rust tests, binary smoke	`bash scripts/integration-heavy.sh`
Generated workspace compile	Cargo-check generated workspaces and scaffolded programs	`SUNSCREEN_COMPILE_TESTS=1 bash scripts/integration-heavy.sh`
Real Anchor/Codama	Anchor/Solana/Codama/pnpm/node against ignored integration tests	`SUNSCREEN_REAL_TOOLCHAIN=1 bash scripts/integration-heavy.sh`
Pinocchio SBF	Real Solana SBF build for Pinocchio workspaces	`SUNSCREEN_PINOCCHIO_SBF=1 bash scripts/integration-heavy.sh`
Serve runtime	Surfpool or `solana-test-validator`, watcher, NDJSON events, teardown	manual tier in `.claude/skills/sunscreen-test-harness/SKILL.md`
Plugin runtime	Lifecycle, sandbox, stdio JSON-RPC, gRPC contract, marketplace	`cargo test --locked --test app_lifecycle -- --nocapture`
Frontend codegen	Generated React/Solid hooks and typecheck when JS dependencies are installed	`SUNSCREEN_FRONTEND_COMPILE_TESTS=1 bash scripts/integration-heavy.sh`
Release distribution	Release binary and `cargo dist plan`	`SUNSCREEN_DIST=1 bash scripts/integration-heavy.sh`
Flake/perf	Repeat CLI smoke and run cold-start bench separately	`SUNSCREEN_FLAKE_RUNS=5 bash scripts/integration-heavy.sh`

False Green Policy

Ignored or gated tests do not count as real coverage when they skip because a tool is missing. A real-toolchain run must prove that anchor, solana, solana-test-validator, pnpm, node, codama, cargo, and rustc were available before accepting tests/integration_anchor.rs as executed.

The fake-toolchain integration tests remain valuable: they prove CLI contracts, JSON/NDJSON shapes, sandbox behavior, plugin runtime boundaries, and command-group flows without depending on external network or local Solana installs. They do not replace the real toolchain gate.

Orchestrator Output

Every scripts/integration-heavy.sh run writes:

heavy-<timestamp>.log with the full command stream.
heavy-<timestamp>.summary.json with top-level status, exit code, log path, and per-tier owner, status, command, evidence, and next_action.

The test-harness-orchestrator must read the newest summary before reporting. This keeps the team honest about which tiers passed, skipped, failed, or were blocked by missing tools.

Harness Team

The durable harness lives in:

.claude/agents/test-strategist.md
.claude/agents/test-harness-orchestrator.md
.claude/agents/offline-ci-owner.md
.claude/agents/real-anchor-codama-owner.md
.claude/agents/pinocchio-sbf-owner.md
.claude/agents/serve-runtime-owner.md
.claude/agents/plugin-runtime-qa.md
.claude/agents/frontend-codegen-owner.md
.claude/agents/release-distribution-qa.md
.claude/agents/flake-perf-auditor.md
.claude/skills/sunscreen-test-harness/SKILL.md
.agents/skills/sunscreen-test-harness/SKILL.md

Use sunscreen-test-harness whenever the task is to validate the app with heavy integration, real toolchain, release QA, stress, or anti-flake coverage.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Sunscreen Testing Harness

Tiers

False Green Policy

Orchestrator Output

Harness Team

Uh oh!

FilesExpand file tree

testing.md

Latest commit

History

testing.md

File metadata and controls

Sunscreen Testing Harness

Tiers

False Green Policy

Orchestrator Output

Harness Team