You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
chore: clean up internal phase markers and reorganize scripts
The repo had grown two ingestion pipelines (PDF prose corpus and XSD
schema graph) without making the duality obvious. Header comments
referenced internal "Phase N" planning vocabulary that doesn't help
public readers, and a tool-output line shipped a forward reference
to a future phase to every MCP caller.
Reorganization:
scripts/ingest/ -> scripts/ingest-pdf/ (was ambiguous)
scripts/ingest-xsd/ stays
scripts/fetch-xsd.ts -> scripts/ingest-xsd/fetch.ts (sibling layout)
scripts/sync-sources.ts -> scripts/sources-sync.ts (verb-style name)
scripts/ingest-pdf/extract-pdf.py -> extract.py
db/migrations/0003_phase3_metadata.sql -> 0003_xsd_metadata.sql
scripts/ingest-xsd/smoke.ts removed (debug-only, low value)
Renamed npm scripts to match the new directory layout:
ingest -> pdf:ingest
ingest:chunk -> pdf:chunk
ingest:embed -> pdf:embed
ingest:upload -> pdf:upload
ingest:setup -> pdf:setup
db:sync-sources -> sources:sync
xsd:smoke removed
Strip "Phase N" markers from migration headers, source-file headers,
test-file headers, and inline comments. None of those references were
load-bearing; they were artifacts of the planning doc.
Drop the user-facing "_behavior notes: none yet (Phase 5)._" line that
shipped in every children/attributes/enum tool response. The line gave
no information when notes are absent and exposed an internal phase
label to the public.
Replace the lone PLAN.md reference in scripts/ingest-xsd/ingest.ts
with self-contained context. PLAN.md is gitignored; pointing at it
was a broken link for anyone reading the public repo.
Add scripts/ingest-pdf/README.md and scripts/ingest-xsd/README.md so
each pipeline is documented at the level that contributors land at,
and refresh CLAUDE.md to make the two corpora explicit and surface
both flavors of MCP tools.
41 / 0 across db / ingest / mcp-server.
0 commit comments