Skip to content

feat(anthropic-intelligence): bootstrap KB + automation + first HTML-vs-Markdown dossier#1

Open
Klangschalen wants to merge 5 commits into
mainfrom
claude/anthropic-knowledge-base-EXRN2
Open

feat(anthropic-intelligence): bootstrap KB + automation + first HTML-vs-Markdown dossier#1
Klangschalen wants to merge 5 commits into
mainfrom
claude/anthropic-knowledge-base-EXRN2

Conversation

@Klangschalen
Copy link
Copy Markdown
Owner

Summary

Bootstraps a continuously-updated knowledge base on Anthropic, Claude Code, and AI engineering best practices, structured as the anthropic-intelligence/ subtree of this repo.

Three commits:

  1. Structure & sourcesanthropic-intelligence/{README,CADENCE,sources.yaml}, engineer profiles (Boris Cherny, Erik Schluntz, Sholto Douglas, Cat Wu, Thariq Shihipar, Simon Willison), scaffolding for topics/ and monitoring/.
  2. Automation.github/workflows/anthropic-weekly-scan.yml (Mon 06:00 UTC, no secrets) and anthropic-monthly-synthesis.yml (1st 06:00 UTC, needs ANTHROPIC_API_KEY), backed by scripts/fetch_sources.py (RSS aggregation) and scripts/synthesize.py (Claude API topic re-synthesis).
  3. First dossiertopics/output-formats-html-vs-markdown.md synthesizing the May 2026 shift after Thariq Shihipar's "Unreasonable Effectiveness of HTML" (Anthropic, Claude Code team). Token economics, 20 use-case examples, counter-arguments, and concrete recommendations mapped to klangschalen/* repos.

Cadence (recommended)

When What Workflow
Mon 06:00 UTC Weekly source scan anthropic-weekly-scan.yml (no secrets)
1st 06:00 UTC Monthly topic synthesis anthropic-monthly-synthesis.yml (needs ANTHROPIC_API_KEY)
Ad-hoc Manual via workflow_dispatch both workflows

Before merging

  • Add repo secret ANTHROPIC_API_KEY under Settings → Secrets and variables → Actions
  • Optionally set repo variable ANTHROPIC_MODEL (defaults to claude-opus-4-7)
  • Verify the TODO: verify handles in anthropic-intelligence/engineers/*.md
  • Review the dossier status (currently draft — Twitter source was rate-limited during research)

Test plan

  • Manually dispatch anthropic-weekly-scan from the Actions tab; confirm a PR is opened with a non-empty monitoring/logs/2026-19.md
  • After adding ANTHROPIC_API_KEY, manually dispatch anthropic-monthly-synthesis with topic=output-formats-html-vs-markdown; confirm the dossier is updated with last_synthesized: <today> (or unchanged if no log signal)
  • Spot-check that the cron expressions trigger at the expected times in the next 7 days

Follow-ups (not in this PR)

  • /new-repo-bootstrap skill in klangschalen/claude-config to streamline future repo creation + allowlist updates
  • Adopt dogum/html-artifacts skill pattern in klangschalen/claude-config
  • Add the three HTML-artifact prompt templates to klangschalen/agent-templates
  • Codify HTML-vs-Markdown carve-out rules in klangschalen/engineering-principles

Generated by Claude Code

…er profiles

Adds the foundation for a continuously-updated knowledge base on Anthropic
and Claude Code, structured as a subtree of klangschalen/knowledge.

- README + CADENCE rationale (weekly scan, monthly synthesis)
- sources.yaml: prioritized source registry (P0/P1/P2)
- engineers/: profiles of Boris Cherny, Erik Schluntz, Sholto Douglas,
  Cat Wu, Thariq Shihipar (Anthropic) + Simon Willison (external)
- topics/, monitoring/ scaffolding
…kflows

GitHub Actions cron pipeline:
- anthropic-weekly-scan.yml (Mon 06:00 UTC) runs fetch_sources.py over
  sources.yaml, writes monitoring/logs/YYYY-WW.md, opens PR
- anthropic-monthly-synthesis.yml (1st 06:00 UTC) runs synthesize.py over
  last 4 weekly logs, updates topics/*.md via Claude API, opens PR

Both workflows are manually dispatchable. Monthly synthesis requires the
ANTHROPIC_API_KEY repo secret; weekly scan needs no secret.

Python scripts use only feedparser + PyYAML + anthropic SDK to stay small.
… output format

Synthesizes the May 2026 shift after Thariq Shihipar's (Anthropic Claude Code)
"Unreasonable Effectiveness of HTML" post:
- Thesis & token economics (incl. measured ~2-4× output cost)
- 9 use-case categories where HTML wins, with the 20 Thariq examples
- Carve-outs where Markdown still wins (READMEs, Slack, LLM-to-LLM, etc.)
- Counter-arguments (Kurtis Redux, HN, security, reviewability)
- Concrete recommendations for our systems: agent prompts, skill design,
  knowledge base, content engine, Filament dashboards
- Action items mapped to klangschalen/* repos

Marked status: draft — Twitter source (Thariq's original) was rate-limited
during research; pending direct re-fetch on next monthly synthesis.
… W20 probe

First weekly-scan probe run (executed locally with stdlib XML parsing since
the sandbox couldn't build feedparser's sgmllib3k dep). Real findings:

- claude-code releases feed works → verified=true (v2.1.133-2.1.139 in 5 days)
- Simon Willison Atom feed works → verified=true
- Anthropic news /news/rss.xml returns 404 → feed=null, marked scrape-only
- Reddit /.rss returns 403 to default UA → feed=null, needs UA rotation

The fetch_sources.py script in the workflow will need a UA header to make
the Reddit feeds work; tracked as a follow-up.
Copy link
Copy Markdown

@chatgpt-codex-connector chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: b541a795b3

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

  • Open a pull request for review
  • Mark a draft as ready
  • Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

PRIORITY: ${{ github.event.inputs.priority || 'P2' }}
run: |
cd anthropic-intelligence
WEEK=$(date -u +%Y-%V)
Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

P1 Badge Use ISO week-year for weekly log filenames

The scanner names logs with date -u +%Y-%V, which mixes calendar year (%Y) with ISO week number (%V). On Mondays at year boundaries (for example, 2024-12-30 is ISO week 2025-01), this produces 2024-01 and can overwrite an earlier 2024-01.md log from January, causing data loss and breaking chronological ordering for monthly synthesis.

Useful? React with 👍 / 👎.


by_source[source["id"]] = new_entries
# Update state — keep last 200 guids per source to bound growth.
merged = list(seen_guids) + [e["guid"] for e in new_entries]
Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

P2 Badge Keep deterministic GUID history before trimming to 200

The dedup state is converted to a set (seen_guids) and then back to a list before truncation, so merged[-200:] keeps an arbitrary subset of old GUIDs. Once the source has many entries, this non-deterministic retention can drop recently seen items and re-surface them as "new" in later runs, creating noisy duplicate alerts.

Useful? React with 👍 / 👎.

Eigenstaendiges HTML-Konzeptdokument als Session-Review. Geht drei
Schritte zurueck und arbeitet auf:

- Session-Timeline (was wurde getan)
- die vier Konzept-Achsen (Repo-Struktur, Automatisierung, Quellen,
  Output-Form) je mit Vor-/Nachteilen, Bewertungs-Scores und Entscheidung
- Sammeltabelle aller 8 getroffenen Entscheidungen mit Status-Badges
  (geerbt vom Branch vs. in dieser Session entschieden - sauber getrennt)
- Bewertungs-Matrix der Hauptkonzepte (Einfachheit/Korrektheit/Langfristig)
- drei moegliche Wege vorwaerts (Subtree dauerhaft / Subtree-Split /
  Merge mit engineering-principles)
- 12 offene Fragen fuer den Bau in Claude Code, in 4 Bloecke gruppiert,
  als aufklappbare details-Elemente
- Markdown-Copy-Block zur Weiterverwendung

Sound-Spirit-Farben, keine externen Abhaengigkeiten, durchgehend echte
Umlaute. Ablage unter concept/, da es das Architektur-Dokument der KB ist.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant