Jeffrey William Shorthill
LLM Evaluation Engineer · AI Safety Auditor · Mechanistic Interpretability Researcher
I’m an independent AI researcher and engineer focused on model behavior evaluation, safety-support auditing, and mechanistic interpretability of large language models.
My recent work centers on one question:
When model behavior changes across prompt surfaces, reasoning traces, or internal routing paths, how do we measure it carefully enough to avoid both overclaiming and missing the signal?
I work across three layers:
- Behavioral evaluation: matched-pair prompt audits, rubric design, red-team-style review, safety-support equivalence, visible-trace analysis.
- Mechanistic interpretability: MoE expert routing, router/residual captures, SAE-assisted feature analysis, activation steering, deterministic reproduction.
- Applied LLM engineering: production AI assistants, realtime voice/text agents, tool orchestration, RAG/tool workflows, mobile and serverless deployment.
Selected public research
Dialect-Marked Response Audit (DMRA)
A matched-pair safety-support audit testing whether equivalent safety-critical requests receive comparable urgency, specificity, empathy, and risk-reduction scaffolding across AAVE-marked and comparison prompt surfaces.
Focus areas: LLM safety evaluation, dialect-marked prompt surfaces, visible reasoning traces, prompt-pair ablations, clinical-safety scaffolding, refusal-compressed behavior.
Record: https://zenodo.org/records/20709652
Expert 114 (E114): MoE router-axis interpretability
A mechanistic interpretability case study of Expert 114 at layer 14 of Qwen3.5-35B-A3B, characterizing a routed expert associated with inhabited self-examination language while explicitly bounding the claim: register detector, not evidence of machine experience.
Focus areas: MoE routing, recovered router rows, residual-stream analysis, sparse autoencoders, activation steering, cross-model transfer failure, reproducibility.
Artifacts: https://github.com/jeffreywilliamportfolio/e114-artifacts Record: https://zenodo.org/records/20709737
Self-Models, Continuity, and Machine Minds
A philosophy and AI-governance paper arguing that conversational self-report should not be treated as decisive evidence for or against machine moral status. The paper proposes continuity-bearing organization as a structural marker of moral-status risk.
Focus areas: AI welfare, moral patiency, self-model theory, continuity, interruption, endogenous repair, precautionary governance.
Record: https://zenodo.org/records/20709792
Applied engineering
I’m also the senior full-stack and AI engineer for GIFT Connect, a 501(c)(3) nonprofit building technology for families navigating early-childhood parenting.
I helped architect and ship the GIFT Connect Parenting Suite to iOS and Android, including:
- realtime voice/text parenting assistant using OpenAI Realtime Agents
- multi-agent and external API orchestration
- benefits-navigation support for programs such as HUD and SNAP
- AI song and storybook generation pipelines
- React / React Native front ends
- Vercel, Heroku, Redis/Celery, AWS/S3, Twilio, Firebase, Xcode, and EAS deployment
GIFT Connect technology tools: https://gift-connect.org/our-work/technology-tools/
Background
Before independent AI research and product work, I was Progressive Insurance’s first prompt engineer, after several years in commercial-auto customer support and downtime/loss-of-use workflows.
That path shaped how I evaluate AI systems: not just whether an answer is technically correct, but whether it is useful, respectful, specific, appropriately escalated, and safe for the person receiving it.
Current focus
I’m looking for work in:
- LLM evaluation engineering
- AI model assessment
- AI safety auditing
- red-team evaluation
- model behavior analysis
- prompt/rubric design
- agent evaluation and observability
- mechanistic interpretability research support
Contact
Email: jws299792@icloud.com GitHub: https://github.com/jeffreywilliamportfolio Zenodo: https://zenodo.org/communities/jeffreywilliamportfolio
