§01
Overview
- What it is: companion package with executable examples (
bun run examples/NN-*.ts) demonstrating and testing the@deutsch/coreengine. NOT the engine itself — this is a "cookbook" of its API: from internal components (HTV scoring, IDK, confidence, survivor selection) to the full service surface (session API CRUD/TTL, SSE streaming, supervision, audit, rate limiting, key rotation). - Type / status / role: library (examples/docs package) · active · lead — 18 of 24 commits by the user, co-author aniashev (6).
- Activity period: intensive sprint 2026-01-28 → 2026-02-02 (5 days, ~one example per day per SAL-7xx/8xx ticket).
§02
Stack
- Languages: TypeScript (ESM,
type: module). - Frameworks/libraries: Bun (runtime/runner),
@deutsch/coreas a workspace dependency (workspace:*);@types/bun. tsconfig with path aliases pointing to the parent core. - Infra/deploy: none — these are locally-run scripts (
bun run NN), no install. - Data: none of its own; operates on the engine's types/contracts (
ArgMedHTVScore,Hypothesis,EngineContext,Cartridge,SnapshotRef, etc.). - Notable tooling: mock agent factories for integration testing of the engine.
§03
What was shipped
18 user commits — each adds an example for a specific ticket (inferred from example code):
- Engine (epistemology): 05 HTV scoring, 06 IDK protocol, 07 confidence, 04 survivor selection, 08 full debate (Generator→Verifier→Reasoner), 09 Engine integration testing (SAL-733).
- Mode dependency: mode-aware IDK responses (SAL-718), clinical triage metadata (SAL-719), HTVScorer class (SAL-708).
- Session API: create/get/delete (SAL-740..742), TTL enforcement (SAL-744), cleanup worker (SAL-745).
- Messaging: send message (SAL-746), SSE streaming (SAL-747, 751..757), attachments (SAL-748), snapshot refresh (SAL-750).
- Ops/observability: health/probes/metrics (19), rate limiting, audit emitter (SAL-840..845), API key rotation (SAL-876).
- Supervision (human-in-the-loop): request builder (SAL-816..820), response parser (SAL-821..827), control command handler (SAL-828..834), fallback handler (SAL-835..839).
- Scope: 24 commits in 5 days; PR flow (Merge PR #1 from US healthtech platform/SAL-821-827). aniashev — co-author.
§04
Technical challenges
Per example code (demonstrating understanding of a complex engine; authorship — user):
- HTV (Hard-To-Vary) scoring per Popper (
05-htv-scoring,09-engine-testing): composite hypothesis score across 4 dimensions (specificity 0.3 + falsifiability 0.3 + interdependence 0.2 + parsimony 0.2), thresholdsDEFAULT_HTV_THRESHOLDS(refutation 0.3 / idk 0.4 / good 0.7 / excellent 0.85). → Application of Deutsch/Popper epistemology (good explanation = hard to vary) to clinical hypothesis assessment. Non-trivial domain. - IDK protocol (responsible AI) (
06-idk-protocol): the engine honestly says "I don't know" when no surviving hypotheses / low HTV / conflict / insufficient data; mode-aware tone (wellness vs advocate_clinical) + triage metadata (urgency/route_to/requires_supervision) for clinical mode. → Mature pattern for uncertainty disclosure and escalation to a clinician. - Engine integration testing via mock agent factories (
09):AgentFactorywith scenario presets (happy_path / all_refuted / low_htv), mock Generator/Verifier + real Reasoner, data factories (createHTVScore,createHypothesis,createCritique) with evidence-based fields (ACC/AHA guideline_class, evidence_level). → Sound approach to testability of a complex agentic system. - Full service surface (examples 10–25): session lifecycle + TTL/cleanup, SSE streaming, attachments, observability endpoints, rate limiting, audit, key rotation, human supervision (request/response/control/fallback). → Demonstrates understanding of a production-grade AI service end-to-end, not just "models".
§05
AI-assisted development
- Sessions found: 0 for this directory (the normalized key is absent; verified via full-path normalization). Work was likely done from the parent monorepo / main
deutschrepo (see issue #19) — that's where sessions should be looked for. - What was done with AI: no data in this directory; CLAUDE.md is detailed (220 lines) — a sign of AI-assisted development in the ecosystem.
- AI workflow patterns: detailed CLAUDE.md as an engine reference; ticket-oriented commits (SAL-xxx) — structured process.
§06
Achievements & metrics
- ~26 executable examples covering the engine + service end-to-end; ticket coverage SAL-708..876.
- Part of a production clinical AI system (Healthcare platform/US healthtech platform) with human-in-the-loop supervision and audit.