:PROPERTIES: :ID: 4b44cf43-6106-4498-81a3-b23ebb25dabf :END: #+title: trmn/llm #+filetags: :project: :knowledge: :llm: ** Purpose NOT a user-facing chat feature. This is infrastructure for *automatic procedural dialogue* between characters in the show. The chat panel was a test harness only. Future use: generate spoken lines for characters as part of the deterministic life-simulation loop. ** Architecture In-browser LLM weights bundled into the site via ~nix build~. No backend, no external inference at runtime. Model: ~roneneldan/TinyStories-1M~ — GPT-Neo architecture, ~model_type: "gpt_neo"~, hidden=64, vocab=50257 (GPT-2 BPE tokenizer). Output is surreal/nonsensical children's story prose — intentionally acceptable for this project. Final ONNX INT8 size: *15 MB* (well under Cloudflare Pages 25 MiB per-file limit). Runtime stack: - ~onnxruntime-web~ loaded as a global ~