A synthetic order stream is evaluated against the rules parsed from the client IPS in Tab 02. Five independent evaluators — four open-weight LLMs targeting distinct rule classes plus a non-LLM statistical anomaly layer — vote on each transaction. The CCO controls the consensus threshold ($\tau$). Block decisions derive from the parsed JSON, not hardcoded labels.
| ID | ACCOUNT / DESK | ORDER DETAILS | LATENCY | RESOLUTION |
|---|
Paste or edit a Client Investment Policy Statement. The parser extracts structured execution boundaries that the arbitration engine in Tab 01 enforces against every order. The output JSON is the same object the gate evaluates — there is no separate hardcoded rule list.
Target architecture: raw order flow is mirrored off the OMS into an isolated on-prem GPU cluster, where four open-weight LLMs and a non-LLM statistical layer evaluate every proposed trade against the parsed IPS rules. No order or PII ever crosses a commercial API boundary.
OPENAI_API_KEY set on Vercel; if missing, the front-end falls back
to the deterministic parser and labels the state./api/parse-ips; next is multi-shot extraction +
per-rule citation back to the IPS span.Demo readiness, lane lock-in, and the path to a first warm-logo pilot. Premature corporate formalities are deferred — we are optimizing for a defensible artifact and three named lanes by end of the Sunday call.
Standard enterprise compliance applies binary rules to probabilistic markets — that’s why legacy stacks both over-block and leak. Systems thinking circumvents single-agent fragility.
Four open-weight LLMs (Gemma 3 4B, DeepSeek V3, Qwen 2.5 Math 7B, Llama 3.1 8B Instruct) plus a non-LLM statistical anomaly layer. Each evaluates from a distinct angle: ticker/sector, leverage, concentration, strategy, statistical outlier. On-prem only — no order flow crosses a commercial API boundary.
Models vote independently; the CCO controls $\tau$. Balanced (3-of-5) is the institutional default. Ultra-Conservative (5-of-5) minimizes false alarms on high-volume desks. High Recall (any-of-5) is a break-glass for known-fragile regimes. Threshold is a policy lever, not a baked heuristic.
Kernel-density estimation over rolling per-desk features — volume, realized vol, structure (skew, margin ratio, basket composition). Tail-region observations are flagged as drift candidates even when no hard rule is broken.
We sell capital preservation, not “AI.” Every dollar claim has to survive a CCO whiteboard. Each intercepted breach has three cost streams:
Default assumptions (all editable in code, displayed in Tab 01 KPI panel):
Legacy compliance data is degraded and non-standardized. Statisticker is forward-facing: it mirrors live order flow against forward rules, explicitly bypassing historic data cleanup in phase one.
A strategy engineer, a JD in compliance, and JD the marketer, working together to make this happen with equal measure. The phasing below is what is actually shippable in a part-time-Mash + part-time-Persia + JD-on-demand cadence. The original 30-day deck implied a full-time team; this doesn’t.