Accuracy is not just WER.
Seven dimensions of "did the model get it right" that change a buyer's provider choice. Each card maps a real production failure to a measurable test. These probes are defined and wired against speko-bench-cli; the cards below stay empty until the first run lands — no numbers yet.