When Low Vectara and High "AA-Omniscience" Mapped to Claude Refusal Issues: A Production Case Study
https://dibz.me/blog/choosing-a-model-when-hallucinations-can-cause-harm-a-facts-benchmark-case-study-1067
How a mid-market SaaS found a mismatch between summarization scores and refusal behavior In January 2025 we were operating a B2B knowledge platform with 120,000 monthly active users and a realtime summarization feature