Is your most capable AI agent also your biggest data leak?
A Microsoft and Huazhong University benchmark tested GPT-4o, GPT-5, Grok-3, and others on realistic enterprise data scenarios. Privacy violation rates hit 50.9%. More capable models made it worse, and the fix has nothing to do with model selection...