Microsoft’s multi-agent AI system tops Anthropic’s Mythos on cybersecurity benchmark Microsoft's new vulnerability-scanning system, codenamed MDASH, scored 88.45% on the CyberGym benchmark, surpassing single-model systems from Anthropic and OpenAI by using more than 100 specialized AI ... Published: 2026-05-13