AI coding tools have enabled a flood of bad code that threatens to overwhelm many projects. Building new features is easier ...
OpenAI and Paradigm have released EVMbench—a framework for evaluating AI agents' ability to find vulnerabilities in Ethereum smart contracts.
OpenAI and Paradigm unveil EVMbench, a benchmark testing AI agents on smart contract security across 120 high-severity vulnerabilities.
Self-hosted agents execute code with durable credentials and process untrusted input. This creates dual supply chain risk, ...
OpenAI's EVMbench tests AI on smart contract security. Claude Opus 4.6 ranked first, beating GPT-5 and Gemini 3 Pro across 120 real crypto vulnerabilities.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results