Solidity Security Skill Benchmark

6 audit skills + a no-skill baseline on their ability to find known vulnerabilities. Headline: 27 core_subset evals, source-only single pass, ranked by micro-recall. Tooling A/B and multi-pass are separate experiments (see FINDINGS.md).