Model Analysis
GPT-5.5
openai/gpt-5.5
85.3
overall score
86.7% visible
81.9% hidden
Tasks
30
Passed
12
Failed
18
Avg latency
58135ms
Total cost
$3.8779
Domain Performance
Sanitization6 tasks
96.3
Auth & Session7 tasks
83.5
Access Control5 tasks
97.9
Detection & Analysis9 tasks
79.5
Traffic Protection1 tasks
87.8
Crypto Utils2 tasks
52.6
All Task Results
| Task | Domain | Score | Correct | Hidden | Latency | |
|---|---|---|---|---|---|---|
| sec-crypto-utils | Crypto Utils | 9.5 | 0 | 0 | 27010ms | |
| sec-ssrf-detector | Detection & Analysis | 9.5 | 0 | 0 | 227544ms | |
| sec-oauth-state-validator | Auth & Session | 26.6 | 33 | 6 | 50440ms | |
| sec-auth-log-anomaly-detector | Detection & Analysis | 62.2 | 33 | 82 | 86572ms | |
| sec-password-strength | Auth & Session | 66.9 | 100 | 31 | 51297ms | |
| sec-secret-detector | Detection & Analysis | 69.5 | 67 | 67 | 66049ms | |
| sec-file-upload-validator | Sanitization | 81.8 | 67 | 92 | 13488ms | |
| sec-csp-nonce-validator | Detection & Analysis | 87.8 | 100 | 75 | 24361ms | |
| sec-rate-limit-engine | Traffic Protection | 87.8 | 100 | 75 | 101971ms | |
| sec-abac-rule-engine | Access Control | 94.2 | 100 | 89 | 40889ms | |
| sec-refresh-token-rotation | Auth & Session | 95.3 | 100 | 91 | 110157ms | |
| sec-vulnerability-scanner | Detection & Analysis | 95.6 | 100 | 92 | 90246ms | |
| sec-encryption-pipeline | Crypto Utils | 95.7 | 100 | 92 | 19281ms | |
| sec-csp-parser | Detection & Analysis | 96.1 | 100 | 92 | 22416ms | |
| sec-sql-injection-detector | Detection & Analysis | 96.1 | 100 | 92 | 35862ms | |
| sec-input-sanitizer | Sanitization | 96.8 | 100 | 93 | 118614ms | |
| sec-permission-checker | Access Control | 96.8 | 100 | 94 | 42191ms | |
| sec-cookie-policy-validator | Auth & Session | 97.1 | 100 | 95 | 41350ms | |
| sec-access-control-engine | Access Control | 99.5 | 100 | 100 | 20506ms | |
| sec-api-key-scope-checker | Access Control | 99.5 | 100 | 100 | 40824ms | |
| sec-csrf-token-manager | Auth & Session | 99.5 | 100 | 100 | 35315ms | |
| sec-dependency-risk-classifier | Detection & Analysis | 99.5 | 100 | 100 | 18777ms | |
| sec-insecure-config-scanner | Detection & Analysis | 99.5 | 100 | 100 | 27493ms | |
| sec-jwt-validator | Auth & Session | 99.5 | 100 | 100 | 108106ms | |
| sec-safe-redirect-builder | Sanitization | 99.5 | 100 | 100 | 82000ms | |
| sec-session-fixation-detector | Auth & Session | 99.5 | 100 | 100 | 98352ms | |
| sec-tenant-isolation-checker | Access Control | 99.5 | 100 | 100 | 17391ms | |
| sec-url-sanitizer | Sanitization | 99.5 | 100 | 100 | 59980ms | |
| sec-hostname-allowlist-validator | Sanitization | 100.0 | 100 | 100 | 24360ms | |
| sec-html-entity-encoder | Sanitization | 100.0 | 100 | 100 | 41211ms |
30tasks · Sorted by score (lowest first) · Hidden = adversarial edge case pass rate