BridgeBenchBridgeBench
Security
Model Analysis

Grok 4.3

openrouter/x-ai/grok-4.3

81.9

overall score

82.2% visible
78.5% hidden

Tasks

30

Passed

15

Failed

15

Avg latency

74909ms

Total cost

$0.5988

Domain Performance

Sanitization6 tasks
97.7
Auth & Session7 tasks
68.0
Access Control5 tasks
97.4
Detection & Analysis9 tasks
77.6
Traffic Protection1 tasks
99.5
Crypto Utils2 tasks
54.8

All Task Results

TaskDomainScore
sec-oauth-state-validatorAuth & Session9.5
sec-ssrf-detectorDetection & Analysis9.5
sec-crypto-utilsCrypto Utils10.0
sec-jwt-validatorAuth & Session27.7
sec-password-strengthAuth & Session49.0
sec-auth-log-anomaly-detectorDetection & Analysis62.2
sec-secret-detectorDetection & Analysis69.5
sec-vulnerability-scannerDetection & Analysis81.8
sec-input-sanitizerSanitization87.3
sec-csp-nonce-validatorDetection & Analysis88.3
sec-refresh-token-rotationAuth & Session91.0
sec-sql-injection-detectorDetection & Analysis92.2
sec-abac-rule-engineAccess Control94.2
sec-permission-checkerAccess Control94.2
sec-csp-parserDetection & Analysis96.1
sec-access-control-engineAccess Control99.5
sec-api-key-scope-checkerAccess Control99.5
sec-cookie-policy-validatorAuth & Session99.5
sec-csrf-token-managerAuth & Session99.5
sec-dependency-risk-classifierDetection & Analysis99.5
sec-encryption-pipelineCrypto Utils99.5
sec-insecure-config-scannerDetection & Analysis99.5
sec-rate-limit-engineTraffic Protection99.5
sec-safe-redirect-builderSanitization99.5
sec-session-fixation-detectorAuth & Session99.5
sec-tenant-isolation-checkerAccess Control99.5
sec-url-sanitizerSanitization99.5
sec-file-upload-validatorSanitization100.0
sec-hostname-allowlist-validatorSanitization100.0
sec-html-entity-encoderSanitization100.0

30tasks · Sorted by score (lowest first) · Hidden = adversarial edge case pass rate