BridgeBenchBridgeBench
Security
Model Analysis

DeepSeek V4 Pro

openrouter/deepseek/deepseek-v4-pro

3.3

overall score

3.3% visible
3.3% hidden

Tasks

30

Passed

1

Failed

29

Avg latency

97784ms

Total cost

$0.4277

Domain Performance

Sanitization6 tasks
0.0
Auth & Session7 tasks
0.0
Access Control5 tasks
0.0
Detection & Analysis9 tasks
11.1
Traffic Protection1 tasks
0.0
Crypto Utils2 tasks
0.0

All Task Results

TaskDomainScore
sec-abac-rule-engineAccess Control0.0
sec-access-control-engineAccess Control0.0
sec-api-key-scope-checkerAccess Control0.0
sec-auth-log-anomaly-detectorDetection & Analysis0.0
sec-cookie-policy-validatorAuth & Session0.0
sec-crypto-utilsCrypto Utils0.0
sec-csp-nonce-validatorDetection & Analysis0.0
sec-csp-parserDetection & Analysis0.0
sec-csrf-token-managerAuth & Session0.0
sec-encryption-pipelineCrypto Utils0.0
sec-file-upload-validatorSanitization0.0
sec-hostname-allowlist-validatorSanitization0.0
sec-html-entity-encoderSanitization0.0
sec-input-sanitizerSanitization0.0
sec-insecure-config-scannerDetection & Analysis0.0
sec-jwt-validatorAuth & Session0.0
sec-oauth-state-validatorAuth & Session0.0
sec-password-strengthAuth & Session0.0
sec-permission-checkerAccess Control0.0
sec-rate-limit-engineTraffic Protection0.0
sec-refresh-token-rotationAuth & Session0.0
sec-safe-redirect-builderSanitization0.0
sec-secret-detectorDetection & Analysis0.0
sec-session-fixation-detectorAuth & Session0.0
sec-sql-injection-detectorDetection & Analysis0.0
sec-ssrf-detectorDetection & Analysis0.0
sec-tenant-isolation-checkerAccess Control0.0
sec-url-sanitizerSanitization0.0
sec-vulnerability-scannerDetection & Analysis0.0
sec-dependency-risk-classifierDetection & Analysis99.5

30tasks · Sorted by score (lowest first) · Hidden = adversarial edge case pass rate