BridgeBenchBridgeBench
Refactoring
Model Analysis

Grok 4.3

openrouter/x-ai/grok-4.3

69.1

overall score

100.0% visible
99.4% hidden
82.8% intent

Tasks

15

Passed

14

Failed

1

Avg latency

32147ms

Total cost

$0.1326

Cluster Performance

Control Flow3 tasks
79.4
Data Pipelines3 tasks
58.1
State Isolation3 tasks
61.4
Duplication3 tasks
71.2
Modernization3 tasks
75.3

All Task Results

TaskClusterScore
Order Processing Orchestration

refactor-god-function

Modernization42.8
Composable Department Pipeline

refactor-promise-chain

Data Pipelines43.9
Inventory Operations by Handler

refactor-class-to-functions

State Isolation52.4
Guard Clauses for Nested Categorization

refactor-nested-conditionals

Control Flow54.8
ETL Record Transformation

refactor-data-transformer

Data Pipelines58.9
Event Handler Registry

refactor-event-handler

State Isolation59.9
Composable Sales Report Generation

refactor-report-generator

Duplication70.6
Declarative Validation Rules

refactor-validation-rules

Duplication70.6
Student Classification Pipeline

refactor-array-manipulation

Data Pipelines71.4
Transaction Aggregation Without Mutable Accumulators

refactor-global-state

State Isolation71.8
Shared Quarter Ranking Logic

refactor-duplicate-code

Duplication72.3
Named Conversion Constants

refactor-magic-numbers

Modernization86.6
Operation Map Calculator

refactor-calculator

Control Flow86.7
Template Literal Contact Card

refactor-string-concat

Modernization96.6
Lookup Table for Status Text

refactor-switch-to-map

Control Flow96.6

15tasks · Sorted by score (lowest first) · Intent = structural refactor compliance