BridgeBenchBridgeBench
Refactoring
Model Analysis

GPT-5.5

openai/gpt-5.5

65.9

overall score

93.3% visible
92.7% hidden
76.6% intent

Tasks

15

Passed

13

Failed

2

Avg latency

19053ms

Total cost

$0.6735

Cluster Performance

Control Flow3 tasks
90.2
Data Pipelines3 tasks
64.7
State Isolation3 tasks
32.3
Duplication3 tasks
62.2
Modernization3 tasks
79.8

All Task Results

TaskClusterScore
Inventory Operations by Handler

refactor-class-to-functions

State Isolation0.0
Event Handler Registry

refactor-event-handler

State Isolation35.3
ETL Record Transformation

refactor-data-transformer

Data Pipelines61.0
Composable Sales Report Generation

refactor-report-generator

Duplication61.3
Transaction Aggregation Without Mutable Accumulators

refactor-global-state

State Isolation61.6
Composable Department Pipeline

refactor-promise-chain

Data Pipelines61.8
Declarative Validation Rules

refactor-validation-rules

Duplication62.2
Shared Quarter Ranking Logic

refactor-duplicate-code

Duplication63.2
Named Conversion Constants

refactor-magic-numbers

Modernization70.6
Student Classification Pipeline

refactor-array-manipulation

Data Pipelines71.4
Order Processing Orchestration

refactor-god-function

Modernization72.1
Operation Map Calculator

refactor-calculator

Control Flow86.7
Guard Clauses for Nested Categorization

refactor-nested-conditionals

Control Flow87.4
Template Literal Contact Card

refactor-string-concat

Modernization96.6
Lookup Table for Status Text

refactor-switch-to-map

Control Flow96.6

15tasks · Sorted by score (lowest first) · Intent = structural refactor compliance