podbench

<- back to dashboard

run_2y6hsbeo3h

2d ago
failedhardMerge duplicate customer records(dedup-customers)
verifier: verifier: one or more checks failed
reward
0.700
failed
model
claude-sonnet-4-6
9 steps
cost
$0.053
42.6k tokens
latency
19.7s
0 rate-limit retries

token accounting

input (uncached)4.0k
cache write4.2k
billed 1.25x input
cache read33.4k
billed 0.1x input
output1.0k
cache hit rate89.3%
cost$0.053

scheduling

podpodbench-worker-e1872
queueredis
started2026-06-14 19:40:57
finished2026-06-14 19:41:17
steps9
retries0
No step-level trajectory recorded for this run. Trajectories are kept for runs executed through the live runner and the worker.