livecodebench/livecodebench
Coding
LiveCodeBench: contamination-free coding benchmark continuously collected from LeetCode, AtCoder, and Codeforces, supporting code generation, self-repair, execution, and test-output prediction.
Run this task
CLI:
inspect eval inspect_harbor/livecodebench --model openai/gpt-5Python:
from inspect_ai import eval
from inspect_harbor import livecodebench
eval(livecodebench(), model="openai/gpt-5")Dataset information
| Harbor registry | livecodebench/livecodebench |
| Inspect task | livecodebench |
| Latest digest | sha256:4dbb3336efa78b7c4d98061d4bdb57af31a2cd52a938b44463e8351ad73b160b |
| Samples | 100 |
| Paper | arxiv |
| Source | https://github.com/LiveCodeBench/LiveCodeBench |
See Task Parameters for the parameter set shared across all Harbor tasks.