xlang/ds-1000

Coding

DS-1000: data-science code-generation problems from StackOverflow across NumPy, Pandas, TensorFlow, PyTorch, SciPy, Scikit-learn, and Matplotlib, with execution-based grading.

← Back to Registry

Run this task

CLI:

inspect eval inspect_harbor/xlang_ds_1000 --model openai/gpt-5

Python:

from inspect_ai import eval
from inspect_harbor import xlang_ds_1000

eval(xlang_ds_1000(), model="openai/gpt-5")

Dataset information

Harbor registry xlang/ds-1000
Inspect task xlang_ds_1000
Latest digest sha256:082656a3ab63bbfeb92cf6c2408fba2bfaa78067026369e484855fbcd9805de2
Samples 1000
Paper arxiv
Source https://github.com/xlang-ai/DS-1000

See Task Parameters for the parameter set shared across all Harbor tasks.