xlang/ds-1000
Coding
DS-1000: data-science code-generation problems from StackOverflow across NumPy, Pandas, TensorFlow, PyTorch, SciPy, Scikit-learn, and Matplotlib, with execution-based grading.
Run this task
CLI:
inspect eval inspect_harbor/xlang_ds_1000 --model openai/gpt-5Python:
from inspect_ai import eval
from inspect_harbor import xlang_ds_1000
eval(xlang_ds_1000(), model="openai/gpt-5")Dataset information
| Harbor registry | xlang/ds-1000 |
| Inspect task | xlang_ds_1000 |
| Latest digest | sha256:082656a3ab63bbfeb92cf6c2408fba2bfaa78067026369e484855fbcd9805de2 |
| Samples | 1000 |
| Paper | arxiv |
| Source | https://github.com/xlang-ai/DS-1000 |
See Task Parameters for the parameter set shared across all Harbor tasks.