reasoning-gym-easy@parity
Reasoning
Reasoning Gym benchmark (easy difficulty).
Run this task
CLI:
inspect eval inspect_harbor/reasoning_gym_easy_parity --model openai/gpt-5Python:
from inspect_ai import eval
from inspect_harbor import reasoning_gym_easy_parity
eval(reasoning_gym_easy_parity(), model="openai/gpt-5")Dataset information
| Harbor registry | reasoning-gym-easy@parity |
| Inspect task | reasoning_gym_easy_parity |
| Version | parity |
| Samples | 288 |
| Paper | arxiv |
See Task Parameters for the parameter set shared across all Harbor tasks.