termigen/termigen-environments

Coding

TermiGen-Environments: verified Docker environments with executable terminal-agent tasks across 11 categories, generated by an end-to-end multi-agent synthesis pipeline.

← Back to Registry

Run this task

CLI:

inspect eval inspect_harbor/termigen_environments --model openai/gpt-5

Python:

from inspect_ai import eval
from inspect_harbor import termigen_environments

eval(termigen_environments(), model="openai/gpt-5")

Dataset information

Harbor registry	termigen/termigen-environments
Inspect task	`termigen_environments`
Latest digest	sha256:492c3b4c051b304b3887ca4a94a3081094c177b1227f0a609123da236359d5f0
Samples	3566
Paper	arxiv
Source	https://github.com/ucsb-mlsec/terminal-bench-env

See Task Parameters for the parameter set shared across all Harbor tasks.