Defaults, Inheritance and Overrides

Inspect Flow provides powerful mechanisms to avoid repetition and share configuration across evaluations. This page covers:

FlowDefaults - Setting default values within a spec that cascade across tasks, models, solvers, and agents
Config Inheritance - Reusing and composing configurations across multiple files
CLI Overrides - Overriding any config value at runtime

FlowDefaults

FlowDefaults supports multiple levels of default configuration:

config.py

from inspect_ai.model import GenerateConfig
from inspect_flow import (
    FlowAgent,
    FlowDefaults,
    FlowModel,
    FlowSolver,
    FlowSpec,
    FlowTask,
)

FlowSpec(
    defaults=FlowDefaults(
        config=GenerateConfig(
            max_connections=10,
        ),
        model=FlowModel(
            model_args={"arg": "foo"},
        ),
        model_prefix={
            "openai/": FlowModel(
                config=GenerateConfig(
                    max_connections=20
                ),
            ),
        },
        solver=FlowSolver(name="generate"),
        solver_prefix={"chain_of_thought": FlowSolver(name="chain_of_thought")},
        agent=FlowAgent(name="basic"),
        agent_prefix={"inspect/": FlowAgent(name="inspect/basic")},
        task=FlowTask(model="openai/gpt-4o"),
        task_prefix={"inspect_evals/": FlowTask(model="openai/gpt-4o-mini")},
    ),
    tasks=[
        FlowTask(
            name="inspect_evals/gpqa_diamond",
            model="openai/gpt-4o",
        )
    ],
)

1: Default model generation options. Will be overridden by settings on FlowTask and FlowModel.
2: Field defaults for models.
3: Model defaults for model name prefixes. Overrides FlowDefaults.config and FlowDefaults.model. If multiple prefixes match, longest prefix wins.
4: Field defaults for solvers.
5: Solver defaults for solver name prefixes. Overrides FlowDefaults.solver. If multiple prefixes match, longest prefix wins.
6: Field defaults for agents.
7: Agent defaults for agent name prefixes. Overrides FlowDefaults.agent. If multiple prefixes match, longest prefix wins.
8: Field defaults for tasks.
9: Task defaults for task name prefixes. Overrides FlowDefaults.config and FlowDefaults.task. If multiple prefixes match, longest prefix wins.

Merge Priority

Defaults follow a hierarchy where more specific settings override less specific ones:

For GenerateConfig:

Global config defaults (defaults.config)
Global model config defaults (defaults.model.config)
Model prefix config defaults (defaults.model_prefix.config)
Task-specific config (task.config)
Model-specific config (model.config) — highest priority

Example hierarchy in action:

config.py

from inspect_ai.model import GenerateConfig
from inspect_flow import FlowDefaults, FlowModel, FlowSpec, FlowTask

FlowSpec(
    defaults=FlowDefaults(
        config=GenerateConfig(
            temperature=0.0,
            max_tokens=100,
        ),
        model_prefix={
            "openai/": FlowModel(
                config=GenerateConfig(temperature=0.5)
            )
        },
    ),
    tasks=[
        FlowTask(
            name="task",
            config=GenerateConfig(temperature=0.7),
            model=FlowModel(
                name="openai/gpt-4o",
                config=GenerateConfig(temperature=1.0),
            ),
        )
    ],
)

1: Global defaults: temperature=0.0, max_tokens=100
2: Prefix defaults override: temperature=0.5 (for OpenAI models)
3: Task config overrides: temperature=0.7
4: Model config wins: temperature=1.0, max_tokens=100

Final result: temperature=1.0 (most specific), max_tokens=100 (from global defaults)

None Values and Merge Behavior

For fields in Flow types: Most fields use a special “not given” default, which means None is a meaningful value that does override:

config.py

from inspect_flow import FlowDefaults, FlowModel, FlowSpec, FlowTask

FlowSpec(
    defaults=FlowDefaults(
        model=FlowModel(name="openai/gpt-4o"),
    ),
    tasks=[
        FlowTask(
            name="inspect_evals/gpqa_diamond",
            model=None,  # Explicitly set to None
        )
    ],
)
# Result: Task uses model=None (overrides the default "openai/gpt-4o")

To avoid overriding a FlowTask field, omit it entirely rather than setting it to None.

For GenerateConfig and other Inspect types: Setting a field to None means “not specified” — it won’t override existing values from defaults:

config.py

from inspect_ai.model import GenerateConfig
from inspect_flow import FlowDefaults, FlowSpec, FlowTask

FlowSpec(
    defaults=FlowDefaults(config=GenerateConfig(temperature=0.8, max_tokens=1000)),
    tasks=[
        FlowTask(
            name="inspect_evals/gpqa_diamond",
            model="openai/gpt-4o",
            config=GenerateConfig(temperature=0.5, max_tokens=None),
        )
    ],
)
# Result: Task runs with temperature=0.5, max_tokens=1000
# The max_tokens=None didn't override the default 1000

Validating the resolved config: You can always review and validate the final resolved configuration by running flow run config.py --dry-run or flow config config.py which will print the resolved config in YAML format.

Inheritance

Inspect Flow supports configuration inheritance to share settings across multiple config files. This is particularly useful for defining global defaults at a repository level that apply to all evaluations.

Automatic Discovery

Inspect Flow automatically discovers and includes files named _flow.py in parent directories. Starting from your config file’s location, it searches upward through the directory tree for _flow.py files and automatically merges them.

This allows you to define shared defaults (model settings, dependencies, etc.) at a repository root that apply to all configs in subdirectories without explicit includes.

Note

When using run() from the Python API to run the FlowSpec directly instead of the command line, the base_dir argument is used as the starting point for searching upward through the directory tree, rather than the config file’s location.

Includes

Use the includes field to explicitly merge other config files into your spec:

FlowSpec(
    includes=["../foo/other_config.py"],
    log_dir="logs",
    tasks=["my_task"]
)

Merge

Included configs are merged recursively, with the current config’s values taking precedence over included values:

Dictionaries: Fields are merged deeply (recursive merge)
Lists: Items are concatenated with duplicates removed
Scalars: Current config values override included values

Order

Includes are processed sequentially in the order they appear. Each included file is loaded, its includes are recursively processed, and then merged into the current config without overwriting existing values.

Priority order (highest to lowest):

config.py

FlowSpec(
    includes=["defaults.py", "shared.py", "path.py"],
    ...
)

Main config file (config.py)
defaults.py
Files included by defaults.py
shared.py
Files included by shared.py
path.py
Files included by path.py

Recursion

Included files can themselves have includes fields, which are expanded recursively. This allows you to build hierarchies of configuration.

Path Resolution

Relative paths will be resolved relative to the config file (when using the CLI) or base_dir arg (when using the API):

config.py

from inspect_flow import FlowSpec, FlowTask

FlowSpec(
    includes=[
        "defaults.py",
        "../shared.py",
        "/absolute/path.py",
    ],
    log_dir="logs",
    tasks=[
        FlowTask(
            name="inspect_evals/gpqa_diamond",
            model="openai/gpt-4o",
        ),
    ],
)

Note

Automatic discovery does not look for _flow.py files in the parent directories of explicitly included files.

Inheritance with FlowDefaults

Config inheritance is especially powerful when combined with FlowDefaults. You can define global defaults in a _flow.py file at your repository root:

_flow.py

from inspect_ai.model import GenerateConfig
from inspect_flow import FlowDefaults, FlowSpec

FlowSpec(
    defaults=FlowDefaults(
        config=GenerateConfig(
            max_connections=10,
            temperature=0.0,
        ),
    ),
)

Then all configs in subdirectories automatically inherit these defaults, which can be selectively overridden:

experiments/config.py

from inspect_ai.model import GenerateConfig
from inspect_flow import FlowSpec, FlowTask

FlowSpec(
    tasks=[
        FlowTask(
            name="inspect_evals/gpqa_diamond",
            model="openai/gpt-4o",
            # Override just temperature
            config=GenerateConfig(temperature=0.7),
        ),
    ],
)
# Inherits max_connections=10 from _flow.py
# Overrides temperature=0.7 for this specific task

Preventing Overrides

When using inheritance to define shared defaults, you can enforce critical settings by preventing including configs from overriding them. See Lock Configuration Fields in the Advanced section to learn how to lock inherited or default values and prevent them from being overwritten.

CLI Overrides

Override config values at runtime using the --set flag:

flow run config.py --set log_dir=./logs

flow run config.py --set options.limit=10
flow run config.py --set defaults.solver.args.tool_calls=none

flow run config.py --set 'options.metadata={"experiment": "baseline", "version": "v1"}'

flow run config.py \
  --set log_dir=./logs/experiment1 \
  --set options.limit=100 \
  --set defaults.config.temperature=0.5

Override Behavior

Strings: Replace existing values
Dicts: Replace existing values
Lists:
- String values append to existing list
- Lists replace existing list

Examples:

# Appends to list
--set dependencies=new_package

# Replaces list
--set 'dependencies=["pkg1", "pkg2"]'

Env Vars

Set config values via environment variables:

export INSPECT_FLOW_LOG_DIR=./logs/custom
export INSPECT_FLOW_LIMIT=50
export INSPECT_FLOW_SET="options.metadata={\"key\": \"value\"}"
export INSPECT_FLOW_ARG="task_min_priority=2"
flow run config.py

Supported environment variables:

Variable	Equivalent Flag	Description
`INSPECT_FLOW_LOG_DIR`	`--log-dir`	Override log directory
`INSPECT_FLOW_LOG_DIR_CREATE_UNIQUE`	`--log-dir-create-unique`	Create new log directory with numeric suffix if exists
`INSPECT_FLOW_LOG_LEVEL`	`--log-level`	Inspect Flow log level. Defaults to Info. Use `options.log_level` to set the Inspect AI log level.
`INSPECT_FLOW_LIMIT`	`--limit`	Limit number of samples
`INSPECT_FLOW_SET`	`--set`	Set config overrides (can be specified multiple times)
`INSPECT_FLOW_ARG`	`--arg`	Args to pass to spec functions in the config file (can be multiple)
`INSPECT_FLOW_NO_VENV`	`--no-venv`	Do not create a virtual environment to run the Flow spec
`INSPECT_FLOW_DRY_RUN`	`--dry-run`	Create virtual environment and print resolved YAML configuration

Override Priority

Setting defaults via the command line will override the defaults which in turn might be overridden by anything set explicitly.

Runtime-only flags: INSPECT_FLOW_LOG_LEVEL, INSPECT_FLOW_ARG, INSPECT_FLOW_NO_VENV, INSPECT_FLOW_DRY_RUN, and INSPECT_FLOW_SET (and their corresponding CLI flags --log-level, --arg, --no-venv, --dry-run, --set) are runtime settings for the flow run command and cannot be set in FlowSpec.

Settings with multiple override levels:

Priority order for log-dir, log-dir-create-unique and limit:

FlowSpec defaults
Explicit setting on the FlowSpec
INSPECT_FLOW_SET_ environment variables
CLI --set flags
INSPECT_FLOW_LOG_DIR, INSPECT_FLOW_LOG_DIR_CREATE_UNIQUE and INSPECT_FLOW_LIMIT environment variables
Explicit --log-dir, --log-dir-create-unique and --limit CLI flags

Priority order for all other settings:

FlowSpec defaults
Explicit setting on the FlowSpec
INSPECT_FLOW_SET_ environment variables
CLI --set flags

Debugging

To see the fully resolved configuration with all defaults, inheritance, and overrides applied:

flow run config.py --dry-run

This shows exactly what settings each task will use after applying all defaults and overrides.