0.2.62 (05 June 2026)
- Claude Code: Present the real served model as the agent’s own identity (e.g. its “You are powered by the model …” system prompt) instead of the bridge sentinel. Added a
model_configparameter to override the presented identity. - Claude Code:
opus_model/sonnet_model/haiku_model/subagent_modelare now actually honored — previously the bridge fallback collapsed them onto a single model. - Tracing: Only trace full session output when
debugoption is passed.
0.2.61 (03 June 2026)
- Claude Code: Fix system prompt being re-sent on resumed turns.
- Codex CLI: Align
OpenAIAPI-derived custom providers by their declaredservice_model_name(). - Codex CLI: Fall back to a bundled model catalog when the version-matched
models.jsoncan’t be fetched.
0.2.60 (02 June 2026)
- Update Inspect AI dependency to 0.3.234
0.2.59 (31 May 2026)
- Codex CLI: Detect and apply agent spans automatically from Codex event stream.
0.2.57 (30 May 2026)
- Codex CLI: Align the system prompt and tool set with the real bridged model.
model_confignow defaults toNone, deriving Codex’s--modelslug from the actual model. Pass an explicitmodel_configto override. - Codex CLI: Add options for enabling/disabling
web_searchandgoalstools (both default to enabled). - Codex CLI: Improve MCP tests to check for correct tool names and assert results.
- Gemini CLI: Explicity set auth type to “gemini-api-key” (required for recent versions of the CLI).
0.2.56 (25 May 2026)
- OpenCode: Install ripgrep for access to native skills.
- Claude Code: Improved live event capture (no longer requires injection of spans into transcript).
- Claude Code: Support for importing transcripts with new JSONL sub-agent file layout.
- Claude Code (ACP): Use officially supported ACP client (@agentclientprotocol/claude-agent-acp)
0.2.55 (16 May 2026)
- Workaround for
agent_client_protocolv0.10.0 connection bug.
0.2.54 (13 May 2026)
- Claude Code (ACP): Allocate per-invocation bridge port.
0.2.53 (13 May 2026)
- Pin
agent-client-protocol>=0.9.0,<0.10until init ordering bug is fixed.
0.2.52 (09 May 2026)
- OpenCode: New agent backend wrapping OpenCode.
- Codex CLI: Disable telemetry by default.
- Gemini CLI: Disable telemetry by default.
0.2.51 (07 May 2026)
- Gemini CLI: Fix MCP registration via GEMINI_CLI_TRUST_WORKSPACE.
- Mini SWE Agent: Ensure that pip is available before attempting installation.
0.2.50 (29 April 2026)
- Codex CLI: Run ACP mode with approval_policy: never and sandbox_mode: danger_full_access
- Codex CLI: Fix for MCP tool calling (bump to Inspect v0.3.214 which has the fix).
0.2.48 (26 April 2026)
- Update download location for Claude Code binaries.
0.2.47 (13 April 2026)
- Codex CLI: Set 60 minute timeout for OpenAI streaming requests (default was 5 minutes).
- Gemini CLI: Use
gemini-3.1-pro-previewas model for tests. - Claude Code: Always use
--resumefor re-attaching to sessions (--continuesometimes drops history).
0.2.46 (26 March 2026)
- Update to latest ACP types.
0.2.45 (20 March 2026)
- ACP protocol improvements.
0.2.44 (18 March 2026)
- Claude Code: Retry uncaught errors (unexpected crashes of scaffold) up to 3 times by default.
- Claude Code: Retry refusals up to 3 times by default.
0.2.43 (16 March 2026)
- Improve ACP error reporting for adapter failures.
0.2.42 (09 March 2026)
- Claude Code: Improved detection of final agent span messages.
- Claude Code: Capture stdout and stderr to store in debug mode.
0.2.41 (04 March 2026)
- Claude Code: Annotate event stream with agent spans.
- Support for sub-agents defined using agent teams.
- Attach agent_span_id to tool events for agent spawning tools.
- Pass
sandboxargument tosandbox_agent_bridge().
0.2.40 (02 March 2026)
- Claude Code: Revert streaming events while we refine our approach.
- Use normal heading/body for task tool views.
0.2.39 (01 March 2026)
- Improvements to the structure and content of claude code tool events.
0.2.38 (28 February 2026)
- Mini SWE Agent: New mini_swe_agent() agent implementation.
- Agent Binaries: Make only a single request for the latest agent binary versions (vs. a request per sample).
- Model aliases: Enable passing
model_aliasesto all agents (passed through tosandbox_agent_bridge()).
0.2.37 (24 February 2026)
- Claude Code: Capture
stream-jsonoutput to create agent spans within transcript.
0.2.36 (23 February 2026)
- Gemini CLI: New gemini_cli() agent for Google Gemini.
- Sandbox: Use
exec_remote()interface for impoved robustness of long running processes. - Claude Code: Remove
retry_timeoutsoptions (not longer necessary due to use ofexec_remote()). - Claude Code: Add
debugoption to enable--debugand--verboseCLI flags. - Claude Code: Pre-seed auth token config to fix silent auth failure in sandbox.
0.2.34 (10 February 2026)
- Agent execution: Redirect stdin using
exec 0</dev/null; - Agent execution: Print returncode for failed processes.
0.2.33 (10 February 2026)
- Claude Code: Look for request timeout errors in stdout as well as stderr.
- Claude Code: Restore default “auto” version behavior (download latest stable version).
0.2.32 (31 January 2026)
- Claude Code: Set “auto” version to 2.1.3 (which avoids a not yet fixed compaction bug with gateways).
- Claude Code: Add
retry_timeoutsoption for retrying “Request timed out” errors a configurable number of times. - Claude Code: Add CLAUDE_CODE_DISABLE_EXPERIMENTAL_BETAS environment variable.
0.2.31 (03 January 2026)
- Centaur Mode for running Claude Code and Codex CLI alongside the Inspect Human Agent.
- Update Inspect dependency to 0.3.159 (required for human agent integration).
0.2.30 (24 December 2025)
- Agent Skills: Support for adding skills to Claude Code and Codex CLI agents.
- Update Inspect dependency to 0.3.158 (required for skills implementation).
0.2.29 (22 December 2025)
- Claude Code: Enable explicit specification of
opus_model,sonnet_model,haiku_model, andsubagent_model. - Update Inspect dependency to 0.3.157 (required for changes to codex cli web search).
0.2.28 (05 December 2025)
- Codex CLI and Claude Code: Added
bridged_toolsparameter for exposing host-side Inspect tools to sandboxed agents via MCP protocol. - Codex CLI: Use GPT 5.1 system prompt by default (includes guidance on using the
update_plan()tool which is excluded from gpt-5-codex system prompt). - Codex CLI: Add
config_overridesparameter for overriding arbitrary config values. - Update Inspect dependency to 0.3.152 (required for bridged tools).
0.2.27 (27 November 2025)
- Codex CLI: Use
RUST_LOG=warning(rather thandebug) to reduce amount of output produced byexec(). - Codex CLI: Enable setting a custom
home_dir(override default of ~). - Handle multiple
run()calls to the same agent (resume session and send only new messages in prompt).
0.2.26 (15 November 2025)
- Copy agent binary executables to /var/tmp/ (more liberal default permissions)
- Agent binary chmod+x using “root” user for scenarios where they agent isn’t root.
0.2.25 (07 November 2025)
- Codex CLI: Execute
codexusing theuserpassed to codex_cli().
0.2.24 (01 November 2025)
- Claude Code: Pass
modelthrough to agent bridge to accomodate scenarios where a non-Inspect model is specified. - Claude Code: Remove
small_modelsetting as it conflicts with aforementioned fix to non-Inspect model.
0.2.23 (27 October 2025)
- Codex: Eliminate use of
--include-plan-tooloption (no longer available as plan mode is now always on).
0.2.22 (16 October 2025)
- Codex CLI now uses the latest version of codex, which includes the
apply_patchtool. - Update
inspect_airequirement to >= 0.3.138.
0.2.20 (07 October 2025)
- Codex CLI now uses 0.44.0 as its default version (since later versions include the
apply_patchtool which relies on “custom” tool types not currently supported by Inspect).
0.2.19 (05 October 2025)
- Automatically use a new port for each unique agent bridge invocation within a sample.
- Added cached_agent_binaries() function to list previously downloaded and cached agent binaries.
0.2.18 (23 September 2025)
- Update for Claude Code 2.0 (don’t call
config listafter installation as it has been removed). - Update
inspect_airequirement to >= 0.3.135.
0.2.17 (23 September 2025)
- Update
inspect_airequirement to >=0.3.134.
0.2.16 (23 September 2025)
- Add support for the
update_plan()tool for Codex CLI.
0.2.15 (23 September 2025)
- Use
gpt-5-codexas the default model config for Codex CLI (e.g. results in use of thegpt-5-codexspecific system instructions).
0.2.14 (22 September 2025)
- Support multiple attempts for Codex CLI via
codex exec <...> resume --last(requires Codex v0.36.0 or later). - Add
retry_refusalsoption to set a configurable number of retries for requests refused due to content filtering. - Update
inspect_airequirement to >=0.3.133.
0.2.13 (12 September 2025)
- Update
inspect_airequirement to >=0.3.132.
0.2.12 (08 September 2025)
- Close stdin when running agent binaries (needed for k8s provider to work properly)
0.2.11 (06 September 2025)
- Codex CLI: New codex_cli() agent for OpenAI Codex.
- Added
filterparameter to agents for intercepting model generations.
0.2.10 (03 September 2025)
- Add trace logging for claude code debug/verbose output.
v0.2.8 (02 September 2025)
- Claude Code:
allowed_toolsanddisallowed_toolsoptions.
v0.2.6 (01 September 2025)
- Claude Code: Add support for multiple agent
attempts.
v0.2.5 (01 September 2025)
Initial release.