Inspect SWE

0.2.63 (10 June 2026)

Remove ACP patch for connection initialization order issue (resolved in ACP 0.10.1).
Codex CLI: Resolve “stable”/“latest” via the GitHub releases/latest endpoint — the full releases listing for openai/codex frequently 504s.

0.2.62 (05 June 2026)

Claude Code: Present the real served model as the agent’s own identity (e.g. its “You are powered by the model …” system prompt) instead of the bridge sentinel. Added a model_config parameter to override the presented identity.
Claude Code: opus_model / sonnet_model / haiku_model / subagent_model are now actually honored — previously the bridge fallback collapsed them onto a single model.
Tracing: Only trace full session output when debug option is passed.

0.2.61 (03 June 2026)

Claude Code: Fix system prompt being re-sent on resumed turns.
Codex CLI: Align OpenAIAPI-derived custom providers by their declared service_model_name().
Codex CLI: Fall back to a bundled model catalog when the version-matched models.json can’t be fetched.

0.2.60 (02 June 2026)

Update Inspect AI dependency to 0.3.234

0.2.59 (31 May 2026)

Codex CLI: Detect and apply agent spans automatically from Codex event stream.

0.2.57 (30 May 2026)

Codex CLI: Align the system prompt and tool set with the real bridged model. model_config now defaults to None, deriving Codex’s --model slug from the actual model. Pass an explicit model_config to override.
Codex CLI: Add options for enabling/disabling web_search and goals tools (both default to enabled).
Codex CLI: Improve MCP tests to check for correct tool names and assert results.
Gemini CLI: Explicity set auth type to “gemini-api-key” (required for recent versions of the CLI).

0.2.56 (25 May 2026)

OpenCode: Install ripgrep for access to native skills.
Claude Code: Improved live event capture (no longer requires injection of spans into transcript).
Claude Code: Support for importing transcripts with new JSONL sub-agent file layout.
Claude Code (ACP): Use officially supported ACP client (@agentclientprotocol/claude-agent-acp)

0.2.55 (16 May 2026)

Workaround for agent_client_protocol v0.10.0 connection bug.

0.2.54 (13 May 2026)

Claude Code (ACP): Allocate per-invocation bridge port.

0.2.53 (13 May 2026)

Pin agent-client-protocol>=0.9.0,<0.10 until init ordering bug is fixed.

0.2.52 (09 May 2026)

OpenCode: New agent backend wrapping OpenCode.
Codex CLI: Disable telemetry by default.
Gemini CLI: Disable telemetry by default.

0.2.51 (07 May 2026)

Gemini CLI: Fix MCP registration via GEMINI_CLI_TRUST_WORKSPACE.
Mini SWE Agent: Ensure that pip is available before attempting installation.

0.2.50 (29 April 2026)

Codex CLI: Run ACP mode with approval_policy: never and sandbox_mode: danger_full_access
Codex CLI: Fix for MCP tool calling (bump to Inspect v0.3.214 which has the fix).

0.2.48 (26 April 2026)

Update download location for Claude Code binaries.

0.2.47 (13 April 2026)

Codex CLI: Set 60 minute timeout for OpenAI streaming requests (default was 5 minutes).
Gemini CLI: Use gemini-3.1-pro-preview as model for tests.
Claude Code: Always use --resume for re-attaching to sessions (--continue sometimes drops history).

0.2.46 (26 March 2026)

Update to latest ACP types.

0.2.45 (20 March 2026)

ACP protocol improvements.

0.2.44 (18 March 2026)

Claude Code: Retry uncaught errors (unexpected crashes of scaffold) up to 3 times by default.
Claude Code: Retry refusals up to 3 times by default.

0.2.43 (16 March 2026)

Improve ACP error reporting for adapter failures.

0.2.42 (09 March 2026)

Claude Code: Improved detection of final agent span messages.
Claude Code: Capture stdout and stderr to store in debug mode.

0.2.41 (04 March 2026)

Claude Code: Annotate event stream with agent spans.
Support for sub-agents defined using agent teams.
Attach agent_span_id to tool events for agent spawning tools.
Pass sandbox argument to sandbox_agent_bridge().

0.2.40 (02 March 2026)

Claude Code: Revert streaming events while we refine our approach.
Use normal heading/body for task tool views.

0.2.39 (01 March 2026)

Improvements to the structure and content of claude code tool events.

0.2.38 (28 February 2026)

Mini SWE Agent: New mini_swe_agent() agent implementation.
Agent Binaries: Make only a single request for the latest agent binary versions (vs. a request per sample).
Model aliases: Enable passing model_aliases to all agents (passed through to sandbox_agent_bridge()).

0.2.37 (24 February 2026)

Claude Code: Capture stream-json output to create agent spans within transcript.

0.2.36 (23 February 2026)

Gemini CLI: New gemini_cli() agent for Google Gemini.
Sandbox: Use exec_remote() interface for impoved robustness of long running processes.
Claude Code: Remove retry_timeouts options (not longer necessary due to use of exec_remote()).
Claude Code: Add debug option to enable --debug and --verbose CLI flags.
Claude Code: Pre-seed auth token config to fix silent auth failure in sandbox.

0.2.34 (10 February 2026)

Agent execution: Redirect stdin using exec 0</dev/null;
Agent execution: Print returncode for failed processes.

0.2.33 (10 February 2026)

Claude Code: Look for request timeout errors in stdout as well as stderr.
Claude Code: Restore default “auto” version behavior (download latest stable version).

0.2.32 (31 January 2026)

Claude Code: Set “auto” version to 2.1.3 (which avoids a not yet fixed compaction bug with gateways).
Claude Code: Add retry_timeouts option for retrying “Request timed out” errors a configurable number of times.
Claude Code: Add CLAUDE_CODE_DISABLE_EXPERIMENTAL_BETAS environment variable.

0.2.31 (03 January 2026)

Centaur Mode for running Claude Code and Codex CLI alongside the Inspect Human Agent.
Update Inspect dependency to 0.3.159 (required for human agent integration).

0.2.30 (24 December 2025)

Agent Skills: Support for adding skills to Claude Code and Codex CLI agents.
Update Inspect dependency to 0.3.158 (required for skills implementation).

0.2.29 (22 December 2025)

Claude Code: Enable explicit specification of opus_model, sonnet_model, haiku_model, and subagent_model.
Update Inspect dependency to 0.3.157 (required for changes to codex cli web search).

0.2.28 (05 December 2025)

Codex CLI and Claude Code: Added bridged_tools parameter for exposing host-side Inspect tools to sandboxed agents via MCP protocol.
Codex CLI: Use GPT 5.1 system prompt by default (includes guidance on using the update_plan() tool which is excluded from gpt-5-codex system prompt).
Codex CLI: Add config_overrides parameter for overriding arbitrary config values.
Update Inspect dependency to 0.3.152 (required for bridged tools).

0.2.27 (27 November 2025)

Codex CLI: Use RUST_LOG=warning (rather than debug) to reduce amount of output produced by exec().
Codex CLI: Enable setting a custom home_dir (override default of ~).
Handle multiple run() calls to the same agent (resume session and send only new messages in prompt).

0.2.26 (15 November 2025)

Copy agent binary executables to /var/tmp/ (more liberal default permissions)
Agent binary chmod+x using “root” user for scenarios where they agent isn’t root.

0.2.25 (07 November 2025)

Codex CLI: Execute codex using the user passed to codex_cli().

0.2.24 (01 November 2025)

Claude Code: Pass model through to agent bridge to accomodate scenarios where a non-Inspect model is specified.
Claude Code: Remove small_model setting as it conflicts with aforementioned fix to non-Inspect model.

0.2.23 (27 October 2025)

Codex: Eliminate use of --include-plan-tool option (no longer available as plan mode is now always on).

0.2.22 (16 October 2025)

Codex CLI now uses the latest version of codex, which includes the apply_patch tool.
Update inspect_ai requirement to >= 0.3.138.

0.2.20 (07 October 2025)

Codex CLI now uses 0.44.0 as its default version (since later versions include the apply_patch tool which relies on “custom” tool types not currently supported by Inspect).

0.2.19 (05 October 2025)

Automatically use a new port for each unique agent bridge invocation within a sample.
Added cached_agent_binaries() function to list previously downloaded and cached agent binaries.

0.2.18 (23 September 2025)

Update for Claude Code 2.0 (don’t call config list after installation as it has been removed).
Update inspect_ai requirement to >= 0.3.135.

0.2.17 (23 September 2025)

Update inspect_ai requirement to >=0.3.134.

0.2.16 (23 September 2025)

Add support for the update_plan() tool for Codex CLI.

0.2.15 (23 September 2025)

Use gpt-5-codex as the default model config for Codex CLI (e.g. results in use of the gpt-5-codex specific system instructions).

0.2.14 (22 September 2025)

Support multiple attempts for Codex CLI via codex exec <...> resume --last (requires Codex v0.36.0 or later).
Add retry_refusals option to set a configurable number of retries for requests refused due to content filtering.
Update inspect_ai requirement to >=0.3.133.

0.2.13 (12 September 2025)

Update inspect_ai requirement to >=0.3.132.

0.2.12 (08 September 2025)

Close stdin when running agent binaries (needed for k8s provider to work properly)

0.2.11 (06 September 2025)

Codex CLI: New codex_cli() agent for OpenAI Codex.
Added filter parameter to agents for intercepting model generations.

0.2.10 (03 September 2025)

Add trace logging for claude code debug/verbose output.

v0.2.8 (02 September 2025)

Claude Code: allowed_tools and disallowed_tools options.

v0.2.6 (01 September 2025)

Claude Code: Add support for multiple agent attempts.

v0.2.5 (01 September 2025)

Initial release.