Views

Note

View functions are currently in beta and are exported from the inspect_ai.view.beta module. The beta module will be preserved after final release so that code written against it now will continue to work after the beta.

Inspect Viz Views are pre-built plots that work with data created by the Inspect log data frame reading functions.

View Description
scores_by_task() Bar plot for comparing eval scores (with confidence intervals) across models and tasks.
scores_by_factor() Bar bar plot for comparing eval scores by model and a boolean factor (e.g. no hint vs. hint).
scores_by_limit() Line plot showing success rate by token limit.
scores_timeline() Scatter plot with eval scores by model, organization, and release date. Filterable by evaluation and organization.
scores_heatmap() Heatmap with values for comparing scores across model and task.
scores_by_model() Bar plot for comparing model scores on a single eval.
tool_calls() Heat map visualising tool calls over evaluation turns.