This example illustrates the code behind the scores_by_task() pre-built view function. If you want to include this plot in your notebooks or websites you should start with that function rather than the lower-level code below.
Facet the x-axis (i.e. create multiple groups of bars) by task name.
2
Add a channel with links to the Inspect log files (links appear in the tooltip).
3
We don’t need an explicit “model” or “task_name” label as they are obvious from context. We also don’t need ticks b/c the fill color and legend provide this.
4
Ensure that y-axis shows the full range of scores (by default it caps at the maximum).
Confidence Interval
Here we add a confidence interval for each reported score by adding a rule_x() mark. Note that we derive the confidence interval transforms using the ci_bounds() function.