lawbench/lawbench

Professional
Law
Knowledge

LawBench: tasks evaluating LLMs on Chinese-law knowledge — legal entity recognition, reading comprehension, criminal-damage calculation, legal consulting — plus an abstention-rate metric.

← Back to Registry

Run this task

CLI:

inspect eval inspect_harbor/lawbench --model openai/gpt-5

Python:

from inspect_ai import eval
from inspect_harbor import lawbench

eval(lawbench(), model="openai/gpt-5")

Dataset information

Harbor registry lawbench/lawbench
Inspect task lawbench
Latest digest sha256:99d2f97d515a3820a657745112aa01a8e2b2e8bf7602d78d3ef53b4bc1c64636
Samples 1000
Paper arxiv
Source https://github.com/open-compass/LawBench

See Task Parameters for the parameter set shared across all Harbor tasks.