vals/financeagent
Professional
Finance
Assistants
Vals AI Finance Agent Benchmark: expert-validated finance questions across nine task categories (retrieval, market research, projections) with EDGAR/SEC search tools for evaluating financial agents.
Run this task
CLI:
inspect eval inspect_harbor/vals_financeagent --model openai/gpt-5Python:
from inspect_ai import eval
from inspect_harbor import vals_financeagent
eval(vals_financeagent(), model="openai/gpt-5")Dataset information
| Harbor registry | vals/financeagent |
| Inspect task | vals_financeagent |
| Latest digest | sha256:d4bcf3a28a28132ff10849a9e8721c20825a12e7f8185edb8420775e86617b7f |
| Samples | 50 |
| Paper | arxiv |
| Source | https://github.com/vals-ai/finance-agent |
See Task Parameters for the parameter set shared across all Harbor tasks.