vals/financeagent

Professional
Finance
Assistants

Vals AI Finance Agent Benchmark: expert-validated finance questions across nine task categories (retrieval, market research, projections) with EDGAR/SEC search tools for evaluating financial agents.

← Back to Registry

Run this task

CLI:

inspect eval inspect_harbor/vals_financeagent --model openai/gpt-5

Python:

from inspect_ai import eval
from inspect_harbor import vals_financeagent

eval(vals_financeagent(), model="openai/gpt-5")

Dataset information

Harbor registry vals/financeagent
Inspect task vals_financeagent
Latest digest sha256:d4bcf3a28a28132ff10849a9e8721c20825a12e7f8185edb8420775e86617b7f
Samples 50
Paper arxiv
Source https://github.com/vals-ai/finance-agent

See Task Parameters for the parameter set shared across all Harbor tasks.