openai/mmmlu

Knowledge
Reasoning

MMMLU (Multilingual MMLU): OpenAI’s professional-human-translation of the MMLU test set into 14 languages for multilingual knowledge and reasoning evaluation.

← Back to Registry

Run this task

CLI:

inspect eval inspect_harbor/openai_mmmlu --model openai/gpt-5

Python:

from inspect_ai import eval
from inspect_harbor import openai_mmmlu

eval(openai_mmmlu(), model="openai/gpt-5")

Dataset information

Harbor registry openai/mmmlu
Inspect task openai_mmmlu
Latest digest sha256:5db8efae92fcb2df5fb3c76647394410badcd08cec58a3cdfd3c602f7d9b38d1
Samples 150
Paper arxiv
Source https://huggingface.co/datasets/openai/MMMLU

See Task Parameters for the parameter set shared across all Harbor tasks.