about

About SenseBench

What SenseBench measures and why the leaderboard exists.

Purpose

SenseBench evaluates English word sense disambiguation with auditable LLM run artifacts.

Each model receives a target word in context and candidate WordNet senses, then returns the chosen sense index.

The leaderboard is static, reproducible, and rebuilt from verified submissions.

Scores are recomputed from predictions and checked against the registered dataset and prompt.