about
About SenseBench
What SenseBench measures and why the leaderboard exists.
Purpose
SenseBench evaluates English word sense disambiguation with auditable LLM run artifacts.
Each model receives a target word in context and candidate WordNet senses, then returns the chosen sense index.
Design
The leaderboard is static, reproducible, and rebuilt from verified submissions.
Scores are recomputed from predictions and checked against the registered dataset and prompt.