Skip to content

package: mean_score over full benchmark (not attempted)#48

Merged
john-b-yang merged 1 commit into
mainfrom
align-package-mean
Jun 22, 2026
Merged

package: mean_score over full benchmark (not attempted)#48
john-b-yang merged 1 commit into
mainfrom
align-package-mean

Conversation

@john-b-yang

Copy link
Copy Markdown
Contributor

Aligns the package console summary with the leaderboard: aggregate() now divides mean_score by the full benchmark size (an unattempted task counts as 0), consistent with resolved%/near%. Previously the summary divided by attempted, overstating a partial submission's mean.

Matches the leaderboard, where unattempted tasks count as 0; keeps the console summary
from overstating a partial submission's mean.
@meta-cla meta-cla Bot added the CLA Signed This label is managed by the Meta Open Source bot. label Jun 22, 2026
@john-b-yang john-b-yang merged commit 8b06613 into main Jun 22, 2026
5 checks passed
@john-b-yang john-b-yang deleted the align-package-mean branch June 22, 2026 20:45
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed This label is managed by the Meta Open Source bot.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant