beam
Discover
PulseActivityAnalyticsBest forMapOrgs
Niches
AgentsMCPRAGCoding AssistantsInference & ServingVector DBs
Personal
WatchlistCompare
A
Hi, Adam
—
beam
LIVE──────── · ──:──:── UTCabout
beam
Discover
PulseActivityAnalyticsBest forMapOrgs
Niches
AgentsMCPRAGCoding AssistantsInference & ServingVector DBs
Personal
WatchlistCompare
A
Hi, Adam
—
beam
Back to Pulse
Tool profile

huggingface/evaluation-guidebook

dying
Stale

Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard and designing lighteval!

evaluationevaluation-metricsguidebooklarge-language-modelsllmmachine-learningtutorial
Velocity score
0.02/ 10
[STARS]
2.1k
[FORKS]
122
[CONTRIBUTORS]
13
[LAST_COMMIT]
5mo ago
OPEN_ON_GITHUB
Velocity class: dying
30-day stars
0.02/ 10 score
last 36d
[SIGNAL_TRACE / 36_PT]
Score breakdown
610/ 1000
eval-benchmark · huggingface/evaluation-guidebook
Velocity50%
Adoption30%
Maintenance15%
Community5%
[CODE_GROWTH]
824
[INSTALL_VEL]
500
[ACTIVITY]
94
[COMMUNITY_SIGNAL]
671

Terminal score: 0–1000 raw, weighted across 4 dimensions. Public score: 0–10 normalized (shown in the 30-day stars chart above).

LIVE──────── · ──:──:── UTCabout