-
nuprl/MultiPL-E
Viewer • Updated • 12.7k • 56.5k • 67 -
openai/openai_humaneval
Viewer • Updated • 164 • 284k • 390 -
Big Code Models Leaderboard
📈1.51kExplore and compare code model performance on a leaderboard
-
Copilot Evaluation Harness: Evaluating LLM-Guided Software Programming
Paper • 2402.14261 • Published • 10
Shaun
drgitt
AI & ML interests
None yet
Organizations
None yet
codegen_eval
-
nuprl/MultiPL-E
Viewer • Updated • 12.7k • 56.5k • 67 -
openai/openai_humaneval
Viewer • Updated • 164 • 284k • 390 - RunningAgents1.51k
Big Code Models Leaderboard
📈1.51kExplore and compare code model performance on a leaderboard
-
Copilot Evaluation Harness: Evaluating LLM-Guided Software Programming
Paper • 2402.14261 • Published • 10
Interesting LLMs