Introducing the LiveCodeBench Leaderboard - Holistic and Contamination-Free Evaluation of Code LLMs
The LiveCodeBench leaderboard evaluates code LLMs on a holistic set of tasks without contamination, providing a more accurate assessment of their performance. It aims to help builders compare and improve code generation models. The leaderboard is open and accessible on the Hugging Face platform.
- LiveCodeBench evaluates code LLMs on diverse tasks without contamination.
- Leaderboard is open and accessible on Hugging Face.
- Helps builders compare and improve code generation models.