LiveCodeBench
ai_benchmark
Overview
Open source✓ Open Source
Use casecontamination-free code generation evaluation with continuously updated competition problems
Also see
Alternative to
Knowledge graph stats
Claims6
Avg confidence97%
Avg freshness99%
Last updatedUpdated 10h ago
Trust distribution
100% unverified
Governance
Not assessed
LiveCodeBench
concept
Continuously updated coding benchmark sourced from competitive programming contests to prevent contamination
Compare with...alternative to
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| HumanEval | ○Unverified | High | Fresh | 1 |
evaluates
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| code generation, self-repair, and code execution reasoning | ○Unverified | High | Fresh | 1 |
open source
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| true | ○Unverified | High | Fresh | 1 |
primary use case
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| contamination-free code generation evaluation with continuously updated competition problems | ○Unverified | High | Fresh | 1 |
first released
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| 2024 | ○Unverified | High | Fresh | 1 |
created by
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| Naman Jain et al. | ○Unverified | High | Fresh | 1 |