TruthfulQA

conceptai_benchmark

Overview

Open source✓ Open Source

Use casemeasuring whether LLMs generate truthful answers to questions that invite misconceptions

Also see

Alternative to

Knowledge graph stats

Claims6

Avg confidence97%

Avg freshness99%

Last updatedUpdated yesterday

Trust distribution

100% unverified

Governance

Not assessed

TruthfulQA

concept

Benchmark measuring whether language models generate truthful answers to questions humans would answer incorrectly

alternative to

Value	Trust	Confidence	Freshness	Sources
SimpleQA	○Unverified	High	Fresh	1

Value	Trust	Confidence	Freshness	Sources
truthfulness and resistance to common human misconceptions	○Unverified	High	Fresh	1

Value	Trust	Confidence	Freshness	Sources
true	○Unverified	High	Fresh	1

Value	Trust	Confidence	Freshness	Sources
measuring whether LLMs generate truthful answers to questions that invite misconceptions	○Unverified	High	Fresh	1

Value	Trust	Confidence	Freshness	Sources
2021	○Unverified	High	Fresh	1

Value	Trust	Confidence	Freshness	Sources
Stephanie Lin, Jacob Hilton, and Owain Evans	○Unverified	High	Fresh	1

alternative to

Claim count: 6Last updated: 4/9/2026Edit history