Validity asks: did the study measure what it claims?
Applicability asks: how much decision weight should I assign this evidence for my context?
S
Setting How real was the environment?
1Toy Task
2Controlled Lab
3Hi-Fi Sim
4Field Work
5In-Situ
N
N-Scale How many humans were observed?
0<10
110–99
2100–1K
31K–10K
410K–100K
5100K+
A
Attribution How confident is the causal claim?
0Descriptive
1Correlational
2Quasi-Exp
3Experimental
4Replicated
5Converged
P
Provenance What kind of data is this?
1Engaged
2Byproduct
3Passive
4Experimental
5Benchmark
6Synthetic
⚠ Provenance note: Higher P ≠ better. This axis measures distance from unmediated reality, not quality. P4 (experimental) is more constructed than P1 (observed), but often produces stronger attribution. These are independent properties.