Skip to main content

← Wiki

Reliability dashboard

Self-applied reproducibility, replicability, and robustness metrics. Computed deterministically from the live catalog + editorial state — no surveys, no fitted models. Snapshot .

Framing follows the Three Rs taxonomy from Nature's April 2026 Reliable research in the social and behavioural sciences collection. See methodology §0.

1 · Reproducible

Same analysis on the same data should produce the same result.

100%

Articles render deterministically from typed catalog constants. No random sampling, no fitted model, no LLM prose in any article body.

Verify yourself: download /wiki/catalog/json or /wiki/catalog/csv — the entire catalog state, machine-readable, CORS-open for any secondary analysis.

2 · Replicable

An independent expert reading the same primary sources should reach the same coverage classification.

Editorial agreement rate: pending first Coverage Games event.

The Coverage Games protocol (3-5 independent editors classify a sample of cells from primary sources; publish the disagreement matrix) is documented in docs/coverage-games-process.md. The first event will sample 20 cells across 5 topics; results posted here when complete.

Editorial board status: 1 founding editor in place; 5 subject-editor slots open for recruitment. See /wiki/editorial-board.

3 · Robust

Alternative analytical assumptions should not flip the conclusion. Per-cell confidence tier surfaces where a stricter rubric (e.g., operative-article-only) would plausibly produce a different label.

9%

of cells classified with explicit confidence tier (42/494). Editorial backfill in progress — first quarterly Coverage Games will prioritise the remaining 452 cells.

29 high confidence
9 medium
4 low (contested)

4 · Honest disclosure

Catalog freshness + the gap between “structurally committed” and “operationally implemented.”

Editorial review staleness (85 catalog rows)

13 fresh (≤90d)
0 aging (90-180d)
0 stale (>180d)
72 never reviewed

Catalog scope

  • 26 published instruments
  • 19 published topics
  • 10 published benchmarks
  • 30 published concepts
  • 0 drafts pending review (/wiki/preview admin only)

Known not-yet-implemented commitments (per methodology page honest-disclosure pattern):

  • ?asOf= version pinning — banner shows but article body renders current state; true snapshot rendering on the roadmap. See methodology §6.
  • DOIs via Zenodo — current persistent identifier is the committed-stable wiki URL. See methodology §7.
  • Editorial-board subject-editor slots (5 of 6) — being recruited. See /wiki/editorial-board.

5 · Coverage matrix at a glance

101

governs

87

implicit

1

conflicts

305

silent (62% of matrix)

Catalog-derived dashboard. Every metric is computed at request time from live catalog state — no caching of stale numbers, no hand-edited stats. Source: src/app/wiki/meta/page.tsx + the typed catalog.

See also: methodology · editorial board · quarterly briefing · changelog.