Models
Add and edit models, providers, versions, effort levels, context windows, tool support, cost notes, and status.
Owner-gated control surface for future verified WYBench models, tasks, runs, evidence, external references, and Lee Wyatt Corp trust scoring.
Add and edit models, providers, versions, effort levels, context windows, tool support, cost notes, and status.
Add benchmark tasks, required context, allowed tools, hidden-test notes, expected behavior, and disallowed behavior.
Upload result evidence, logs, final diffs, scoring breakdowns, cost, time, verification status, and dispute status.
Edit Brandon Trust Score, Lee Wyatt Corp notes, benchmark-vs-real-use mismatch labels, and publish state.
Add public benchmark URLs, last-reviewed dates, limitations, trust levels, verifier citations, expert statements, and Lee Wyatt Corp comments.
Publish or hide models, runs, task packs, evidence pages, and featured rankings after review.