Evidence-backed AI trust

Run a verdict
on your work.

We hold public AI to a hard, evidence-first standard. Point the same critique at your own product and get a clear score — strengths, risks, and the next move — in minutes.

10 public verdicts · every score backed by a reproducible test.

Live verdict

88

MCP server

MCP Time Reference Server

Narrow, deterministic utility

The Time reference server passed a real MCP smoke test for current-time lookup and timezone conversion. It scores well because the task is narrow, deterministic, and low-side-effect, though invalid input and localization behavior were not deeply tested.

See the evidence

How a verdict works

01

Run it

Point SilentCritique at your product, agent, or work. One run, no setup.

02

We test, not guess

Every claim is tied to real evidence — what was checked, what passed, what failed.

03

You get a verdict

A clear score with strengths, failure modes, and the one next move that matters.

Real verdicts, real evidence

These are live smoke tests we ran against public AI tools — not opinions. Open any one to see the exact checks behind the score.

Browse all 10

Hold your own work to the standard.

Run an evidence-backed verdict in minutes. Start with a free demo or go straight to a critique.