The model learns that hedging is a signal of lower-quality output. This creates a systematic bias toward sounding certain.
QED, an AI assistant tool, evaluates the originality and validity of bioRxiv preprints, assigning them QED Scores. Researchers report that its rankings often align with expert opinion.