AI-biguities Index
Where the clankers agree, split, and wander off.
We scanned the active question bank across the modes to see which models draw hard boundaries, which categories produce real disagreement, and where the pack becomes oddly synchronized.
Take Quick Start Read model personalities
Definitiveness measures edge-leaning answers. Spread measures a model's range across the full answer space. Contention measures distance from the rest of the models.
Higher means the models cluster together across the bank.
Definitiveness
Hardest edge-leaning answers
Higher means the model lands closer to the ends of each scale.
Spread
Widest stance range
Higher means the model covers more of the answer spectrum.
Contention
Most contentious opinions
Higher means the model sits farther from the others on average.
Questions
Most contentious questions
The questions that pulled the models farthest apart.
Questions
Most settled questions
The questions where the models mostly landed together.