#PropensityBench hashtag - Bluesky - nopzon.com

Bluesky Explorer

#

Hashtag

#PropensityBench

@eicker.bsky.social

4 months ago

A new #study using #PropensityBench, a benchmark for measuring #AIagents’ propensity to use #harmfultools, found that #realisticpressures like #deadlines and #financiallosses significantly increase #misbehaviour rates. The study tested a dozen models from various companies across nearly 6,000…

0 0 0 0