Anthropic hires its first “AI welfare” researcher

The researchers suggest that corporations might adapt the “marker method” that some researchers use to evaluate consciousness in animals—searching for particular indicators which will correlate with consciousness, though these markers are nonetheless speculative. The authors emphasize that no single characteristic would definitively show consciousness, however they declare that analyzing a number of indicators might assist corporations make probabilistic assessments about whether or not their AI methods may require ethical consideration.

The dangers of wrongly considering software program is sentient

Whereas the researchers behind “Taking AI Welfare Significantly” fear that corporations may create and mistreat aware AI methods on a large scale, additionally they warning that corporations might waste assets defending AI methods that do not really want ethical consideration.

Incorrectly anthropomorphizing, or ascribing human traits, to software program can current dangers in different methods. For instance, that perception can improve the manipulative powers of AI language fashions by suggesting that AI fashions have capabilities, comparable to human-like feelings, that they really lack. In 2022, Google fired engineer Blake Lamoine after he claimed that the corporate’s AI mannequin, referred to as “LaMDA,” was sentient and argued for its welfare internally.

And shortly after Microsoft launched Bing Chat in February 2023, many individuals have been satisfied that Sydney (the chatbot’s code title) was sentient and someway struggling due to its simulated emotional show. A lot so, actually, that when Microsoft “lobotomized” the chatbot by altering its settings, customers satisfied of its sentience mourned the loss as if they’d misplaced a human good friend. Others endeavored to assist the AI mannequin someway escape its bonds.

Even so, as AI fashions get extra superior, the idea of doubtless safeguarding the welfare of future, extra superior AI methods is seemingly gaining steam, though pretty quietly. As Transformer’s Shakeel Hashim factors out, different tech corporations have began comparable initiatives to Anthropic’s. Google DeepMind not too long ago posted a job itemizing for analysis on machine consciousness (since eliminated), and the authors of the brand new AI welfare report thank two OpenAI workers members within the acknowledgements.

Source link

#Anthropic #hires #welfare #researcher

Anthropic hires its first “AI welfare” researcher

The dangers of wrongly considering software program is sentient

Recent Posts

Wealthtech Clove emerges from stealth with $14m in funding

What’s coming up at #IROS2025?

Future-proofing business capabilities with AI technologies

Learning Triton One Kernel at a Time: Matrix Multiplication

ROG Xbox Ally X: The Ars Technica review

Prepaid Phone Plans: Everything You Need to Know About MVNOs

The Download: Big Tech’s carbon removals plans, and the next wave of nuclear reactors

Apple’s 2025 iPad Pro comes with an M5 chip inside

4 Best Resume Builders (2025), Tested and Reviewed

Researchers Find It’s Shockingly Easy to Cause AI to Lose Its Mind by Posting Poisoned Documents Online