LLMs can learn about themselves by introspection — AI Alignment Forum
Are LLMs capable of introspection, i.e. special access to their own inner states?Can they use this access to report facts ...
Read more
Azalea: a science-fiction story
“This is simply a question of right and wrong.” “You can’t deny the costs, though. You keep saying that just ...
Read more
Sabotage Evaluations for Frontier Models — AI Alignment Forum
This is a linkpost for a new research paper from the Alignment Evaluations team at Anthropic and other researchers, introducing ...
Read more
The race to find new materials with AI needs more data. Meta is giving massive amounts away for free
“We’re really firm believers that by contributing to the community and building upon open-source data models, the whole community moves ...
Read more
AI could help people find common ground during deliberations
Participants were divided up into six-person groups, with one participant in each randomly assigned to write statements on behalf of ...
Read more
Transforming software with generative AI
Where exactly are we on this transformative journey? How are enterprises navigating this new terrain—and what’s still ahead? To investigate ...
Read more
Minimal Motivation of Natural Latents — AI Alignment Forum
Suppose two Bayesian agents are presented with the same spreadsheet – IID samples of data in each row, a feature ...
Read more
Cloud transformation clears businesses for digital takeoff
Rajeev: Sure. I think these days none of these conversations can be complete without talking about AI and gen AI. ...
Read more
An Opinionated Evals Reading List — AI Alignment Forum
While you can make a lot of progress in evals with tinkering and paying little attention to the literature, we ...
Read more
The Download: Protecting farmworkers from heat, and AI’s Nobel Prize
On July 21, 2024, temperatures soared in many parts of the world, breaking the record for the hottest day ever ...
Read more