LLMs can learn about themselves by introspection — AI Alignment Forum

LLMs can learn about themselves by introspection — AI Alignment Forum
Are LLMs capable of introspection, i.e. special access to their own inner states?Can they use this access to report facts ...
Read more

Azalea: a science-fiction story

Azalea: a science-fiction story
“This is simply a question of right and wrong.” “You can’t deny the costs, though. You keep saying that just ...
Read more

Sabotage Evaluations for Frontier Models — AI Alignment Forum

Sabotage Evaluations for Frontier Models — AI Alignment Forum
This is a linkpost for a new research paper from the Alignment Evaluations team at Anthropic and other researchers, introducing ...
Read more

The race to find new materials with AI needs more data. Meta is giving massive amounts away for free

The race to find new materials with AI needs more data. Meta is giving massive amounts away for free
 “We’re really firm believers that by contributing to the community and building upon open-source data models, the whole community moves ...
Read more

AI could help people find common ground during deliberations

AI could help people find common ground during deliberations
Participants were divided up into six-person groups, with one participant in each randomly assigned to write statements on behalf of ...
Read more

Transforming software with generative AI

Transforming software with generative AI
Where exactly are we on this transformative journey? How are enterprises navigating this new terrain—and what’s still ahead? To investigate ...
Read more

Minimal Motivation of Natural Latents — AI Alignment Forum

Minimal Motivation of Natural Latents — AI Alignment Forum
Suppose two Bayesian agents are presented with the same spreadsheet – IID samples of data in each row, a feature ...
Read more

Cloud transformation clears businesses for digital takeoff

Cloud transformation clears businesses for digital takeoff
Rajeev: Sure. I think these days none of these conversations can be complete without talking about AI and gen AI. ...
Read more

An Opinionated Evals Reading List — AI Alignment Forum

An Opinionated Evals Reading List — AI Alignment Forum
While you can make a lot of progress in evals with tinkering and paying little attention to the literature, we ...
Read more

The Download: Protecting farmworkers from heat, and AI’s Nobel Prize

The Download: Protecting farmworkers from heat, and AI’s Nobel Prize
On July 21, 2024, temperatures soared in many parts of the world, breaking the record for the hottest day ever ...
Read more