Will We Know When to Unplug AI? — AI Alignment Forum

Will We Know When to Unplug AI? — AI Alignment Forum
TL;DR: We introduce the first comprehensive theoretical framework for understanding and mitigating secret collusion among advanced AI agents, along with ...
Read more

2024 Climate Tech Companies to Watch: Pano AI and its fire-detecting AI

2024 Climate Tech Companies to Watch: Pano AI and its fire-detecting AI
When blazes are confirmed, Pano alerts fire monitoring agencies, providing images and location data that help them respond quickly. As ...
Read more

The Obliqueness Thesis — AI Alignment Forum

The Obliqueness Thesis — AI Alignment Forum
In my Xenosystems review, I discussed the Orthogonality Thesis, concluding that it was a bad metaphor. It’s a long post, though, and ...
Read more

Why bigger is not always better in AI 

Why bigger is not always better in AI 
This story originally appeared in The Algorithm, our weekly newsletter on AI. To get stories like this in your inbox ...
Read more

A basic systems architecture for AI agents that do autonomous research — AI Alignment Forum

A basic systems architecture for AI agents that do autonomous research — AI Alignment Forum
A lot of threat models describing how AIs might escape our control (e.g. self-exfiltration, hacking the datacenter) start out with ...
Read more

The Download: How to break up with coal, and AI’s false climate promises

The Download: How to break up with coal, and AI’s false climate promises
September 2022—Anna Louie Sussman Like me, my eggs were flying economy class. They were ensconced in a cryogenic storage flask ...
Read more

Base LLMs refuse too — AI Alignment Forum

Base LLMs refuse too — AI Alignment Forum
Executive Summary Refusing harmful requests is not a novel behavior learned in chat fine-tuning, as pre-trained base models will also refuse ...
Read more

Roundtables: Putting AI’s Climate Impact Into Perspective

Roundtables: Putting AI’s Climate Impact Into Perspective
The latest iteration of a legacy Founded at the Massachusetts Institute of Technology in 1899, MIT Technology Review is a ...
Read more

Using LLM’s for AI Foundation research and the Simple Solution assumption — AI Alignment Forum

The Obliqueness Thesis — AI Alignment Forum
Current LLM based AI systems are getting pretty good at maths by writing formal proofs in Lean or similar. https://deepmind.google/discover/blog/ai-solves-imo-problems-at-silver-medal-level/ ...
Read more

Sorry, AI won’t “fix” climate change

Sorry, AI won’t “fix” climate change
“As long as we effectively subsidize fossil fuels by allowing them to use the atmosphere as a waste dump, we ...
Read more