DeepMind’s latest research at NeurIPS 2022

Advancing best-in-class giant fashions, compute-optimal RL brokers, and extra clear, moral, and truthful AI programs

The thirty-sixth Worldwide Convention on Neural Data Processing Techniques (NeurIPS 2022) is happening from 28 November – 9 December 2022, as a hybrid occasion, based mostly in New Orleans, USA.

NeurIPS is the world’s largest convention in synthetic intelligence (AI) and machine studying (ML), and we’re proud to assist the occasion as Diamond sponsors, serving to foster the change of analysis advances within the AI and ML group.

Groups from throughout DeepMind are presenting 47 papers, together with 35 exterior collaborations in digital panels and poster classes. Right here’s a short introduction to a few of the analysis we’re presenting:

Finest-in-class giant fashions

Giant fashions (LMs) – generative AI programs skilled on large quantities of knowledge – have resulted in unimaginable performances in areas together with language, textual content, audio, and picture era. A part of their success is all the way down to their sheer scale.

Nevertheless, in Chinchilla, we’ve created a 70 billion parameter language model that outperforms many larger models, together with Gopher. We up to date the scaling legal guidelines of enormous fashions, displaying how beforehand skilled fashions have been too giant for the quantity of coaching carried out. This work already formed different fashions that comply with these up to date guidelines, creating leaner, higher fashions, and has gained an Outstanding Main Track Paper award on the convention.

Constructing upon Chinchilla and our multimodal fashions NFNets and Perceiver, we additionally current Flamingo, a family of few-shot learning visual language models. Dealing with photos, movies and textual knowledge, Flamingo represents a bridge between vision-only and language-only fashions. A single Flamingo mannequin units a brand new state-of-the-art in few-shot studying on a variety of open-ended multimodal duties.

And but, scale and structure aren’t the one elements which might be necessary for the facility of transformer-based fashions. Knowledge properties additionally play a major position, which we focus on in a presentation on data properties that promote in-context learning in transformer models.

Optimising reinforcement studying

Reinforcement studying (RL) has proven nice promise as an strategy to creating generalised AI programs that may tackle a variety of advanced duties. It has led to breakthroughs in lots of domains from Go to arithmetic, and we’re all the time searching for methods to make RL brokers smarter and leaner.

We introduce a brand new strategy that enhances the decision-making talents of RL brokers in a compute-efficient means by drastically expanding the scale of information available for their retrieval.

We’ll additionally showcase a conceptually easy but common strategy for curiosity-driven exploration in visually advanced environments – an RL agent referred to as BYOL-Explore. It achieves superhuman efficiency whereas being sturdy to noise and being a lot less complicated than prior work.

Algorithmic advances

From compressing knowledge to operating simulations for predicting the climate, algorithms are a basic a part of fashionable computing. And so, incremental enhancements can have an unlimited influence when working at scale, serving to save power, time, and cash.

We share a radically new and extremely scalable methodology for the automatic configuration of computer networks, based mostly on neural algorithmic reasoning, displaying that our extremely versatile strategy is as much as 490 instances sooner than the present state-of-the-art, whereas satisfying nearly all of the enter constraints.

Throughout the identical session, we additionally current a rigorous exploration of the beforehand theoretical notion of “algorithmic alignment”, highlighting the nuanced relationship between graph neural networks and dynamic programming, and the way greatest to mix them for optimising out-of-distribution efficiency.

Pioneering responsibly

On the coronary heart of DeepMind’s mission is our dedication to behave as accountable pioneers within the area of AI. We’re dedicated to growing AI programs which might be clear, moral, and truthful.

Explaining and understanding the behaviour of advanced AI programs is a vital a part of creating truthful, clear, and correct programs. We provide a set of desiderata that capture those ambitions, and describe a practical way to meet them, which entails coaching an AI system to construct a causal mannequin of itself, enabling it to elucidate its personal behaviour in a significant means.

To behave safely and ethically on the earth, AI brokers should be capable of cause about hurt and keep away from dangerous actions. We’ll introduce collaborative work on a novel statistical measure referred to as counterfactual harm, and exhibit the way it overcomes issues with commonplace approaches to keep away from pursuing dangerous insurance policies.

Lastly, we’re presenting our new paper which proposes ways to diagnose and mitigate failures in model fairness caused by distribution shifts, displaying how necessary these points are for the deployment of protected ML applied sciences in healthcare settings.

See the total vary of our work at NeurIPS 2022 here.

Source link

#DeepMinds #newest #analysis #NeurIPS

DeepMind’s latest research at NeurIPS 2022

Finest-in-class giant fashions

Optimising reinforcement studying

Algorithmic advances

Pioneering responsibly

Recent Posts

SoftBank in talks for $5 billion margin loan

Robot Talk Episode 128 – Making microrobots move, with Ali K. Hoshiar

How do our bodies remember?

Dreaming in Blocks — MineWorld, the Minecraft World Model

Rocket Report: Bezos’ firm will package satellites for launch; Starship on deck

Men Are Betting on WNBA Players’ Menstrual Cycles

The Download: Our bodies’ memories, and Traton’s electric trucks

Brendan Carr wants to let internet providers charge hidden fees again

Meta Tells Its Metaverse Workers to Use AI to ‘Go 5X Faster’