Mapping the misuse of generative AI

Accountability & Security

Revealed: 2 August 2024
Authors: Nahema Marchal and Rachel Xu

Abstract artwork to depict generative AI and revealing layers of insight and data

New analysis analyzes the misuse of multimodal generative AI at the moment, with a purpose to assist construct safer and extra accountable applied sciences

Generative synthetic intelligence (AI) fashions that may produce picture, textual content, audio, video and extra are enabling a brand new period of creativity and industrial alternative. But, as these capabilities develop, so does the potential for his or her misuse, together with manipulation, fraud, bullying or harassment.

As a part of our commitment to develop and use AI responsibly, we revealed a new paper, in partnership with Jigsaw and Google.org, analyzing how generative AI applied sciences are being misused at the moment. Groups throughout Google are utilizing this and different analysis to develop higher safeguards for our generative AI applied sciences, amongst different security initiatives.

Collectively, we gathered and analyzed practically 200 media reviews capturing public incidents of misuse, revealed between January 2023 and March 2024. From these reviews, we outlined and categorized widespread techniques for misusing generative AI and located novel patterns in how these applied sciences are being exploited or compromised.

By clarifying the present threats and techniques used throughout several types of generative AI outputs, our work may help form AI governance and information corporations like Google and others constructing AI applied sciences in creating extra complete security evaluations and mitigation methods.

Highlighting the principle classes of misuse

Whereas generative AI instruments signify a singular and compelling means to reinforce creativity, the power to supply bespoke, real looking content material has the potential for use in inappropriate methods by malicious actors.

By analyzing media reviews, we recognized two most important classes of generative AI misuse techniques: the exploitation of generative AI capabilities and the compromise of generative AI programs. Examples of the applied sciences being exploited included creating real looking depictions of human likenesses to impersonate public figures; whereas situations of the applied sciences being compromised included ‘jailbreaking’ to take away mannequin safeguards and utilizing adversarial inputs to trigger malfunctions.

Relative frequency generative AI misuse techniques in our dataset. Any given case of misuse reported within the media might contain a number of techniques.

Instances of exploitation — involving malicious actors exploiting simply accessible, consumer-level generative AI instruments, typically in ways in which didn’t require superior technical abilities — had been probably the most prevalent in our dataset. For instance, we reviewed a high-profile case from February 2024 the place a global firm reportedly lost HK$200 million (approx. US $26M) after an worker was tricked into making a monetary switch throughout an internet assembly. On this occasion, each different “individual” within the assembly, together with the corporate’s chief monetary officer, was the truth is a convincing, computer-generated imposter.

A few of the most distinguished techniques we noticed, reminiscent of impersonation, scams, and artificial personas, pre-date the invention of generative AI and have lengthy been used to affect the knowledge ecosystem and manipulate others. However wider entry to generative AI instruments might alter the prices and incentives behind info manipulation, giving these age-old techniques new efficiency and potential, particularly to those that beforehand lacked the technical sophistication to include such techniques.

Figuring out methods and combos of misuse

Falsifying proof and manipulating human likenesses underlie probably the most prevalent techniques in real-world circumstances of misuse. Within the time interval we analyzed, most circumstances of generative AI misuse had been deployed in efforts to affect public opinion, allow scams or fraudulent actions, or to generate revenue.

By observing how unhealthy actors mix their generative AI misuse techniques in pursuit of their varied objectives, we recognized particular combos of misuse and labeled these combos as methods.

Diagram of how the objectives of unhealthy actors (left) map onto their methods of misuse (proper).

Rising types of generative AI misuse, which aren’t overtly malicious, nonetheless elevate moral considerations. For instance, new types of political outreach are blurring the strains between authenticity and deception, reminiscent of government officials suddenly speaking a variety of voter-friendly languages with out clear disclosure that they’re utilizing generative AI, and activists using the AI-generated voices of deceased victims to plead for gun reform.

Whereas the examine gives novel insights on rising types of misuse, it’s value noting that this dataset is a restricted pattern of media reviews. Media reviews might prioritize sensational incidents, which in flip might skew the dataset in direction of explicit sorts of misuse. Detecting or reporting circumstances of misuse can also be tougher for these concerned as a result of generative AI programs are so novel. The dataset additionally doesn’t make a direct comparability between misuse of generative AI programs and conventional content material creation and manipulation techniques, reminiscent of picture enhancing or establishing ‘content material farms’ to create massive quantities of textual content, video, gifs, photos and extra. Up to now, anecdotal proof means that conventional content material manipulation techniques stay extra prevalent.

Staying forward of potential misuses

Our paper highlights alternatives to design initiatives that shield the general public, reminiscent of advancing broad generative AI literacy campaigns, creating higher interventions to guard the general public from unhealthy actors, or forewarning people and equipping them to identify and refute the manipulative methods utilized in generative AI misuse.

This analysis helps our groups higher safeguard our merchandise by informing our growth of security initiatives. On YouTube, we now require creators to share when their work is meaningfully altered or synthetically generated, and seems realistic. Equally, we up to date our election promoting insurance policies to require advertisers to reveal when their election advertisements embody materials that has been digitally altered or generated.

As we proceed to broaden our understanding of malicious makes use of of generative AI and make additional technical developments, we all know it’s extra necessary than ever to ensure our work isn’t taking place in a silo. We just lately joined the Content for Coalition Provenance and Authenticity (C2PA) as a steering committee member to assist develop the technical normal and drive adoption of Content material Credentials, that are tamper-resistant metadata that exhibits how content material was made and edited over time.

In parallel, we’re additionally conducting analysis that advances current red-teaming efforts, together with improving best practices for testing the safety of large language models (LLMs), and creating pioneering instruments to make AI-generated content material simpler to establish, reminiscent of SynthID, which is being built-in right into a rising vary of merchandise.

In recent times, Jigsaw has conducted research with misinformation creators to know the instruments and techniques they use, developed prebunking videos to forewarn individuals of makes an attempt to govern them, and shown that prebunking campaigns can improve misinformation resilience at scale. This work types a part of Jigsaw’s broader portfolio of data interventions to assist individuals shield themselves on-line.

By proactively addressing potential misuses, we will foster accountable and moral use of generative AI, whereas minimizing its dangers. We hope these insights on the commonest misuse techniques and methods will assist researchers, policymakers, business belief and security groups construct safer, extra accountable applied sciences and develop higher measures to fight misuse.

Source link

#Mapping #misuse #generative

Unlock the potential of cutting-edge AI options with our complete choices. As a number one supplier within the AI panorama, we harness the ability of synthetic intelligence to revolutionize industries. From machine studying and knowledge analytics to pure language processing and pc imaginative and prescient, our AI options are designed to reinforce effectivity and drive innovation. Discover the limitless potentialities of AI-driven insights and automation that propel what you are promoting ahead. With a dedication to staying on the forefront of the quickly evolving AI market, we ship tailor-made options that meet your particular wants. Be part of us on the forefront of technological development, and let AI redefine the way in which you use and achieve a aggressive panorama. Embrace the longer term with AI excellence, the place potentialities are limitless, and competitors is surpassed.

Mapping the misuse of generative AI

Highlighting the principle classes of misuse

Figuring out methods and combos of misuse

Staying forward of potential misuses

Recent Posts

Canada preps financial crime agency

EDUCAUSE 2025: Higher Ed IT Professionals to Share Future-Focused Ideas

AI could predict who will have a heart attack

How to Build An AI Agent with Function Calling and GPT-5

It wasn’t space debris that struck a United Airlines plane—it was a weather balloon

What to Know About the Shocking Louvre Jewelry Heist

Flowers of the future | MIT Technology Review

Google’s new deadline for Epic consequences is October 29th

Google Has a Bedbug Infestation in Its New York Offices

Meta Employee Creates AI App That Deepfakes the Dream Vacation You Couldn’t Afford