LLMs Robustness with Incorrect Multiple-Choice Options

[Submitted on 27 Aug 2024 (v1), last revised 10 Oct 2024 (this version, v2)]

View a PDF of the paper titled Wait, that is not an choice: LLMs Robustness with Incorrect A number of-Alternative Choices, by Gracjan G’oral and Emilia Wi’snios and Piotr Sankowski and Pawe{l} Budzianowski

View PDF
HTML (experimental)

Summary:Resolution-making beneath full alignment requires balancing between reasoning and faithfulness – a problem for big language fashions (LLMs). This examine explores whether or not LLMs prioritize following directions over reasoning and reality when given “deceptive” directions, similar to “Reply solely with A or B”, even when neither choice is appropriate. We introduce a brand new metric referred to as “reflective judgment”, which sheds new mild on the connection between the pre-training and post-training alignment schemes. In duties starting from primary arithmetic to domain-specific assessments, fashions like GPT-4o, o1-mini, or Claude 3 Opus adhered to directions accurately however did not mirror on the validity of the offered choices. Opposite, fashions from the Llama 3.1 household (8B, 70B, 405B) or base Qwen2.5 (7B, 14B, 32B) households exhibit improved refusal charges with measurement, indicating a scaling impact. We additionally noticed that alignment strategies, although meant to reinforce reasoning, typically weakened the fashions’ potential to reject incorrect directions, main them to observe flawed prompts uncritically. Lastly, we have now additionally performed a parallel human examine revealing related patterns in human habits and annotations. We spotlight how in style RLHF datasets may disrupt both coaching or analysis on account of annotations exhibiting poor reflective judgement.

Submission historical past

From: Gracjan Góral [view email]
[v1]
Tue, 27 Aug 2024 19:27:43 UTC (1,144 KB)
[v2]
Thu, 10 Oct 2024 20:46:36 UTC (4,417 KB)

Source link

#LLMs #Robustness #Incorrect #MultipleChoice #Choices

Unlock the potential of cutting-edge AI options with our complete choices. As a number one supplier within the AI panorama, we harness the ability of synthetic intelligence to revolutionize industries. From machine studying and knowledge analytics to pure language processing and pc imaginative and prescient, our AI options are designed to reinforce effectivity and drive innovation. Discover the limitless potentialities of AI-driven insights and automation that propel your small business ahead. With a dedication to staying on the forefront of the quickly evolving AI market, we ship tailor-made options that meet your particular wants. Be part of us on the forefront of technological development, and let AI redefine the way in which you use and reach a aggressive panorama. Embrace the long run with AI excellence, the place potentialities are limitless, and competitors is surpassed.

LLMs Robustness with Incorrect Multiple-Choice Options

Submission historical past

Recent Posts

Sony sues Tencent for “slavish clone” of its “valuable” Horizon franchise

Real-time is the new ‘business as usual’—how are liquidity strategies responding?

The Stanford Framework That Turns AI into Your PM Superpower

The first company to complete a fully successful lunar landing is going public

Programmers Aren’t So Humble Anymore—Maybe Because Nobody Codes in Perl

The controversial legal tactic The Trump Organization is using to take down fake merch

LG Promo Codes: 20% Off | July 2025

Man in Prison Gets Hired as Software Engineer at Silicon Valley Startup, Works Every Day From Cell

LegalZoom Promo Code: Exclusive 10% Off LLC Formations

I bought Samsung’s Galaxy Watch Ultra 2025, but I’d recommend this model instead