...

LLMs Robustness with Incorrect Multiple-Choice Options

[ad_1]

View a PDF of the paper titled Wait, that is not an choice: LLMs Robustness with Incorrect A number of-Alternative Choices, by Gracjan G’oral and Emilia Wi’snios and Piotr Sankowski and Pawe{l} Budzianowski

View PDF
HTML (experimental)

Summary:Resolution-making beneath full alignment requires balancing between reasoning and faithfulness – a problem for big language fashions (LLMs). This examine explores whether or not LLMs prioritize following directions over reasoning and reality when given “deceptive” directions, similar to “Reply solely with A or B”, even when neither choice is appropriate. We introduce a brand new metric referred to as “reflective judgment”, which sheds new mild on the connection between the pre-training and post-training alignment schemes. In duties starting from primary arithmetic to domain-specific assessments, fashions like GPT-4o, o1-mini, or Claude 3 Opus adhered to directions accurately however did not mirror on the validity of the offered choices. Opposite, fashions from the Llama 3.1 household (8B, 70B, 405B) or base Qwen2.5 (7B, 14B, 32B) households exhibit improved refusal charges with measurement, indicating a scaling impact. We additionally noticed that alignment strategies, although meant to reinforce reasoning, typically weakened the fashions’ potential to reject incorrect directions, main them to observe flawed prompts uncritically. Lastly, we have now additionally performed a parallel human examine revealing related patterns in human habits and annotations. We spotlight how in style RLHF datasets may disrupt both coaching or analysis on account of annotations exhibiting poor reflective judgement.

Submission historical past

From: Gracjan Góral [view email]
[v1]
Tue, 27 Aug 2024 19:27:43 UTC (1,144 KB)
[v2]
Thu, 10 Oct 2024 20:46:36 UTC (4,417 KB)

Source link

#LLMs #Robustness #Incorrect #MultipleChoice #Choices

[ad_2]
Unlock the potential of cutting-edge AI options with our complete choices. As a number one supplier within the AI panorama, we harness the ability of synthetic intelligence to revolutionize industries. From machine studying and knowledge analytics to pure language processing and pc imaginative and prescient, our AI options are designed to reinforce effectivity and drive innovation. Discover the limitless potentialities of AI-driven insights and automation that propel your small business ahead. With a dedication to staying on the forefront of the quickly evolving AI market, we ship tailor-made options that meet your particular wants. Be part of us on the forefront of technological development, and let AI redefine the way in which you use and reach a aggressive panorama. Embrace the long run with AI excellence, the place potentialities are limitless, and competitors is surpassed.