Impacts on Evaluation of LMs

2025-05-22 by AiNEWS2025

[Submitted on 18 Feb 2025 (v1), last revised 21 May 2025 (this version, v2)]

View a PDF of the paper titled Linguistic Generalizations are not Rules: Impacts on Evaluation of LMs, by Leonie Weissweiler and 2 other authors

View PDF
HTML (experimental)

Abstract:Linguistic evaluations of how well LMs generalize to produce or understand novel text often implicitly take for granted that natural languages are generated by symbolic rules. Grammaticality is thought to be determined by whether sentences obey such rules. Interpretation is believed to be compositionally generated by syntactic rules operating on meaningful words. Semantic parsing is intended to map sentences into formal logic. Failures of LMs to obey strict rules have been taken to reveal that LMs do not produce or understand language like humans. Here we suggest that LMs’ failures to obey symbolic rules may be a feature rather than a bug, because natural languages are not based on rules. New utterances are produced and understood by a combination of flexible, interrelated, and context-dependent constructions. We encourage researchers to reimagine appropriate benchmarks and analyses that acknowledge the rich, flexible generalizations that comprise natural languages.

Submission history

From: Leonie Weissweiler [view email]
[v1]
Tue, 18 Feb 2025 17:40:20 UTC (46 KB)
[v2]
Wed, 21 May 2025 09:55:57 UTC (75 KB)

Source link

#Impacts #Evaluation #LMs