View a PDF of the paper titled Linguistic Generalizations are not Rules: Impacts on Evaluation of LMs, by Leonie Weissweiler and 2 other authors
Abstract:Linguistic evaluations of how well LMs generalize to produce or understand novel text often implicitly take for granted that natural languages are generated by symbolic rules. Grammaticality is thought to be determined by whether sentences obey such rules. Interpretation is believed to be compositionally generated by syntactic rules operating on meaningful words. Semantic parsing is intended to map sentences into formal logic. Failures of LMs to obey strict rules have been taken to reveal that LMs do not produce or understand language like humans. Here we suggest that LMs’ failures to obey symbolic rules may be a feature rather than a bug, because natural languages are not based on rules. New utterances are produced and understood by a combination of flexible, interrelated, and context-dependent constructions. We encourage researchers to reimagine appropriate benchmarks and analyses that acknowledge the rich, flexible generalizations that comprise natural languages.
Submission history
From: Leonie Weissweiler [view email]
[v1]
Tue, 18 Feb 2025 17:40:20 UTC (46 KB)
[v2]
Wed, 21 May 2025 09:55:57 UTC (75 KB)
Source link
#Impacts #Evaluation #LMs