Impacts on Evaluation of LMs

[Submitted on 18 Feb 2025 (v1), last revised 21 May 2025 (this version, v2)] View a PDF of the paper ...
Read more
[2410.13779] The Mystery of the Pathological Path-star Task for Language Models

[Submitted on 17 Oct 2024 (v1), last revised 19 May 2025 (this version, v2)] View a PDF of the paper ...
Read more
General-Purpose Dual-Agent Framework for Reliable Reasoning on Knowledge Graphs

[Submitted on 18 Feb 2025 (v1), last revised 20 May 2025 (this version, v5)] View a PDF of the paper ...
Read more
Can Embodied Agents Understand Vague Human Instructions in Task Planning?

[Submitted on 16 May 2025 (v1), last revised 19 May 2025 (this version, v2)] View a PDF of the paper ...
Read more
LLM-KG-Bench 3.0: A Compass for SemanticTechnology Capabilities in the Ocean of LLMs

arXiv:2505.13098v1 Announce Type: cross Abstract: Current Large Language Models (LLMs) can assist developing program code beside many other things, but ...
Read more
LLM Agent as a Shield between User and Recommender Systems

[Submitted on 20 Feb 2025 (v1), last revised 16 May 2025 (this version, v2)] View a PDF of the paper ...
Read more
Towards a Deeper Understanding of Reasoning Capabilities in Large Language Models

arXiv:2505.10543v1 Announce Type: cross Abstract: While large language models demonstrate impressive performance on static benchmarks, the true potential of large ...
Read more
Superposition Yields Robust Neural Scaling

arXiv:2505.10465v1 Announce Type: cross Abstract: The success of today’s large language models (LLMs) depends on the observation that larger models ...
Read more
Customizing a Large Language Model for VHDL Design of High-Performance Microprocessors

arXiv:2505.09610v1 Announce Type: cross Abstract: The use of Large Language Models (LLMs) in hardware design has taken off in recent ...
Read more
Clicking some of the silly options: Exploring Player Motivation in Static and Dynamic Educational Interactive Narratives

arXiv:2505.08891v1 Announce Type: new Abstract: Motivation is an important factor underlying successful learning. Previous research has demonstrated the positive effects ...
Read more