Can Embodied Agents Understand Vague Human Instructions in Task Planning?

[Submitted on 16 May 2025 (v1), last revised 19 May 2025 (this version, v2)] View a PDF of the paper ...
Read more
LLM-KG-Bench 3.0: A Compass for SemanticTechnology Capabilities in the Ocean of LLMs

arXiv:2505.13098v1 Announce Type: cross Abstract: Current Large Language Models (LLMs) can assist developing program code beside many other things, but ...
Read more
LLM Agent as a Shield between User and Recommender Systems

[Submitted on 20 Feb 2025 (v1), last revised 16 May 2025 (this version, v2)] View a PDF of the paper ...
Read more
Towards a Deeper Understanding of Reasoning Capabilities in Large Language Models

arXiv:2505.10543v1 Announce Type: cross Abstract: While large language models demonstrate impressive performance on static benchmarks, the true potential of large ...
Read more
Superposition Yields Robust Neural Scaling

arXiv:2505.10465v1 Announce Type: cross Abstract: The success of today’s large language models (LLMs) depends on the observation that larger models ...
Read more
Customizing a Large Language Model for VHDL Design of High-Performance Microprocessors

arXiv:2505.09610v1 Announce Type: cross Abstract: The use of Large Language Models (LLMs) in hardware design has taken off in recent ...
Read more
Clicking some of the silly options: Exploring Player Motivation in Static and Dynamic Educational Interactive Narratives

arXiv:2505.08891v1 Announce Type: new Abstract: Motivation is an important factor underlying successful learning. Previous research has demonstrated the positive effects ...
Read more
A Benchmark for Narrative-Driven Drama Series Understanding

[Submitted on 30 Apr 2025 (v1), last revised 13 May 2025 (this version, v3)] View a PDF of the paper ...
Read more
[2410.12876] In-context KV-Cache Eviction for LLMs via Attention-Gate

[Submitted on 15 Oct 2024 (v1), last revised 17 Apr 2025 (this version, v3)] View a PDF of the paper ...
Read more
[2411.16707] Enhancing LLMs for Power System Simulations: A Feedback-driven Multi-agent Framework

[Submitted on 21 Nov 2024 (v1), last revised 15 Apr 2025 (this version, v2)] View a PDF of the paper ...
Read more