TTSDS2: Resources and Benchmark for Evaluating Human-Quality Text to Speech Systems

arXiv:2506.19441v1 Announce Type: cross Abstract: Evaluation of Text to Speech (TTS) systems is challenging and resource-intensive. Subjective metrics such as ...
Read more
Programming by Backprop: LLMs Acquire Reusable Algorithmic Abstractions During Code Training

arXiv:2506.18777v1 Announce Type: cross Abstract: Training large language models (LLMs) on source code significantly enhances their general-purpose reasoning abilities, but ...
Read more
AI-Generated Song Detection via Lyrics Transcripts

arXiv:2506.18488v1 Announce Type: cross Abstract: The recent rise in capabilities of AI-based music generation tools has created an upheaval in ...
Read more
An Interactive Benchmark for Evaluating LLMs’ Sequential Reasoning Ability

[Submitted on 14 Feb 2024 (v1), last revised 20 Jun 2025 (this version, v2)] View a PDF of the paper ...
Read more
[2504.21016] Nested Named-Entity Recognition on Vietnamese COVID-19: Dataset and Experiments

[Submitted on 21 Apr 2025 (v1), last revised 14 Jun 2025 (this version, v2)] View a PDF of the paper ...
Read more
[2502.14718] Entity Framing and Role Portrayal in the News

[Submitted on 20 Feb 2025 (v1), last revised 15 Jun 2025 (this version, v2)] Authors:Tarek Mahmoud, Zhuohan Xie, Dimitar Dimitrov, ...
Read more
On the Performance of LLMs for Real Estate Appraisal

arXiv:2506.11812v1 Announce Type: cross Abstract: The real estate market is vital to global economies but suffers from significant information asymmetry. ...
Read more
[2502.13604] BeamLoRA: Beam-Constraint Low-Rank Adaptation

[Submitted on 19 Feb 2025 (v1), last revised 12 Jun 2025 (this version, v2)] View a PDF of the paper ...
Read more
[2411.18553] Retrofitting Large Language Models with Dynamic Tokenization

[Submitted on 27 Nov 2024 (v1), last revised 11 Jun 2025 (this version, v3)] View a PDF of the paper ...
Read more
[2410.11005] Assessing Dialect Fairness and Robustness of Large Language Models in Reasoning Tasks

[Submitted on 14 Oct 2024 (v1), last revised 9 Jun 2025 (this version, v3)] View a PDF of the paper ...
Read more