Fast Generation from Convolutional Sequence Models

[Submitted on 2 Oct 2024 (v1), last revised 25 Oct 2024 (this version, v2)] View a PDF of the paper ...
Read more
[2311.09335] Investigating Hallucinations in Pruned Large Language Models for Abstractive Summarization

[Submitted on 15 Nov 2023 (v1), last revised 24 Oct 2024 (this version, v3)] View a PDF of the paper ...
Read more
[2410.15115] On Designing Effective RL Reward at Training Time for LLM Reasoning

[Submitted on 19 Oct 2024 (v1), last revised 25 Oct 2024 (this version, v2)] View a PDF of the paper ...
Read more
Improving Open-Domain QA Performance and Interpretability with Relevance Estimator in Retrieval-Augmented Generation

[Submitted on 9 Jun 2024 (v1), last revised 24 Oct 2024 (this version, v3)] View a PDF of the paper ...
Read more
A Study of Compositional Generalization on OOD Prompts

[Submitted on 9 Sep 2024 (v1), last revised 24 Oct 2024 (this version, v2)] View a PDF of the paper ...
Read more
Causal Scan for LLM Misbehavior Detection

[Submitted on 22 Oct 2024 (v1), last revised 23 Oct 2024 (this version, v2)] View a PDF of the paper ...
Read more
[2406.16030] Zero-Shot Cross-Lingual NER Using Phonemic Representations for Low-Resource Languages

[Submitted on 23 Jun 2024 (v1), last revised 22 Oct 2024 (this version, v2)] View a PDF of the paper ...
Read more
[2405.10523] Adaptable and Reliable Text Classification using Large Language Models

[Submitted on 17 May 2024 (v1), last revised 22 Oct 2024 (this version, v2)] View a PDF of the paper ...
Read more
Evaluating the Effect of Explanations on Users’ Mental Models of Visual Question Answering Systems

[Submitted on 27 Jun 2024 (v1), last revised 21 Oct 2024 (this version, v2)] View a PDF of the paper ...
Read more
Enabling Early Exit Inference and Self-Speculative Decoding

[Submitted on 25 Apr 2024 (v1), last revised 18 Oct 2024 (this version, v4)] Authors:Mostafa Elhoushi, Akshat Shrivastava, Diana Liskovich, ...
Read more