Causal Scan for LLM Misbehavior Detection

[Submitted on 22 Oct 2024 (v1), last revised 23 Oct 2024 (this version, v2)] View a PDF of the paper ...
Read more
[2406.16030] Zero-Shot Cross-Lingual NER Using Phonemic Representations for Low-Resource Languages

[Submitted on 23 Jun 2024 (v1), last revised 22 Oct 2024 (this version, v2)] View a PDF of the paper ...
Read more
[2405.10523] Adaptable and Reliable Text Classification using Large Language Models

[Submitted on 17 May 2024 (v1), last revised 22 Oct 2024 (this version, v2)] View a PDF of the paper ...
Read more
Evaluating the Effect of Explanations on Users’ Mental Models of Visual Question Answering Systems

[Submitted on 27 Jun 2024 (v1), last revised 21 Oct 2024 (this version, v2)] View a PDF of the paper ...
Read more
Enabling Early Exit Inference and Self-Speculative Decoding

[Submitted on 25 Apr 2024 (v1), last revised 18 Oct 2024 (this version, v4)] Authors:Mostafa Elhoushi, Akshat Shrivastava, Diana Liskovich, ...
Read more
Benchmarking LLM Proficiency in Scientific Literature Analysis

[Submitted on 4 Mar 2024 (v1), last revised 18 Oct 2024 (this version, v5)] Authors:Hengxing Cai, Xiaochen Cai, Junhan Chang, ...
Read more
Inferring Safe Actions for LLM-Based Agents Through Preemptive Evaluation and Human Feedback

[Submitted on 16 Jul 2024 (v1), last revised 17 Oct 2024 (this version, v2)] View a PDF of the paper ...
Read more
Lightweight Passage Retrieval for Open Domain Multi-Document Summarization

[Submitted on 18 Jun 2024 (v1), last revised 17 Oct 2024 (this version, v2)] View a PDF of the paper ...
Read more
[2310.20246] Breaking Language Barriers in Multilingual Mathematical Reasoning: Insights and Observations

[Submitted on 31 Oct 2023 (v1), last revised 16 Oct 2024 (this version, v5)] View a PDF of the paper ...
Read more
[2406.11939] From Crowdsourced Data to High-Quality Benchmarks: Arena-Hard and BenchBuilder Pipeline

[Submitted on 17 Jun 2024 (v1), last revised 14 Oct 2024 (this version, v2)] View a PDF of the paper ...
Read more