Evaluating Large Language Models on Biomedical Syllogistic Reasoning

[ad_1] [Submitted on 18 Oct 2024 (v1), last revised 10 Feb 2025 (this version, v2)] View a PDF of the ...
Read more
[2501.10868] Generating Structured Outputs from Language Models: Benchmark and Studies

[ad_1] [Submitted on 18 Jan 2025 (v1), last revised 10 Feb 2025 (this version, v2)] View a PDF of the ...
Read more
Proposing Several Tasks on Explainable Natural Language Inference

[ad_1] [Submitted on 15 Nov 2023 (v1), last revised 6 Feb 2025 (this version, v2)] View a PDF of the ...
Read more
Adaptive Attention Sparsity with Hierarchical Top-$p$ Pruning

[ad_1] [Submitted on 4 Feb 2025 (v1), last revised 6 Feb 2025 (this version, v2)] View a PDF of the ...
Read more
A Fine-grained Metric for Video Question Answering Data Quality Evaluation

[ad_1] [Submitted on 11 Nov 2024 (v1), last revised 6 Feb 2025 (this version, v3)] View a PDF of the ...
Read more
Combining Base and Instruction-Tuned Language Models for Better Synthetic Data Generation

[ad_1] [Submitted on 3 Feb 2025 (v1), last revised 5 Feb 2025 (this version, v2)] View a PDF of the ...
Read more
Rule-Guided Retrieval-Augmented Generation with Language Models for Question Answering

[ad_1] [Submitted on 15 Oct 2024 (v1), last revised 5 Feb 2025 (this version, v2)] View a PDF of the ...
Read more
An Integrated Toolkit for Evaluating Jailbreak Attempts Against Large Language Models

[ad_1] [Submitted on 13 Jun 2024 (v1), last revised 4 Feb 2025 (this version, v2)] View a PDF of the ...
Read more
Exploring the Role of Punctuation in Semantic Processing

[ad_1] [Submitted on 10 Jan 2025 (v1), last revised 2 Feb 2025 (this version, v3)] View a PDF of the ...
Read more
[2501.07927] Gandalf the Red: Adaptive Security for LLMs

[ad_1] [Submitted on 14 Jan 2025 (v1), last revised 2 Feb 2025 (this version, v2)] Authors:Niklas Pfister, Václav Volhejn, Manuel ...
Read more