Techniques, Enhancements, Applications, Collaboration with LLMs, and Trustworthiness

Techniques, Enhancements, Applications, Collaboration with LLMs, and Trustworthiness
[Submitted on 4 Nov 2024 (v1), last revised 28 Dec 2024 (this version, v2)] Authors:Fali Wang, Zhiwei Zhang, Xianren Zhang, ...
Read more

[2412.16576] Open-Vocabulary Mobile Manipulation Based on Double Relaxed Contrastive Learning with Dense Labeling

Techniques, Enhancements, Applications, Collaboration with LLMs, and Trustworthiness
[Submitted on 21 Dec 2024 (v1), last revised 24 Dec 2024 (this version, v2)] View a PDF of the paper ...
Read more

Investigating Temporal Vulnerabilities in LLMs

Techniques, Enhancements, Applications, Collaboration with LLMs, and Trustworthiness
[Submitted on 4 Jul 2024 (v1), last revised 23 Dec 2024 (this version, v3)] View a PDF of the paper ...
Read more

The Task Shield: Enforcing Task Alignment to Defend Against Indirect Prompt Injection in LLM Agents

Techniques, Enhancements, Applications, Collaboration with LLMs, and Trustworthiness
arXiv:2412.16682v1 Announce Type: cross Abstract: Large Language Model (LLM) agents are increasingly being deployed as conversational assistants capable of performing ...
Read more

MORTAR: Metamorphic Multi-turn Testing for LLM-based Dialogue Systems

Techniques, Enhancements, Applications, Collaboration with LLMs, and Trustworthiness
arXiv:2412.15557v1 Announce Type: cross Abstract: With the widespread application of LLM-based dialogue systems in daily life, quality assurance has become ...
Read more

[2403.17196] Text Understanding in GPT-4 vs Humans

Techniques, Enhancements, Applications, Collaboration with LLMs, and Trustworthiness
[Submitted on 25 Mar 2024 (v1), last revised 20 Dec 2024 (this version, v3)] View a PDF of the paper ...
Read more

A Rate-Distortion Framework for Black-Box Language Models

Techniques, Enhancements, Applications, Collaboration with LLMs, and Trustworthiness
[Submitted on 22 Jul 2024 (v1), last revised 11 Dec 2024 (this version, v2)] View a PDF of the paper ...
Read more

Benchmarking and Enhancing Multimodal Models on Visual Illusions

Techniques, Enhancements, Applications, Collaboration with LLMs, and Trustworthiness
[Submitted on 11 Dec 2024] View a PDF of the paper titled Illusory VQA: Benchmarking and Enhancing Multimodal Models on ...
Read more

Simulating Legislative System for Roll Call Votes Prediction with Large Language Models

Techniques, Enhancements, Applications, Collaboration with LLMs, and Trustworthiness
arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website. Both individuals ...
Read more

Extreme Context Compression for Retrieval-augmented Generation with One Token

Techniques, Enhancements, Applications, Collaboration with LLMs, and Trustworthiness
[Submitted on 22 May 2024 (v1), last revised 9 Dec 2024 (this version, v2)] View a PDF of the paper ...
Read more