View a PDF of the paper titled MindEval: Benchmarking Language Models on Multi-turn Mental Health Support, by Jos\'e Pombal and...
Read moreDetailsView a PDF of the paper titled Control Illusion: The Failure of Instruction Hierarchies in Large Language Models, by Yilin...
Read moreDetailsarXiv:2512.03794v1 Announce Type: cross Abstract: Vision-Language Models (VLMs) have achieved remarkable success in visual question answering tasks, but their reliance...
Read moreDetailsView a PDF of the paper titled Finetune-RAG: Fine-Tuning Language Models to Resist Hallucination in Retrieval-Augmented Generation, by Zhan Peng...
Read moreDetailsarXiv:2512.02973v1 Announce Type: cross Abstract: While Multimodal Large Language Models (MLLMs) show remarkable capabilities, their safety alignments are susceptible to...
Read moreDetailsView a PDF of the paper titled Multilingual DistilWhisper: Efficient Distillation of Multi-task Speech Models via Language-Specific Experts, by Thomas...
Read moreDetailsarXiv:2512.01797v1 Announce Type: cross Abstract: Large language models (LLMs) frequently generate hallucinations -- plausible but factually incorrect outputs -- undermining...
Read moreDetailsarXiv:2511.21750v1 Announce Type: cross Abstract: Multimodal large language models (MLLMs) are increasingly deployed in real-world, agentic settings where outputs must...
Read moreDetailsarXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website. Both individuals...
Read moreDetailsView a PDF of the paper titled AI-Mediated Communication Reshapes Social Structure in Opinion-Diverse Groups, by Faria Huq and 2...
Read moreDetails