arXiv:2506.22385v1 Announce Type: cross Abstract: Video Large Multimodal Models (VLMMs) have made impressive strides in understanding video content, but they...
Read moreDetailsView a PDF of the paper titled Comparing Retrieval-Augmentation and Parameter-Efficient Fine-Tuning for Privacy-Preserving Personalization of Large Language Models, by...
Read moreDetailsarXiv:2506.21546v1 Announce Type: cross Abstract: Recent progress in vision-language segmentation has significantly advanced grounded visual understanding. However, these models often...
Read moreDetailsView a PDF of the paper titled Graph Linearization Methods for Reasoning on Graphs with Large Language Models, by Christos...
Read moreDetailsarXiv:2506.19697v1 Announce Type: cross Abstract: Extreme activation outliers in Large Language Models (LLMs) critically degrade quantization performance, hindering efficient on-device...
Read moreDetailsarXiv:2506.19441v1 Announce Type: cross Abstract: Evaluation of Text to Speech (TTS) systems is challenging and resource-intensive. Subjective metrics such as...
Read moreDetailsarXiv:2506.18777v1 Announce Type: cross Abstract: Training large language models (LLMs) on source code significantly enhances their general-purpose reasoning abilities, but...
Read moreDetailsarXiv:2506.18488v1 Announce Type: cross Abstract: The recent rise in capabilities of AI-based music generation tools has created an upheaval in...
Read moreDetailsView a PDF of the paper titled AQA-Bench: An Interactive Benchmark for Evaluating LLMs' Sequential Reasoning Ability, by Siwei Yang...
Read moreDetailsView a PDF of the paper titled Nested Named-Entity Recognition on Vietnamese COVID-19: Dataset and Experiments, by Ngoc C.L\^e and...
Read moreDetails