View a PDF of the paper titled Towards Threshold-Free KV Cache Pruning, by Xuanfan Ni and 8 other authors View...
Read moreDetailsView a PDF of the paper titled Multimodal Fact-Checking: An Agent-based Approach, by Danni Xu and 3 other authors View...
Read moreDetailsView a PDF of the paper titled KVCrush: Key value cache size-reduction using similarity in head-behaviour, by Gopi Krishna Jha...
Read moreDetailsView a PDF of the paper titled Towards Acyclic Preference Evaluation of Language Models via Multiple Evaluators, by Zhengyu Hu...
Read moreDetailsView a PDF of the paper titled Modeling the One-to-Many Property in Open-Domain Dialogue with LLMs, by Jing Yang Lee...
Read moreDetailsView a PDF of the paper titled Training Language Models to Explain Their Own Computations, by Belinda Z. Li and...
Read moreDetailsView a PDF of the paper titled Chunk Based Speech Pre-training with High Resolution Finite Scalar Quantization, by Yun Tang...
Read moreDetailsView a PDF of the paper titled Breadcrumbs Reasoning: Memory-Efficient Reasoning with Compression Beacons, by Giovanni Monea and 4 other...
Read moreDetailsAuthors:Zihao Cheng, Yuheng Lu, Huaiqian Ye, Zeming Liu, Minqi Wang, Jingjing Liu, Zihan Li, Wei Fan, Yuanfang Guo, Ruiji Fu,...
Read moreDetailsView a PDF of the paper titled Trusted Uncertainty in Large Language Models: A Unified Framework for Confidence Calibration and...
Read moreDetails