[2410.12876] In-context KV-Cache Eviction for LLMs via Attention-Gate
![[2410.12876] In-context KV-Cache Eviction for LLMs via Attention-Gate [2410.12876] In-context KV-Cache Eviction for LLMs via Attention-Gate](https://i0.wp.com/arxiv.org/static/browse/0.3.4/images/arxiv-logo-fb.png?w=1920&resize=1920,1113&ssl=1)
[Submitted on 15 Oct 2024 (v1), last revised 17 Apr 2025 (this version, v3)] View a PDF of the paper ...
Read more
[2411.16707] Enhancing LLMs for Power System Simulations: A Feedback-driven Multi-agent Framework
![[2410.12876] In-context KV-Cache Eviction for LLMs via Attention-Gate [2410.12876] In-context KV-Cache Eviction for LLMs via Attention-Gate](https://i0.wp.com/arxiv.org/static/browse/0.3.4/images/arxiv-logo-fb.png?w=1920&resize=1920,1113&ssl=1)
[Submitted on 21 Nov 2024 (v1), last revised 15 Apr 2025 (this version, v2)] View a PDF of the paper ...
Read more
an open dataset and web-based application for the study of metaphor
![[2410.12876] In-context KV-Cache Eviction for LLMs via Attention-Gate [2410.12876] In-context KV-Cache Eviction for LLMs via Attention-Gate](https://i0.wp.com/arxiv.org/static/browse/0.3.4/images/arxiv-logo-fb.png?w=1920&resize=1920,1113&ssl=1)
[Submitted on 1 Mar 2025 (v1), last revised 15 Apr 2025 (this version, v2)] Authors:Maddalena Bressler, Veronica Mangiaterra, Paolo Canal, ...
Read more
Nondeterministic Polynomial-time Problem Challenge: An Ever-Scaling Reasoning Benchmark for LLMs
![[2410.12876] In-context KV-Cache Eviction for LLMs via Attention-Gate [2410.12876] In-context KV-Cache Eviction for LLMs via Attention-Gate](https://i0.wp.com/arxiv.org/static/browse/0.3.4/images/arxiv-logo-fb.png?w=1920&resize=1920,1113&ssl=1)
arXiv:2504.11239v1 Announce Type: cross Abstract: Reasoning is the fundamental capability of large language models (LLMs). Due to the rapid progress ...
Read more
Long-Context Evaluation Beyond Literal Matching
![[2410.12876] In-context KV-Cache Eviction for LLMs via Attention-Gate [2410.12876] In-context KV-Cache Eviction for LLMs via Attention-Gate](https://i0.wp.com/arxiv.org/static/browse/0.3.4/images/arxiv-logo-fb.png?w=1920&resize=1920,1113&ssl=1)
[Submitted on 7 Feb 2025 (v1), last revised 26 Mar 2025 (this version, v2)] View a PDF of the paper ...
Read more
A Novel Approach for the Automated Evaluation of Open-Ended Question Generation
![[2410.12876] In-context KV-Cache Eviction for LLMs via Attention-Gate [2410.12876] In-context KV-Cache Eviction for LLMs via Attention-Gate](https://i0.wp.com/arxiv.org/static/browse/0.3.4/images/arxiv-logo-fb.png?w=1920&resize=1920,1113&ssl=1)
[Submitted on 16 Oct 2024 (v1), last revised 25 Mar 2025 (this version, v3)] View a PDF of the paper ...
Read more
A Floater-Free Framework for 3D Gaussian Splatting
![[2410.12876] In-context KV-Cache Eviction for LLMs via Attention-Gate [2410.12876] In-context KV-Cache Eviction for LLMs via Attention-Gate](https://i0.wp.com/arxiv.org/static/browse/0.3.4/images/arxiv-logo-fb.png?w=1920&resize=1920,1113&ssl=1)
[Submitted on 24 Mar 2025 (v1), last revised 25 Mar 2025 (this version, v2)] View a PDF of the paper ...
Read more
Exploring Training and Inference Scaling Laws in Generative Retrieval
![[2410.12876] In-context KV-Cache Eviction for LLMs via Attention-Gate [2410.12876] In-context KV-Cache Eviction for LLMs via Attention-Gate](https://i0.wp.com/arxiv.org/static/browse/0.3.4/images/arxiv-logo-fb.png?w=1920&resize=1920,1113&ssl=1)
arXiv:2503.18941v1 Announce Type: cross Abstract: Generative retrieval has emerged as a novel paradigm that leverages large language models (LLMs) to ...
Read more
A Scalable Data Synthesis Framework and Guided Tree Search for Automated Theorem Proving
![[2410.12876] In-context KV-Cache Eviction for LLMs via Attention-Gate [2410.12876] In-context KV-Cache Eviction for LLMs via Attention-Gate](https://i0.wp.com/arxiv.org/static/browse/0.3.4/images/arxiv-logo-fb.png?w=1920&resize=1920,1113&ssl=1)
[Submitted on 30 Dec 2024 (v1), last revised 21 Mar 2025 (this version, v3)] View a PDF of the paper ...
Read more
Unlocking the Multi-modal Potential of CLIP for Generalized Category Discovery
![[2410.12876] In-context KV-Cache Eviction for LLMs via Attention-Gate [2410.12876] In-context KV-Cache Eviction for LLMs via Attention-Gate](https://i0.wp.com/arxiv.org/static/browse/0.3.4/images/arxiv-logo-fb.png?w=1920&resize=1920,1113&ssl=1)
[Submitted on 15 Mar 2024 (v1), last revised 21 Mar 2025 (this version, v3)] View a PDF of the paper ...
Read more