Long-Context Evaluation Beyond Literal Matching
[ad_1] [Submitted on 7 Feb 2025 (v1), last revised 26 Mar 2025 (this version, v2)] View a PDF of the ...
Read more A Novel Approach for the Automated Evaluation of Open-Ended Question Generation
[ad_1] [Submitted on 16 Oct 2024 (v1), last revised 25 Mar 2025 (this version, v3)] View a PDF of the ...
Read more A Floater-Free Framework for 3D Gaussian Splatting
[ad_1] [Submitted on 24 Mar 2025 (v1), last revised 25 Mar 2025 (this version, v2)] View a PDF of the ...
Read more Exploring Training and Inference Scaling Laws in Generative Retrieval
[ad_1] arXiv:2503.18941v1 Announce Type: cross Abstract: Generative retrieval has emerged as a novel paradigm that leverages large language models (LLMs) ...
Read more A Scalable Data Synthesis Framework and Guided Tree Search for Automated Theorem Proving
[ad_1] [Submitted on 30 Dec 2024 (v1), last revised 21 Mar 2025 (this version, v3)] View a PDF of the ...
Read more Unlocking the Multi-modal Potential of CLIP for Generalized Category Discovery
[ad_1] [Submitted on 15 Mar 2024 (v1), last revised 21 Mar 2025 (this version, v3)] View a PDF of the ...
Read more Optimizing Data Mixtures by Predicting Language Modeling Performance
[ad_1] [Submitted on 25 Mar 2024 (v1), last revised 20 Mar 2025 (this version, v2)] View a PDF of the ...
Read more Machine Unlearning in Hyperbolic vs. Euclidean Multimodal Contrastive Learning: Adapting Alignment Calibration to MERU
[ad_1] arXiv:2503.15166v1 Announce Type: cross Abstract: Machine unlearning methods have become increasingly important for selective concept removal in large pre-trained ...
Read more [2402.13213] Probabilities of Chat LLMs Are Miscalibrated but Still Predict Correctness on Multiple-Choice Q&A
[ad_1] [Submitted on 20 Feb 2024 (v1), last revised 19 Mar 2025 (this version, v3)] View a PDF of the ...
Read more MoonCast: High-Quality Zero-Shot Podcast Generation
[ad_1] arXiv:2503.14345v1 Announce Type: cross Abstract: Recent advances in text-to-speech synthesis have achieved notable success in generating high-quality short utterances ...
Read more