...

Long-Context Evaluation Beyond Literal Matching

Long-Context Evaluation Beyond Literal Matching
[ad_1] [Submitted on 7 Feb 2025 (v1), last revised 26 Mar 2025 (this version, v2)] View a PDF of the ...
Read more

A Novel Approach for the Automated Evaluation of Open-Ended Question Generation

Long-Context Evaluation Beyond Literal Matching
[ad_1] [Submitted on 16 Oct 2024 (v1), last revised 25 Mar 2025 (this version, v3)] View a PDF of the ...
Read more

A Floater-Free Framework for 3D Gaussian Splatting

Long-Context Evaluation Beyond Literal Matching
[ad_1] [Submitted on 24 Mar 2025 (v1), last revised 25 Mar 2025 (this version, v2)] View a PDF of the ...
Read more

Exploring Training and Inference Scaling Laws in Generative Retrieval

Long-Context Evaluation Beyond Literal Matching
[ad_1] arXiv:2503.18941v1 Announce Type: cross Abstract: Generative retrieval has emerged as a novel paradigm that leverages large language models (LLMs) ...
Read more

A Scalable Data Synthesis Framework and Guided Tree Search for Automated Theorem Proving

Long-Context Evaluation Beyond Literal Matching
[ad_1] [Submitted on 30 Dec 2024 (v1), last revised 21 Mar 2025 (this version, v3)] View a PDF of the ...
Read more

Unlocking the Multi-modal Potential of CLIP for Generalized Category Discovery

Long-Context Evaluation Beyond Literal Matching
[ad_1] [Submitted on 15 Mar 2024 (v1), last revised 21 Mar 2025 (this version, v3)] View a PDF of the ...
Read more

Optimizing Data Mixtures by Predicting Language Modeling Performance

Long-Context Evaluation Beyond Literal Matching
[ad_1] [Submitted on 25 Mar 2024 (v1), last revised 20 Mar 2025 (this version, v2)] View a PDF of the ...
Read more

Machine Unlearning in Hyperbolic vs. Euclidean Multimodal Contrastive Learning: Adapting Alignment Calibration to MERU

Long-Context Evaluation Beyond Literal Matching
[ad_1] arXiv:2503.15166v1 Announce Type: cross Abstract: Machine unlearning methods have become increasingly important for selective concept removal in large pre-trained ...
Read more

[2402.13213] Probabilities of Chat LLMs Are Miscalibrated but Still Predict Correctness on Multiple-Choice Q&A

Long-Context Evaluation Beyond Literal Matching
[ad_1] [Submitted on 20 Feb 2024 (v1), last revised 19 Mar 2025 (this version, v3)] View a PDF of the ...
Read more

MoonCast: High-Quality Zero-Shot Podcast Generation

Long-Context Evaluation Beyond Literal Matching
[ad_1] arXiv:2503.14345v1 Announce Type: cross Abstract: Recent advances in text-to-speech synthesis have achieved notable success in generating high-quality short utterances ...
Read more
12316 Next