View a PDF of the paper titled Rethinking Reward Model Evaluation: Are We Barking up the Wrong Tree?, by Xueru...
Read moreDetailsView a PDF of the paper titled LawGPT: Knowledge-Guided Data Generation and Its Application to Legal LLM, by Zhi Zhou...
Read moreDetailsView a PDF of the paper titled FineMedLM-o1: Enhancing the Medical Reasoning Ability of LLM from Supervised Fine-Tuning to Test-Time...
Read moreDetailsView a PDF of the paper titled Music for All: Exploring Multicultural Representations in Music Generation Models, by Atharva Mehta...
Read moreDetailsView a PDF of the paper titled On the Feasibility of In-Context Probing for Data Attribution, by Cathy Jiao and...
Read moreDetailsView a PDF of the paper titled FinTruthQA: A Benchmark Dataset for Evaluating the Quality of Financial Information Disclosure, by...
Read moreDetailsView a PDF of the paper titled SylloBio-NLI: Evaluating Large Language Models on Biomedical Syllogistic Reasoning, by Magdalena Wysocka and...
Read moreDetailsView a PDF of the paper titled Generating Structured Outputs from Language Models: Benchmark and Studies, by Saibo Geng and...
Read moreDetailsView a PDF of the paper titled Formal Proofs as Structured Explanations: Proposing Several Tasks on Explainable Natural Language Inference,...
Read moreDetailsView a PDF of the paper titled Twilight: Adaptive Attention Sparsity with Hierarchical Top-$p$ Pruning, by Chaofan Lin and 8...
Read moreDetails