View a PDF of the paper titled Arcee's MergeKit: A Toolkit for Merging Large Language Models, by Charles Goddard and...
Read moreDetailsAuthors:Sheng Zhang, Yanbo Xu, Naoto Usuyama, Hanwen Xu, Jaspreet Bagga, Robert Tinn, Sam Preston, Rajesh Rao, Mu Wei, Naveen Valluri,...
Read moreDetailsView a PDF of the paper titled Literature Meets Data: A Synergistic Approach to Hypothesis Generation, by Haokun Liu and...
Read moreDetailsView a PDF of the paper titled LoRA-LiteE: A Computationally Efficient Framework for Chatbot Preference-Tuning, by Yahe Yang and 2...
Read moreDetailsAuthors:Sudeshna Das, Yao Ge, Yuting Guo, Swati Rajwal, JaMor Hairston, Jeanne Powell, Drew Walker, Snigdha Peddireddy, Sahithi Lakamana, Selen Bozkurt,...
Read moreDetailsView a PDF of the paper titled Scaling Efficient LLMs, by B.N. Kausik View PDF Abstract:Trained LLMs are typically sparse...
Read moreDetailsView a PDF of the paper titled Revisiting In-Context Learning with Long Context Language Models, by Jinheon Baek and 5...
Read moreDetailsView a PDF of the paper titled Prompting Disentangled Embeddings for Knowledge Graph Completion with Pre-trained Language Model, by Yuxia...
Read moreDetailsView a PDF of the paper titled MuQ: Self-Supervised Music Representation Learning with Mel Residual Vector Quantization, by Haina Zhu...
Read moreDetailsarXiv:2501.01336v1 Announce Type: new Abstract: Large Language Models (LLMs) have demonstrated impressive capabilities in complex reasoning tasks. However, they can...
Read moreDetails