...

Query-centric Data Synthesis Approach for Long-context Scaling of Large Language Model


View a PDF of the paper titled Quest: Question-centric Knowledge Synthesis Method for Lengthy-context Scaling of Giant Language Mannequin, by Chaochen Gao and three different authors

View PDF
HTML (experimental)

Summary:Latest developments in giant language fashions (LLMs) have highlighted the significance of extending context lengths for dealing with complicated duties. Whereas conventional strategies for coaching on lengthy contexts typically use filtered lengthy paperwork, these approaches result in area imbalances, limiting mannequin efficiency. To handle this, strategies like random doc concatenation (Normal) and similarity-based strategies (KNN, ICLM) have been developed. Nevertheless, they both sacrifice semantic coherence or variety. To steadiness each points, we introduce Quest, a query-centric knowledge synthesis methodology aggregating semantically related but numerous paperwork. Quest makes use of a generative mannequin to foretell potential queries for every doc, grouping paperwork with comparable queries and key phrases. In depth experiments reveal Quest’s superior efficiency on long-context duties, attaining outstanding outcomes with context lengths of as much as 1M tokens and confirming its scalability throughout numerous mannequin sizes.

Submission historical past

From: Chaochen Gao [view email]
[v1]
Thu, 30 Could 2024 08:50:55 UTC (4,291 KB)
[v2]
Thu, 20 Jun 2024 02:26:48 UTC (4,596 KB)
[v3]
Sat, 14 Sep 2024 11:57:54 UTC (4,726 KB)
[v4]
Tue, 24 Sep 2024 09:06:21 UTC (4,736 KB)
[v5]
Wed, 9 Oct 2024 12:14:22 UTC (6,225 KB)

Source link

#Querycentric #Knowledge #Synthesis #Method #Longcontext #Scaling #Giant #Language #Mannequin


Unlock the potential of cutting-edge AI options with our complete choices. As a number one supplier within the AI panorama, we harness the ability of synthetic intelligence to revolutionize industries. From machine studying and knowledge analytics to pure language processing and pc imaginative and prescient, our AI options are designed to reinforce effectivity and drive innovation. Discover the limitless prospects of AI-driven insights and automation that propel your online business ahead. With a dedication to staying on the forefront of the quickly evolving AI market, we ship tailor-made options that meet your particular wants. Be part of us on the forefront of technological development, and let AI redefine the way in which you use and achieve a aggressive panorama. Embrace the longer term with AI excellence, the place prospects are limitless, and competitors is surpassed.