View a PDF of the paper titled Ladder: A Mannequin-Agnostic Framework Boosting LLM-based Machine Translation to the Subsequent Degree, by Zhaopeng Feng and 4 different authors
Summary:Normal-purpose Giant Language Fashions (LLMs) like GPT-4 have achieved outstanding developments in machine translation (MT) by leveraging intensive internet content material. Alternatively, translation-specific LLMs are constructed by pre-training on domain-specific monolingual corpora and fine-tuning with human-annotated translation information. Regardless of the superior efficiency, these strategies both demand an unprecedented scale of computing and information or substantial human modifying and annotation efforts. On this paper, we develop MT-Ladder, a novel model-agnostic and cost-effective software to refine the efficiency of normal LLMs for MT. MT-Ladder is skilled on pseudo-refinement triplets which might be simply obtained from current LLMs with out extra human price. Throughout coaching, we suggest a hierarchical fine-tuning technique with an easy-to-hard schema, bettering MT-Ladder’s refining efficiency progressively. The skilled MT-Ladder might be seamlessly built-in with any general-purpose LLMs to spice up their translation efficiency. By using Gemma-2B/7B because the spine, MT-Ladder-2B can elevate uncooked translations to the extent of top-tier open-source fashions (e.g., refining BigTranslate-13B with +6.91 BLEU and +3.52 COMET for XX-En), and MT-Ladder-7B can additional improve mannequin efficiency to be on par with the state-of-the-art GPT-4. In depth ablation and evaluation corroborate the effectiveness of MT-Ladder in numerous settings. Our code is obtainable at this https URL
Submission historical past
From: Zhaopeng Feng [view email]
[v1]
Sat, 22 Jun 2024 05:33:35 UTC (1,495 KB)
[v2]
Fri, 9 Aug 2024 08:06:39 UTC (2,842 KB)
[v3]
Tue, 29 Oct 2024 05:15:09 UTC (2,885 KB)
Source link
#ModelAgnostic #Framework #Boosting #LLMbased #Machine #Translation #Degree
Unlock the potential of cutting-edge AI options with our complete choices. As a number one supplier within the AI panorama, we harness the facility of synthetic intelligence to revolutionize industries. From machine studying and information analytics to pure language processing and pc imaginative and prescient, our AI options are designed to boost effectivity and drive innovation. Discover the limitless prospects of AI-driven insights and automation that propel your online business ahead. With a dedication to staying on the forefront of the quickly evolving AI market, we ship tailor-made options that meet your particular wants. Be a part of us on the forefront of technological development, and let AI redefine the way in which you use and achieve a aggressive panorama. Embrace the long run with AI excellence, the place prospects are limitless, and competitors is surpassed.