...

[2310.20246] Breaking Language Barriers in Multilingual Mathematical Reasoning: Insights and Observations


View a PDF of the paper titled Breaking Language Boundaries in Multilingual Mathematical Reasoning: Insights and Observations, by Nuo Chen and 5 different authors

View PDF
HTML (experimental)

Summary:Present analysis predominantly focuses on growing highly effective language studying fashions (LLMs) for mathematical reasoning inside monolingual languages, with few explorations in preserving efficacy in a multilingual context. To bridge this hole, this paper pioneers exploring and coaching highly effective Multilingual Math Reasoning (xMR) LLMs. Firstly, by using translation, we assemble the primary multilingual math reasoning instruction dataset, MGSM8KInstruct, encompassing ten distinct languages, thus addressing the difficulty of coaching knowledge shortage in xMR duties. Based mostly on the collected dataset, we suggest completely different coaching methods to construct highly effective xMR LLMs, named MathOctopus, notably outperform typical open-source LLMs and exhibit superiority over ChatGPT in few-shot eventualities. Notably, MathOctopus-13B reaches 47.6% accuracy which exceeds ChatGPT 46.3% on MGSM testset. Past exceptional outcomes, we unearth a number of pivotal observations and insights from intensive experiments: (1) When extending the rejection sampling technique to the multilingual context, it proves efficient for mannequin performances, albeit restricted. (2) Using parallel corpora for math Supervised Nice-Tuning (SFT) throughout a number of languages not solely considerably enhances mannequin efficiency multilingually but in addition elevates their monolingual efficiency. This means that crafting multilingual corpora may be thought to be a significant technique for enhancing mannequin efficiency in a particular language, particularly in mathematical reasoning duties. For example, MathOctopus-7B improves its counterparts that skilled on English from 42.2% to 50.8% on GSM8K testset. Codes can be found at this https URL.

Submission historical past

From: Nuo Chen [view email]
[v1]
Tue, 31 Oct 2023 08:09:20 UTC (512 KB)
[v2]
Wed, 1 Nov 2023 06:56:14 UTC (739 KB)
[v3]
Tue, 7 Nov 2023 12:13:02 UTC (739 KB)
[v4]
Tue, 28 Nov 2023 05:25:14 UTC (739 KB)
[v5]
Wed, 16 Oct 2024 04:26:04 UTC (8,560 KB)

Source link

#Breaking #Language #Boundaries #Multilingual #Mathematical #Reasoning #Insights #Observations


Unlock the potential of cutting-edge AI options with our complete choices. As a number one supplier within the AI panorama, we harness the facility of synthetic intelligence to revolutionize industries. From machine studying and knowledge analytics to pure language processing and pc imaginative and prescient, our AI options are designed to reinforce effectivity and drive innovation. Discover the limitless prospects of AI-driven insights and automation that propel what you are promoting ahead. With a dedication to staying on the forefront of the quickly evolving AI market, we ship tailor-made options that meet your particular wants. Be part of us on the forefront of technological development, and let AI redefine the way in which you use and reach a aggressive panorama. Embrace the long run with AI excellence, the place prospects are limitless, and competitors is surpassed.