Monday, January 12, 2026

Login

Home
News
Hillary Clinton in white pantsuit for Trump inauguration

Amazon has 143 billion reasons to keep adding more perks to Prime

Shooting More than 40 Years of New York’s Halloween Parade

These Are the 5 Big Tech Stories to Watch in 2017

Why Millennials Need to Save Twice as Much as Boomers Did

Doctors take inspiration from online dating to build organ transplant AI
Trending Tags
Tech
- All
- Apps
- Gadget
- Mobile
- Startup
The Legend of Zelda: Breath of the Wild gameplay on the Nintendo Switch

Shadow Tactics: Blades of the Shogun Review

macOS Sierra review: Mac users get a modest update this year

Hands on: Samsung Galaxy A5 2017 review

The Last Guardian Playstation 4 Game review

These Are the 5 Big Tech Stories to Watch in 2017
Trending Tags
Entertainment
- All
- Gaming
- Movie
- Music
- Sports
The Legend of Zelda: Breath of the Wild gameplay on the Nintendo Switch

macOS Sierra review: Mac users get a modest update this year

Hands on: Samsung Galaxy A5 2017 review

Heroes of the Storm Global Championship 2017 starts tomorrow, here’s what you need to know

Harnessing the power of VR with Power Rangers and Snapdragon 835

So you want to be a startup investor? Here are things you should know
Lifestyle
- All
- Fashion
- Food
- Health
- Travel
Shooting More than 40 Years of New York’s Halloween Parade

Heroes of the Storm Global Championship 2017 starts tomorrow, here’s what you need to know

Why Millennials Need to Save Twice as Much as Boomers Did

Doctors take inspiration from online dating to build organ transplant AI

How couples can solve lighting disagreements for good

Ducati launch: Lorenzo and Dovizioso’s Desmosedici
Trending Tags
Review

The Legend of Zelda: Breath of the Wild gameplay on the Nintendo Switch

Shadow Tactics: Blades of the Shogun Review

macOS Sierra review: Mac users get a modest update this year

Hands on: Samsung Galaxy A5 2017 review

The Last Guardian Playstation 4 Game review

Intel Core i7-7700K ‘Kaby Lake’ review

No Result

View All Result

Ai News

No Result

View All Result

No Result

View All Result

Home AI & Sentiment Analysis

[2412.17451] Diving into Self-Evolving Training for Multimodal Reasoning

by AiNEWS2025

in AI & Sentiment Analysis

0

SHARES

0

VIEWS

Share on Facebook Share on Twitter

[Submitted on 23 Dec 2024 (v1), last revised 6 Jun 2025 (this version, v3)]

View a PDF of the paper titled Diving into Self-Evolving Training for Multimodal Reasoning, by Wei Liu and 5 other authors

View PDF
HTML (experimental)

Abstract:Self-evolving trainin–where models iteratively learn from their own outputs–has emerged as a key approach for complex reasoning tasks, addressing the scarcity of high-quality chain-of-thought data. However, its effectiveness in multimodal reasoning, a domain more intricate than text-only reasoning, remains underexplored, and the understanding of critical factors in this training paradigm remains limited. Furthermore, a central challenge for this training method is performance saturation, which impedes further improvements and scalability. Inspired by reinforcement learning (RL), in this paper, we reframe self-evolving training for multimodal reasoning through the lens of RL, identifying three pivotal factors: Training Method, Reward Model, and Prompt Variation. Through systematic analysis, we establish relatively optimal design principles that significantly enhance multimodal reasoning capabilities. Moreover, delving deeper into training dynamics, we uncover the roots of saturation and propose a new automatic balancing mechanism to mitigate this limitation. Building on these insights, we propose M-STAR (Multimodal Self-evolving Training for Reasoning), a framework that achieves consistent performance gains across models of varying sizes and diverse benchmarks. All resources are made publicly available at this https URL.

Submission history

From: Junlong Li [view email]
[v1]
Mon, 23 Dec 2024 10:18:41 UTC (1,079 KB)
[v2]
Tue, 3 Jun 2025 09:07:50 UTC (281 KB)
[v3]
Fri, 6 Jun 2025 10:36:59 UTC (282 KB)

Source link

#Diving #SelfEvolving #Training #Multimodal #Reasoning

Xbox’s handhelds have Valve in their sights, not Nintendo | Opinion

macOS Spatial Rendering Sounds Like Apple’s Take On PC VR

AiNEWS2025

Next Post

macOS Spatial Rendering Sounds Like Apple’s Take On PC VR

macOS Spatial Rendering Sounds Like Apple's Take On PC VR

No Result

View All Result

© 2026 JNews - Premium WordPress news & magazine theme by Jegtheme.