Saturday, January 10, 2026

Login

Home
News
Hillary Clinton in white pantsuit for Trump inauguration

Amazon has 143 billion reasons to keep adding more perks to Prime

Shooting More than 40 Years of New York’s Halloween Parade

These Are the 5 Big Tech Stories to Watch in 2017

Why Millennials Need to Save Twice as Much as Boomers Did

Doctors take inspiration from online dating to build organ transplant AI
Trending Tags
Tech
- All
- Apps
- Gadget
- Mobile
- Startup
The Legend of Zelda: Breath of the Wild gameplay on the Nintendo Switch

Shadow Tactics: Blades of the Shogun Review

macOS Sierra review: Mac users get a modest update this year

Hands on: Samsung Galaxy A5 2017 review

The Last Guardian Playstation 4 Game review

These Are the 5 Big Tech Stories to Watch in 2017
Trending Tags
Entertainment
- All
- Gaming
- Movie
- Music
- Sports
The Legend of Zelda: Breath of the Wild gameplay on the Nintendo Switch

macOS Sierra review: Mac users get a modest update this year

Hands on: Samsung Galaxy A5 2017 review

Heroes of the Storm Global Championship 2017 starts tomorrow, here’s what you need to know

Harnessing the power of VR with Power Rangers and Snapdragon 835

So you want to be a startup investor? Here are things you should know
Lifestyle
- All
- Fashion
- Food
- Health
- Travel
Shooting More than 40 Years of New York’s Halloween Parade

Heroes of the Storm Global Championship 2017 starts tomorrow, here’s what you need to know

Why Millennials Need to Save Twice as Much as Boomers Did

Doctors take inspiration from online dating to build organ transplant AI

How couples can solve lighting disagreements for good

Ducati launch: Lorenzo and Dovizioso’s Desmosedici
Trending Tags
Review

The Legend of Zelda: Breath of the Wild gameplay on the Nintendo Switch

Shadow Tactics: Blades of the Shogun Review

macOS Sierra review: Mac users get a modest update this year

Hands on: Samsung Galaxy A5 2017 review

The Last Guardian Playstation 4 Game review

Intel Core i7-7700K ‘Kaby Lake’ review

No Result

View All Result

Ai News

No Result

View All Result

No Result

View All Result

Home AI & Sentiment Analysis

Knowledge Distillation of Large Language Models

by AiNEWS2025

in AI & Sentiment Analysis

0

SHARES

0

VIEWS

Share on Facebook Share on Twitter

[Submitted on 14 Jun 2023 (v1), last revised 21 Nov 2025 (this version, v5)]

View a PDF of the paper titled MiniLLM: Knowledge Distillation of Large Language Models, by Yuxian Gu and 3 other authors

View PDF
HTML (experimental)

Abstract:Knowledge Distillation (KD) is a promising technique for reducing the high computational demand of large language models (LLMs). However, previous KD methods are primarily applied to white-box classification models or training small models to imitate black-box model APIs like ChatGPT. How to effectively distill the knowledge of white-box LLMs into small models is still under-explored, which becomes more important with the prosperity of open-source LLMs. In this work, we propose a KD approach that distills LLMs into smaller language models. We first replace the forward Kullback-Leibler divergence (KLD) objective in the standard KD approaches with reverse KLD, which is more suitable for KD on generative language models, to prevent the student model from overestimating the low-probability regions of the teacher distribution. Then, we derive an effective on-policy optimization approach to learn this objective. The student models are named MiniLLM. Extensive experiments in the instruction-following setting show that MiniLLM generates more precise responses with higher overall quality, lower exposure bias, better calibration, and higher long-text generation performance than the baselines. Our method is scalable for different model families with 120M to 13B parameters. Our code, data, and model checkpoints can be found in this https URL.

Submission history

From: Yuxian Gu [view email]
[v1]
Wed, 14 Jun 2023 14:44:03 UTC (143 KB)
[v2]
Wed, 28 Feb 2024 14:48:19 UTC (309 KB)
[v3]
Tue, 12 Mar 2024 16:15:19 UTC (309 KB)
[v4]
Wed, 10 Apr 2024 02:30:19 UTC (318 KB)
[v5]
Fri, 21 Nov 2025 10:20:53 UTC (211 KB)

Source link

#Knowledge #Distillation #Large #Language #Models

Revolut achieves $75 billion valuation

Meta Black Friday Sale Gives 40% Off Hundreds Of Quest Games

AiNEWS2025

Next Post

Meta Black Friday Sale Gives 40% Off Hundreds Of Quest Games

Meta Black Friday Sale Gives 40% Off Hundreds Of Quest Games

No Result

View All Result

© 2026 JNews - Premium WordPress news & magazine theme by Jegtheme.