• About
  • Advertise
  • Privacy & Policy
  • Contact
Thursday, January 1, 2026
  • Login
  • Home
    • Home – Layout 1
    • Home – Layout 2
    • Home – Layout 3
    • Home – Layout 4
    • Home – Layout 5
    • Home – Layout 6
  • News
    • All
    • Business
    • Politics
    • Science
    • World
    Hillary Clinton in white pantsuit for Trump inauguration

    Hillary Clinton in white pantsuit for Trump inauguration

    Amazon has 143 billion reasons to keep adding more perks to Prime

    Amazon has 143 billion reasons to keep adding more perks to Prime

    Shooting More than 40 Years of New York’s Halloween Parade

    Shooting More than 40 Years of New York’s Halloween Parade

    These Are the 5 Big Tech Stories to Watch in 2017

    These Are the 5 Big Tech Stories to Watch in 2017

    Why Millennials Need to Save Twice as Much as Boomers Did

    Why Millennials Need to Save Twice as Much as Boomers Did

    Doctors take inspiration from online dating to build organ transplant AI

    Doctors take inspiration from online dating to build organ transplant AI

    Trending Tags

    • Trump Inauguration
    • United Stated
    • White House
    • Market Stories
    • Election Results
  • Tech
    • All
    • Apps
    • Gadget
    • Mobile
    • Startup
    The Legend of Zelda: Breath of the Wild gameplay on the Nintendo Switch

    The Legend of Zelda: Breath of the Wild gameplay on the Nintendo Switch

    Shadow Tactics: Blades of the Shogun Review

    Shadow Tactics: Blades of the Shogun Review

    macOS Sierra review: Mac users get a modest update this year

    macOS Sierra review: Mac users get a modest update this year

    Hands on: Samsung Galaxy A5 2017 review

    Hands on: Samsung Galaxy A5 2017 review

    The Last Guardian Playstation 4 Game review

    The Last Guardian Playstation 4 Game review

    These Are the 5 Big Tech Stories to Watch in 2017

    These Are the 5 Big Tech Stories to Watch in 2017

    Trending Tags

    • Nintendo Switch
    • CES 2017
    • Playstation 4 Pro
    • Mark Zuckerberg
  • Entertainment
    • All
    • Gaming
    • Movie
    • Music
    • Sports
    The Legend of Zelda: Breath of the Wild gameplay on the Nintendo Switch

    The Legend of Zelda: Breath of the Wild gameplay on the Nintendo Switch

    macOS Sierra review: Mac users get a modest update this year

    macOS Sierra review: Mac users get a modest update this year

    Hands on: Samsung Galaxy A5 2017 review

    Hands on: Samsung Galaxy A5 2017 review

    Heroes of the Storm Global Championship 2017 starts tomorrow, here’s what you need to know

    Heroes of the Storm Global Championship 2017 starts tomorrow, here’s what you need to know

    Harnessing the power of VR with Power Rangers and Snapdragon 835

    Harnessing the power of VR with Power Rangers and Snapdragon 835

    So you want to be a startup investor? Here are things you should know

    So you want to be a startup investor? Here are things you should know

  • Lifestyle
    • All
    • Fashion
    • Food
    • Health
    • Travel
    Shooting More than 40 Years of New York’s Halloween Parade

    Shooting More than 40 Years of New York’s Halloween Parade

    Heroes of the Storm Global Championship 2017 starts tomorrow, here’s what you need to know

    Heroes of the Storm Global Championship 2017 starts tomorrow, here’s what you need to know

    Why Millennials Need to Save Twice as Much as Boomers Did

    Why Millennials Need to Save Twice as Much as Boomers Did

    Doctors take inspiration from online dating to build organ transplant AI

    Doctors take inspiration from online dating to build organ transplant AI

    How couples can solve lighting disagreements for good

    How couples can solve lighting disagreements for good

    Ducati launch: Lorenzo and Dovizioso’s Desmosedici

    Ducati launch: Lorenzo and Dovizioso’s Desmosedici

    Trending Tags

    • Golden Globes
    • Game of Thrones
    • MotoGP 2017
    • eSports
    • Fashion Week
  • Review
    The Legend of Zelda: Breath of the Wild gameplay on the Nintendo Switch

    The Legend of Zelda: Breath of the Wild gameplay on the Nintendo Switch

    Shadow Tactics: Blades of the Shogun Review

    Shadow Tactics: Blades of the Shogun Review

    macOS Sierra review: Mac users get a modest update this year

    macOS Sierra review: Mac users get a modest update this year

    Hands on: Samsung Galaxy A5 2017 review

    Hands on: Samsung Galaxy A5 2017 review

    The Last Guardian Playstation 4 Game review

    The Last Guardian Playstation 4 Game review

    Intel Core i7-7700K ‘Kaby Lake’ review

    Intel Core i7-7700K ‘Kaby Lake’ review

No Result
View All Result
Ai News
Advertisement
  • Home
    • Home – Layout 1
    • Home – Layout 2
    • Home – Layout 3
    • Home – Layout 4
    • Home – Layout 5
    • Home – Layout 6
  • News
    • All
    • Business
    • Politics
    • Science
    • World
    Hillary Clinton in white pantsuit for Trump inauguration

    Hillary Clinton in white pantsuit for Trump inauguration

    Amazon has 143 billion reasons to keep adding more perks to Prime

    Amazon has 143 billion reasons to keep adding more perks to Prime

    Shooting More than 40 Years of New York’s Halloween Parade

    Shooting More than 40 Years of New York’s Halloween Parade

    These Are the 5 Big Tech Stories to Watch in 2017

    These Are the 5 Big Tech Stories to Watch in 2017

    Why Millennials Need to Save Twice as Much as Boomers Did

    Why Millennials Need to Save Twice as Much as Boomers Did

    Doctors take inspiration from online dating to build organ transplant AI

    Doctors take inspiration from online dating to build organ transplant AI

    Trending Tags

    • Trump Inauguration
    • United Stated
    • White House
    • Market Stories
    • Election Results
  • Tech
    • All
    • Apps
    • Gadget
    • Mobile
    • Startup
    The Legend of Zelda: Breath of the Wild gameplay on the Nintendo Switch

    The Legend of Zelda: Breath of the Wild gameplay on the Nintendo Switch

    Shadow Tactics: Blades of the Shogun Review

    Shadow Tactics: Blades of the Shogun Review

    macOS Sierra review: Mac users get a modest update this year

    macOS Sierra review: Mac users get a modest update this year

    Hands on: Samsung Galaxy A5 2017 review

    Hands on: Samsung Galaxy A5 2017 review

    The Last Guardian Playstation 4 Game review

    The Last Guardian Playstation 4 Game review

    These Are the 5 Big Tech Stories to Watch in 2017

    These Are the 5 Big Tech Stories to Watch in 2017

    Trending Tags

    • Nintendo Switch
    • CES 2017
    • Playstation 4 Pro
    • Mark Zuckerberg
  • Entertainment
    • All
    • Gaming
    • Movie
    • Music
    • Sports
    The Legend of Zelda: Breath of the Wild gameplay on the Nintendo Switch

    The Legend of Zelda: Breath of the Wild gameplay on the Nintendo Switch

    macOS Sierra review: Mac users get a modest update this year

    macOS Sierra review: Mac users get a modest update this year

    Hands on: Samsung Galaxy A5 2017 review

    Hands on: Samsung Galaxy A5 2017 review

    Heroes of the Storm Global Championship 2017 starts tomorrow, here’s what you need to know

    Heroes of the Storm Global Championship 2017 starts tomorrow, here’s what you need to know

    Harnessing the power of VR with Power Rangers and Snapdragon 835

    Harnessing the power of VR with Power Rangers and Snapdragon 835

    So you want to be a startup investor? Here are things you should know

    So you want to be a startup investor? Here are things you should know

  • Lifestyle
    • All
    • Fashion
    • Food
    • Health
    • Travel
    Shooting More than 40 Years of New York’s Halloween Parade

    Shooting More than 40 Years of New York’s Halloween Parade

    Heroes of the Storm Global Championship 2017 starts tomorrow, here’s what you need to know

    Heroes of the Storm Global Championship 2017 starts tomorrow, here’s what you need to know

    Why Millennials Need to Save Twice as Much as Boomers Did

    Why Millennials Need to Save Twice as Much as Boomers Did

    Doctors take inspiration from online dating to build organ transplant AI

    Doctors take inspiration from online dating to build organ transplant AI

    How couples can solve lighting disagreements for good

    How couples can solve lighting disagreements for good

    Ducati launch: Lorenzo and Dovizioso’s Desmosedici

    Ducati launch: Lorenzo and Dovizioso’s Desmosedici

    Trending Tags

    • Golden Globes
    • Game of Thrones
    • MotoGP 2017
    • eSports
    • Fashion Week
  • Review
    The Legend of Zelda: Breath of the Wild gameplay on the Nintendo Switch

    The Legend of Zelda: Breath of the Wild gameplay on the Nintendo Switch

    Shadow Tactics: Blades of the Shogun Review

    Shadow Tactics: Blades of the Shogun Review

    macOS Sierra review: Mac users get a modest update this year

    macOS Sierra review: Mac users get a modest update this year

    Hands on: Samsung Galaxy A5 2017 review

    Hands on: Samsung Galaxy A5 2017 review

    The Last Guardian Playstation 4 Game review

    The Last Guardian Playstation 4 Game review

    Intel Core i7-7700K ‘Kaby Lake’ review

    Intel Core i7-7700K ‘Kaby Lake’ review

No Result
View All Result
Ai News
No Result
View All Result
Home AI & Cloud Computing

Huawei CloudMatrix AI performance beat Nvidia in internal tests

AiNEWS2025 by AiNEWS2025
2025-06-20
in AI & Cloud Computing
0
Huawei CloudMatrix AI performance beat Nvidia in internal tests
0
SHARES
0
VIEWS
Share on FacebookShare on Twitter


Huawei CloudMatrix AI performance has achieved what the company claims is a significant milestone, with internal testing showing its new data centre architecture outperforming Nvidia’s H800 graphics processing units in running DeepSeek’s advanced R1 artificial intelligence model, according to a comprehensivetechnical paperreleased this week by Huawei researchers.

The research, conducted by Huawei Technologies in collaboration with Chinese AI infrastructure startup SiliconFlow, provides what appears to be the first detailed public disclosure of performance metrics for CloudMatrix384.

However, it’s important to note that the benchmarks were conducted by Huawei on its systems, raising questions about independent verification of the claimed performance advantages over established industry standards.

The paper describes CloudMatrix384 as a “next-generation AI datacentre architecture that embodies Huawei’s vision for reshaping the foundation of AI infrastructure.” While the technical achievements outlined appear impressive, the lack of third-party validation means results should be viewed in the context of Huawei’s continuing efforts to demonstrate technological competitiveness outside of US sanctions.

The CloudMatrix384 architecture

CloudMatrix384 integrates 384 Ascend 910C NPUs and 192 Kunpeng CPUs in a supernode, connected by an ultra-high-bandwidth, low-latency Unified Bus (UB).

Unlike traditional hierarchical designs, a peer-to-peer architecture enables what Huawei calls “direct all-to-all communication,” allowing compute, memory, and network resources to be pooled dynamically and scaled independently.

The system’s design addresses notable challenges in creating modern AI infrastructure, particularly for mixture-of-experts (MoE) architectures and distributed key-value cache access, considered essential for large language model operations.

Performance claims: The numbers in context

The Huawei CloudMatrix AI performance results, while conducted internally, present impressive metrics on the system’s capabilities. To understand the numbers, it’s helpful to think of AI processing like a conversation: the “prefill” phase is when an AI reads and ‘understands’ a question, while the “decode” phase is when it generates its response, word by word.

According to the company’s testing, CloudMatrix-Infer achieves a prefill throughput of 6,688 tokens per second per processing unit, and 1,943 tokens per second when generating a response.

Think of tokens as individual pieces of text – roughly equivalent to words or parts of words that the AI processes. For context, this means the system can process thousands of words per second on each chip.

The “TPOT” measurement (time-per-output-token) of under 50 milliseconds means the system generates each word in its response in less than a twentieth of a second – creating remarkably fast response times.

More significantly, Huawei’s results correspond to what it claims are superior efficiency ratings compared with competing systems. The company measures this through “compute efficiency” – essentially, how much useful work each chip accomplishes relative to its theoretical maximum processing power.

Huawei claims its system achieves 4.45 tokens per second per TFLOPS for reading questions and 1.29 tokens per second per TFLOPS for generating answers. In perspective, TFLOPS (trillion floating-point operations per second) measures raw computational power – akin to the horsepower rating of a car.

Huawei’s efficiency claims suggest its system does more useful AI work per unit of computational horsepower than Nvidia’s competing H100 and H800 processors.

The company reports maintaining 538 tokens per second under the stricter timing requirements of sub-15 milliseconds per word.

However, the impressive numbers lack independent verification from third-parties, standard practice for validating performance claims in the technology industry.

Technical innovations behind the claims

The reported Huawei CloudMatrix AI performance metrics stem from several technical details quoted in the research paper. The system implements what Huawei describes as a “peer-to-peer serving architecture” that disaggregates the inference workflow into three subsystems: prefill, decode, and caching, enabling each component to scale based on workload demands.

The paper posits three innovations: a peer-to-peer serving architecture with disaggregated resource pools, large-scale expert parallelism supporting up to EP320 configuration where each NPU die hosts one expert, and hardware-aware optimisations including optimised operators, microbatch-based pipelining, and INT8 quantisation.

Geopolitical context and strategic implications

The performance claims emerge against the backdrop of intensifying US-China tech tensions. Huawei founder Ren Zhengfei acknowledged recently that the company’s chips still lag behind US competitors “by a generation,” but said clustering methods can achieve comparable performance to the world’s most advanced systems.

Nvidia CEO Jensen Huang appeared to validate this during a recent CNBC interview, stating: “AI is a parallel problem, so if each one of the computers is not capable… just add more computers… in China, [where] they have plenty of energy, they’ll just use more chips.”

Lead researcher Zuo Pengfei, part of Huawei’s “Genius Youth” program, framed the research’s strategic importance, writing that the paper aims “to build confidence in the domestic technology ecosystem in using Chinese-developed NPUs to outperform Nvidia’s GPUs.”

Questions of verification and industry impact

Beyond the performance metrics, Huawei reports that INT8 quantisation maintains model accuracy comparable to the official DeepSeek-R1 API in 16 benchmarks in internal, unverified tests.

The AI and technology industries will likely await independent verification of Huawei’s CloudMatrix AI performance before drawing definitive conclusions.

Nevertheless, the technical approaches described suggest genuine innovation in AI infrastructure design, offering insights for the industry, regardless of the specific performance numbers.

Huawei’s claims – whether validated or not – highlight the intensity of competition in AI hardware and the varying approaches companies take to achieve computational efficiency.

(Photo by Shutterstock )

See also: From cloud to collaboration: Huawei maps out AI future in APAC

Want to learn more about cybersecurity and the cloud from industry leaders? Check out Cyber Security & Cloud Expo taking place in Amsterdam, California, and London.

Explore other upcoming enterprise technology events and webinars powered by TechForge here.

Source link

#Huawei #CloudMatrix #performance #beat #Nvidia #internal #tests

Tags: cloudNVIDIA
Previous Post

X to offer trading services soon, claims CEO

Next Post

Xbox and Windows are no longer at arm’s length | Opinion

AiNEWS2025

AiNEWS2025

Next Post
Xbox and Windows are no longer at arm’s length | Opinion

Xbox and Windows are no longer at arm's length | Opinion

Stay Connected test

  • 23.9k Followers
  • 99 Subscribers
  • Trending
  • Comments
  • Latest
A tiny new open source AI model performs as well as powerful big ones

A tiny new open source AI model performs as well as powerful big ones

0
Water Cooler Small Talk: The Birthday Paradox 🎂🎉 | by Maria Mouschoutzi, PhD | Sep, 2024

Water Cooler Small Talk: The Birthday Paradox 🎂🎉 | by Maria Mouschoutzi, PhD | Sep, 2024

0
Ghost of Yōtei: The acclaimed Ghost of Tsushima is getting a sequel

Ghost of Yōtei: The acclaimed Ghost of Tsushima is getting a sequel

0
Best Headphones for Working Out (2024): Bose, Shokz, JLab

Best Headphones for Working Out (2024): Bose, Shokz, JLab

0
Deep Reinforcement Learning: The Actor-Critic Method

Deep Reinforcement Learning: The Actor-Critic Method

2026-01-01
“Streaming stops feeling infinite”: What subscribers can expect in 2026

“Streaming stops feeling infinite”: What subscribers can expect in 2026

2026-01-01
Net neutrality was back, until it wasn’t

Net neutrality was back, until it wasn’t

2026-01-01
Man Operating Robot Accidentally Makes It Kick Him Directly in the Nutsack

Man Operating Robot Accidentally Makes It Kick Him Directly in the Nutsack

2026-01-01

Recent News

Deep Reinforcement Learning: The Actor-Critic Method

Deep Reinforcement Learning: The Actor-Critic Method

2026-01-01
“Streaming stops feeling infinite”: What subscribers can expect in 2026

“Streaming stops feeling infinite”: What subscribers can expect in 2026

2026-01-01
Net neutrality was back, until it wasn’t

Net neutrality was back, until it wasn’t

2026-01-01
Man Operating Robot Accidentally Makes It Kick Him Directly in the Nutsack

Man Operating Robot Accidentally Makes It Kick Him Directly in the Nutsack

2026-01-01
Footer logo

We bring you the best Premium WordPress Themes that perfect for news, magazine, personal blog, etc. Check our landing page for details.

Follow Us

Browse by Category

  • AI & Cloud Computing
  • AI & Cybersecurity
  • AI & Sentiment Analysis
  • AI Applications
  • AI Ethics
  • AI Future Predictions
  • AI in Education
  • AI in Fintech
  • AI in Gaming
  • AI in Healthcare
  • AI in Startups
  • AI Innovations
  • AI News
  • AI Research
  • AI Tools & Automation
  • Apps
  • AR/VR & AI
  • Business
  • Deep Learning
  • Emerging Technologies
  • Entertainment
  • Fashion
  • Food
  • Gadget
  • Gaming
  • Health
  • Lifestyle
  • Machine Learning
  • Mobile
  • Movie
  • Music
  • News
  • Politics
  • Review
  • Robotics & Smart Systems
  • Science
  • Sports
  • Startup
  • Tech
  • Travel
  • World

Recent News

Deep Reinforcement Learning: The Actor-Critic Method

Deep Reinforcement Learning: The Actor-Critic Method

2026-01-01
“Streaming stops feeling infinite”: What subscribers can expect in 2026

“Streaming stops feeling infinite”: What subscribers can expect in 2026

2026-01-01
  • About
  • Advertise
  • Privacy & Policy
  • Contact

© 2026 JNews - Premium WordPress news & magazine theme by Jegtheme.

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In
No Result
View All Result

© 2026 JNews - Premium WordPress news & magazine theme by Jegtheme.