Generative AI or GenAI is an emerging market and all hardware manufacturers are trying to grab their slice of the cake. But despite their best efforts, it's NVIDIA that has so far taken the bulk of the share and there's no stopping the green giant as it has showcased some utterly strong benchmarks and records within the MLPerf v4.0 inference results.

Related Story Qualcomm, Intel, & Google Join Hands To Come For NVIDIA, Plans On Dethroning CUDA Through oneAPI

Fine-tuning on TensorRT-LLM has been ongoing ever since the AI Software suite was released last year. We saw a major increase in performance with the previous MLPerf v3.1 results & now with MLPerf v4.0, NVIDIA is supercharging Hopper's performance. Why inference matters is because it accounts for 40% of the data center revenue (generated last year). Inference workloads range from LLMs (Large Language Models), Visual Content, and Recommenders. As these models increase in size, there comes more complexity and the need to have both strong hardware and software.

That's why TensorRT-LLM is there as a state-of-the-art inference compiler that is co-designed with the NVIDIA GPU architectures. Some features of TensorRT-LLMs include:

In-Flight Sequence Batching (Optimizes GPU Utilization)
KV Cache Management (Higher GPU Memory Utilization)
Generalized Attention (XQA Kernel)
Multi-GPU Multi-Node (Tensor & Pipeline Parallel)
FP8 Quantization (Higher Perf & Fit Larger Models)

Using the latest TensorRT-LLM optimizations, NVIDIA has managed to squeeze in an additional 2.9x performance for its Hopper GPUs (such as the H100) in MLPerf v4.0 versus MLPerf v3.1. In today's benchmark results, NVIDIA has set new performance records in MLPerf

Read more on wccftech.com

All news from wccftech.com

About this in other media

AMD Navi 48 “RDNA 4” GPU Confirmed In ROCm Patches, Coming To Radeon RX 8000 Gaming Cards This Year wccftech.com /9 months ago

AMD RDNA 4 “Radeon RX 8000” GPU Rumors: Navi 48 Around Navi 31 Performance, Navi 44 Between Navi 33 & 32 wccftech.com /9 months ago

NVIDIA GeForce RTX 4080 Crashes During Million Dollar CS2 Tournament Despite Being Selected As The GPU Of Choice wccftech.com /9 months ago

The website gamebastion.com is an aggregator of news from open sources. The source is indicated at the beginning and at the end of the announcement. You can send a complaint on the news if you find it unreliable.

24.04 / 10:37

Mobile UPS Provident Noise ColorFit Pulse 4 smartwatch launched with always-on display: Check specs, price and more

Noise launched the latest addition to its Pulse Series with the launch of the ColorFit Pulse 4 smartwatch. Featuring a 1.85-inch AMOLED display, this new release aims to combine style with functionality for users. The Noise ColorFit Pulse 4 offers a variety of vibrant colours, including Jet Black, Space Blue, Forest Green, Rose Gold Pink, Starlight Gold, Silver Link, and Black Link.

24.04 / 10:37

UPS Apple Music Top 10 smartwatch brands: Leading the market with innovation

24.04 / 10:33

Provident Gaming Today's Connections Hints & Answers For April 25, 2024 (Puzzle #318)

24.04 / 10:33

Puzzle Progressive Gaming Wordle Today's Wordle Hints & Answer - April 25, 2024 (Puzzle #1041)

24.04 / 10:27

AMD RDNA 3+ edges closer as AMD hints at refreshed GPU architecture in its Linux firmware

24.04 / 10:13

UPS CEO Gloomhaven studio Flaming Fowl lays off over 20 staff

24.04 / 10:09

UPS Nintendo The latest Nintendo Switch firmware update fixes an issue that stopped some players being able to connect to Wi-Fi

24.04 / 09:59

Fighting UPS Teenage Mutant Ninja Turtles Arcade: Wrath of the Mutants Review

24.04 / 09:59

Party Action UPS Music Eiyuden Chronicles: Hundred Heroes Review

24.04 / 09:57

UPS Gengar And Pikachu Pokemon Squishmallows Are Half Price At Walmart

24.04 / 09:57

UPS City Rider Series Outfits Coming Soon To Arknights Global!

24.04 / 09:57

Battlefield RPG UPS Suit Up For Brammashell’s Arrival! Pre-Register For GrandChase’s New SR Hero

24.04 / 09:57

Stardew Valley Fans Harassed After Asking For Pronouns

24.04 / 09:45

RPG PC PS5 Lords of the Fallen version 1.5 update ‘Master of Fate’ now available

24.04 / 09:24

CEO Apple Qubo InstaView video door phone launched: Price, specs and all details

24.04 / 09:17

CEO Embracer's Wingefors admits he deserves criticism for state of company

24.04 / 09:17

FTC puts stop to non-compete clauses

24.04 / 09:11

Fallout UPS Fallout 4 perks system explained: Best perks to pick and why

24.04 / 09:07

PC Xbox Series X PS5 Lords of the Fallen – Master of Fate Update Adds New Challenge Modifiers, Out Now

24.04 / 09:07

Asus ROG Will Make a New Performance VR Headset for Meta

24.04 / 09:07

PC Homeworld: Vast Reaches Gets May 2 Release Date, Coming to SteamVR Later This Year

24.04 / 09:07

PC Xbox One Xbox Series X PS4 PS5 Teenage Mutant Ninja Turtles Arcade: Wrath of the Mutants is Out Now on PC and Consoles

24.04 / 09:07

Ubisoft Assassin’s Creed Codename Hexe Will be Linear with Open World Elements – Rumour

24.04 / 08:57

PC Fallout PS5 Fallout 76 Bethesda Fallout Games Surge as Prime Video TV Series Helps Drive Close to 5 Million Players in a Single Day

24.04 / 08:43

Platform Mobile UPS AMD AMD Strix Point “Ryzen 9” APU With 12 Zen 5 Cores, 24 MB L3 Cache Spotted, Impressive Multi-Core Score Despite 1.4 GHz ES Clock

24.04 / 08:43

Platform UPS Provident AMD AMD Ryzen 9000 “Zen 5” Desktop CPUs Confirmed As Gigabyte Lists Official Support On Its AM5 Motherboards

24.04 / 08:23

UPS Sennheiser Momentum True Wireless 4 launched in India; Check features, price and more

24.04 / 08:23

Provident Microsoft Microsoft OneNote: Know how to use this app to make office meetings easy

24.04 / 08:23

Provident LG Artcool AC launched: Here are the latest LG air conditioner models in 2024 and all top features explained

24.04 / 08:23

CEO Apple Apple CEO Tim Cook teases ‘Pencil 3’ along with new iPads ahead of May 7 special event

24.04 / 08:23

After OnePlus, Mobile retailers in South India are now angry with Poco- Here’s all details

24.04 / 08:19

PlayStation 4 PlayStation 5 Xbox One TMNT Arcade: Wrath of the Mutants is a basic but serviceable coin-op conversion

24.04 / 08:13

Star Wars Star Wars: The Bad Batch Season 3 Episode 15 Release Date & Time on Disney Plus

24.04 / 07:37

PS5 platformer BIT.TRIP RERUNNER coming to PS5, Xbox Series

24.04 / 07:37

RPG PC PS4 PS5 Nintendo Adventure Bar Story now available worldwide for PS5 and PS4; launches April 25 for Switch and May 9 for PC

24.04 / 07:17

Realme Narzo 70 5G and Narzo 70x set to launch in India today: Check expected prices, specs and more

24.04 / 07:17

GTA 6 map may be nearly 3 times larger than Grand Theft Auto 5: All details of the upcoming game

24.04 / 07:01

Dota Counter-Strike Provident PUBG’s Original Erangel Map Is Coming Back to ‘Evoke Nostalgia for Players Who Remember the Early Access Days’

24.04 / 07:01

UPS Progressive tennis TopSpin 2K25 Review

24.04 / 07:01

Twitter CEO Lords of the Fallen Gets Final 1.5 Update, Publisher Confirms Plans for ‘Next Installments’ in Franchise

24.04 / 06:19

UPS Google Garena Free Fire Max Garena Free Fire MAX Redeem Codes for April 24: Chance to get 100% bonus diamonds

24.04 / 06:19

Provident Apple Apple may open 3 more stores in these cities across India after Apple Saket, BKC- Check locations and all details

24.04 / 05:49

Sony PS5 Rumor: Helldivers 2 Leak Reveals New Vehicle

24.04 / 05:49

Xbox One Xbox Series X PS4 PS5 Fortnite Rumor: Fortnite Could Be Adding a New Vehicle Mechanic in Chapter 5 Season 3

24.04 / 05:49

Xbox One Minecraft PS4 Minecraft Player Spends Three Years Collecting Extremely Rare Armor Set

24.04 / 05:49

Xbox One Xbox Series X PS4 PS5 Some MultiVersus Fans Think Barbie is Coming to the Game

24.04 / 05:49

UPS LG Releasing New OLED Gaming Panels With 'First-Ever' Features

24.04 / 05:49

Epic Games Releases Unreal Engine 5.4

24.04 / 05:49

Xbox One PS4 Nintendo Stardew Valley Player Shows Off Impressive Forest Farm Layout

24.04 / 05:49

Persona Xbox Series X PS4 PS5 Metaphor: ReFantazio is Missing a Popular Persona Game Mechanic

24.04 / 05:21

Apple Apple Has Added Eight New Chinese Firms To Its Supply Chain While Removing Four, Indicating Its Dependence On The Region

24.04 / 04:41

Apple Vision Pro Demand Falls Significantly Below Expectations, as Apple Cuts Shipments by Half

24.04 / 04:21

Party RPG Nintendo Dokapon: Sword of Fury remaster announced for Switch

24.04 / 04:07

UPS The Parting Glass - Spend Antique Bronze Bullion and Awakened Mark of Mastery

24.04 / 04:07

UPS (FIXED) Mythic 0 Dawn of the Infinites Bosses Drop 493 Champion Gear and Have 5 Million Health

24.04 / 03:59

Fighting Racing Extreme Reddit War Helldivers 2 Players Really Hate Hellmire

24.04 / 03:57

Itel S24 budget smartphone with 108MP AI camera launched: Check out specs, features and more

24.04 / 03:41

Deliver Us the Moon is Finally Releasing for Switch This Year

24.04 / 03:41

Sony PS5 Stellar Blade Dev Diary Details Locations, Combat, Bosses, and More

24.04 / 03:35

UPS NVIDIA Elon Musk Says That Tesla’s AI Training Capacity Will Be Equivalent to Around 85,000 Units of NVIDIA’s H100 Chips by the End of 2024

24.04 / 03:27

Puzzle UPS boxing Today's Wordle answer for Wednesday, April 24

24.04 / 03:11

UPS Subnautica Subnautica Fans Share What They Want Most In The Sequel

24.04 / 03:09

Shoresy Season 3 Hulu Release Date Set in First-Look Images

24.04 / 03:07

GTA 6: What will be the price and when the game will be available for pre-orders- All details

24.04 / 03:07

Mobile Google Garena Free Fire Redeem Codes for April 24: Know how to win every battle with ease

24.04 / 02:49

May Savings promotion comes to PlayStation Store

24.04 / 02:47

UPS Mythic 0 Dawn of the Infinites Bosses Drop 493 Champion Gear and Have 5 Million Health

24.04 / 02:47

5 More Character Slots with Patch 10.2.7

24.04 / 02:47

UPS Season of Discovery Hotfixes April 23rd

24.04 / 02:33

Warcraft Hotfixes: April 23, 2024

24.04 / 02:33

Warcraft Adventure Racing UPS Get Up to 33% off Select Game Services

24.04 / 02:33

Dreams UPS Progressive War Dragonflight Season 4 is Now Live!

24.04 / 02:25

Fallout UPS Sony Considered doomed 6 years ago, Fallout 76 is attracting record player numbers thanks to the TV series

24.04 / 02:25

Action RPG German government wants games like Baldur's Gate 3 to 'also go on to be developed in Germany'

24.04 / 02:03

Nintendo Nintendo 64 – Nintendo Switch Online adds Extreme-G, Iggy’s Reckin’ Balls

24.04 / 01:05

PC Xbox One PS4 Fallout 76 Fallout Series Had Nearly 5 Million Players in a Single Day

24.04 / 00:41

PC Xbox Series X PS5 No Rest for the Wicked Hotfix Adjust Chest Loot Rates, Prevents Bounty and Challenge Deletion

24.04 / 00:07

RPG You can now pre-register for Genshin developer's Zenless Zone Zero

24.04 / 00:07

RPG PC Strategy Strategy RPG Songs Of Conquest will hit 1.0 with a final campaign next month

24.04 / 00:03

Warcraft Racing War Blizzard admits WoW is rough for new players and plans to fix that: 'We know that we have a lot of work to do'

NVIDIA Hopper H200 GPU Continues To Dominate In Latest MLPerf 4.0 Results: Up To 3x Gain In GenAI With TensorRT-LLM

Blackwell Is Here But NVIDIA Continues Pushing Hopper H100 & H200 AI GPUs With New TensorRT-LLM Optimizations For Up To 3x Gain In MLPerf v4.0

Related Story Qualcomm, Intel, & Google Join Hands To Come For NVIDIA, Plans On Dethroning CUDA Through oneAPI

Related News