NVIDIA's TensorRT-LLM acceleration for Windows has brought some spectacular performance uplifts on the Windows PC platform. We have seen some impressive gains & new features that have been added to NVIDIA's RTX "AI PC" feature set and things are getting even better with the company showcasing some huge performance figures with its flagship GeForce RTX 4090 GPU.

Related Story NVIDIA Once Again Proves Why It’s The AI Boss: Sweeps All MLPerf Training Benchmarks, Achieves Near-Perfect Scaling In GPT-175B, Hopper Now 30% Faster

In a new AI-Decoded blog, NVIDIA has shared how its existing GPU lineup trumps over the entire NPU ecosystem which has only managed to reach 50 TOPS in 2024. Meanwhile, NVIDIA's RTX AI GPUs feature several 100 TOPS and go all the way up to 1321 TOPS using the GeForce RTX 4090, making it the fastest desktop AI solution for running LLMs and more. It's also the fastest gaming graphics card on the planet.

NVIDIA's GeForce RTX GPUs offer up to 24 GB of VRAM while NVIDIA RTX GPUs offer up to 48 GB of VRAM, making them quite the beasts when it comes to handling LLMs (Large Language Models) as these workloads love large amounts of video memory. NVIDIA's RTX hardware comes not only with dedicated video memory but also AI-specific acceleration through Tensor Cores (hardware) and the aforementioned TensorRT-LLM (software).

The number of generated tokens across all batch sizes on NVIDIA's GeForce RTX 4090 GPUs is very fast but it improves significantly, over 4x, when enabling TensorRT-LLM acceleration.

NVIDIA is now sharing some new benchmarks using the open-source Jan.ai platform which has also recently integrated TensorRT-LLM into its local chatbot app. This chatbot makes use of AI models

Read more on wccftech.com

All news from wccftech.com

About this in other media

AMD Says Its Instinct GPUs Made NVIDIA Step On The AI Accelerator Pedal, Yearly Cadence Is A Response To NVIDIA Trying To Block Everyone Out wccftech.com /8 months ago

Intel Lunar Lake, Panther Lake CPUs & Battlemage Discrete GPUs Receive Enhanced Support With Linux 6.11 wccftech.com /8 months ago

Intel 3 Process Node Detailed: 18% More Performance At Same Power, 10% Higher Density, Shipping With Xeon 6 CPUs Now wccftech.com /8 months ago

The website gamebastion.com is an aggregator of news from open sources. The source is indicated at the beginning and at the end of the announcement. You can send a complaint on the news if you find it unreliable.

01.07 / 10:49

Platform Action Progressive Hopes High For Deadpool Game's Return After New Activision Update

The official game may be a thing of the past but after a recent update to some of Activision's older licensed projects, there's new hope the Merc with the Mouth's first solo video game may soon be relisted on various platforms. Released in 2013 and developed by High Moon Studios, made a splash as it won plenty of praise from the characters' fans, even if critics were lukewarm on its combat and progression system. Despite this, it's becoming a cult classic, making its inaccessibility on modern hardware a frustration for many.

01.07 / 10:27

Why are Japanese developers not undergoing mass layoffs?

01.07 / 10:23

Elden Ring Movie or TV Series Seemingly Being Teased by George R.R. Martin

01.07 / 10:23

Adventure UPS Provident Alan Wake 2 The Lake House DLC is Coming This October, Remedy Confirms

01.07 / 10:23

Simulation SpaceX Shares Test Footage Of “Upcoming” Starship Super Heavy Tower Catch

01.07 / 10:23

Digital Software Unreal Engine 5 Full Suite of Features Could Benefit From the PS5 Pro; Impressive Feudal Japan Tech Demo Delivers Great Visuals, but Bad Performance

01.07 / 10:23

Strategy Apple Galaxy Tab S10 Plus, Galaxy Tab S10 Ultra Could Be The Only Models Offered From Samsung’s Upcoming Flagship Tablet Range; Base Model May Get Phased Out

01.07 / 10:17

AMD AMD's Ryzen 9000X3D-series chips reportedly gain full overclocking support, meaning simple tweaks would make them the all-round chips we always wished for

01.07 / 10:15

Ubisoft PC PS5 Multiple Remakes of Older Assassin's Creed Games in the Works, Ubisoft CEO Confirms

01.07 / 10:05

PC Xbox Series X PS5 Dragon’s Dogma 2 – Free 2 Hour Trial is Now Live Until July 18th

01.07 / 10:05

PC Xbox Series X PS5 Flintlock: The Siege of Dawn Extended Gameplay Showcases Quests, Combat and More En Route to Dukmar

01.07 / 09:57

In A Violent Nature: Unorthodox Slasher is Available on VOD Now

01.07 / 09:23

Graveyard Shift 4K UHD Release Confirmed for Stephen King Horror Movie

01.07 / 09:15

Ubisoft Ubisoft Toronto affected by layoffs

01.07 / 08:55

Action UPS George R. R. Martin Teases Elden Ring Movie or TV Series

01.07 / 08:32

Pixel 9 launch to now happen on August 13 as Google wants to avoid conflict with iPhone 16 event in September

01.07 / 08:32

Vivo X200, Vivo X200 Pro to feature new screens and design; Launch expected in October

01.07 / 08:23

RPG Dreams The Maw: what's new in PC games this week?

01.07 / 08:23

RPG PC Xbox One PS4 PS5 Elden Ring: Shadow Of The Erdtree has a hidden item that lets Torrent headbutt your enemies

01.07 / 08:05

UPS Software Apple Apple Reportedly Mass Producing Up To 100 Million A18 Chipset Units, As It Anticipates High Demand For The iPhone 16 Series Due To Generative AI & Other Upgrades

01.07 / 07:44

Don’t wait for Apple Intelligence: Here are 5 AI tricks you can use on your iPhone right away

01.07 / 07:44

Apple Apple AirPods with cameras launching next; mass production starts by 2026 - All details

01.07 / 07:44

Aliens exist? Dyson Spheres in the Milky Way Galaxy is the latest proof that has got scientists excited - All details

01.07 / 07:44

Samsung CMF Phone 1, Samsung Galaxy Z Fold 6, Motorola Razr 50 Ultra and more : All smartphones launching in July 2024 in India

01.07 / 06:11

Sony PS5 Sony’s Recordable Media Business is Reportedly Cutting 250 Jobs

01.07 / 05:27

Provident Rockstar Games to reintroduce GTA Online Heist challenge with new rewards and content in 2024

01.07 / 04:59

Provident Google Apple iPhone 16 Pro, iPhone 16 Pro Max To Reportedly Get Samsung’s New ‘M14’ OLED Panels, But Google Also Said To Introduce It To Its Pixel 9 Lineup

01.07 / 04:51

Provident Samsung Samsung Galaxy Ring health features tipped ahead of launch: All details

01.07 / 04:47

Puzzle Racing UPS boxing Today's Wordle answer for Monday, July 1

01.07 / 04:41

PC Xbox Series X PS5 Marvel Rivals Will Have Cross-Play, but Cross-Progression is Undecided

01.07 / 04:41

PC Sony PS5 Concord Reveal Trailer Was “a Tiny Slice” of the Full Experience, Developer Says

01.07 / 04:41

PC Xbox One Xbox Series X Nintendo Palworld Developer Has Received No Official Complaints from The Pokemon Company

01.07 / 04:41

Cyberpunk 2 Will “Push the Envelope” with its Dystopian Setting, Developer Says

01.07 / 04:41

PC Xbox One Xbox Series X PS4 PS5 Elden Ring Movie/TV Show Seemingly Being Teased by George R.R. Martin

01.07 / 04:41

PC Xbox Series X PS5 Terminator: Survivors Behind-the-Scenes Video Details Setting, Combat, and More

01.07 / 04:41

Sony PS5 Housemarque is Recruiting for a AAA Action Game Built on Unreal Engine

01.07 / 04:37

Marvell Odyssey Nintendo Celebrity The Steam Summer Sale chugs along with yet more marvellous deals, plus a bunch of cheap peripherals surface on Amazon.

01.07 / 03:53

Provident War Starfield Shattered Space May Answer One Of The Game's Biggest Mysteries

01.07 / 03:13

When Shadow Of The Erdtree Takes Place On Elden Ring’s Timeline

01.07 / 03:13

Fighting Provident Extreme Pokémon GO: How To Beat Giovanni (July Lineup & Counters)

01.07 / 03:13

Party RPG Puzzle UPS Baldur's Gate 3 Has A Huge, Orpheus-Shaped Plot Hole You Probably Never Realized

01.07 / 03:13

Mobile UPS Google Street Fighter Duel Reward Codes (July 2024)

01.07 / 02:57

Warcraft Stats Caps for Remix: Mists of Pandaria - Maximum Useful Rating for Each Stat

01.07 / 01:55

UPS House of the Dragon finally gives a tour of Harrenhal, Westeros’ most haunted castle

01.07 / 01:37

Fighting UPS War What’s up with the Riverlands in House of the Dragon?

01.07 / 01:21

Strategy Samsung’s 3nm GAA Yields Rumored To Be Less Than 20 Percent, Mass Production Of Exynos 2500 May Be Unviable, Leaving The Company With Little Options

01.07 / 00:31

UPS Final Fantasy 14 producer Yoshi-P apologizes for the Dawntrail expansion's early access technical issues

01.07 / 00:31

Puzzle Strategy UPS Drop blocks to make your own maze in the demo for tower defense game Emberward

01.07 / 00:13

Mobile UPS Marvell boxing Warhammer Tacticus Codes (July 2024)

01.07 / 00:13

Digital Provident Extreme Turns Out, Shiny Pokémon Aren't As Special As Everyone Thought

01.07 / 00:13

UPS Elden Ring's DLC Has One Major Flaw (And It's Not The Difficulty)

30.06 / 23:25

Fighting PC PS5 Nintendo Hunter x Hunter: Nen x Impact character trailers – Machi and Uvogin

30.06 / 23:19

Diablo The State of Sorcerers featuring Mekuna and Roxy - Diablo 4 Season 5

30.06 / 22:23

UPS boxing Star Trek Fleet Command Codes (July 2024)

30.06 / 22:17

Cooper Elden Ring Seamless Co-Op Mod Now Works with Shadow of the Erdtree, Adds Optional Invasions

30.06 / 21:59

Divinity UPS Extreme Cyberpunk Modders have uncovered an extended version of The Witcher 3's ending where Yennefer pulls off a shocking betrayal of her sorceress friends

30.06 / 21:59

UPS Digital Digital board game Dune: Imperium is getting the Rise of Ix expansion in July

30.06 / 21:21

Platform UPS Rhythm Nintendo 10 Nintendo DS Games That Should Come To Nintendo Switch

30.06 / 21:21

UPS Provident Progressive Honkai: Star Rail Codes & How To Redeem Them (July 2024)

30.06 / 21:21

Fighting Divinity UPS Elden Ring: Shadow of the Erdtree - How To Beat Black Knight Edredd (Boss Guide)

30.06 / 21:21

Action RPG Extreme Progressive There's One Great Way To Make FF7 Rebirth Feel More Like The Original Game

30.06 / 21:17

Puzzle UPS NYT Connections today — hints and answers for Sunday, June 30 (game #385)

30.06 / 21:17

UPS NYT Connections today — hints and answers for Saturday, June 29 (game #384)

30.06 / 20:07

Platform Infinity boxing Demon Slayer’s final arc will arrive as a trilogy of movies — and here’s the first look

30.06 / 19:23

Racing UPS Professions for Alts, NPC Work Orders & Flipping - Wowhead Economy Weekly Wrap-Up 333

30.06 / 19:21

Apple AirPods With Camera Modules To Reportedly Enter Mass Production In 2026, Will Be Designed To Deliver An Enhanced Spatial Audio Experience

30.06 / 19:09

PC Xbox Series X PS5 Alan Wake 2 Physical Deluxe Edition Launches October 22nd

30.06 / 19:09

Xbox Series X PS5 Alan Wake 2: Lake House Expansion Launches This October

30.06 / 18:47

Fighting UPS Elden Ring: Shadow Of The Erdtree Is Missing An Important Co-Op Feature

30.06 / 18:47

Fighting Party Puzzle skeleton How To Solve Sarin’s Skeleton Puzzle In Baldur's Gate 3

30.06 / 18:47

UPS Provident Discover Elden Ring: Shadow Of The Erdtree - Every Scadu Altus Site Of Grace LocationsElden Ring: Shadow Of The Erdtree - Every Scadu Altus Site Of Grace Locations

30.06 / 18:47

Adventure Booking D&D Vs. Pathfinder: Which Campaign Settings Are Best (& Why)

30.06 / 18:07

RPG PS4 PS5 Nintendo Ys Memoire: The Oath in Felghana coming west

30.06 / 18:07

Mobile RPG Re:ZERO – Starting Life in Another World Witch’s re:surrection launches this summer in Japan

30.06 / 17:25

PS4 PS5 Ys Memoire: The Oath in Felghana Coming to the West, per XSEED

30.06 / 17:25

PC Xbox Series X PS5 Warhammer 40,000: Space Marine 2 Public Beta is Cancelled

30.06 / 17:25

War Coppers the Kobold - New Vendor Pet in The War Within

30.06 / 17:25

PC Xbox One Xbox Series X PS4 PS5 The First Descendant – Lepic, Viessa and Gley Showcased in New Trailers

30.06 / 17:25

PC Xbox One PS4 PS5 The First Descendant Pre-Load is Now Available

30.06 / 17:07

Provident Booking War Legacy of Kain: Soul Reaver Returns In New Prequel Graphic Novel

NVIDIA GeForce RTX 4090 GPU Offers Up To 15X AI Throughput Versus Laptop CPUs, TensorRT-LLM Boosts Perf By Up To 70%

NVIDIA's GeForce RTX 40 GPUs Tear Apart Laptop CPUs & NPUs In New Llama & Mistral AI Benchmarks, Accelerated Further With TensorRT-LLM

Related Story NVIDIA Once Again Proves Why It’s The AI Boss: Sweeps All MLPerf Training Benchmarks, Achieves Near-Perfect Scaling In GPT-175B, Hopper Now 30% Faster

Related News