Back in September, NVIDIA announced its TensoRT-LLM model for Data Centers which offered an 8x gain on the industry's top AI GPUs such as the Hopper H100 and the Ampere A100. Taking full advantage of the tensor core acceleration featured on NVIDIA's GeForce RTX & RTX Pro GPUs, the latest model will deliver up to 4x faster performance in LLM Inferencing workloads.

Earlier, we explained that One of the biggest updates that TensorRT-LLM brings is in the form of a new scheduler known as In-Flight batching which allows work to enter & exit the GPU independent of other tasks. It allows dynamic processing of several smaller queries while processing large compute-intensive requests in the same GPU. The TensorRT-LLM makes use of optimized open-source models which allow for higher speedups when Batch Sizes are increased. Starting today, these optimized open-source models have been made available to the public and are available to download at developer.nvidia.com.

The added AI acceleration with the TensorRT-LLM model will help drive various daily productivity tasks such as engaging in chat, summarising documents and web content, drafting emails and blogs, and can also be used to analyze data and generate vast amounts of content using what is available to the model.

So how will TensorRT-LLM help consumer PCs running Windows? Well in a demo shown by NVIDIA, a comparison between an open-source pre-trained LLM model such as LLaMa-2 and TensorRT-LLM was shown. When a query is passed to LLaMa-2, it will gather information from a large generalized dataset like Wikipedia so they don't have up-to-date information after they

Read more on wccftech.com

All news from wccftech.com

About this in other media

NVIDIA GeForce RTX 4090 GPUs See Price Surge In The US, Cheapest Model Now Starting Over $1700 US wccftech.com /1 year ago

NVIDIA GeForce RTX 4080 SUPER Graphics Card Possibly Spotted With AD103 GPU wccftech.com /1 year ago

NVIDIA Reveals Alan Wake 2 Game Performance: 4K 120+ FPS With GeForce RTX 4090 With DLSS 3.5 & Path Tracing wccftech.com /1 year ago

The website gamebastion.com is an aggregator of news from open sources. The source is indicated at the beginning and at the end of the announcement. You can send a complaint on the news if you find it unreliable.

20.10 / 15:11

UPS Cyberpunk Cyberpunk 2077 Update Makes Players Give Thousands To Homeless NPCs

Cyberpunk 2077 is, unsurprisingly, full of homeless people. After all, we're constantly told how bad poverty is in Night City, and apartments will lock you out as soon as your rent is late, so it's hardly shocking that we see so many people living on the streets. But now, these unfortunate souls are in luck, because they seem to have been given a huge boost in the new update.

20.10 / 14:38

RPG Provident Cyberpunk Cyberpunk 2077 Patch 2.02 Is Coming Soon; Key Features and Improvements Detailed

20.10 / 14:38

UPS Discover DISH Batman and Superman's trip to Kingdom Come's Earth-22 gets off to a rocky start

20.10 / 13:37

UPS Chase New D&D-inspired drip is predictably pricey—though it can apparently take a beating: 'We want you to be able to run around in the woods with your friends'

20.10 / 13:33

UPS In Super Mario Bros. Wonder, Mario’s personality finally comes through

20.10 / 13:21

Mobile Digital Marvell Booking Six to Start co-founder Adrian Hon departing Zombies, Run! dev in 2024

20.10 / 12:55

UPS CEO Citizens Star Citizen Alpha 3.21: Mission Ready Update Released Ahead of Upcoming Announcements

20.10 / 12:03

Mobile UPS NVIDIA AMD AMD brings chiplets to Radeon Mobile at last with the new RX 7900M

20.10 / 12:03

UPS Amazon NVIDIA RTX 4090 prices in US tick upwards as China ban approaches

20.10 / 11:45

Adventure Sony Spider-Man 2 could help Sony boost PS5 sales

20.10 / 11:41

FIFA "The worst glitch in FIFA ever" allows EA Sports FC players to stick the ball to their legs and simply run into the goal

20.10 / 10:17

Mobile Intel CEO Apple TSMC taunts Intel, claims superior chip tech for years to come

20.10 / 09:51

Platform Twitter CEO Elon Musk says two X Premium tiers coming soon; Know all about it

20.10 / 09:09

Fighting RPG Adventure UPS Top 83 Upcoming Single Player Video Games of 2024

20.10 / 09:09

Fighting Marvel’s Spider-Man 2: Just Let Go & Home Run! Trophy Guide

20.10 / 06:27

Mobile Grand Theft Auto A new GTA game is reportedly coming to Netflix; Could it be GTA 6?

20.10 / 01:19

Platform Simulation UPS Provident NVIDIA AMD NVIDIA & AMD Power The World’s Most Powerful AI Workstations With Threadripper 7000 CPUs & RTX GPUs

20.10 / 00:01

Microsoft NVIDIA Starfield Luma Native HDR Mod Released by All-Star Modding Team

19.10 / 22:53

PC Xbox Series X Microsoft Pentiment Coming to Switch is “Possible,” Director Says

19.10 / 20:57

Platform NVIDIA AMD Warhammer Vermintide 2 Adds DLSS 3 and FSR 2.2; More DLC Coming After Sienna’s Necromancer

19.10 / 20:13

Gaming “Villains Are More Fun” - Yuri Lowenthal On Bringing Out Peter Parker’s Dark Side

19.10 / 19:47

Fallout UPS A host of Fallout-themed cards are coming to Magic: The Gathering

19.10 / 19:45

Twitter Nintendo Super Mario Bros. Wonder Elephant Power-Up Plushies Coming 2024

19.10 / 19:37

Tower of God: New World details upcoming Halloween update, adds new characters

19.10 / 19:13

Adventure UPS NIFTY Huge update! Get two WhatsApp accounts on one smartphone now

19.10 / 18:57

Platform UPS CEO Discover Epic Games wants devs to bring their back catalogs to its store

19.10 / 18:03

Sony PS5 Marvel’s Spider-Man 2 Photo Mode Details Revealed, Action Figure Mode Coming Later

19.10 / 17:25

Platform NVIDIA NVIDIA GeForce RTX 4090 Now Twice As Expensive In China Following US Ban

19.10 / 16:39

Conduit is the new legend coming in Apex Legends Ignite: Leaked abilities, season 19 details, and more

19.10 / 16:29

RPG PC Nintendo Salt and Sacrifice coming to Switch, Steam on November 7

19.10 / 16:15

Platform Party Nintendo Super Mario Wonder brings 2D Mario back into the fold this week on the Switch eShop

19.10 / 15:37

UPS Google Samsung Samsung's Android 14 update is almost here! Know what it may bring

19.10 / 15:15

Ubisoft UPS Half-Life The upcoming Assassin's Creed VR game looks genuinely cool if you can keep yourself from throwing up

19.10 / 15:13

Adventure Digital Nintendo Destiny A Date with Destiny – Walking Dead Coming in November

19.10 / 15:13

shooting Yu Suzuki bringing Air Twister to Xbox in November

19.10 / 15:13

Mobile Extreme NVIDIA Nintendo Nintendo Switch 2 Will Support NVIDIA DLSS 3.5 Ray Reconstruction, but May Not Support Frame Generation – Rumor

19.10 / 15:09

Twitter Marvell Marvel's Spider-Man 2 will receive a New Game Plus mode but it won't come with the day-one patch

19.10 / 15:05

Platform UPS Intel Extreme AMD Surprise! AMD Resurrects 'Consumer' Threadripper, in New CPUs Up to 64 Cores

19.10 / 15:05

Mobile UPS NVIDIA AMD Chiplets Hit Laptop GPUs: AMD's Radeon RX 7900M Lands in the Alienware m18

19.10 / 14:57

Strategy UPS Celebrity Yu-Gi-Oh's Age Of Overlord Booster Set Is Available Now

19.10 / 14:29

Simulation PS4 PS5 Calico coming to PS5, PS4 on November 28

19.10 / 13:25

Mobile UPS NVIDIA AMD Cyberpunk AMD unveils Radeon RX 7900M GPUs in high-end Alienware m18 gaming laptop

19.10 / 13:17

Platform Mobile RPG Starfield helps boost US games revenue 10% in September | US Monthly Charts

19.10 / 13:09

Platform UPS AMD AMD brings Threadripper back to the desktop with a new non-Pro range of monster chips... that I'm going to now call Wallet-ripper

19.10 / 11:11

Apple Get huge discounts on iPhone 14, iPhone 13, iPhone 12, more; check prices here

19.10 / 09:49

Battlefield Fighting Strategy UPS Top 21 Upcoming War Games Of 2024

19.10 / 09:43

Digital Google Google and HP join hands to produce affordable Chromebooks in India to boost digital education

19.10 / 09:35

Extreme Discover Mortal Kombat 1 Invasions Is Bringing Back MK11's Raiden, Kind Of

19.10 / 09:27

Adventure UPS Google Avid Danganronpa Comes to Mobile in Miss Perfect Miss Ending

19.10 / 08:31

NVIDIA NVIDIA GeForce RTX 4070 SUPER Rumored To Get AD103 GPU & 16 GB VRAM, Non-GDDR6X RTX 4070 Also In The Works

19.10 / 06:45

Digital Google Google for India 2023: Search Generative Experience to improve shopping, bring small businesses to fore

18.10 / 21:59

Hogwarts Legacy Screenshots Offer First Look at Upcoming Switch Port

18.10 / 21:45

Overwatch Fallout UPS Microsoft Diablo Phil Spencer confirms that Activision Blizzard games won't come to Game Pass this year: 'Now that the deal is closed, we're starting that work, but there is work'

18.10 / 21:28

NVIDIA US to Block Nvidia From Shipping More GeForce RTX 4090 GPUs to China

18.10 / 21:07

UPS Diablo Farm Blood Harvest Bosses for Huge XP and Loot

18.10 / 20:57

Witcher spin-off Gwent's final update now here, bringing development to end

18.10 / 20:45

Star Wars: The Old Republic's 7.4 Story Is Called 'Chains In The Dark,' Bringing New Bosses, Landing Zone And More

18.10 / 20:45

CCP Games Previews EVE Online's Upcoming Pirate Mechanics In New Havoc Video

18.10 / 20:03

PlayStation 4 PlayStation 5 PC Xbox One Xbox Series X The Nightmare Before Christmas Comes To Rocket League As Part Of Haunted Hallows Event

18.10 / 19:27

RPG Adventure Fallout UPS Starfield’s lead quest designer joins other ex-Bethesda devs - plus Obsidian and BioWare vets - to work on upcoming open-world RPG Wyrdsong

18.10 / 19:03

Action Nintendo Trippy FPS Project Downfall is coming to consoles starting with Switch in November

18.10 / 18:47

Puzzle Nintendo Touch Detective 3 + The Complete Case Files coming west in Q1 2024

18.10 / 18:11

Platform UPS Apple When is Killers of the Flower Moon coming to streaming?

18.10 / 17:21

UPS Intel Amazon I've simply not seen an RTX 4070 gaming laptop as cheap as this 15-inch Asus machine

18.10 / 17:09

Action Counter-Strike Twitter NVIDIA AMD AMD pulls the trigger on Anti-Lag+, for now

18.10 / 16:49

Party Hollywood is going to come for D&D actual plays sooner or later — will they survive?

18.10 / 16:45

Digital Intel NVIDIA AMD HP Unveils HP Pavilion Plus 14 & 16 Laptops in India

18.10 / 15:59

Platform Software Star Wars Jedi Diablo Starfield Topped the Charts in September, but Couldn’t Quite Boost the Xbox to #1

18.10 / 15:21

Party RPG UPS Hands On: Final Fantasy VII Rebirth Brings Superb Action and a Massive Open World

18.10 / 15:07

RPG Persona Microsoft Marvell 2023’s best horror game comes to Game Pass, but we lose an amazing RPG

18.10 / 15:07

NVIDIA One of the best indie horror games gets DLSS 3 in time for Halloween

18.10 / 14:49

Battlefield Warcraft Strategy UPS Progressive Warhammer Age of Sigmar: Realms of Ruin brings the Warhammer universe to RTS in a fun way

18.10 / 14:49

Outlast Trials teases new enemy, plus new content coming soon

18.10 / 14:43

Platform UPS CEO NVIDIA Nvidia to Build 'AI Factories' With Foxconn

18.10 / 14:01

UPS Microsoft Phil Spencer Clears The Air About Activision Blizzard Coming To Game Pass

18.10 / 13:37

Provident Microsoft NVIDIA GeForce RTX 20-series owners can now enjoy a spot of deep learning enhanced video playback

18.10 / 13:17

Strategy UPS Microsoft Sony PlayStation Seems To Have Huge PC Ambitions

18.10 / 12:15

RPG Nintendo Nintendo Shares First Look at Hogwarts Legacy Running on Switch

18.10 / 12:08

Gaming Starfield Doesn't Let You Eat The Rich, And It's A Huge Letdown

18.10 / 12:07

Platform UPS Microsoft Xbox boss says it may take time to bring back classic Activision franchises

NVIDIA TensorRT-LLM Coming To Windows, Brings Huge AI Boost To Consumer PCs Running GeForce RTX & RTX Pro GPUs

Related News