This latest iteration of NVIDIA Ada Lovelace architecture-based GPUs delivers up to 52 shader TFLOPS, 121 RT TFLOPS and 836 AI TOPS to supercharge gaming and creating — and provide the power to develop new entertainment worlds and experiences. The GeForce RTX 4070 SUPER starts from $599.
PC gamers demand the very best in visual quality, and AI-powered NVIDIA Deep Learning Super Sampling (DLSS) Super Resolution, Frame Generation and Ray Reconstruction combine with ray tracing to offer stunning worlds — just a click away in titles such as Diablo IV, Pax Dei and Horizon Forbidden West. With DLSS, seven out of eight pixels can be AI-generated, accelerating full ray tracing by up to 4x with better image quality.
“For everyone from gaming enthusiasts to creative professionals, GeForce RTX SUPER GPUs are simply awesome upgrades,” said Matt Wuebbling, vice president of global GeForce marketing at NVIDIA. “GeForce RTX SUPER cards support over 500 RTX games and applications and will have users prepared for the wave of generative AI apps coming to PC.”
Other upcoming RTX titles are Enshrouded, Dragon's Dogma II, GrayZone Warfare, Layers Of Fear, Starminer, Tekken 8, Throne Of Liberty, Like A Dragon Gaiden: The Man Who Erased His Name, Like A Dragon: Infinite Wealth, Nakwon: Last Paradise and Half-Life 2 RTX (Remix Project).
Half-Life 2 RTX: An RTX Remix Project is being developed by four of Half-Life 2’s top mod teams, now working together under the banner of Orbifold Studios. Using the latest version of RTX Remix, the modders are rebuilding materials with PBR properties, adding extra geometric detail via Valve’s Hammer editor, and leveraging NVIDIA technologies including full ray tracing, DLSS 3.5, Reflex, and RTX IO to deliver a fantastic experience for GeForce RTX gamers.
As with the Portal projects, almost every asset is being reconstructed in high fidelity, and full ray tracing is being leveraged to bring cutting-edge graphics to Half-Life 2. In Ravenholm, average world textures have 8X the pixels, and are brought to life with Parallax Occlusion Mapping (POM) and PBR.
Monstrous creatures like the zombies feature almost 30X the geometric detail in Half-Life 2 RTX, going from 4,200 triangles to a staggering 75,590 triangles. Father Grigori, similarly, is now composed of 68,341 polygons. Weapons have been updated, with the Gravity Gun featuring 7X the textures, and 70X the polygonal detail. Now the materials of your weapon, the glass, metals and plastics, react to the world around you, catching light, shadows and color as you move.
Reload animations have been updated, and each time you fire your weapon, the muzzle flashes illuminate the darkest rooms. Orbifold Studios has even used Valve’s Hammer editor to rebuild the particles and explosions in Half-Life 2 to modern standards, which combined with full ray tracing, means fire glows and swells, and explosions cause smoke to propagate through light, imbuing clouds with beams of color.
Using RTX Remix, the team has added realistic lighting and shadowing to each part of Ravenholm, greatly enhancing the moody, dimly lit streets, abandoned buildings, and ingenious traps set up by Father Grigori.
To further enhance image quality, Half-Life 2 RTX features NVIDIA DLSS 3.5 with Ray Reconstruction. Ray Reconstruction replaces hand-tuned ray tracing denoisers with a new unified AI model, elevating the image quality of ray-traced effects and full ray tracing to new heights, further enhancing detail and realism.
In the above trailer, DLSS 3.5 allows foliage to look more detailed, and shimmer less. It also makes fire, flashes of light, and shadows render more responsively. With DLSS 3.5, when you move through Ravenholm with your flashlight, every shadow you create will dynamically appear in the world and update in real-time with your movements.
An AI-Powered Leap in PC Computing
The new GeForce RTX SUPER GPUs are the ultimate way to experience AI on PCs. Specialized AI Tensor Cores deliver up to 836 AI TOPS to deliver transformative capabilities for AI in gaming, creating and everyday productivity. The rich software stack built on top of RTX GPUs further accelerates AI.
NVIDIA TensorRT is software for high-performance deep learning inference, which includes a deep learning inference optimizer and runtime that delivers low latency and high throughput for inference applications. TensorRT-LLM for Windows is an open-source library that accelerates inference performance for the latest large language models. In AI workloads, the GeForce RTX 4080 SUPER generates video over 1.5x faster and images over 1.7x faster than the RTX 3080 Ti.
For games, AI-powered DLSS provides greater in-game immersion. Meanwhile, generative AI applications like Adobe Photoshop take advantage of Tensor Cores to speed productivity and keep creative workflows moving. And for productivity, NVIDIA Broadcast can remove background noise and provide seamless virtual backgrounds.
With GeForce RTX SUPER GPUs, users can unlock the full potential of AI on Windows PCs.
A 4K Monster: The GeForce RTX 4080 SUPER
The GeForce RTX 4080 SUPER powers fully ray-traced games in 4K resolution. At 1.4x faster than the GeForce RTX 3080 Ti without DLSS Frame Generation, the RTX 4080 SUPER delivers blistering performance with traditional rasterization. With 836 AI TOPS, DLSS Frame Generation delivers an extra performance boost, making the RTX 4080 SUPER twice as fast as the RTX 3080 Ti. The RTX 4080 SUPER features more cores and faster memory for a performance edge. It will be available starting Jan. 31 from $999.
Precision Gaming: The GeForce RTX 4070 Ti SUPER
The RTX 4070 Ti SUPER is the ideal GPU for maxing out games at super-high frame rates at 1440p, and up to 4K. Compared to the RTX 4070 Ti, it has more cores, an increased frame buffer to 16GB, and a 256-bit memory bus, providing a significant memory bandwidth increase to 672 GB/sec. It is 1.6x faster than a RTX 3070 Ti and 2.5x with DLSS 3. The GeForce RTX 4070 Ti SUPER will be available starting Jan. 24 at $799.
Perfectly Balanced: The GeForce RTX 4070 SUPER
The RTX 4070 SUPER arrives with 20% more cores than the RTX 4070, making it faster than an RTX 3090 at a fraction of the power. With DLSS 3, its lead stretches to 1.5x faster. It will be available starting Jan. 17 at $599.
Where to Buy
For the GeForce RTX 4080 SUPER and 4070 SUPER, an NVIDIA Founders Edition Design will be available direct from NVIDIA.com and select retailers. Custom boards, including stock-clocked and factory-overclocked models for all GeForce RTX 40 SUPER Series GPUs, will be available from top add-in card providers such as ASUS, Colorful, Gainward, GALAX, GIGABYTE, INNO3D, KFA2, MSI, Palit, PNY and ZOTAC.
NVIDIA and Developers Pioneer Lifelike Digital Characters for Games and Applications With NVIDIA ACE (Avatar Cloud Engine)
NVIDIA today also introduced production microservices for the NVIDIA Avatar Cloud Engine (ACE) that allow developers of games, tools and middleware to integrate state-of-the-art generative AI models into the digital avatars in their games and applications.
The new ACE microservices let developers build interactive avatars using AI models such as NVIDIA Audio2Face (A2F), which creates expressive facial animations from audio sources, and NVIDIA Riva Automatic Speech Recognition (ASR), for building customizable multilingual speech and translation applications using generative AI .
Developers embracing ACE include Charisma.AI, Convai, Inworld, miHoYo, NetEase Games, Ourpalm, Tencent, Ubisoft and UneeQ.
“Generative AI technologies are transforming virtually everything we do, and that also includes game creation and gameplay,” said Keita Iida, vice president of developer relations at NVIDIA. “NVIDIA ACE opens up new possibilities for game developers by populating their worlds with lifelike digital characters while removing the need for pre-scripted dialogue, delivering greater in-game immersion.”
Top Game and Interactive Avatar Developers Embrace NVIDIA ACE
Top game and interactive avatar developers are pioneering ways ACE and generative AI technologies can be used to transform interactions between players and non-playable characters (NPCs) in games and applications. “This is a milestone moment for AI in games,” said Tencent Games. “NVIDIA ACE and Tencent Games will help lay the foundation that will bring digital avatars with individual, lifelike personalities and interactions to video games.”
NVIDIA ACE Brings Game Characters to Life
NPCs have historically been designed with predetermined responses and facial animations. This limited player interactions, which tended to be transactional, short-lived, and as a result, skipped by a majority of players.
“Generative AI-powered characters in virtual worlds unlock various use cases and experiences that were previously impossible,” said Purnendu Mukherjee, founder and CEO at Convai. “Convai is leveraging Riva ASR and A2F to enable lifelike NPCs with low-latency response times and high-fidelity natural animation.”
To showcase how ACE can transform NPC interactions, NVIDIA worked with Convai to expand the NVIDIA Kairos demo, which debuted at Computex, with a number of new features and inclusion of ACE microservices.
In the latest version of Kairos, Riva ASR and A2F are used extensively, improving NPC interactivity. Convai’s new framework now allows NPCs to converse among themselves and gives them awareness of objects, enabling them to pick up and deliver items to desired areas. Furthermore, NPCs gain the ability to lead players to objectives and traverse worlds.
The Audio2Face and Riva Automatic Speech Recognition microservices are available now. Interactive avatar developers can incorporate the models individually into their development pipelines.
NVIDIA Brings Generative AI to Millions, With Tensor Core GPUs, LLMs, Tools for RTX PCs and Workstations
NVIDIA announced GeForce RTX SUPER desktop GPUs for supercharged generative AI performance, new AI laptops from every top manufacturer, and new NVIDIA RTX-accelerated AI software and tools for both developers and consumers.
Building on decades of PC leadership, with over 100 million of its RTX GPUs driving the AI PC era, NVIDIA is now offering these tools to enhance PC experiences with generative AI: NVIDIA TensorRT acceleration of the popular Stable Diffusion XL model for text-toimage workflows, NVIDIA RTX Remix with generative AI texture tools, NVIDIA ACE microservices and more games that use DLSS 3 technology with Frame Generation.
In addition, NVIDIA TensorRT-LLM (TRT-LLM), an open-source library that accelerates and optimizes inference performance of the latest large language models (LLMs), now supports more pre-optimized models for PCs. Accelerated by TRT-LLM, Chat with RTX, an NVIDIA tech demo also releasing this month, allows AI enthusiasts to interact with their notes, documents and other content.
“Generative AI is the single most significant platform transition in computing history and will transform every industry, including gaming,” said Jensen Huang, founder and CEO of NVIDIA. “With over 100 million RTX AI PCs and workstations, NVIDIA is a massive install base for developers and gamers to enjoy the magic of generative AI.”
Running generative AI locally on a PC is critical for privacy, latency and cost-sensitive applications. It requires a large install base of AI-ready systems, as well as the right developer tools to tune and optimize AI models for the PC platform.
To meet these needs, NVIDIA is delivering innovations across its full technology stack, driving new experiences and building on the 500+ AI-enabled PC applications and games already accelerated by NVIDIA RTX technology.
RTX AI PCs and Workstations
NVIDIA RTX GPUs — capable of running a broad range of applications at the highest performance — unlock the full potential of generative AI on PCs. Tensor Cores in these GPUs dramatically speed AI performance across the most demanding applications for work and play.
The new GeForce RTX 40 SUPER Series graphics cards, also announced today at CES, include the GeForce RTX 4080 SUPER, 4070 Ti SUPER and 4070 SUPER for top AI performance. The GeForce RTX 4080 SUPER generates AI video 1.5x faster — and images 1.7x faster — than the GeForce RTX 3080 Ti GPU. The Tensor Cores in SUPER GPUs deliver up to 836 trillion operations per second, bringing transformative AI capabilities to gaming, creating and everyday productivity.
Leading manufacturers — including Acer, ASUS, Dell, HP, Lenovo, MSI, Razer and Samsung — are releasing a new wave of RTX AI laptops, bringing a full set of generative AI capabilities to users right out of the box. The new systems, which deliver a performance increase ranging from 20x-60x compared with using neural processing units, will start shipping this month.
Mobile workstations with RTX GPUs can run NVIDIA AI Enterprise software, including TensorRT and NVIDIA RAPIDS for simplified, secure generative AI and data science development. A three-year license for NVIDIA AI Enterprise is included with every NVIDIA A800 40GB Active GPU, providing an ideal workstation development platform for AI and data science.
New PC Developer Tools for Building AI Models
To help developers quickly create, test and customize pretrained generative AI models and LLMs using PC-class performance and memory footprint, NVIDIA recently announced NVIDIA AI Workbench, a unified, easy-to-use toolkit.
AI Workbench, which will be released in beta later this month, offers streamlined access to popular repositories like Hugging Face, GitHub and NVIDIA NGC, along with a simplified user interface that enables developers to easily reproduce, collaborate on and migrate projects.
Projects can be scaled out to virtually anywhere — whether the data center, a public cloud or NVIDIA DGX Cloud — and then brought back to local RTX systems on a PC or workstation for inference and light customization.
In collaboration with HP, NVIDIA is also simplifying AI model development by integrating NVIDIA AI Foundation Models and Endpoints, which include RTX-accelerated AI models and software development kits, into the HP AI Studio, a centralized platform for data science. This will allow users to easily search, import and deploy optimized models across PCs and the cloud.
After building AI models for PC use cases, developers can optimize them using NVIDIA TensorRT to take full advantage of RTX GPUs’ Tensor Cores.
NVIDIA recently extended TensorRT to text-based applications with TensorRT-LLM for Windows, an open-source library for accelerating LLMs. The latest update to TensorRT-LLM now available, adds Phi-2 to the growing list of pre-optimized models for PC, which run up to 5x faster compared to other inference backends.
RTX-Accelerated Generative AI Powers New PC Experiences
At CES, NVIDIA and its developer partners are releasing new generative AI-powered applications and services for PCs, including:
- NVIDIA RTX Remix, a platform for creating stunning RTX remasters of classic games. Releasing in open beta later this month (January 22, 2024), it delivers generative AI tools that can transform basic textures from classic games into modern, 4K-resolution, physically based rendering materials.
- NVIDIA ACE microservices, including generative AI-powered speech and animation models, which enable developers to add intelligent, dynamic digital avatars to games.
- TensorRT acceleration for Stable Diffusion XL (SDXL) Turbo and latent consistency models, two of the most popular Stable Diffusion acceleration methods. TensorRT improves performance for both by up to 60% compared with the previous fastest implementation. An updated version of the Stable Diffusion WebUI TensorRT extension is also now available, including acceleration for SDXL, SDXL Turbo, LCM - Low-Rank Adaptation (LoRA) and improved LoRA support.
- NVIDIA DLSS 3 with Frame Generation, which uses AI to increase frame rates up to 4x compared with native rendering, will be featured in a dozen of the 14 new RTX games announced, including Horizon Forbidden West, Pax Dei and Dragon’s Dogma 2.
- Chat with RTX, an NVIDIA tech demo available later this month, allows AI enthusiasts to easily connect PC LLMs to their own data using a popular technique known as retrieval-augmented generation (RAG). The demo, accelerated by TensorRT-LLM, enables users to quickly interact with their notes, documents and other content. It will also be available as an open-source reference project, so developers can easily implement the same capabilities into their own applications.