The world’s first GPUs based on the new NVIDIA Ada Lovelace architecture, the RTX 40 Series delivers massive generational leaps in performance and efficiency, and represents a new era of real-time ray tracing and neural rendering, which uses AI to generate pixels.
“The age of RTX ray tracing and neural rendering is in full steam, and our new Ada Lovelace architecture takes it to the next level,” said Jensen Huang, NVIDIA’s founder and CEO, at the GeForce Beyond: Special Broadcast at GTC.
“Ada provides a quantum leap for gamers and paves the way for creators of fully simulated worlds. With up to 4x the performance of the previous generation, Ada is setting a new standard for the industry,” he said.
DLSS 3 Generates Entire Frames for Faster Game Play
Huang also announced NVIDIA DLSS 3 — the next revolution in the company’s Deep Learning Super Sampling neural-graphics technology for games and creative apps. The AI-powered technology can generate entire frames for massively faster game play. It can overcome CPU performance limitations in games by allowing the GPU to generate entire frames independently.
“DLSS is one of our best inventions and has made real-time ray tracing possible. DLSS 3 is another quantum leap for gamers and creators,” said Jensen Huang, founder and CEO of NVIDIA. “Our pioneering work in RTX neural rendering has opened a new universe of possibilities where AI plays a central role in the creation of virtual worlds.”
The combination of ray tracing and AI technologies has revolutionized video games by simultaneously delivering dramatic improvements in image quality along with massive uplifts in performance — a feat unheard of before GeForce RTX.
Additionally, the RTX 40 Series GPUs feature a range of new technological innovations, including:
- Streaming multiprocessors with up to 83 teraflops of shader power — 2x over the previous generation.
- Third-generation RT Cores with up to 191 effective ray-tracing teraflops — 2.8x over the previous generation.
- Fourth-generation Tensor Cores with up to 1.32 Tensor petaflops — 5x over the previous generation using FP8 acceleration.
- Shader Execution Reordering (SER) that improves execution efficiency by rescheduling shading workloads on the fly to better utilize the GPU’s resources. As significant an innovation as out-of-order execution was for CPUs, SER improves ray-tracing performance up to 3x and in-game frame rates by up to 25%.
- Ada Optical Flow Accelerator with 2x faster performance allows DLSS 3 to predict movement in a scene, enabling the neural network to boost frame rates while maintaining image quality.
- Architectural improvements tightly coupled with custom TSMC 4N process technology results in an up to 2x leap in power efficiency.
- Dual NVIDIA Encoders (NVENC) cut export times by up to half and feature AV1 support. The NVENC AV1 encode is being adopted by OBS, Blackmagic Design DaVinci Resolve, Discord and more.
Powered by new fourth-generation Tensor Cores and a new Optical Flow Accelerator on GeForce RTX 40 Series GPUs, DLSS 3 is the latest iteration of the company’s critically acclaimed Deep Learning Super Sampling technology and introduces a new capability called Optical Multi Frame Generation.
Optical Multi Frame Generation generates entirely new frames, rather than just pixels, delivering astounding performance boosts. The new Optical Flow Accelerator incorporated into the NVIDIA Ada Lovelace architecture analyzes two sequential in-game images and calculates motion vector data for objects and elements that appear in the frame, but are not modeled by traditional game engine motion vectors. This dramatically reduces visual anomalies when AI renders elements such as particles, reflections, shadows and lighting.
Pairs of super-resolution frames from the game, along with both engine and optical flow motion vectors, are then fed into a convolutional neural network that analyzes the data and automatically generates an additional frame for each game-rendered frame — a first for real-time game rendering. Combining the DLSS-generated frames with the DLSS super-resolution frames enables DLSS 3 to reconstruct seven-eighths of the displayed pixels with AI, boosting frame rates by up to 4x compared to without DLSS.
Because DLSS Frame Generation executes as a post-process on the GPU, it can boost frame rates even when the game is bottlenecked by the CPU. For CPU-limited games, such as those that are physics heavy or involve large worlds, DLSS 3 allows the GeForce RTX 40 Series GPUs to render the game at up to twice the frame rate that the CPU is able to compute the game.
DLSS 3 integrations also incorporate NVIDIA Reflex, which synchronizes the GPU and CPU, ensuring optimum responsiveness and low system latency.
Game Developers Embrace DLSS 3, Over 35 Games, Applications Coming Soon
The revolutionary power of DLSS 3 is a boon for game developers who want to express their artistic vision. The technology is coming to the world’s most popular game engines, such as Unity and Unreal Engine.
DLSS 3 has also received support from many of the world’s leading game developers, with more than 35 games and applications announcing support, including:
|
|
Since DLSS 3 builds on top of DLSS 2 integrations, game developers can quickly enable it in existing titles that already support DLSS 2 or NVIDIA Streamline.
DLSS 3 Coming Oct. 12
DLSS 3 is supported in GeForce RTX 40 Series GPUs and will debut on Wednesday, Oct. 12, with the availability of GeForce RTX 4090 GPUs. More details are available on GeForce.com and NVIDIA.com, including details on GeForce RTX 40 Series GPUs and NVIDIA DLSS technology.
New Ray-Tracing Tech for Even More Immersive Games
For decades, rendering ray-traced scenes with physically correct lighting in real time has been considered the holy grail of graphics. At the same time, geometric complexity of environments and objects has continued to increase as 3D games and graphics strive to provide the most accurate representations of the real world.
Achieving physically accurate graphics requires tremendous computational horsepower. Modern ray-traced games like Cyberpunk 2077 run over 600 ray-tracing calculations for each pixel just to determine lighting — a 16x increase from the first ray-traced games introduced four years ago.
The new third-generation RT Cores have been enhanced to deliver 2x faster ray-triangle intersection testing and include two important new hardware units. An Opacity Micromap Engine speeds up ray tracing of alpha-test geometry by a factor of 2x, and a Micro-Mesh Engine generates micro-meshes on the fly to generate additional geometry. The Micro-Mesh Engine provides the benefits of increased geometric complexity without the traditional performance and storage costs of complex geometries.
Creativity Redefined With RTX Remix, New AV1 Encoders
The RTX 40 Series GPUs and DLSS 3 deliver advancements for NVIDIA Studio creators. 3D artists can render fully ray-traced environments with accurate physics and realistic materials, and view the changes in real time, without proxies.
Video editing and live streaming also get a boost from improved GPU performance and the inclusion of new dual, eighth-generation AV1 encoders. The NVIDIA Broadcast software development kit has three updates, now available for partners, including Face Expression Estimation, Eye Contact and quality improvements to Virtual Background.
NVIDIA Omniverse — included in the NVIDIA Studio suite of software — will soon add NVIDIA RTX Remix, a modding platform to create stunning RTX remasters of classic games. RTX Remix allows modders to easily capture game assets, automatically enhance materials with powerful AI tools, and quickly enable RTX with ray tracing and DLSS.
Portal Is RTX ON!
RTX Remix has been used by NVIDIA Lightspeed Studios to reimagine Valve’s iconic video game Portal, regarded as one of the best video games of all time. Advanced graphics features such as full ray tracing and DLSS 3 give the game a striking new look and feel. Portal with RTX will be released as free, official downloadable content for the classic platformer with RTX graphics in November, just in time for Portal’s 15th anniversary.
Step Up To 1440p 360 FPS Competitive Gaming With New GeForce RTX 40 Series Graphics Cards and NVIDIA Reflex
The new GeForce RTX 40 Series GPUs are beyond fast, with the performance to deliver 360 FPS on 1440p 360Hz monitors arriving this fall, with as low as 10ms of system latency in top esports titles. Whether you game on a GeForce PC or laptop, NVIDIA Reflex reduces system latency in over 40 games, including 8 of the top 10 competitive shooters.
Another 8 Reflex games were just announced, including Call of Duty: Modern Warfare II, Cyberpunk 2077, Destroy All Humans! 2 - Reprobed, Loopmancer, Overwatch 2, Scathe, SUPER PEOPLE, Warhammer 40,000: Darktide, with many more in the pipeline.
The GeForce RTX 4090 and 4080: The New Ultimate GPUs
The RTX 4090 is the world’s fastest gaming GPU with astonishing power, acoustics and temperature characteristics. In full ray-traced games, the RTX 4090 with DLSS 3 is up to 4x faster compared to last generation’s RTX 3090 Ti with DLSS 2. It is also up to 2x faster in today’s games while maintaining the same 450W power consumption. It features 76 billion transistors, 16,384 CUDA cores and 24GB of high-speed Micron GDDR6X memory, and consistently delivers over 100 frames per second at 4K-resolution gaming. The RTX 4090 will be available on Wednesday, Oct. 12, starting at $1,599.
The company also announced the RTX 4080, launching in two configurations. The RTX 4080 16GB has 9,728 CUDA cores and 16GB of high-speed Micron GDDR6X memory, and with DLSS 3 is 2x as fast in today’s games as the GeForce RTX 3080 Ti and more powerful than the GeForce RTX 3090 Ti at lower power. The RTX 4080 12GB has 7,680 CUDA cores and 12GB of Micron GDDR6X memory, and with DLSS 3 is faster than the RTX 3090 Ti, the previous-generation flagship GPU.
Both RTX 4080 configurations will be available in November, with prices starting at $1,199 and $899, respectively.
Where to Buy
The GeForce RTX 4090 and 4080 GPUs will be available as custom boards, including stock-clocked and factory-overclocked models, from top add-in card providers such as ASUS, Colorful, Gainward, Galaxy, GIGABYTE, Innovision 3D, MSI, Palit, PNY and Zotac.
The RTX 4090 and RTX 4080 (16GB) are also produced directly by NVIDIA in limited Founders Editions for fans wanting the NVIDIA in-house design.
Look for the GeForce RTX 40 Series GPUs in gaming systems built by Acer, Alienware, ASUS, Dell, HP, Lenovo and MSI, leading system builders worldwide, and many more.
More articles about NVIDIA GeForce RTX 4090