All contents on this page are sourced from Wikipedia:

Nvidia

Nvidia Corporation is an American technology company headquartered in Santa Clara, California. Founded in 1993 by Jensen Huang, Chris Malachowsky, and Curtis Priem, it develops graphics processing units (GPUs), systems on chips (SoCs), and application programming interfaces (APIs) for data science, high-performance computing, video games, and mobile and automotive applications. Nvidia has been described as a Big Tech company.

Originally focused on GPUs for video gaming, Nvidia broadened their use into other markets, including artificial intelligence (AI), professional visualization, and supercomputing. The company's product lines include GeForce GPUs for gaming and creative workloads, and professional GPUs for edge computing, scientific research, and industrial applications. As of the first quarter of 2025, Nvidia held a 92% share of the discrete desktop and laptop GPU market

Nvidia ended support for older generations, meaning that they will no longer receive driver updates. GPU driver updates are important because they can fix bugs, improve compatibility, and enhance performance, especially for new games.

Generation (microarchitecture) that still get support:

Blackwell

Blackwell is a graphics processing unit (GPU) microarchitecture developed by Nvidia as the successor to the Hopper and Ada Lovelace microarchitectures. It was officially announced at Nvidia's GTC 2024 keynote on March 18, 2024.

Blackwell

By 极客湾Geekerwan, CC BY 3.0, Link

About The Architecture

Blackwell is fabricated on the custom 4NP process node for datacenter products, and on the custom 4N process node for consumer products, from TSMC. 4NP is an enhancement of the 4N node used for the Hopper and Ada Lovelace architectures. The Nvidia-specific 4NP process likely adds metal layers to the standard TSMC N4P technology. The GB100 die contains 104 billion transistors, a 30% increase over the 80 billion transistors in the previous generation Hopper GH100 die. As Blackwell cannot reap the benefits that come with a major process node advancement, it must achieve power efficiency and performance gains through underlying architectural changes.

The GB100 die is at the reticle limit of semiconductor fabrication. The reticle limit in semiconductor fabrication is the maximum size of features that lithography machines can etch into a silicon die. Previously, Nvidia had nearly hit TSMC's reticle limit with GH100's 814 mm2 die. In order to not be constrained by die size, Nvidia's B100 accelerator utilizes two GB100 dies in a single package, connected with a 10 TB/s link that Nvidia calls the NV-High Bandwidth Interface (NV-HBI). NV-HBI is based on the NVLink 7 protocol. Nvidia CEO Jensen Huang said in an interview with CNBC that Nvidia had spent around $10 billion in research and development for Blackwell's NV-HBI die interconnect. Veteran semiconductor engineer Jim Keller, who had worked on AMD's K7, K12 and Zen architectures, criticized this figure and claimed that the same outcome could be achieved for $1 billion through using Ultra Ethernet rather than the proprietary NVLink system. The two connected GB100 dies are able to act like a large monolithic piece of silicon with full cache coherency between both dies. The dual die package totals 208 billion transistors. Those two GB100 dies are placed on top of a silicon interposer produced using TSMC's CoWoS-L 2.5D packaging technique.

On the consumer side, Blackwell's largest die, GB202, measures in at 750mm2 which is 20% larger than AD102, Ada Lovelace's largest die. GB202 contains a total of 24,576 CUDA cores, 28.5% more than the 18,432 CUDA cores in AD102. GB202 is the largest consumer die designed by Nvidia since the 754mm2 TU102 die in 2018, based on the Turing microarchitecture. The gap between GB202 and GB203 has also gotten much wider compared to previous generations. GB202 features more than double the number of CUDA cores than GB203 which was not the case with AD102 over AD103.

GPUs released this generation
Model Name	Launch
GeForce RTX 5090	Jan 30, 2025
GeForce RTX 5080	Jan 30, 2025
GeForce RTX 5070 Ti	Mar 5, 2025
GeForce RTX 5060 Ti	April 16, 2025
GeForce RTX 5060 Ti	April 16, 2025
GeForce RTX 5060	May 19, 2025
GeForce RTX 5050	July 1, 2025

About RTX 50 series

GeForce RTX 50 series desktop GPUs are the first consumer GPUs to utilize a PCIe 5.0 interface and GDDR7 video memory.
Supported APIs: Direct3D 12.2, OpenGL 4.6, OpenCL 3.0, Vulkan 1.4 and CUDA 12.x
Supported display connections: HDMI 2.1b, DisplayPort 2.1b
9th gen NVENC (3×/2×/1×) / 6th gen NVDEC (2×/1×)
NVIDIA DLSS 4 (Multi Frame Generation support)
AI Management Processor (AMP)
Reflex 2 optimized
Tensor core 5th gen (INT4/FP4 capabilities and second-generation FP8 Transformer Engine)
RT core 4th gen
Shader processors, RT cores and tensor cores optimized for RTX Neural Shaders and new neural workloads
Mega Geometry Technology optimized (Shader processors and RT cores)
Shader Execution Reordering (SER) 2.0
Linear Swept Spheres (LSS)

Learn More about Blackwell

Ada Lovelace

Ada Lovelace, also referred to simply as Lovelace, is a graphics processing unit (GPU) microarchitecture developed by Nvidia as the successor to the Ampere architecture, officially announced on September 20, 2022.

Ada Lovelace

Architectural details

Architectural improvements of the Ada Lovelace architecture include the following:

CUDA Compute Capability 8.9
TSMC 4N process (custom designed for Nvidia) - not to be confused with TSMC's regular N4 node
4th-generation Tensor Cores with FP8, FP16, bfloat16, TensorFloat-32 (TF32) and sparsity acceleration
3rd-generation Ray Tracing Cores, plus concurrent ray tracing and shading and compute
Shader Execution Reordering (SER)
Nvidia video encoder/decoder (NVENC/NVDEC) with 8K 10-bit 60FPS AV1 fixed function hardware encoding
No NVLink support

GPUs released this generation
Model Name	Launch
GeForce RTX 4090	Oct 12, 2022
GeForce RTX 4080	Nov 16, 2022
GeForce RTX 4070 Ti	Jan 5, 2023
GeForce RTX 4070	Apr 13, 2023
GeForce RTX 4060 Ti (8GB)	May 24, 2023
GeForce RTX 4060	Jun 29, 2023
GeForce RTX 4060 Ti (16GB)	Jul 18, 2023
GeForce RTX 4070 Super	Jan 17, 2024
GeForce RTX 4070 Ti Super	Jan 24, 2024
GeForce RTX 4080 Super	Jan 31, 2024

Abount RTX 40 series

Supported APIs: Direct3D 12 Ultimate (12_2), OpenGL 4.6, OpenCL 3.0, Vulkan 1.3 and CUDA 8.9
Supported display connections: HDMI 2.1, DisplayPort 1.4a
Tensor core 4th gen
RT core 3rd gen
NVIDIA DLSS 3
NVIDIA DLSS 3.5
Shader Execution Reordering
Dual NVENC with 8K 10-bit 60FPS AV1 fixed function hardware encoding
Opacity Micro-Maps (OMM)
Displacement Micro-Meshes (DMM)
No NVLink support, Multi-GPU over PCIe 5.0

Learn More about Ada Lovelace

Ampere

Ampere is the codename for a graphics processing unit (GPU) microarchitecture developed by Nvidia as the successor to both the Volta and Turing architectures. It was officially announced on May 14, 2020.

Ampere

Architectural details

Architectural improvements of the Ampere architecture include the following:

CUDA Compute Capability 8.0 for A100 and 8.6 for the GeForce 30 series
TSMC's 7 nm FinFET process for A100
Custom version of Samsung's 8 nm process (8N) for the GeForce 30 series
Third-generation Tensor Cores with FP16, bfloat16, TensorFloat-32 (TF32) and FP64 support and sparsity acceleration. The individual Tensor cores have with 256 FP16 FMA operations per clock 4x processing power (GA100 only, 2x on GA10x) compared to previous Tensor Core generations; the Tensor Core Count is reduced to one per SM.
Second-generation ray tracing cores; concurrent ray tracing, shading, and compute for the GeForce 30 series
High Bandwidth Memory 2 (HBM2) on A100 40 GB and A100 80 GB
GDDR6X memory for GeForce RTX 3090, RTX 3080 Ti, RTX 3080, RTX 3070 Ti
Double FP32 cores per SM on GA10x GPUs
NVLink 3.0 with a 50 Gbit/s per pair throughput
PCI Express 4.0 with SR-IOV support (SR-IOV is reserved only for A100)
Multi-instance GPU (MIG) virtualization and spatial GPU partitioning feature in A100 supporting up to seven instances
PureVideo feature set K hardware video decoding with AV1 hardware decoding for the GeForce 30 series and feature set J for A100
5 NVDEC for A100
Adds new hardware-based 5-core JPEG decode (NVJPG) with YUV420, YUV422, YUV444, YUV400, RGBA. Should not be confused with Nvidia NVJPEG (GPU-accelerated library for JPEG encoding/decoding)

GPUs released this generation
Model Name	Launch
GeForce RTX 3080	Sep 17, 2020
GeForce RTX 3090	Sep 24, 2020
GeForce RTX 3070	Oct 29, 2020
GeForce RTX 3060 Ti	Dec 2, 2020
GeForce RTX 3060	Feb 25, 2021
GeForce RTX 3080 Ti	Jun 3, 2021
GeForce RTX 3070 Ti	Jun 10, 2021
GeForce RTX 3080 (12 GB)	Jan 11, 2022
GeForce RTX 3050	Jan 27, 2022
GeForce RTX 3090 Ti	Mar 29, 2022

Abount RTX 30 series

Supported APIs: Direct3D 12 Ultimate (12_2), OpenGL 4.6, OpenCL 3.0, Vulkan 1.3 and CUDA 8.6
Supported display connections: HDMI 2.1, DisplayPort 1.4a
NVENC 7th generation
Tensor core 3rd gen
RT Core 2nd gen
RTX IO
Improved NVDEC with AV1 decode
NVIDIA DLSS 2.0

Learn More about Ampere

Turing

Turing is the codename for a graphics processing unit (GPU) microarchitecture developed by Nvidia. The architecture was first introduced in August 2018 at SIGGRAPH 2018 in the workstation-oriented Quadro RTX cards, and one week later at Gamescom in consumer GeForce 20 series graphics cards.

Turing

By Fritzchens Fritz, Link

Features in Turing:

CUDA cores (SM, Streaming Multiprocessor)
Compute Capability 7.5
Traditional rasterized shaders and compute
Concurrent execution of integer and floating point operations (inherited from Volta)
Ray-tracing (RT) cores
- Bounding volume hierarchy acceleration
- Shadows, ambient occlusion, lighting, reflections
Tensor (AI) cores
- Artificial intelligence
- Large matrix operations
- Deep Learning Super Sampling (DLSS)
Nvidia Optical Flow Accelerator for video interpolation workloads
Memory controller with GDDR6/HBM2 support
DisplayPort 1.4a with Display Stream Compression (DSC) 1.2
PureVideo Feature Set J hardware video decoding
GPU Boost 4
NVLink Bridge with VRAM stacking pooling memory from multiple cards
VirtualLink VR
NVENC hardware encoding
The GDDR6 memory is produced by Samsung Electronics for the Quadro RTX series. The RTX 20 series initially launched with Micron memory chips, before switching to Samsung chips by November 2018.

GPUs released this generation
Model Name	Launch
GeForce RTX 2080	Sep 20, 2018
GeForce RTX 2080 Ti	Sep 27, 2018
GeForce RTX 2070	Oct 17, 2018
Nvidia Titan RTX	Dec 18, 2018
GeForce RTX 2060	Jan 15, 2019
GeForce RTX 2060 Super	Jul 9, 2019
GeForce RTX 2070 Super	Jul 9, 2019
GeForce RTX 2080 Super	Jul 23, 2019
GeForce RTX 2060 (12 GB)	Dec 7, 2021
GeForce GTX 1660 Ti	Feb 21, 2019
GeForce GTX 1660	Mar 14, 2019
GeForce GTX 1650	Apr 23, 2019
GeForce GTX 1650 Super	Nov 22, 2019
GeForce GTX 1660 Super	Oct 29, 2019
GeForce GTX 1630	Jun 28, 2022

About RTX 20 series

Supported APIs: Direct3D 12 Ultimate (12_2), OpenGL 4.6, OpenCL 3.0, Vulkan 1.3 and CUDA 7.5
Unlike previous generations the RTX Non-Super (RTX 2070, RTX 2080, RTX 2080 Ti) Founders Edition cards no longer have reference clocks, but are "Factory-OC". However, RTX Supers (RTX 2060 Super, RTX 2070 Super, and RTX 2080 Super) Founders Edition are reference clocks.
NVENC 6th generation (B-frame, etc.)

About GTX 16 series

Supported APIs: Direct3D 12 (feature level 12_1), OpenGL 4.6, OpenCL 3.0, Vulkan 1.3 and CUDA 7.5
NVENC 6th generation (B-frame, etc.)
TU117 only supports Volta NVENC (5th generation)

Learn More about Turing

Generation (microarchitecture) that stopped getting support:

Nvidia has ended support for these generations because they are old and not up to today's standards

Pascal

Pascal is the codename for a GPU microarchitecture developed by Nvidia, as the successor to the Maxwell architecture. The architecture was first introduced in April 2016 with the release of the Tesla P100 (GP100) on April 5, 2016, and is primarily used in the GeForce 10 series.

Learn more (external link)

Maxwell

Maxwell is the codename for a GPU microarchitecture developed by Nvidia as the successor to the Kepler microarchitecture. The Maxwell architecture was introduced in later models of the GeForce 700 series and is also used in the GeForce 800M series, GeForce 900 series, and Quadro Mxxx series, as well as some Jetson products.

Learn more (external link)

Kepler

Kepler is the codename for a GPU microarchitecture developed by Nvidia, first introduced at retail in April 2012, as the successor to the Fermi microarchitecture. Kepler was Nvidia's first microarchitecture to focus on energy efficiency.

Learn more (external link)

Fermi

Fermi is the codename for a graphics processing unit (GPU) microarchitecture developed by Nvidia, released fermi architecture to retail in April 2010, as the successor to the Tesla microarchitecture. It was the primary microarchitecture used in the GeForce 400 series and 500 series.

Learn more (external link)

Tesla

Tesla is the codename for a GPU microarchitecture developed by Nvidia, and released in 2006, as the successor to Curie microarchitecture. As Nvidia's first microarchitecture to implement unified shaders, it was used with GeForce 8 series, GeForce 9 series, GeForce 100 series, GeForce 200 series, and GeForce 300 series of GPUs

Learn more (external link)

Curie

Curie is the codename for a GPU microarchitecture developed by Nvidia, and released in 2004, as the successor to the Rankine microarchitecture. It was named with reference to the Polish physicist Marie Salomea Skłodowska-Curie and used with the GeForce 6 and 7 series.

Learn more (external link)

Rankine

Rankine is the codename for a GPU microarchitecture developed by Nvidia, and released in 2003, as the successor to the Kelvin microarchitecture. It was named with reference to Macquorn Rankine and used with the GeForce FX series.

Learn more (external link)

Kelvin

Kelvin is the codename for a GPU microarchitecture developed by Nvidia, and released in 2001, as the successor to the Celsius microarchitecture. It was named with reference to William Thomson, 1st Baron Kelvin and used with the GeForce 3 and 4 series.

Learn more (external link)

Celsius

Celsius is the codename for a GPU microarchitecture developed by Nvidia, and released in 1999 microarchitecture. It was named with reference to Anders Celsius and used with the GeForce 256 and GeForce 2 series.

Learn more (external link)