GPU

Definition

Graphics Processing Unit - specialized hardware designed for parallel processing that accelerates AI training and graphics rendering.

Use Cases

Provider Equivalents

Frequently Asked Questions

What's the difference between a GPU and a CPU?
A CPU is designed for general-purpose computing and excels at a few complex tasks at a time (low parallelism). A GPU is designed to run many similar operations in parallel (high parallelism), which makes it much faster for workloads like deep learning training, matrix math, image/video processing, and 3D rendering.
When should I use a GPU in the cloud?
Use a GPU when your workload is highly parallel and spends most of its time on math-heavy operations—common examples are training deep learning models, running high-throughput AI inference, 3D rendering, video transcoding, and scientific simulations. If your workload is mostly web serving, typical databases, or single-threaded logic, a CPU instance is usually more cost-effective.
How much does a cloud GPU cost?
GPU cost depends on the GPU model (e.g., NVIDIA T4 vs A100 vs H100), the number of GPUs, VM size (CPU/RAM), region, and pricing model (on-demand vs reserved/committed use vs spot/preemptible). GPUs are typically priced per hour (or per second/minute depending on provider) and can range from relatively low-cost inference GPUs to very expensive top-end training GPUs. Also factor in storage, data transfer, and managed service fees if you use a platform like a managed ML service.

Category: hardware

Difficulty: intermediate

Related Terms

See Also