Have you ever dreamed of getting top GPU (graphics processing unit) power without spending too much? AWS is cutting prices by up to 45% on On-Demand GPU instances starting June 1, 2025. New Savings Plans also help you manage costs for AI, machine learning, and graphics projects. In this post, we explain how these updates give you strong performance and cost control, making high-performance computing more affordable and predictable.
AWS GPU Pricing Overview and Key Cost Models

Starting June 1, 2025, AWS cuts On-Demand GPU pricing by up to 45%. This means companies running AI, machine learning (ML), high-performance computing (HPC), and generative tasks can see real savings. After June 4, 2025, Savings Plans will offer similar discounts, making it easier to match compute capacity to long-term project needs. This approach keeps costs low without compromising the high-performance computing required for graphic-intensive work.
AWS is also expanding these offerings to more global regions. This expansion brings lower delays and helps you meet local data rules. It gives you a clearer view of your compute expenses, so you can estimate costs based on workload intensity and location. Whether you're rendering detailed scenes or processing complex AI models, the pricing is designed to match your technical and business goals.
| Instance Type | Description |
|---|---|
| On-Demand | No long-term commitment with up to 45% cost reduction |
| Savings Plans | Reserved capacity with annual or multi-year terms offering significant discounts |
| Spot Instances | Very low rates with the possibility of interruptions |
This 45% reduction along with broader regional availability makes budgeting more predictable. It lets you scale GPU resources efficiently while keeping expenses in check. In short, AWS's new pricing structure offers a balanced solution that meets both powerful performance needs and cost control for a variety of demanding workloads.
Detailed AWS GPU Instance Cost Breakdown

AWS provides a variety of GPU instances that are built to meet different workload needs. Each type offers unique benefits based on its GPU model, available VRAM (video memory), and price. For example, the p5.48xlarge uses H100 GPUs and shows the impact of recent price cuts. The new p6-B200 uses NVIDIA Blackwell B200 GPUs, which can deliver up to 2.5x more performance than H100 GPUs. Each GPU comes with 192 GB of high-bandwidth HBM3e memory. Other instances like the g5.xlarge, g4dn.xlarge, and p4d.24xlarge provide a mix of performance and cost options so you can optimize for compute power and budget.
| Instance Type | GPU Model | VRAM | Relative Cost Change |
|---|---|---|---|
| p5.48xlarge | H100 | Varies | Up to 45% reduction |
| p6-B200 | Blackwell B200 | 192 GB HBM3e | Enhanced performance |
| g5.xlarge | A10 | Moderate VRAM | Slight discount |
| g4dn.xlarge | T4 | 16 GB | Cost-effective |
| p4d.24xlarge | A100 | 40 GB | Premium pricing |
VRAM capacity and core design play key roles in determining price and performance. Large, fast VRAM like that of the Blackwell B200 helps process big datasets faster, which is vital for heavy AI training. The mix of CUDA cores (used for general tasks) and Tensor cores (tuned for AI work) also affects overall compute efficiency. This lets you choose the right instance to match your specific needs and budget, making your operations more cost-effective while still delivering high performance for AI and graphics rendering.
Region-Specific AWS GPU Pricing Variations

AWS now offers GPU instances in additional regions. This means you get lower latency, meet local data regulations, and manage workloads more efficiently. Prices for a p5.48xlarge instance can vary by region, providing real-time pricing that reflects local market conditions. Regional differences in infrastructure and operating costs make budgeting more predictable and global operations more flexible.
Price multipliers change due to varying local power costs and support expenses. AWS adjusts its rates to match each market's cloud service trends while maintaining reliable performance and meeting regional policies. This lets you optimize compute expenses without compromising performance.
| Region | Multiplier |
|---|---|
| US East | 1.00× |
| EU West | 1.10× |
| Asia Pacific Tokyo | 1.20× |
| South America São Paulo | 1.30× |
| Middle East Bahrain | 1.25× |
Local factors like power costs, infrastructure investments, and taxes directly influence these differences. By understanding these drivers, you can better predict spending across regions and align your deployment strategy with specific market dynamics.
AWS GPU Pricing Models: On-Demand, Reserved, and Spot Comparison

AWS offers several GPU pricing options that match different work styles. For instance, a recent enterprise case study showed that using the P6-B200 on Reserved Plans can cut costs for steady workloads, while design studios have saved money by running non-critical rendering tasks on Spot Instances during off-peak times.
-
On-Demand – With discounts anywhere from 0% to 45%, this model is great when you need flexibility without a long-term tie.
Example: A project saved 45% on costs by using On-Demand rates while quickly scaling up. -
Savings Plans – Over a 1- to 3-year period, you can get a 30% to 50% discount. This option is ideal for consistent workloads and detailed capacity planning, with choices like the P6-B200 that suit enterprise needs (enterprise nvidia gpu pricing).
Example: An animation studio used Savings Plans to streamline expenses during a multi-year production cycle. -
Spot Instances – Offering up to 90% off, this option is best for work that can handle interruptions, like batch processing or proof-of-concept runs.
Example: A simulation team cut costs by 90% on non-critical testing by using Spot Instances during off-peak hours.
Each pricing model fits different needs, flexible On-Demand for quick bursts, reliable Savings Plans for ongoing work, and cost-effective Spot Instances when occasional interruptions won’t hurt your workflow.
AWS GPU Pricing: Affordable Power & Performance

We control costs by matching infrastructure upgrades to your business needs. We fine-tune GPU VRAM and core settings to fit your workload. You can save by shifting less critical tasks to CPUs or exploring AMD options. AWS tools give you detailed insights into your usage so you can keep your budget in check and scale spending as needed. This approach lets you adjust deployments, prevent overspending, and stay flexible in a changing cloud environment.
- Choose the instance type that fits your workload
- Use Savings Plans for long-term projects with steady usage
- Take advantage of Spot Instances for flexible, interruptible tasks
- Set up AWS Cost Explorer alerts to keep track of spending
- Optimize data transfer and network use
- Automate shutdowns for idle instances
These practices help you align GPU deployments with your exact needs. By matching instance options to actual workload demands and selecting pricing models that reward predictable usage, you reduce waste and boost performance. Tools like Cost Explorer and automated shutdowns further fine-tune your spending, ensuring every dollar supports your goals. This focused strategy not only cuts costs but also makes your infrastructure more scalable and efficient, turning GPU investments into key business assets. For more details, check out our guide comparing cloud GPU cost vs on-prem GPU cost.
Frequently Asked Questions on AWS GPU Pricing and Hidden Fees

-
Are data-egress fees included?
No. Data egress fees are charged separately at about $0.09 per gigabyte in the US East region. -
Is premium support bundled?
No. Premium support is available as a separate plan. -
Are software licenses extra?
Sometimes. Third-party software licenses may incur additional charges. -
How does EBS storage affect cost?
EBS storage is billed separately based on your usage. -
Can hourly rates change mid-instance?
No. Hourly rates are fixed when you launch your instance.
For more billing details and an overview of any hidden charges, please refer to the official AWS billing documentation available through your AWS account.
Final Words
In the action, we walked through AWS GPU pricing models and cost breakdowns that matter for your production workflows. We covered key aspects like On-Demand, Savings Plans, and Spot Instances, and shared how regional differences and instance performance impact your budget.
We then explored actionable strategies to optimize costs while keeping operations reliable and efficient. By understanding these elements, you can make informed decisions and better control your expenses using aws gpu pricing.
FAQ
Frequently Asked Questions
What are AWS GPU instances and instance types?
AWS GPU instances and instance types refer to cloud servers with dedicated graphics processing units (GPUs) like H100 and A100, optimized for AI, machine learning, and graphics rendering workloads.
What is the AWS GPU Pricing Calculator and AWS Pricing Calculator?
The AWS GPU Pricing Calculator and AWS Pricing Calculator help you estimate costs for GPU instances and other AWS services, letting you compare On-Demand, Savings Plans, and Spot Instance models quickly.
How much do GPUs in AWS cost, especially H100 and A100 models?
AWS GPU costs vary by model and region. For instance, H100 and A100 pricing adjust based on configuration and discounts, so checking current pricing details is the best way to get accurate cost estimates.
Does AWS offer GPUs for compute tasks?
AWS offers a range of GPU instances designed for compute-intensive tasks. These instances use GPUs from NVIDIA among others, making them suitable for AI, rendering, and high-performance computing projects.
How much will AWS reduce GPU prices?
AWS has reduced GPU pricing by up to 45% on On-Demand instances and extended similar discounts with Savings Plans, lowering the overall cost for GPU-heavy workloads while expanding global availability.
What should I know about AWS paid plan pricing and monthly costs?
AWS paid plan pricing varies with service selection and usage. Monthly costs depend on instance type, region, and duration, so using the Pricing Calculator provides a tailored cost estimate for your needs.
How do AWS Dedicated server pricing, GPU rent, and VPS pricing differ?
AWS offers dedicated servers for exclusive hardware use, GPU rent for specialized GPU performance, and VPS for flexible virtual servers. Each option caters to specific performance and budgeting requirements.
What defines AWS EC2 pricing for compute instances?
AWS EC2 pricing for compute instances depends on instance types, regions, and usage patterns. These rates integrate with the overall service pricing and can be precisely estimated using the EC2 Pricing Calculator.

