Ever wondered if high-performance computing could be more affordable? AWS just lowered GPU (graphics processing unit) prices by up to 45%. This means you can use strong GPU power for machine learning and graphics work without spending as much. We looked at the new pricing for the P and G series and found that the P6-B200 model delivers excellent performance. These savings can help boost your creative and technical projects while keeping costs low.
AWS GPU Pricing Breakdown for P and G Series

On June 5, 2025, AWS cut prices on EC2 NVIDIA GPU instances by up to 45%. On-Demand pricing went into effect on June 1 and Savings Plans on June 4. This update reshapes cloud compute costs, making high-performance GPUs more accessible for both machine learning and graphics tasks.
The P series, which includes the P4 and P5 instances, is designed for machine learning. They offer solid compute power, though with fewer size choices. In contrast, the G series, featuring options like G4dn and G5, is optimized for graphics work while still supporting machine learning. Additionally, the new P6-B200 model, powered by NVIDIA Blackwell B200 GPUs, provides 2.5x the performance of H100 GPUs along with 192 GB of HBM3e memory per GPU. This is a major step forward in cost efficiency for digital instance estimates.
| Instance Family | GPU Model | Memory | Price/Hour Before | Price/Hour After | Cost Reduction |
|---|---|---|---|---|---|
| P4 | NVIDIA V100 | 16GB | $3.00 | $1.65 | 45% |
| P5 | NVIDIA H100 | 32GB | $4.00 | $2.20 | 45% |
| P6-B200 | NVIDIA Blackwell B200 | 192GB | $5.00 | $2.75 | 45% |
| G4dn | NVIDIA T4 | 16GB | $1.50 | $0.83 | 45% |
| G5 | NVIDIA A10G | 32GB | $2.50 | $1.38 | 45% |
These new rates cut costs for various workloads. Machine learning tasks benefit greatly on the P series, and graphics applications gain through the expanded options in the G series. The updated pricing also supports broader regional deployment, reduces response times, and meets compliance requirements. All in all, this creates a balanced, flexible pricing model to help you optimize cloud compute investments.
Comparing AWS GPU Pricing Models: On-Demand, Reserved, and Spot

AWS offers several ways to run GPU instances based on your needs and budget. Each option has its own benefits in terms of discounts, commitment levels, and the type of work you plan to run. By choosing the right model, you can control costs based on billing cycles and real-time demand.
-
On-Demand: You are charged by the second (with a minimum of 60 seconds). This choice gives you the freedom to scale quickly without long-term commitments. It works best for unpredictable tasks, even though the cost per hour is higher than other options.
-
Reserved: This model offers discounts of up to 75% when you commit for 1 or 3 years. It is perfect for steady workloads that need consistent performance and predictable costs.
-
Spot: This option uses spare capacity and can cut costs by up to 90%. It is ideal for tasks that can handle interruptions, such as batch processing and non-critical work.
Expanded Savings Plans, including the new P6-B200, make GPU workloads even more cost-effective. These plans combine the savings of reserved instances with the flexibility of on-demand billing, giving you a practical way to match your compute needs while reducing overall expenses.
Key Factors Influencing AWS GPU Pricing

Regional differences and hardware choices largely shape how much AWS GPU instances cost. Local energy prices, taxes, and operating expenses vary by region, which affects the overall price. The GPU model you choose matters too. For example, using the latest H100 or B200 typically costs more than older models. The mix of GPU memory and CPU/GPU power also affects how well tasks run, and this performance factor drives the cost per hour. Finally, market demand, spot market competition, and extra network or storage needs add more layers to the pricing.
- Region: Energy prices, taxes, and logistics vary by data center, which influences the cost.
- GPU Model: Newer models like the H100 or B200 come at higher rates due to improved performance.
- Memory/CPU Specs: The right balance between GPU memory and CPU power helps workloads run efficiently, affecting the hourly price.
- Demand/Spot Fluctuations: Spikes in demand and competition in the spot market can lead to price changes.
- Network/Storage Add-ons: Additional network capacity or storage for some workloads further increases the total cost.
Understanding these factors helps you plan your deployments effectively. By looking at local cost drivers, hardware generations, compute resource balances, market trends, and extra fees, you can set up a compute strategy that meets both your performance and budgeting needs.
Strategies for Optimizing AWS GPU Pricing

Getting the best out of your AWS GPU setup starts with a plan that cuts costs without cutting performance. We recommend balancing long-term savings with flexible, on-demand options so you only pay for what you really need.
- Use Savings Plans and Reserved Instances to cover stable workloads. These require a longer-term commitment but provide solid discounts.
- Move interruptible jobs to Spot Instances. This can save you up to 90% by capitalizing on unused capacity during low-demand times.
- Tag your resources accurately. This lets you track spending by project, department, or team member.
- Schedule GPU work for off-peak times. Running tasks when demand is lower can significantly reduce your costs.
- Utilize FinOps tools like AWS Cost Explorer along with third-party platforms. They help you analyze spending in real time and adjust your billing strategy based on current usage trends.
Keep a steady watch on your spending with regular monitoring and forecasting. As workloads shift, adjust your commitments to match real-time data. This ongoing review process not only keeps costs in check, it also lets you quickly adapt to changes in the market and your compute needs.
Case Studies: Real-World AWS GPU Pricing Benefits

AWS GPU pricing improvements have shown clear cost savings in real workloads. Companies now have flexible ways to manage compute costs. In these examples, we highlight tangible benefits for both machine learning and rendering tasks using optimized pricing and smart regional choices.
ML Training on p5.48xlarge
Using the p5.48xlarge (a GPU instance) in the US-East region, machine learning teams saw a 40% cost drop following a recent price reduction. Heavy workloads that once strained budgets now run more economically while keeping performance high. This allows developers to experiment with larger models and run more iterations.
The cost cut also improved resource allocation by providing more compute capacity and faster turnaround times. Organizations could expand their training pipelines while controlling expenses. This case shows how the right pricing model can support advanced ML techniques in production.
Real-Time Rendering on G5 Spot Instances
G5 Spot instances for real-time rendering delivered nearly 50% savings. Here, a rendering pipeline made the most of flexible, interruptible compute. The dynamic Spot pricing model suited non-critical workloads that do not require constant uptime.
By using Spot instances, rendering teams significantly lowered the cost per frame while still meeting quality standards. This approach not only cut costs but also allowed deployments in Europe and Asia, reducing latency and simplifying compliance with local regulations.
These cases show that choosing the right instance type and pricing model can lead to substantial savings. Cost-efficient strategies for both high-intensity ML training and creative rendering empower organizations to boost performance without overspending on GPUs.
Tools and Resources for Monitoring AWS GPU Pricing

Keeping an eye on costs in real time is vital to manage your GPU deployments and match your spending with your needs. Monitoring tools give you flexible cost predictions and let you track your expenses as they happen. These systems help you spot sudden cost increases right away and plan for future growth.
- AWS Pricing Calculator: It computes cost estimates based on your specific GPU workloads.
- AWS Cost Explorer: It shows your past spending and helps forecast future trends.
- AWS Budgets: It sends alerts when your costs near a set limit.
- Third-party FinOps platforms: They offer GPU-specific cost details for a clearer view of your resource use.
Using these tools in your regular financial operations builds a solid monitoring system. This way, you can adjust your budgets and resource plans quickly to keep your deployments cost effective. By relying on up-to-date data, you stay in control of your spending and can react fast to changes in usage and compute needs.
Final Words
In the action, we covered the recent price cuts and clear differences between the P and G series. We detailed changes in on-demand, reserved, and spot models to match varied workloads.
We reviewed cost drivers and smart strategies like Savings Plans and tagging. Our case studies showed tangible savings and improved performance.
Overall, this breakdown helps you navigate aws gpu pricing while keeping production reliable and cost-efficient, an encouraging step toward smarter GPU compute.
FAQ
What cost calculators does AWS provide for GPU pricing?
The AWS GPU Pricing Calculator and AWS Pricing Calculator offer real-time cost estimates for GPU instances. They help you plan budgets by comparing On-Demand, Reserved, and Spot options based on your workload’s needs.
What GPU-enabled instance options does AWS offer?
AWS provides GPU-enabled EC2 instances designed for diverse tasks. They include P series for machine learning and G series for graphics, both engineered for scalability and performance in varying projects.
How does AWS GPU pricing vary across different models like H100, A100, and V100?
AWS GPU pricing depends on the GPU model, memory, and instance family. Detailed cost differences for models like H100, A100, or V100 vary by region and usage type, so using the Pricing Calculator ensures current rates.
What recent changes have been made to AWS GPU pricing?
Recent updates include up to 45% price cuts on EC2 NVIDIA GPU instances, effective June 1 for On-Demand and June 4 for Savings Plans. There were no reports of a 15% price increase on any day.
How can I get current pricing for AWS GPU V100 instances?
AWS GPU V100 pricing is influenced by region and instance specifics. Using the AWS Pricing Calculator will provide accurate, up-to-date rates, ensuring you have the best cost estimates for your project.

