The FinOps Inform Phase: Ensuring Cloud Cost Visibility
The FinOps Inform Phase involves setting up visibility into your organization’s cloud spend in order to inform future optimization decision.
Read
In Phase 1, we showed you how to set up your cloud environment for monitoring. We then explained how to create effective cloud cost intelligence dashboards. Doing this enabled you to:
These one-time manual fixes can lead to significant cloud cost savings. However, ideally, you shouldn't have to rely on accurate forecasting and manual intervention. Instead, your resources should scale up and down automatically based on your business’s changing needs. This article will explain how to achieve that by:
Your target cloud resource utilization rate should be 70-80%. But most companies are not even close to hitting that. On average, 30-40% of instances are over-provisioned, and some studies show even worse. So for every $1 spent on cloud computing services, at least $0.30-$0.40 is wasted on unused capacity.
Rightsizing your cloud infrastructure involves matching the appropriate size and type of resources to actual baseline workload requirements, so that you never pay for more than what you need.
Under-provisioning cloud computing resources for a critical application can be more costly than over-provisioning, as it can result in degraded performance or even downtime. For example, if you're running an e-commerce business, having your site crash on Black Friday because you don't have enough servers is going to cost a lot more than what you'd save on cloud operating costs.
Therefore, after rightsizing your baseline workloads, you should consider setting up auto-scaling for anything variable. This is especially important if you have fluctuating demand, for example if you are a seasonal ecommerce business.
Automated resource provisioning can refer to:
A step further is implementing Kubernetes, which optimizes compute resource allocation by matching workload requirements rather than simply scaling the same resources up and down.
A third way to drive cloud savings is to review and choose more appropriate instance types. Start by profiling workloads to understand their specific resource needs and performance requirements. This will help to select the most suitable cloud services and configurations.
Two instance types you should consider are Reserved Instances (RIs), and Spot Instances (SI).
Reserved instances:
Spot instances:
Use your FinOps dashboards from Phase 1 to review your workloads and instance types with your engineering team, as there are most likely more price efficient combinations to consider. A hybrid approach will work best; reserved instances handle baseline capacity reliably, while spot instances cover fluctuating demands. This combination ensures cost efficiency while maintaining operational flexibility and resilience.
You may want to automatically balance between reserved and spot instances. Unfortunately autoscaling in AWS does not natively support switching between instance types (i.e. reserved to spot instances) within a single Auto Scaling Group (ASG). However, you can achieve this by using multiple ASGs:
***
Cloud cost optimization is an ongoing process that involves selecting appropriate resource types, and leveraging automation to balance performance and cost efficiency. Begin by discussing the following with your engineering team:
Asking these questions to your engineering team should uncover the source and solution to your growing cloud spend. But achieving lasting cost optimization requires a company-wide cultural shift. Subscribe to our newsletter for Phase 3, where we explain how to build a company-wide FinOps mindset and embed cost-conscious practices into your organization.
Cloud Cost Optimization is the continuous practice of minimizing cloud expenses without compromising performance. It involves matching resource capacity to actual workload needs, automating resource scaling, and strategically choosing between on-demand, reserved, and spot instances. This proactive approach prevents overspending, reduces waste from idle resources, and ensures cloud infrastructure is cost-effective and aligned with business growth.
Rightsizing Cloud Resources means adjusting the size and type of compute, storage, and network services to fit actual workload requirements. Many organizations over-provision by default, leading to waste. By profiling applications, using analytics tools, and monitoring usage regularly, teams can eliminate underutilized or oversized resources, typically saving 30-40% of cloud spend associated with idle capacity.
Autoscaling and Kubernetes Autoscaler help manage variable workloads by dynamically adjusting cloud resource capacity in real time. Autoscaling automatically adds or removes instances based on demand, while Kubernetes’ Horizontal Pod Autoscaler and Cluster Autoscaler manage containerized workloads and node scaling. These tools prevent overprovisioning, reduce manual intervention, and optimize costs by scaling resources only when needed.
Reserved Instances offer up to 72% savings over on-demand pricing by committing to long-term usage for predictable workloads. Spot Instances provide up to 90% discounts by leveraging unused cloud capacity but carry the risk of sudden termination. Using a hybrid strategy—running baseline workloads on Reserved Instances and flexible workloads on Spot Instances—maximizes cost efficiency while maintaining operational reliability.
Subscribe to our newsletter. Get exclusive insights delivered straight to your inbox.