In the last post, I described some of the new challenges associated with the Cloud based infrastructure.
Based on the observations, what shall be the right strategy then to tackle those challenges?
1. Have a clear capacity scaling strategy and establish regular architecture/capacity review
When deploy or migrate workloads to Cloud, always define a scaling strategy. Both for scaling up and down, so that this can quickly be implemented when need is there. Cloud suppliers are constantly creating new services and solutions that users can benefit through simplification of deployment, automation of workflows and intelligent monitoring. Thus it could be smart to have an architecture/capacity review for the solution on a yearly or 6-month basis to answer the following questions:
Is the current infra and architecture correctly set for the current workloads and usage patterns? Do we need to modify the capacity scaling strategy?
Is there any new solution or service that could improve stability, performance or cost efficiency of the solution?
How can we improve the scalability by de-coupling of application front- and backend architecture?
Based on the business forecast, what changes we would need to make on architectural level to meet the new demands (whether it is to host more users, more locations, new integrations or reduce costs)
2. Know your workloads
Understanding the workloads means understanding what is driving the cost for the Cloud bill. In the Cloud based infrastructure delivery, there is no more cost leverage on sourcing of infra operation suppliers, HW volume discount or facility sizing. Instead the cost leverage will come from stringent workload management and data archiving disciplines. Knowing your workloads means:
Understanding what is behind the temporal resource utilization variations
Understanding what usage behavior or DB queries or access patterns are resource demanding
Understanding possible resource bottlenecks for the workload patterns
Tuning of workload means:
Avoiding peak concurrent usage through batch jobs or pre-processing of reports
Separation of collision workloads with heavy resource demand
Stringent clean-up and archiving of historical data to reduce search and query time
Create more parallel and multiple processing queues through application tuning and utilizing application native scaling capabilities
Tune buffer space and other temporary caching for database layers
Etc.
3. Monitor, monitor and monitor
Fortunate enough, current Cloud suppliers provide extensive amount of monitoring data for your Cloud resources. Challenge remains for IT to have the right staff to be able to analyze and interpret data and summarize information for application owner and business stakeholder to make right decisions. Those are practices for traditional capacity managers and analysts with established practices and methodologies.
4. Follow-up cost on monthly basis in close dialog with business stakeholders
New challenge for capacity managers in the Cloud era implies their follow-ups is not only about resource utilization, workloads usage pattern but as well monthly costs for the involved resources in the Cloud bill. Establish clear relationship between data and have the capability to carry out right dialog with business stakeholders in answering following questions:
What is costing how much in the application landscape?
Why the cost is increasing?
What can be done to bring the cost down?
In traditional ITIL language, this is what we called “Business Capacity Management forum”.
For a seasoned capacity manager, the Cloud landscape possess new opportunities for automation as well as similar need for traditional workload management. The game-changing parameter is the immediate cost association with the practice. Being able to relate the capacity and workload management directly with cost impact immediately is a scenario that not existed in traditional datacenter model. This is a great opportunity for Capacity Management practices to demonstrate value for business.
Hong Zhu
hong.zhu@cmiconsulting.se