Posts Tagged ‘auto-scale’


How To Create an Auto-Scaling Web Application on GoGrid (Part 1 – Theory)

Tuesday, April 23rd, 2013 by

Creating an auto-scaling web application is an ideal use of cloud computing. Although manually scaling your infrastructure is easy in the GoGrid cloud, programmatically controlling your infrastructure to scale automatically is an even better example of the power of the cloud. This scenario–an application that can increase and decrease its server count, and therefore capacity, based on the load it’s experiencing at any given time–makes IT professionals, sysadmins, and application developers alike extremely happy. And it’s also something you can build using out-of-the-box tools in GoGrid.

We’ve divided this topic into two articles:

Part 1 (this article) – The Theory of Auto-Scaling:

  • Background: traditional vs. cloud hosting
  • Programmatically architecting a solution
  • The underlying Orchestration methodology

Part 2 – A Proof of Concept of Auto-Scaling:

  • Do-it-yourself Orchestration
  • Proof-of-concept examples

(more…) «How To Create an Auto-Scaling Web Application on GoGrid (Part 1 – Theory)»

What is Auto-Scaling, How Does it Work, & Why Should I Use it?

Monday, March 11th, 2013 by

When I think about the phrase “auto-scaling,” for some reason it conjures up the word “Transformers.” For those not familiar with the Transformers genre of cartoons, toys, games, and movies, it is essentially about cars that turn into robots or vise versa, depending on how you look at it. When they need to fight or confront a challenge, Transformers will scale up from a vehicle (a car, truck, airplane, etc.) into a much larger robot. Then, when the challenge subsides, they scale back down to a vehicle.

Transformers 4 Movie

Image source:

Scaling Explained

Scaling – in terms of infrastructure – is a similar concept, but applied to the horizontal or vertical scaling of servers. Horizontal scaling means adding (or removing) servers within an infrastructure environment. Vertical scaling involves adding resources to an existing server (like RAM).

Let’s look at an example. An author of a content creation website may write an article that attracts the attention of the social media community. What starts as a few views of the article per minute, once shared by many in social media, may result in hundreds or thousands of requests for this article per minute. When this spike in demand occurs, the load to the server or servers handling the website’s content may experience extreme load, affecting its ability to respond in a timely manner. The results can vary from long page loads to the server actually crashing under the additional peak load. In the past, this scenario used to be known as the “Digg effect” or “Slashdot effect.”

Although this type of success is great publicity for the author, it’s bad for the brand hosting the content. And, if users encounter slow or inaccessible websites, they’re less likely to return for other content at a later point, which can eventually result in a loss of revenue.

(more…) «What is Auto-Scaling, How Does it Work, & Why Should I Use it?»