Save
40%
Instantly

Use coupon code OCT2011

  • DAYS

  • HOURS

  • MINUTES

  • SECONDS

HURRY UP!
OFFER EXPIRES IN:

The "StableHost" blog

The only time it matters: Handling Downtime.

We’ve worked with a certain provider many times throughout the last few years; they’ve hosted external services for us that we’ve used internally that needed to be outside our network. This post isn’t to slam this provider, in fact we won’t be naming the provider, however the lesson we learned that day was critical to us and we’re making sure our customers never experience the same lesson as we did.

It’s 1AM, we started to get paged regarding a certain internal service that was down, this particular service was hosted on a VPS at another provider’s network. We opened up a ticket with them as many of our own customers would do and the first response we received was “It’s not down, it’s just slow”. While that answer may be perfectly acceptable in certain situations, when we can’t ping or ssh into the server at all, it doesn’t matter if it’s “just slow”, it’s down for us and that’s all that matters. We replied back letting them know not only was it “not slow, but completely down” we didn’t hear back for quite some time.

We jumped on their live chat a few hours later and asked what’s going on, their response was ‘we’re working on it, no eta’. Few hours later, we replied to our ticket again asking for an update, no response.

12 hours later, our server is accessible again.

To this date, we have not yet received a response from them as to what happened, no apology, nothing.

So, what did we learn from this?

  • Downtime will happen regardless! It doesn’t matter if you’re Joe’s Small Hosting Company or Google. You will have outages big and small, computers have a limited time span before they finally fail on you.
  • Communication during an outage is #1. We’ve learned customers are a lot more understanding knowing what’s being done to bring the server back up, even if they don’t exactly understand the “tech lingo”. There should be status updates via twitter and facebook at least hourly and constant updates to support tickets. No communication during an outage is the worst thing you can do, however it’s also the easiest thing to do as you’re busy fixing the server, the last thing you want to do is update people on the status. We believe, in our situation, the entire server had to be restored and in that case 10+ hours is perfectly acceptable as there’s a lot of work to be done, however without any updates from the company you automatically assume the worst.
  • Follow up with customers after the fact, be truthful and learn from your mistakes. If your customers have been down for the past few hours, they deserve some kind of notification letting them know what happened, why it happened and what you’re going to do to prevent it from happening again.

You can run the perfect company in terms of pricing, support, have a great looking website and so on, however if you forget the simple things such as communication when it matters most, you’re not going to succeed in this business.


Unhappy? Speak up!

Several years ago, prior to working here, I was a manager of a technical support organization. Every few months, I’d have an employee come to me and let me know they were leaving the company because they were unhappy with a certain aspect of their job. I always asked them why they didn’t let me know prior and always felt like a failure because as a manager it’s your job to realize when people are unhappy and make them happy again. I could never get a honest answer from them, I just assume employees feel it’s easier and less risky to find another job then to to be honest with their boss.

Similar situations has haunted me here, however it’s not with employees but customers this time. Very few customers will write in and tell us they are unhappy with us but instead change web hosts and we find out months down the road when they write a thread on a web hosting forum letting people know why they don’t recommend us.

As we run a shared platform, we have to use technology and standards that will fit ‘most’ customers, now this doesn’t mean every customer is going to have a positive experience because we know that won’t happen. However, in most cases, we can always make the customer happy by moving them to another server, adjusting a server configuration or upgrading them to another plan/service. I can count on my right hand how many times we’ve exhausted all resources for a single client and end up letting them know they are better off with another service because we simply can’t help them. That feeling really sucks and we’re lucky we don’t have to do it very often.

I encourage everyone if there’s a problem in life, not just web hosting, be honest with the other person/company and in most cases you will see it’s a lot easier to work out the issue then to move on. It’s our job to make sure you’re happy and if we never hear from you, we will always expect there’s nothing we could do to make your experience a better one.


How exactly does the “Internet” work?

As a nerd, Internet routing comes up in every day conversations for us. For most people, paying a monthly fee to their Cable or DSL provider is the extent of how the Internet works, but how does something on your computer travel from one end of the country to the other in less then a second?

First off, you have to understand how it’s all connected together. The Internet isn’t a service that people sign up and you’re connected, it’s far more complex. The major Internet companies out there laid cables, known as fiber cables, it looks like this:

These cables are mostly under ground and we drive over them every day. The providers that own these cables are the main players of communication and are known as Level3, Sprint, QWest (CentryLink), AT&T, Savvis, etc. For the Internet to work, these cables need to connect each of these providers up with another provider. For example, AT&T has cables that connect into Sprint’s networking equipment all over the world. Companies then, such as us, buy a “port” on these major companies routers which allow us to connect into the “Internet”.

When you check your email, you send a packet to your Internet provider which has certain information such as where you want to go. Your ISP then check their database as to how they reach our network and send it on it’s way. It would be very expensive to buy ports on every ISP’s networking equipment so, so we rely on an Internet protocol called “BGP”. The BGP protocol will tell your ISP in order to get to us, you have to go to AT&T first, then Level3, and then Sprint and we have a port on Sprint’s networking equipment. In this example, it only changes locations 3 times, however in the real world, it’s normal for a single packet to change companies 10-15 times depending on how far away you are from where we are (and our servers).

It’s really incredible if you think about it, you are sending thousands and thousands packets to us every day. For example, when you check your email every 5 minutes, you send us atleast 20 packets just to ask us if you have any new email, so every 5 minutes, you are sending 20 packets from one end of the world to Phoenix faster then you can blink.

The Internet is an amazing thing, however since there’s so many moving pieces to the Internet and how it’s all connected together, there can be problems where you can’t reach your servers and it’s not our fault. For example, in our situation above where it went from your Internet company, to AT&T, and then Level3 to Sprint to us. If Level3 had an issue for whatever reason, your may not be able to reach your website.

We hope you enjoyed this, obviously there’s a lot more to the Internet as we describe here but that’s the basics on how things work.


Survey Results

We recently sent out an invitation to rate us on our performance to a few thousand people — we didn’t send it out to new customers because we wanted people to have a chance to experience us a bit more before reviewing.  We only expected 20 or so people to respond because let’s be honest, surveys are a pain.  We all get them constantly and they just annoy us — so seeing a few hundred people answered ours was amazing!

Normally survey results are kept fairly confidential — however, we were stunned by the results so we wanted to share with everyone.  Usually you won’t see companies release survey results because they are always negative, companies send out surveys to figure out “What’s wrong” and “Why sales are horrible”.  We don’t feel that way at all, however customer needs are always changing and we want to make sure we’re supporting our customers as much as possible…… and now to the results:

First off, we’re extremely happy to see the majority of people voted us “Excellent” in all categories — that is outstanding and we couldn’t be more happy.

As we expected, speed was the lowest rating (insert sad face here).  As we pointed out a few months ago, shared web hosting isn’t for every website out there.  If it’s a personal website where you store pictures and use for email, you won’t have speed issues, if it’s a small business website that you run, shouldn’t be a problem either.  Slow speeds come into play when you’re running multiple sites with database-driven applications (Huge blogs, forums, chat servers, etc).

In most cases, when you notice speed issues, it’s usually not due to the server being overloaded (even though that has happened due to various reasons in the past, it’s unlikely), it’s usually due to you running out of your own CPU/memory restrictions we place on each account.  We do this, not to make you mad, however to protect the server and protect the other hundreds of clients on our servers.  However, as we allow for each website to use “up to 10%” (not to be confused with use 10% constantly), a few bad apples in the hundreds of accounts can really slow things down, all it takes is 6 or 7 accounts and the server is running on 30% CPU idle for the rest of the hundreds of accounts, not really ideal.  We do our best to make sure our server’s are always monitored and they page us as soon as they notice CPU issues so if it comes to that point, we fix it before it becomes a major issue.  We recently released VPS plans (both budget — for those who are experienced system administrators who can need a VPS for a low-resource project, and enterprise VPS’s, which are for high end websites (high end blogs, forums, etc) who don’t want to upgrade to a dedicated server but want the benefit and speed of one.

However, our servers are getting a little bit of upgrade in the next few months (starting with the first one tomorrow, actually).  Each of our US servers has 12G of ram, we are upgrading all of our Phoenix-based servers to 24G of ram, doubling the amount of things we can keep in cache.  This will help increase speeds greatly across the board.  Some of our EU servers already have 24gigs, we’ll be rolling it out to all of them after we finish with our Phoenix-based servers.

The next survey question was regarding communication and how often we communicated with you.  Sometimes I feel like we communicate too much with you so I’m glad we asked this question — the results were not expected.   We’ll continue to send updates as much as needed — hopefully in the future, customers can check what kind of communication they want from us (outages, maintenance’s, features, etc) and so they only receive certain emails from us if they feel we’re sending too many.

The next question related to how we notify customers, we’re glad phone wasn’t a favorite.   It would take a long time to call thousands of people anytime we made a change — and we’re nerds, we hate talking on the phone!

The question related to referrals was a huge sign to us we’re doing the right thing — in most cases, you don’t refer other people to services which you aren’t happy with, therefore we’re very happy to see the results of that poll

As far as ‘What could we do better?’ this was a comment box with lots of comments — most of them say ‘Nothing, you’re doing a great job’ which is awesome!    However, I wanted to publicly address on the blog:


Concerns about our terms of service – If you are a newer customer of ours, you might have no idea what we are talking about here, but maybe 8 months ago, we completely re-did our terms with our lawyer and he wrote up about 25 pages of “lawyer-speak”.     We thought it was a good idea due to recent events at the time, however we received a lot of negative feedback about it.    We decided to change our terms a few months back to make it much easier to read — therefore, for those who complained about our terms, we invite you to check out our new terms and let us know any feedback you have!   It’s best to contact us directly regarding terms questions/comments.

Concerns we are getting too big and service will decline – Every big company has this issue, either they grow too fast and can’t support things or they start going cheap on things to make more profitable.     We’re doing our best NOT to become those companies, we’re not a fan of these “mega web hosting companies” that have millions of customers and 1000′s of employees because to them, a customer is nothing.    We’re growing at a steady pace that gives us the ability to not have hundreds of employees and we still try to treat each customer like they are a part of our family.      I hope we never lose that feeling as we continue to grow.     If it wasn’t for the customers, we wouldn’t be here — as people grow big, they forget that very thing.

Coupons for existing customers - The majority of our coupons on for new accounts only — this is pretty standard across most web hosting companies.   However, after reading some of these comments, we’re going to change some things up and start offering limited-time coupons to existing customers to encourage them to bring their additional accounts over to us.  When this happens, we’ll make sure to announce it!


That’s about it — we greatly thank each and every one of you for taking your time and answering our survey.


PHP 5.3 support

Over the past year, we’ve constantly gotten emails about why we run old versions of PHP.      The simple answer was “we can’t”.

When PHP created PHP 5.3.x, they rewrote a lot of the functions that PHP 5.2.x had, causing the majority of scripts out there to not work.       As a web hosting provider, we have to make decisions every day on what’s the best decision for our clients — do we enable PHP 5.3.x knowing that it will not work with older scripts such as WordPress, but allow newer scripts to work with our system such as the new versions of Joomla?   The cons of PHP 5.3.x always out-weighed the pros and therefore we always kept on the PHP 5.2.x branch….. until now.

I’m happy to report all of our Litespeed servers now support both PHP 5.2.x and PHP 5.3.x versions.     By default, our customers will always use PHP 5.2.x, however if you wish to use PHP 5.3,x, it’s a simple line to your .htaccess file.

AddType application/x-httpd-php53 php53 php

Enjoy!


Changes, Changes, and more Changes!

If you’ve been a customer of ours for at least a year, you’ve most likely been a part of one (or more) of the major changes we’ve made to our network.

Due to increased growth the last 2 years, we’ve been changing things non-stop to accommodate the growth.    Any person that has an internet connection can start a web hosting company, however very few people have the technical knowledge to continue to scale it as it continues to grow.

When we had only a few hundred clients, we leased our servers directly from the datacenter.    We started like every other web hosting company out there, with a $99 server.   As we all know — a $99 server can only go so far, a few clients and it’s maxed out.    As months went on, we started to buy our own hardware directly from Dell and placed it in a cabinet in Texas.    Several months later, we continued to add servers and realized we needed them to be locally here in Arizona.    Troubleshooting hardware problems remotely is very hard not something we wanted to do.    However, how do you move all of your clients from Texas to Phoenix?

Well, we could overnight the servers and hope the customers are okay with 24 hours of downtime, not ideal but the cheapest solution by far.   We decided 24 hours of downtime was unacceptable (hey! we have to live up to our name!) so we bought new hardware and duplicated our setup that we had in Texas.    We then migrated everyone from Texas to Phoenix, not a fun task for the customer or us.

When we originally moved to Phoenix, we were still using our datacenter’s network.    When you have your own servers, you have two options on how you configure your network.    You can take the easy and cheap approach which involves buying a switch and having a uplink to your datacenter’s network, they give you IPs, and you’re set!   We did that for a while and realized while we love our servers locally in Phoenix, we needed more control over the network.     We hate relying on other people if we can help it and the more things we have control over, the better.

Months later, we decided that we needed to build out an actual “network” in Phoenix.   By this, it involves buying routers, transit providers, our own IPs,  etc.   This has been a “work in progress” for the past year, constantly upgrading it and adding additional redundancy to it.

If you signed up with us in the past year in our US location, you’re “on our network”, while if you came from our Texas network, or signed up prior to the whole build out, you’re still on the “old network (datacenters network)”.   While there’s nothing wrong with the old network — it doesn’t mean your website is slow, etc — it does mean that we don’t have the control we want when it comes to DOS attacks, routing issues, outages, redundancy, etc.

Therefore, our last (hopefully!) network migration is to move all of the people who are on the old network to our own network.     So, how do you know what network you’re on?  You can determine it based on your IP address.    If your IP address starts with 199.x.x.x, you are on our network.    If you are on a 64.38.x.x IP, or a 184.x.x.x IP, you are on our older network.     We started this IP migration project as of today and starting with the server ‘cp20′.    We will be updating this post with our update to keep everyone informed on the progress of it as it’s a very slow and time consuming process.

Lastly, we greatly appreciate everyone’s patience as we continue to make our network better.    We are very happy with how our US network turned out after several changes and learning what works (and what doesn’t work) and hope you can understand and appreciate why we do it.

IP MIGRATION STATUS

-

cp20: As of 08/01/11 – All accounts have been moved over successfully.

cp22: As of 08/21/11 – All accounts have been moved over successfully.

cp23: As of 08/22/11 – All accounts have been moved over successfully.

cp24: As of 08/04/11 – All accounts have been moved over successfully.


Understanding the limits of shared hosting

One of the toughest jobs we have is making sure the customer understands the limits of shared hosting.

What exactly is shared hosting? Most customers have never heard that word before, they just know they buy web hosting from StableHost.com. Shared hosting is the term companies use where they purchase a server which is shared amongst all of their customers. A shared server can have anywhere from 100 to 2000 clients on it, depending on specs, load, etc. We have many shared servers that we spread our customer base out equally to each of them.

On average, web hosting companies will allow each customer to use a certain amount of CPU, normally this ranges from 10-15%. CPU is the most valuable resource when it comes to a server and therefore there is always a limit of how much each website can use. When web hosting companies state a 10% CPU limit, that means you can burst up to 10% (usually for up to a few minutes) but that doesn’t mean you can constantly use 10%.

We’ll explain why — let’s take the average server cost, $500/month for the server, rack space and bandwidth. If we allowed a customer to constantly use 10% of the server, that means we could only have 10 clients per server. That means each customer would have to pay $50/month and that gives us no overhead, no support costs, no paying employees, no upgrades, nothing. With the average web hosting cost of $5/month, that averages out to the customer paying for roughly 1% of the CPU.

99.9% of websites will fall under the 10% CPU, however the 0.1% of usually our highest submitting ticket customers. We run cloudlinux which basically limits each customer to 10% automatically. Before cloudlinux came around, we would have to watch for it constantly and suspend if we saw someone abusing resources. They make it easy on us now where the server does all the hard work and will limit a website if they are using more then 10%. When your website gets limited, it will either be extremely slow or you will not be able to reach it.

These types of websites are suitable for shared hosting:

- A personal blog or website
- A forum for a few friends or a small community
- A gallery of pictures
- Your small business website

These types of websites are NOT suitable for shared hosting:

- A large forum (over 30 people online at a time)
- A large blog (Over 10,000 visitors a day)
- Any type of web-based game

So what happens when you outgrow shared hosting? The next step is VPS which is your own virtual server. It’s where a web hosting provider will split a dedicated server into smaller pieces and allocate 1 piece to each customer giving them dedicated CPU/Memory resources. VPS’s with cPanel start roughly around $50/month and go up from there. Once you outgrow a VPS, the next step is a dedicated server which start usually around $150-$200/month.


We are IPv6 accessible!

On June 8th, 2011, it was World IPv6 Day!. As you’re most likely aware, IPv4 (our current IP addresses, known as 123.123.123.123) are about to run out. There’s roughly, 12,347,472 IPs left on the Internet which seems like a lot however it will be gone in less then a year.

When the internet was first “designed”, nobody could have imagined the internet would be as big as it is today and how almost every single person in the world is now online, therefore IPv4 IP space had to run out sooner or later and we’re now at that point.

They’ve been developing IPv6 for many years but up until the last few years, nobody really gave it the attention it deserves as of today, there’s still a lot of ISPs who have no interest in re-designing their network for IPv6.

Our control panel, cPanel doesn’t yet support IPv6 for customer facing websites, however they are currently working on it. However in the mean time, we’ve been busy getting IPv6 ready while we wait on cPanel!

As of a few weeks ago, our main website StableHost.com is now IPv6 accessible.

stablehost.com. 120 IN AAAA 2600:3c01::19:d000
stablehost.com. 120 IN AAAA 2600:3c03::1d:2000

As you can see, IPv6 is a lot different then you may be used to — the IP addresses are much longer and use letters as well. If you’re familiar with DNS, they also use AAAA records, instead of ‘A’ records that IPv4 uses. If you’re curious why we have two AAAA records for our main website, it’s due to the load balancing of our corporate website we have. Remember — we take uptime very serious!

I’m happy to also report that our customer-facing nameservers are also IPv6 accessible:

ns1.stablehost.com. 120 IN AAAA 2001:470:1f0e:b89::2
ns2.stablehost.com. 120 IN AAAA 2600:3c01::f03c:91ff:fe93:a830

Over the next few weeks, we will be deploying IPv6 across all of our Phoenix network — what does this mean for you? It means the minute cPanel comes out with a IPv6 version, we’ll be able to deploy IPv6 to our customers.


Failover? What failover?

Recently, our phone company grasshopper.com had a massive outage that lasted over 24 hours. The original outage was due to a raid failure which is bound to happen sooner or later. Hard drives are always the part that fails the most — however, that’s why you make backups, right?

Grasshopper.com decided to fail over to their secondary/failover site while they tried to fix the raid, however their secondary site didn’t work as expected. I’ve personally seen this time and time again — companies put thousands (sometimes millions) into a secondary / redundant setup and in the prime moment where you need it the most, it doesn’t work for you. It’s easy to forget about your secondary site — usually you never log into it, once it’s setup and working — people just let it run itself. However, is that the right thing to do? Of course not.

So how do we prepare for failures? There’s three major failures we’re concerned about, we’ll outline them below:

Data loss – As I mentioned earlier, hard drives are not reliable — even at times, raids aren’t reliable, however we do our best to make sure customer data is safe. We run raid on all of our servers except 3 — those servers are very old and will be getting replaced shortly. By running raid, it gives us the safety net knowing if one drive fails, the server is still going to stay up and continue processing. So, what happens if we lose two drives at once? In the unlikely situation that we do lose multiple drives at once, we have local backups on the server. We do local backups several times a week syncing customer websites to a secondary drive on the server, not related to the raid. Therefore, if the raid dies, we still have local copies of the data. So, what happens if both the raid dies and local backup drive dies? That’s when we pack up and go home for the day. We wish ;-) , however we have yet another method to make sure your data is safe — we sync our local backup drives to a secondary backup server weekly as well. We hope you understand now how serious we take keeping customer data safe. We have developed custom, in-house scripts to monitor that data and make sure it’s always in sync with each server. Lately, we always encourage our customers to take backups of their own data as well. You can download a full backup of your site by logging into cPanel -> Backup -> Generate Full Backup.

System Failure – Although hard drive failures are most common, an entire system failure can happen as well. Yes — we have solutions for these types of failures as well. In Europe, where we lease dedicated servers, the datacenter will replace the faulty parts for us and we’ll be back up and running in no time. In Phoenix, where we buy all of our own hardware, it’s a bit different. We have spare servers with the exact same specs sitting in our cabinets waiting in case of a failure. We buy premium Dell equipment which is brand new, therefore to-date, we’ve never had a system failure, however does that mean we shouldn’t spend the money to have extra machines sitting there doing nothing? No! If a system does fail, we can swap the drives and the customer will be back up in no time — and we can spend the rest of the day troubleshooting with Dell on what went wrong and fix the bad hardware without having the customer suffer.

Networking Failure – The last part failure we are concerned about is a networking failure. While we don’t run our own network in Europe and therefore don’t need to worry about it there — we do in Phoenix. In Phoenix, our entire network was designed by us and all equipment is owned by us. As we prefer to have complete control over our network, it also is prone to failures. To prevent a networking failure, we have redundant switches and routers sitting in our cabinets, similar to our server setup. In the event one dies, we will move everything to a spare and they will be back up. As far as providers, we currently have 3 network providers and have configured our network that if one has an outage, it will automatically switch to the other one within seconds. We do, unlike our phone company, test our failover and make sure it does work as designed.

We hope you have a better understanding of how we stand by our name, Stable Host. It’s not cheap by any means to have redundant everything, but it’s worth it in an outage. Every company will have outages from time to time — it’s a matter of who’s going to be prepared for it or not.


Phone Support

We’ve been going back and forth on the topic of having phone support for technical issues for a few months now.    We stopped billing support over the phone a while back and it’s worked out well for us.   We preferred having all billing issues written down and documented — plus, we didn’t want our techs to be taking credit cards over the phone for security reasons.

We recently revisited the topic of moving technical support to email only so we took a list of the last month of calls and looked at why they called, the results are below:

  • Unable to reach their website
  • Password resets
  • Escalated issues (scripts not working, not receiving email, etc)

We then looked at how to solve each of these issues:

  • The fastest way to find out if there’s a server problem is view our twitter page, which has up to the minute information of all of our servers.
  • Password recoveries were never done over the phone due to security reasons.    We built tools on resetting passwords via the customer portal.
  • Escalated issues required a ticket put in to have our tier-2 email support investigate.

We only had tier-1 support via phone who was there to answer basic questions, logging into cPanel, setting up email, etc.   Any other issue that came in got escalated to email, which delayed the resolution time and I’m sure frustrated the customer.

Therefore, we made the decision at this time to remove our tier-1 phone support, which gave us the ability to bring on additional tier-2 email-only support members.    Therefore, any ticket that comes in will be handled by an experienced tech and now very few issues will end up getting escalated.

If you’ve never emailed our support, we want you to know we have one of the fastest response times in the hosting industry.  99% of tickets are answered within 1 hour, 75% of them within 10 minutes.

 

 


RatePoint Business Reviews