<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Stop Buying Servers &#187; reliability</title>
	<atom:link href="http://stopbuyingservers.com/tag/reliability/feed/" rel="self" type="application/rss+xml" />
	<link>http://stopbuyingservers.com</link>
	<description>Cloud Computing for Small Business</description>
	<lastBuildDate>Tue, 20 Apr 2010 21:31:11 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	
		<item>
		<title>Google Gapps</title>
		<link>http://stopbuyingservers.com/2009/02/google-gapps/</link>
		<comments>http://stopbuyingservers.com/2009/02/google-gapps/#comments</comments>
		<pubDate>Thu, 26 Feb 2009 14:11:11 +0000</pubDate>
		<dc:creator>David Schrag</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[reliability]]></category>

		<guid isPermaLink="false">http://stopbuyingservers.com/2009/02/google-gapps/</guid>
		<description><![CDATA[One oft-stated concern about moving applications to the cloud is reliability? What happens if the cloud goes down? Google Apps Premier Edition had an outage earlier this week. Here’s how they explained it: Dear Google Apps customer, Between approximately 9AM to 12PM GMT / 1AM to 4AM PST on Tuesday, February 24, 2009, some Google [...]]]></description>
			<content:encoded><![CDATA[<p>One oft-stated concern about moving applications to the cloud is reliability? What happens if the cloud goes down?</p>
<p>Google Apps Premier Edition had an outage earlier this week. Here’s how they explained it:</p>
<blockquote><p>Dear Google Apps customer,</p>
<p>Between approximately 9AM to 12PM GMT / 1AM to 4AM PST on Tuesday, February 24, 2009, some Google Apps mail users were unable to access their accounts. The actual outage period varied by user because the recovery process was executed in stages. No data was lost during this time. The root cause of the problem was a software bug that caused an unexpected service disruption during the course of a routine maintenance event. The root cause of this unexpected service disruption has been found and fixed.</p>
<p>Additional Details</p>
<p>A few months ago, new software was implemented to optimize data center functionality to make more efficient use of Google&#8217;s computing resources, as well as to achieve faster system performance for users. </p>
<p>Google&#8217;s software is designed to allow maintenance work to be done in data centers without affecting users. User traffic that could potentially be impacted by a maintenance event is directed towards another instance of the service. On Tuesday, February 24, 2009, an unexpected service disruption occurred during a routine maintenance event in a data center. In this particular case, users were directed towards an alternate data center in preparation for the maintenance tasks, but the new software that optimizes the location of user data had the unexpected side effect of triggering a latent bug in the Google Mail code. The bug caused the destination data center to become overloaded when users were directed to it, and which in turn caused multiple downstream overload conditions as user traffic was automatically shifted in response to the failures. Google engineers acted quickly to re-balance load across data centers to restore users&#8217; access. This process took some time to complete.</p>
<p>The recently launched Apps Status Dashboard includes greater detail on this February 24th incident, including actions we are taking to continually improve performance. For a direct link to this Incident Report, visit <a href="http://www.google.com/appsstatus/ir/1nsexcr2jnrj1d6.pdf">http://www.google.com/appsstatus/ir/1nsexcr2jnrj1d6.pdf</a> (English only).</p>
<p>For ongoing service performance information, please access the Apps Status Dashboard at <a href="http://www.google.com/appsstatus">http://www.google.com/appsstatus</a> (English only).</p>
<p>We are very sorry for the inconvenience that this incident has caused. We understand that system problems are inconvenient and frustrating for customers who have come to rely on our products to do many different things. One of Google&#8217;s core values is to focus on the user, so we are working very hard to make improvements to our technology and operational processes so as to prevent service disruptions. We are confident that we will achieve continuous improvements quickly and persistently.</p>
<p>We will be extending a full 15-day SLA credit to all affected Google Apps Premier customers for the month of February. This credit will be applied to customer accounts automatically so there&#8217;s no action needed on the part of administrators. </p>
<p>Once again, we apologize for the impact that this incident has caused. Thank you very much for your continued support.</p>
<p>Sincerely,</p>
<p>The Google Apps Team</p>
</blockquote>
<p>Cloud computing detractors will point to outages like this as evidence that businesses should not rely on the cloud for mission-critical applications. Cloud computing supporters will counter that any business, especially a small business, is likely to experience system downtime even if everything is located on premise. There’s no right answer, except to say that if you are going to rely on a host in the cloud to run your business, you’d better make sure that the host knows what it’s doing. What you DON’T want is to have your systems running on equipment and backed by procedures that are no better than what you could have purchased and maintained yourself.</p>
]]></content:encoded>
			<wfw:commentRss>http://stopbuyingservers.com/2009/02/google-gapps/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
