Welcome to the Second Life Forums Archive

These forums are CLOSED. Please visit the new forums HERE

About the outage (geek questions)

Krazzora Zaftig
Do you have my marbles?
Join date: 20 Aug 2005
Posts: 649
12-29-2006 13:39
After reading the blog it's my understanding that the battery repalcement and reactivation of that machine caused a restriping to occur. How does this calculate into the other drives failing or was it some sort of random occurance that happened? Also there was alot of talk about poor Distaster Procedure at LL that lead to this according to many a suppossed network engineer that has an avatar in SL. Is this something that under normal conditions would have been "planned for"? Also will there be any public notice as to how LL will make a DP to prevent this from happening agian? If the harddrive failures were not related to the battery issue was there any sign of it having problems BEFORE the update?

I guess in essence there was alot of screaming about Disaster Procedure and Recovery. Is this and was this due to a poor one, just a small tweak, or a no impact? Also how stumped was the Vendor if I can ask? ^_^
_____________________
Robin Linden
Linden Lifer
Join date: 25 Nov 2002
Posts: 1,224
01-02-2007 10:53
According to our ops team, the battery replacement was unrelated to the drive failures in the asset server cluster, although the increased activity of the cluster repair after the battery replacement could have tipped drives over that were borderline before.

So... do we have a disaster recovery plan? Yes.
Is it a good one? Yes, although obviously things could have gone better.
Are we changing it in response to this event? Yes, we're improving it based on what we've learned.
_____________________