In message < > someone
claiming to be "TheScullster" <> typed:
>We have a HP ML370 G6 Domain controller - 1 year old Windows Server 2008 R2.
>The OS is on a mirror, but recently dropped one of the drives.
>Basically the drive showed the orange failure light, but when the server was
>rebooted, it rebuilt the array and operated fine for approx 1 month (till
>today
).
>
>Again the same drive is showing failure.
Almost like the drive is bad...
>OK so this time I will suggest that our IT support company (I don't do
>server work) replace the failed drive rather than allowing a rebuild.
>But what else can cause this, and how can I build resilience against this
>type of failure?
To be entirely honest, I'd be a little... annoyed... if a drive reported
as failing wasn't immediately removed from service and replaced in the
first place.
Maybe that's just me.
>Fail-over server perhaps?
A fail-over server is certainly an option. RAID-10 or RAID-6 would
isolate you from two simultaneous failures, at the cost of needing at
least 4 different drives. If you're in a situation where
possibly-failing drives might be put back into service, this might be
worth the minor up front cost.
It really depends on how sensitive to downtime you are, the odds of two
simultaneous failures are (IMO) fairly low. However, if the downtime of
restoring from backups or rebuilding will cost you more than the cost of
a couple extra drives, the math is easy.