WEIRD MacPro softRAID problem HELP

MacPro RAID 0 problem I NEED HELP

I am having a weird RAID problem with our new MacPro 2.66, MacOS 10.4.8, 5GB RAM, 4 x 500GB RAID 0, 1.8 TB in internal bays configuration, drives have been formatted individually, and the RAID 0 have been created with Apple Disk Utility - original 250GB drive moved to the optical bay. In service for a week or so.
The RAID mount at start-up, can be open, and show the content. After 30 sec. I got a “bad device removal message“ and the RAID is gone.
We have more then 500GB of photos on it and NO BACK UP.

The problem first show up when I was writing DVD backup set with Toast 8 (yes, I was doing it) and I plugged a LaCie FireWire 800 drive in the front port. That drive is good and mounting on other Mac. Seem unrelated to me, I mention it just in case.

I checked the controler of the 4 x 500GB drive with TechTool Pro, test passed.

Disk utility, first aid, failed with the following report:

Verify and repair disk RAID Mattera
Can't open /dev/rdisk3: ressource busy
1 non HFS volume checked
1 volume could not be repaired because of error.

Disk utility RAID tab is showing that the RAID is offline and that the 465.4 GB - RAID Slice (disk0s2) failed. The 3 others one are offline. As I cannot find a way to insert my snapshot I am reproducing what I can see on the left window.
465.8 GB Maxtor 7H500f0
465.4 GB disk0s2
465.8 GB Maxtor 7H500f0
465.4 GB disk1s2
465.8 GB Maxtor 7H500f0
465.4 GB disk2s2
1.8 TB RAID Mattera
disk3
465.8 GB Maxtor 7H500f0
465.4 GB disk4s2

What is not normal, in the left window, is the disk3 right under 1.8 TB RAID Mattera. Was not there before if I recall correctly.
More, as far as I can recall, my RAID set was made of 465.4 GB - RAID Slice (disk1s2) (disk2s2) (disk3s2) (disk4s2).
The RAID had consecutive number when created, disk1s2 to disk4s2.
There was no Slice by the name of (disk0s2) in it.

The problem seem to be there as Disk utility, first aid report state:
Can't open /dev/rdisk3: ressource busy

Does the RAID setup is in PRAM? Is there a corrupted preference file, visible or invisible that can be removed.
I am lost with that problem. I need an expert advise.

I would appreciate greatly any help on this one. I cannot loose the content of the RAID.

Jacques (I am French speaking, sorry for my English).





MacPro 2.66 Mac OS X (10.4.8)

Posted on Feb 3, 2007 4:02 PM

Reply
17 replies

Feb 4, 2007 1:32 AM in response to jacques.paul

I'm having a nearly identical situation with the same Maxtor MaxLine Pro drives you are using. I hadn't thought of Toast being a problem but the first time this happened to me I was also working in Toast 8 trying to burn additional copies from a VIDEO-TS folder. I was working off the internal raid with no additional firewire drives connected.

At first, the drives would dismount shortly after startup like they are a firewire or USB device. I've received the same error messages you've received.

Disk Utility won't fix it, indicating one drive has failed and the other is offline. In my case, it alternates between the two drives as to which has failed and which is offline, never the same answer twice in a row.

DiskWarrior rebuilt the directory a couple of times, and it's hardware test indicated there were no problems (as did Apple Hardware Test), but they would continue to dismount upon restart into OSX. Now DiskWarrior has stopped working and is indicating damage to the drives.

There have been no clicking or weird sounds coming from the drive so I suspect the directory is just hosed beyond repair. I don't know if the drive is salvagable and I'm uncertain at this point whether any of the data recovery software solutions will work on a RAID-0 striped pair. If not, we may be really screwed.

Feb 4, 2007 5:10 AM in response to beerstine

I beerstine

In some way that a good new.

Our problem is so identical (same drives, same operation) that I have to conclude that we just discover a hardware/software BUG in Apple top of the line MacPro.

That imply Apple responsibility and mean that we will be able to get help from Apple top level engineers. I known some well placed persons at Apple Montréal. I will call them monday to start the process.

Meanwhile I suggest that we establish a personal contact by email to exchange our name, address, phone number, MacPro serial number, and a screen shot of our RAID configuration disk utility/RAID tab. That gone help me with Apple people in Montréal.

Please email me at jacques.paul@videotron.ca

Feb 4, 2007 6:19 AM in response to infinite vortex

I Ned

I will try it monday and let you known. But I don't expect much of it - see beerstine post in the same tread:

“Disk Utility won't fix it, indicating one drive has failed and the other is offline. In my case, it alternates between the two drives as to which has failed and which is offline, never the same answer twice in a row.“

I can even try the 4 disks set in an other MacPro to make sure the SATA bus is not involved.

In order to do it I need to known where is the RAID config pref file in a MacPro; on the RAID disks, on the PRAM, or in the pref file on the start up disk

Feb 4, 2007 3:18 PM in response to beerstine

If I had a 4-drive RAID and 500GB not backed up, I'd be using SoftRAID 3.6
http://www.softraid.com

... which is what - by your title - I thought you were using,

and then use SynchronizePro! for scheduled backups, could be couple times a day to external SATA or Firewire. Maybe a hot-swap two drive enclosure (using 500GB drives like big floppies).

And Disk Warrior 4.

Feb 5, 2007 1:39 PM in response to beerstine

I beerstine

Additional news: seem that we are facing an Apple RAID bug.
The failed drive is never the same in my case - same as you.
The good new is that SoftRaid 3.6 can convert apple RAID and live the data untouched. Their tech support answer me as follow.

here is the trick... If the problem is an Apple RAID bug, we can probably
help. If it is a hardware issue, then not. It is hard to tell from this
message.

If there is another problem like yours, that is a good sign, though.

I don't know what to say. SoftRAID can convert your disks, but I can't
make any promises, of course.

I have not encountered this problem from a user lie yourself before, so
it is new to me.

Mark James
SoftRAID, LLC
mjames@ softraid.com

I am awaiting Apple answer before try it in last resort.
There is a hope

Feb 5, 2007 2:45 PM in response to jacques.paul

I have been using SoftRAID since 2.2.2 and OS 9.1 and it is really been rock solid.

Once and "if" it gets all the volumes and data back online, try and pull and backup the latest changes and such and then rebuild the array.

SR is good at notifying when there is an unrecoverable error so you know immediately when it happens. Or even recoverable errors.

My own experience, if using SoftRAID, is to not install a half dozen other utilities to monitor drives and smart status and just let it do its own thing.

Feb 5, 2007 4:43 PM in response to beerstine

I beerstine

First I would like to thank The hatter to introducing me to SoftRaid 3.6, it's seem to me to be the only way to go as Apple is not willing to recognise their BUG.

Second, I just downloaded my copy of SoftRaid 3.6, $129US, and if you are willing to try it to convert your Apple RAID to SoftRaid 3.6 RAID and see if that settle the problem I will be glad to send it to you even if I will have to buy an other copy. That is an offer!

Just email me to establish the contact: jacques.paul@videotron.ca
I will send back instructions to download it from a site.

Feb 11, 2007 9:06 PM in response to jacques.paul

I have seen the exact same problem happen twice now on my Mac Pro. Here is the configuration:

Mac Pro 3 GHz, 8 GB Apple RAM, nVidia FX 4500
5 internal SATA drives (all WD1500ADFD) one drive (optical bay) is used for system, 4 others are used in Apple RAID 0 (stripe) for data.

Two of the RAID failures happened while my backup software was running a backup script (Personal Backup X4) that clones the RAID to external fw800 drive. These failures reported the following in the system log:

kernel[0]: AppleRAID::completeRAIDRequest - error 0xe0030005 detected for set "Stripe" (0D6F397D-04AB-4D12-A634-73E668D9B6F4), member 8EE005C9-1F88-424E-896D-7600B54A3B05, set byte offset = 120291328.
Jan 29 09:33:59 MacPro kernel[0]: disk3: 0xe0030005 (UNDEFINED).

These failures were unrecoverable (tried using both DU and DW4) and required the RAID volume to be destroyed. Once taking the disks out of the RAID, all drives passed both the Apple Hardware tests, DW tests, and TechTool Pro tests. I then re-created the RAID stripe with the same 4 drives, and restored data to the volume. The RAID volume then performed without error for 34 days, after which time the exact same problem occured.

I've also had two other RAID failures that didn't require the RAID volume to be destroyed; upon re-boot, the journal was able to repair the data. These two failured happend when logging into an Parallels XP VM that was running from the RAID.

The two hard-crash failures that happened during the backup routine (where the RAID had to be destroyed) failed during the backup of the Parallels hdd file (corrupting it). I have posted in the Parallels forums without any helpful responses (and emailed their support team, but we all know how useless that is!). My backup software vendor (Intego) has responded quicky to my support requests, but tell me that it's a hardware problem, and not their software.

Apple doens't provide any technical support for RAID unless you purchase an Advanced Enterprise support plan (or something similarly named).

The S.M.A.R.T. status of these drives are checked daily (DW4), and have never reported an error.

After both hard-failures, all disks have passed all the diagnostics that I have run. What are the chances that this is indeed a hardware failure??? I have been able to recreate the RAID after both failures, and continue to use it without problems for many weeks (actually, exactly 34 days each time) - writing significant amounts of data to the RAID volume (~60% full) without having any kinds of problems.

There seems to something here that is affecting all of us, and it seems that all that we're going to get for support is 'finger-pointing' and "don't use raid" responses.

During one of the DW repair attempts, I saw an error that read something like "invalid sibling link" on the RAID...

Here's hoping that someone can get to the bottom of this - I'm tired of re-creating the RAID and restoring my data.

Mac OS X (10.4.8)

Feb 12, 2007 1:01 PM in response to jacques.paul

Happy ending for me

I am in the middle of a success story.
I buyed SoftRaid 3.6 this morning.
http://www.softraid.com/index.html
Converted all four disk to SoftRaid, (selected each drive individualy, used convert to SoftRaid tool, essentialy it replace the Apple driver by SoftRaid driver in the boot block, living the data untouched) and restarted the Mac.
WARNING: on an Intel Mac you have to be at Mac OS 10.4.8
The RAID was on the desktop and STAY on the desktop.
One of the disk have I/o error but only once in a while.
I have been able to copy to an other disk (firewire in my case). For now I already rescued 250GB out of 750GB and I am continuing. I opened a few files, they are good (RAW photos).
Sometime the Finder stop copying, I receive an i/o error message an after a while it just continue copying, sometime the finder hang and as I am at the finder level, I have to hard restart the Mac. But, once restarted, I can copy against until the new time.
Already order a replacement drive.
SoftRaid will have saved my *** on this one.
Let you known the end of the story but already I will post my results on some forums where people like me was having trouble with an AppleRAID.
If somebody want more info, screen shot, etc. just email me jacques.paul@videotron.ca

Feb 12, 2007 3:03 PM in response to jacques.paul

Happy days are here again! 🙂

When you get through this, and have the new drives, I like to zero the entire drive once. I also like to "benchmark" and put it through a rough set of ZoneBench (Intech Speedtools) tests along with QuickBench. There is even an Integrity and Media test that will help insure you don't have I/O errors (but SoftRAID will let you know) as well as it is able to map out bad sectors on the fly, without a reformat, should the need arise. ZoneBench can help find if there is a "slow" or bad area or zone on the drive, which would affect performance and may point to excessive remapped blocks.

Congrats!

Feb 12, 2007 3:32 PM in response to The hatter

I The hatter

I was so excited today that I almost forget to tank you for your suggestions and remarks about this problem.

I will follow your advices:I like to zero the entire drive once. I also like to "benchmark" and put it through a rough set of ZoneBench (Intech Speedtools) tests along with QuickBench.

Thank very much for your help.

You are the one who guide me to the solution.

I appreciate greatly.

Jacques

This thread has been closed by the system or the community team. You may vote for any posts you find helpful, or search the Community for additional answers.

WEIRD MacPro softRAID problem HELP

Welcome to Apple Support Community
A forum where Apple customers help each other with their products. Get started with your Apple Account.