Same issue here...
how many of you are on Xserve (Early 2008) with an Apple Raid card and SATA drives??
there's an interesting thread that someone just started in hardware>xserve that I'm watching due to the eerie similarities that have made me start to wonder whether it's actually software or hardware.
(see the crosspost in
http://discussions.apple.com/thread.jspa?threadID=2142070&tstart=0)
Here's what I've got:
* Xserve (Early 2008) (OS 10.5.8) 16GB RAM with Apple Raid card using three 1TB SATA Drives as raid 5
* network homes from iMac g5's and iMac intels all running 10.4.11 averaging about 50 users at peak times, normally about 25-30.
This setup worked perfectly for about 4 months (running server 10.5.4) and then for no explicable reason things started going south. we hadn't installed any updates, software, or made hardware or network changes. Since then, once about 20 users log in, we have SWOD everywhere, taking 5 minutes to log in, AFP eating all 8 cores to about 780%, for about 3 minutes at a time with a few minutes of normal processor use in between. There are times that even when the processor is quiet it takes over 3 minutes to do a local admin login ON CONSOLE!.
I backed everything up, did a CLEAN install of 10.5, patched it to 10.5.8, recreated all the users by hand in case there was something hosed with the OD database, and within 3 hours of normal use everything was back to being slower than death.
I've tried just about everything I could find... all the fixes in the Apple Client Management whitepapers, as well as some other possible fixes from the forums, more disk and permission repairs than I can count, raid verifications, etc.etc.
I did notice however that during the high AFP load, disk I/O drops to nearly nothing. less than 200k/sec PEAK. (that raid card can do peak transfer of 300M/sec) even with nearly no I/O activity, all the drive lights are on nearly solid. Raid utility says everything is good, write cache is on, battery is not conditioning... it should be moving like a speed demon, but it's not.
I replaced the RAID card, in case it was the issue, but no luck.
perhaps one of the drives is failing but not in a way that SMART or the controller can pick up?? I just picked up a spare 1TB drive module to test that theory, but that means swapping the drive and waiting all weekend for the array to rebuild itself before we can check performance. for each of the 3 drives...
right now we're in the process of setting up a modified login (like NHR) that just maps documents, pictures, movies, favorites, and safari bookmarks to their AFP home folder to cut down on the overhead, but I'm not sure that even that will help.