Want to highlight a helpful answer? Upvote!

Did someone help you, or did an answer or User Tip resolve your issue? Upvote by selecting the upvote arrow. Your feedback helps others! Learn more about when to upvote >

Looks like no one’s replied in a while. To start the conversation again, simply ask a new question.

Network home folder clients (10.8.2) freezing

Hi all,


I have a Mac OS X Lion Server (10.7.5, all updates, Mac mini server with TB RAID attached) serving network home folders to Mac OS X Mountain Lion 10.8.2 clients.


Some of our users are experiencing freezes that manifest shortly after login. It appears that the shared volume is no longer available, based on the following system log entries. Again, this only happens to some users, but those users have it happen consistently...


Any insight?


Mar 4 13:46:47 BMC-CCT3110-Chloroplast.local KernelEventAgent[47]: tid 00000000 type 'afpfs', mounted on '/Network/Servers/www.ourURL.foo.foo/Volumes/User_partition/BMCusers', from '//erink@www.ourURL.foo.foo/BMCusers', not responding

Mar 4 13:46:47 BMC-CCT3110-Chloroplast.local KernelEventAgent[47]: tid 00000000 found 1 filesystem(s) with problem(s)

Mar 4 13:46:47 BMC-CCT3110-Chloroplast.local KernelEventAgent[47]: tid 00000000 received event(s) VQ_NOTRESP (1)

Mar 4 13:46:47 --- last message repeated 1 time ---

Mar 4 13:46:47 BMC-CCT3110-Chloroplast kernel[0]: ASP_TCP Disconnect: triggering reconnect by bumping reconnTrigger from curr value 8 on so 0xffffff802d71c370

Mar 4 13:46:47 BMC-CCT3110-Chloroplast kernel[0]: ASP_TCP asp_tcp_usr_control: invalid kernelUseCount 0

Mar 4 13:46:47 BMC-CCT3110-Chloroplast kernel[0]: AFP_VFS afpfs_DoReconnect started /Network/Servers/www.ourURL.foo.foo/Volumes/User_partition/BMCusers prevTrigger 8 currTrigger 9

Mar 4 13:46:47 BMC-CCT3110-Chloroplast kernel[0]: AFP_VFS afpfs_DoReconnect: doing reconnect on /Network/Servers/www.ourURL.foo.foo/Volumes/User_partition/BMCusers

Mar 4 13:46:47 BMC-CCT3110-Chloroplast kernel[0]: AFP_VFS afpfs_DoReconnect: posting to KEA EINPROGRESS for /Network/Servers/www.ourURL.foo.foo/Volumes/User_partition/BMCusers

Mar 4 13:46:47 BMC-CCT3110-Chloroplast kernel[0]: AFP_VFS afpfs_DoReconnect: Max reconnect time: 600 secs, Connect timeout: 15 secs for /Network/Servers/www.ourURL.foo.foo/Volumes/User_partition/BMCusers

Mar 4 13:46:47 BMC-CCT3110-Chloroplast kernel[0]: AFP_VFS afpfs_DoReconnect: connect to the server /Network/Servers/www.ourURL.foo.foo/Volumes/User_partition/BMCusers

Mar 4 13:46:47 BMC-CCT3110-Chloroplast kernel[0]: AFP_VFS afpfs_DoReconnect: Logging in with uam 10 /Network/Servers/www.ourURL.foo.foo/Volumes/User_partition/BMCusers

Mar 4 13:46:47 BMC-CCT3110-Chloroplast kernel[0]: AFP_VFS afpfs_DoReconnect: Restoring session /Network/Servers/www.ourURL.foo.foo/Volumes/User_partition/BMCusers

Mar 4 13:46:47 BMC-CCT3110-Chloroplast kernel[0]: ASP_TCP ReplayPendingReqs: replaying slot 7 with reqID 51198 afpCmd 0x44 on so 0xffffff802d71c370

Mar 4 13:46:47 BMC-CCT3110-Chloroplast kernel[0]: AFP_VFS afpfs_DoReconnect: get the reconnect token

Mar 4 13:46:47 BMC-CCT3110-Chloroplast kernel[0]: ASP_TCP Disconnect: triggering reconnect by bumping reconnTrigger from curr value 9 on so 0xffffff802d71c370

Mar 4 13:46:47 BMC-CCT3110-Chloroplast kernel[0]: ASP_TCP asp_tcp_usr_control: invalid kernelUseCount 0

Mar 4 13:46:47 BMC-CCT3110-Chloroplast kernel[0]: AFP_VFS afpfs_DoReconnect started /Network/Servers/www.ourURL.foo.foo/Volumes/User_partition/BMCusers prevTrigger 9 currTrigger 10

Mar 4 13:46:47 BMC-CCT3110-Chloroplast kernel[0]: AFP_VFS afpfs_DoReconnect: doing reconnect on /Network/Servers/www.ourURL.foo.foo/Volumes/User_partition/BMCusers

Mar 4 13:46:47 BMC-CCT3110-Chloroplast.local KernelEventAgent[47]: tid 00000000 type 'afpfs', mounted on '/Network/Servers/www.ourURL.foo.foo/Volumes/User_partition/BMCusers', from '//erink@www.ourURL.foo.foo/BMCusers', not responding

Mar 4 13:46:47 BMC-CCT3110-Chloroplast kernel[0]: AFP_VFS afpfs_DoReconnect: posting to KEA EINPROGRESS for /Network/Servers/www.ourURL.foo.foo/Volumes/User_partition/BMCusers

Mar 4 13:46:47 BMC-CCT3110-Chloroplast kernel[0]: AFP_VFS afpfs_DoReconnect: Max reconnect time: 600 secs, Connect timeout: 15 secs for /Network/Servers/www.ourURL.foo.foo/Volumes/User_partition/BMCusers

Mac OS X (10.6.7), OS X server

Posted on Mar 4, 2013 12:12 PM

Reply
49 replies

Aug 3, 2013 6:37 AM in response to cafarom

thanks cararom, I am testing this now and it appears to have done the trick for us as well. The problem does not produce itself in our testing and we've rolled out to effected users 3 days ago. So far no instances of the bug reported since.


FWIW, I have also tried testing an upgrade to 10.8.4 server. This solved the mds issue, but presented a host of other AFP-related problems and performance issues.

Nov 7, 2013 5:15 AM in response to cafarom

I upgraded to OS X 10.9. The upgrade did not solve the problem for me, so I still use the LoginHook script.


At first I deleted every item within ~/Libraray/Caches at login (with the LoginHook script as cafarom recommended).

This helped me in almost every case, but I had problems with the FontExplorer X which took a very long time to start.


Now I only remove the folder ~/Library/Caches/Metadata/Safari/History which seems to be responsible for the problem.

#


By the way: When I login as admin at the mac (local account) and when I mount the home folder of the network user account (user name is 'cl') then I lost the connection to the volume and get the following error in the log when I want to open the folder /Volumes/cl/Library/Caches/Metadata/Safari/History/:

  • KernelEventAgent[87]: tid 54485244 type 'afpfs', mounted on '/Volumes/cl', from '//cl@172.16.10.50/cl', not responding
  • KernelEventAgent[87]: tid 54485244 received event(s) VQ_NOTRESP (1)
  • 07.11.13 12:58:33,650 KernelEventAgent[87]: tid 54485244 type 'afpfs', mounted on '/Volumes/cl', from '//cl@172.16.10.50/cl', not responding

Nov 7, 2013 8:11 AM in response to christoph.l

That's too bad to hear that upgrading didn't fix the problem. I was hoping that upgrading both the sever and clients to 10.9 would fix this, since they would be using SMB2 and not AFP anymore, but it sounds like AFP isn't the root cause here.


Would you be willing to share your loginhook script? I've been having trouble getting the script from cafarom to work correctly and was hoping to see how you did it.


Thanks a bunch.

Jan 20, 2014 8:35 AM in response to Nicholas Woolridge

background:

I'm currently running OS X Server 10.6.8 Server on an older Xserve2,1 (2008) with RAID 5, SAS across 3x300GB SAS 15k rpm drives - about as fast as you can get except SSD and Thunderbolt - however the network is the bottleneck - I'm using gigabit cat5e. I only have 2 client iMac 2012/2013 systems running OS X Mavericks. Trying to setup network home accounts for 4 people. I migrated their home folders off one iMac onto the server.


I've seen the same problem as others - shortly after login when you get around to launching an app - be it System Preferences, Mail, etc, folks are experience app hangs. Clicking the spotlight icon, I see it is indexing - seems. Who has patience for this? Looking at the server, it's raid array is busy flashing lights and I check network IO and it's pretty heavily hit.


Some thoughts towards a solution:

1. make sure your client computers are on gigabit connections and cables are cat5e at least and all switches (not hubs) in between.

2. Relocation client caches to the local client drive following the tip here: http://lisacherie.com/?p=19

(NOTE: Adobe Apps seem to be unhappy about caches being moved - see the other change for Acrobat in the link above).


A corrupt spotlight file will surely cause your users problems, so consider these final tips:

+ As admin, remove in your client home folders any hidden spot-light index ~/.Spotlight-V100 (spotlight will rebuild the user's index unless you disable spotlight)

+ disable Spotlight altogether if you don't need/want it

+ drop spotlight priority significantly: http://thegreyblog.blogspot.com/2012/04/is-mdworker-jeopardizing-your-resources. html

Using an admin account, carefully change: /System/Library/LaunchDaemons/com.apple.metadata.mds.plist

Near the bottom on OS X Mavericks (client), change the false to true, then save your change, and reboot.


<key>LowPriorityBackgroundIO</key>

<true/>


+ if you allow spotlight ot run, be patient when the user logs in for the first time on a specic machine and let spotlight do it's long heavy-handed index work - after you clean the index the first time).


Added further details: ziondotcom

Jan 28, 2014 8:36 AM in response to cafarom

Yeah, I would almost consider Network accounts depricated, Apple doesn't care, and doesn't take the bug reports seriously.


Like cafrarom said, living without Spotlight is aweful, we did it for a whole year, and there are still more issues with network accounts besides just this one. They just don't work anymore. Mobile accounts are unfortunately the only thing left.


The login/sync times aren't too bad with mobile accounts if you enable server-side-file-tracking (used to be an option in 10.6 but now it's hidden.) http://krypted.com/mac-os-x/enable-server-side-file-tracking-in-os-x-mountain-li on-server/


My account is 55gb, but with server-side-file-tracking enabled it only takes about 6-8 minutes to login/logout. We tried them a few years ago and were getting all sorts of sync conflicts continually, especially with iTunes, but lately it's all been working well...too well perhaps.


Anyway, please give mobile accounts a try, it's the only thing left.

Mar 4, 2014 4:17 PM in response to ServerBurninator

Hi bug-friends,


Apple just reopened my long since closed bug report with this question:


Engineering has requested the following:


Are you still seeing this issue with OS X 10.9.2 and Server 3.0.3 or later?


We've long since abandoned Network Accounts for Mobile Accounts, but I wanted to pass this info along and ask if 10.9.2 seems to fix the issue for anyone? With any luck, maybe they're actually trying to fix it?


Let me know, I'll include what you guys say in the report.

Mar 4, 2014 5:42 PM in response to ServerBurninator

Nope. Same problem with 10.9.2. After rolling out 10.9.2 our shared group folder, which has been otherwise stable, now also occasionally drops out for some users.


To share my experience, I'll say that one reliable way to resolve the issue as it is happening is to go to the server and explicitly kick the hung connection. At that point, the client computer (which has user home mounted off the server) unfreezes and at least lets me to restart the client Mac. (If the connection is not kicked, only the hard reset would bring the client machine back to life).


Should have never moved from 10.6. *sigh*


As a side note, I came across a couple of very special Word files sitting in an AFP share. Trying to open these files (quicklook or otherwise) hangs the AFP connection instantetiously. Perhaps the underlying cause for this is the same. If Apple support would be interested to look into this, I would love to collaborate.

Mar 4, 2014 6:10 PM in response to k8n

Thanks for the reply. That's a shame to hear it's still broken.


That's interesting - those Word files do sound really suspsicious. Maybe that's why disabling Spotlight from indexing them fixes the problem?


What are your client and server versions? If they're both using 10.9.2, then they should be using SMB2 and not AFP, so I'm wondering if those Word files would still crash their connections or not?


Yeah, I'm sure they would love to see those files. I can attach them in my reply if you want, but it would probably be best to open a new issue here:https://bugreport.apple.com


Let me know, thanks.

Mar 4, 2014 6:23 PM in response to ServerBurninator

I agree about filing the bug reports under https://bugreport.apple.com

Aside from calling AppleCare after you install OS X Mavericks Server and getting help during your 90 days after purchase, filing a bug report is about the ONLY way to notify Apple of bugs. The forums are not the place to report to Apple.


While I agree that Apple introduced a new implementation of SMB2 and it is supposed to be more reliable, notice that 10.9.2 just included fixes for SMB2 reliability. How long has AFP been around vs SMB2. I think that should speak to what's more reliable. Going forward, I would expect eventually SMB2 will be a good choice.

Network home folder clients (10.8.2) freezing

Welcome to Apple Support Community
A forum where Apple customers help each other with their products. Get started with your Apple ID.