SOCD report detected: (AP watchdog expired)

Mac Mini M1 16GB keeps crashing and then restarting randomly on all applications with following error message:

I was asked by Apple second level support to provide details of the crash reports logged when it happens. 

It is [SOCD report detected: (AP watchdog expired)


SOCD report detected: (AP watchdog expired)


Even when idle. 


Steps taken with 2nd level support:

  1. MAC OS REINSTALL.
  2. MAC DISK ERASE AND REINSTALL FROM ICLOUD.
  3. ISSUE ALSO HAPPENS IN GUEST MODE LOG IN.


Mac mini, macOS 11.2

Posted on Feb 11, 2021 2:23 AM

Reply
Question marked as Top-ranking reply

Posted on Feb 28, 2021 1:01 AM

Apple 2nd level support seemed to think it was due to external hard drive issues. They asked me to test without drive connected.


can confirm now that the SOCD crash would only occur with My external USB A Western Digital Hard drive physically connected. After disconnecting it has not happened again, so far. After 5 days.


do you all have external drives connected?

365 replies

Feb 28, 2021 10:27 AM in response to magic100

Further update:

Just looked in my 'IT box', and found a USB A to USB C adapter. I have since disconnected my Time Machine backup drive that was connected to USB A, and used the adapter to connect it.


Let's see if this error, "SOCD report detected: (AP watchdog expired)" happens again.


PS: Maybe it is just me, but, already feel as if my mouse is moving MUCH better than it was...

Mar 12, 2021 6:55 AM in response to magic100

I just bought a new Mac Mini M1 16G 256 storage two days ago, Big Sur 11.2.3 installed and the same thing happened twice: green screen and reboot. Error log is [SOCD report detected: (AP watchdog expired). I was also using a WD external hard drive (circa 2012-13) to manually transfer documents (only one or two apps, both updated), and I believe the drive was attached during both incidents. Have detached and will update in a week or two.

Apr 28, 2021 5:41 AM in response to magic100

Based on the posts here by users experiencing this annoying and expensive fault, I think the issue is with many types of USB types of devices including Storage drives (powered and non powered), audio dac/amp, other peripherals.


I AM CONCERNED 11.3 BIG SUR HAS NO DETAILED INFORMATION ON ACKNOWLEDGING OR FIXING THIS PROBLEM. I WAS CONTACTED DIRECTLY BY RND AFTER CREATING THIS THREAD AND WAS TOLD 13ENGINEERS ARE VERY CONCERNED ABOUT IT. THIS WAS IN FEBRUARY AND NOW NOTHING FURTHER FROM RND DIRECTLY.


I was considering taking in for service, but it could be simply replaced with new product or board with same issue again.


some one mentioned try using a usb hub. Perhaps that is a solution? I don’t know.

Jun 7, 2021 3:35 AM in response to magic100

anyway i disconnected my Toshiba external disk that was connected to a portable USB connector and waiting to see if it was this the issue but it seems odd that I could not use an external disk.....on a a brand new Mac with such small space. I hope Apple will release some updated ASAP for this issue!!!!! I also have the issue the some software run quite slow such Rhinoceros and Photoshop and I didn't expect this since the marketing was concentred on "HOW FAST IS THE MAC MINI". ...yep, maybe not mine (mAc Mini 16GB, 1 T space).

Jul 3, 2021 10:04 AM in response to sajal101

"Its like a component defect in the logic board"


i'm not entirely sure there's enough evidence to support this theory. you could be right though.


Just because 'Watchdog' has relation to logic board, doesn't necessarily point to 'logic board replacement'. correlation doesn't equal causation.


my theory: there's some x86 legacy driver that's being translated through some sort of abstraction layer like Rosetta, and there's some sort of leak or overflow that builds up over time, and eventually the system has to restart.





Jul 4, 2021 7:12 PM in response to manucartes

nice find.


to further speculate, based in this comment https://gist.github.com/Omar-Ikram/8e6721d8e83a3da69b31d4c2612a68ba#file-endpointsecuritydemo-m-L92


// This will cause discrepancies when converting Mach absolute time values from Endpoint Security Messages.

// The best option would be to compile your client as a universal binary:

// https://developer.apple.com/documentation/xcode/building_a_universal_macos_binary
// Note: Running x86_64 code running under Rosetta 2 will have timebase information for Intel CPUs.


not saying this is exactly the cause. but we do have a release note saying that this API was causing panics due a timeout type issue. it's feasible, not confirmed though, that these could be caused by something like x86_64 code spitting out bad values through Rosetta 2. wild speculation and could be completely wrong.

Jul 7, 2021 5:44 PM in response to magic100

The same "SOCD report detected: (AP watchdog expired)" panic has occurred numerous times on my machine, but usually only once every few months, tops. I've had such a panic a total of maybe 3 or 4 times since purchasing the M1 mini in March. I am aware it is related to external storage.


I have an external Thunderbolt enclosure, a Sabrent EC-T3NS, which is based on an Intel Thunderbolt controller chip. I am using it with a 1TB Samsung 980 Pro.


I have sent a feedback post about this, which contains the panic report and a sysdiagnose captured after rebooting back into Monterey. FB9286550 if anyone wants to know, for reference. And if anyone wants to mention this Feedback in their own reports of this issue.


The Panic dump appears to contain a base64 encoded SOCDContainer object, which may be related to the SoC's microcontroller which locked up. Maybe someone will actually find this useful. Hopefully this isn't some errata that can't be corrected with at most a firmware update.


[Edited by Moderator] 

Aug 23, 2021 2:27 PM in response to d3cbgf

The string "SOCD" is only contained in kernel extensions related to SMC(System Management Controller) and PMU (Power Management Unit):


AppleD2422PMU.kext

AppleD2449PMU.kext

AppleSMC.kext

ApplePMGR.kext

AppleSPMIPMU.kext

AppleT8103PMGR.kext


Maybe the SOCD combines on M1 all the functions that on an Intel Mac the SMC and PMU did?

Since we also saw this base64 data in the panic log it could also have some NVRAM on it.


So maybe the SOCD is really daemon program doing former SMC and PMU tasks and runs on the AP (application processor). And everything is encrypted by the keys in the security enclave.


And on the AP is also a watchdog function that detects if the SOCD daemon is frozen. And since it handles all those important SMC and PMU stuff all they can do is a hard reset of the Mac.


Still a lot of speculation, but since nothing is documented...

Aug 23, 2021 2:58 PM in response to la_manzana

pmu restarting
"SMCFW failed to handle restart request\n"SOCD: BASE 0x%04X, COUNT 0x%04X
"AppleDialogPMU::%s failed to alloc %d bytes"_handleSOCDSOCD: failed to read magic value
SOCD: Magic value %02Xh %02Xh %02Xh %02Xh
SOCDSOCD: failed to read trigger counters
SOCD: Failed to reset magic
SOCD: Failed to read socd container
SOCD: Failed to allocate OSData
SOCD: PMU %llu ns, TOTAL %llu ns (%u, %u)

some interesting strings in AppleSPMIPMU.kext


another possibility / theory on hardware side: bad or corrupted NAND (nonvolatile) flash storage (used by AES Engine) within SEP..... daemon times out and triggers pmu to restart


so many possibilities!


also, i'm not sure what line 109 refers to, but its kinda funny.

PMURTCHelperreadRTCUpcountTicks%s::%**** <rdar://problem/31800609> (retries=%d)

something to do with the pmu reading real time clock tickets... and looks like reference to internal help system

Aug 25, 2021 8:02 AM in response to la_manzana

more errata/interesting information:


bug reported on 11.5.2 that claims that the "amount of user and system CPU time a given process has consumed over its lifetime" for processes run through Rosetta 2 on M1 is an incorrect value.


https://openradar.appspot.com/FB9546856


my guess would be SEP would potentially monitor some of these values. If there is skew over time, that may result in SOCD getting confused, watchdog times out on sepOS, and triggers pmu to hard reset.

Sep 4, 2021 9:04 PM in response to kode54

I believe it is, in my case anyway. I ordered a USB C to DP cable, will see how that goes. I though about upgrading the monitor as well. Mine is the LG 31MU97-B, Has been great over the years but now show signs of screen burn in when left up for some time. Should do HDMI @ 30Hz but Preferences for display show 24Hz, not sure if that has anything to do with it. I have an Ultra High Speed HDMI cable which is supposed to support 8k, so I figure it should be fine. Will see what it works at on that USB C to Mini DisplayPort does.

This thread has been closed by the system or the community team. You may vote for any posts you find helpful, or search the Community for additional answers.

SOCD report detected: (AP watchdog expired)

Welcome to Apple Support Community
A forum where Apple customers help each other with their products. Get started with your Apple Account.