bithead2

Q: Mac Pro 2009 hangs intermittently

I have a Mac Pro (early 2009) that has lived a very cushy life because I don't use it a lot). On 10.8.5 (Mtn Lion) is started hanging intermittently. So I installed an SSD drive and did a clean OS install of Mtn Lion on it and ran updates.  It still hangs intermittently, so the OS installation/disk drives aren't the problem.

 

Here is the trace, what is the likely issue?

 

Thanks,

CJ

 

Interval Since Last Panic Report:  53 sec

Panics Since Last Report:          1

Anonymous UUID:                    4D4C7BB1-ECDA-A646-91B0-47B042CFF6AD

 

Sun Oct  4 03:22:26 2015

Machine-check capabilities 0x0000000000001c09:

family: 6 model: 26 stepping: 5 microcode: 17

Intel(R) Xeon(R) CPU           E5520  @ 2.27GHz

9 error-reporting banks

threshold-based error status present

extended corrected memory error handling present

Processor 0: no valid machine-check state

Processor 1: no valid machine-check state

Processor 2: no valid machine-check state

Processor 3: no valid machine-check state

Processor 4: no valid machine-check state

Processor 5: no valid machine-check state

Processor 6: no valid machine-check state

Processor 7: no valid machine-check state

Processor 8: machine-check status 0x0000000000000004:

machine-check in progress

MCA error-reporting registers:

IA32_MC0_STATUS(0x401): 0x0000000000000800 invalid

IA32_MC1_STATUS(0x405): 0xbe00000000400e0f valid

  MCA error code:            0x0e0f

  Model specific error code: 0x0040

  Other information:         0x00000000

  Threshold-based status:    Undefined

  Status bits:

   Processor context corrupt

   ADDR register valid

   MISC register valid

   Error enabled

   Uncorrected error

IA32_MC1_ADDR(0x406): 0x000000e1e7e1e000

IA32_MC1_MISC(0x407): 0x0000000001000000

IA32_MC2_STATUS(0x409): 0x0000000000000000 invalid

IA32_MC3_STATUS(0x40d): 0x0000000000000000 invalid

IA32_MC4_STATUS(0x411): 0x0000000000000000 invalid

IA32_MC5_STATUS(0x415): 0x0000000000000000 invalid

IA32_MC6_STATUS(0x419): 0x0000000000000000 invalid

IA32_MC7_STATUS(0x41d): 0x0000000000000000 invalid

IA32_MC8_STATUS(0x421): 0x0000000000000000 invalid

Processor 9: machine-check status 0x0000000000000004:

machine-check in progress

MCA error-reporting registers:

IA32_MC0_STATUS(0x401): 0x0000000000000800 invalid

IA32_MC1_STATUS(0x405): 0xbe00000000400e0f valid

  MCA error code:            0x0e0f

  Model specific error code: 0x0040

  Other information:         0x00000000

  Threshold-based status:    Undefined

  Status bits:

   Processor context corrupt

   ADDR register valid

   MISC register valid

   Error enabled

   Uncorrected error

IA32_MC1_ADDR(0x406): 0x000000e1e7e1e000

IA32_MC1_MISC(0x407): 0x0000000001000000

IA32_MC2_STATUS(0x409): 0x0000000000000000 invalid

IA32_MC3_STATUS(0x40d): 0x0000000000000000 invalid

IA32_MC4_STATUS(0x411): 0x0000000000000000 invalid

IA32_MC5_STATUS(0x415): 0x0000000000000000 invalid

IA32_MC6_STATUS(0x419): 0x0000000000000000 invalid

IA32_MC7_STATUS(0x41d): 0x0000000000000000 invalid

IA32_MC8_STATUS(0x421): 0x0000000000000000 invalid

Processor 10: machine-check status 0x0000000000000004:

machine-check in progress

MCA error-reporting registers:

IA32_MC0_STATUS(0x401): 0x0000000000000800 invalid

IA32_MC1_STATUS(0x405): 0xbe00000000400e0f valid

  MCA error code:            0x0e0f

  Model specific error code: 0x0040

  Other information:         0x00000000

  Threshold-based status:    Undefined

  Status bits:

   Processor context corrupt

   ADDR register valid

   MISC register valid

   Error enabled

   Uncorrected error

IA32_MC1_ADDR(0x406): 0x000000e1e7e1e000

IA32_MC1_MISC(0x407): 0x0000000001000000

IA32_MC2_STATUS(0x409): 0x0000000000000000 invalid

IA32_MC3_STATUS(0x40d): 0x0000000000000000 invalid

IA32_MC4_STATUS(0x411): 0x0000000000000000 invalid

IA32_MC5_STATUS(0x415): 0x0000000000000000 invalid

IA32_MC6_STATUS(0x419): 0x0000000000000000 invalid

IA32_MC7_STATUS(0x41d): 0x0000000000000000 invalid

IA32_MC8_STATUS(0x421): 0x0000000000000000 invalid

Processor 11: machine-check status 0x0000000000000004:

machine-check in progress

MCA error-reporting registers:

IA32_MC0_STATUS(0x401): 0x0000000000000800 invalid

IA32_MC1_STATUS(0x405): 0xbe00000000400e0f valid

  MCA error code:            0x0e0f

  Model specific error code: 0x0040

  Other information:         0x00000000

  Threshold-based status:    Undefined

  Status bits:

   Processor context corrupt

   ADDR register valid

   MISC register valid

   Error enabled

   Uncorrected error

IA32_MC1_ADDR(0x406): 0x000000e1e7e1e000

IA32_MC1_MISC(0x407): 0x0000000001000000

IA32_MC2_STATUS(0x409): 0x0000000000000000 invalid

IA32_MC3_STATUS(0x40d): 0x0000000000000000 invalid

IA32_MC4_STATUS(0x411): 0x0000000000000000 invalid

IA32_MC5_STATUS(0x415): 0x0000000000000000 invalid

IA32_MC6_STATUS(0x419): 0x0000000000000000 invalid

IA32_MC7_STATUS(0x41d): 0x0000000000000000 invalid

IA32_MC8_STATUS(0x421): 0x0000000000000000 invalid

Processor 12: machine-check st

Model: MacPro4,1, BootROM MP41.0081.B07, 8 processors, Quad-Core Intel Xeon, 2.26 GHz, 32 GB, SMC 1.39f5

Graphics: NVIDIA GeForce GT 120, NVIDIA GeForce GT 120, PCIe, 512 MB

Memory Module: DIMM 1, 4 GB, DDR3 ECC, 1066 MHz, 0x802C, 0x33364A445A533531323732505A3147344631

Memory Module: DIMM 2, 4 GB, DDR3 ECC, 1066 MHz, 0x802C, 0x33364A425A53353132373250593147344431

Memory Module: DIMM 3, 4 GB, DDR3 ECC, 1066 MHz, 0x802C, 0x33364A425A53353132373250593147344431

Memory Module: DIMM 4, 4 GB, DDR3 ECC, 1066 MHz, 0x802C, 0x33364A425A53353132373250593147344431

Memory Module: DIMM 5, 4 GB, DDR3 ECC, 1066 MHz, 0x802C, 0x33364A425A53353132373250593147344431

Memory Module: DIMM 6, 4 GB, DDR3 ECC, 1066 MHz, 0x802C, 0x33364A425A53353132373250593147344431

Memory Module: DIMM 7, 4 GB, DDR3 ECC, 1066 MHz, 0x802C, 0x33364A445A533531323732505A3147344631

Memory Module: DIMM 8, 4 GB, DDR3 ECC, 1066 MHz, 0x802C, 0x33364A445A533531323732505A3147344631

AirPort: spairport_wireless_card_type_airport_extreme (0x14E4, 0x8E), Broadcom BCM43xx 1.0 (5.106.98.100.17)

Bluetooth: Version 6.1.7f5 15859, 3 service, 21 devices, 3 incoming serial ports

Network Service: Wi-Fi, AirPort, en2

PCI Card: NVIDIA GeForce GT 120, sppci_displaycontroller, Slot-1

PCI Card: pci1057,3410, sppci_othermultimedia, Slot-2@8,4,0

PCI Card: pci1057,3410, sppci_othermultimedia, Slot-2@8,5,0

PCI Card: pci1057,3410, sppci_othermultimedia, Slot-2@8,6,0

PCI Card: pci1a00,1, sppci_othermultimedia, Slot-4

PCI Card: pci1057,3410, sppci_othermultimedia, Slot-3@4,4,0

PCI Card: pci1057,3410, sppci_othermultimedia, Slot-3@4,5,0

PCI Card: pci1057,3410, sppci_othermultimedia, Slot-3@4,6,0

Serial ATA Device: HL-DT-ST DVD-RW GH41N

Serial ATA Device: Samsung SSD 850 EVO 500GB, 500.11 GB

Serial ATA Device: Hitachi HDS722020ALA330, 2 TB

Serial ATA Device: Hitachi HDS723030ALA640, 3 TB

Serial ATA Device: WDC WD740GD-00FLC0, 74.36 GB

USB Device: Keyboard Hub, apple_vendor_id, 0x1006, 0xfd500000 / 3

USB Device: Kensington Expert Mouse, 0x047d  (Kensington), 0x1020, 0xfd530000 / 7

USB Device: Apple Keyboard, apple_vendor_id, 0x0220, 0xfd520000 / 6

USB Device: hub_device, 0x0409  (NEC Corporation), 0x005a, 0xfd300000 / 2

USB Device: v125w, 0x03f0  (Hewlett Packard), 0x3307, 0xfd310000 / 5

USB Device: hub_device, 0x0409  (NEC Corporation), 0x005a, 0xfd340000 / 4

USB Device: hub_device, apple_vendor_id, 0x9102, 0x1a200000 / 2

USB Device: hub_device, apple_vendor_id, 0x9118, 0x1a210000 / 3

USB Device: iLok, 0x088e, 0x5036, 0x1a211000 / 5

USB Device: Studio Display, apple_vendor_id, 0x9218, 0x1a213000 / 4

USB Device: BRCM2046 Hub, 0x0a5c  (Broadcom Corp.), 0x4500, 0x5a100000 / 2

USB Device: Bluetooth USB Host Controller, apple_vendor_id, 0x8215, 0x5a110000 / 3

FireWire Device: built-in_hub, 800mbit_speed

Mac Pro, OS X Mountain Lion (10.8.5)

Posted on Oct 4, 2015 3:45 AM

Close

Q: Mac Pro 2009 hangs intermittently

  • All replies
  • Helpful answers

Previous Page 2
  • by Grant Bennet-Alder,

    Grant Bennet-Alder Grant Bennet-Alder Oct 15, 2015 2:38 PM in response to bithead2
    Level 9 (61,083 points)
    Desktops
    Oct 15, 2015 2:38 PM in response to bithead2

    Yeah, but there is another processor in there we don't usually have to deal with directly . The System Management Controller (SMC) is the processor running the fans.

  • by bithead2,

    bithead2 bithead2 Oct 19, 2015 10:54 AM in response to Grant Bennet-Alder
    Level 1 (0 points)
    Oct 19, 2015 10:54 AM in response to Grant Bennet-Alder

    That helps. Not sure what I should do. I don't have high confidence that Apple can really repair this but I suppose if I dish it to them for $299 and they don't then I won't have to worry too much about it.  Maybe I should send this out to a third party?  I found some processor cages on EBay but looking at that part I have a hard time believing that replacing it is really going to fix the problem .... unless by "processor tray" Apple Geniuses mean that they are going to replace the processor board, not the processor cage with the fans, etc.

  • by Grant Bennet-Alder,

    Grant Bennet-Alder Grant Bennet-Alder Oct 19, 2015 12:34 PM in response to bithead2
    Level 9 (61,083 points)
    Desktops
    Oct 19, 2015 12:34 PM in response to bithead2
    by "processor tray" Apple Geniuses mean that they are going to replace the processor board, not the processor cage with the fans, etc.

    In the 2009 through 2012 models the entire bottom of the cabinet is a slide-out shelf, the "processor tray". It include processors, their heatsinks, the memory slots, and all the stuff around them.

     

    I suggest you power your off and remove the tray, and examine it closely with a very bright light before you pay to have it fixed. That may be a very expensive busted wire. Also, be certain that the Northbridge chip heatsink is firmly attached, not sloppy. (It sits between the two processors and its heatsink is attached with springs captured on plastic posts. The posts occasionally break, and weird things start to happen when that chip gets really hot.)

  • by Grant Bennet-Alder,

    Grant Bennet-Alder Grant Bennet-Alder Oct 19, 2015 12:53 PM in response to Grant Bennet-Alder
    Level 9 (61,083 points)
    Desktops
    Oct 19, 2015 12:53 PM in response to Grant Bennet-Alder

    The heatsink needs to be tight and not "wiggly".

     

    You can check that by powering off, removing the AC power cord, and sliding out the Processor/memory shelf in the bottom of the cabinet. The Northbridge chip sits in the middle of the board, and can be checked for 'wiggly" without any dis-assembly.

     

    If you have a wiggly Northbridge chip as others have experienced, read the thread referenced below (applies to 2009 through 2012 models). The Original Poster thought he just had random crashing, but as he probed, the NorthBridge heatsink and its retaining pins became the focus. Richard Schlettyprovided some good photographs (toward the end) as well.

     

    Re: Kernel panic in Yosemite 10.10.3


    MacPro 2009 northbridge.png


    .

  • by bithead2,

    bithead2 bithead2 Oct 20, 2015 2:01 PM in response to Grant Bennet-Alder
    Level 1 (0 points)
    Oct 20, 2015 2:01 PM in response to Grant Bennet-Alder

    Thanks Grant I'll check this out tonight.

     

    CJ

  • by bithead2,Solvedanswer

    bithead2 bithead2 Oct 20, 2015 5:36 PM in response to bithead2
    Level 1 (0 points)
    Oct 20, 2015 5:36 PM in response to bithead2

    Grant.  You are a wizard. I just touched the heatsink and it moved!

     

    CJ

  • by bithead2,

    bithead2 bithead2 Oct 31, 2015 5:30 PM in response to bithead2
    Level 1 (0 points)
    Oct 31, 2015 5:30 PM in response to bithead2

    OK I got all the tools together and reworked the heat sink on the Northridge chip and put everything back together. Still hangs though, so now the heat sink doesn't move but the hang is still occurring.  What would be the next thing to try?  Already tried different memory.

     

    Thanks!
    CJ

  • by Grant Bennet-Alder,

    Grant Bennet-Alder Grant Bennet-Alder Oct 31, 2015 6:04 PM in response to bithead2
    Level 9 (61,083 points)
    Desktops
    Oct 31, 2015 6:04 PM in response to bithead2

    Is it the same kernel panic, machine check, multiple processors? or something else?

  • by bithead2,

    bithead2 bithead2 Feb 15, 2016 3:36 PM in response to Grant Bennet-Alder
    Level 1 (0 points)
    Feb 15, 2016 3:36 PM in response to Grant Bennet-Alder

    I ended up getting Apple to do an out of warranty repair for $299 on the entire processor cage assembly (sans processors).  That fixed it.  So it wasn't memory it was something on the board itself.  I believe you correctly pointed that out.

     

    Regards,

    CJ

Previous Page 2