ntennies wrote:
What I'm leading up to here is the possibility that what we're seeing is being caused by two issues. There may be a Yosemite-specific software issue related to how the GPU is being reset that is causing this process to fail in Yosemite; as a trivial example, these hardware reset procedures often have timing elements, and it could be one of those timing elements got modified in the GPU driver used in Yosemite. BUT, the reason that we're only seeing this problem on a small number of MacBook Pros is due to a problem in hardware that is causing the GPU to glitch out in the first place. So it works like this:
- If you have a MacBook Pro in which the GPU doesn't glitch out, you don't get repeated reset attempts on Yosemite, so you never see the problem
- If you have a MacBook Pro in which the GPU does glitch, attempts to reset the GPU fail on Yosemite and you see the problem
- If you have a MacBook Pro in which the GPU does glitch, attempts to reset the GPU succeed in Mavericks, so you don't see the problem (except maybe a brief screen freeze)
What still doesn't make sense here is why swapping logic boards doesn't fix the problem by pushing your MacBook into the first bullet. Or why you'd see variations in behavior between user logins. Options are:
- Apple says they swapped the logic board but really didn't.
- Apple swapped with a "refurbed" logic board that came back from another user that had the problem
- That some batches of GPUs or logic boards have this glitchy problem, and it's just bad luck that you got another one with it (seems too coincidental)
- That whatever is glitching your GPU is not on the logic board, so it survives the logic board swap. One example is nrj45's GPU shorting theory, but there could be others.
- Something else
I'm just tossing this out as a theory. Interestingly, I also ran across a parallel discussion thread (fequent mid 2012 macbook pro retina kernel panic reboots possibly GPU related) that has been going on since late 2013 and still has activity. Most of these folks were reporting problems pre-Yosemite; it's not clear when some of these people report their computer "freezes" exactly what that means.
This seems like the most likely theory for everything going on.
Like others, I was also seeing several GPU resets (sometimes as many as 9-10, sometimes as few as 2-3) that were clearly correlated with the screen freezing (except for the cursor) before the machine restarted itself. All of them reported that the Intel GPU had hung and was being restarted. This explanation makes sense, but if there is a hardware issue, I think think it's with the integrated Intel GPU, not the nVidia one. If the issue was with the nVidia GPU, then disabling automatic graphics switching should exacerbate the issue, not solve it (since the nVidia GPU is being used all the time). Either that, or the issue really does lie in the core of the switching process/hardware, and whatever the problem is ends up getting blamed on the Intel GPU hanging.
I'm going to try ripping the back off this week and see if the logic board looks brand new or if it has any sort of revision number printed on it.