Question
44 views

Random STOP errors

Surly Jason asked on

Randomly get stop errors. Generally within just 5-10 minutes, and even if no one is using the system. Sometimes on boot-up.

Chkdsk no errors.

Traded video cards with another computer and same errors

Boot to Ubuntu live CD and memtest and stress can both tun >5 hours without error

You can get the minidump here:

https://drive.google.com/file/d/0B3Hqpl22OzzBWWQ3MU5iREwyVmM/edit?usp=sharing

1 person had this question

Abuse history


The answered status icon Answer
Patrick Barker replied on

Hi,

The attached DMP file is of the WHEA_UNCORRECTABLE_ERROR (124) bug check.

A fatal hardware error has occurred. This fatal error displays data from the Windows Hardware Error Architecture (WHEA).

If we run an !errrec on the 2nd parameter of the bug check (address of the WER structure) we get the following:

===============================================================================
Section 2     : x86/x64 MCA
-------------------------------------------------------------------------------
Descriptor    @ fffffa80038b7148
Section       @ fffffa80038b72d0
Offset        : 664
Length        : 264
Flags         : 0x00000000
Severity      : Fatal

Error         : BUSL0_SRC_ERR_M_NOTIMEOUT_ERR (Proc 0 Bank 0)
  Status      : 0xb200004000000800

This error implies that the processor sent a read request to the L0 Cache, and there was a delay in return (invalid data, or cache miss).

-------------------------------

There is only so much you can do with a bugcheck like this until it comes down to a faulty processor that will need to be replaced. Start from 1 and work downward:

1. Ensure your temperatures are within standard and nothing's overheating. You can use a program such as Speccy if you'd like to monitor temps - http://www.piriform.com/speccy

2. Clear your CMOS (or load optimized BIOS defaults) to ensure there's no improper BIOS setting - http://pcsupport.about.com/od/fixtheproblem/tp/clearcmos.htm

3. Ensure your BIOS is up to date.

4. Software conflicts that can usually cause *124 bugchecks are OS to BIOS utilities from manufacturer's like Asus' AI Suite. If you have something like this software-wise, remove it ASAP.

5. If all of the above fail, the only left to do is replace your processor as it is faulty.

Regards,

Patrick

Debugger/Reverse Engineer.
Be the first person to mark this helpful

Abuse history


progress