question

Zoram-0989 avatar image
0 Votes"
Zoram-0989 asked Zoram-0989 commented

BSOD MACHINE_CHECK_EXCEPTION

The computer is just a few months old, and it gets a BSOD once every few days. It nearly always happens at night.
The error is sometimes MACHINE_CHECK_EXCEPTION, sometimes WHEA_UNCORRECTABLE_ERROR.

I ran memtest86, Intel Processor Diagnostics, Prime95, OCCT - none of them have any issues.

I replaced my PSU with an old one that I know is working fine, but it didn't help.

There is no overclocking, and no graphics card.

I have minidumps, but unsure how to attach to this question.

Specs: Windows 10 Pro, i7-9700K, B365M DS3H (motherboard).

Here is a summary of the error, copied from WinDbg:

MACHINE_CHECK_EXCEPTION (9c)
A fatal Machine Check Exception has occurred.

Debugging Details:
------------------

KEY_VALUES_STRING: 1
Key : Analysis.CPU.mSec
Value: 2858
Key : Analysis.DebugAnalysisManager
Value: Create
Key : Analysis.Elapsed.mSec
Value: 19064
Key : Analysis.Init.CPU.mSec
Value: 312
Key : Analysis.Init.Elapsed.mSec
Value: 4779
Key : Analysis.Memory.CommitPeak.Mb
Value: 82
Key : WER.OS.Branch
Value: vb_release
Key : WER.OS.Timestamp
Value: 2019-12-06T14:06:00Z
Key : WER.OS.Version
Value: 10.0.19041.1

BUGCHECK_CODE: 9c
BUGCHECK_P1: 80000001
BUGCHECK_P2: ffffaa000b7e6b30
BUGCHECK_P3: 0
BUGCHECK_P4: 0
BLACKBOXBSD: 1 (!blackboxbsd)
BLACKBOXNTFS: 1 (!blackboxntfs)
BLACKBOXPNP: 1 (!blackboxpnp)
BLACKBOXWINLOGON: 1
CUSTOMER_CRASH_COUNT: 1
PROCESS_NAME: System
BAD_STACK_POINTER: ffffaa000b7e6ae8

STACK_TEXT:
ffffaa00`0b7e6ae8 fffff806`7a0b5619 : 00000000`0000009c 00000000`80000001 ffffaa00`0b7e6b30 00000000`00000000 : nt!KeBugCheckEx
ffffaa00`0b7e6af0 fffff806`7a0b5a74 : 00000000`00000008 ffffaa00`0b7e6e70 00000000`00000000 00000000`00000008 : nt!HalpMcaReportError+0x149
ffffaa00`0b7e6c60 fffff806`7a0b4c8b : 00000000`00000000 00000000`80000001 ffffaa00`0b7e6ef0 ef3bfb64`96659275 : nt!HalpMceHandlerWithRendezvous+0x11c
ffffaa00`0b7e6c90 fffff806`7a0b74d5 : ffffbf8c`d76cad40 03c21cac`60a5675d e0f07086`da81f8bf ee39f330`8ccae531 : nt!HalpHandleMachineCheck+0x5f
ffffaa00`0b7e6cc0 fffff806`7a10cc79 : ba05590e`ce0d35e7 bc4d5ff8`48e84b99 5bfcae32`78a45edb 7c05fe3c`150bdc2d : nt!HalHandleMcheck+0x35
ffffaa00`0b7e6cf0 fffff806`7a004efa : 96096e3a`3d39a983 045bfe84`09ffbc15 52b9211e`5e39b337 a3a5e888`98213369 : nt!KiHandleMcheck+0x9
ffffaa00`0b7e6d20 fffff806`7a004bb7 : 00000000`00000000 fffff806`7a004aec 00000000`00000002 00000000`00000000 : nt!KxMcheckAbort+0x7a
ffffaa00`0b7e6e60 fffff806`8880138f : 00000000`00000000 00000000`00000000 00000000`00000000 00000000`00000000 : nt!KiMcheckAbort+0x277
ffffa28c`9f63f5b8 00000000`00000000 : 00000000`00000000 00000000`00000000 00000000`00000000 00000000`00000000 : intelppm!MWaitIdle+0x1f

SYMBOL_NAME: intelppm!MWaitIdle+1f
MODULE_NAME: intelppm
IMAGE_NAME: intelppm.sys
IMAGE_VERSION: 10.0.19041.1134
STACK_COMMAND: .thread ; .cxr ; kb
BUCKET_ID_FUNC_OFFSET: 1f
FAILURE_BUCKET_ID: 0x9C_SPURIOUS_GenuineIntel_STACKPTR_ERROR_intelppm!MWaitIdle
OS_VERSION: 10.0.19041.1
BUILDLAB_STR: vb_release
OSPLATFORM_TYPE: x64
OSNAME: Windows 10
FAILURE_ID_HASH: {1e3f83d3-dcec-5c94-4c1c-54ecb2a7abc1}

windows-10-general
· 2
5 |1600 characters needed characters left characters exceeded

Up to 10 attachments (including images) can be used with a maximum of 3.0 MiB each and 30.0 MiB total.

@Zoram-0989
Hi,

Just checking in to see if the information provided was helpful.

If the reply helped you, please remember to accept as answer.
If no, please reply and tell us the current situation in order to provide further help.

0 Votes 0 ·

Just replied to your answer below, thanks.

0 Votes 0 ·
Zoram-0989 avatar image
0 Votes"
Zoram-0989 answered Zoram-0989 commented

Since it only happened at night, it made sense that the issue happened when the CPU entered some deep sleep state (C-state).
I disabled the C-states in the BIOS, and haven't had any BSOD since (for 7 days).

What I wonder is, do these C-state crashes mean that the CPU is faulty, or the motherboard, or something else?

I will enable the C-states one by one, waiting a week after each, to see which one it was that was causing the issue.

· 2
5 |1600 characters needed characters left characters exceeded

Up to 10 attachments (including images) can be used with a maximum of 3.0 MiB each and 30.0 MiB total.


Disabling C states makes the processor work at 100%.

It costs money.

At some time you should swap test or replace the CPU.

For a computer that is a fee months old > RMA (Return Merchandise Authorization)

0 Votes 0 ·

Thanks for your replies. I cannot replace the CPU due to the expense. See my comment to JennyFeng-MSFT's answer why it's not worth me returning it to the manufacturer.

0 Votes 0 ·
SSengupta-4080 avatar image
0 Votes"
SSengupta-4080 answered Zoram-0989 edited

You will find attachment button [ the clip image in toolbar above> click on it > select either Upload or Drop the minidump file from computer.


· 1
5 |1600 characters needed characters left characters exceeded

Up to 10 attachments (including images) can be used with a maximum of 3.0 MiB each and 30.0 MiB total.

It says "Allowed Upload Types: xml, gif, png, jpg, jpeg, pdf, txt, log", and a minidump is dmp.

0 Votes 0 ·
JennyFeng-MSFT avatar image
0 Votes"
JennyFeng-MSFT answered JennyFeng-MSFT commented

@Zoram-0989
Hi,
MACHINE_CHECK_EXCEPTION is pretty much always a hardware issue. Often CPU, but sometimes RAM, cache or even defective PSU.
This is another MACHINE_CHECK _EXCEPTION STOP Error, this points to a hardware defect.
https://docs.microsoft.com/en-us/windows-hardware/drivers/debugger/bug-check-0x124---whea-uncorrectable-error
In view of the greater possibility of hardware errors, I think you should consult the manufacturer
Please understand due to security policy and from our professional level, we do not provide dump/log analysis. In addition, if this problem is more urgent for you I still recommend that you open a case to Microsoft for further professional help.
https://support.microsoft.com/en-us/help/4341255/support-for-business

Hope above information can help you.

============================================
If the Answer is helpful, please click "Accept Answer" and upvote it.
Note: Please follow the steps in our documentation to enable e-mail notifications if you want to receive the related email notification for this thread.

· 6
5 |1600 characters needed characters left characters exceeded

Up to 10 attachments (including images) can be used with a maximum of 3.0 MiB each and 30.0 MiB total.

I contacted the manufacturer, but they said I must bring them the computer, and they have to reproduce the issue, which can take a week or more, which I can't afford.
If the problem is a hardware issue, will Support for Business walk me through the process of finding out which hardware is the culprit, or will they stop once they rule out Windows as the problem?

0 Votes 0 ·

Hi,
If the hardware is the culprit, I’m afraid you still need to contact the manufacturer for repair.

0 Votes 0 ·

I can't take it back to the manufacturer, since it means I will be without my PC for a week while they perform tests, and since the error happens only occasionally, it may not happen with them. Maybe the error is due to my wireless mouse, or some other hardware that they won't be using in their tests. Plus they insisted that I bring the PC to them, and they're in a different town...

0 Votes 0 ·
Show more comments
Docs-4663 avatar image
0 Votes"
Docs-4663 answered Docs-4663 edited

For bugchecks 9C and WHEA 124 you can perform hardware stress tests using software to potentially find the malfunctioning hardware.
(you ran commonly used tests)
(additional common tests are available for drive and GPU)



These tests do not directly test the motherboard.

If the tests, when properly performed, did not display the malfunctioning hardware then options are:

a) swap testing

b) contacting the component manufacturer or vendor for Return Merchandise Authorization (RMA)






5 |1600 characters needed characters left characters exceeded

Up to 10 attachments (including images) can be used with a maximum of 3.0 MiB each and 30.0 MiB total.