{"id":324,"date":"2008-09-09T19:46:13","date_gmt":"2008-09-09T23:46:13","guid":{"rendered":"http:\/\/nuxx.net\/blog\/?p=324"},"modified":"2008-09-12T20:26:49","modified_gmt":"2008-09-13T00:26:49","slug":"cpu-2-machine-check-exception-4-bank-4-f61c2001ba080813","status":"publish","type":"post","link":"https:\/\/nuxx.net\/blog\/2008\/09\/09\/cpu-2-machine-check-exception-4-bank-4-f61c2001ba080813\/","title":{"rendered":"CPU 2: Machine Check Exception: 4 Bank 4: f61c2001ba080813"},"content":{"rendered":"<p><center><\/p>\n<table cellpadding=1>\n<tr>\n<td bgcolor=\"black\"><a href=\"https:\/\/nuxx.net\/gallery\/v\/computers\/banstyle_nuxx_net\/IMG_2418.jpg.html?g2_imageViewsIndex=2\"><img decoding=\"async\" src=\"https:\/\/nuxx.net\/gallery\/d\/76493-2\/IMG_2418.jpg\" height=427 width=640 border=0 alt=\"A real, honest, good failure while running Breakin on banstyle.nuxx.net. It points to something being wrong with the second CPU or bank of memory.\"><\/a><\/td>\n<\/tr>\n<\/table>\n<p><\/center><\/p>\n<p>In testing my server <a href=\"https:\/\/nuxx.net\/gallery\/v\/computers\/banstyle_nuxx_net\/\">banstyle.nuxx.net<\/a> has had its first real set of errors \/ failures. This is a good thing.<\/p>\n<p>First, last night I started getting SMART warnings about bad blocks on <tt>ad6<\/tt>, which is the second hard drive. So today I just went ahead and ordered up a pair of <A href=\"http:\/\/www.seagate.com\/ww\/v\/index.jsp?vgnextoid=c89ef141e7f43110VgnVCM100000f5ee0a0aRCRD&#038;locale=en-US\">ST3500320AS<\/a> 500GB disks and a <a href=\"http:\/\/www.3ware.com\/products\/serial_ata8000.asp\">3ware 8006-2LP<\/a>, the same as is used in my current server.<\/p>\n<p>Note the <tt>sdb<\/tt> errors, which are consistent with the other errors I&#8217;d been seeing indicating a bad block on the second hard disk.<\/p>\n<p>Second, I came home today and found my server hung while running <a href=\"http:\/\/www.advancedclustering.com\/software\/breakin.html\">Breakin<\/a>, displaying the error <tt>CPU 2: Machine Check Exception: 4 Bank 4: f61c2001ba080813 TSC 2561d00c4ef7 ADDR ce19fd00<\/tt>. So, at least I&#8217;ve got some place to look for what else might be the issue.<\/p>\n<p><!--more-->This error was decoded by AMD&#8217;s <a href=\"http:\/\/www.amd.com\/us-en\/assets\/content_type\/utilities\/MCAT.zip\">the AMD Machine Check Analysis Tool (MCAT)<\/a>. Since the machine contains <a href=\"http:\/\/en.wikipedia.org\/wiki\/List_of_AMD_Opteron_microprocessors#Opteron_800-series_.22Egypt.22_.28E1_.26_E6.2C_90_nm.29\">Opteron 885<\/a> CPUs which are in the <a href=\"http:\/\/en.wikipedia.org\/wiki\/AMD_K10\">AMD K10<\/a> family, the <tt>\/gh<\/tt> flag is used:<\/p>\n<p><tt>C:\\Program&nbsp;Files\\AMD\\MCat&gt;mcat&nbsp;\/gh&nbsp;\/cmd&nbsp;4&nbsp;0xf41c2000ba080a13&nbsp;0xce19fd00&nbsp;0x2561d00c4ef7<\/tt><br \/>\n<tt>Processor&nbsp;Number&nbsp;&nbsp;:&nbsp;0<\/tt><br \/>\n<tt>Bank&nbsp;Number&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;:&nbsp;4<\/tt><br \/>\n<tt>Time&nbsp;Stamp&nbsp;&nbsp;&nbsp;&nbsp;(0x):&nbsp;00000000&nbsp;00000000<\/tt><br \/>\n<tt>Error&nbsp;Status&nbsp;&nbsp;(0x):&nbsp;F41C2000&nbsp;BA080A13<\/tt><br \/>\n<tt>Error&nbsp;Address&nbsp;(0x):&nbsp;00000000&nbsp;CE19FD00<\/tt><br \/>\n<tt>Error&nbsp;Misc&nbsp;&nbsp;&nbsp;&nbsp;(0x):&nbsp;00002561&nbsp;D00C4EF7<\/tt><br \/>\n<tt>Status&nbsp;Bit&nbsp;Decode&nbsp;:<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Uncorrectable&nbsp;ECC&nbsp;error<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Error&nbsp;address&nbsp;valid<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Error&nbsp;enable<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Error&nbsp;uncorrected<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Error&nbsp;overflow<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Error&nbsp;valid<\/tt><br \/>\n<tt>Error&nbsp;Code&nbsp;&nbsp;&nbsp;&nbsp;(0x):&nbsp;0A13<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Error&nbsp;Type&nbsp;-&nbsp;Bus<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Participation&nbsp;Processor&nbsp;(PP)&nbsp;-&nbsp;Local&nbsp;node&nbsp;responded&nbsp;to&nbsp;the&nbsp;request&nbsp;(RES)<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Timeout&nbsp;(T)&nbsp;-&nbsp;Request&nbsp;did&nbsp;not&nbsp;time&nbsp;out<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Memory&nbsp;Transaction&nbsp;Type&nbsp;(RRRR)&nbsp;-&nbsp;Generic&nbsp;read&nbsp;(RD)<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Memory&nbsp;or&nbsp;IO&nbsp;(II)&nbsp;-&nbsp;Memory&nbsp;Access&nbsp;(MEM)<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Cache&nbsp;Level&nbsp;(LL)&nbsp;-&nbsp;Generic,&nbsp;includes&nbsp;L3&nbsp;cache&nbsp;(LG)<\/tt><br \/>\n<tt>Bank&nbsp;4&nbsp;North&nbsp;Bridge&nbsp;Errors:<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;ECC&nbsp;Error&nbsp;-&nbsp;DRAM&nbsp;ECC&nbsp;error&nbsp;detected&nbsp;in&nbsp;the&nbsp;NB.<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Error&nbsp;address&nbsp;at&nbsp;3297&nbsp;MB&nbsp;rage<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Syndrome&nbsp;&nbsp;(0x):&nbsp;BA38<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Address&nbsp;decode:&nbsp;00000000CE19FD00<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;Node&nbsp;ID:&nbsp;5<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;Channel&nbsp;Select:&nbsp;0<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;Chip&nbsp;Select:&nbsp;0<\/tt><\/p>\n<p>I then swapped the first and fourth DIMMs which are connected to the first CPU, then received this error after a few more hours of running Breakin:<\/p>\n<p><tt>C:\\Program&nbsp;Files\\AMD\\MCat&gt;mcat&nbsp;\/gh&nbsp;\/cmd&nbsp;4&nbsp;0xf41c2000ba080a13&nbsp;0x8589bd00&nbsp;0x147c1f963903<\/tt><br \/>\n<tt>Processor&nbsp;Number&nbsp;&nbsp;:&nbsp;0<\/tt><br \/>\n<tt>Bank&nbsp;Number&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;:&nbsp;4<\/tt><br \/>\n<tt>Time&nbsp;Stamp&nbsp;&nbsp;&nbsp;&nbsp;(0x):&nbsp;00000000&nbsp;00000000<\/tt><br \/>\n<tt>Error&nbsp;Status&nbsp;&nbsp;(0x):&nbsp;F41C2000&nbsp;BA080A13<\/tt><br \/>\n<tt>Error&nbsp;Address&nbsp;(0x):&nbsp;00000000&nbsp;8589BD00<\/tt><br \/>\n<tt>Error&nbsp;Misc&nbsp;&nbsp;&nbsp;&nbsp;(0x):&nbsp;0000147C&nbsp;1F963903<\/tt><br \/>\n<tt>Status&nbsp;Bit&nbsp;Decode&nbsp;:<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Uncorrectable&nbsp;ECC&nbsp;error<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Error&nbsp;address&nbsp;valid<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Error&nbsp;enable<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Error&nbsp;uncorrected<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Error&nbsp;overflow<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Error&nbsp;valid<\/tt><br \/>\n<tt>Error&nbsp;Code&nbsp;&nbsp;&nbsp;&nbsp;(0x):&nbsp;0A13<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Error&nbsp;Type&nbsp;-&nbsp;Bus<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Participation&nbsp;Processor&nbsp;(PP)&nbsp;-&nbsp;Local&nbsp;node&nbsp;responded&nbsp;to&nbsp;the&nbsp;request&nbsp;(RES)<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Timeout&nbsp;(T)&nbsp;-&nbsp;Request&nbsp;did&nbsp;not&nbsp;time&nbsp;out<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Memory&nbsp;Transaction&nbsp;Type&nbsp;(RRRR)&nbsp;-&nbsp;Generic&nbsp;read&nbsp;(RD)<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Memory&nbsp;or&nbsp;IO&nbsp;(II)&nbsp;-&nbsp;Memory&nbsp;Access&nbsp;(MEM)<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Cache&nbsp;Level&nbsp;(LL)&nbsp;-&nbsp;Generic,&nbsp;includes&nbsp;L3&nbsp;cache&nbsp;(LG)<\/tt><br \/>\n<tt>Bank&nbsp;4&nbsp;North&nbsp;Bridge&nbsp;Errors:<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;ECC&nbsp;Error&nbsp;-&nbsp;DRAM&nbsp;ECC&nbsp;error&nbsp;detected&nbsp;in&nbsp;the&nbsp;NB.<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Error&nbsp;address&nbsp;at&nbsp;2136&nbsp;MB&nbsp;rage<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Syndrome&nbsp;&nbsp;(0x):&nbsp;BA38<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Address&nbsp;decode:&nbsp;000000008589BD00<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;Node&nbsp;ID:&nbsp;5<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;Channel&nbsp;Select:&nbsp;0<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;Chip&nbsp;Select:&nbsp;0<\/tt><\/p>\n<p>I&#8217;ve got interleaving enabled and I suspect that with this they are being used in some sort of balanced fashion, round-robin or something like that. With this, I would have suspected that the error would then have moved from the upper 2048 MB of that CPU&#8217;s RAM to the lower 2048 MB, but it didn&#8217;t. It&#8217;s very likely that I&#8217;m wrong in this thinking, though.<\/p>\n<p>So, tomorrow morning before work I&#8217;ll turn off interleaving, run the test again, and try to narrow it down to a single DIMM. If I can do that I&#8217;ll then move it over to the other CPU and see if it moves. Hopefully the error will appear in a manner which I can more easily narrow down to a single DIMM.<\/p>\n<p><a href=\"https:\/\/nuxx.net\/gallery\/v\/computers\/banstyle_nuxx_net\/IMG_2435.jpg.html\">This third MCE, with Bank Interleaving set to Auto and Node Interleaving set to Disabled<\/a> has resulted in the following:<\/p>\n<p><tt>C:\\Program&nbsp;Files\\AMD\\MCat&gt;mcat&nbsp;\/gh&nbsp;\/cmd&nbsp;4&nbsp;0xf61c2001ba080813&nbsp;0x1b7ae9d00&nbsp;0x4e7b5766b77b<\/tt><br \/>\n<tt>Processor&nbsp;Number&nbsp;&nbsp;:&nbsp;0<\/tt><br \/>\n<tt>Bank&nbsp;Number&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;:&nbsp;4<\/tt><br \/>\n<tt>Time&nbsp;Stamp&nbsp;&nbsp;&nbsp;&nbsp;(0x):&nbsp;00000000&nbsp;00000000<\/tt><br \/>\n<tt>Error&nbsp;Status&nbsp;&nbsp;(0x):&nbsp;F61C2001&nbsp;BA080813<\/tt><br \/>\n<tt>Error&nbsp;Address&nbsp;(0x):&nbsp;00000001&nbsp;B7AE9D00<\/tt><br \/>\n<tt>Error&nbsp;Misc&nbsp;&nbsp;&nbsp;&nbsp;(0x):&nbsp;00004E7B&nbsp;5766B77B<\/tt><br \/>\n<tt>Status&nbsp;Bit&nbsp;Decode&nbsp;:<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Error&nbsp;associated&nbsp;with&nbsp;CPU&nbsp;core&nbsp;0<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Uncorrectable&nbsp;ECC&nbsp;error<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Processor&nbsp;context&nbsp;corrupt<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Error&nbsp;address&nbsp;valid<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Error&nbsp;enable<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Error&nbsp;uncorrected<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Error&nbsp;overflow<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Error&nbsp;valid<\/tt><br \/>\n<tt>Error&nbsp;Code&nbsp;&nbsp;&nbsp;&nbsp;(0x):&nbsp;0813<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Error&nbsp;Type&nbsp;-&nbsp;Bus<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Participation&nbsp;Processor&nbsp;(PP)&nbsp;-&nbsp;Local&nbsp;node&nbsp;originated&nbsp;the&nbsp;request&nbsp;(SRC)<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Timeout&nbsp;(T)&nbsp;-&nbsp;Request&nbsp;did&nbsp;not&nbsp;time&nbsp;out<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Memory&nbsp;Transaction&nbsp;Type&nbsp;(RRRR)&nbsp;-&nbsp;Generic&nbsp;read&nbsp;(RD)<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Memory&nbsp;or&nbsp;IO&nbsp;(II)&nbsp;-&nbsp;Memory&nbsp;Access&nbsp;(MEM)<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Cache&nbsp;Level&nbsp;(LL)&nbsp;-&nbsp;Generic,&nbsp;includes&nbsp;L3&nbsp;cache&nbsp;(LG)<\/tt><br \/>\n<tt>Bank&nbsp;4&nbsp;North&nbsp;Bridge&nbsp;Errors:<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;ECC&nbsp;Error&nbsp;-&nbsp;DRAM&nbsp;ECC&nbsp;error&nbsp;detected&nbsp;in&nbsp;the&nbsp;NB.<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Error&nbsp;address&nbsp;at&nbsp;7034&nbsp;MB&nbsp;rage<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Syndrome&nbsp;&nbsp;(0x):&nbsp;BA38<\/tt><\/p>\n<p>Tomorrow I shall try to duplicate this result and memory location. If I can, I&#8217;ll begin shuffling parts around to see if the failure address moves. If I can&#8217;t, I&#8217;ll investigate the CPU.<\/p>\n<p><a href=\"https:\/\/nuxx.net\/gallery\/v\/computers\/banstyle_nuxx_net\/IMG_2440.jpg.html\">This fourth MCE<\/a> was generated with the same settings as the third, except with different disks (Seagate 500GB) and a new RAID controller (3ware 8006-2LP in JBOD mode):<\/p>\n<p><tt>C:\\Program&nbsp;Files\\AMD\\MCat&gt;mcat&nbsp;\/gh&nbsp;\/cmd&nbsp;4&nbsp;0xf41c2000ba080a13&nbsp;0x1250f2d00&nbsp;0xe0eb9e12d8e<\/tt><br \/>\n<tt>Processor&nbsp;Number&nbsp;&nbsp;:&nbsp;0<\/tt><br \/>\n<tt>Bank&nbsp;Number&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;:&nbsp;4<\/tt><br \/>\n<tt>Time&nbsp;Stamp&nbsp;&nbsp;&nbsp;&nbsp;(0x):&nbsp;00000000&nbsp;00000000<\/tt><br \/>\n<tt>Error&nbsp;Status&nbsp;&nbsp;(0x):&nbsp;F41C2000&nbsp;BA080A13<\/tt><br \/>\n<tt>Error&nbsp;Address&nbsp;(0x):&nbsp;00000001&nbsp;250F2D00<\/tt><br \/>\n<tt>Error&nbsp;Misc&nbsp;&nbsp;&nbsp;&nbsp;(0x):&nbsp;00000E0E&nbsp;B9E12D8E<\/tt><br \/>\n<tt>Status&nbsp;Bit&nbsp;Decode&nbsp;:<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Uncorrectable&nbsp;ECC&nbsp;error<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Error&nbsp;address&nbsp;valid<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Error&nbsp;enable<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Error&nbsp;uncorrected<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Error&nbsp;overflow<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Error&nbsp;valid<\/tt><br \/>\n<tt>Error&nbsp;Code&nbsp;&nbsp;&nbsp;&nbsp;(0x):&nbsp;0A13<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Error&nbsp;Type&nbsp;-&nbsp;Bus<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Participation&nbsp;Processor&nbsp;(PP)&nbsp;-&nbsp;Local&nbsp;node&nbsp;responded&nbsp;to&nbsp;the&nbsp;request&nbsp;(RES)<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Timeout&nbsp;(T)&nbsp;-&nbsp;Request&nbsp;did&nbsp;not&nbsp;time&nbsp;out<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Memory&nbsp;Transaction&nbsp;Type&nbsp;(RRRR)&nbsp;-&nbsp;Generic&nbsp;read&nbsp;(RD)<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Memory&nbsp;or&nbsp;IO&nbsp;(II)&nbsp;-&nbsp;Memory&nbsp;Access&nbsp;(MEM)<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Cache&nbsp;Level&nbsp;(LL)&nbsp;-&nbsp;Generic,&nbsp;includes&nbsp;L3&nbsp;cache&nbsp;(LG)<\/tt><br \/>\n<tt>Bank&nbsp;4&nbsp;North&nbsp;Bridge&nbsp;Errors:<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;ECC&nbsp;Error&nbsp;-&nbsp;DRAM&nbsp;ECC&nbsp;error&nbsp;detected&nbsp;in&nbsp;the&nbsp;NB.<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Error&nbsp;address&nbsp;at&nbsp;4688&nbsp;MB&nbsp;rage<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Syndrome&nbsp;&nbsp;(0x):&nbsp;BA38<\/tt><\/p>\n<p><a href=\"https:\/\/nuxx.net\/gallery\/v\/computers\/banstyle_nuxx_net\/IMG_2441.jpg.html\">This fifth MCE<\/a> was generated with Bank and Node Interleaving both disabled:<\/p>\n<p><tt>C:\\Program&nbsp;Files\\AMD\\MCat&gt;mcat&nbsp;\/gh&nbsp;\/cmd&nbsp;4&nbsp;0xf41c2000ba080a13&nbsp;0x1e224fd00&nbsp;0x48e38a5ecb7<\/tt><br \/>\n<tt>Processor&nbsp;Number&nbsp;&nbsp;:&nbsp;0<\/tt><br \/>\n<tt>Bank&nbsp;Number&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;:&nbsp;4<\/tt><br \/>\n<tt>Time&nbsp;Stamp&nbsp;&nbsp;&nbsp;&nbsp;(0x):&nbsp;00000000&nbsp;00000000<\/tt><br \/>\n<tt>Error&nbsp;Status&nbsp;&nbsp;(0x):&nbsp;F41C2000&nbsp;BA080A13<\/tt><br \/>\n<tt>Error&nbsp;Address&nbsp;(0x):&nbsp;00000001&nbsp;E224FD00<\/tt><br \/>\n<tt>Error&nbsp;Misc&nbsp;&nbsp;&nbsp;&nbsp;(0x):&nbsp;0000048E&nbsp;38A5ECB7<\/tt><br \/>\n<tt>Status&nbsp;Bit&nbsp;Decode&nbsp;:<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Uncorrectable&nbsp;ECC&nbsp;error<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Error&nbsp;address&nbsp;valid<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Error&nbsp;enable<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Error&nbsp;uncorrected<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Error&nbsp;overflow<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Error&nbsp;valid<\/tt><br \/>\n<tt>Error&nbsp;Code&nbsp;&nbsp;&nbsp;&nbsp;(0x):&nbsp;0A13<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Error&nbsp;Type&nbsp;-&nbsp;Bus<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Participation&nbsp;Processor&nbsp;(PP)&nbsp;-&nbsp;Local&nbsp;node&nbsp;responded&nbsp;to&nbsp;the&nbsp;request&nbsp;(RES)<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Timeout&nbsp;(T)&nbsp;-&nbsp;Request&nbsp;did&nbsp;not&nbsp;time&nbsp;out<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Memory&nbsp;Transaction&nbsp;Type&nbsp;(RRRR)&nbsp;-&nbsp;Generic&nbsp;read&nbsp;(RD)<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Memory&nbsp;or&nbsp;IO&nbsp;(II)&nbsp;-&nbsp;Memory&nbsp;Access&nbsp;(MEM)<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Cache&nbsp;Level&nbsp;(LL)&nbsp;-&nbsp;Generic,&nbsp;includes&nbsp;L3&nbsp;cache&nbsp;(LG)<\/tt><br \/>\n<tt>Bank&nbsp;4&nbsp;North&nbsp;Bridge&nbsp;Errors:<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;ECC&nbsp;Error&nbsp;-&nbsp;DRAM&nbsp;ECC&nbsp;error&nbsp;detected&nbsp;in&nbsp;the&nbsp;NB.<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Error&nbsp;address&nbsp;at&nbsp;7714&nbsp;MB&nbsp;rage<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Syndrome&nbsp;&nbsp;(0x):&nbsp;BA38<\/tt><\/p>\n<p><a href=\"https:\/\/nuxx.net\/gallery\/v\/computers\/banstyle_nuxx_net\/IMG_2442.jpg.html\">The sixth MCE<\/a> was generated under the same conditions as #5, with both Node and Bank Interleaving disabled:<br \/>\n<tt>C:\\Program&nbsp;Files\\AMD\\MCat&gt;mcat&nbsp;\/gh&nbsp;\/cmd&nbsp;4&nbsp;0xf61c2001ba080813&nbsp;0x1a78d9d00&nbsp;0x42f9116690d7<\/tt><br \/>\n<tt>Processor&nbsp;Number&nbsp;&nbsp;:&nbsp;0<\/tt><br \/>\n<tt>Bank&nbsp;Number&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;:&nbsp;4<\/tt><br \/>\n<tt>Time&nbsp;Stamp&nbsp;&nbsp;&nbsp;&nbsp;(0x):&nbsp;00000000&nbsp;00000000<\/tt><br \/>\n<tt>Error&nbsp;Status&nbsp;&nbsp;(0x):&nbsp;F61C2001&nbsp;BA080813<\/tt><br \/>\n<tt>Error&nbsp;Address&nbsp;(0x):&nbsp;00000001&nbsp;A78D9D00<\/tt><br \/>\n<tt>Error&nbsp;Misc&nbsp;&nbsp;&nbsp;&nbsp;(0x):&nbsp;000042F9&nbsp;116690D7<\/tt><br \/>\n<tt>Status&nbsp;Bit&nbsp;Decode&nbsp;:<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Error&nbsp;associated&nbsp;with&nbsp;CPU&nbsp;core&nbsp;0<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Uncorrectable&nbsp;ECC&nbsp;error<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Processor&nbsp;context&nbsp;corrupt<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Error&nbsp;address&nbsp;valid<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Error&nbsp;enable<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Error&nbsp;uncorrected<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Error&nbsp;overflow<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Error&nbsp;valid<\/tt><br \/>\n<tt>Error&nbsp;Code&nbsp;&nbsp;&nbsp;&nbsp;(0x):&nbsp;0813<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Error&nbsp;Type&nbsp;-&nbsp;Bus<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Participation&nbsp;Processor&nbsp;(PP)&nbsp;-&nbsp;Local&nbsp;node&nbsp;originated&nbsp;the&nbsp;request&nbsp;(SRC)<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Timeout&nbsp;(T)&nbsp;-&nbsp;Request&nbsp;did&nbsp;not&nbsp;time&nbsp;out<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Memory&nbsp;Transaction&nbsp;Type&nbsp;(RRRR)&nbsp;-&nbsp;Generic&nbsp;read&nbsp;(RD)<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Memory&nbsp;or&nbsp;IO&nbsp;(II)&nbsp;-&nbsp;Memory&nbsp;Access&nbsp;(MEM)<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Cache&nbsp;Level&nbsp;(LL)&nbsp;-&nbsp;Generic,&nbsp;includes&nbsp;L3&nbsp;cache&nbsp;(LG)<\/tt><br \/>\n<tt>Bank&nbsp;4&nbsp;North&nbsp;Bridge&nbsp;Errors:<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;ECC&nbsp;Error&nbsp;-&nbsp;DRAM&nbsp;ECC&nbsp;error&nbsp;detected&nbsp;in&nbsp;the&nbsp;NB.<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Error&nbsp;address&nbsp;at&nbsp;6776&nbsp;MB&nbsp;rage<\/tt><br \/>\n<tt>&nbsp;&nbsp;&nbsp;Syndrome&nbsp;&nbsp;(0x):&nbsp;BA38<\/tt><\/p>\n","protected":false},"excerpt":{"rendered":"<p>In testing my server banstyle.nuxx.net has had its first real set of errors \/ failures. This is a good thing. First, last night I started<\/p>\n<div class=\"more-link-wrapper\"><a class=\"more-link\" href=\"https:\/\/nuxx.net\/blog\/2008\/09\/09\/cpu-2-machine-check-exception-4-bank-4-f61c2001ba080813\/\">Continue reading<span class=\"screen-reader-text\">CPU 2: Machine Check Exception: 4 Bank 4: f61c2001ba080813<\/span><\/a><\/div>\n","protected":false},"author":2,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[13],"tags":[],"class_list":["post-324","post","type-post","status-publish","format-standard","hentry","category-computers","entry"],"amp_enabled":true,"_links":{"self":[{"href":"https:\/\/nuxx.net\/blog\/wp-json\/wp\/v2\/posts\/324","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/nuxx.net\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/nuxx.net\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/nuxx.net\/blog\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/nuxx.net\/blog\/wp-json\/wp\/v2\/comments?post=324"}],"version-history":[{"count":19,"href":"https:\/\/nuxx.net\/blog\/wp-json\/wp\/v2\/posts\/324\/revisions"}],"predecessor-version":[{"id":327,"href":"https:\/\/nuxx.net\/blog\/wp-json\/wp\/v2\/posts\/324\/revisions\/327"}],"wp:attachment":[{"href":"https:\/\/nuxx.net\/blog\/wp-json\/wp\/v2\/media?parent=324"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/nuxx.net\/blog\/wp-json\/wp\/v2\/categories?post=324"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/nuxx.net\/blog\/wp-json\/wp\/v2\/tags?post=324"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}