[
Bloke.com
|| Linux
|| JavaScript
|| Java
|| Volleyball
|| Link Me
]
Free: [ Guestbook || MessageBot || Plugins || Counter || AusPrices || Advertise ] |
Message from syslogd@jack at Mon Mar 11 11:11:32 2002 ... jack kernel: CPU 1: Machine Check Exception: 0000000000000004 Message from syslogd@jack at Mon Mar 11 11:11:32 2002 ... jack kernel: Bank 4: b200000000040151 Message from syslogd@jack at Mon Mar 11 11:11:32 2002 ... jack kernel: Kernel panic: CPU context corruptIt happens about once a day. I've been looking at the web every few months, trying to find a solution. The general consensous is that it's a bad CPU, or it's overclocked. Seems to make sense, but over time I noticed that this error always seems to be associated with a dual cpu system, and always the second CPU.
In this thread, Alan Cox decodes the string as:
> CPU 1: Machine Check Exception: 0000000000000004 > Bank 4: b200000000040151<0>Kernel panic: CPU context corrupt Ok that decodes as: Status valid Uncorrect Error Error Enabled Processor Context Corrupt Memory Heirarchy Error Instruction Fetch L1 cache More than that I can't really say. Power and heat problems can certainly trigger MCE's. I don't know if I/O devices can influence them.Which is got me to thinking that it's really not the CPU, but a problem with the core linux kernel. Definately for me, the problem occurs more regularly if I increase the load on the machine. But that doesn't say much, since this would be true for almost any problem.
A simple solution would be to replace the CPU, and see what happens. But it's not so easy. I'm 5000 miles from the machine. Here is the output from 'dmesg'. Email me at cameron@bloke.com if you have a similar setup as me (dual cpu Redhat 7.2 machine, scsi)
Linux version 2.4.9-21smp (bhcompile@stripples.devel.redhat.com) (gcc version 2.96 20000731 (Red Hat Linux 7.1 2.96-98)) #1 SMP Thu Jan 17 14:01:48 EST 2002 BIOS-provided physical RAM map: BIOS-e820: 0000000000000000 - 000000000009fc00 (usable) BIOS-e820: 000000000009fc00 - 00000000000a0000 (reserved) BIOS-e820: 00000000000e0000 - 0000000000100000 (reserved) BIOS-e820: 0000000000100000 - 0000000040000000 (usable) BIOS-e820: 00000000fec00000 - 00000000fec01000 (reserved) BIOS-e820: 00000000fee00000 - 00000000fee01000 (reserved) BIOS-e820: 00000000fffc0000 - 0000000100000000 (reserved) Scanning bios EBDA for MXT signature 128MB HIGHMEM available. found SMP MP-table at 000fb4f0 hm, page 000fb000 reserved twice. hm, page 000fc000 reserved twice. hm, page 000f2000 reserved twice. hm, page 000f3000 reserved twice. On node 0 totalpages: 262144 zone(0): 4096 pages. zone(1): 225280 pages. zone(2): 32768 pages. Intel MultiProcessor Specification v1.4 Virtual Wire compatibility mode. OEM ID: INTEL Product ID: 440BX APIC at: 0xFEE00000 Processor #0 Pentium(tm) Pro APIC version 17 Processor #1 Pentium(tm) Pro APIC version 17 I/O APIC #2 Version 17 at 0xFEC00000. Processors: 2 Kernel command line: ro root=/dev/hda8 Initializing CPU#0 Detected 701.604 MHz processor. Console: colour VGA+ 80x25 Calibrating delay loop... 1399.19 BogoMIPS Memory: 1026280k/1048576k available (1887k kernel code, 20884k reserved, 103k data, 244k init, 131072k highmem) Dentry-cache hash table entries: 131072 (order: 8, 1048576 bytes) Inode-cache hash table entries: 65536 (order: 7, 524288 bytes) Mount-cache hash table entries: 16384 (order: 5, 131072 bytes) Buffer-cache hash table entries: 65536 (order: 6, 262144 bytes) Page-cache hash table entries: 262144 (order: 9, 2097152 bytes) CPU: Before vendor init, caps: 0383fbff 00000000 00000000, vendor = 0 CPU: L1 I cache: 16K, L1 D cache: 16K CPU: L2 cache: 256K Intel machine check architecture supported. Intel machine check reporting enabled on CPU#0. CPU: After vendor init, caps: 0383fbff 00000000 00000000 00000000 CPU: After generic, caps: 0383fbff 00000000 00000000 00000000 CPU: Common caps: 0383fbff 00000000 00000000 00000000 Enabling fast FPU save and restore... done. Enabling unmasked SIMD FPU exception support... done. Checking 'hlt' instruction... OK. POSIX conformance testing by UNIFIX mtrr: v1.40 (20010327) Richard Gooch (rgooch@atnf.csiro.au) mtrr: detected mtrr type: Intel CPU: Before vendor init, caps: 0383fbff 00000000 00000000, vendor = 0 CPU: L1 I cache: 16K, L1 D cache: 16K CPU: L2 cache: 256K Intel machine check reporting enabled on CPU#0. CPU: After vendor init, caps: 0383fbff 00000000 00000000 00000000 CPU: After generic, caps: 0383fbff 00000000 00000000 00000000 CPU: Common caps: 0383fbff 00000000 00000000 00000000 CPU0: Intel Pentium III (Coppermine) stepping 03 per-CPU timeslice cutoff: 731.88 usecs. enabled ExtINT on CPU#0 ESR value before enabling vector: 00000004 ESR value after enabling vector: 00000000 Booting processor 1/1 eip 2000 Initializing CPU#1 masked ExtINT on CPU#1 ESR value before enabling vector: 00000000 ESR value after enabling vector: 00000000 Calibrating delay loop... 1402.47 BogoMIPS CPU: Before vendor init, caps: 0383fbff 00000000 00000000, vendor = 0 CPU: L1 I cache: 16K, L1 D cache: 16K CPU: L2 cache: 256K Intel machine check reporting enabled on CPU#1. CPU: After vendor init, caps: 0383fbff 00000000 00000000 00000000 CPU: After generic, caps: 0383fbff 00000000 00000000 00000000 CPU: Common caps: 0383fbff 00000000 00000000 00000000 CPU1: Intel Pentium III (Coppermine) stepping 03 Total of 2 processors activated (2801.66 BogoMIPS). ENABLING IO-APIC IRQs ...changing IO-APIC physical APIC ID to 2 ... ok. init IO_APIC IRQs IO-APIC (apicid-pin) 2-0, 2-10, 2-11, 2-18, 2-19, 2-20, 2-21, 2-22, 2-23 not connected. ..TIMER: vector=0x31 pin1=2 pin2=0 number of MP IRQ sources: 17. number of IO-APIC #2 registers: 24. testing the IO APIC....................... IO APIC #2...... .... register #00: 02000000 ....... : physical APIC id: 02 .... register #01: 00170011 ....... : max redirection entries: 0017 ....... : IO APIC version: 0011 .... register #02: 00000000 ....... : arbitration: 00 .... IRQ redirection table: NR Log Phy Mask Trig IRR Pol Stat Dest Deli Vect: 00 000 00 1 0 0 0 0 0 0 00 01 003 03 0 0 0 0 0 1 1 39 02 003 03 0 0 0 0 0 1 1 31 03 003 03 0 0 0 0 0 1 1 41 04 003 03 0 0 0 0 0 1 1 49 05 003 03 0 0 0 0 0 1 1 51 06 003 03 0 0 0 0 0 1 1 59 07 003 03 0 0 0 0 0 1 1 61 08 003 03 0 0 0 0 0 1 1 69 09 003 03 0 0 0 0 0 1 1 71 0a 000 00 1 0 0 0 0 0 0 00 0b 000 00 1 0 0 0 0 0 0 00 0c 003 03 0 0 0 0 0 1 1 79 0d 003 03 0 0 0 0 0 1 1 81 0e 003 03 0 0 0 0 0 1 1 89 0f 003 03 0 0 0 0 0 1 1 91 10 003 03 1 1 0 1 0 1 1 99 11 003 03 1 1 0 1 0 1 1 A1 12 000 00 1 0 0 0 0 0 0 00 13 000 00 1 0 0 0 0 0 0 00 14 000 00 1 0 0 0 0 0 0 00 15 000 00 1 0 0 0 0 0 0 00 16 000 00 1 0 0 0 0 0 0 00 17 000 00 1 0 0 0 0 0 0 00 IRQ to pin mappings: IRQ0 -> 0:2 IRQ1 -> 0:1 IRQ3 -> 0:3 IRQ4 -> 0:4 IRQ5 -> 0:5 IRQ6 -> 0:6 IRQ7 -> 0:7 IRQ8 -> 0:8 IRQ9 -> 0:9 IRQ10 -> 0:16 IRQ11 -> 0:17 IRQ12 -> 0:12 IRQ13 -> 0:13 IRQ14 -> 0:14 IRQ15 -> 0:15 .................................... done. Using local APIC timer interrupts. calibrating APIC timer ... ..... CPU clock speed is 701.6359 MHz. ..... host bus clock speed is 100.2331 MHz. cpu: 0, clocks: 1002331, slice: 334110 CPU0<T0:1002320,T1:668208,D:2,S:334110,C:1002331> cpu: 1, clocks: 1002331, slice: 334110 CPU1<T0:1002320,T1:334096,D:4,S:334110,C:1002331> checking TSC synchronization across CPUs: passed. PCI: PCI BIOS revision 2.10 entry at 0xfdb81, last bus=1 PCI: Using configuration type 1 PCI: Probing PCI hardware isapnp: Scanning for PnP cards... isapnp: No Plug & Play device found Linux NET4.0 for Linux 2.4 Based upon Swansea University Computer Society NET3.039 Initializing RT netlink socket apm: BIOS version 1.2 Flags 0x03 (Driver version 1.14) apm: disabled - APM is not SMP safe. mxt_scan_bios: enter Starting kswapd v1.8 allocated 64 pages and 64 bhs reserved for the highmem bounces VFS: Diskquotas version dquot_6.5.0 initialized Detected PS/2 Mouse Port. pty: 2048 Unix98 ptys configured Serial driver version 5.05c (2001-07-08) with MANY_PORTS MULTIPORT SHARE_IRQ SERIAL_PCI ISAPNP enabled ttyS00 at 0x03f8 (irq = 4) is a 16550A ttyS01 at 0x02f8 (irq = 3) is a 16550A Real Time Clock Driver v1.10e block: queued sectors max/low 681178kB/550106kB, 2048 slots per queue RAMDISK driver initialized: 16 RAM disks of 4096K size 1024 blocksize Uniform Multi-Platform E-IDE driver Revision: 6.31 ide: Assuming 33MHz PCI bus speed for PIO modes; override with idebus=xx PIIX4: IDE controller on PCI bus 00 dev 39 PIIX4: chipset revision 1 PIIX4: not 100% native mode: will probe irqs later ide0: BM-DMA at 0xffa0-0xffa7, BIOS settings: hda:DMA, hdb:DMA ide1: BM-DMA at 0xffa8-0xffaf, BIOS settings: hdc:pio, hdd:pio hda: IC35L040AVER07-0, ATA DISK drive hdb: IC35L040AVER07-0, ATA DISK drive ide0 at 0x1f0-0x1f7,0x3f6 on irq 14 blk: queue c03bdba0, I/O limit 4095Mb (mask 0xffffffff) blk: queue c03bdba0, I/O limit 4095Mb (mask 0xffffffff) hda: 80418240 sectors (41174 MB) w/1916KiB Cache, CHS=5005/255/63, UDMA(33) blk: queue c03bdcdc, I/O limit 4095Mb (mask 0xffffffff) blk: queue c03bdcdc, I/O limit 4095Mb (mask 0xffffffff) hdb: 80418240 sectors (41174 MB) w/1916KiB Cache, CHS=5005/255/63, UDMA(33) Partition check: hda: hda1 hda2 < hda5 hda6 hda7 hda8 hda9 > hdb: hdb1 < hdb5 hdb6 hdb7 > Floppy drive(s): fd0 is 1.44M FDC 0 is a post-1991 82077 md: md driver 0.90.0 MAX_MD_DEVS=256, MD_SB_DISKS=27 md: Autodetecting RAID arrays. md: autorun ... md: ... autorun DONE. pci_hotplug: PCI Hot Plug PCI Core version: 0.3 NET4: Linux TCP/IP 1.0 for NET4.0 IP Protocols: ICMP, UDP, TCP, IGMP IP: routing cache hash table of 8192 buckets, 64Kbytes TCP: Hash tables configured (established 262144 bind 65536) Linux IP multicast router 0.06 plus PIM-SM NET4: Unix domain sockets 1.0/SMP for Linux NET4.0. RAMDISK: Compressed image found at block 0 Freeing initrd memory: 450k freed VFS: Mounted root (ext2 filesystem). SCSI subsystem driver Revision: 1.00 (scsi0) <Adaptec AIC-7895 Ultra SCSI host adapter> found at PCI 0/14/0 (scsi0) Wide Channel A, SCSI ID=7, 32/255 SCBs (scsi0) Downloading sequencer code... 383 instructions downloaded (scsi1) <Adaptec AIC-7895 Ultra SCSI host adapter> found at PCI 0/14/1 (scsi1) Wide Channel B, SCSI ID=7, 32/255 SCBs (scsi1) Downloading sequencer code... 383 instructions downloaded scsi0 : Adaptec AHA274x/284x/294x (EISA/VLB/PCI-Fast SCSI) 5.2.4/5.2.0 <Adaptec AIC-7895 Ultra SCSI host adapter> scsi1 : Adaptec AHA274x/284x/294x (EISA/VLB/PCI-Fast SCSI) 5.2.4/5.2.0 <Adaptec AIC-7895 Ultra SCSI host adapter> blk: queue c2357e18, I/O limit 4095Mb (mask 0xffffffff) blk: queue c2357e18, I/O limit 4095Mb (mask 0xffffffff) Journalled Block Device driver loaded EXT3-fs: INFO: recovery required on readonly filesystem. EXT3-fs: write access will be enabled during recovery. kjournald starting. Commit interval 5 seconds EXT3-fs: ide0(3,8): orphan cleanup on readonly fs ext3_orphan_cleanup: deleting unreferenced inode 22090 ext3_orphan_cleanup: deleting unreferenced inode 22188 EXT3-fs: ide0(3,8): 2 orphan inodes deleted EXT3-fs: recovery complete. EXT3-fs: mounted filesystem with ordered data mode. Freeing unused kernel memory: 244k freed Adding Swap: 265032k swap-space (priority -1) Adding Swap: 499896k swap-space (priority -2) EXT3 FS 2.4-0.9.11, 3 Oct 2001 on ide0(3,8), internal journal kjournald starting. Commit interval 5 seconds EXT3 FS 2.4-0.9.11, 3 Oct 2001 on ide0(3,1), internal journal EXT3-fs: mounted filesystem with ordered data mode. kjournald starting. Commit interval 5 seconds EXT3 FS 2.4-0.9.11, 3 Oct 2001 on ide0(3,6), internal journal EXT3-fs: mounted filesystem with ordered data mode. kjournald starting. Commit interval 5 seconds EXT3 FS 2.4-0.9.11, 3 Oct 2001 on ide0(3,5), internal journal EXT3-fs: mounted filesystem with ordered data mode. kjournald starting. Commit interval 5 seconds EXT3 FS 2.4-0.9.11, 3 Oct 2001 on ide0(3,7), internal journal EXT3-fs: mounted filesystem with ordered data mode. kjournald starting. Commit interval 5 seconds EXT3 FS 2.4-0.9.11, 3 Oct 2001 on ide0(3,70), internal journal EXT3-fs: mounted filesystem with ordered data mode. kjournald starting. Commit interval 5 seconds EXT3 FS 2.4-0.9.11, 3 Oct 2001 on ide0(3,71), internal journal EXT3-fs: mounted filesystem with ordered data mode. parport0: PC-style at 0x378 [PCSPP,TRISTATE] parport0: cpp_daisy: aa5500ff(38) parport0: assign_addrs: aa5500ff(38) parport0: cpp_daisy: aa5500ff(38) parport0: assign_addrs: aa5500ff(38) ip_conntrack (8192 buckets, 65536 max) 3c59x: Donald Becker and others. www.scyld.com/network/vortex.html 00:0f.0: 3Com PCI 3c905B Cyclone 100baseTx at 0xec00. Vers LK1.1.16 send_arp uses obsolete (PF_INET,SOCK_PACKET)
Okay I'm back on it again with this little gem from
Last Change: Sunday, 06-Dec-2009 20:07:29 EST
Disclaimer
The information provided within these pages is provided AS IS, and without any
warranty. Following these directions may (but not limited to)
crash your computer, delete all the information on your hard disk, open up security holes or cause your house to burn down.
I made these pages to provide some information about the setup that I have done,
but I did not proofread it for correctness, and in most cases did not test it.
There are commands in these pages that would definately delete or corrupt all
the data on your computer (especially the dualboot section). In fact it happened
to me....
So you are on your own!