Systemabsturz [message #28303] |
Thu, 13 March 2008 13:08 |
Hurlyburly
Messages: 6 Registered: November 2007
|
Junior Member |
|
|
Hallo,
nach ca. 4 Monaten Uptime hatte ich heute morgen einen (für mich) unerklärlichen Systemabsturz. Kann mir jemand anhand des beigefügten Auszugs aus /var/log/syslog sagen, wo das Problem herkommt, bzw. wie ich es künftig vermeiden kann?
Ich nutze Kernel 2.6.18-1-openvz unter Debian Etch auf einem System mit
Xeon Quad Core 3220 2,40GHz 8MB FSB1066,
Supermicro Mainboard PDSMI+
4096 MB ECC DRR2 667-RAM ATP (4x 1024 MB)
3Ware Raid Controller 9650-2LP mit 2 x 320 GB SATA II Western Digital
Ein RAM-Check mit Memtester lieferte keine Fehler.
------- /var/log/syslog -------
Mar 13 05:00:08 s1 kernel: BUG: unable to handle kernel paging request at virtual address 00003f82
Mar 13 05:00:08 s1 kernel: printing eip:
Mar 13 05:00:08 s1 kernel: 00000037
Mar 13 05:00:08 s1 kernel: *pde = 17cf2067
Mar 13 05:00:08 s1 kernel: *pte = 00000000
Mar 13 05:00:08 s1 kernel: Oops: 0002 [#1]
Mar 13 05:00:08 s1 kernel: SMP
Mar 13 05:00:08 s1 kernel: Modules linked in: vzethdev vznetdev simfs vzrst ip_nat vzcpt ip_conntrack nfnetlink vzmon xt_tcpudp xt_length ipt_ttl xt_tcpmss ipt_TCPMSS iptable_mangle iptable_filter xt_multiport xt_
limit ipt_tos ipt_REJECT ip_tables x_tables vzdquota vzdev ipv6 button ac battery dm_snapshot dm_mirror dm_mod loop floppy evdev i2c_i801 parport_pc parport shpchp pci_hotplug psmouse i2c_core serio_raw pcspkr rtc
ata_piix libata sd_mod generic piix ide_core ehci_hcd e1000 uhci_hcd usbcore 3w_9xxx scsi_mod thermal processor fan
Mar 13 05:00:08 s1 kernel: CPU: 0, VCPU: 501.1
Mar 13 05:00:08 s1 kernel: EIP: 0060:[<00000037>] Not tainted VLI
Mar 13 05:00:08 s1 kernel: EFLAGS: 00010246 (2.6.18-1-openvz #1)
Mar 13 05:00:08 s1 kernel: EIP is at 0x37
Mar 13 05:00:08 s1 kernel: eax: 00004000 ebx: 00000004 ecx: 00000286 edx: 00004000
Mar 13 05:00:08 s1 kernel: esi: e133bf90 edi: ffffffe0 ebp: 00000001 esp: e133be94
Mar 13 05:00:08 s1 kernel: ds: 007b es: 007b ss: 0068
Mar 13 05:00:08 s1 kernel: Process vms (pid: 19303, veid: 501, ti=e133a000 task=dfcd5910 task.ti=e133a000)
Mar 13 05:00:08 s1 kernel: Stack: c014d0dc c0182a97 00000000 00000000 00000000 00000000 00000030 00000030
Mar 13 05:00:08 s1 kernel: c0183627 e133c010 f4c7d540 cb771600 00000030 00000000 00000001 00000000
Mar 13 05:00:08 s1 kernel: 00001000 00000000 00001000 00000000 00001000 00000000 00001000 00000000
Mar 13 05:00:08 s1 kernel: Call Trace:
Mar 13 05:00:08 s1 kernel: [<c014d0dc>] put_compound_page+0x25/0x26
Mar 13 05:00:08 s1 kernel: [<c0182a97>] splice_to_pipe+0x1df/0x1ed
Mar 13 05:00:08 s1 kernel: [<c0183627>] sys_vmsplice+0x236/0x25f
Mar 13 05:00:08 s1 kernel: Code: Bad EIP value.
Mar 13 05:00:08 s1 kernel: EIP: [<00000037>] 0x37 SS:ESP 0068:e133be94
Mar 13 05:00:13 s1 kernel: BUG: unable to handle kernel paging request at virtual address 00003f82
Mar 13 05:00:13 s1 kernel: printing eip:
Mar 13 05:00:13 s1 kernel: 00000037
Mar 13 05:00:13 s1 kernel: *pde = 17cf2067
Mar 13 05:00:13 s1 kernel: *pte = 00000000
Mar 13 05:00:13 s1 kernel: Oops: 0002 [#2]
Mar 13 05:00:13 s1 kernel: SMP
Mar 13 05:00:13 s1 kernel: Modules linked in: vzethdev vznetdev simfs vzrst ip_nat vzcpt ip_conntrack nfnetlink vzmon xt_tcpudp xt_length ipt_ttl xt_tcpmss ipt_TCPMSS iptable_mangle iptable_filter xt_multiport xt_
limit ipt_tos ipt_REJECT ip_tables x_tables vzdquota vzdev ipv6 button ac battery dm_snapshot dm_mirror dm_mod loop floppy evdev i2c_i801 parport_pc parport shpchp pci_hotplug psmouse i2c_core serio_raw pcspkr rtc
ata_piix libata sd_mod generic piix ide_core ehci_hcd e1000 uhci_hcd usbcore 3w_9xxx scsi_mod thermal processor fan
Mar 13 05:00:13 s1 kernel: CPU: 0, VCPU: 501.2
Mar 13 05:00:13 s1 kernel: EIP: 0060:[<00000037>] Not tainted VLI
Mar 13 05:00:13 s1 kernel: EFLAGS: 00010246 (2.6.18-1-openvz #1)
Mar 13 05:00:13 s1 kernel: EIP is at 0x37
Mar 13 05:00:13 s1 kernel: eax: 00004000 ebx: 00000004 ecx: 00000286 edx: 00004000
Mar 13 05:00:13 s1 kernel: esi: e7c51f90 edi: ffffffe0 ebp: 00000001 esp: e7c51e94
Mar 13 05:00:13 s1 kernel: ds: 007b es: 007b ss: 0068
Mar 13 05:00:13 s1 kernel: Process vms (pid: 19318, veid: 501, ti=e7c50000 task=f43ea7c0 task.ti=e7c50000)
Mar 13 05:00:13 s1 kernel: Stack: c014d0dc c0182a97 00000000 00000000 00000000 00000000 00000030 00000030
Mar 13 05:00:13 s1 kernel: c0183627 e7c52010 dfa21500 cb771600 00000030 00000000 00000001 00000000
Mar 13 05:00:13 s1 kernel: 00001000 00000000 00001000 00000000 00001000 00000000 00001000 00000000
Mar 13 05:00:13 s1 kernel: Call Trace:
Mar 13 05:00:13 s1 kernel: [<c014d0dc>] put_compound_page+0x25/0x26
Mar 13 05:00:13 s1 kernel: [<c0182a97>] splice_to_pipe+0x1df/0x1ed
Mar 13 05:00:13 s1 kernel: [<c0183627>] sys_vmsplice+0x236/0x25f
Mar 13 05:00:13 s1 kernel: Code: Bad EIP value.
Mar 13 05:00:13 s1 kernel: EIP: [<00000037>] 0x37 SS:ESP 0068:e7c51e94
Mar 13 05:00:19 s1 kernel: BUG: unable to handle kernel paging request at virtual address 00003f82
Mar 13 05:00:19 s1 kernel: printing eip:
Mar 13 05:00:19 s1 kernel: 00000037
Mar 13 05:00:19 s1 kernel: *pde = 1c7d1067
Mar 13 05:00:19 s1 kernel: *pte = 00000000
Mar 13 05:00:19 s1 kernel: Oops: 0002 [#3]
Mar 13 05:00:19 s1 kernel: SMP
Mar 13 05:00:19 s1 kernel: Modules linked in: vzethdev vznetdev simfs vzrst ip_nat vzcpt ip_conntrack nfnetlink vzmon xt_tcpudp xt_length ipt_ttl xt_tcpmss ipt_TCPMSS iptable_mangle iptable_filter xt_multiport xt_limit ipt_tos ipt_REJECT ip_tables x_tables vzdquota vzdev ipv6 button ac battery dm_snapshot dm_mirror dm_mod loop floppy evdev i2c_i801 parport_pc parport shpchp pci_hotplug psmouse i2c_core serio_raw pcspkr rtc ata_piix libata sd_mod generic piix ide_core ehci_hcd e1000 uhci_hcd usbcore 3w_9xxx scsi_mod thermal processor fan
Mar 13 05:00:19 s1 kernel: CPU: 3, VCPU: 501.3
Mar 13 05:00:19 s1 kernel: EIP: 0060:[<00000037>] Not tainted VLI
Mar 13 05:00:19 s1 kernel: EFLAGS: 00010246 (2.6.18-1-openvz #1)
Mar 13 05:00:19 s1 kernel: EIP is at 0x37
Mar 13 05:00:19 s1 kernel: eax: 00004000 ebx: 00000004 ecx: 00000286 edx: 00004000
Mar 13 05:00:19 s1 kernel: esi: e7c51f90 edi: ffffffe0 ebp: 00000001 esp: e7c51e94
Mar 13 05:00:19 s1 kernel: ds: 007b es: 007b ss: 0068
Mar 13 05:00:19 s1 kernel: Process vms (pid: 19337, veid: 501, ti=e7c50000 task=f43ea7c0 task.ti=e7c50000)
Mar 13 05:00:19 s1 kernel: Stack: c014d0dc c0182a97 00000000 00000000 00000000 00000000 00000030 00000030
Mar 13 05:00:19 s1 kernel: c0183627 e7c52010 df9ef380 c4700e00 00000030 00000000 00000001 00000000
Mar 13 05:00:19 s1 kernel: 00001000 00000000 00001000 00000000 00001000 00000000 00001000 00000000
Mar 13 05:00:19 s1 kernel: Call Trace:
Mar 13 05:00:19 s1 kernel: [<c014d0dc>] put_compound_page+0x25/0x26
Mar 13 05:00:19 s1 kernel: [<c0182a97>] splice_to_pipe+0x1df/0x1ed
Mar 13 05:00:19 s1 kernel: [<c0183627>] sys_vmsplice+0x236/0x25f
Mar 13 05:00:19 s1 kernel: Code: Bad EIP value.
|
|
|
|
Re: Systemabsturz, Fortsetzung [message #28374 is a reply to message #28354] |
Sat, 15 March 2008 12:49 |
Hurlyburly
Messages: 6 Registered: November 2007
|
Junior Member |
|
|
Ich hab den Kernel selbst kompiliert und bin nach dieser Anleitung vorgegangen: http://wiki.openvz.org/Compiling_the_OpenVZ_Kernel_%28the_de bian_way%29
# apt-get install kernel-package linux-source-2.6.18 kernel-patch-openvz libncurses5-dev
Wie gesagt: monatelang lief alles prima. Dann Systemabsturz und Hard Reset. Und einen Tag später friert dann eine VE ein, d.h. vzlist gibt zwar den Status "running" zurück, aber die VE ist von außen nicht erreichbar, man kann sie nicht mit vzctl enter "betreten" und sie lässt sich auch nicht durch vzctl stop anhalten. In /var/log/messages sieht das dann so aus:
Mar 14 03:16:16 s1 -- MARK --
Mar 14 03:34:55 s1 kernel: c02a1328
Mar 14 03:34:55 s1 kernel: Modules linked in: vzethdev vznetdev simfs vzrst ip_nat vzcpt ip
_conntrack nfnetlink vzdquota vzmon vzdev xt_tcpudp xt_length ipt_ttl xt_tcpmss ipt_TCPMSS
iptable_mangle iptable_filter xt_multiport xt_limit ipt_tos ipt_REJECT ip_tables x_tables b
utton ac battery ipv6 dm_snapshot dm_mirror dm_mod loop parport_pc i2c_i801 floppy parport
psmouse i2c_core rtc serio_raw pcspkr shpchp pci_hotplug evdev ata_piix libata sd_mod gener
ic ehci_hcd piix ide_core e1000 3w_9xxx scsi_mod uhci_hcd usbcore thermal processor fan
Mar 14 03:34:55 s1 kernel: EIP: 0060:[<c02a1328>] Not tainted VLI
Mar 14 03:34:55 s1 kernel: EFLAGS: 00010002 (2.6.18-1-openvz #1)
Mar 14 03:34:55 s1 kernel: [<f8f40b7d>] cpt_resume+0x42/0x1ab [vzcpt]
Mar 14 03:34:55 s1 kernel: [<f8f3f89c>] cpt_ioctl+0x510/0xb09 [vzcpt]
Mar 14 03:34:55 s1 kernel: [<c0170ae3>] do_ioctl+0x47/0x5d
Mar 14 03:34:55 s1 kernel: [<c0170d43>] vfs_ioctl+0x24a/0x25c
Mar 14 03:34:55 s1 kernel: [<c0170d81>] sys_ioctl+0x2c/0x43
Mar 14 03:34:55 s1 kernel: [<c010295f>] syscall_call+0x7/0xb
Mar 14 03:56:17 s1 -- MARK --
Edit: Gerade fällt mir auf, dass nach dem Neustart einer VE neuerdings der Eintrag "disabled privacy extensions" im Logfile. Was bedeutet das? (Auf meinem anderen System steht dieser Eintrag nicht da.)
Mar 13 10:36:16 s1 kernel: VE: 101: started
Mar 13 10:36:19 s1 kernel: lo: Disabled Privacy Extensions
[Updated on: Sat, 15 March 2008 13:01] Report message to a moderator
|
|
|
|
|
|
Re: Systemabsturz [message #28659 is a reply to message #28620] |
Wed, 26 March 2008 07:45 |
|
curx
Messages: 739 Registered: February 2006 Location: Nürnberg, Germany
|
Senior Member |
|
|
@Xandrian:
> [...] CentOS und habe einen eigenen Kernel auf Basis des OpenVZ RHEL 5 Patch [...]
Welche OpenVZ Patch Version hast Du hier verwendet ?
[Updated on: Wed, 26 March 2008 07:45] Report message to a moderator
|
|
|
|
|