OpenVZ Forum


Home » General » Support » Kernel oops caused crash twice now on centos 6 kernel
Kernel oops caused crash twice now on centos 6 kernel [message #43312] Tue, 23 August 2011 13:50 Go to next message
bjdea1 is currently offline  bjdea1
Messages: 39
Registered: February 2009
Member
Hi,
my server has crashed twice now on the RHEL6 042stab033.1 kernel. I'm running OpenVZ on a Dell 1950. I get the following in /var/log/messages

Aug 23 15:53:46 aunode3 kernel: [610041.706043] NOHZ: local_softirq_pending 100
Aug 23 18:47:40 aunode3 kernel: [620475.613884] Core dump to |/usr/libexec/abrt-hook-ccpp /var/spool/abrt 12447 11 516 0 pipe failed
Aug 23 22:24:36 aunode3 kernel: [633492.036001] BUG: soft lockup - CPU#0 stuck for 67s! [migration/0:3]
Aug 23 22:24:36 aunode3 kernel: [633492.036001] Modules linked in: vzethdev(U) simfs(U) vzrst(U) nf_nat(U) nf_conntrack_ipv4(U) nf_defrag_ipv4(U) vzcpt(U) nfs(U) lockd(U) fscache($
Aug 23 22:24:36 aunode3 kernel: ed: speedstep_lib]
Aug 23 22:24:36 aunode3 kernel: [633492.036001] CPU 0:
Aug 23 22:24:36 aunode3 kernel: [633492.036001] Modules linked in: vzethdev(U) simfs(U) vzrst(U) nf_nat(U) nf_conntrack_ipv4(U) nf_defrag_ipv4(U) vzcpt(U) nfs(U) lockd(U) fscache($
Aug 23 22:24:36 aunode3 kernel: ed: speedstep_lib]
Aug 23 22:24:36 aunode3 kernel: [633492.036001] Pid: 3, comm: migration/0 Not tainted 2.6.32_042stab031_deasoft #1 042stab031 PowerEdge 1950
Aug 23 22:24:36 aunode3 kernel: [633492.036001] RIP: 0010:[<ffffffff81060b60>] [<ffffffff81060b60>] migration_thread+0x170/0x2e0
Aug 23 22:24:36 aunode3 kernel: [633492.036001] RSP: 0000:ffff88043e53be80 EFLAGS: 00000207
Aug 23 22:24:36 aunode3 kernel: [633492.036001] RAX: 0000000000000001 RBX: ffff88043e53bee0 RCX: 0000000000000000
Aug 23 22:24:36 aunode3 kernel: [633492.036001] RDX: ffff88002829e440 RSI: ffff88002829e440 RDI: ffff88002829e440
Aug 23 22:24:36 aunode3 kernel: [633492.036001] RBP: ffffffff8100bd8e R08: 0000000000000000 R09: 0000000000000001
Aug 23 22:24:36 aunode3 kernel: [633492.036001] R10: 00000000ffffffff R11: 0000000000000000 R12: 0000000000000000
Aug 23 22:24:36 aunode3 kernel: [633492.036001] R13: ffff88002828f7b8 R14: 0000000000000000 R15: 0000000000000001
Aug 23 22:24:36 aunode3 kernel: [633492.036001] FS: 0000000000000000(0000) GS:ffff880028200000(0000) knlGS:0000000000000000
Aug 23 22:24:36 aunode3 kernel: [633492.036001] CS: 0010 DS: 0018 ES: 0018 CR0: 000000008005003b
Aug 23 22:24:36 aunode3 kernel: [633492.036001] CR2: 00007ff3f802a218 CR3: 0000000001a25000 CR4: 00000000000006f0
Aug 23 22:24:36 aunode3 kernel: [633492.036001] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
Aug 23 22:24:36 aunode3 kernel: [633492.036001] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
Aug 23 22:24:36 aunode3 kernel: [633492.036001] Call Trace:
Aug 23 22:24:36 aunode3 kernel: [633492.036001] [<ffffffff810609f0>] ? migration_thread+0x0/0x2e0
Aug 23 22:24:36 aunode3 kernel: [633492.036001] [<ffffffff81091b06>] ? kthread+0x96/0xa0
Aug 23 22:24:36 aunode3 kernel: [633492.036001] [<ffffffff8100c2ca>] ? child_rip+0xa/0x20
Aug 23 22:24:36 aunode3 kernel: [633492.036001] [<ffffffff81091a70>] ? kthread+0x0/0xa0
Aug 23 22:24:36 aunode3 kernel: [633492.036001] [<ffffffff8100c2c0>] ? child_rip+0x0/0x20
Aug 23 22:24:40 aunode3 abrt: Kerneloops: Reported 1 kernel oopses to Abrt
Aug 23 22:24:40 aunode3 abrtd: Directory 'kerneloops-1314102280-1920-1' creation detected
Aug 23 23:23:29 aunode3 kernel: imklog 4.6.2, log source = /proc/kmsg started.


Note I have added a small patch to this kernel myself however I don't believe my patch would have any significant effect, its a tiny simple patch that adds function of putting a process to sleep. I'm not using this feature at all atm, it needs to be activated before it does anything and its off by default.

After rebooting the oops comes right back again but the server keeps running ok until it crashes at random time, usually about a week or 2 later.

Any ideas? How to get this fixed?


Deasoft.com Hosting/Software
AutoBillMe.com Billing Automation
Re: Kernel oops caused crash twice now on centos 6 kernel [message #43321 is a reply to message #43312] Wed, 24 August 2011 06:31 Go to previous messageGo to next message
cyrolancer is currently offline  cyrolancer
Messages: 15
Registered: July 2011
Junior Member
I am not really sure about the problem, but maybe you can try the unmodified kernel for a while, and check it crashes while using.
Re: Kernel oops caused crash twice now on centos 6 kernel [message #43323 is a reply to message #43321] Wed, 24 August 2011 12:34 Go to previous message
bjdea1 is currently offline  bjdea1
Messages: 39
Registered: February 2009
Member
This is a bug and the OpenVZ developers have written a patch. Bug number is 1954. Note their patch is still being tested at present.

Good job OpenVZ Developers! I was glad to see how quickly the OpenVZ development team investigated this bug and wrote a patch. Keep up the good work!

Hopefully this will be solved with patch once and for all.


Deasoft.com Hosting/Software
AutoBillMe.com Billing Automation

[Updated on: Wed, 24 August 2011 12:38]

Report message to a moderator

Previous Topic: No package ovzkernel.x86_64 available.
Next Topic: Show ve id and name in "ps" output
Goto Forum:
  


Current Time: Fri Oct 24 09:08:22 GMT 2025

Total time taken to generate the page: 0.12729 seconds