Падает выделенный сервер [message #52055] |
Tue, 19 May 2015 17:31 |
koka3000
Messages: 4 Registered: August 2014
|
Junior Member |
|
|
Здравствуйте. Есть сервер, на нем штук 20 openvz контейнеров. Он работает нормально, ram и swap использовано где-то половина.
Вдруг в течение двух часов происходит следующее:
RAM:
s008.radikal.ru/i304/1505/c4/1e8494f75e32.png
SWAP:
s019.radikal.ru/i636/1505/0f/b9921047fc93.png
CPU:
s019.radikal.ru/i602/1505/32/eb6ded3def70.png
Как видите, ram не меняется. swap начинает быстро увеличиваться, а в конце сервер падает.
Вот что я нашел в syslog:
May 16 4:40:05 ns376306 kernel: [1617341.346380] INFO: task kswapd0:101 blocked for more than 120 seconds.
May 16 4:40:05 ns376306 kernel: [1617341.346602] Tainted: P --------------- 2.6.32-openvz-amd64 #1
May 16 4:40:05 ns376306 kernel: [1617341.347034] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
May 16 4:40:05 ns376306 kernel: [1617341.347469] kswapd0 D ffff881076c343c0 0 101 2 0 0x00000000
May 16 4:40:05 ns376306 kernel: [1617341.347474] ffff881076c39700 0000000000000046 0000000000000000 ffffffff8126f8a0
May 16 4:40:05 ns376306 kernel: [1617341.347477] ffff881076c39750 ffffffff8129262e 0000000000000008 0000000000800010
May 16 4:40:05 ns376306 kernel: [1617341.347479] 0000000000000000 000000016048def5 ffff881076c34988 000000000001ec80
May 16 4:40:05 ns376306 kernel: [1617341.347481] Call Trace:
May 16 4:40:05 ns376306 kernel: [1617341.347486] [<ffffffff8126f8a0>] ? generic_make_request+0x240/0x5a0
May 16 4:40:05 ns376306 kernel: [1617341.347489] [<ffffffff8129262e>] ? radix_tree_tag_clear+0x1e/0x200
May 16 4:40:05 ns376306 kernel: [1617341.347492] [<ffffffff811818f0>] ? wait_for_discard+0x0/0x20
May 16 4:40:05 ns376306 kernel: [1617341.347494] [<ffffffff811818fe>] wait_for_discard+0xe/0x20
May 16 4:40:05 ns376306 kernel: [1617341.347498] [<ffffffff81530fff>] __wait_on_bit+0x5f/0x90
May 16 4:40:05 ns376306 kernel: [1617341.347500] [<ffffffff81175f61>] ? page_check_address+0x141/0x1c0
May 16 4:40:05 ns376306 kernel: [1617341.347502] [<ffffffff811818f0>] ? wait_for_discard+0x0/0x20
May 16 4:40:05 ns376306 kernel: [1617341.347504] [<ffffffff815310a8>] out_of_line_wait_on_bit+0x78/0x90
May 16 4:40:05 ns376306 kernel: [1617341.347508] [<ffffffff810a2470>] ? wake_bit_function+0x0/0x40
May 16 4:40:05 ns376306 kernel: [1617341.347510] [<ffffffff81181d11>] scan_swap_map+0x401/0x640
May 16 4:40:05 ns376306 kernel: [1617341.347512] [<ffffffff8118208d>] get_swap_page+0x9d/0x140
May 16 4:40:05 ns376306 kernel: [1617341.347514] [<ffffffff8117f227>] add_to_swap+0x17/0x90
May 16 4:40:05 ns376306 kernel: [1617341.347517] [<ffffffff81152dc7>] T.1175+0x297/0xa80
May 16 4:40:05 ns376306 kernel: [1617341.347519] [<ffffffff81153927>] shrink_inactive_list+0x377/0x9f0
May 16 4:40:05 ns376306 kernel: [1617341.347521] [<ffffffff8114f946>] ? __pagevec_release+0x26/0x40
May 16 4:40:05 ns376306 kernel: [1617341.347523] [<ffffffff81151738>] ? move_active_pages_to_lru+0x1a8/0x1f0
May 16 4:40:05 ns376306 kernel: [1617341.347525] [<ffffffff81151ed3>] ? shrink_active_list+0x2c3/0x390
May 16 4:40:05 ns376306 kernel: [1617341.347528] [<ffffffff8114b5ca>] ? determine_dirtyable_memory+0x1a/0x30
May 16 4:40:05 ns376306 kernel: [1617341.347530] [<ffffffff8114b687>] ? get_dirty_limits+0x27/0x320
May 16 4:40:05 ns376306 kernel: [1617341.347532] [<ffffffff811543c0>] shrink_lruvec+0x420/0x600
May 16 4:40:05 ns376306 kernel: [1617341.347536] [<ffffffff81015019>] ? read_tsc+0x9/0x20
May 16 4:40:05 ns376306 kernel: [1617341.347546] [<ffffffffa039a19e>] ? nfs_access_cache_shrinker+0x1ce/0x210 [nfs]
May 16 4:40:05 ns376306 kernel: [1617341.347548] [<ffffffff81154777>] shrink_zone+0x1d7/0x400
May 16 4:40:05 ns376306 kernel: [1617341.347550] [<ffffffff81155983>] balance_pgdat+0x9d3/0xb50
May 16 4:40:05 ns376306 kernel: [1617341.347552] [<ffffffff81160d00>] ? refresh_zone_stat_thresholds+0x0/0xc0
May 16 4:40:05 ns376306 kernel: [1617341.347554] [<ffffffff81155c7f>] kswapd+0x17f/0x3f0
May 16 4:40:05 ns376306 kernel: [1617341.347556] [<ffffffff810a23f0>] ? autoremove_wake_function+0x0/0x40
May 16 4:40:05 ns376306 kernel: [1617341.347558] [<ffffffff81155b00>] ? kswapd+0x0/0x3f0
May 16 4:40:05 ns376306 kernel: [1617341.347560] [<ffffffff810a1dd6>] kthread+0x96/0xa0
May 16 4:40:05 ns376306 kernel: [1617341.347562] [<ffffffff8100c34a>] child_rip+0xa/0x20
May 16 4:40:05 ns376306 kernel: [1617341.347564] [<ffffffff810a1d40>] ? kthread+0x0/0xa0
May 16 4:40:05 ns376306 kernel: [1617341.347566] [<ffffffff8100c340>] ? child_rip+0x0/0x20
И таких подобных сообщений много.
|
|
|