Nightly Snap Backup Fails - Server Crashes
Posted: Wed Nov 21, 2007 9:47 am
We seem to be having a problem with the nightly snap and backup process.
This is not a consistent error, it shows up every 3-4 days. The issue begins with the Snap process, then the server loads up to about 15 processor load and maxes out the available RAM, then Scalix crashes out and the server has to be restarted.
Occasionally we get this error when we try to turn the Snap volume off:
This is the error we are trying to troubleshoot:
Any assistance on this would be very helpful, I'm new to this job and new to Scalix and all this experience has taught me is that I'm much happier administering Exchange. I would really appreciate it if someone could help turn my opinion around, I really would prefer to use open source software whenever possible.
Thanks.
This is not a consistent error, it shows up every 3-4 days. The issue begins with the Snap process, then the server loads up to about 15 processor load and maxes out the available RAM, then Scalix crashes out and the server has to be restarted.
Occasionally we get this error when we try to turn the Snap volume off:
Code: Select all
Umounting Scalix Backup Volume
umount: /mnt/sxbackup: not mounted
Removing the snapshot logical volume
device-mapper ioctl cmd 9 failed: Cannot allocate memory
Couldn't load device 'vgscalix-sxbackup'.
Unable to deactivate logical volume "sxbackup"
[root@mail1 ~]# kill -9 12157
[root@mail1 ~]# sxsnapoff
Umounting Scalix Backup Volume
umount: /mnt/sxbackup: not mounted
Removing the snapshot logical volume
Unable to deactivate logical volume "sxbackup"
This is the error we are trying to troubleshoot:
Code: Select all
Nov 20 21:30:14 mail1 kernel: lvcreate: page allocation failure. order:0, mode:0xd0
Nov 20 21:30:14 mail1 kernel: [<c013f1bf>] __alloc_pages+0x28b/0x298
Nov 20 21:30:14 mail1 kernel: [<f88d65e7>] alloc_pl+0x27/0x3d [dm_mod]
Nov 20 21:30:14 mail1 kernel: [<f88d66c2>] client_alloc_pages+0x15/0x47 [dm_mod]
Nov 20 21:30:14 mail1 kernel: [<f88d7070>] kcopyd_client_create+0x64/0x9f [dm_mod]
Nov 20 21:30:14 mail1 kernel: [<f8ac2697>] snapshot_ctr+0x231/0x2b8 [dm_snapshot]
Nov 20 21:30:14 mail1 kernel: [<f88d3088>] dm_table_add_target+0xfc/0x169 [dm_mod]
Nov 20 21:30:14 mail1 kernel: [<f88d509b>] populate_table+0x8a/0xaf [dm_mod]
Nov 20 21:30:14 mail1 kernel: [<f88d50f7>] table_load+0x37/0xf9 [dm_mod]
Nov 20 21:30:14 mail1 kernel: [<f88d58af>] ctl_ioctl+0xd1/0x144 [dm_mod]
Nov 20 21:30:14 mail1 kernel: [<f88d50c0>] table_load+0x0/0xf9 [dm_mod]
Nov 20 21:30:14 mail1 kernel: [<c0164faa>] sys_ioctl+0x227/0x269
Nov 20 21:30:14 mail1 kernel: [<c02c62a3>] syscall_call+0x7/0xb
Nov 20 21:30:14 mail1 kernel: device-mapper: : Could not create kcopyd client
Nov 20 21:30:14 mail1 kernel:
Nov 20 21:30:15 mail1 kernel: device-mapper: error adding target to table
Nov 20 21:40:44 mail1 sshd(pam_unix)[12325]: session opened for user root by root(uid=0)
Nov 20 21:43:02 mail1 kernel: lvremove: page allocation failure. order:0, mode:0xd2
Nov 20 21:43:02 mail1 kernel: [<c013f1bf>] __alloc_pages+0x28b/0x298
Nov 20 21:43:02 mail1 kernel: [<c014e21a>] __vmalloc+0xaf/0xee
Nov 20 21:43:02 mail1 kernel: [<c014e26f>] vmalloc+0x16/0x19
Nov 20 21:43:02 mail1 kernel: [<f88d26b1>] dm_vcalloc+0x1d/0x42 [dm_mod]
Nov 20 21:43:02 mail1 kernel: [<f8ac2253>] init_exception_table+0x18/0x4f [dm_snapshot]
Nov 20 21:43:02 mail1 kernel: [<f8ac2416>] init_hash_tables+0x88/0xd8 [dm_snapshot]
Nov 20 21:43:02 mail1 kernel: [<f8ac2641>] snapshot_ctr+0x1db/0x2b8 [dm_snapshot]
Nov 20 21:43:02 mail1 kernel: [<f88d3088>] dm_table_add_target+0xfc/0x169 [dm_mod]
Nov 20 21:43:02 mail1 kernel: [<f88d509b>] populate_table+0x8a/0xaf [dm_mod]
Nov 20 21:43:02 mail1 kernel: [<f88d50f7>] table_load+0x37/0xf9 [dm_mod]
Nov 20 21:43:02 mail1 kernel: [<f88d58af>] ctl_ioctl+0xd1/0x144 [dm_mod]
Nov 20 21:43:02 mail1 kernel: [<f88d50c0>] table_load+0x0/0xf9 [dm_mod]
Nov 20 21:43:02 mail1 kernel: [<c0164faa>] sys_ioctl+0x227/0x269
Nov 20 21:43:02 mail1 kernel: [<c02c62a3>] syscall_call+0x7/0xb
Nov 20 21:43:02 mail1 kernel: device-mapper: : Unable to allocate hash table space
Nov 20 21:43:02 mail1 kernel:
Nov 20 21:43:02 mail1 kernel: device-mapper: error adding target to table
Any assistance on this would be very helpful, I'm new to this job and new to Scalix and all this experience has taught me is that I'm much happier administering Exchange. I would really appreciate it if someone could help turn my opinion around, I really would prefer to use open source software whenever possible.
Thanks.