Cluster deadlock issue


hi,

i've 3 node exchange cluster repectedly i'm getting cluster deadload issue. have created cluster resourcers using dat model in exchange 2010.

kindly find cluster log below.

0000553c.00004500::2012/05/22-09:03:27.012 info  [rhs] enabling rhs termination watchdog timeout 1200000 , recovery action 3.
0000553c.00004500::2012/05/22-09:03:27.012 err   [rhs] resource cluster ip address handling deadlock. cleaning current operation , terminating rhs process.
0000553c.00004500::2012/05/22-09:03:27.012 err   [rhs] send wer report.
00001b1c.000013c8::2012/05/22-09:03:27.043 warn  [rcm] resourcecontrol(get_private_properties) cluster ip address returned 5038.
00001b1c.00003b1c::2012/05/22-09:03:27.059 err   [rcm] rcm::rcmresource::control: error_resource_call_timed_out(5910)' because of 'pending controls list has been cleared.'
00001b1c.00003b1c::2012/05/22-09:03:27.059 err   [rcm] rcm::rcmrescontrol::doresourcecontrol: error_resource_call_timed_out(5910)' because of 'resourcecontrol( get_ro_private_properties ) failed resource 'cluster ip address'.'
00001b1c.00003b1c::2012/05/22-09:03:27.059 warn  [rcm] resourcecontrol(get_ro_private_properties) cluster ip address returned 5910.
00001b1c.00003b1c::2012/05/22-09:03:27.059 info  [rcm] handlemonitorreply: openresource 'cluster name', gen(1) result 0.
00002e84.00003928::2012/05/22-09:03:31.162 info  [res] ip address <cluster ip address>: nbt interface \device\netbt_if1 (instance 0x94c0e32e) no longer valid, status 2.
00001b1c.00004700::2012/05/22-09:05:00.816 info  [nm] received request client address sebro1038.
00001b1c.000013c8::2012/05/22-09:05:01.206 info  [nm] received request client address sebro1038.
00001b1c.00004700::2012/05/22-09:06:08.848 info  [nm] received request client address 169.254.2.136.
00001b1c.00003b1c::2012/05/22-09:07:01.623 info  [nm] received request client address sebro1038.
00001b1c.00003578::2012/05/22-09:08:31.167 err   [rcm] rcm::rcmrescontrol::doresourcecontrol: error_resource_call_timed_out(5910)' because of 'control(get_private_properties) resource 'cluster ip address' timed out.'
00001b1c.00003578::2012/05/22-09:08:31.167 warn  [rcm] resourcecontrol(get_private_properties) cluster ip address returned 5910.
00002e84.00004c14::2012/05/22-09:08:32.010 err   [rhs] rhscall::deadlockmonitor: call openresource timed out resource 'cluster ip address'.
00002e84.00004c14::2012/05/22-09:08:32.010 info  [rhs] enabling rhs termination watchdog timeout 1200000 , recovery action 3.
00002e84.00004c14::2012/05/22-09:08:32.010 err   [rhs] resource cluster ip address handling deadlock. cleaning current operation , terminating rhs process.
00001b1c.00003578::2012/05/22-09:08:32.010 warn  [rcm] handlemonitorreply: failurenotification 'cluster ip address', gen(1) result 4.
00002e84.00004c14::2012/05/22-09:08:32.010 err   [rhs] send wer report.
00001b1c.00003578::2012/05/22-09:08:32.010 warn  [rcm] rcm::rcmresource::handlemonitorreply: resource 'cluster ip address' consecutive failure count 13. moving resource poisoned state.
00002e84.00004c14::2012/05/22-09:08:32.025 err   [rhs] wer report submitted. result : werdisabled.
00001b1c.00003578::2012/05/22-09:08:32.025 err   [rcm] rcm::rcmmonitor::recoverprocess: recovering monitor process 11908 / 0x2e84
00001b1c.00003578::2012/05/22-09:08:32.025 info  [rcm] created monitor process 21492 / 0x53f4
000053f4.0000543c::2012/05/22-09:08:32.041 info  [rhs] initializing.
00001b1c.00003578::2012/05/22-09:08:32.041 info  [rcm] rcm::rcmresource::reattachtomonitorprocess: (cluster name, waitingtocomeonline)
00001b1c.00003578::2012/05/22-09:08:32.041 info  [rcm] leaving state waitingtocomeonline of cluster name as-is after monitor restart.
00001b1c.00003578::2012/05/22-09:08:32.041 info  [rcm] rcm::rcmresource::reattachtomonitorprocess: (cluster ip address, poisoned)
00001b1c.00003578::2012/05/22-09:08:32.041 warn  [rcm] canceling pending control get_private_properties resource 'cluster ip address' due monitor crash.

system event log :-

cluster resource 'cluster name' (resource type '', dll 'clusres.dll') either crashed or deadlocked. resource hosting subsystem (rhs) process attempt terminate, , resource marked run in separate monitor.

need fix issue.


regards, sridharan. m

maybe host in network use same ip adress cluster? try change ip in properties of cluster. , bring resourse online.


Windows Server  >  High Availability (Clustering)



Comments

Popular posts from this blog

server manager error: ADAM.events.xml could not be enumerated.

Cannot access Anywhere Access using domain name?

WMI Failure: Unable to update Local Resource Group