cluster Fail when network switch fail

cluster Fail when network switch fail

Post by c2FtaX » Thu, 12 Oct 2006 19:18:01


hi,
I have a 2 node cluster Windows 2003 server EE.
all is OK but when we reboot the network switch, the cluster fails and i
have to reboot all the nodes to restart the cluster.
Is there a way to let the cluster in a waiting state if the public NIC are
off and it will be on when the NIC are ON.
I heared about disabling the media sensing but I am not sure that will
resolve my problem.
 
 
 

cluster Fail when network switch fail

Post by Edwin vMie » Thu, 12 Oct 2006 19:39:44

Samir

1) disable Media sense for all network adapters on all nodes :
http://www.yqcomputer.com/
2) ensure all your NIC's has the latest drives
3) ensure your heartbeat network is using a cross-over cable (not affected
by your network-switch reboot)

and try again, cluster should stay up

HTH,
Edwin.

 
 
 

cluster Fail when network switch fail

Post by c2FtaX » Thu, 12 Oct 2006 20:35:03

thanks Edwin but I cannot find the DisableClusSvcMediaSense key.
 
 
 

cluster Fail when network switch fail

Post by c2FtaX » Thu, 12 Oct 2006 20:36:01

thanks Edwin but I cannot find the DisableClusSvcMediaSense key.
 
 
 

cluster Fail when network switch fail

Post by Edwin vMie » Thu, 12 Oct 2006 21:02:22

Samir,

As per KB article :

Add the following registry value:
Value Name: DisableDHCPMediaSense
Data Type: REG_DWORD -Boolean
Value Data Range: 0, 1 (False, True) Default: 0 (False)

(you need to add the value)

rgds,
Edwin.




affected


i
are
 
 
 

cluster Fail when network switch fail

Post by Chuck Timo » Thu, 12 Oct 2006 21:24:40

In Windows 2003 it is enabled by default and therefore will not show up in
the registry.

--
Chuck Timon, Jr.
Microsoft Corporation
Longhorn Readiness Team
This posting is provided "AS IS" with no
warranties, and confers no rights.
 
 
 

cluster Fail when network switch fail

Post by c2FtaX » Thu, 12 Oct 2006 21:26:02

Edwin,

the DisableDHCPMediaSense exist and set to 1, but I cannot find the
DisableClusSvcMediaSense key on the HKEY_LOCAL_MACHINE\Cluster\Parameters.
do I have to create it ?
 
 
 

cluster Fail when network switch fail

Post by Edwin vMie » Thu, 12 Oct 2006 21:30:58

In W2K3 SP1 you do not have to create it










and
NIC
will
 
 
 

cluster Fail when network switch fail

Post by c2FtaX » Thu, 12 Oct 2006 21:58:01

thanks Chuck

is there another solution?
 
 
 

cluster Fail when network switch fail

Post by c2FtaX » Thu, 12 Oct 2006 22:41:01

Edwin,

I confirm that the Media Sensing is disabled, I just made a test with one
node, if I unplug the Wire, the NIC stay on.
Is there another solution ?
 
 
 

cluster Fail when network switch fail

Post by c2FtaX » Thu, 12 Oct 2006 22:43:02

Edwin,

I confirm that the Media Sensing is disabled, I just made a test with one
node, if I unplug the Wire, the NIC stay on.
Is there another solution ?
 
 
 

cluster Fail when network switch fail

Post by Edwin vMie » Thu, 12 Oct 2006 22:47:26

id you ensure you are on the latest drivers ?
did you ensure your heartbeat is a cross over cable ?




"samir" < XXXX@XXXXX.COM > wrote in message
news: XXXX@XXXXX.COM ...
HKEY_LOCAL_MACHINE\Cluster\Parameters.
(not
fails
public
that


 
 
 

cluster Fail when network switch fail

Post by Chuck Timo » Thu, 12 Oct 2006 23:43:19

f the private NICs are supported using a crossover cable or even a separate
dumb hub, they will be fine...however, if the switch losses power, the IP
addresses will go offline and this will cause a 'regroup' and an attempt to
restart the resources on the same node...failing that, a failover will occur
and online attempts will be made....several 'ping-pong' attempts later, the
groups will eventually fail because of loss of the public networks. That is
the way cluster works. If you like you can administratively take the groups
offline, do your work on the switch and then bring the groups back online.
Disabling Media sense, in effect by default, only prevents the TCP/IP stack
from becoming 'unbound' from the NIC. It does nothing else for you in this
scenario.


--
Chuck Timon, Jr.
Microsoft Corporation
Longhorn Readiness Team
This posting is provided "AS IS" with no
warranties, and confers no rights.

"samir" < XXXX@XXXXX.COM > wrote in message
news: XXXX@XXXXX.COM ...


 
 
 

cluster Fail when network switch fail

Post by c2FtaX » Fri, 13 Oct 2006 00:04:02

he cluster is an HP MSA500 packaged cluster and the heartbeat is a cross
over cable.
the reboot of the switch was not programmed, it was a power failure and it
happens 2 times this year.
on this cluster we have fileshares ressources and a DHCP ressource and the
admins are not on site.

"Chuck Timon [Microsoft]" wrote:

 
 
 

cluster Fail when network switch fail

Post by Chuck Timo » Fri, 13 Oct 2006 08:43:52

e can only do so much....cannot save you from that - sorry.

--
Chuck Timon, Jr.
Microsoft Corporation
Longhorn Readiness Team
This posting is provided "AS IS" with no
warranties, and confers no rights.

"samir" < XXXX@XXXXX.COM > wrote in message
news: XXXX@XXXXX.COM ...