|
View:
New views
14 Messages
—
Rating Filter:
Alert me
|
|
|
VIF weirdnessI am having a great
deal of difficulty getting my filer’s 10g interfaces connected to a pair
of 3750E switches I am trying to use for my storage network’s backbone. I
can inconsistently ping other objects on the network (there really is nothing
there yet, we are just getting started, but I do have a host and an RLM
card just for troubleshooting purposes), but they cannot ping each other.
From a remote host (172.1.2.5) I can ping everything but array1. If I
bring down either of the two stacked switches (which brings down one port of
each VIF member pairs) everything works. array01> ping 172.1.0.1 172.1.0.1 is alive array01> ping 172.1.0.2 array01> ping 172.1.0.4 ping: wrote 172.1.0.4 64 chars,
error=Host is down ping: wrote 172.1.0.4 64 chars,
error=Host is down array01> Thu May 15 13:28:33 EDT
[gvr-array01: nis_worker_0:info]: Local NIS group update successful. array01> ping 172.1.2.5 172.1.2.5 is alive array02> ping
172.1.0.1 array02> ping
172.1.0.4 172.1.0.4 is alive array02> ping
172.1.0.4 172.1.0.4 is alive array02> ping
172.1.0.250 172.1.0.250 is alive array02> ping
172.1.2.5 no answer from
172.1.2.5 array01> ifconfig -a e0a:
flags=948043<UP,BROADCAST,RUNNING,MULTICAST,TCPCKSUM> mtu 1500
ether 02:a0:98:08:22:b7 (auto-1000t-fd-up) flowcontrol full
trunked lan0 e0b:
flags=108042<BROADCAST,RUNNING,MULTICAST,TCPCKSUM> mtu 1500
ether 00:a0:98:08:22:b6 (auto-unknown-cfg_down) flowcontrol full e0c: flags=948043<UP,BROADCAST,RUNNING,MULTICAST,TCPCKSUM>
mtu 1500
ether 02:a0:98:08:22:b7 (auto-1000t-fd-up) flowcontrol full
trunked lan0 e0d:
flags=108042<BROADCAST,RUNNING,MULTICAST,TCPCKSUM> mtu 1500
ether 00:a0:98:08:22:b4 (auto-unknown-cfg_down) flowcontrol full e2a:
flags=948043<UP,BROADCAST,RUNNING,MULTICAST,TCPCKSUM> mtu 1500
ether 02:a0:98:08:22:b6 (auto-10g_sr-fd-up) flowcontrol full
trunked tgif e2b: flags=948043<UP,BROADCAST,RUNNING,MULTICAST,TCPCKSUM>
mtu 1500
ether 02:a0:98:08:22:b6 (auto-10g_sr-fd-up) flowcontrol full
trunked tgif lo:
flags=1948049<UP,LOOPBACK,RUNNING,MULTICAST,TCPCKSUM> mtu 8160
inet 127.0.0.1 netmask 0xff000000 broadcast 127.0.0.1
ether 00:00:00:00:00:00 (VIA Provider) lan0:
flags=948043<UP,BROADCAST,RUNNING,MULTICAST,TCPCKSUM> mtu 1500
inet 10.28.17.213 netmask 0xffffff00 broadcast 10.28.17.255
partner 10.28.17.214 (not in use)
ether 02:a0:98:08:22:b7 (Enabled virtual interface) tgif:
flags=4948043<UP,BROADCAST,RUNNING,MULTICAST,TCPCKSUM,NOWINS> mtu 1500
inet 172.1.0.1 netmask 0xffff0000 broadcast 172.1.255.255
partner 172.1.0.2 (not in use)
ether 02:a0:98:08:22:b6 (Enabled virtual interface)
nfo enabled array02>ifconfig –a e0a:
flags=948043<UP,BROADCAST,RUNNING,MULTICAST,TCPCKSUM> mtu 1500
ether 02:a0:98:0c:1a:37 (auto-1000t-fd-up) flowcontrol full
trunked lan0 e0b:
flags=108042<BROADCAST,RUNNING,MULTICAST,TCPCKSUM> mtu 1500
ether 00:a0:98:0c:1a:36 (auto-unknown-cfg_down) flowcontrol full e0c:
flags=948043<UP,BROADCAST,RUNNING,MULTICAST,TCPCKSUM> mtu 1500
ether 02:a0:98:0c:1a:37 (auto-1000t-fd-up) flowcontrol full
trunked lan0 e0d:
flags=108042<BROADCAST,RUNNING,MULTICAST,TCPCKSUM> mtu 1500
ether 00:a0:98:0c:1a:34 (auto-unknown-cfg_down) flowcontrol full e2a:
flags=948043<UP,BROADCAST,RUNNING,MULTICAST,TCPCKSUM> mtu 1500
ether 02:a0:98:0c:1a:36 (auto-10g_sr-fd-up) flowcontrol full
trunked tgif e2b:
flags=948043<UP,BROADCAST,RUNNING,MULTICAST,TCPCKSUM> mtu 1500
ether 02:a0:98:0c:1a:36 (auto-10g_sr-fd-up) flowcontrol full
trunked tgif lo:
flags=1948049<UP,LOOPBACK,RUNNING,MULTICAST,TCPCKSUM> mtu 8160
inet 127.0.0.1 netmask 0xff000000 broadcast 127.0.0.1
ether 00:00:00:00:00:00 (VIA Provider) lan0:
flags=948043<UP,BROADCAST,RUNNING,MULTICAST,TCPCKSUM> mtu 1500
inet 10.28.17.214 netmask 0xffffff00 broadcast 10.28.17.255
partner 10.28.17.213 (not in use)
ether 02:a0:98:0c:1a:37 (Enabled virtual interface) tgif:
flags=4948043<UP,BROADCAST,RUNNING,MULTICAST,TCPCKSUM,NOWINS> mtu 1500
inet 172.1.0.2 netmask 0xffff0000 broadcast 172.1.255.255
partner 172.1.0.1 (not in use)
ether 02:a0:98:0c:1a:36 (Enabled virtual interface)
nfo enabled
This message (including any attachments) contains confidential |
|
|
|
|
|
Re: VIF weirdnessHow about a "sysconfig -v 2" from the filers
and a "show running-config " that shows at a minimum: the port config of the 10-gig stuff and the port-channel definitions. --tmac On Thu, May 15, 2008 at 2:19 PM, Page, Jeremy <jeremy.page@...> wrote: > They are 10 gig cards, they are only in a VIF with themselves (no other > cards). It's a 3070 with 2 copper 100/1000 2 FC gig and 2 FC 10g > connections. The 2 1000 are on our production network as lan0 VIF for > CIFS/NFS, the fibre gig is not used and the two 10g FC are VIF tgif on the > storage only network (no routing etc, private for storage devices and the > hosts using them only). > > > > They are plugged into stacked 3750e boxes, I tried putting them on the same > switch just to make sure it was not a stacking thing with no real change. > > > > The oddness is that occasionally a single packet will get through, and if I > bring the interfaces up and down it may change who can get to what. > > > > ________________________________ > > From: Webster, Stetson [mailto:Stetson.Webster@...] > Sent: Thursday, May 15, 2008 2:13 PM > To: Page, Jeremy; toasters@... > Subject: RE: VIF weirdness > > > > Are these 10g TOE cards? Are they in a VIF with other 'non-TOE' cards? > > > > Stetson M. Webster > Onsite Professional Services Engineer > PS - North Amer. - East > > NetApp > 919.250.0052 Mobile > Stetson.Webster@... > www.netapp.com > > > > > > ________________________________ > > From: Page, Jeremy [mailto:jeremy.page@...] > Sent: Thursday, May 15, 2008 1:32 PM > To: toasters@... > Subject: VIF weirdness > > I am having a great deal of difficulty getting my filer's 10g interfaces > connected to a pair of 3750E switches I am trying to use for my storage > network's backbone. I can inconsistently ping other objects on the network > (there really is nothing there yet, we are just getting started, but I do > have a host and an RLM card just for troubleshooting purposes), but they > cannot ping each other. >From a remote host (172.1.2.5) I can ping > everything but array1. If I bring down either of the two stacked switches > (which brings down one port of each VIF member pairs) everything works. > > > > array01> ping 172.1.0.1 > > 172.1.0.1 is alive > > array01> ping 172.1.0.2 > > array01> ping 172.1.0.4 > > ping: wrote 172.1.0.4 64 chars, error=Host is down > > ping: wrote 172.1.0.4 64 chars, error=Host is down > > array01> Thu May 15 13:28:33 EDT [gvr-array01: nis_worker_0:info]: Local NIS > group update successful. > > array01> ping 172.1.2.5 > > 172.1.2.5 is alive > > > > array02> ping 172.1.0.1 > > array02> ping 172.1.0.4 > > 172.1.0.4 is alive > > array02> ping 172.1.0.4 > > 172.1.0.4 is alive > > array02> ping 172.1.0.250 > > 172.1.0.250 is alive > > array02> ping 172.1.2.5 > > no answer from 172.1.2.5 > > > > > > array01> ifconfig -a > > e0a: flags=948043<UP,BROADCAST,RUNNING,MULTICAST,TCPCKSUM> mtu 1500 > > ether 02:a0:98:08:22:b7 (auto-1000t-fd-up) flowcontrol full > > trunked lan0 > > e0b: flags=108042<BROADCAST,RUNNING,MULTICAST,TCPCKSUM> mtu 1500 > > ether 00:a0:98:08:22:b6 (auto-unknown-cfg_down) flowcontrol full > > e0c: flags=948043<UP,BROADCAST,RUNNING,MULTICAST,TCPCKSUM> mtu 1500 > > ether 02:a0:98:08:22:b7 (auto-1000t-fd-up) flowcontrol full > > trunked lan0 > > e0d: flags=108042<BROADCAST,RUNNING,MULTICAST,TCPCKSUM> mtu 1500 > > ether 00:a0:98:08:22:b4 (auto-unknown-cfg_down) flowcontrol full > > e2a: flags=948043<UP,BROADCAST,RUNNING,MULTICAST,TCPCKSUM> mtu 1500 > > ether 02:a0:98:08:22:b6 (auto-10g_sr-fd-up) flowcontrol full > > trunked tgif > > e2b: flags=948043<UP,BROADCAST,RUNNING,MULTICAST,TCPCKSUM> mtu 1500 > > ether 02:a0:98:08:22:b6 (auto-10g_sr-fd-up) flowcontrol full > > trunked tgif > > lo: flags=1948049<UP,LOOPBACK,RUNNING,MULTICAST,TCPCKSUM> mtu 8160 > > inet 127.0.0.1 netmask 0xff000000 broadcast 127.0.0.1 > > ether 00:00:00:00:00:00 (VIA Provider) > > lan0: flags=948043<UP,BROADCAST,RUNNING,MULTICAST,TCPCKSUM> mtu 1500 > > inet 10.28.17.213 netmask 0xffffff00 broadcast 10.28.17.255 > > partner 10.28.17.214 (not in use) > > ether 02:a0:98:08:22:b7 (Enabled virtual interface) > > tgif: flags=4948043<UP,BROADCAST,RUNNING,MULTICAST,TCPCKSUM,NOWINS> mtu 1500 > > inet 172.1.0.1 netmask 0xffff0000 broadcast 172.1.255.255 > > partner 172.1.0.2 (not in use) > > ether 02:a0:98:08:22:b6 (Enabled virtual interface) > > nfo enabled > > array02>ifconfig –a > > e0a: flags=948043<UP,BROADCAST,RUNNING,MULTICAST,TCPCKSUM> mtu 1500 > > ether 02:a0:98:0c:1a:37 (auto-1000t-fd-up) flowcontrol full > > trunked lan0 > > e0b: flags=108042<BROADCAST,RUNNING,MULTICAST,TCPCKSUM> mtu 1500 > > ether 00:a0:98:0c:1a:36 (auto-unknown-cfg_down) flowcontrol full > > e0c: flags=948043<UP,BROADCAST,RUNNING,MULTICAST,TCPCKSUM> mtu 1500 > > ether 02:a0:98:0c:1a:37 (auto-1000t-fd-up) flowcontrol full > > trunked lan0 > > e0d: flags=108042<BROADCAST,RUNNING,MULTICAST,TCPCKSUM> mtu 1500 > > ether 00:a0:98:0c:1a:34 (auto-unknown-cfg_down) flowcontrol full > > e2a: flags=948043<UP,BROADCAST,RUNNING,MULTICAST,TCPCKSUM> mtu 1500 > > ether 02:a0:98:0c:1a:36 (auto-10g_sr-fd-up) flowcontrol full > > trunked tgif > > e2b: flags=948043<UP,BROADCAST,RUNNING,MULTICAST,TCPCKSUM> mtu 1500 > > ether 02:a0:98:0c:1a:36 (auto-10g_sr-fd-up) flowcontrol full > > trunked tgif > > lo: flags=1948049<UP,LOOPBACK,RUNNING,MULTICAST,TCPCKSUM> mtu 8160 > > inet 127.0.0.1 netmask 0xff000000 broadcast 127.0.0.1 > > ether 00:00:00:00:00:00 (VIA Provider) > > lan0: flags=948043<UP,BROADCAST,RUNNING,MULTICAST,TCPCKSUM> mtu 1500 > > inet 10.28.17.214 netmask 0xffffff00 broadcast 10.28.17.255 > > partner 10.28.17.213 (not in use) > > ether 02:a0:98:0c:1a:37 (Enabled virtual interface) > > tgif: flags=4948043<UP,BROADCAST,RUNNING,MULTICAST,TCPCKSUM,NOWINS> mtu 1500 > > inet 172.1.0.2 netmask 0xffff0000 broadcast 172.1.255.255 > > partner 172.1.0.1 (not in use) > > ether 02:a0:98:0c:1a:36 (Enabled virtual interface) > > nfo enabled > > This message (including any attachments) contains confidential > and/or proprietary information intended only for the addressee. > Any unauthorized disclosure, copying, distribution or reliance on > the contents of this information is strictly prohibited and may > constitute a violation of law. If you are not the intended > recipient, please notify the sender immediately by responding to > this e-mail, and delete the message from your system. If you > have any questions about this e-mail please notify the sender > immediately. > > This message (including any attachments) contains confidential > and/or proprietary information intended only for the addressee. > Any unauthorized disclosure, copying, distribution or reliance on > the contents of this information is strictly prohibited and may > constitute a violation of law. If you are not the intended > recipient, please notify the sender immediately by responding to > this e-mail, and delete the message from your system. If you > have any questions about this e-mail please notify the sender > immediately. > -- --tmac RedHat Certified Engineer #804006984323821 (RHEL4) RedHat Certified Engineer #805007643429572 (RHEL5) Principal Consultant |
|
|
RE: VIF weirdnessI *think* the problem may be isolated to array1. I've tried setting them
to single mode instead of multi (LACP is not supposed to work on a stacked switch). array01> sysconfig -v 2 slot 2: Dual TOE-10G Ethernet Controller (T320E-XFP) Device Type: CT-31-1 Version Number: T3-SRAM1.1.0-BR1040-20-C0-FW4.6.0-DR03 Serial Number: PT4107009 e2a MAC Address: 00:07:43:05:16:98 (auto-10g_sr-fd-up) e2b MAC Address: 00:07:43:05:16:99 (auto-10g_sr-fd-up) **switch config (short form)** version 12.2 no aaa new-model switch 1 provision ws-c3750e-24td switch 2 provision ws-c3750e-24td interface TenGigabitEthernet1/0/1 switchport mode access channel-group 1 mode on ! interface TenGigabitEthernet1/0/2 switchport mode access channel-group 2 mode on interface TenGigabitEthernet2/0/1 switchport mode access channel-group 1 mode on ! interface TenGigabitEthernet2/0/2 switchport mode access channel-group 2 mode on ! interface Vlan1 ip address 172.1.0.250 255.255.0.0 -----Original Message----- From: tmac [mailto:tmacmd@...] Sent: Thursday, May 15, 2008 3:46 PM To: Page, Jeremy Cc: toasters@... Subject: Re: VIF weirdness How about a "sysconfig -v 2" from the filers and a "show running-config " that shows at a minimum: the port config of the 10-gig stuff and the port-channel definitions. --tmac On Thu, May 15, 2008 at 2:19 PM, Page, Jeremy <jeremy.page@...> wrote: > They are 10 gig cards, they are only in a VIF with themselves (no other > cards). It's a 3070 with 2 copper 100/1000 2 FC gig and 2 FC 10g > connections. The 2 1000 are on our production network as lan0 VIF for > CIFS/NFS, the fibre gig is not used and the two 10g FC are VIF tgif on the > storage only network (no routing etc, private for storage devices and the > hosts using them only). > > > > They are plugged into stacked 3750e boxes, I tried putting them on the same > switch just to make sure it was not a stacking thing with no real change. > > > > The oddness is that occasionally a single packet will get through, and if I > bring the interfaces up and down it may change who can get to what. > > > > ________________________________ > > From: Webster, Stetson [mailto:Stetson.Webster@...] > Sent: Thursday, May 15, 2008 2:13 PM > To: Page, Jeremy; toasters@... > Subject: RE: VIF weirdness > > > > Are these 10g TOE cards? Are they in a VIF with other 'non-TOE' > > > > Stetson M. Webster > Onsite Professional Services Engineer > PS - North Amer. - East > > NetApp > 919.250.0052 Mobile > Stetson.Webster@... > www.netapp.com > > > > > > ________________________________ > > From: Page, Jeremy [mailto:jeremy.page@...] > Sent: Thursday, May 15, 2008 1:32 PM > To: toasters@... > Subject: VIF weirdness > > I am having a great deal of difficulty getting my filer's 10g > connected to a pair of 3750E switches I am trying to use for my storage > network's backbone. I can inconsistently ping other objects on the network > (there really is nothing there yet, we are just getting started, but I do > have a host and an RLM card just for troubleshooting purposes), but they > cannot ping each other. >From a remote host (172.1.2.5) I can ping > everything but array1. If I bring down either of the two stacked switches > (which brings down one port of each VIF member pairs) everything works. > > > > array01> ping 172.1.0.1 > > 172.1.0.1 is alive > > array01> ping 172.1.0.2 > > array01> ping 172.1.0.4 > > ping: wrote 172.1.0.4 64 chars, error=Host is down > > ping: wrote 172.1.0.4 64 chars, error=Host is down > > array01> Thu May 15 13:28:33 EDT [gvr-array01: nis_worker_0:info]: > group update successful. > > array01> ping 172.1.2.5 > > 172.1.2.5 is alive > > > > array02> ping 172.1.0.1 > > array02> ping 172.1.0.4 > > 172.1.0.4 is alive > > array02> ping 172.1.0.4 > > 172.1.0.4 is alive > > array02> ping 172.1.0.250 > > 172.1.0.250 is alive > > array02> ping 172.1.2.5 > > no answer from 172.1.2.5 > > > > > > array01> ifconfig -a > > e0a: flags=948043<UP,BROADCAST,RUNNING,MULTICAST,TCPCKSUM> mtu 1500 > > ether 02:a0:98:08:22:b7 (auto-1000t-fd-up) flowcontrol full > > trunked lan0 > > e0b: flags=108042<BROADCAST,RUNNING,MULTICAST,TCPCKSUM> mtu 1500 > > ether 00:a0:98:08:22:b6 (auto-unknown-cfg_down) flowcontrol > > e0c: flags=948043<UP,BROADCAST,RUNNING,MULTICAST,TCPCKSUM> mtu 1500 > > ether 02:a0:98:08:22:b7 (auto-1000t-fd-up) flowcontrol full > > trunked lan0 > > e0d: flags=108042<BROADCAST,RUNNING,MULTICAST,TCPCKSUM> mtu 1500 > > ether 00:a0:98:08:22:b4 (auto-unknown-cfg_down) flowcontrol > > e2a: flags=948043<UP,BROADCAST,RUNNING,MULTICAST,TCPCKSUM> mtu 1500 > > ether 02:a0:98:08:22:b6 (auto-10g_sr-fd-up) flowcontrol full > > trunked tgif > > e2b: flags=948043<UP,BROADCAST,RUNNING,MULTICAST,TCPCKSUM> mtu 1500 > > ether 02:a0:98:08:22:b6 (auto-10g_sr-fd-up) flowcontrol full > > trunked tgif > > lo: flags=1948049<UP,LOOPBACK,RUNNING,MULTICAST,TCPCKSUM> mtu 8160 > > inet 127.0.0.1 netmask 0xff000000 broadcast 127.0.0.1 > > ether 00:00:00:00:00:00 (VIA Provider) > > lan0: flags=948043<UP,BROADCAST,RUNNING,MULTICAST,TCPCKSUM> mtu 1500 > > inet 10.28.17.213 netmask 0xffffff00 broadcast 10.28.17.255 > > partner 10.28.17.214 (not in use) > > ether 02:a0:98:08:22:b7 (Enabled virtual interface) > > tgif: flags=4948043<UP,BROADCAST,RUNNING,MULTICAST,TCPCKSUM,NOWINS> > > inet 172.1.0.1 netmask 0xffff0000 broadcast 172.1.255.255 > > partner 172.1.0.2 (not in use) > > ether 02:a0:98:08:22:b6 (Enabled virtual interface) > > nfo enabled > > array02>ifconfig -a > > e0a: flags=948043<UP,BROADCAST,RUNNING,MULTICAST,TCPCKSUM> mtu 1500 > > ether 02:a0:98:0c:1a:37 (auto-1000t-fd-up) flowcontrol full > > trunked lan0 > > e0b: flags=108042<BROADCAST,RUNNING,MULTICAST,TCPCKSUM> mtu 1500 > > ether 00:a0:98:0c:1a:36 (auto-unknown-cfg_down) flowcontrol > > e0c: flags=948043<UP,BROADCAST,RUNNING,MULTICAST,TCPCKSUM> mtu 1500 > > ether 02:a0:98:0c:1a:37 (auto-1000t-fd-up) flowcontrol full > > trunked lan0 > > e0d: flags=108042<BROADCAST,RUNNING,MULTICAST,TCPCKSUM> mtu 1500 > > ether 00:a0:98:0c:1a:34 (auto-unknown-cfg_down) flowcontrol > > e2a: flags=948043<UP,BROADCAST,RUNNING,MULTICAST,TCPCKSUM> mtu 1500 > > ether 02:a0:98:0c:1a:36 (auto-10g_sr-fd-up) flowcontrol full > > trunked tgif > > e2b: flags=948043<UP,BROADCAST,RUNNING,MULTICAST,TCPCKSUM> mtu 1500 > > ether 02:a0:98:0c:1a:36 (auto-10g_sr-fd-up) flowcontrol full > > trunked tgif > > lo: flags=1948049<UP,LOOPBACK,RUNNING,MULTICAST,TCPCKSUM> mtu 8160 > > inet 127.0.0.1 netmask 0xff000000 broadcast 127.0.0.1 > > ether 00:00:00:00:00:00 (VIA Provider) > > lan0: flags=948043<UP,BROADCAST,RUNNING,MULTICAST,TCPCKSUM> mtu 1500 > > inet 10.28.17.214 netmask 0xffffff00 broadcast 10.28.17.255 > > partner 10.28.17.213 (not in use) > > ether 02:a0:98:0c:1a:37 (Enabled virtual interface) > > tgif: flags=4948043<UP,BROADCAST,RUNNING,MULTICAST,TCPCKSUM,NOWINS> > > inet 172.1.0.2 netmask 0xffff0000 broadcast 172.1.255.255 > > partner 172.1.0.1 (not in use) > > ether 02:a0:98:0c:1a:36 (Enabled virtual interface) > > nfo enabled > > This message (including any attachments) contains confidential > and/or proprietary information intended only for the addressee. > Any unauthorized disclosure, copying, distribution or reliance on > the contents of this information is strictly prohibited and may > constitute a violation of law. If you are not the intended > recipient, please notify the sender immediately by responding to > this e-mail, and delete the message from your system. If you > have any questions about this e-mail please notify the sender > immediately. > > This message (including any attachments) contains confidential > and/or proprietary information intended only for the addressee. > Any unauthorized disclosure, copying, distribution or reliance on > the contents of this information is strictly prohibited and may > constitute a violation of law. If you are not the intended > recipient, please notify the sender immediately by responding to > this e-mail, and delete the message from your system. If you > have any questions about this e-mail please notify the sender > immediately. > -- --tmac RedHat Certified Engineer #804006984323821 (RHEL4) RedHat Certified Engineer #805007643429572 (RHEL5) Principal Consultant This message (including any attachments) contains confidential and/or proprietary information intended only for the addressee. Any unauthorized disclosure, copying, distribution or reliance on the contents of this information is strictly prohibited and may constitute a violation of law. If you are not the intended recipient, please notify the sender immediately by responding to this e-mail, and delete the message from your system. If you have any questions about this e-mail please notify the sender immediately. |
|
|
Re: VIF weirdnessNotes aboutof 10GbE TOE NIC limitations : The 10GbE TOE NIC cards
have a number of limitations. They include: ◆ Multimode vif limited to two (2) 10GbE TOE NICs ◆ LACP not supported with 10GbE TOE NICs ◆ TOE functionality disabled on 10GbE NIC in vif Do you need a switchport access vlan ### in the interface lines? Here is the config I use for vifs on my switch: interface Port-Channel7 description netapp-vif switchport switchport access vlan 239 switchport mode access no ip address flowcontrol receive on flowcontrol send on spanning-tree portfast ! interface GigabitEthernet7/42 description filer-e9 switchport switchport access vlan 239 switchport mode access no ip address flowcontrol receive on flowcontrol send on spanning-tree portfast channel-group 7 mode active ! In your case, with 10-gig, the only change should be: channel-group 7 mode on (which you are already doing) Also, make sure your filer is setup correctly (not using LACP for 10Gig) vif create multi tgif -b mac e2a r3b On Thu, May 15, 2008 at 3:55 PM, Page, Jeremy <jeremy.page@...> wrote: > I *think* the problem may be isolated to array1. I've tried setting them > to single mode instead of multi (LACP is not supposed to work on a > stacked switch). > > > array01> sysconfig -v 2 > slot 2: Dual TOE-10G Ethernet Controller (T320E-XFP) > Device Type: CT-31-1 > Version Number: > T3-SRAM1.1.0-BR1040-20-C0-FW4.6.0-DR03 > Serial Number: PT4107009 > e2a MAC Address: 00:07:43:05:16:98 > (auto-10g_sr-fd-up) > e2b MAC Address: 00:07:43:05:16:99 > (auto-10g_sr-fd-up) > > > **switch config (short form)** > version 12.2 > no aaa new-model > switch 1 provision ws-c3750e-24td > switch 2 provision ws-c3750e-24td > interface TenGigabitEthernet1/0/1 > switchport mode access > channel-group 1 mode on > ! > interface TenGigabitEthernet1/0/2 > switchport mode access > channel-group 2 mode on > interface TenGigabitEthernet2/0/1 > switchport mode access > channel-group 1 mode on > ! > interface TenGigabitEthernet2/0/2 > switchport mode access > channel-group 2 mode on > ! > interface Vlan1 > ip address 172.1.0.250 255.255.0.0 > > -----Original Message----- > From: tmac [mailto:tmacmd@...] > Sent: Thursday, May 15, 2008 3:46 PM > To: Page, Jeremy > Cc: toasters@... > Subject: Re: VIF weirdness > > How about a "sysconfig -v 2" from the filers > and a "show running-config " that shows at a minimum: > the port config of the 10-gig stuff and the port-channel definitions. > > --tmac > > On Thu, May 15, 2008 at 2:19 PM, Page, Jeremy <jeremy.page@...> > wrote: >> They are 10 gig cards, they are only in a VIF with themselves (no > other >> cards). It's a 3070 with 2 copper 100/1000 2 FC gig and 2 FC 10g >> connections. The 2 1000 are on our production network as lan0 VIF for >> CIFS/NFS, the fibre gig is not used and the two 10g FC are VIF tgif on > the >> storage only network (no routing etc, private for storage devices and > the >> hosts using them only). >> >> >> >> They are plugged into stacked 3750e boxes, I tried putting them on the > same >> switch just to make sure it was not a stacking thing with no real > change. >> >> >> >> The oddness is that occasionally a single packet will get through, and > if I >> bring the interfaces up and down it may change who can get to what. >> >> >> >> ________________________________ >> >> From: Webster, Stetson [mailto:Stetson.Webster@...] >> Sent: Thursday, May 15, 2008 2:13 PM >> To: Page, Jeremy; toasters@... >> Subject: RE: VIF weirdness >> >> >> >> Are these 10g TOE cards? Are they in a VIF with other 'non-TOE' > cards? >> >> >> >> Stetson M. Webster >> Onsite Professional Services Engineer >> PS - North Amer. - East >> >> NetApp >> 919.250.0052 Mobile >> Stetson.Webster@... >> www.netapp.com >> >> >> >> >> >> ________________________________ >> >> From: Page, Jeremy [mailto:jeremy.page@...] >> Sent: Thursday, May 15, 2008 1:32 PM >> To: toasters@... >> Subject: VIF weirdness >> >> I am having a great deal of difficulty getting my filer's 10g > interfaces >> connected to a pair of 3750E switches I am trying to use for my > storage >> network's backbone. I can inconsistently ping other objects on the > network >> (there really is nothing there yet, we are just getting started, but > I do >> have a host and an RLM card just for troubleshooting purposes), but > they >> cannot ping each other. >From a remote host (172.1.2.5) I can ping >> everything but array1. If I bring down either of the two stacked > switches >> (which brings down one port of each VIF member pairs) everything > works. >> >> >> >> array01> ping 172.1.0.1 >> >> 172.1.0.1 is alive >> >> array01> ping 172.1.0.2 >> >> array01> ping 172.1.0.4 >> >> ping: wrote 172.1.0.4 64 chars, error=Host is down >> >> ping: wrote 172.1.0.4 64 chars, error=Host is down >> >> array01> Thu May 15 13:28:33 EDT [gvr-array01: nis_worker_0:info]: > Local NIS >> group update successful. >> >> array01> ping 172.1.2.5 >> >> 172.1.2.5 is alive >> >> >> >> array02> ping 172.1.0.1 >> >> array02> ping 172.1.0.4 >> >> 172.1.0.4 is alive >> >> array02> ping 172.1.0.4 >> >> 172.1.0.4 is alive >> >> array02> ping 172.1.0.250 >> >> 172.1.0.250 is alive >> >> array02> ping 172.1.2.5 >> >> no answer from 172.1.2.5 >> >> >> >> >> >> array01> ifconfig -a >> >> e0a: flags=948043<UP,BROADCAST,RUNNING,MULTICAST,TCPCKSUM> mtu 1500 >> >> ether 02:a0:98:08:22:b7 (auto-1000t-fd-up) flowcontrol full >> >> trunked lan0 >> >> e0b: flags=108042<BROADCAST,RUNNING,MULTICAST,TCPCKSUM> mtu 1500 >> >> ether 00:a0:98:08:22:b6 (auto-unknown-cfg_down) flowcontrol > full >> >> e0c: flags=948043<UP,BROADCAST,RUNNING,MULTICAST,TCPCKSUM> mtu 1500 >> >> ether 02:a0:98:08:22:b7 (auto-1000t-fd-up) flowcontrol full >> >> trunked lan0 >> >> e0d: flags=108042<BROADCAST,RUNNING,MULTICAST,TCPCKSUM> mtu 1500 >> >> ether 00:a0:98:08:22:b4 (auto-unknown-cfg_down) flowcontrol > full >> >> e2a: flags=948043<UP,BROADCAST,RUNNING,MULTICAST,TCPCKSUM> mtu 1500 >> >> ether 02:a0:98:08:22:b6 (auto-10g_sr-fd-up) flowcontrol full >> >> trunked tgif >> >> e2b: flags=948043<UP,BROADCAST,RUNNING,MULTICAST,TCPCKSUM> mtu 1500 >> >> ether 02:a0:98:08:22:b6 (auto-10g_sr-fd-up) flowcontrol full >> >> trunked tgif >> >> lo: flags=1948049<UP,LOOPBACK,RUNNING,MULTICAST,TCPCKSUM> mtu 8160 >> >> inet 127.0.0.1 netmask 0xff000000 broadcast 127.0.0.1 >> >> ether 00:00:00:00:00:00 (VIA Provider) >> >> lan0: flags=948043<UP,BROADCAST,RUNNING,MULTICAST,TCPCKSUM> mtu 1500 >> >> inet 10.28.17.213 netmask 0xffffff00 broadcast 10.28.17.255 >> >> partner 10.28.17.214 (not in use) >> >> ether 02:a0:98:08:22:b7 (Enabled virtual interface) >> >> tgif: flags=4948043<UP,BROADCAST,RUNNING,MULTICAST,TCPCKSUM,NOWINS> > mtu 1500 >> >> inet 172.1.0.1 netmask 0xffff0000 broadcast 172.1.255.255 >> >> partner 172.1.0.2 (not in use) >> >> ether 02:a0:98:08:22:b6 (Enabled virtual interface) >> >> nfo enabled >> >> array02>ifconfig -a >> >> e0a: flags=948043<UP,BROADCAST,RUNNING,MULTICAST,TCPCKSUM> mtu 1500 >> >> ether 02:a0:98:0c:1a:37 (auto-1000t-fd-up) flowcontrol full >> >> trunked lan0 >> >> e0b: flags=108042<BROADCAST,RUNNING,MULTICAST,TCPCKSUM> mtu 1500 >> >> ether 00:a0:98:0c:1a:36 (auto-unknown-cfg_down) flowcontrol > full >> >> e0c: flags=948043<UP,BROADCAST,RUNNING,MULTICAST,TCPCKSUM> mtu 1500 >> >> ether 02:a0:98:0c:1a:37 (auto-1000t-fd-up) flowcontrol full >> >> trunked lan0 >> >> e0d: flags=108042<BROADCAST,RUNNING,MULTICAST,TCPCKSUM> mtu 1500 >> >> ether 00:a0:98:0c:1a:34 (auto-unknown-cfg_down) flowcontrol > full >> >> e2a: flags=948043<UP,BROADCAST,RUNNING,MULTICAST,TCPCKSUM> mtu 1500 >> >> ether 02:a0:98:0c:1a:36 (auto-10g_sr-fd-up) flowcontrol full >> >> trunked tgif >> >> e2b: flags=948043<UP,BROADCAST,RUNNING,MULTICAST,TCPCKSUM> mtu 1500 >> >> ether 02:a0:98:0c:1a:36 (auto-10g_sr-fd-up) flowcontrol full >> >> trunked tgif >> >> lo: flags=1948049<UP,LOOPBACK,RUNNING,MULTICAST,TCPCKSUM> mtu 8160 >> >> inet 127.0.0.1 netmask 0xff000000 broadcast 127.0.0.1 >> >> ether 00:00:00:00:00:00 (VIA Provider) >> >> lan0: flags=948043<UP,BROADCAST,RUNNING,MULTICAST,TCPCKSUM> mtu 1500 >> >> inet 10.28.17.214 netmask 0xffffff00 broadcast 10.28.17.255 >> >> partner 10.28.17.213 (not in use) >> >> ether 02:a0:98:0c:1a:37 (Enabled virtual interface) >> >> tgif: flags=4948043<UP,BROADCAST,RUNNING,MULTICAST,TCPCKSUM,NOWINS> > mtu 1500 >> >> inet 172.1.0.2 netmask 0xffff0000 broadcast 172.1.255.255 >> >> partner 172.1.0.1 (not in use) >> >> ether 02:a0:98:0c:1a:36 (Enabled virtual interface) >> >> nfo enabled >> >> This message (including any attachments) contains confidential >> and/or proprietary information intended only for the addressee. >> Any unauthorized disclosure, copying, distribution or reliance on >> the contents of this information is strictly prohibited and may >> constitute a violation of law. If you are not the intended >> recipient, please notify the sender immediately by responding to >> this e-mail, and delete the message from your system. If you >> have any questions about this e-mail please notify the sender >> immediately. >> >> This message (including any attachments) contains confidential >> and/or proprietary information intended only for the addressee. >> Any unauthorized disclosure, copying, distribution or reliance on >> the contents of this information is strictly prohibited and may >> constitute a violation of law. If you are not the intended >> recipient, please notify the sender immediately by responding to >> this e-mail, and delete the message from your system. If you >> have any questions about this e-mail please notify the sender >> immediately. >> > > > > -- > --tmac > > RedHat Certified Engineer #804006984323821 (RHEL4) > RedHat Certified Engineer #805007643429572 (RHEL5) > > Principal Consultant > > > This message (including any attachments) contains confidential > and/or proprietary information intended only for the addressee. > Any unauthorized disclosure, copying, distribution or reliance on > the contents of this information is strictly prohibited and may > constitute a violation of law. If you are not the intended > recipient, please notify the sender immediately by responding to > this e-mail, and delete the message from your system. If you > have any questions about this e-mail please notify the sender > immediately. > -- --tmac RedHat Certified Engineer #804006984323821 (RHEL4) RedHat Certified Engineer #805007643429572 (RHEL5) Principal Consultant |
|
|
RE: VIF weirdnessOnly time I have seen stuff like that is
when it’s in multimode on the filer, and etherchannel is not properly
enabled on the switch. From:
owner-toasters@... [mailto:owner-toasters@...] On Behalf Of Page, Jeremy I am having a great
deal of difficulty getting my filer’s 10g interfaces connected to a pair
of 3750E switches I am trying to use for my storage network’s backbone. I
can inconsistently ping other objects on the network (there really is nothing
there yet, we are just getting started, but I do have a host and an RLM
card just for troubleshooting purposes), but they cannot ping each other.
From a remote host (172.1.2.5) I can ping everything but array1. If I
bring down either of the two stacked switches (which brings down one port of
each VIF member pairs) everything works. array01> ping 172.1.0.1 172.1.0.1 is alive array01> ping 172.1.0.2 array01> ping 172.1.0.4 ping: wrote 172.1.0.4 64 chars,
error=Host is down ping: wrote 172.1.0.4 64 chars,
error=Host is down array01> Thu May 15 13:28:33 EDT
[gvr-array01: nis_worker_0:info]: Local NIS group update successful. array01> ping 172.1.2.5 172.1.2.5 is alive array02> ping
172.1.0.1 array02> ping
172.1.0.4 172.1.0.4 is alive array02> ping
172.1.0.4 172.1.0.4 is alive array02> ping
172.1.0.250 172.1.0.250 is alive array02> ping
172.1.2.5 no answer from
172.1.2.5 array01> ifconfig -a e0a:
flags=948043<UP,BROADCAST,RUNNING,MULTICAST,TCPCKSUM> mtu 1500
ether 02:a0:98:08:22:b7 (auto-1000t-fd-up) flowcontrol full
trunked lan0 e0b:
flags=108042<BROADCAST,RUNNING,MULTICAST,TCPCKSUM> mtu 1500
ether 00:a0:98:08:22:b6 (auto-unknown-cfg_down) flowcontrol full e0c:
flags=948043<UP,BROADCAST,RUNNING,MULTICAST,TCPCKSUM> mtu 1500
ether 02:a0:98:08:22:b7 (auto-1000t-fd-up) flowcontrol full
trunked lan0 e0d:
flags=108042<BROADCAST,RUNNING,MULTICAST,TCPCKSUM> mtu 1500
ether 00:a0:98:08:22:b4 (auto-unknown-cfg_down) flowcontrol full e2a:
flags=948043<UP,BROADCAST,RUNNING,MULTICAST,TCPCKSUM> mtu 1500
ether 02:a0:98:08:22:b6 (auto-10g_sr-fd-up) flowcontrol full
trunked tgif e2b: flags=948043<UP,BROADCAST,RUNNING,MULTICAST,TCPCKSUM>
mtu 1500
ether 02:a0:98:08:22:b6 (auto-10g_sr-fd-up) flowcontrol full
trunked tgif lo:
flags=1948049<UP,LOOPBACK,RUNNING,MULTICAST,TCPCKSUM> mtu 8160
inet 127.0.0.1 netmask 0xff000000 broadcast 127.0.0.1
ether 00:00:00:00:00:00 (VIA Provider) lan0:
flags=948043<UP,BROADCAST,RUNNING,MULTICAST,TCPCKSUM> mtu 1500
inet 10.28.17.213 netmask 0xffffff00 broadcast 10.28.17.255
partner 10.28.17.214 (not in use)
ether 02:a0:98:08:22:b7 (Enabled virtual interface) tgif:
flags=4948043<UP,BROADCAST,RUNNING,MULTICAST,TCPCKSUM,NOWINS> mtu 1500
inet 172.1.0.1 netmask 0xffff0000 broadcast 172.1.255.255
partner 172.1.0.2 (not in use)
ether 02:a0:98:08:22:b6 (Enabled virtual interface)
nfo enabled array02>ifconfig
–a e0a:
flags=948043<UP,BROADCAST,RUNNING,MULTICAST,TCPCKSUM> mtu 1500
ether 02:a0:98:0c:1a:37 (auto-1000t-fd-up) flowcontrol full
trunked lan0 e0b: flags=108042<BROADCAST,RUNNING,MULTICAST,TCPCKSUM>
mtu 1500
ether 00:a0:98:0c:1a:36 (auto-unknown-cfg_down) flowcontrol full e0c:
flags=948043<UP,BROADCAST,RUNNING,MULTICAST,TCPCKSUM> mtu 1500
ether 02:a0:98:0c:1a:37 (auto-1000t-fd-up) flowcontrol full
trunked lan0 e0d:
flags=108042<BROADCAST,RUNNING,MULTICAST,TCPCKSUM> mtu 1500
ether 00:a0:98:0c:1a:34 (auto-unknown-cfg_down) flowcontrol full e2a:
flags=948043<UP,BROADCAST,RUNNING,MULTICAST,TCPCKSUM> mtu 1500
ether 02:a0:98:0c:1a:36 (auto-10g_sr-fd-up) flowcontrol full
trunked tgif e2b:
flags=948043<UP,BROADCAST,RUNNING,MULTICAST,TCPCKSUM> mtu 1500
ether 02:a0:98:0c:1a:36 (auto-10g_sr-fd-up) flowcontrol full
trunked tgif lo:
flags=1948049<UP,LOOPBACK,RUNNING,MULTICAST,TCPCKSUM> mtu 8160
inet 127.0.0.1 netmask 0xff000000 broadcast 127.0.0.1
ether 00:00:00:00:00:00 (VIA Provider) lan0:
flags=948043<UP,BROADCAST,RUNNING,MULTICAST,TCPCKSUM> mtu 1500
inet 10.28.17.214 netmask 0xffffff00 broadcast 10.28.17.255
partner 10.28.17.213 (not in use)
ether 02:a0:98:0c:1a:37 (Enabled virtual interface) tgif:
flags=4948043<UP,BROADCAST,RUNNING,MULTICAST,TCPCKSUM,NOWINS> mtu 1500
inet 172.1.0.2 netmask 0xffff0000 broadcast 172.1.255.255
partner 172.1.0.1 (not in use)
ether 02:a0:98:0c:1a:36 (Enabled virtual interface)
nfo enabled
This message (including any attachments) contains confidential |
|
|
MSCS and snapmirror questionI am migrating from an aggregate to another aggregate in the same filer (6030). The LUNS are attached to print clusters, 2 node and 3 node clusters (WIndows 2003 sp2 x86 etc).
I have snapmirrored the volumes. If I stop the cluster service on the windows boxes, do a snapmirror update, the disconnected the quorum and other resource disks, do a snapmirror update, release the snapmirror, remove and replace the disks with snapdrive with the same disk letters. fire up the cluster service.. Will that work? has anyone done that before? Any gotchas? Any help is greatly appreciate! |
|
|
R: MSCS and snapmirror questionYou will not have issues. And you don't need to update your mirror.
If you simply power off you clusters no i/o activity will be performer on LUNs. Then you can simply copy (vol copy) the volume(s) containing your LUNs, rename the new volumes as the old (in case of iSCSI LUNs and if you don't want to recreate the connections) and restart you clusters. In case your nodes (but it will not happen) will loose the LUNs with Snapdrive you can safely reconnect them starting with the quorum one, on first node then restarting the cluster service, then on the second node and so on with all other cluster shared LUNs Regards -----Messaggio originale----- Da: owner-toasters@... [mailto:owner-toasters@...] Per conto di Klise, Steve Inviato: venerdì 16 maggio 2008 15.50 A: toasters@... Oggetto: MSCS and snapmirror question I am migrating from an aggregate to another aggregate in the same filer (6030). The LUNS are attached to print clusters, 2 node and 3 node clusters (WIndows 2003 sp2 x86 etc). I have snapmirrored the volumes. If I stop the cluster service on the windows boxes, do a snapmirror update, the disconnected the quorum and other resource disks, do a snapmirror update, release the snapmirror, remove and replace the disks with snapdrive with the same disk letters. fire up the cluster service.. Will that work? has anyone done that before? Any gotchas? Any help is greatly appreciate! |
|
|
|
|
|
|
|
|
RE: R: MSCS and snapmirror questionSteve,
As long as you keep the resources offline and bring them back up with the same drive letter, you are generally OK. It's when you try changing drive letters and assume setting the disk dependency will work is how you run into trouble. I like doing the procedure you described but we avoid snapmirring the Quorum (it will work, but over here we just don't like to offline the cluster resource) With the Quorum / MSDTC LUN, it is pretty painless to do a non-intrusive migration of the data - we usually end up doing it twice because we want to leave Q: as the quorum drive letter: Add temp quorum as W: Open MSDTC component services panel, offline msdtc Change quorum drive from q: to w: (right click on cluster, go to quorum tab, choose W: drive) Change msdtc log path to w:\ in MSDTC component services control panel Online msdtc, test failover. If failover is successful, disconnect legacy quorum Add new final quorum as Q: Repeat process to move to Q:\ Test cluster failover Patrick, In regards to the LUN serials changing, I've never seen this to be an issue with MSCS, but I've heard it can be a big gotcha when snapmirroring VMWare LUNs. Opinions? - Hadrian -----Original Message----- From: owner-toasters@... [mailto:owner-toasters@...] On Behalf Of Klise, Steve Sent: Friday, May 16, 2008 10:50 AM To: G.Milazzo@...; pvh@...; toasters@... Subject: Re: R: MSCS and snapmirror question Correct. Same filer. Thanks ----- Original Message ----- From: Milazzo Giacomo <G.Milazzo@...> To: Patrick van Helden <pvh@...>; Klise, Steve; toasters@... <toasters@...> Sent: Fri May 16 10:29:55 2008 Subject: R: MSCS and snapmirror question Of course but that guy said he's migrating on the same filer to another aggregate. ________________________________ Da: Patrick van Helden [mailto:pvh@...] Inviato: ven 16/05/2008 19.12 A: Milazzo Giacomo; Klise, Steve; toasters@... Oggetto: RE: MSCS and snapmirror question Guys, Please be aware that MSCS will keep track of the lun serials. So you would probably need to set the lun serial numbers back to the original ones when your snapmirroring to another filer. Regards, Patrick van Helden Databasement BV pvh@... -----Oorspronkelijk bericht----- Van: owner-toasters@... namens Milazzo Giacomo Verzonden: vr 5/16/2008 18:01 Aan: Klise, Steve; toasters@... Onderwerp: R: MSCS and snapmirror question You will not have issues. And you don't need to update your mirror. If you simply power off you clusters no i/o activity will be performer on LUNs. Then you can simply copy (vol copy) the volume(s) containing your LUNs, rename the new volumes as the old (in case of iSCSI LUNs and if you don't want to recreate the connections) and restart you clusters. In case your nodes (but it will not happen) will loose the LUNs with Snapdrive you can safely reconnect them starting with the quorum one, on first node then restarting the cluster service, then on the second node and so on with all other cluster shared LUNs Regards -----Messaggio originale----- Da: owner-toasters@... [mailto:owner-toasters@...] Per conto di Klise, Steve Inviato: venerdì 16 maggio 2008 15.50 A: toasters@... Oggetto: MSCS and snapmirror question I am migrating from an aggregate to another aggregate in the same filer (6030). The LUNS are attached to print clusters, 2 node and 3 node clusters (WIndows 2003 sp2 x86 etc). I have snapmirrored the volumes. If I stop the cluster service on the windows boxes, do a snapmirror update, the disconnected the quorum and other resource disks, do a snapmirror update, release the snapmirror, remove and replace the disks with snapdrive with the same disk letters. fire up the cluster service.. Will that work? has anyone done that before? Any gotchas? Any help is greatly appreciate! |
|
|
RE: R: MSCS and snapmirror questionThis worked like a charm over the weekend!
Interesting note. I had a 3 node cluster, and when I would try and change the disk letter on a node that did not own the resource, it failed. Note to self. On a multi-node cluster, have all the resources from the machine you are changing the drive letters. Turns out you can change the quorum resource pretty easy. You right click on the cluster, properties, cluster tab, change it from drive letter to drive letter, done. I just deleted and recreated the msdtc resource. Done Another question: I have to migrate a bunch of sql clusters from a 940/960's to 3020's 6030's. We use Snap Drive for SQL. Should I just use SMS to migrate? One database is pretty big; Its like 200GB, and I prefer the least amount of downtime if possible. Yes, I would imagine I would have to set the LUN serial number, not sure how to do that. Thanks everyone. -----Original Message----- From: Hadrian Baron [mailto:Hadrian.Baron@...] Sent: Monday, May 19, 2008 8:58 AM To: Klise, Steve; G.Milazzo@...; pvh@...; toasters@... Subject: RE: R: MSCS and snapmirror question Steve, As long as you keep the resources offline and bring them back up with the same drive letter, you are generally OK. It's when you try changing drive letters and assume setting the disk dependency will work is how you run into trouble. I like doing the procedure you described but we avoid snapmirring the Quorum (it will work, but over here we just don't like to offline the cluster resource) With the Quorum / MSDTC LUN, it is pretty painless to do a non-intrusive migration of the data - we usually end up doing it twice because we want to leave Q: as the quorum drive letter: Add temp quorum as W: Open MSDTC component services panel, offline msdtc Change quorum drive from q: to w: (right click on cluster, go to quorum tab, choose W: drive) Change msdtc log path to w:\ in MSDTC component services control panel Online msdtc, test failover. If failover is successful, disconnect legacy quorum Add new final quorum as Q: Repeat process to move to Q:\ Test cluster failover Patrick, In regards to the LUN serials changing, I've never seen this to be an issue with MSCS, but I've heard it can be a big gotcha when snapmirroring VMWare LUNs. Opinions? - Hadrian -----Original Message----- From: owner-toasters@... [mailto:owner-toasters@...] On Behalf Of Klise, Steve Sent: Friday, May 16, 2008 10:50 AM To: G.Milazzo@...; pvh@...; toasters@... Subject: Re: R: MSCS and snapmirror question Correct. Same filer. Thanks ----- Original Message ----- From: Milazzo Giacomo <G.Milazzo@...> To: Patrick van Helden <pvh@...>; Klise, Steve; toasters@... <toasters@...> Sent: Fri May 16 10:29:55 2008 Subject: R: MSCS and snapmirror question Of course but that guy said he's migrating on the same filer to another aggregate. ________________________________ Da: Patrick van Helden [mailto:pvh@...] Inviato: ven 16/05/2008 19.12 A: Milazzo Giacomo; Klise, Steve; toasters@... Oggetto: RE: MSCS and snapmirror question Guys, Please be aware that MSCS will keep track of the lun serials. So you would probably need to set the lun serial numbers back to the original ones when your snapmirroring to another filer. Regards, Patrick van Helden Databasement BV pvh@... -----Oorspronkelijk bericht----- Van: owner-toasters@... namens Milazzo Giacomo Verzonden: vr 5/16/2008 18:01 Aan: Klise, Steve; toasters@... Onderwerp: R: MSCS and snapmirror question You will not have issues. And you don't need to update your mirror. If you simply power off you clusters no i/o activity will be performer on LUNs. Then you can simply copy (vol copy) the volume(s) containing your LUNs, rename the new volumes as the old (in case of iSCSI LUNs and if you don't want to recreate the connections) and restart you clusters. In case your nodes (but it will not happen) will loose the LUNs with Snapdrive you can safely reconnect them starting with the quorum one, on first node then restarting the cluster service, then on the second node and so on with all other cluster shared LUNs Regards -----Messaggio originale----- Da: owner-toasters@... [mailto:owner-toasters@...] Per conto di Klise, Steve Inviato: venerdì 16 maggio 2008 15.50 A: toasters@... Oggetto: MSCS and snapmirror question I am migrating from an aggregate to another aggregate in the same filer (6030). The LUNS are attached to print clusters, 2 node and 3 node clusters (WIndows 2003 sp2 x86 etc). I have snapmirrored the volumes. If I stop the cluster service on the windows boxes, do a snapmirror update, the disconnected the quorum and other resource disks, do a snapmirror update, release the snapmirror, remove and replace the disks with snapdrive with the same disk letters. fire up the cluster service.. Will that work? has anyone done that before? Any gotchas? Any help is greatly appreciate! |
|
|
RE: R: MSCS and snapmirror questionSteve,
SMSQL will take longer to migrate the data than Snapmirror. SMSQL is really useful when migrating the system databases, but if you use the same general approach as last time - Offline the SQL resources, remove disk dependancies, snapmirror update / quiesce / break, detach old luns, attach new luns with same drive letter, online SQL, you can be back up in minutes once you get the process down pat. SMSQL will do a manual copy of the data from one LUN to another. I don't believe SMSQL is Snapmirror-aware enough to make this faster than traditional Snapmirror. - Hadrian -----Original Message----- From: Klise, Steve [mailto:klises@...] Sent: Monday, May 19, 2008 9:26 AM To: Hadrian Baron; G.Milazzo@...; pvh@...; toasters@... Subject: RE: R: MSCS and snapmirror question This worked like a charm over the weekend! Interesting note. I had a 3 node cluster, and when I would try and change the disk letter on a node that did not own the resource, it failed. Note to self. On a multi-node cluster, have all the resources from the machine you are changing the drive letters. Turns out you can change the quorum resource pretty easy. You right click on the cluster, properties, cluster tab, change it from drive letter to drive letter, done. I just deleted and recreated the msdtc resource. Done Another question: I have to migrate a bunch of sql clusters from a 940/960's to 3020's 6030's. We use Snap Drive for SQL. Should I just use SMS to migrate? One database is pretty big; Its like 200GB, and I prefer the least amount of downtime if possible. Yes, I would imagine I would have to set the LUN serial number, not sure how to do that. Thanks everyone. -----Original Message----- From: Hadrian Baron [mailto:Hadrian.Baron@...] Sent: Monday, May 19, 2008 8:58 AM To: Klise, Steve; G.Milazzo@...; pvh@...; toasters@... Subject: RE: R: MSCS and snapmirror question Steve, As long as you keep the resources offline and bring them back up with the same drive letter, you are generally OK. It's when you try changing drive letters and assume setting the disk dependency will work is how you run into trouble. I like doing the procedure you described but we avoid snapmirring the Quorum (it will work, but over here we just don't like to offline the cluster resource) With the Quorum / MSDTC LUN, it is pretty painless to do a non-intrusive migration of the data - we usually end up doing it twice because we want to leave Q: as the quorum drive letter: Add temp quorum as W: Open MSDTC component services panel, offline msdtc Change quorum drive from q: to w: (right click on cluster, go to quorum tab, choose W: drive) Change msdtc log path to w:\ in MSDTC component services control panel Online msdtc, test failover. If failover is successful, disconnect legacy quorum Add new final quorum as Q: Repeat process to move to Q:\ Test cluster failover Patrick, In regards to the LUN serials changing, I've never seen this to be an issue with MSCS, but I've heard it can be a big gotcha when snapmirroring VMWare LUNs. Opinions? - Hadrian -----Original Message----- From: owner-toasters@... [mailto:owner-toasters@...] On Behalf Of Klise, Steve Sent: Friday, May 16, 2008 10:50 AM To: G.Milazzo@...; pvh@...; toasters@... Subject: Re: R: MSCS and snapmirror question Correct. Same filer. Thanks ----- Original Message ----- From: Milazzo Giacomo <G.Milazzo@...> To: Patrick van Helden <pvh@...>; Klise, Steve; toasters@... <toasters@...> Sent: Fri May 16 10:29:55 2008 Subject: R: MSCS and snapmirror question Of course but that guy said he's migrating on the same filer to another aggregate. ________________________________ Da: Patrick van Helden [mailto:pvh@...] Inviato: ven 16/05/2008 19.12 A: Milazzo Giacomo; Klise, Steve; toasters@... Oggetto: RE: MSCS and snapmirror question Guys, Please be aware that MSCS will keep track of the lun serials. So you would probably need to set the lun serial numbers back to the original ones when your snapmirroring to another filer. Regards, Patrick van Helden Databasement BV pvh@... -----Oorspronkelijk bericht----- Van: owner-toasters@... namens Milazzo Giacomo Verzonden: vr 5/16/2008 18:01 Aan: Klise, Steve; toasters@... Onderwerp: R: MSCS and snapmirror question You will not have issues. And you don't need to update your mirror. If you simply power off you clusters no i/o activity will be performer on LUNs. Then you can simply copy (vol copy) the volume(s) containing your LUNs, rename the new volumes as the old (in case of iSCSI LUNs and if you don't want to recreate the connections) and restart you clusters. In case your nodes (but it will not happen) will loose the LUNs with Snapdrive you can safely reconnect them starting with the quorum one, on first node then restarting the cluster service, then on the second node and so on with all other cluster shared LUNs Regards -----Messaggio originale----- Da: owner-toasters@... [mailto:owner-toasters@...] Per conto di Klise, Steve Inviato: venerdì 16 maggio 2008 15.50 A: toasters@... Oggetto: MSCS and snapmirror question I am migrating from an aggregate to another aggregate in the same filer (6030). The LUNS are attached to print clusters, 2 node and 3 node clusters (WIndows 2003 sp2 x86 etc). I have snapmirrored the volumes. If I stop the cluster service on the windows boxes, do a snapmirror update, the disconnected the quorum and other resource disks, do a snapmirror update, release the snapmirror, remove and replace the disks with snapdrive with the same disk letters. fire up the cluster service.. Will that work? has anyone done that before? Any gotchas? Any help is greatly appreciate! |
|
|
R: R: MSCS and snapmirror questionEheheh :-)
There's somebody needs to study better and deeper MSCS...That's the way it works as from design :-) Don't forget that also if Snapdrive make very easy to create from quorum to other disk resource MSCS is always Microsoft software :-) For which concerns SQL migration I completely agree with Hadrian's instructions. -----Messaggio originale----- Da: Klise, Steve [mailto:klises@...] Inviato: lunedì 19 maggio 2008 18.26 A: Hadrian Baron; Milazzo Giacomo; pvh@...; toasters@... Oggetto: RE: R: MSCS and snapmirror question This worked like a charm over the weekend! Interesting note. I had a 3 node cluster, and when I would try and change the disk letter on a node that did not own the resource, it failed. Note to self. On a multi-node cluster, have all the resources from the machine you are changing the drive letters. Turns out you can change the quorum resource pretty easy. You right click on the cluster, properties, cluster tab, change it from drive letter to drive letter, done. I just deleted and recreated the msdtc resource. Done Another question: I have to migrate a bunch of sql clusters from a 940/960's to 3020's 6030's. We use Snap Drive for SQL. Should I just use SMS to migrate? One database is pretty big; Its like 200GB, and I prefer the least amount of downtime if possible. Yes, I would imagine I would have to set the LUN serial number, not sure how to do that. Thanks everyone. -----Original Message----- From: Hadrian Baron [mailto:Hadrian.Baron@...] Sent: Monday, May 19, 2008 8:58 AM To: Klise, Steve; G.Milazzo@...; pvh@...; toasters@... Subject: RE: R: MSCS and snapmirror question Steve, As long as you keep the resources offline and bring them back up with the same drive letter, you are generally OK. It's when you try changing drive letters and assume setting the disk dependency will work is how you run into trouble. I like doing the procedure you described but we avoid snapmirring the Quorum (it will work, but over here we just don't like to offline the cluster resource) With the Quorum / MSDTC LUN, it is pretty painless to do a non-intrusive migration of the data - we usually end up doing it twice because we want to leave Q: as the quorum drive letter: Add temp quorum as W: Open MSDTC component services panel, offline msdtc Change quorum drive from q: to w: (right click on cluster, go to quorum tab, choose W: drive) Change msdtc log path to w:\ in MSDTC component services control panel Online msdtc, test failover. If failover is successful, disconnect legacy quorum Add new final quorum as Q: Repeat process to move to Q:\ Test cluster failover Patrick, In regards to the LUN serials changing, I've never seen this to be an issue with MSCS, but I've heard it can be a big gotcha when snapmirroring VMWare LUNs. Opinions? - Hadrian -----Original Message----- From: owner-toasters@... [mailto:owner-toasters@...] On Behalf Of Klise, Steve Sent: Friday, May 16, 2008 10:50 AM To: G.Milazzo@...; pvh@...; toasters@... Subject: Re: R: MSCS and snapmirror question Correct. Same filer. Thanks ----- Original Message ----- From: Milazzo Giacomo <G.Milazzo@...> To: Patrick van Helden <pvh@...>; Klise, Steve; toasters@... <toasters@...> Sent: Fri May 16 10:29:55 2008 Subject: R: MSCS and snapmirror question Of course but that guy said he's migrating on the same filer to another aggregate. ________________________________ Da: Patrick van Helden [mailto:pvh@...] Inviato: ven 16/05/2008 19.12 A: Milazzo Giacomo; Klise, Steve; toasters@... Oggetto: RE: MSCS and snapmirror question Guys, Please be aware that MSCS will keep track of the lun serials. So you would probably need to set the lun serial numbers back to the original ones when your snapmirroring to another filer. Regards, Patrick van Helden Databasement BV pvh@... -----Oorspronkelijk bericht----- Van: owner-toasters@... namens Milazzo Giacomo Verzonden: vr 5/16/2008 18:01 Aan: Klise, Steve; toasters@... Onderwerp: R: MSCS and snapmirror question You will not have issues. And you don't need to update your mirror. If you simply power off you clusters no i/o activity will be performer on LUNs. Then you can simply copy (vol copy) the volume(s) containing your LUNs, rename the new volumes as the old (in case of iSCSI LUNs and if you don't want to recreate the connections) and restart you clusters. In case your nodes (but it will not happen) will loose the LUNs with Snapdrive you can safely reconnect them starting with the quorum one, on first node then restarting the cluster service, then on the second node and so on with all other cluster shared LUNs Regards -----Messaggio originale----- Da: owner-toasters@... [mailto:owner-toasters@...] Per conto di Klise, Steve Inviato: venerdì 16 maggio 2008 15.50 A: toasters@... Oggetto: MSCS and snapmirror question I am migrating from an aggregate to another aggregate in the same filer (6030). The LUNS are attached to print clusters, 2 node and 3 node clusters (WIndows 2003 sp2 x86 etc). I have snapmirrored the volumes. If I stop the cluster service on the windows boxes, do a snapmirror update, the disconnected the quorum and other resource disks, do a snapmirror update, release the snapmirror, remove and replace the disks with snapdrive with the same disk letters. fire up the cluster service.. Will that work? has anyone done that before? Any gotchas? Any help is greatly appreciate! |
| Free embeddable forum powered by Nabble | Forum Help |