by mmistrettammistretta on Tue Mar 23, 2010 6:30 pm
Kevin,
Thanks for this response, it is a huge help.
So the bigger problem is this, we are using an F5 device, which in its simplest form, load balances traffic across server farms, and/or proxies connections to the servers. One function of the F5 is to use a virual IP address to access a server behind it.
The topology consists of 2 F5 devices, 2 6500 switches, fully meshed
The F5 devices work similarly to HSRP, using a virtual MAC and IP
They support 802.1q trunks. 2 VLANs are defined, 146 in front, and 111 behind the F5.
In the switches, there is an interface VLAN 146 to route traffic to/from the F5 to the devices in front of it.
VLAN 111 is used for the server farm behind it and is switched through the 6500's, no routing occurs. Only the F5 routes the traffic.
The F5 use STP passthrough, essentially allowing the BPDUs to pass through the interfaces, allowing the BPDU from switch1 to reach switch2.
Now onto the problem:
TCP traffic to the servers seems to work fine, although there have been some complaints of traffic problem.
ICMP fails 75% of the time to devices on the 111 network. Devices that ping, for monitoring, the 111 network, are sitting on the .5 network. We have confirmed routing is not an isssue. Any other device on the .5 network pings just fine to the 111.
The devices sending the echo requests are Nagios monitoring devices running on linux.
We have confirmed the request makes it to the host server, the reply is sent from the server, back through the F5, into the distribution switch1, and then just stops, seemingly discarded.
The behavior that we have seen is the request going to the F5 from the switch, is using source MAC of the interface, which you have confirmed it should be, but the return for some reason is using a different MAC, even though the ARP tables of the F5 have the correct MAC, so why is the MAC changing???
Additionally, the interface MAC of the active HSRP switch ends in c800, the standy is 8400. The HSRP MAC is ac05
So what we are seeing is the F5 sending echo replies to 840a, not a typo. Where is this coming from. All traffic to the F5 is using source of 8400.
Oddly, 8400 is the MAC of switch1, but the traffic is coming from switch2, why is switch2 using the MAC of switch1? Is this an HSRP thing? switch2 is the active.
when replies do succeed, they go to 8400, and interestingly, the seconf F5 still send to 840a, however, the responses go to switch1 which is STP blocking, so they go nowhere.
So the questions remaining are:
Where is the 840a coming from?
And why are the packets using the source MAC of a different switch thats not the active?
Thanks,
Mike