35
Huawei S9700 SRU CSS Feature

Huawei S9700 SRU CSS Feature - ActForNet S970… · Simplifying configuration and management ... form a logical switch. Huawei CSS is the same as: ... Commands must be used to enable

Embed Size (px)

Citation preview

Huawei S9700 SRU CSS

Feature

1

Contents

Chapter 1 CSS Overview

Chapter 2 CSS Setup

Chapter 3 CSS Division and Combination

Chapter 4 CSS Packet Forwarding

Chapter 5 CSS Troubleshooting

Chapter 6 CSS Application Scenarios

2

Why CSS

Increasing device switching capacity and extending device ports

Simplifying configuration and management (multiple physical switches

form a logical switch)

Improving system availability through redundancy and backup of multiple

switches

Two switches = A logical switch featuring higher reliability and higher bandwidth

Cluster Switch System (CSS) also called a

clustering system.

In a CSS, multiple switches are connected using

dedicated clustering cables to

form a logical switch.

Huawei CSS is the same as:

Cisco Virtual Switching System (VSS)

H3C Intelligent Resilient Framework (IRF)

Maintenance

personnelOperator

interface

NMS

NMS

interface

Sta

ckin

g in

terfa

ce

trunk

3

Active

Standby

8*10GE

8*10GE

8*10GE

8*10GE

CSS Hardware Features

8*10GE

8*10GE

Supporting products: S9706 and S9712, except S9703

Two S9706s, two S9712s, or one S9706 and one S9712 can form a

CSS. There are only two switches in one CSS.

Bandwidth of a CSS: 320 Gbit/s

provides 16 clustering cables and 10 Gbit/s per cable.

Supporting SRU boards: SRUC

Note:SRUC will support in V2R3,2013Q3 GA

4

S9700 CSS HardWare

Clustering card (VS08)

The VS08 is not hot swappable.

Only the Link indicator works properly. The CSS ID can be set to only

1 or 2.

Clustering cable

SFP+ high speed

Cable

SRU on S9700

SRUC

5

CSS Software Features

Whether the CSS function is enabled.

By default, CSS is not enabled. Commands must be used to enable CSS.

CSS status (in one CSS system)

Master chassis, slave chassis, and single-chassis CSS

Master SRU (one board), standby SRU (one board), and slave SRU (two

boards)

CSS ID

The CSS ID determines the chassis ID of each interface and currently can be

set to only 1 or 2.

Setting the device priority in the CSS

When chassis compete with one another or are combined, the chassis with the

highest priority is selected as the master switch.

Specifying the master switch

When switch is stable in a CSS, the switch is always selected as the master

switch.

6

Major Differences: Before and After Enabling

CSS on Devices

Interface Name

Before CSS is enabled on devices, interface name is three-dimensional,

such as: Gigabit Ethernet 1/0/0.

After CSS is enabled on devices, interface name is four-dimensional,

such as: Gigabit Ethernet 1/1/0/0.

Upgrade and backup of configuration files

Before CSS is enabled on devices, configuration is backed up and can

be saved as configuration files.

After CSS is enabled on devices, backup configuration files should be

specified as startup configuration files.

In a CSS system, only configuration files of the master switch are

executed. Configurations of the backup switch will be lost.

7

Contents

Chapter 1 CSS Overview

Chapter 2 CSS Setup

Chapter 3 CSS Division and Combination

Chapter 4 CSS Packet Forwarding

Chapter 5 CSS Troubleshooting

Chapter 6 CSS Application Scenarios

8

CSS Setup: Roles of Chassis and SRU

Frame: Master/Slave

SRU: Master/Slave/Standby

SLOT6-LPU

SLOT5-LPU

SLOT4-LPU

SLOT8-SRU

SLOT7-SRU

SLOT3-LPU

SLOT2-LPU

SLOT1-LPU

PW

R1

PW

R2

PW

R3

PW

R4

MC

U1

MC

U2

Master Chassis

Master

Standby

Slave Chassis

S9700 S9700

SLOT6-LPU

SLOT5-LPU

SLOT4-LPU

SLOT8-SRU

SLOT7-SRU

SLOT3-LPU

SLOT2-LPU

SLOT1-LPU P

WR

1

PW

R2

PW

R3

PW

R4

MC

U1

MC

U2

Slave

Standby

9

CSS Setup: Rules for Clustering Connection

The clustering connections are fixed.

Port-Group1: Port1, Port2 , Port3, Port4

Port-Group2: Port5, Port6, Port7, Port8

SLOT6-LPU

SLOT5-LPU

SLOT4-LPU

SLOT8-SRU

SLOT7-SRU

SLOT3-LPU

SLOT2-LPU

SLOT1-LPU

PW

R1

PW

R2

PW

R3

PW

R4

MC

U1

MC

U2

Master Chassis

Master SRU

Standby SRU

SLOT6-LPU

SLOT5-LPU

SLOT4-LPU

SLOT8-SRU

SLOT7-SRU

SLOT3-LPU

SLOT2-LPU

SLOT1-LPU

PW

R1

PW

R2

PW

R3

PW

R4

MC

U1

MC

U2

Slave Chassis

Slave SRU

StandbySRU

S9700 S9700

1 2 3 4

1 2 3 4

5 6 7 8

5 6 7 8

In one SRU, Group 1 & Group 2

must be connected to different

SRUs.

1 2 3 4

1 2 3 4

5 6 7 8

5 6 7 8

Interfaces of group1 must be connected to the interfaces of group1

belong to the opposite Chassis, and there are no special requirements

for the cable connection sequence inside the group.

10

CSS Setup: Clustering Preemption Rules

Which Chassis becomes the

master chassis?

Running status:

The running switch becomes active first.

A switch begins preempting after startup.

Clustering priority:

A higher clustering priority

MAC addresses:

A smaller MAC address

Chassis ID:

A smaller chassis ID

* One ID must be different from another,

otherwise packages may be dropped

between two chassiss.

Preemption

process

PRI

MAC

Frame

ID

Master

Chassis

Standby

Chassis

higher lower

smaller bigger

smaller bigger

equal

equal

Different Chassis IDs

11

Contents

Chapter 1 CSS Overview

Chapter 2 CSS Setup

Chapter 3 CSS Division and Combination

Chapter 4 CSS Packet Forwarding

Chapter 5 CSS Troubleshooting

Chapter 6 CSS Application Scenarios

12

CSS Division and Combination: What

What is division?

What is combination?

One logic switch Two switches

Combination

One logic switch

Division

Two switches

13

CSS Division and Combination: When

When does the division burst?

A total of 16 clustering cables are obstructed for up to 20s.

A total four HiGig channels are shut down for up to 3s. 16 cables are faulty.

At least two CSS cards are faulty.

At least two SRUs are faulty.

When two chassis enabled CSS, then disable CSS on one of them.

Reset the two SRUs of the backup chassis using management devices.

When does the combination burst?

The two chassis are enabled with CSS, but the state is single-

chassis CSS.

The possible causes are as follows:

Switches are not started or connected using CSS cables.

The CSS system is divided.

Division

Combination

14

CSS Division and Combination: How

How to implement division?

In the master chassis: Step 1: Send clustering division alarm packages.

Step 2: Switch the standby SRU to the slave SRU and then

implement batch backup.

In the slave chassis: Step 1: Send clustering division alarm packages.

Step 2: Switch the slave SRU to the master SRU and the standby

SRU to the slave SRU.

How to implement combination?

Step1: Compete with one another and bring the master and

slave CSS.

Step2: Restart the slave chassis.

Division

Combination

15

CSS Division and Combination: Affect

Affect caused by CSS division Users cannot log in to the CSS through the service interface.

The processed protocol packages have some problems.

Why? Two chassis have the same management IP.

Two chassis have the same system MAC.

Affect caused by combination Slave chassis services might be lost, because the slave chassis needs

to be reset.

Why? When the slave chassis is reset, the chassis MAC changes to the

system MAC.

The system MAC should be changed to the master chassis MAC.

The slave chassis's configuration must be invalid. After the slave

chassis is reset, the system uses the master chassis's configuration.

Division

Combination

16

Contents

Chapter 1 CSS Overview

Chapter 2 CSS Setup

Chapter 3 CSS Division and Combination

Chapter 4 CSS Packet Forwarding

Chapter 5 CSS Troubleshooting

Chapter 6 CSS Application Scenarios

17

CSS Packet Forwarding: Through the

Clustering Link Data packets

Layer 2 and Layer 3 packets between two different chassis, which are

transmitted from one LPU to another

Master Slave

Fabric

Fabric

Fabric

PP PP

LPU LPU

Standby SRU

Master SRU Slave SRU

Standby SRU

Fabric

18

CSS Packet Forwarding: Through the

Clustering Link Protocol packet

Protocol packets reported to the active CPU, which come from

LPU

The protocol packets include ARP, ICMP, MSTP, OSPF etc.

Master Slave

Fabric

PP

LPU LPU

Standby SRU

Master SRU Slave SRU

Standby SRU

PP

Fabric Fabric

Fabric

19

CSS Packet Forwarding: Through the Clustering

Link Control packet

Control packets between two different chassis, which come from

LPU' s CPU to SRU's CPU

The control packets include SFP loss, shutdown, and temperature

alarm packets.

Master Slave

Fabric

Fabric Fabric

Fabric

PP PP

LPU

Standby SRU

Master SRU Slave SRU

Standby SRU

LPU

20

CSS Packet Forwarding: Basic Rules for

Forwarding Layer 2 and Layer 3 Packets

Based on the LSW (Fabric or PP) IC's packet

forwarding rules

Ingress modules search forwarding tables including the Layer 2

and Layer 3 tables, get the DST_MODID and DST_PID, write to

the HiGig header, and forward packets.

Fabrics forward packets using HiGig header information,

DST_MODID, and DST_PID. Do not check the packet again.

Egress modules ignore the HiGig header. These modules

search only the Layer 2 and Layer 3 table using packet

information and finally determine the forwarding route.

The tables include the VLAN table, trunk table, ARP table, and port

table.

21

CSS Packet Forwarding: Basic Rules for

Forwarding Protocol and Control Packets

Based on the flag "CPU to CPU" on the HiGig header

Control packets between CPUs of the master SRU and slave

SRU

Control packets between an LPU and active SRU of the

chassis, which implement software upgrade and communication

Based on the MOD_ID

Control packets between CPUs of the master SRU and slave

SRU

Protocol packets between the LPU and the CPU of the master

SRU

22

CSS Packet Forwarding: Rule Processing in

Forwarding the Backup Table

When you download the configuration using command

lines:

The module is backed up to the slave SRU using HA.

The module is backed up to the standby SRU using the IPC.

When you log into the slave SRU,

The module is backed up in batch mode to the slave SRU using

HA.

When you log into the standby SRU,

The module is backed up to the standby SRU using the IPC.

23

CSS Packet Forwarding: Concept of MODPORT

MOD overview

Forward packets based on the MOD, just as the IP addresses in IP

network.

The MOD management module assigns MOD ID in a unified manner.

MODPORT table

The MODPORT table is similar to the routing table of the IP network.

The CSS management module dynamically detects and updates the

MODPORT table between different MPUs, because the CSS system is

not connected fixed.

The MODPORT table destined for an line card is updated by the MOD

management module when this module assigns MODs to the line card.

24

CSS Packet Forwarding: Rules Processing in a

Trunk Why all the clustering HiGigs are bound into one Trunk?

A trunk ensures high performance of cross-chassis packet forwarding.

The trunk interface is connected to multiple destination MODs.

How to select the MOD? Hash: The MOD of service packets is determined using the hash

algorithm. Based on the MOD, the service packets are forwarded using

the fabric. In addition, two MPUs are used to implement load balancing.

Override: The protocol and control packets need to be sent to the

master SRU and be processed using the MPU's CPU. Therefore, hash

cannot be used by these packets.

What is override?

The protocol and control : In a trunk, the packets with a specified MOD

will sent using one given route.

If the HiGig link is down, the software will select a new route.

25

Contents

Chapter 1 CSS Overview

Chapter 2 CSS Setup

Chapter 3 CSS Division and Combination

Chapter 4 CSS Packet Forwarding

Chapter 5 CSS Troubleshooting

Chapter 6 CSS Application Scenario

26

CSS Troubleshooting: Failure Class

Clustering cables that connect interfaces

incorrectly

Clustering link layer error (support check of

only one link error)

Clustering card (VS08) hardware failure

Active SRU restart

Only one SRU that is removed

Active/Standby switchover

Error

27

CSS Troubleshooting: Cable Failure Check

Check which interface is connected incorrectly.

Check how to link interfaces using cables.

Check

2/13/0/5 belongs to group 2 of slot2/13 and should be connected to a port

in group 2 of slot1/13.

1/13/0/4 belongs to group 1 of slot1/13 and should be connected to a port

in group 1 of slot2/13. (group 1: 1-4, group 2: 5-8)

28

CSS Troubleshooting: Cable Fault Check

Supports check of link fault.

Checks whether the clustering port is shut down.

Then the CSS bandwidth is reduced by 6.25%.

Check

Mar 31 2010 10:53:43 SYS-136 CSSM/4/CSSLINKDOWN: OID

1.3.6.1.4.1.2011.5.2

5.183.3.3.2.1 1/13 CSS port 3 down.

Mar 31 2010 10:53:43 SYS-136 CSSM/4/CSSLINKDOWN: OID

1.3.6.1.4.1.2011.5.2

5.183.3.3.2.1 1/14 CSS port 2 down.

*When a link failure occurs, the link check may be disabled in the following situations:

The active/standby switchover occurs.

SRUs are restarted.

Another link failure occurs.

Hardware of the clustering card is faulty.

Warn

29

CSS Troubleshooting: Data Plane Link Map

One link fail results in loss of 6.25% bandwidth.

One SRU/card fail results in loss of 50% bandwidth.

30

CSS Troubleshooting: CSS Switchover

What causes the SRU to perform the active/standby

switchover?

The active/standby switchover is performed by using

command lines.

The master SRU is reset.

How to perform the switchover?

The backup SRU of the original chassis updates

the MODPORT of MOD 0 and the override.

The original slave SRU deletes the MODPORT

table of MOD 0 and makes MOD 0 point to the

CPU.

Perform isolation on the original master SRU (reset

occurs).

31

Contents

Chapter 1 CSS Overview

Chapter 2 CSS Setup

Chapter 3 CSS Division and Combination

Chapter 4 CSS Packet Forwarding

Chapter 5 CSS Troubleshooting

Chapter 6 CSS Application Scenarios

32

CSS Application Scenario 1

OSPF Network

S9700-1 S9700-2

CSSMaster Standby

Trunk 2

Stacking cable

Trunk 1

S1 S2

33

OSPF Network2

S9700-1 S9700-2

CSSMaster Standby

Stacking cable

Trunk1

S1 S2

OSPF Network1

VLANIF 10 VLANIF 20

CSS Application Scenario 2

Copyright©2012 Huawei Technologies Co., Ltd. All Rights Reserved. The information in this document may contain predictive statements including, without limitation, statements regarding the future financial and

operating results, future product portfolio, new technology, etc. There are a number of factors that could cause actual results and developments to

differ materially from those expressed or implied in the predictive statements. Therefore, such information is provided for reference purpose only and

constitutes neither an offer nor an acceptance. Huawei may change the information at any time without notice.

HUAWEI ENTERPRISE ICT SOLUTIONS A BETTER WAY