[go: up one dir, main page]

CN114223182B - Method and system for isolating leaf switches in a network - Google Patents

Method and system for isolating leaf switches in a network Download PDF

Info

Publication number
CN114223182B
CN114223182B CN201980099305.3A CN201980099305A CN114223182B CN 114223182 B CN114223182 B CN 114223182B CN 201980099305 A CN201980099305 A CN 201980099305A CN 114223182 B CN114223182 B CN 114223182B
Authority
CN
China
Prior art keywords
server
leaf switch
switch
leaf
notification
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201980099305.3A
Other languages
Chinese (zh)
Other versions
CN114223182A (en
Inventor
郑海洋
喻湘宁
王永灿
刘永锋
王国辉
王海勇
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Alibaba Group Holding Ltd
Original Assignee
Alibaba Group Holding Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Alibaba Group Holding Ltd filed Critical Alibaba Group Holding Ltd
Publication of CN114223182A publication Critical patent/CN114223182A/en
Application granted granted Critical
Publication of CN114223182B publication Critical patent/CN114223182B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/12Discovery or management of network topologies

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Data Exchanges In Wide-Area Networks (AREA)

Abstract

The present disclosure provides methods and systems for isolating a leaf switch in a network that is connected to a server in the network. The method may include: in response to receiving a request to isolate a leaf switch in a network, sending a notification to a server through the leaf switch, wherein the notification instructs the server to stop sending egress traffic to the leaf switch; determining whether an acknowledgement of the notification is received from the server; and stopping ingress traffic towards the server in response to the determination that the acknowledgement has been received.

Description

在网络中隔离叶交换机的方法和系统Method and system for isolating leaf switches in a network

背景技术Background technique

服务器中心(如大规模可扩展的数据中心)可以包括多个网络服务器和交换机,提供零下线服务,包括远程存储服务、云处理服务、大量数据分发等。由于零下线要求,高可用性(HA)在大规模可扩展数据中心(MSDC)中至关重要。Server centers (such as large-scale scalable data centers) can include multiple network servers and switches to provide zero-offline services, including remote storage services, cloud processing services, large-scale data distribution, etc. High availability (HA) is critical in massively scalable data centers (MSDC) due to zero downtime requirements.

为了提供高带宽、低延迟和非阻塞的连接,MSDC广泛采用了Clos网络拓扑。Clos网络可以基于脊-叶拓扑结构,包括多个脊交换机和多个叶交换机。在脊-叶拓扑中,一个叶交换机可以连接到所有脊交换机,以提高恢复力和可扩展性,以及连接到多个服务器。一个服务器也可以连接到多个叶交换机。由于硬件故障或软件升级,叶交换机可能必须与Clos网络隔离以进行维护或升级。然而,叶交换机的隔离可能会导致实时流量下降,从而导致不希望出现的服务中断。To provide high-bandwidth, low-latency, and non-blocking connections, MSDC makes extensive use of Clos network topology. Clos networks can be based on a spine-leaf topology, including multiple spine switches and multiple leaf switches. In a spine-leaf topology, a leaf switch can connect to all spine switches for increased resiliency and scalability, as well as to multiple servers. A server can also be connected to multiple leaf switches. Due to hardware failure or software upgrades, leaf switches may have to be isolated from the Clos network for maintenance or upgrades. However, isolation of leaf switches can cause real-time traffic drops, resulting in undesirable service outages.

发明内容Contents of the invention

本发明的实施例提供了一种用于隔离网络中的第一叶交换机的方法,所述第一叶交换机连接到网络中的服务器。该方法可以包括:响应于接收到隔离网络中的第一叶交换机的请求,经由第一叶交换机向服务器发送通知,所述通知指示服务器停止向第一叶交换机发送出口业务;确定是否从服务器接收到对所述通知的确认;以及响应于接收到的对所述确认已接受的判定,停止朝向所述服务器的入口业务。Embodiments of the present invention provide a method for isolating a first leaf switch in a network that is connected to a server in the network. The method may include: in response to receiving a request from a first leaf switch in the isolation network, sending a notification to the server via the first leaf switch, the notification instructing the server to stop sending egress traffic to the first leaf switch; determining whether to receive to an acknowledgment of the notification; and in response to a received determination that the acknowledgment has been accepted, ceasing ingress traffic toward the server.

公开的实施例进一步提供连接到网络中的服务器的第一叶交换机。第一叶交换机可以包括:存储一组指令的存储器;以及至少一个与所述内存耦合并配置为执行所述指令集的处理器,以使所述第一叶交换机执行:响应于接收到的隔离网络中交换机的请求,向服务器发送通知,所述通知指示服务器停止向第一叶交换机发送出口流量;确定是否从服务器接收到对所述通知的确认;以及响应所述确认已接收的判定,停止朝向所述服务器的入口流量。The disclosed embodiments further provide a first leaf switch connected to servers in the network. The first leaf switch may include a memory storing a set of instructions; and at least one processor coupled to the memory and configured to execute the set of instructions, such that the first leaf switch performs: in response to receiving the isolation a request from a switch in the network to send a notification to the server instructing the server to stop sending egress traffic to the first leaf switch; determine whether an acknowledgment of the notification is received from the server; and in response to a determination that the acknowledgment has been received, stop Ingress traffic towards said server.

本发明的实施例还提供一种非易失性的计算机可读介质,该介质存储一组指令,该指令可由叶交换机的至少一个处理器执行,以使叶交换机执行一种在网络中隔离叶交换机的方法。叶交换机可以连接到网络中的服务器上。该方法可以包括:响应于接收网络中隔离第一叶交换机的请求,通过第一叶交换机向服务器发送通知,所述通知指示服务器停止向第一叶交换机发送出口流量;确定是否从服务器接收到对所述通知的确认;以及响应于所述确认已接收的判定,停止朝向所述服务器的入口流量。Embodiments of the present invention also provide a non-volatile computer-readable medium that stores a set of instructions executable by at least one processor of the leaf switch to cause the leaf switch to perform a method of isolating leaves in a network. switch method. Leaf switches can be connected to servers in the network. The method may include: in response to receiving a request to isolate the first leaf switch in the network, sending a notification to the server through the first leaf switch, the notification instructing the server to stop sending egress traffic to the first leaf switch; determining whether a response to the request is received from the server. acknowledgment of the notification; and in response to a determination that the acknowledgment has been received, ceasing ingress traffic toward the server.

附图说明Description of drawings

本公开的实施例和各个方面在以下详细描述和附图中进行了说明。图中显示的各种特征并不是按比例绘制的。Embodiments and various aspects of the present disclosure are set forth in the following detailed description and accompanying drawings. The various features shown are not to scale.

图1示出Clos网络的示意图。Figure 1 shows a schematic diagram of the Clos network.

图2示出根据本公开的某些实施例的示例网络的示意图。Figure 2 shows a schematic diagram of an example network in accordance with certain embodiments of the present disclosure.

图3示出根据本公开的某些实施例的所述隔离后的网络的示意图。Figure 3 shows a schematic diagram of the isolated network according to certain embodiments of the present disclosure.

图4是根据本公开的某些实施例,在网络中隔离第一叶交换机的方法流程图。Figure 4 is a flowchart of a method of isolating a first leaf switch in a network according to certain embodiments of the present disclosure.

图5示出了根据本公开的一些实施例的示例性叶交换机的框图。Figure 5 illustrates a block diagram of an example leaf switch in accordance with some embodiments of the present disclosure.

具体实施方式Detailed ways

现在将详细地参考示范性实施例,其示例在附图中加以说明。在可能的情况下,在整个图纸中使用相同的编号来表示相同或类似的零件。Reference will now be made in detail to exemplary embodiments, examples of which are illustrated in the accompanying drawings. Where possible, use the same numbering throughout the drawing to refer to the same or similar parts.

如上所述,对于传统系统,将叶交换机与Clos网络隔离可能会破坏网络,例如,导致实时流量丢失。本公开中描述的技术可以最小化这些类型的中断。As mentioned above, with legacy systems, isolating leaf switches from the Clos network can disrupt the network, causing live traffic to be lost, for example. The techniques described in this disclosure can minimize these types of disruptions.

如本文所用,术语“包含”、“包含”或其任何其他变体旨在涵盖非排他包含物,使得包含元素列表的过程、方法、组合物、物品或装置不仅包括这些元素,但可包括未明确列出的或此类工艺、方法、组合物、物品或装置固有的其他元素。“示范”一词是指“示例”而不是“理想”。As used herein, the terms "comprises," "comprises," or any other variation thereof are intended to encompass non-exclusive inclusion, such that a process, method, composition, article, or apparatus that includes a list of elements not only includes those elements, but may include other Other elements expressly listed or inherent in such process, method, composition, article or apparatus. The word "exemplary" means "example" rather than "ideal".

图1示出网络100的示意图。虽然网络100被设想为一个Clos网络,需要注意的是,任何至少具有三层架构的网络都可以使用。Figure 1 shows a schematic diagram of network 100. Although network 100 is envisioned as a Clos network, it is noted that any network with at least a three-layer architecture may be used.

如图1所示,Clos网络100是三层架构,包括脊层110、叶层120和服务器层130。脊层110是Clos网络100的主干,负责使叶层120中的所有叶交换机互联,它可以包括多个脊交换机(如脊交换机112、114、116和118)。叶层120可以提供对服务器等设备的访问,并包括多个叶交换机(例如,叶交换机122、124、126和128)。服务器层130可以包括多个服务器(例如,服务器132、134、136和138)。As shown in Figure 1, the Clos network 100 is a three-layer architecture, including a spine layer 110, a leaf layer 120 and a server layer 130. The spine layer 110 is the backbone of the Clos network 100 and is responsible for interconnecting all leaf switches in the leaf layer 120. It may include multiple spine switches (such as spine switches 112, 114, 116, and 118). Leaf layer 120 may provide access to devices such as servers and include a plurality of leaf switches (eg, leaf switches 122, 124, 126, and 128). Server tier 130 may include multiple servers (eg, servers 132, 134, 136, and 138).

在这种三层结构中,多个叶交换机可以连接到多个全网拓扑结构中的脊交换机。换句话说,每个叶交换机(如叶交换机122)被连接到脊层110中的每一个脊交换机(如脊交换机112、114、116和118),以产生多个链路。可以在所述多个链路中随机选择叶交换机(如122)与脊交换机之间的链路,从而使叶层120与脊层130之间的流量负载均匀分布。这些叶交换机和脊交换机之间的连接也可以称为L3连接。In this three-tier architecture, multiple leaf switches can be connected to multiple spine switches in a network-wide topology. In other words, each leaf switch (eg, leaf switch 122) is connected to each spine switch (eg, spine switches 112, 114, 116, and 118) in spine layer 110 to create multiple links. Links between leaf switches (eg, 122) and spine switches may be randomly selected among the plurality of links so that the traffic load between leaf layer 120 and spine layer 130 is evenly distributed. The connections between these leaf switches and spine switches may also be called L3 connections.

每个叶交换机(如叶交换机122)也可以连接到服务器层130中的至少一个服务器(如服务器132和服务器134)。另一方面,每个服务器(例如服务器132)可以连接到至少两个叶交换机(例如叶交换机122和124),以确保连通性。换句话说,例如,服务器132可以用叶交换机122建立第一链路,用叶交换机124建立第二链路。服务器和叶交换机之间的第一链路和第二链路可以称为L2链路。Each leaf switch (eg, leaf switch 122) may also be connected to at least one server (eg, server 132 and server 134) in server tier 130. On the other hand, each server (eg, server 132) may be connected to at least two leaf switches (eg, leaf switches 122 and 124) to ensure connectivity. In other words, server 132 may use leaf switch 122 to establish a first link and leaf switch 124 to establish a second link, for example. The first link and the second link between the server and the leaf switch may be called L2 links.

在这种Clos网络100的三层架构下,如果出现Clos网络100的超载,扩展Clos网络100容量的过程可以很简单。例如,可以添加一个附加的脊交换机,并连接到每个叶交换机,提供额外的脊层110和叶层120之间的层间带宽,以减少所述超载。Under this three-layer architecture of the Clos network 100, if the Clos network 100 is overloaded, the process of expanding the capacity of the Clos network 100 can be very simple. For example, an additional spine switch can be added and connected to each leaf switch to provide additional inter-layer bandwidth between spine layer 110 and leaf layer 120 to reduce the overload.

类似地,只要将新的叶交换机连接到每个脊交换机,就可以添加一个新的叶交换机。但是,当现有叶交换机与Clos网络隔离时,这种隔离可能会导致不必要的服务中断。例如,按照现有技术,在叶交换机和服务器之间的L2链路可以从叶交换机的一侧关闭,而服务器不知道该关闭。因此,服务器可能会不断地向叶交换机发送流量,直到服务器检测到关闭并将流量切换到另一个L2链路。因此,在切换之前发送到叶交换机的流量将永远不会被处理,将不得不被丢弃,从而导致不希望的业务中断。Similarly, you can add a new leaf switch simply by connecting it to each spine switch. However, when existing leaf switches are isolated from the Clos network, this isolation can cause unnecessary service interruptions. For example, according to existing technology, the L2 link between a leaf switch and a server can be shut down from one side of the leaf switch without the server being aware of the shutdown. Therefore, the server may continuously send traffic to the leaf switch until the server detects the shutdown and switches the traffic to another L2 link. Therefore, traffic sent to the leaf switches before the switchover will never be processed and will have to be dropped, causing undesirable business interruption.

本公开的实施例进一步提供了用于在网络中隔离叶交换机的方法和系统,最大限度地减少流量破坏。Embodiments of the present disclosure further provide methods and systems for isolating leaf switches in a network to minimize traffic disruption.

图2示出根据本发明的一些实施例的示例网络200的示意图。如图2所示,网络200可以包括脊交换机212和214、叶交换机222和224以及服务器232。每个叶交换机222和224连接到脊交换机212和214。服务器232连接到叶交换机222和224。服务器232和叶交换机222之间的连接可以被称为第一L2链路,服务器232和叶交换机224之间的连接可以被称为第二L2链路。Figure 2 shows a schematic diagram of an example network 200 in accordance with some embodiments of the invention. As shown in FIG. 2 , network 200 may include spine switches 212 and 214 , leaf switches 222 and 224 , and server 232 . Each leaf switch 222 and 224 is connected to a spine switch 212 and 214. Server 232 is connected to leaf switches 222 and 224. The connection between server 232 and leaf switch 222 may be referred to as a first L2 link, and the connection between server 232 and leaf switch 224 may be referred to as a second L2 link.

在某些实施例中,叶交换机222可以接收将叶交换机222从网络200隔离的请求。例如,网络200的管理员可以出于维护、软件升级等目的提出请求。需要注意的是,也可以通过网络200本身提出请求。例如,当网络200检测到叶交换机222出现故障时,网络200可以自动请求隔离交换机222,以免造成进一步的业务中断。In some embodiments, leaf switch 222 may receive a request to isolate leaf switch 222 from network 200 . For example, an administrator of network 200 may make a request for maintenance, software upgrades, etc. Note that requests can also be made via the network 200 itself. For example, when the network 200 detects that the leaf switch 222 fails, the network 200 can automatically request to isolate the switch 222 to avoid further service interruption.

在某些实施例中,当叶交换机222接收隔离请求时,叶交换机222能够确定与叶交换机224相关联的第二L2链路是否具有足够的带宽来处理与叶交换机222相关联的流量。In some embodiments, when leaf switch 222 receives a quarantine request, leaf switch 222 can determine whether the second L2 link associated with leaf switch 224 has sufficient bandwidth to handle the traffic associated with leaf switch 222 .

如果与叶交换机224相关联的第二L2链路不能处理与叶交换机222相关联的额外流量,叶交换机222可以指示不能执行隔离。例如,叶交换机222可以生成一条消息,通知网络200的管理员暂时无法执行隔离。需要注意的是,由于第二L2链路的流量下降,可以释放第二L2链路的带宽。例如,第二L2链路在第一时刻可能没有足够的带宽来处理与叶交换机222相关的流量,但在第二时刻可能有适当的带宽。因此,在某些实施例中,由叶交换机222生成的消息可以进一步指示执行此隔离的另一个时间。If the second L2 link associated with leaf switch 224 cannot handle the additional traffic associated with leaf switch 222, leaf switch 222 may indicate that isolation cannot be performed. For example, leaf switch 222 may generate a message informing the administrator of network 200 that quarantine is temporarily unavailable. It should be noted that due to the decrease in traffic on the second L2 link, the bandwidth of the second L2 link can be released. For example, the second L2 link may not have sufficient bandwidth to handle the traffic associated with leaf switch 222 at the first moment, but may have appropriate bandwidth at the second moment. Therefore, in some embodiments, the message generated by leaf switch 222 may further indicate another time to perform this isolation.

如果与叶交换机224相关联的第二L2链路不能处理与叶交换机222相关联的额外流量,叶交换机222可以继续处理隔离请求。例如,为了响应隔离叶交换机222的请求,叶交换机222可以向服务器232发送通知202。通知202可以用来通知服务器232,到叶交换机222的出口流量应该停止。在某些实施例中,通知202可以包括叶交换机222的标识(例如,媒体访问控制(MAC)地址)。在收到通知202后,服务器232可以停止发送出口流量到叶交换机222。本应发送到叶交换机222的出口流量现在可以发送到另一个叶交换机(如叶交换机224),因此,最终可以到达脊层。If the second L2 link associated with leaf switch 224 is unable to handle the additional traffic associated with leaf switch 222, leaf switch 222 may continue to process quarantine requests. For example, in response to a request to isolate leaf switch 222 , leaf switch 222 may send notification 202 to server 232 . Notification 202 may be used to notify server 232 that egress traffic to leaf switch 222 should stop. In some embodiments, notification 202 may include an identification of leaf switch 222 (eg, a media access control (MAC) address). After receiving the notification 202, the server 232 may stop sending egress traffic to the leaf switch 222. Egress traffic that would have been sent to leaf switch 222 can now be sent to another leaf switch (such as leaf switch 224) and, therefore, ultimately reach the spine.

值得注意的是,当到叶交换机222的出口流量停止时,朝向服务器232的入口流量可以由叶交换机222连续发送。Notably, when egress traffic to leaf switch 222 ceases, ingress traffic toward server 232 may be continuously sent by leaf switch 222.

然后,服务器232可以向叶交换机222发送一个针对通知202的确认204。确认204可以通知叶交换机222,服务器232已知晓该请求。在某些实施例中,确认204可以进一步通知叶交换机222,出口流量已经发送到另一个叶交换机(例如,叶交换机224)。Server 232 may then send an acknowledgment 204 of notification 202 to leaf switch 222. Acknowledgment 204 may notify leaf switch 222 that server 232 is aware of the request. In some embodiments, acknowledgment 204 may further notify leaf switch 222 that egress traffic has been sent to another leaf switch (eg, leaf switch 224).

链路聚合控制协议(LACP)可以用来管理L2链路以及叶交换机和服务器之间的通信。LACP允许网络设备(如叶交换机222)通过向同样执行LACP的对端设备(如服务器232)发送LACP数据单元(LACPDU)来协商自动绑定链路,对端设备(如服务器232)也执行LACP。在某些实施例中,通知202和确认204可以使用LACPDU在叶交换机222和服务器232之间传输。Link Aggregation Control Protocol (LACP) can be used to manage L2 links and communication between leaf switches and servers. LACP allows network devices (such as leaf switches 222) to negotiate automatic link binding by sending LACP data units (LACPDUs) to peer devices (such as servers 232) that also perform LACP. . In some embodiments, notifications 202 and acknowledgments 204 may be transmitted between leaf switches 222 and servers 232 using LACPDUs.

因此,叶交换机222可以确定是否从服务器232接收到所述确认204。在某些实施例中,叶交换机222可以进一步确定是否在给定的时间(例如,3秒)内从服务器232接收所述确认204。由于各种原因,叶交换机222在给定的时间内可能无法收到所述确认204。例如,这些原因可以包括叶交换机222不能发送所述通知202、服务器232不能接收所述通知202、服务器232不能发送确认204、叶交换机222不能接收所述确认204等等中的至少一个。Accordingly, leaf switch 222 may determine whether the acknowledgment 204 was received from server 232 . In certain embodiments, leaf switch 222 may further determine whether the acknowledgment 204 is received from server 232 within a given time (eg, 3 seconds). For various reasons, the leaf switch 222 may not receive the acknowledgment 204 within a given time. For example, these reasons may include at least one of leaf switch 222 not being able to send the notification 202 , server 232 not being able to receive the notification 202 , server 232 not being able to send the acknowledgment 204 , leaf switch 222 not being able to receive the acknowledgment 204 , and so forth.

响应于所述确认204未收到的判定,叶交换机222可以向服务器232重发通知202,以进一步向服务器232通报该隔离请求。In response to the determination that the acknowledgment 204 was not received, the leaf switch 222 may resend the notification 202 to the server 232 to further notify the server 232 of the quarantine request.

响应所述确认204被接收的判断,叶交换机222可以停止朝向服务器232的进口流量。因此,在确定收到所述确认204之前,服务器232已经停止了对叶交换机222的出口流量。并且在确定收到所述确认204之后,叶交换机222可以停止对服务器232的进口流量。换句话说,在收到所述确认204后,叶交换机222和服务器232之间的流量(包括服务器232的出口流量和服务器232的进口流量)可以完全停止。In response to determining that the acknowledgment 204 was received, the leaf switch 222 may stop ingress traffic toward the server 232 . Therefore, the server 232 has stopped egress traffic to the leaf switch 222 before determining that the acknowledgment 204 has been received. And upon determining that the acknowledgment 204 is received, the leaf switch 222 may stop ingress traffic to the server 232 . In other words, after receiving the acknowledgment 204, the traffic between the leaf switch 222 and the server 232 (including the egress traffic of the server 232 and the ingress traffic of the server 232) may be completely stopped.

值得注意的是,在接收到确认204之前,一些朝向服务器232的进口流量仍然可以被发送,并成为实时流量。因此,在朝向服务器232的实时流量被处理之前,叶交换机222最好不要断开与服务器232的连接。如果叶交换机222在收到确认204后立即断开与服务器232的连接,则所述实时流量可能不会被彻底处理。It is worth noting that before the acknowledgment 204 is received, some ingress traffic towards the server 232 may still be sent and become live traffic. Therefore, leaf switch 222 preferably does not disconnect from server 232 until real-time traffic toward server 232 is processed. If leaf switch 222 immediately disconnects from server 232 after receiving acknowledgment 204, the live traffic may not be processed thoroughly.

由于实时流量处理的持续时间可能很短,在叶交换机222处的朝向服务器的进口流量被停止后的一段时间内,叶交换机222可以断开与服务器232的连接。时间周期是可配置的,可以设置为几毫秒。值得注意的是,虽然叶交换机222已经从服务器232隔离,但在叶交换机222断开后,脊层的脊交换机和服务器232之间的流量可以通过叶交换机224进行通信。Because the duration of real-time traffic processing may be short, leaf switch 222 may disconnect from server 232 for a period of time after ingress traffic toward the server at leaf switch 222 is stopped. The time period is configurable and can be set to a few milliseconds. It is worth noting that although leaf switch 222 has been isolated from server 232, traffic between the spine switch and server 232 of the spine layer can communicate through leaf switch 224 after leaf switch 222 is disconnected.

在某些实施例中,叶交换机222也可以使用通向服务器232的流量的最后一个包作为确认。响应于接收到所述确认,服务器232可以进一步确认所有的流量已经处理,断开叶交换机222是安全的。In some embodiments, leaf switch 222 may also use the last packet of traffic to server 232 as an acknowledgment. In response to receiving the confirmation, server 232 may further confirm that all traffic has been processed and that it is safe to disconnect leaf switch 222 .

可以理解的是,当叶交换机222重新上线时,则可以在叶交换机222和服务器232之间重新建立L2链路。It can be understood that when the leaf switch 222 comes back online, the L2 link can be re-established between the leaf switch 222 and the server 232 .

从上面可以看出,在叶交换机222隔离期间,可以使用通知202和确认204协调服务器232的出口流量和进口流量依次序终止,这样,叶交换机222和服务器232之间的实时流量可以在隔离之前处理完毕,以避免流量中断。As can be seen from the above, during the isolation period of the leaf switch 222, the notification 202 and the confirmation 204 can be used to coordinate the sequential termination of the egress traffic and the ingress traffic of the server 232, so that the real-time traffic between the leaf switch 222 and the server 232 can be terminated before isolation. Processed to avoid traffic interruption.

如上所述,LACP可用于管理叶交换机和服务器之间的链路和通信。在某些实施例中,LACP可进一步用于执行服务器的出口流量和入口流量的依次序终止。As mentioned above, LACP can be used to manage links and communications between leaf switches and servers. In some embodiments, LACP may further be used to perform sequential termination of egress traffic and ingress traffic to the server.

在某些实施例中,LACPDU的LACP端口状态字段可以用作同步字段,以在隔离期间协调叶交换机和服务器。LACP端口状态字段至少包含三个位,每个位都是一个标志,表示发送端端口的特定状态。下面的表1显示了LACP端口状态字段的三个位的示例含义,包括“同步”、“收集”和“分发”。In some embodiments, the LACP Port Status field of the LACPDU may be used as a synchronization field to coordinate leaf switches and servers during isolation. The LACP port status field contains at least three bits, each bit is a flag indicating the specific status of the sending port. Table 1 below shows example meanings of the three bits of the LACP port status field, including "synchronize", "collect", and "distribute".

表1Table 1

比特位“同步”可以用来表示发送设备与接收设备是否处于同步状态。如上表1所示,如果“同步”位为“0”,表示接收端和发送端不同步,接收端设备可以重新同步多个接收端和发送端物理端口。重新同步也可以称为“flapping”。物理端口同步后可以聚合成一条高带宽的数据路径,提供更好的连通性。聚合的物理端口也可以称为链路聚合组(linkaggregation group,LAG)。The "sync" bit can be used to indicate whether the sending device and the receiving device are in sync. As shown in Table 1 above, if the "synchronization" bit is "0", it means that the receiving end and the transmitting end are not synchronized, and the receiving end device can resynchronize multiple receiving end and transmitting end physical ports. Resynchronization may also be called "flapping". After synchronization, physical ports can be aggregated into a high-bandwidth data path to provide better connectivity. The aggregated physical port may also be called a link aggregation group (LAG).

如果比特位“Synchronization”为“1”,表示接收端和发送端已同步,至少可以进行一次采集和分发。如表1所示,当端口状态字段的三个位为“101”时,表示发送端设备正在向接收端发送流量,并期望所述发送端停止发送流量。更特别地,在本公开的实施例中,叶交换机的所述三个端口状态位为“101”,表明叶交换机仍向与所述叶交换机连接的服务器发送入口流量,并期望每个服务器停止向所述叶交换机发送出口流量。例如,当叶交换机222使用LACPDU向服务器232发送通知202时,叶交换机222也可以将LACP端口状态字段设置为“101”。因此,在接收到通知202后,服务器232可以读取叶交换机的端口状态,并停止向叶交换机的出口流量,并发送确认(如图2中的确认204)。If the "Synchronization" bit is "1", it means that the receiving end and the transmitting end are synchronized, and at least one collection and distribution can be performed. As shown in Table 1, when the three bits of the port status field are "101", it means that the sending device is sending traffic to the receiving end and expects the sending end to stop sending traffic. More specifically, in this embodiment of the present disclosure, the three port status bits of the leaf switch are "101", indicating that the leaf switch is still sending ingress traffic to the servers connected to the leaf switch and expects each server to stop Send egress traffic to the leaf switch. For example, when leaf switch 222 sends notification 202 to server 232 using LACPDU, leaf switch 222 may also set the LACP port status field to "101". Therefore, upon receiving the notification 202, the server 232 can read the port status of the leaf switch and stop egress traffic to the leaf switch and send a confirmation (confirmation 204 in Figure 2).

响应于通知202,当服务器232发送确认204时,服务器232的端口状态字段的三位可设置为“110”,表明服务器232仍在从叶交换机222接收流量,并期望从叶交换机222获得无流量传输的确认。由于服务器232在叶交换机222和服务器232之间的链路上连续地处理实时流量,因此可以从链路中处理完实时流量,从而当叶交换机222被隔离时可以避免流量中断,或最小化流量中断。In response to notification 202, when server 232 sends acknowledgment 204, the three bits of server 232's port status field may be set to "110", indicating that server 232 is still receiving traffic from leaf switch 222 and expects no traffic from leaf switch 222 Confirmation of transmission. Because server 232 continuously handles real-time traffic on the link between leaf switch 222 and server 232, real-time traffic can be processed from the link, thereby avoiding traffic interruption, or minimizing traffic when leaf switch 222 is isolated. Interrupt.

在叶交换机222接收到确认204后,叶交换机222可以进一步确认在叶交换机222和服务器232之间的链路上没有流量传输。例如,叶交换机222发送的实时流量的最后一个包可以作为所述确认。After leaf switch 222 receives acknowledgment 204, leaf switch 222 may further confirm that there is no traffic transmission on the link between leaf switch 222 and server 232. For example, the last packet of real-time traffic sent by leaf switch 222 may serve as the acknowledgment.

这样,叶交换机222准备好与服务器断开连接。根据本发明的某些实施例,图3示出所述隔离后的网络300的示意图。如图3所示,在叶交换机222接收到确认后,叶交换机222已断开与服务器232的连接。可以理解的是,服务器232发送的流量仍然可以通过叶交换机224到达脊交换机212和214。因此,网络300的用户不会感知到叶交换机222的隔离。At this point, leaf switch 222 is ready to disconnect from the server. According to some embodiments of the present invention, FIG. 3 shows a schematic diagram of the isolated network 300. As shown in Figure 3, after leaf switch 222 receives the acknowledgment, leaf switch 222 has disconnected from server 232. It can be understood that traffic sent by server 232 can still reach spine switches 212 and 214 through leaf switch 224. Therefore, users of network 300 are not aware of the isolation of leaf switch 222.

重新参考图2和表1,当叶交换机222和服务器232的同步字段的三位为“111”时,可以表示叶交换机222和服务器232正在进行双向通信。Referring again to FIG. 2 and Table 1, when the three bits of the synchronization field of the leaf switch 222 and the server 232 are "111", it may indicate that the leaf switch 222 and the server 232 are conducting bidirectional communication.

可以理解的是,当某个叶交换机接收到从网络隔离该叶交换机的请求时,可以激活LACPDU的LACP端口状态字段的以上复用。同样可以理解的是,同步字段可以使用LACPDU的LACP端口状态字段的三个不同比特位在所述叶交换机和所述服务器之间传输。在某些实施例中,同步字段可以使用LACPDU以外的数据单元传输。It can be understood that when a certain leaf switch receives a request to isolate the leaf switch from the network, the above multiplexing of the LACP port status field of the LACPDU can be activated. It is also understood that the synchronization field may be transmitted between the leaf switch and the server using three different bits of the LACP Port Status field of the LACPDU. In some embodiments, the synchronization field may be transmitted using data units other than LACPDU.

图4是根据本发明的一些实施例在网络中隔离第一叶交换机的方法400的流程图。除第一叶交换机外,该网络还可以包括第二叶交换机和脊交换机。第一和第二叶交换机都可以连接到一个服务器(例如,服务器层130的服务器134),而脊交换机连接到第一和第二叶交换机。方法400可以通过电子设备执行。所述电子设备可包括存储一组指令的存储器和至少一个执行该指令集以使所述电子设备执行方法400的处理器。例如,该电子器件可以是叶层(如,叶层120)的叶交换机(如图2-3中的叶交换机222)。参考图4,方法400可以包括以下步骤。Figure 4 is a flowchart of a method 400 of isolating a first leaf switch in a network according to some embodiments of the invention. In addition to the first leaf switch, the network may also include second leaf switches and spine switches. Both the first and second leaf switches may be connected to a server (eg, server 134 of server tier 130), with the spine switch connected to the first and second leaf switches. Method 400 may be performed by an electronic device. The electronic device may include a memory that stores a set of instructions and at least one processor that executes the set of instructions to cause the electronic device to perform method 400 . For example, the electronic device may be a leaf switch (eg, leaf switch 222 in Figure 2-3) of a leaf layer (eg, leaf layer 120). Referring to Figure 4, method 400 may include the following steps.

在步骤402,响应于接收一个请求,要求隔离网络中的第一叶交换机,第一叶交换机可以向服务器发送通知。该通知可以指示服务器停止向所述第一叶交换机发送出口流量。在某些实施例中,通知还可以包括第一叶交换机的第一端口状态。例如,该通知可以由第一个LACPDU(Link Aggregation Control Protocol Data Unit)携带,第一端口状态可以由第一个LACPDU的LACP端口状态字段表示。在某些实施例中,LACP端口状态字段可以包括3比特位以指示发送端(例如,第一叶交换机)的端口状态。在本步骤402,第一个LACPDU的第一LACP端口状态字段可以是“101”,指示第一叶交换机的端口仍然在分发业务,但不接收业务,并且第一叶交换机期望服务器停止朝向第一叶交换机的出口流量。因此,在发送通知之后,第一叶交换机可以不断地将入口流量发送到所述服务器以进行处理。At step 402, in response to receiving a request to isolate a first leaf switch in the network, the first leaf switch may send a notification to the server. The notification may instruct the server to stop sending egress traffic to the first leaf switch. In some embodiments, the notification may also include the first port status of the first leaf switch. For example, the notification may be carried by the first LACPDU (Link Aggregation Control Protocol Data Unit), and the first port status may be represented by the LACP port status field of the first LACPDU. In some embodiments, the LACP port status field may include 3 bits to indicate the port status of the sending end (eg, the first leaf switch). In this step 402, the first LACP port status field of the first LACPDU may be "101", indicating that the port of the first leaf switch is still distributing services but not receiving services, and the first leaf switch expects the server to stop moving towards the first Egress traffic from the leaf switch. Therefore, after sending the notification, the first leaf switch can continuously send ingress traffic to the server for processing.

在步骤404,第一叶交换机可以确定是否从服务器接收到所述通知的确认。当服务器收到通知时,服务器可以停止向所述第一叶交换机发送出口流量,并将确认发送回所述第一叶交换机。确认信息可以被第二个LACPDU携带。同样,服务器的第二端口状态也可以通过第二个LACPDU的LACP端口状态字段来表示。在本步骤404中,第二个LACPDU的LACP端口状态字段可以是“101”,表明服务器的端口仍在接收来自第一叶交换机的流量,并期望来自第一叶交换机的无流量传输的确认。At step 404, the first leaf switch may determine whether an acknowledgment of the notification was received from the server. When the server receives the notification, the server can stop sending egress traffic to the first leaf switch and send an acknowledgment back to the first leaf switch. The confirmation information can be carried by the second LACPDU. Similarly, the server's second port status can also be represented by the LACP port status field of the second LACPDU. In this step 404, the LACP port status field of the second LACPDU may be "101", indicating that the server's port is still receiving traffic from the first leaf switch and expects confirmation of no traffic transmission from the first leaf switch.

如前所述,服务器也连接到第二叶交换机。在某些实施例中,所述通知可以进一步使来自服务器的出口流量通过第二叶交换机发送到脊交换机。As mentioned earlier, the servers are also connected to the second leaf switch. In some embodiments, the notification may further cause egress traffic from the server to be sent to the spine switch through the second leaf switch.

在步骤406,响应于所述确认被接收到的判定,第一叶交换机可以停止朝向服务器的进口流量。In step 406, in response to a determination that the acknowledgment was received, the first leaf switch may stop ingress traffic toward the server.

在某些实施例中,为了响应在步骤404的第一段时间内未收到所述确认的判定,第一叶交换机可以在步骤402向服务器发送另一个通知。值得注意的是,如果发送通知的次数达到给定的次数,第一叶交换机可以生成一个错误代码,表明隔离已失败。In some embodiments, in response to the determination that the acknowledgment was not received within the first period of time in step 404, the first leaf switch may send another notification to the server in step 402. It is worth noting that if notifications are sent a given number of times, the first leaf switch can generate an error code indicating that isolation has failed.

在步骤408,第一叶交换机可以断开第一叶交换机与服务器的连接。所述脊交换机与所述服务器之间的流量在断开所述第一叶交换机后通过所述第二叶交换机进行通信。在某些实施例中,在朝向服务器的进口流量在第一叶交换机处停止后的第二时间段内,第一叶交换机可以断开第一叶交换机与服务器的连接。在某些实施例中,当服务器接收进口流量的最后一个包时,服务器可以进一步确认最后一个包已经被处理。作为对该确认的响应,第一叶交换机可以断开第一叶交换机与服务器的连接。In step 408, the first leaf switch may disconnect the first leaf switch from the server. Traffic between the spine switch and the server communicates through the second leaf switch after disconnecting the first leaf switch. In some embodiments, the first leaf switch may disconnect the first leaf switch from the server within a second period of time after ingress traffic toward the server is stopped at the first leaf switch. In some embodiments, when the server receives the last packet of ingress traffic, the server may further confirm that the last packet has been processed. In response to the acknowledgment, the first leaf switch may disconnect the first leaf switch from the server.

图5示出根据本公开的一些实施例的示例叶交换机500的框图。叶交换机500可以连接到网络中的服务器,并配置为执行方法400。该网络还可以包括一个脊交换机。Figure 5 shows a block diagram of an example leaf switch 500 in accordance with some embodiments of the present disclosure. Leaf switch 500 may be connected to a server in the network and configured to perform method 400. The network may also include a spine switch.

叶交换机500可以包括多个网口502a-502n、内存504和耦合所述多个网口502a-502n和内存504的处理器506。Leaf switch 500 may include a plurality of network ports 502a-502n, memory 504, and a processor 506 coupling the plurality of network ports 502a-502n and memory 504.

网口502a-502n可用于收发脊椎交换机和服务器的流量。内存504可以存储一组用于执行方法400的指令。此外,内存504还可以存储包括网络中设备的地址和相应端口的地址的地址查找表。处理器506可以执行一组指令使叶交换机500执行方法400。Network ports 502a-502n can be used to send and receive traffic from spine switches and servers. Memory 504 may store a set of instructions for performing method 400. In addition, the memory 504 may also store an address lookup table including the addresses of devices in the network and the addresses of corresponding ports. Processor 506 may execute a set of instructions causing leaf switch 500 to perform method 400.

本公开的实施例还提供计算机程序产品。所述计算机程序产品可以包括非易失性计算机可读存储介质,其具有计算机可读程序指令,所述指令用于使处理器执行上述方法。Embodiments of the present disclosure also provide a computer program product. The computer program product may include a non-volatile computer-readable storage medium having computer-readable program instructions for causing the processor to perform the above method.

计算机可读存储介质可以是一种有形的设备,它可以存储供指令执行设备使用的指令。例如,计算机可读存储介质可以是但不限于电子存储设备、磁存储设备、光存储设备、电磁存储设备、半导体存储设备或上述任何合适的组合。计算机可读存储介质的一个非详尽列表包括以下更具体的例子:便携式计算机磁盘、硬盘、随机存取存储器(RAM)、只读存储器(ROM)、可擦写可编程只读存储器(EPROM)、静态随机存取存储器(SRAM)、闪存、便携式光盘只读存储器(CD-ROM)、数字多用途磁盘(DVD)、记忆棒、软盘、一种机械编码装置,如穿孔卡片或槽中的凸起结构,其上记录有指令,以及上述各部分的任何适当组合。A computer-readable storage medium may be a tangible device that stores instructions for use by an instruction execution device. For example, the computer-readable storage medium may be, but is not limited to, an electronic storage device, a magnetic storage device, an optical storage device, an electromagnetic storage device, a semiconductor storage device, or any suitable combination of the foregoing. A non-exhaustive list of computer-readable storage media includes the following more specific examples: portable computer disks, hard drives, random access memory (RAM), read-only memory (ROM), erasable programmable read-only memory (EPROM), Static random access memory (SRAM), flash memory, portable compact disc read-only memory (CD-ROM), digital versatile disk (DVD), memory stick, floppy disk, a mechanical encoding device such as a punched card or bumps in a slot A structure on which instructions are recorded, and any appropriate combination of the foregoing.

用于执行上述方法的计算机可读程序指令可以是汇编指令、指令集体系结构(ISA)指令、机器指令、依赖于机器的指令、微码、固件指令、状态设置数据、或用一种或多种编程语言(包括面向对象编程语言和常规过程编程语言)的任何组合编写的源代码或目标代码。计算机可读的程序指令可以完全作为独立软件包在计算机系统上执行,或者部分在第一计算机上执行,部分在远离该第一计算机的第二计算机上执行。在后一种场景中,第二种远程计算机可以通过任何类型的网络连接到第一台计算机,包括局域网(LAN)或广域网(WAN)。Computer-readable program instructions for performing the above methods may be assembly instructions, instruction set architecture (ISA) instructions, machine instructions, machine-dependent instructions, microcode, firmware instructions, state setting data, or use one or more Source code or object code written in any combination of programming languages, including object-oriented programming languages and conventional procedural programming languages. The computer-readable program instructions may execute entirely on a computer system, as a stand-alone software package, or execute partially on a first computer and partially on a second computer remote from the first computer. In the latter scenario, the second remote computer can be connected to the first computer over any type of network, including a local area network (LAN) or a wide area network (WAN).

可以将计算机可读程序指令提供给计算机的一个或多个处理器或其他可编程数据处理设备以制成机器,这样,通过计算机的一个或多个处理器或其他可编程数据处理设备执行的指令,创建实现上述方法的方法。Computer-readable program instructions may be provided to one or more processors of a computer or other programmable data processing apparatus to make a machine such that the instructions execute by the one or more processors of the computer or other programmable data processing apparatus , create a method that implements the above method.

图中的流程图和示意图说明了根据本说明书的各种实施例的设备、方法和计算机程序产品的可能实现的示例性架构、功能和操作。就此而言,流程图或示意图中的块可表示软件程序、段或部分代码,其包括用于实现特定功能的一个或多个可执行指令。还应注意,在一些替代实现中,在块中记录的功能可能会出现在图中记录的顺序之外。例如,连续示出的两个块实际上可以基本上同时执行,或者有时可以根据所涉及的功能以相反顺序执行这些块。还将注意到,图和/或流程图的每个块,以及图和流程图中的块的组合,可以由执行指定功能或动作的基于专用硬件的系统,或者专用硬件和计算机指令的组合来实现。The flowcharts and schematic diagrams in the Figures illustrate exemplary architecture, functionality, and operations of possible implementations of apparatus, methods, and computer program products according to various embodiments of the present specification. In this regard, a block in the flowchart or schematic diagram may represent a software program, segment, or portion of code, which includes one or more executable instructions for implementing the specified functions. It should also be noted that in some alternative implementations, functions documented in blocks may appear out of the order documented in the figures. For example, two blocks shown in succession may actually be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order depending on the functionality involved. It will also be noted that each block of the figures and/or flowchart illustrations, and combinations of blocks in the figures and flowchart illustrations, can be constructed by special purpose hardware-based systems that perform the specified functions or actions, or by combinations of special purpose hardware and computer instructions. accomplish.

应理解,为清楚起见,在单独实施例的上下文中描述的说明书的某些特征也可以组合在单个实施例中提供。相反,为简洁起见,在单个实施例的上下文中描述的说明书的各种特征也可以单独提供,或者在说明书的任何合适的子组合中提供,或者在说明书的任何其他描述的实施例中提供。在各种实施例的上下文中描述的某些特征不应被视为这些实施例的基本特征,除非没有这些元件,该实施例无法工作的。It is to be understood that, for clarity, certain features of the specification that are described in the context of separate embodiments can also be provided in combination in a single embodiment. Conversely, various features of the specification that are, for the sake of brevity, described in the context of a single embodiment, may also be provided separately, or in any suitable subcombination of the specification, or in any other described embodiment of the specification. Certain features described in the context of various embodiments should not be considered essential features of those embodiments, except that the embodiment would not function without these elements.

尽管已经结合说明书已经结合具体实施例进行了描述,但显而易见,许多备选方案、修改和变化对于本领域技术人员来说是显而易见的。因此,本公开包括落入所附权利要求书的本质和广泛范围内的所有此类替代、修改和变化。Although specific embodiments have been described in connection with the specification, it will be apparent that many alternatives, modifications and variations will be apparent to those skilled in the art. Accordingly, this disclosure includes all such alternatives, modifications and variations that fall within the spirit and broad scope of the appended claims.

Claims (16)

1.一种用于隔离网络中的第一叶交换机的方法,所述第一叶交换机连接到网络中的服务器,包括:1. A method for isolating a first leaf switch in a network connected to a server in the network, comprising: 响应于接收到网络中隔离第一叶交换机的请求,通过所述第一叶交换机向服务器发送通知,在所述通知中,指示服务器停止向第一叶交换机发送出口流量;In response to receiving a request to isolate a first leaf switch in the network, sending a notification to the server through the first leaf switch, instructing the server in the notification to stop sending egress traffic to the first leaf switch; 确定是否从服务器接收到对所述通知的确认;以及Determine whether acknowledgment of the notification is received from the server; and 响应所述确认已接收的判定,停止朝向服务器的入口流量;In response to the determination that the acknowledgment has been received, stopping ingress traffic toward the server; 断开所述第一叶交换机与所述服务器的连接,包括:在朝向服务器的入口流量在所述第一叶交换机处停止后的第二时间段内,断开所述第一叶交换机与服务器的连接。Disconnecting the first leaf switch from the server includes: disconnecting the first leaf switch from the server within a second period of time after ingress traffic toward the server stops at the first leaf switch. Connection. 2.根据权利要求1所述的方法,其中所述网络还包括连接到服务器的第二叶交换机和连接到第一叶交换机和第二叶交换机的脊交换机。2. The method of claim 1, wherein the network further includes a second leaf switch connected to the server and a spine switch connected to the first leaf switch and the second leaf switch. 3.根据权利要求2所述的方法,其中所述通知进一步使来自服务器的出口流量通过第二叶交换机发送到脊交换机。3. The method of claim 2, wherein the notification further causes egress traffic from the server to be sent to the spine switch through the second leaf switch. 4.根据权利要求1-3中的任何一项所述的方法,其中4. The method according to any one of claims 1-3, wherein 发送所述通知后,所述第一叶交换机将所述入口流量发送给服务器以进行处理。After sending the notification, the first leaf switch sends the ingress traffic to the server for processing. 5.根据权利要求1-4中的任何一项所述的方法,还包括:5. The method of any one of claims 1-4, further comprising: 响应于所述确认在第一时间段内没有收到的判定,向所述服务器发送另一个通知。In response to the determination that the acknowledgment was not received within the first time period, another notification is sent to the server. 6.根据权利要求3所述的方法,其中断开所述第一叶交换机与所述服务器之间的连接,会使所述脊交换机和所述服务器之间的流量通过所述第二叶交换机进行通信。6. The method of claim 3, wherein disconnecting the first leaf switch from the server causes traffic between the spine switch and the server to pass through the second leaf switch. communicate. 7.根据权利要求1-6中的任意一个所述的方法,其中所述通知由第一个链路聚合控制协议数据单元LACPDU携带,确认由第二个LACPDU携带。7. The method according to any one of claims 1-6, wherein the notification is carried by a first Link Aggregation Control Protocol Data Unit LACPDU and the confirmation is carried by a second LACPDU. 8.根据权利要求1-7中的任意一个所述的方法,其中通知包括所述第一叶交换机的第一端口状态,所述确认包括所述服务器的第二端口状态。8. The method of any one of claims 1-7, wherein the notification includes a first port status of the first leaf switch and the acknowledgment includes a second port status of the server. 9.连接到网络中的服务器的第一叶交换机,包括:9. First leaf switches connected to servers in the network, including: 存储一组指令的存储器;和A memory that stores a set of instructions; and 至少一个与该存储器耦合的处理器,并被配置为执行一组指令以使第一叶交换机执行:At least one processor coupled to the memory and configured to execute a set of instructions to cause the first leaf switch to: 响应于接收到网络中隔离交换机的请求,向服务器发送通知,所述通知指示服务器停止向第一叶交换机发送出口流量;In response to receiving the request to isolate the switch in the network, sending a notification to the server, the notification instructing the server to stop sending egress traffic to the first leaf switch; 确定是否从服务器接收到对所述通知的确认;和Determine whether an acknowledgment of the notification is received from the server; and 响应于所述确认已被接收的判定,停止朝向服务器的入口流量;Responsive to a determination that the acknowledgment has been received, stopping ingress traffic toward the server; 使第一开关断开第一叶交换机与服务器的连接,包括:在朝向服务器的入口流量在第一叶交换机处停止后的第二时间段内,断开第一叶交换机与服务器的连接。Causing the first switch to disconnect the first leaf switch from the server includes disconnecting the first leaf switch from the server within a second period of time after ingress traffic toward the server stops at the first leaf switch. 10.根据权利要求9所述的第一叶交换机,其中所述网络还包括连接到所述服务器的第二叶交换机和连接到所述第一叶交换机和所述第二叶交换机的脊交换机。10. The first leaf switch of claim 9, wherein the network further includes a second leaf switch connected to the server and a spine switch connected to the first leaf switch and the second leaf switch. 11.根据权利要求10所述的第一叶交换机,其中11. The first leaf switch of claim 10, wherein 该通知进一步使来自服务器的出口流量通过所述第二叶交换机发送到脊交换机。This notification further causes egress traffic from the server to be sent to the spine switch through the second leaf switch. 12.根据权利要求9-11中的任意一项所述的第一叶交换机,其中至少一个处理器被进一步配置为执行指令集以使所述第一叶交换机进一步执行:12. The first leaf switch of any one of claims 9-11, wherein at least one processor is further configured to execute a set of instructions to cause the first leaf switch to further: 发送所述通知后,将所述入口流量发送给服务器以进行处理。After sending the notification, the ingress traffic is sent to the server for processing. 13.根据权利要求9-12中的任意一个所述的第一叶交换机,其中所述至少一个处理器进一步配置为执行指令集以使所述第一开关进一步执行:13. The first leaf switch of any one of claims 9-12, wherein the at least one processor is further configured to execute a set of instructions to cause the first switch to further: 响应于所述确认在第一时间段内没有收到的判定,向所述服务器发送另一个通知。In response to the determination that the acknowledgment was not received within the first time period, another notification is sent to the server. 14.根据权利要求10的第一叶交换机,其中将开关从服务器断开进一步包括:14. The first leaf switch of claim 10, wherein disconnecting the switch from the server further comprises: 在第一叶交换机断开后,脊交换机和服务器之间的流量通过第二叶交换机传达。After the first leaf switch is disconnected, traffic between the spine switch and the server is conveyed through the second leaf switch. 15.根据权利要求9-14中的任意一个所述的第一叶交换机,其中所述通知由第一个链路聚合控制协议数据单元LACPDU携带,所述确认由第二个LACPDU携带。15. A first leaf switch according to any one of claims 9-14, wherein said notification is carried by a first Link Aggregation Control Protocol Data Unit LACPDU and said acknowledgment is carried by a second LACPDU. 16.一种非易失性计算机可读介质,其存储可由叶交换机的至少一个处理器执行的指令集,以使叶交换机执行用于在网络中隔离叶交换机的方法,叶交换机连接到网络中的服务器,所述方法包括:16. A non-volatile computer-readable medium storing a set of instructions executable by at least one processor of a leaf switch to cause the leaf switch to perform a method for isolating the leaf switch in a network to which the leaf switch is connected server, the method includes: 响应于接收到用于隔离网络中的第一叶交换机的请求,经由第一叶交换机向服务器发送通知,其中该通知指示服务器停止向第一叶交换机发送出口流量;In response to receiving the request to isolate the first leaf switch in the network, sending a notification to the server via the first leaf switch, wherein the notification instructs the server to stop sending egress traffic to the first leaf switch; 确定是否从服务器接收到对所述通知的确认;和Determine whether an acknowledgment of the notification is received from the server; and 响应于接收到所述确认的判定,停止朝向服务器的入口流量;Responsive to a determination that said acknowledgment is received, stopping ingress traffic toward the server; 断开所述第一叶交换机与所述服务器的连接,包括:在朝向服务器的入口流量在所述第一叶交换机处停止后的第二时间段内,断开所述第一叶交换机与服务器的连接。Disconnecting the first leaf switch from the server includes: disconnecting the first leaf switch from the server within a second period of time after ingress traffic toward the server stops at the first leaf switch. Connection.
CN201980099305.3A 2019-08-19 2019-08-19 Method and system for isolating leaf switches in a network Active CN114223182B (en)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2019/101379 WO2021031070A1 (en) 2019-08-19 2019-08-19 Method and system for isolating a leaf switch in a network

Publications (2)

Publication Number Publication Date
CN114223182A CN114223182A (en) 2022-03-22
CN114223182B true CN114223182B (en) 2024-01-05

Family

ID=74660161

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201980099305.3A Active CN114223182B (en) 2019-08-19 2019-08-19 Method and system for isolating leaf switches in a network

Country Status (2)

Country Link
CN (1) CN114223182B (en)
WO (1) WO2021031070A1 (en)

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102710486A (en) * 2012-05-17 2012-10-03 杭州华三通信技术有限公司 S-channel status notification method and equipment
CN103067291A (en) * 2012-12-24 2013-04-24 杭州华三通信技术有限公司 Method and device of up-down link correlation

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7606167B1 (en) * 2002-04-05 2009-10-20 Cisco Technology, Inc. Apparatus and method for defining a static fibre channel fabric
US8392637B2 (en) * 2008-07-25 2013-03-05 Broadcom Corporation System and method for enabling legacy medium access control to do energy efficent ethernet
US9787573B2 (en) * 2014-12-31 2017-10-10 Juniper Networks, Inc. Fast convergence on link failure in multi-homed Ethernet virtual private networks

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102710486A (en) * 2012-05-17 2012-10-03 杭州华三通信技术有限公司 S-channel status notification method and equipment
CN103067291A (en) * 2012-12-24 2013-04-24 杭州华三通信技术有限公司 Method and device of up-down link correlation

Also Published As

Publication number Publication date
WO2021031070A1 (en) 2021-02-25
CN114223182A (en) 2022-03-22

Similar Documents

Publication Publication Date Title
US8842518B2 (en) System and method for supporting management network interface card port failover in a middleware machine environment
EP1309142B1 (en) Communication system and method
CN105763359B (en) Distributed Bidirectional Forwarding Detection Protocol (D-BFD) for Interleaved Fabric Switch Clusters
EP1779590B1 (en) Port aggregation for fibre channel interfaces
US8223633B2 (en) Port trunking at a fabric boundary
US7640364B2 (en) Port aggregation for network connections that are offloaded to network interface devices
US9160618B2 (en) Active IP forwarding in an event driven virtual link aggregation (vLAG) system
US8412831B2 (en) Per priority TCP quality of service
JP2019134461A (en) System and method for preventing denial of service (dos) attack on system designed for middleware and application execution, and system and method for supporting reliable connection (rc) based on subnet administrator (sa) access in system designed for middleware and application execution
US20160065479A1 (en) Distributed input/output architecture for network functions virtualization
US20120155458A1 (en) Repeated Lost Packet Retransmission in a TCP/IP Network
CN104025552A (en) Synchronization of traffic multiplexing in link aggregation
JP5861772B2 (en) Network appliance redundancy system, control device, network appliance redundancy method and program
US20150117200A1 (en) Bandwidth Optimization Using Coalesced DUP ACKs
US9172598B2 (en) Scalable reliable failover in a network
CN104468151A (en) System and method for maintaining TCP session during cluster switching
CN113472646A (en) Data transmission method, node, network manager and system
WO2016172926A1 (en) Communication method and device, and system in communication system
US10652310B2 (en) Secure remote computer network
CN114223182B (en) Method and system for isolating leaf switches in a network
JP7688291B2 (en) COMMUNICATION SYSTEM, COMPUTER, COMMUNICATION METHOD, AND PROGRAM
CN102907037B (en) A method, device and system for business backup among multiple machines
US10536875B2 (en) System and method for seamless TCP connection handoff
KR20180099143A (en) Apparatus and method for recovering tcp-session
CN104796276A (en) Link switching method and system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant