EP1535168B1 - Multi-port high-speed serial fabric interconnect chip in a meshed configuration - Google Patents
Multi-port high-speed serial fabric interconnect chip in a meshed configuration Download PDFInfo
- Publication number
- EP1535168B1 EP1535168B1 EP03749266A EP03749266A EP1535168B1 EP 1535168 B1 EP1535168 B1 EP 1535168B1 EP 03749266 A EP03749266 A EP 03749266A EP 03749266 A EP03749266 A EP 03749266A EP 1535168 B1 EP1535168 B1 EP 1535168B1
- Authority
- EP
- European Patent Office
- Prior art keywords
- fabric
- destination specific
- port
- local
- queues
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime
Links
- 239000004744 fabric Substances 0.000 title claims abstract description 72
- 239000013307 optical fiber Substances 0.000 claims abstract description 13
- 230000006870 function Effects 0.000 claims abstract description 9
- 239000004020 conductor Substances 0.000 claims abstract 13
- 230000003287 optical effect Effects 0.000 claims description 30
- 230000009977 dual effect Effects 0.000 claims description 5
- 239000000835 fiber Substances 0.000 abstract description 15
- 238000004891 communication Methods 0.000 abstract description 13
- RYGMFSIKBFXOCR-UHFFFAOYSA-N Copper Chemical compound [Cu] RYGMFSIKBFXOCR-UHFFFAOYSA-N 0.000 abstract description 10
- 229910052802 copper Inorganic materials 0.000 abstract description 10
- 239000010949 copper Substances 0.000 abstract description 10
- 230000003139 buffering effect Effects 0.000 abstract description 2
- 238000005516 engineering process Methods 0.000 description 7
- 230000000903 blocking effect Effects 0.000 description 6
- 229930186949 TCA Natural products 0.000 description 5
- 229940123445 Tricyclic antidepressant Drugs 0.000 description 5
- 239000000872 buffer Substances 0.000 description 4
- 238000012545 processing Methods 0.000 description 4
- 238000012546 transfer Methods 0.000 description 4
- 230000008901 benefit Effects 0.000 description 3
- 238000013461 design Methods 0.000 description 3
- 238000010586 diagram Methods 0.000 description 3
- 230000006872 improvement Effects 0.000 description 3
- 238000000034 method Methods 0.000 description 3
- 230000006855 networking Effects 0.000 description 3
- 230000002093 peripheral effect Effects 0.000 description 3
- 238000013459 approach Methods 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 239000000758 substrate Substances 0.000 description 2
- 235000008694 Humulus lupulus Nutrition 0.000 description 1
- 230000006399 behavior Effects 0.000 description 1
- 239000000969 carrier Substances 0.000 description 1
- 230000001934 delay Effects 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000007417 hierarchical cluster analysis Methods 0.000 description 1
- 239000003999 initiator Substances 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 238000009434 installation Methods 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 230000013011 mating Effects 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 230000005855 radiation Effects 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 229910052710 silicon Inorganic materials 0.000 description 1
- 239000010703 silicon Substances 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 239000010409 thin film Substances 0.000 description 1
- 230000007704 transition Effects 0.000 description 1
Images
Classifications
-
- H—ELECTRICITY
- H05—ELECTRIC TECHNIQUES NOT OTHERWISE PROVIDED FOR
- H05K—PRINTED CIRCUITS; CASINGS OR CONSTRUCTIONAL DETAILS OF ELECTRIC APPARATUS; MANUFACTURE OF ASSEMBLAGES OF ELECTRICAL COMPONENTS
- H05K7/00—Constructional details common to different types of electric apparatus
- H05K7/14—Mounting supporting structure in casing or on frame or rack
- H05K7/1438—Back panels or connecting means therefor; Terminals; Coding means to avoid wrong insertion
- H05K7/1459—Circuit configuration, e.g. routing signals
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L47/00—Traffic control in data switching networks
- H04L47/10—Flow control; Congestion control
- H04L47/12—Avoiding congestion; Recovering from congestion
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L47/00—Traffic control in data switching networks
- H04L47/10—Flow control; Congestion control
- H04L47/24—Traffic characterised by specific attributes, e.g. priority or QoS
- H04L47/2408—Traffic characterised by specific attributes, e.g. priority or QoS for supporting different services, e.g. a differentiated services [DiffServ] type of service
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L49/00—Packet switching elements
- H04L49/10—Packet switching elements characterised by the switching fabric construction
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L49/00—Packet switching elements
- H04L49/35—Switches specially adapted for specific applications
- H04L49/356—Switches specially adapted for specific applications for storage area networks
- H04L49/358—Infiniband Switches
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L49/00—Packet switching elements
- H04L49/40—Constructional details, e.g. power supply, mechanical construction or backplane
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L47/00—Traffic control in data switching networks
- H04L47/10—Flow control; Congestion control
- H04L47/20—Traffic policing
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L47/00—Traffic control in data switching networks
- H04L47/10—Flow control; Congestion control
- H04L47/24—Traffic characterised by specific attributes, e.g. priority or QoS
- H04L47/2416—Real-time traffic
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L47/00—Traffic control in data switching networks
- H04L47/10—Flow control; Congestion control
- H04L47/24—Traffic characterised by specific attributes, e.g. priority or QoS
- H04L47/2441—Traffic characterised by specific attributes, e.g. priority or QoS relying on flow classification, e.g. using integrated services [IntServ]
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L47/00—Traffic control in data switching networks
- H04L47/10—Flow control; Congestion control
- H04L47/32—Flow control; Congestion control by discarding or delaying data units, e.g. packets or frames
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L47/00—Traffic control in data switching networks
- H04L47/10—Flow control; Congestion control
- H04L47/34—Flow control; Congestion control ensuring sequence integrity, e.g. using sequence numbers
Definitions
- the present invention relates to computing systems, and specifically to systems that use high speed serial packet-switching fabrics, such as the InfiniBand architecture and PCI-Express.
- the central processing unit In current-generation computers, the central processing unit (CPU) is connected to the system memory and to peripheral devices by a shared parallel bus, such as the Peripheral Component Interface (PCI) bus or the Industry Standard Architecture (ISA) bus.
- PCI Peripheral Component Interface
- ISA Industry Standard Architecture
- a bus is the channel or path between components in a computer.
- server to server connections and links to other server-related systems depend on parallel bus technology.
- Server design dependent on a shared bus input/output (I/O) architecture may deliver for example 512 megabytes per second (MB/sec) of potential bandwidth that is shared among devices connected to the bus.
- next-generation bus standards Many of the proposed new standards have something in common. They propose doing away with the shared-bus technology used in PCI and moving to a point-to-point switching connection. Thus, the computer industry is moving towards fast, packetized, serial input/output bus architectures, in which computing hosts and peripherals are linked by a switching network, commonly referred to as a switching fabric. A number of architectures of this type have been proposed, and the first next-generation standard is already in place. InfiniBand has been advanced by a consortium headed by a group of industry leaders and PCI-Express is soon to follow.
- the InfiniBand architecture is an I/O infrastructure technology that simplifies and speeds server-to-server connections and links to other server related systems, such as remote storage and network devices.
- the Infiniband fabric or mesh is the central network of connections between nodes, i.e., servers, and remote networking and storage devices within a data center.
- Infiniband architecture also constructs highly available data centers through multiple levels of redundancy. By connecting nodes via multiple links, Infiniband systems continue to perform even if one link fails. For increased reliability, multiple switches in a fabric provide redundant paths that allow for seamless re-routing of data through the fabric should a link between switches fail. Completely redundant fabrics may be configured for the highest level of reliability and may continue to perform even if an entire fabric fails.
- HCA Host Channel Adapters
- TCA Target Channel Adapters
- HCAs are present in servers or even desktop machines and provide an interface that is used to integrate the InfiniBand with the operating system.
- TCAs are present on I/O devices such as a Redundant Array of Independent Disks (RAID) subsystem or a Just a Bunch of Disks (JBOD) subsystem.
- RAID Redundant Array of Independent Disks
- JBOD Just a Bunch of Disks
- Each channel adapter may have one or more ports. A channel adapter with more than one port, may be connected to multiple switch ports. This allows for multiple paths between a source and a destination, resulting in performance and reliability benefits.
- Infiniband channel adapters essentially have their own intelligence, therefore, they offload some of the communication processing from the operating system and CPU.
- Host and Target Channel adapters present an interface to the Open Systems Interconnection (OSI) reference model layers above them that allow those layers to generate and consume packets.
- OSI Open Systems Interconnection
- the host In the case of a server writing a file to a storage device, the host is generating the packets that are then consumed by the storage device.
- switches simply forward packets between two of their ports based on an established routing table and the addressing information stored on the packets.
- Infiniband uses Internet Protocol Version 6 (IPv6) headers natively, and can connect to Local Area Network (LAN) and Wide Area Network (WAN) switches and routers with the TCA providing a seamless transition between the system area network and external networks.
- IPv6 Internet Protocol Version 6
- LAN Local Area Network
- WAN Wide Area Network
- InfiniBand defines network layers up to the transport layer and offers all four combinations of reliable/unreliable and connection datagram service.
- the Infiniband transport protocol is used within the system area network, but other transport protocols can be accessed by sending raw packets via a TCA.
- TCAs provide connections to storage, fibre channel networks, and other I/O nodes, and include an I/O controller specific to the device's protocol, be it Small Computer Systems Interface (SCSI), Ethernet, etc.
- SCSI Small Computer Systems Interface
- a TCA includes an Infiniband protocol engine that dramatically accelerates the performance of critical Infiniband transport functions in the TCA's hardware, achieving aggregate internal transaction switching throughput speeds of 150 gigabits per second.
- TCAs are highly optimized for Infiniband target applications such as bridging from Infiniband devices (switches) to local busses i.e., Gigabit Ethernet, Fibre Channel, and Transport Control Protocol/Intennet Protocol TCP/IP devices, as well as next-generation I/O protocols.
- EP 0844763 discusses a type of switching fabric for exchanging variable-size frames of digital information between frame processors coupled directly or indirectly to one or more digital communication lines. It comprises one or more multi-line serial communication controllers (MSCCs), and a backplane providing a full mesh of serial point-to-point bi-directional links between each MSCC, and, in a loopback, from each MSCC back to itself.
- MSCCs multi-line serial communication controllers
- backplane providing a full mesh of serial point-to-point bi-directional links between each MSCC, and, in a loopback, from each MSCC back to itself.
- the Infiniband architecture is similar to the Virtual Interface (VI) Architecture (VIA) for operation at the OSI transport level.
- VIA Virtual Interface
- the VIA is a server messaging protocol whose focus is to provide a very low latency link between the communicating servers.
- the specification defines four basic components: virtual interfaces, completion queues, VI Providers, and VI Consumers.
- latency arises in the form of overhead and delays that are added to the time needed to transfer the actual data. If latency is broken down into components, the major contributors are: a) the overhead of executing network protocol code within the operating system, b) context switches to move in and out of kernel mode to receive and send out the data, and c) excessive copying of data between the user level buffers and the Network Interface Chip (NIC) memory.
- NIC Network Interface Chip
- VIA Since VIA was only intended to be used for communication across the physical servers of a cluster (in other words across high-bandwidth links with very high reliability), the specification eliminates much of the standard network protocol code that deals with special cases. Also, because of the well-defined environment of operation, the message exchange protocol was defined to avoid kernel mode interaction and allow for access to the MC from user mode. Finally, because of the direct access to the NIC, unnecessary copying of the data into kernel buffers was also eliminated since the user is able to directly transfer data from user-space to the NIC.
- the VIA provides Remote Direct Memory Access (RDMA) operations where the initiator of the operation specifies both the source and destination of a data transfer, resulting in zero-copy data transfers with minimum involvement of the CPUs.
- RDMA Remote Direct Memory Access
- the InfiniBand Architecture uses basically the VIA primitives for its operation at the transport layer.
- a work queue that consists of a queue pair (QP).
- WQE work queue element
- Each process may create one or more QPs for communications purposes with another application.
- each queue pair has an associated context. Since both the protocol and the structures are all very clearly defined, queue pairs can be implemented in hardware, thereby off-loading most of the work from the CPU.
- a completion queue element CQE is created and placed in the completion queue.
- High speed serial fabrics such as the InfiniBand architecture may also be used for connecting multiple components (boards) communicating to one another within a single chassis over a common fabric or switched fabric.
- AdvancedTCA TM Advanced Telecommunications Compute Architecture
- CompactPCI TM is a variation of the Eurocard form factor that uses PCI buses in a back plane.
- the Eurocard form factor is a physical form factor of line cards that have a front panel.
- a line card has a solid face plate with handles, slides into the front of a chassis by the use of guides, and makes contact with a backplane or midplane within the chassis.
- AdvancedTCA TM defines star and meshed topologies for a packet switched fabric in the back plane of a chassis.
- the proposed meshed fabric solution utilizes Infiniband technology and a meshed fabric solution.
- the design methodology being proposed potentially puts a switch on every single board in the chassis. For example, suppose there are 16 boards or slots in a chassis, if a 16 port switch exists on every single board, 16 switches are required in the chassis. Each board communicates with all other boards to form a meshed or switched fabric.
- TCA Target Channel Adaptor
- Embodiments of the present invention exists within the context of connecting multiple entities within a system (specifically multiple boards within a chassis), using multiple TCAs.
- the components consist of multi-port TCAs and a meshed backplane that is equipped to mount boards via connectors on the backplane.
- FIG. 1 depicts the connection of a FIC 100 that interconnects between local components on a board (local busses) 120 and a backplane mesh fabric 110.
- FIG. 2 depicts connection of multiple interconnected FICs.
- Each board 211-218 on the mesh 220 contains one FIC 201-208 .
- Each FIC may provide an interconnection to every other board in the shelf (chassis).
- FIG. 2 shows an 8-way (8 board) mesh, but any size mesh is realizable, hence the 8 th board 218 is labeled "N".
- the lines drawn show how the traces from each FIC 'port' travels across the passive backplane interconnect fabric 220 to its corresponding 'port' on another FIC, (located on another board).
- the FIC provides data queuing first in first out (FIFOs) on both its inbound and outbound sides for each of its multiple ports.
- FIFOs first in first out
- the boards can inter-communicate with each other in a "meshed" topology. Therefore, every board has a point to point connection with every other board in the backplane, making a complete cross-connect without the need for a separate switch device.
- FIG. 3 illustrates the same 8-way mesh depicted in FIG. 2, highlighting the logic switch which is created by distributing the switch function across FICs 201-208 on all the boards 211-218.
- Each multi-port FIC 201-208 is shown having inbound and outbound queues 221-228 for each port.
- FIG. 3 illustrates the same 8-way logical distributed switch, where each local bus (board 211-218) represents an access point on the logical switch.
- each FIC (201-208) has queuing (221-228) on both the inbound and outbound sides, coupled with the inherent cross-connect in the backplane 220, an extremely efficient logically distributed switch can be built at much lower cost, greater reliability, higher efficiency, lower power and lower latency than a typical, centrally switched topology, or any architecture that places a switch on every board of a meshed topology.
- Links are the connections between boards (or node to node, or switch to switch, in general). They are capable of transmitting no more than a certain amount of information at any one time, known as bandwidth. Bandwidth may be measured, for example, in bits/second. In general, there is no restriction on the number of boards (N) except that it be greater than 1.
- a flow is a set of packets that all share a set of characteristics.
- the characteristics include the source and destination address of the packet, as well as its protocol type and possibly its priority or classification. It is important that all the packets in a flow maintain a certain sequence in which they were sent, preferably arriving at their destination in that same sequence. If they do arrive out of sequence they can be re-sequenced, or put back in order. However, it is not desireable to re-sequence packets at the end. Therefore, a good design attempts to keep all the packets in a flow in sequence all through the network so that they arrive at the far end in sequence and do not require re-sequencing.
- QoS Quality of Service
- the network traffic must be differentiated, i.e., classified according to service levels. Some traffic should be handled in one way, other traffic another.
- an entity may set up a Service Level Agreement (SLA) with a network service provider (the entity that is providing the network), that specifies that the entity's traffic will always have available a certain bandwidth (e.g., 10 Megabits per second, or 10 Mbps) and latency (e.g., less than 1 millisecond, or ms). Then, whenever a packet is detected that comes from or is going to that entity, the packet should receive special handling. If the overall flow to the entity is currently less than 10 Mbps, then the packet should get through without being dropped and with a latency of less than 1 ms. This type of flow is said to be handled by Assured Forwarding (AF). Packets arriving when the current flows are greater than 10 Mbps will be handled differently, perhaps as Best Effort (BE) traffic (see below).
- SLA Service Level Agreement
- BE Best Effort
- a router may be set up to recognize certain types of flows as real-time flows.
- Real-time flows are characterized by the idea that if the packet doesn't arrive in time it might as well not arrive at all. For example, a packet of voice data in a telephone conversation has to be available at the receiver when it is needed, or it is useless. Too late, and it cannot be used and will just be discarded. So real-time traffic (a stream of packets) belonging to a voice conversation should be handled by a class of behavior known as Expedited Forwarding (EF). A packet handled this way will be forwarded very quickly (with low latency). Ultimately, the variation in latency (known as jitter) will also be low.
- EF Expedited Forwarding
- packets in such a stream may be simply dropped if their aggregate bandwidth exceeds a certain threshold.
- a SLA covering such packets may be expensive to the buyer because providing this kind of service requires that a router have features that make it expensive to build.
- a third example is for traffic that is not covered by any SLA, which is called Best Effort (BE) traffic.
- BE Best Effort
- This type of traffic is typically found now across the Internet. Best effort packets may be dropped for any reason, and have no particular guarantees about latency or jitter.
- FIG. 4 Processing of packets from ingress to fabric input is shown in FIG. 4. Depicted is the flow of packets through a single FIC 400 located on a board going to 16 destination boards 401-416.
- a network processor 450 on the FIC 400 classifies all outgoing traffic to a particular flow.
- the flow defines a set of parameters including the 16 destination boards 401-416, the path by which it will get to the destination board, i.e., one of the 16 fabric ports 421-436, and classification (AF, EF or BE).
- the individual packets end up in virtual output (send) queues 460 according to the classification.
- the fabric ports 421-436 and N local ports 490 have both send and receive terminals.
- Multiple virtual output (send) queues 460 are maintained per FIC for two reasons. The primary reason is that multiple virtual output queues 460 provide a steering function. After classification, the destination board 401-416 has been identified. If the packets were put back into the same (common) queue, the destination board 401-416 information would have to be carried along with the packet. This scheme is in fact done in some implementations. However, separating the packets into separate queues is another way to carry the information.
- the second reason is to prevent head-of-line blocking. This occurrence should be a rare condition (the boards should not back up very often, but it can happen, and separate queues prevent this condition). Head-of-line blocking would occur if there were only a single outbound queue, and if, for example, a packet slated to go board 2 were in front of a packet slated to go to board 3, and board 2 was heavily loaded but board 3 was not. The packet slated to go to board 3 would be unable to be taken off the queue because board 2 would be unavailable, and it would block the packet slated for board 3, even though the packet slated for board 3 could go if it could get off the queue.
- InfiniBand offers link layer Virtual Lanes (VLs) to support multiple logical channels (i.e. multiplexing) on the same physical link. Infiniband offers up to 16 virtual lanes per link. VLs provide a mechanism to avoid head-of-line blocking and the ability to support Quality of Service (QoS).
- QoS Quality of Service
- the difference between a Virtual Lane and a Service Level (SL) is that a Virtual Lane is the actual logical lane (multiplexed) used on a given point-to-point link.
- the Service Level stays constant as a packet traverses the fabric, and specifies the desired service level within a subnet.
- the SL (AF, EF or BE) is included in the link header, and each switch maps the SL to a VL supported by the destination link.
- a switch supporting a limited number of virtual lanes will map the SL field to a VL it supports. Without preserving the SL, the desired SL (AF, EF or BE) would be lost in this mapping, and later in the path, a switch supporting more VLs would be unable to recover finer granularity of SLs between two packets mapped to the same VL.
- the virtual lanes are established by classifying, sorting, and placing packets, according to service level and destination, into the virtual output queues 460.
- the packets are then multiplexed onto a link via a fabric port 421-436 to be transmitted to a destination board 401-416.
- the classifying, sorting, placing and multiplexing are carried out by the network processor 450.
- FIG. 4 also illustrates simple buffers (receive queues) 470 on the egress fabric side.
- the 16 fabric ports 421-436 have integrated serializer/de-serializers (SERDES). By allocating buffers (queues) to each port on both the receive and transmit sides, traffic can be managed without head of line blocking or congestion. This enables each board to have a private communications channel with every other board in the system.
- SERDES serializer/de-serializers
- Optical fibers may be employed to provide increased bandwidth while using only a few fibers in a small space. Copper traces, on the other hand, often have to contain multiple (parallel) interconnects in order to achieve a limited bandwidth.
- optical fibers may consist of either a mesh of individual optical fibers or a redundant pair of fibers configured in a counter-rotating ring topology. Either way, fibers represent a much wider bandwidth potential than copper, although a ring requires less fibers than a mesh.
- each board When using a mesh, each board will have a direct connection to every other board in the chassis using an optical fiber.
- Each port on a board may have a transmitter/receiver pair connected to a single fiber.
- WDM Wavelength Division Multiplexing
- optical interconnects in a backplane environment greatly increases chassis bandwidth as well as reducing electrical radiation that often accompanies copper interconnects.
- the components involved include an optical backplane in a physical ring topology, and the necessary transmitters and receivers for the size of the installation (i.e., number of slots in the chassis).
- optical add/drop multiplexer devices are required.
- FIG. 5 shows a high-level diagram of a system utilizing a dual-counter-rotating ring topology as an optical backplane according to an embodiment of the present invention.
- the concept is not limited to any particular number of boards in a chassis or shelf, but 3 boards is chosen to simplify the illustration.
- Each board 501-503 has transmitters 511-513 for each wavelength needed. The number of wavelengths needed is equal to the number of boards minus one, in the system.
- a counter-rotating ring topology there are two physical rings, each carrying the same traffic flows but moving in opposite directions.
- Each board 501-503 transmits in both directions across the dual counter-rotating ring topology 520 such that that loss of any one board (due to removal or failure) will not disrupt communications with the other boards.
- Each board 501-503 reserves one wavelength for each other board in the system. Therefore, in the 3 board diagram shown in FIG. 5, each board provides 2 wavelengths, one for each board in the system, less itself. Correspondingly, each board has a pair of receivers 531-533 to transduce the optical signals sent from the transmitters 511-513 on each of the other boards 501-503. To implement two counter-rotating rings, for each individual board, there is one transmitter and one receiver for each other board in the system, per ring.
- the first board 501 transmits ⁇ 1 , 2 , to board 2 502 and ⁇ 1,3 to board 3 503; and receives: ⁇ 2,1 from board 2 502 and ⁇ 3,1 from board 3 503.
- each wavelength Since each wavelength is transmitted by only one board and consumed by only one board, each wavelength uniquely identifies both a source and destination across the optical backplane 520. This provides a substantial improvement in efficiency over current day solutions from the standpoint of a reduction in protocol overhead.
- the mesh topology is preserved through the meshing of wavelengths; each containing dedicated bandwidth between any two end points.
- OADM optical add/drop multiplexer
- Thin-film filters have emerged as the technology of choice for OADMs because of their low cost and stability.
- other technologies such as tunable gratings and circulators, may come into prominence.
- FIG. 6 illustrates a ring adaptor utilizing an OADM according to an embodiment of the present invention.
- the blocks show how the ring adapter 541 shown previously in FIG. 5 may consist of a splitter 541A to route traffic towards the receiver 531, a filter 541B to eliminate wavelengths generated by the transmitters and a multiplexer 541C to add the new wavelengths received from the transmitters 511 back into the optical stream.
- the basic components of the dual counter-rotating ring optical backplane include a set of fibers 521, arranged in a dual-counter-rotating ring 520, housed in a overlay module 710 that allows fiber cavity mechanical connectors 720-725 to mount to the overlay module 710 front surface.
- Each mechanical connector 720-725 mates up to a mated connector 730-735 that mounts to the individual boards 740-745 in a chassis.
- Mechanical connectors 720-725 may also contain an OADM 541 ring adaptor previously depicted in FIG. 5 and FIG. 6.
- FIG. 8 illustrates an overlay module according to an embodiment of the present invention.
- the overlay module 710 includes a series of connectors 720-N mounted to the front of the enclosure.
- the overlay module 710 contains the optical fibers 521 that form the dual counter-rotating ring 520.
- each connector 720-725 has transmit 521A and receive 521B optical fibers for each direction on the counter-rotation ring 520.
- Each connector 720-725 (720-N) has transmit 521 A and receive fiber 521 B in each direction to each of its nearest neighbors. Fibers 521 pass between the two end connectors 720, 735 (720, N) to complete the rings 520.
- FIG. 9 illustrates a mechanical drawing of the position of the optical overlay module 710 in the AdvancedTCA Eurocard chassis 910 according to an embodiment of the present invention.
- the boards 740-N used in this approach are a "Euro-card" style front insertion type of board.
- the optical connectors 730 on the board's back edge align with and mate into the connector 720 provided by the overlay module 710.
- the overlay module 710 sits at the top in the rear panel access area.
- the board's optical transceivers (not shown) interconnect to the optical fiber ring 520 that is contained in the overlay module 710.
- All other boards 741-N inserted to this chassis 910 will also connect to the overlay module 710 and become peer members of the optical ring.
- the optical overlay module 710 will mount in any Eurocard chassis 910 and can function in addition to a traditional copper backplane that would provide power and management to the individual boards.
Landscapes
- Engineering & Computer Science (AREA)
- Computer Networks & Wireless Communication (AREA)
- Signal Processing (AREA)
- Microelectronics & Electronic Packaging (AREA)
- Data Exchanges In Wide-Area Networks (AREA)
- Light Guides In General And Applications Therefor (AREA)
- Small-Scale Networks (AREA)
- Treatment Of Fiber Materials (AREA)
- Connector Housings Or Holding Contact Members (AREA)
- Infusion, Injection, And Reservoir Apparatuses (AREA)
Abstract
Description
- The present invention relates to computing systems, and specifically to systems that use high speed serial packet-switching fabrics, such as the InfiniBand architecture and PCI-Express.
- In current-generation computers, the central processing unit (CPU) is connected to the system memory and to peripheral devices by a shared parallel bus, such as the Peripheral Component Interface (PCI) bus or the Industry Standard Architecture (ISA) bus. Essentially, a bus is the channel or path between components in a computer. Likewise, current server to server connections and links to other server-related systems, such as remote storage and networking devices, depend on parallel bus technology. Server design dependent on a shared bus input/output (I/O) architecture may deliver for example 512 megabytes per second (MB/sec) of potential bandwidth that is shared among devices connected to the bus.
- As data path-widths grow, and clock speeds become faster, the shared parallel bus becomes too costly and complex to keep up with system demands. In response, the computer industry is working to develop next-generation bus standards. Many of the proposed new standards have something in common. They propose doing away with the shared-bus technology used in PCI and moving to a point-to-point switching connection. Thus, the computer industry is moving towards fast, packetized, serial input/output bus architectures, in which computing hosts and peripherals are linked by a switching network, commonly referred to as a switching fabric. A number of architectures of this type have been proposed, and the first next-generation standard is already in place. InfiniBand has been advanced by a consortium headed by a group of industry leaders and PCI-Express is soon to follow.
- The InfiniBand architecture is an I/O infrastructure technology that simplifies and speeds server-to-server connections and links to other server related systems, such as remote storage and network devices. The Infiniband fabric or mesh is the central network of connections between nodes, i.e., servers, and remote networking and storage devices within a data center. Infiniband architecture also constructs highly available data centers through multiple levels of redundancy. By connecting nodes via multiple links, Infiniband systems continue to perform even if one link fails. For increased reliability, multiple switches in a fabric provide redundant paths that allow for seamless re-routing of data through the fabric should a link between switches fail. Completely redundant fabrics may be configured for the highest level of reliability and may continue to perform even if an entire fabric fails.
- Each node connects to the fabric through a channel adapter. Infiniband classifies the channel adapter into two categories: Host Channel Adapters (HCA) and Target Channel Adapters (TCA). HCAs are present in servers or even desktop machines and provide an interface that is used to integrate the InfiniBand with the operating system. TCAs are present on I/O devices such as a Redundant Array of Independent Disks (RAID) subsystem or a Just a Bunch of Disks (JBOD) subsystem. Each channel adapter may have one or more ports. A channel adapter with more than one port, may be connected to multiple switch ports. This allows for multiple paths between a source and a destination, resulting in performance and reliability benefits.
- Infiniband channel adapters essentially have their own intelligence, therefore, they offload some of the communication processing from the operating system and CPU. Host and Target Channel adapters present an interface to the Open Systems Interconnection (OSI) reference model layers above them that allow those layers to generate and consume packets. In the case of a server writing a file to a storage device, the host is generating the packets that are then consumed by the storage device. In contrast to the channel adapter, switches simply forward packets between two of their ports based on an established routing table and the addressing information stored on the packets.
- Infiniband uses Internet Protocol Version 6 (IPv6) headers natively, and can connect to Local Area Network (LAN) and Wide Area Network (WAN) switches and routers with the TCA providing a seamless transition between the system area network and external networks. InfiniBand defines network layers up to the transport layer and offers all four combinations of reliable/unreliable and connection datagram service. The Infiniband transport protocol is used within the system area network, but other transport protocols can be accessed by sending raw packets via a TCA. TCAs provide connections to storage, fibre channel networks, and other I/O nodes, and include an I/O controller specific to the device's protocol, be it Small Computer Systems Interface (SCSI), Ethernet, etc. A TCA includes an Infiniband protocol engine that dramatically accelerates the performance of critical Infiniband transport functions in the TCA's hardware, achieving aggregate internal transaction switching throughput speeds of 150 gigabits per second. TCAs are highly optimized for Infiniband target applications such as bridging from Infiniband devices (switches) to local busses i.e., Gigabit Ethernet, Fibre Channel, and Transport Control Protocol/Intennet Protocol TCP/IP devices, as well as next-generation I/O protocols.
- The invention is as set out in the attached
independent claims -
- FIG. 1 depicts the connection of a Fabric Interconnect Chip according to an embodiment of the present invention;
- FIG. 2 depicts connection of multiple interconnected Fabric Interconnect Chips according to an embodiment of the present invention;
- FIG. 3 illustrates a logical switch distributed across many boards according to an embodiment of the present invention;
- Fig. 4 illustrates processing of packets from ingress to fabric input according to an embodiment of the present invention;
- FIG. 5 shows a high-level diagram of a system utilizing a dual-counter-rotating ring topology according to an embodiment of the present invention;
- FIG. 6 illustrates an optical add/drop multiplexer (OADM) according to an embodiment of the present invention;
- FIG. 7 illustrates circuit boards mating to connectors according to an embodiment of the present invention;
- FIG. 8 illustrates an Optical Backplane Overlay Module according to an embodiment of the present invention; and
- FIG. 9 illustrates a mechanical view of a chassis containing the Optical Backplane Overlay Module according to an embodiment of the present invention.
- EP 0844763 discusses a type of switching fabric for exchanging variable-size frames of digital information between frame processors coupled directly or indirectly to one or more digital communication lines. It comprises one or more multi-line serial communication controllers (MSCCs), and a backplane providing a full mesh of serial point-to-point bi-directional links between each MSCC, and, in a loopback, from each MSCC back to itself.
- The Infiniband architecture is similar to the Virtual Interface (VI) Architecture (VIA) for operation at the OSI transport level. The VIA is a server messaging protocol whose focus is to provide a very low latency link between the communicating servers. The specification defines four basic components: virtual interfaces, completion queues, VI Providers, and VI Consumers.
- In transferring a block of data from one server to another, latency arises in the form of overhead and delays that are added to the time needed to transfer the actual data. If latency is broken down into components, the major contributors are: a) the overhead of executing network protocol code within the operating system, b) context switches to move in and out of kernel mode to receive and send out the data, and c) excessive copying of data between the user level buffers and the Network Interface Chip (NIC) memory.
- Since VIA was only intended to be used for communication across the physical servers of a cluster (in other words across high-bandwidth links with very high reliability), the specification eliminates much of the standard network protocol code that deals with special cases. Also, because of the well-defined environment of operation, the message exchange protocol was defined to avoid kernel mode interaction and allow for access to the MC from user mode. Finally, because of the direct access to the NIC, unnecessary copying of the data into kernel buffers was also eliminated since the user is able to directly transfer data from user-space to the NIC. In addition to the standard send/receive operations that are typically available in a networking library, the VIA provides Remote Direct Memory Access (RDMA) operations where the initiator of the operation specifies both the source and destination of a data transfer, resulting in zero-copy data transfers with minimum involvement of the CPUs.
- The InfiniBand Architecture uses basically the VIA primitives for its operation at the transport layer. In order for an application to communicate with another application over the InfiniBand it must first create a work queue that consists of a queue pair (QP). In order for the application to execute an operation, it must place a work queue element (WQE) in the work queue. From there, the operation is picked-up for execution by the channel adapter. Therefore, the Work Queue forms the communications medium between applications and the channel adapter, relieving the operating system from having to deal with this responsibility.
- Each process may create one or more QPs for communications purposes with another application. Instead of having to arbitrate for the use of the single queue for the NIC card, as in a typical operating system, each queue pair has an associated context. Since both the protocol and the structures are all very clearly defined, queue pairs can be implemented in hardware, thereby off-loading most of the work from the CPU. Once a WQE has been processed properly, a completion queue element (CQE) is created and placed in the completion queue. The advantage of using the completion queue for notifying the caller of completed WQEs is that it reduces the interrupts that would be otherwise generated.
- High speed serial fabrics such as the InfiniBand architecture may also be used for connecting multiple components (boards) communicating to one another within a single chassis over a common fabric or switched fabric. For instance, a new standard, AdvancedTCA™ (Advanced Telecommunications Compute Architecture), is currently being developed out of the industry forum PCI Industrial Computer Manufacturers Group, which previously formulated the CompactPCI™ standard. CompactPCI™ is a variation of the Eurocard form factor that uses PCI buses in a back plane. The Eurocard form factor is a physical form factor of line cards that have a front panel. A line card has a solid face plate with handles, slides into the front of a chassis by the use of guides, and makes contact with a backplane or midplane within the chassis. This allows direct access to the cards unlike a PC chassis or server where a lid is opened to expose the PCI cards sitting inside the chassis. AdvancedTCA™ defines star and meshed topologies for a packet switched fabric in the back plane of a chassis. The proposed meshed fabric solution utilizes Infiniband technology and a meshed fabric solution. However, the design methodology being proposed potentially puts a switch on every single board in the chassis. For example, suppose there are 16 boards or slots in a chassis, if a 16 port switch exists on every single board, 16 switches are required in the chassis. Each board communicates with all other boards to form a meshed or switched fabric.
- This potentially is an inefficient methodology to transport data across a mesh. For example, this may require a Target Channel Adaptor (TCA) between the CPU and a switch on a board, followed by a switch communicating to another switch across a meshed back plane, followed by another TCA to the CPU on a receiving board, creating numerous hops and points of potential congestion.
- All switch devices have a given amount of inefficiency based upon their queuing logic. Most often, this leads to head-of-line blocking in the switch because multiple contenders are vying for a single bottle neck. However, by placing multiple queues both inbound and outbound, for each port on the switch, these head-of-line blocking latencies can be largely removed. A single device that incorporates such a queuing model is both costly and large, almost too large and expensive to put into a single piece of usable silicon. However, by distributing this functionality across multiple components, using the inherent cross-connect of the meshed backplane, an efficient logical distributed switch may be built at much lower cost and higher efficiency.
- Therefore, a simpler solution is to transport data from a TCA on a board directly to the TCA on another board through a meshed backplane. Communication from TCA to TCA requires putting more ports on each individual TCA (16 for example) along with the appropriate input and output buffering on each TCA. This proposed solution essentially creates one large logical distributed switch that may be more efficient in many ways, i.e., less costly, lower power, etc.
- Embodiments of the present invention exists within the context of connecting multiple entities within a system (specifically multiple boards within a chassis), using multiple TCAs. The components consist of multi-port TCAs and a meshed backplane that is equipped to mount boards via connectors on the backplane. A multi-port TCA capable of performing multiple bridging functions simultaneously i.e., bridging from an Infiniband meshed backplane to multiple local busses, i.e., Gigabit Ethernet, Fibre Channel, and TCP/IP devices, is referred to as a Fabric Interconnect Chip (FIC). FIG. 1 depicts the connection of a
FIC 100 that interconnects between local components on a board (local busses) 120 and abackplane mesh fabric 110. - FIG. 2 depicts connection of multiple interconnected FICs. Each board 211-218 on the mesh 220 contains one FIC 201-208 . Each FIC may provide an interconnection to every other board in the shelf (chassis). FIG. 2 shows an 8-way (8 board) mesh, but any size mesh is realizable, hence the 8th
board 218 is labeled "N". The lines drawn show how the traces from each FIC 'port' travels across the passive backplane interconnect fabric 220 to its corresponding 'port' on another FIC, (located on another board). - The FIC provides data queuing first in first out (FIFOs) on both its inbound and outbound sides for each of its multiple ports. By placing FICs on every board within a chassis that is connected to the meshed backplane, the boards can inter-communicate with each other in a "meshed" topology. Therefore, every board has a point to point connection with every other board in the backplane, making a complete cross-connect without the need for a separate switch device.
- In addition, by incorporating the queuing logic on both the inbound and outbound sides of the FIC, coupled with the inherent cross-connect of the mesh, a logical, distributed switch is created, without really having a switch device. FIG. 3 illustrates the same 8-way mesh depicted in FIG. 2, highlighting the logic switch which is created by distributing the switch function across FICs 201-208 on all the boards 211-218. Each multi-port FIC 201-208 is shown having inbound and outbound queues 221-228 for each port. In total, what is depicted is a 8-way logical distributed switch, where each local bus (board 211-218) represents an access point on the logical switch.
- Since each FIC (201-208) has queuing (221-228) on both the inbound and outbound sides, coupled with the inherent cross-connect in the backplane 220, an extremely efficient logically distributed switch can be built at much lower cost, greater reliability, higher efficiency, lower power and lower latency than a typical, centrally switched topology, or any architecture that places a switch on every board of a meshed topology.
- To better appreciate the efficiency improvements created by the FIC queuing strategy, a discussion on the issues of flow control, congestion control and Quality of Service may be helpful.
- Links are the connections between boards (or node to node, or switch to switch, in general). They are capable of transmitting no more than a certain amount of information at any one time, known as bandwidth. Bandwidth may be measured, for example, in bits/second. In general, there is no restriction on the number of boards (N) except that it be greater than 1.
- A flow is a set of packets that all share a set of characteristics. Typically, the characteristics include the source and destination address of the packet, as well as its protocol type and possibly its priority or classification. It is important that all the packets in a flow maintain a certain sequence in which they were sent, preferably arriving at their destination in that same sequence. If they do arrive out of sequence they can be re-sequenced, or put back in order. However, it is not desireable to re-sequence packets at the end. Therefore, a good design attempts to keep all the packets in a flow in sequence all through the network so that they arrive at the far end in sequence and do not require re-sequencing.
- A common problem for any network is how to provide Quality of Service (QoS) through a network. To provide QoS, the network traffic must be differentiated, i.e., classified according to service levels. Some traffic should be handled in one way, other traffic another.
- For example, an entity may set up a Service Level Agreement (SLA) with a network service provider (the entity that is providing the network), that specifies that the entity's traffic will always have available a certain bandwidth (e.g., 10 Megabits per second, or 10 Mbps) and latency (e.g., less than 1 millisecond, or ms). Then, whenever a packet is detected that comes from or is going to that entity, the packet should receive special handling. If the overall flow to the entity is currently less than 10 Mbps, then the packet should get through without being dropped and with a latency of less than 1 ms. This type of flow is said to be handled by Assured Forwarding (AF). Packets arriving when the current flows are greater than 10 Mbps will be handled differently, perhaps as Best Effort (BE) traffic (see below).
- As another example, a router may be set up to recognize certain types of flows as real-time flows. Real-time flows are characterized by the idea that if the packet doesn't arrive in time it might as well not arrive at all. For example, a packet of voice data in a telephone conversation has to be available at the receiver when it is needed, or it is useless. Too late, and it cannot be used and will just be discarded. So real-time traffic (a stream of packets) belonging to a voice conversation should be handled by a class of behavior known as Expedited Forwarding (EF). A packet handled this way will be forwarded very quickly (with low latency). Hopefully, the variation in latency (known as jitter) will also be low. As a tradeoff, packets in such a stream may be simply dropped if their aggregate bandwidth exceeds a certain threshold. Also, a SLA covering such packets may be expensive to the buyer because providing this kind of service requires that a router have features that make it expensive to build.
- A third example is for traffic that is not covered by any SLA, which is called Best Effort (BE) traffic. This type of traffic is typically found now across the Internet. Best effort packets may be dropped for any reason, and have no particular guarantees about latency or jitter.
- Processing of packets from ingress to fabric input is shown in FIG. 4. Depicted is the flow of packets through a
single FIC 400 located on a board going to 16 destination boards 401-416. Anetwork processor 450 on theFIC 400 classifies all outgoing traffic to a particular flow. The flow defines a set of parameters including the 16 destination boards 401-416, the path by which it will get to the destination board, i.e., one of the 16 fabric ports 421-436, and classification (AF, EF or BE). The individual packets end up in virtual output (send)queues 460 according to the classification. There is a separate set of virtual output (send) queues 460 (AF, EF or BE) for every destination point, i.e., fabric port 421-436, per FIC. The fabric ports 421-436 and Nlocal ports 490 have both send and receive terminals. - Multiple virtual output (send)
queues 460 are maintained per FIC for two reasons. The primary reason is that multiplevirtual output queues 460 provide a steering function. After classification, the destination board 401-416 has been identified. If the packets were put back into the same (common) queue, the destination board 401-416 information would have to be carried along with the packet. This scheme is in fact done in some implementations. However, separating the packets into separate queues is another way to carry the information. - The second reason is to prevent head-of-line blocking. This occurrence should be a rare condition (the boards should not back up very often, but it can happen, and separate queues prevent this condition). Head-of-line blocking would occur if there were only a single outbound queue, and if, for example, a packet slated to go
board 2 were in front of a packet slated to go toboard 3, andboard 2 was heavily loaded butboard 3 was not. The packet slated to go toboard 3 would be unable to be taken off the queue becauseboard 2 would be unavailable, and it would block the packet slated forboard 3, even though the packet slated forboard 3 could go if it could get off the queue. - InfiniBand offers link layer Virtual Lanes (VLs) to support multiple logical channels (i.e. multiplexing) on the same physical link. Infiniband offers up to 16 virtual lanes per link. VLs provide a mechanism to avoid head-of-line blocking and the ability to support Quality of Service (QoS). The difference between a Virtual Lane and a Service Level (SL) is that a Virtual Lane is the actual logical lane (multiplexed) used on a given point-to-point link. The Service Level stays constant as a packet traverses the fabric, and specifies the desired service level within a subnet. The SL (AF, EF or BE) is included in the link header, and each switch maps the SL to a VL supported by the destination link. A switch supporting a limited number of virtual lanes will map the SL field to a VL it supports. Without preserving the SL, the desired SL (AF, EF or BE) would be lost in this mapping, and later in the path, a switch supporting more VLs would be unable to recover finer granularity of SLs between two packets mapped to the same VL.
- In the case of the
FIC 400 shown in FIG. 4, the virtual lanes are established by classifying, sorting, and placing packets, according to service level and destination, into thevirtual output queues 460. The packets are then multiplexed onto a link via a fabric port 421-436 to be transmitted to a destination board 401-416. The classifying, sorting, placing and multiplexing are carried out by thenetwork processor 450. - FIG. 4 also illustrates simple buffers (receive queues) 470 on the egress fabric side. The 16 fabric ports 421-436 have integrated serializer/de-serializers (SERDES). By allocating buffers (queues) to each port on both the receive and transmit sides, traffic can be managed without head of line blocking or congestion. This enables each board to have a private communications channel with every other board in the system.
- Today, boards in a common chassis interconnect to each other over copper traces embedded in the chassis backplane. Existing backplanes use copper traces within a PCB substrate to establish the communications paths between boards. Topologies such as common bus PCI and switched star (i.e. some Ethernet approaches) are popular today. The copper traces and the electrical drivers that communicate over them are limited in terms of their capacity to carry data bandwidth with respect to the Local Area Network (LAN) and Wide Area Network (WAN) optical signals that these boards are intended to terminate. Fabric backplanes are used for data communications between two or more boards located within a common chassis. Mesh topologies have certain performance and availability advantages over bus and star configurations. However, in a copper backplane, meshed topologies are generally more expensive and require far more pins to implement.
- As communications rates grow, the need for faster back plane technologies also grows. The limits of performance on copper traces within a PCB substrate will soon be reached giving rise to the need to investigate optical interconnects for the backplane. Optical fibers may be employed to provide increased bandwidth while using only a few fibers in a small space. Copper traces, on the other hand, often have to contain multiple (parallel) interconnects in order to achieve a limited bandwidth.
- One alternative is to use optical fibers to create an optical backplane fabric. The optical backplane may consist of either a mesh of individual optical fibers or a redundant pair of fibers configured in a counter-rotating ring topology. Either way, fibers represent a much wider bandwidth potential than copper, although a ring requires less fibers than a mesh. When using a mesh, each board will have a direct connection to every other board in the chassis using an optical fiber. Each port on a board may have a transmitter/receiver pair connected to a single fiber.
- The issue with using a ring, however, is how to map the addressing of multiple boards across these fibers. One solution is to employ Wavelength Division Multiplexing (WDM). A WDM optical mesh defines a meshed-topology in the wavelength space as opposed to the physical fiber space. By utilizing multiple discrete lambda-waves as optical carriers such that by meshing dedicated optical wavelengths between every two boards,
layer 2 protocols are eliminated, thereby creating a dramatic improvement in the efficiency of the transport. Today, every packet transport requires a protocol that allows the end point (and intermediate points) to decipher the intended path (or consumer) of the packet. This protocol increases the amount of overhead required in the packet bus, allowing less room for actual data to be sent. By moving the protocol into the wavelength of the actual optical signal, the destination is implied by the wavelength and no additional bandwidth needs to be surrendered on the signal to provide this information. This makes the efficiency of the transport better and also speeds the routing of the packet through the network. In addition, the use of optical interconnects in a backplane environment greatly increases chassis bandwidth as well as reducing electrical radiation that often accompanies copper interconnects. The components involved include an optical backplane in a physical ring topology, and the necessary transmitters and receivers for the size of the installation (i.e., number of slots in the chassis). In addition, optical add/drop multiplexer devices are required. - FIG. 5 shows a high-level diagram of a system utilizing a dual-counter-rotating ring topology as an optical backplane according to an embodiment of the present invention. The concept is not limited to any particular number of boards in a chassis or shelf, but 3 boards is chosen to simplify the illustration. Each board 501-503 has transmitters 511-513 for each wavelength needed. The number of wavelengths needed is equal to the number of boards minus one, in the system. In a counter-rotating ring topology, there are two physical rings, each carrying the same traffic flows but moving in opposite directions. Each board 501-503 transmits in both directions across the dual
counter-rotating ring topology 520 such that that loss of any one board (due to removal or failure) will not disrupt communications with the other boards. - Each board 501-503 reserves one wavelength for each other board in the system. Therefore, in the 3 board diagram shown in FIG. 5, each board provides 2 wavelengths, one for each board in the system, less itself. Correspondingly, each board has a pair of receivers 531-533 to transduce the optical signals sent from the transmitters 511-513 on each of the other boards 501-503. To implement two counter-rotating rings, for each individual board, there is one transmitter and one receiver for each other board in the system, per ring.
- In the 3 board example, there are 6 wavelengths: λ1,2, λ1,3, λ2,1, λ2,3, λ3,1, λ3,2. The
first board 501 transmits λ1,2, to board 2 502 and λ1,3 toboard 3 503; and receives: λ2,1 fromboard 2 502 and λ3,1 fromboard 3 503. - Since each wavelength is transmitted by only one board and consumed by only one board, each wavelength uniquely identifies both a source and destination across the
optical backplane 520. This provides a substantial improvement in efficiency over current day solutions from the standpoint of a reduction in protocol overhead. The mesh topology is preserved through the meshing of wavelengths; each containing dedicated bandwidth between any two end points. - Between multiplexing and demultiplexing points in a WDM system, there is an area in which multiple wavelengths exist (i.e., the dual counter-rotating rings). It is necessary to remove or insert one or more wavelengths at points along the dual-counter-rotating rings. Demultiplexing must be done before the optical signal is detected, because photo detectors are inherently broadband devices that cannot selectively detect a single wavelength. An optical add/drop multiplexer (OADM) performs this function. The OADM can add, pass, and remove wavelengths. There are two general types of OADMs. The first is a fixed device that is physically configured to drop specific predetermined wavelengths, while adding others. The second type is reconfigurable and capable of dynamically selecting which wavelengths are added and dropped.
- Thin-film filters have emerged as the technology of choice for OADMs because of their low cost and stability. For the emerging second generation of OADMs, other technologies, such as tunable gratings and circulators, may come into prominence.
- FIG. 6 illustrates a ring adaptor utilizing an OADM according to an embodiment of the present invention. The blocks show how the ring adapter 541 shown previously in FIG. 5 may consist of a
splitter 541A to route traffic towards thereceiver 531, a filter 541B to eliminate wavelengths generated by the transmitters and a multiplexer 541C to add the new wavelengths received from thetransmitters 511 back into the optical stream. - Referring to FIG. 7, the basic components of the dual counter-rotating ring optical backplane include a set of
fibers 521, arranged in a dual-counter-rotating ring 520, housed in aoverlay module 710 that allows fiber cavity mechanical connectors 720-725 to mount to theoverlay module 710 front surface. Each mechanical connector 720-725 mates up to a mated connector 730-735 that mounts to the individual boards 740-745 in a chassis. Mechanical connectors 720-725 may also contain an OADM 541 ring adaptor previously depicted in FIG. 5 and FIG. 6. - FIG. 8 illustrates an overlay module according to an embodiment of the present invention. The
overlay module 710 includes a series of connectors 720-N mounted to the front of the enclosure. Theoverlay module 710 contains theoptical fibers 521 that form the dualcounter-rotating ring 520. Referring to FIG. 7 and FIG. 8, each connector 720-725 has transmit 521A and receive 521B optical fibers for each direction on thecounter-rotation ring 520. Each connector 720-725 (720-N) has transmit 521 A and receivefiber 521 B in each direction to each of its nearest neighbors.Fibers 521 pass between the twoend connectors 720, 735 (720, N) to complete therings 520. - FIG. 9 illustrates a mechanical drawing of the position of the
optical overlay module 710 in theAdvancedTCA Eurocard chassis 910 according to an embodiment of the present invention. The boards 740-N used in this approach are a "Euro-card" style front insertion type of board. As theboard 740 is inserted to thechassis 910, theoptical connectors 730 on the board's back edge align with and mate into theconnector 720 provided by theoverlay module 710. Theoverlay module 710 sits at the top in the rear panel access area. Once theboard 740 is inserted, the board's optical transceivers (not shown) interconnect to theoptical fiber ring 520 that is contained in theoverlay module 710. All other boards 741-N inserted to thischassis 910 will also connect to theoverlay module 710 and become peer members of the optical ring. Theoptical overlay module 710 will mount in anyEurocard chassis 910 and can function in addition to a traditional copper backplane that would provide power and management to the individual boards. - While the description above refers to particular embodiments of the present invention, it will be understood that many modifications may be made. The accompanying claims are intended to cover such modifications as would fall within the true scope of the present invention. The presently disclosed embodiments are therefore to be considered in all respects as illustrative and not restrictive, the scope of the invention being indicated by the appended claims, rather than the foregoing description, and all changes which come within the meaning and range of equivalency of the claims are therefore intended to be embraced therein.
Claims (13)
- A high speed serial fabric backplane comprising:a plurality of circuit boards (201-218) interconnected to one another through a fabric of conductors (220), each circuit board containing a fabric interface chip, FIC (201-208,400);characterised by:each FIC having a direct individual connection via a conductor of the fabric of conductors to all other FICs,each FIC providing an interface between a conductor of the fabric of conductors on a fabric side and a local node on a local side,each FIC including a plurality of local ports (490) to interface with a plurality of local nodes, and a plurality of destination specific fabric ports (421-436) to interface with a plurality of conductors of the fabric of conductors (220), andeach FIC including a plurality of destination specific send queues (460) and a plurality of receive queues (470) to interface with the plurality of local ports (490) and the plurality of destination specific fabric ports (421-436),wherein a destination specific send queue (460) is arranged to interface with both a local port (490) and a destination specific fabric port (421-436), wherein a receive queue (470) is arranged to interface with both a local port (490) and a destination specific fabric port (421-436), and wherein each destination specific fabric port (421-436) is arranged to interface with multiple destination specific send queues (460).
- The high speed serial fabric backplane according to claim 1, wherein the multiple destination specific send queues (460) per destination specific fabric port (421-436) includes best effort queues, expedited forwarding queues, and assured forwarding queues.
- The high speed serial fabric backplane according to claim 2, wherein the FIC (201-208,400) includes a network processor (450) to classify, prioritize, route, and multiplex signals from the plurality of local ports and the plurality of destination specific fabric ports.
- The high speed serial fabric backplane according to claim 3, wherein the network processor (450) functions as a bridge between network protocols on the local side and network protocols on the fabric side.
- The high speed serial fabric backplane according to claim 4, wherein the fabric of conductors (220) includes a plurality of optical fibers.
- The high speed serial fabric backplane according to claim 5, wherein the plurality of optical fibers is arranged in a dual counter rotating ring topology.
- A multi-port Fabric Interconnect Chip (FIC) comprising:a network processor (450);characterised by:a plurality of local ports (490);a plurality of destination specific fabric ports (421-436);a plurality of destination specific send queues (460); anda plurality of receive queues (470);wherein the plurality of local ports (490) are arranged to interface with a plurality of local nodes and the plurality of destination specific fabric ports (421-436) are arranged to interface with a plurality of conductors of a fabric of conductors (220), wherein the plurality of destination specific send queues (460) and the plurality of receive queues (470) are arranged to interface with the plurality of local ports (490) and the plurality of destination specific fabric ports (421-436), wherein a destination specific send queue (460) is arranged to interface with both a local port (490) and a destination specific fabric port (421-436), wherein a receive queue (470) is arranged to interface with both a local port (490) and a destination specific fabric port (421-436), and wherein each destination specific fabric port (421-436) is arranged to interface with multiple destination specific send queues (460).
- The multi-port FIC according to claim 7, wherein the multiple destination specific send queues (460) per destination specific fabric port (421-436) includes best effort queues, expedited forwarding queues, and assured forwarding queues.
- The multi-port FTC according to claim 8, wherein the network processor (450) classifies, prioritizes, routes, and multiplexes signals from the plurality of local ports (490) and the plurality of destination specific fabric ports (421-436).
- The multi-port FIC according to claim 9, wherein the network processor (450) functions as a bridge between network protocols used on the local nodes and network protocols used on the fabric of conductors.
- An optical backplane according to any one of claims 1 to 6, wherein the fabric of conductors is a fabric of optical fibers.
- The optical fabric backplane according to claim 11, wherein a FIC transmits a separate wavelength of light per all other FICs via a optical fiber of the fabric of optical fibers to all other FICs.
- The optical fabric backplane according to claim 12, wherein each FIC has associated with it a plurality of optical add/drop multiplexer (OADM) each corresponding to a FIC of the plurality of FICs.
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US10/231,832 US7218640B2 (en) | 2002-08-30 | 2002-08-30 | Multi-port high-speed serial fabric interconnect chip in a meshed configuration |
US231832 | 2002-08-30 | ||
PCT/US2003/027224 WO2004021200A1 (en) | 2002-08-30 | 2003-08-29 | Multi-port high-speed serial fabric interconnect chip in a meshed configuration |
Publications (2)
Publication Number | Publication Date |
---|---|
EP1535168A1 EP1535168A1 (en) | 2005-06-01 |
EP1535168B1 true EP1535168B1 (en) | 2007-05-09 |
Family
ID=31976833
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP03749266A Expired - Lifetime EP1535168B1 (en) | 2002-08-30 | 2003-08-29 | Multi-port high-speed serial fabric interconnect chip in a meshed configuration |
Country Status (8)
Country | Link |
---|---|
US (1) | US7218640B2 (en) |
EP (1) | EP1535168B1 (en) |
CN (1) | CN100373368C (en) |
AT (1) | ATE362137T1 (en) |
AU (1) | AU2003268310A1 (en) |
DE (1) | DE60313780T2 (en) |
TW (1) | TWI233018B (en) |
WO (1) | WO2004021200A1 (en) |
Families Citing this family (49)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7076569B1 (en) * | 2002-10-18 | 2006-07-11 | Advanced Micro Devices, Inc. | Embedded channel adapter having transport layer configured for prioritizing selection of work descriptors based on respective virtual lane priorities |
US7802049B2 (en) * | 2002-10-30 | 2010-09-21 | Intel Corporation | Links having flexible lane allocation |
US7415032B2 (en) * | 2002-11-13 | 2008-08-19 | Intel Corporation | Aggregatable connectivity |
US7266083B2 (en) * | 2003-02-26 | 2007-09-04 | International Business Machines Corporation | Method and apparatus for implementing queue pair connection protection over infiniband |
US7573898B2 (en) * | 2003-12-29 | 2009-08-11 | Fujitsu Limited | Method and apparatus to double LAN service unit bandwidth |
JP4524120B2 (en) * | 2004-02-03 | 2010-08-11 | 富士通株式会社 | Blade type optical transmission equipment |
US7466716B2 (en) * | 2004-07-13 | 2008-12-16 | International Business Machines Corporation | Reducing latency in a channel adapter by accelerated I/O control block processing |
US7564869B2 (en) * | 2004-10-22 | 2009-07-21 | Cisco Technology, Inc. | Fibre channel over ethernet |
US7830793B2 (en) * | 2004-10-22 | 2010-11-09 | Cisco Technology, Inc. | Network device architecture for consolidating input/output and reducing latency |
US7969971B2 (en) | 2004-10-22 | 2011-06-28 | Cisco Technology, Inc. | Ethernet extension for the data center |
US7602720B2 (en) * | 2004-10-22 | 2009-10-13 | Cisco Technology, Inc. | Active queue management methods and devices |
US7801125B2 (en) | 2004-10-22 | 2010-09-21 | Cisco Technology, Inc. | Forwarding table reduction and multipath network forwarding |
US8238347B2 (en) * | 2004-10-22 | 2012-08-07 | Cisco Technology, Inc. | Fibre channel over ethernet |
US7496698B2 (en) * | 2005-03-22 | 2009-02-24 | International Business Machines Corporation | System and method for efficient implementation of a shared receive queue |
US7843830B1 (en) * | 2005-05-05 | 2010-11-30 | Force 10 Networks, Inc | Resilient retransmission of epoch data |
US7961621B2 (en) | 2005-10-11 | 2011-06-14 | Cisco Technology, Inc. | Methods and devices for backward congestion notification |
US7631133B2 (en) * | 2006-03-31 | 2009-12-08 | Intel Corporation | Backplane interconnection system and method |
WO2007139426A1 (en) * | 2006-05-31 | 2007-12-06 | Intel Corporation | Multiple phase buffer enlargement for rdma data transfer |
WO2008012790A1 (en) * | 2006-07-27 | 2008-01-31 | Contextream Inc. | Distributed edge network |
US9014563B2 (en) | 2006-12-11 | 2015-04-21 | Cisco Technology, Inc. | System and method for providing an Ethernet interface |
US8259720B2 (en) | 2007-02-02 | 2012-09-04 | Cisco Technology, Inc. | Triple-tier anycast addressing |
US20080222351A1 (en) * | 2007-03-07 | 2008-09-11 | Aprius Inc. | High-speed optical connection between central processing unit and remotely located random access memory |
US20080307149A1 (en) * | 2007-06-08 | 2008-12-11 | Tomonori Hirai | Clustering System and Flexible Interconnection Architecture Thereof |
US8149710B2 (en) | 2007-07-05 | 2012-04-03 | Cisco Technology, Inc. | Flexible and hierarchical dynamic buffer allocation |
US8121038B2 (en) * | 2007-08-21 | 2012-02-21 | Cisco Technology, Inc. | Backward congestion notification |
US8929372B2 (en) * | 2007-10-30 | 2015-01-06 | Contextream Ltd. | Grid router |
US7719969B1 (en) | 2007-12-28 | 2010-05-18 | Qlogic, Corporation | System and method for assigning network device port address based on link rate |
US8155520B1 (en) | 2008-04-16 | 2012-04-10 | Cyan, Inc. | Multi-fabric shelf for a transport network |
US8064200B1 (en) | 2008-04-16 | 2011-11-22 | Cyan Optics, Inc. | Cooling a chassis by moving air through a midplane between two sets of channels oriented laterally relative to one another |
US8390993B1 (en) | 2008-04-16 | 2013-03-05 | Cyan, Inc. | Light source in chassis to provide frontal illumination of a faceplate on the chassis |
US8467295B2 (en) * | 2008-08-21 | 2013-06-18 | Contextream Ltd. | System and methods for distributed quality of service enforcement |
US7944812B2 (en) | 2008-10-20 | 2011-05-17 | International Business Machines Corporation | Redundant intermediary switch solution for detecting and managing fibre channel over ethernet FCoE switch failures |
WO2010128958A1 (en) * | 2009-05-06 | 2010-11-11 | Hewlett-Packard Development Company, L.P. | Bus-based scalable optical fabrics |
US8379516B2 (en) * | 2009-12-24 | 2013-02-19 | Contextream Ltd. | Grid routing apparatus and method |
CN101751368B (en) * | 2009-12-30 | 2013-08-07 | 中兴通讯股份有限公司 | Single-board communication method, device and system |
CN102860033B (en) | 2010-02-12 | 2016-10-26 | Adc电信公司 | Communication bladed panel system |
WO2012058270A2 (en) * | 2010-10-28 | 2012-05-03 | Compass Electro Optical Systems Ltd. | Router and switch architecture |
US8934483B2 (en) * | 2011-01-20 | 2015-01-13 | Broadcom Corporation | Data center switch |
US8687971B2 (en) * | 2011-01-31 | 2014-04-01 | Telefonaktiebolaget Lm Ericsson (Publ) | System, laser-on-CMOS chip, and method for setting a wavelength to be used by the laser-on-CMOS chip |
US20120327952A1 (en) * | 2011-06-23 | 2012-12-27 | Exar Corporation | Ethernet tag approach to support networking task offload |
US9104639B2 (en) | 2012-05-01 | 2015-08-11 | SEAKR Engineering, Inc. | Distributed mesh-based memory and computing architecture |
EP2720423B1 (en) * | 2012-10-12 | 2015-12-16 | Alcatel Lucent | Multistage switching system with a physical ring structure |
FR3007863B1 (en) * | 2013-06-28 | 2016-08-05 | Online | BLADE INTERCONNECTION ARCHITECTURE FOR A MODULAR CHASSIS TYPE NETWORK EQUIPMENT, PARTICULARLY OF ATCA TYPE |
US9648148B2 (en) * | 2013-12-24 | 2017-05-09 | Intel Corporation | Method, apparatus, and system for QoS within high performance fabrics |
US9674090B2 (en) * | 2015-06-26 | 2017-06-06 | Microsoft Technology Licensing, Llc | In-line network accelerator |
US10114792B2 (en) * | 2015-09-14 | 2018-10-30 | Cisco Technology, Inc | Low latency remote direct memory access for microservers |
PL3353667T3 (en) * | 2015-09-22 | 2021-05-31 | Ovh | Modular backplane |
CN107872340B (en) * | 2016-09-27 | 2021-09-07 | 华为技术有限公司 | Single board registration method, single board and forwarding equipment |
CN115269487B (en) * | 2022-09-26 | 2023-01-10 | 中诚华隆计算机技术有限公司 | Inter-chip data high-speed transmission method and device |
Family Cites Families (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4866704A (en) * | 1988-03-16 | 1989-09-12 | California Institute Of Technology | Fiber optic voice/data network |
US6016211A (en) * | 1995-06-19 | 2000-01-18 | Szymanski; Ted | Optoelectronic smart pixel array for a reconfigurable intelligent optical interconnect |
US6108338A (en) * | 1995-12-28 | 2000-08-22 | Dynarc Inc. | Method and device for dynamic synchronous transfer mode in a dual ring topology |
US6031843A (en) | 1996-11-21 | 2000-02-29 | Alcatel Data Networks Inc. | Digital communications switching fabric |
US6766381B1 (en) * | 1999-08-27 | 2004-07-20 | International Business Machines Corporation | VLSI network processor and methods |
US7075927B2 (en) * | 2000-05-05 | 2006-07-11 | Fujitsu Limited | Method and system for quality of service (QoS) support in a packet-switched network |
US6928470B1 (en) * | 2000-07-31 | 2005-08-09 | Western Digital Ventures, Inc. | Transferring scheduling data from a plurality of disk storage devices to a network switch before transferring data associated with scheduled requests between the network switch and a plurality of host initiators |
US6754757B1 (en) | 2000-12-22 | 2004-06-22 | Turin Networks | Full mesh interconnect backplane architecture |
US6725296B2 (en) * | 2001-07-26 | 2004-04-20 | International Business Machines Corporation | Apparatus and method for managing work and completion queues using head and tail pointers |
-
2002
- 2002-08-30 US US10/231,832 patent/US7218640B2/en not_active Expired - Fee Related
-
2003
- 2003-08-28 TW TW092123699A patent/TWI233018B/en not_active IP Right Cessation
- 2003-08-29 EP EP03749266A patent/EP1535168B1/en not_active Expired - Lifetime
- 2003-08-29 AT AT03749266T patent/ATE362137T1/en not_active IP Right Cessation
- 2003-08-29 DE DE60313780T patent/DE60313780T2/en not_active Expired - Lifetime
- 2003-08-29 CN CNB038248271A patent/CN100373368C/en not_active Expired - Fee Related
- 2003-08-29 WO PCT/US2003/027224 patent/WO2004021200A1/en active IP Right Grant
- 2003-08-29 AU AU2003268310A patent/AU2003268310A1/en not_active Abandoned
Also Published As
Publication number | Publication date |
---|---|
ATE362137T1 (en) | 2007-06-15 |
CN1695130A (en) | 2005-11-09 |
US20040042448A1 (en) | 2004-03-04 |
WO2004021200A1 (en) | 2004-03-11 |
DE60313780D1 (en) | 2007-06-21 |
DE60313780T2 (en) | 2008-01-24 |
CN100373368C (en) | 2008-03-05 |
TWI233018B (en) | 2005-05-21 |
TW200413927A (en) | 2004-08-01 |
EP1535168A1 (en) | 2005-06-01 |
US7218640B2 (en) | 2007-05-15 |
AU2003268310A1 (en) | 2004-03-19 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP1535168B1 (en) | Multi-port high-speed serial fabric interconnect chip in a meshed configuration | |
US7324537B2 (en) | Switching device with asymmetric port speeds | |
US7042891B2 (en) | Dynamic selection of lowest latency path in a network switch | |
US11368768B2 (en) | Optical network system | |
US7400590B1 (en) | Service level to virtual lane mapping | |
US7296093B1 (en) | Network processor interface system | |
US7394814B2 (en) | Method and apparatus for rendering a cell-based switch useful for frame based application protocols | |
US9756407B2 (en) | Network employing multi-endpoint optical transceivers | |
US20050089054A1 (en) | Methods and apparatus for provisioning connection oriented, quality of service capabilities and services | |
US20020118692A1 (en) | Ensuring proper packet ordering in a cut-through and early-forwarding network switch | |
US20030200330A1 (en) | System and method for load-sharing computer network switch | |
US7436845B1 (en) | Input and output buffering | |
KR20120004993A (en) | Method and system for providing a logical network layer for input / output data transfer | |
US7210056B2 (en) | Low latency comma detection and clock alignment | |
US20070110088A1 (en) | Methods and systems for scalable interconnect | |
CN113841364A (en) | System, method and architecture for data center network switching | |
US7539184B2 (en) | Reconfigurable interconnect/switch for selectably coupling network devices, media, and switch fabric | |
US7639616B1 (en) | Adaptive cut-through algorithm | |
US7257758B1 (en) | Stumping mechanism | |
US7733855B1 (en) | Community separation enforcement | |
EP1322079A2 (en) | System and method for providing gaps between data elements at ingress to a network element | |
US7483442B1 (en) | VCRC checking and generation | |
US7990987B2 (en) | Network processor having bypass capability | |
KR100489945B1 (en) | Apparatus and method for Synchronizing a Plurality of Processors in a Processor Array | |
US7286532B1 (en) | High performance interface logic architecture of an intermediate network node |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
17P | Request for examination filed |
Effective date: 20050324 |
|
AK | Designated contracting states |
Kind code of ref document: A1 Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LI LU MC NL PT RO SE SI SK TR |
|
AX | Request for extension of the european patent |
Extension state: AL LT LV MK |
|
DAX | Request for extension of the european patent (deleted) | ||
GRAP | Despatch of communication of intention to grant a patent |
Free format text: ORIGINAL CODE: EPIDOSNIGR1 |
|
GRAS | Grant fee paid |
Free format text: ORIGINAL CODE: EPIDOSNIGR3 |
|
GRAA | (expected) grant |
Free format text: ORIGINAL CODE: 0009210 |
|
AK | Designated contracting states |
Kind code of ref document: B1 Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LI LU MC NL PT RO SE SI SK TR |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: LI Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20070509 Ref country code: CH Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20070509 Ref country code: FI Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20070509 |
|
REG | Reference to a national code |
Ref country code: GB Ref legal event code: FG4D |
|
REG | Reference to a national code |
Ref country code: CH Ref legal event code: EP |
|
REG | Reference to a national code |
Ref country code: IE Ref legal event code: FG4D |
|
REF | Corresponds to: |
Ref document number: 60313780 Country of ref document: DE Date of ref document: 20070621 Kind code of ref document: P |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: SE Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20070809 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: ES Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20070820 |
|
ET | Fr: translation filed | ||
NLV1 | Nl: lapsed or annulled due to failure to fulfill the requirements of art. 29p and 29m of the patents act | ||
REG | Reference to a national code |
Ref country code: CH Ref legal event code: PL |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: AT Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20070509 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: BE Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20070509 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: PT Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20071009 Ref country code: BG Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20070809 Ref country code: NL Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20070509 Ref country code: CZ Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20070509 Ref country code: SI Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20070509 Ref country code: DK Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20070509 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: SK Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20070509 |
|
PLBE | No opposition filed within time limit |
Free format text: ORIGINAL CODE: 0009261 |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: NO OPPOSITION FILED WITHIN TIME LIMIT |
|
26N | No opposition filed |
Effective date: 20080212 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: MC Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20070831 Ref country code: IT Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20070509 Ref country code: GR Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20070810 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: RO Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20070509 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: IE Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20070829 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: EE Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20070509 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: CY Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20070509 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: LU Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20070829 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: TR Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20070509 Ref country code: HU Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20071110 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: FR Payment date: 20090814 Year of fee payment: 7 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: GB Payment date: 20090826 Year of fee payment: 7 |
|
GBPC | Gb: european patent ceased through non-payment of renewal fee |
Effective date: 20100829 |
|
REG | Reference to a national code |
Ref country code: FR Ref legal event code: ST Effective date: 20110502 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: FR Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20100831 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: GB Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20100829 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: DE Payment date: 20180814 Year of fee payment: 16 |
|
REG | Reference to a national code |
Ref country code: DE Ref legal event code: R119 Ref document number: 60313780 Country of ref document: DE |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: DE Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20200303 |