US20120008944A1 - Optical switching network - Google Patents

Optical switching network Download PDF

Info

Publication number
US20120008944A1
US20120008944A1 US13/078,979 US201113078979A US2012008944A1 US 20120008944 A1 US20120008944 A1 US 20120008944A1 US 201113078979 A US201113078979 A US 201113078979A US 2012008944 A1 US2012008944 A1 US 2012008944A1
Authority
US
United States
Prior art keywords
network
routing
hop
traffic
tor
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US13/078,979
Inventor
Ankit Singla
Atul Singh
Kishore Ramachandran
Lei Xu
Yueping Zhang
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
NEC Laboratories America Inc
Original Assignee
NEC Laboratories America Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by NEC Laboratories America Inc filed Critical NEC Laboratories America Inc
Priority to US13/078,979 priority Critical patent/US20120008944A1/en
Publication of US20120008944A1 publication Critical patent/US20120008944A1/en
Abandoned legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04JMULTIPLEX COMMUNICATION
    • H04J14/00Optical multiplex systems
    • H04J14/02Wavelength-division multiplex systems
    • H04J14/0201Add-and-drop multiplexing
    • H04J14/0202Arrangements therefor
    • H04J14/021Reconfigurable arrangements, e.g. reconfigurable optical add/drop multiplexers [ROADM] or tunable optical add/drop multiplexers [TOADM]
    • H04J14/0212Reconfigurable arrangements, e.g. reconfigurable optical add/drop multiplexers [ROADM] or tunable optical add/drop multiplexers [TOADM] using optical switches or wavelength selective switches [WSS]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04JMULTIPLEX COMMUNICATION
    • H04J14/00Optical multiplex systems
    • H04J14/02Wavelength-division multiplex systems
    • H04J14/0201Add-and-drop multiplexing
    • H04J14/0202Arrangements therefor
    • H04J14/0204Broadcast and select arrangements, e.g. with an optical splitter at the input before adding or dropping
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04JMULTIPLEX COMMUNICATION
    • H04J14/00Optical multiplex systems
    • H04J14/02Wavelength-division multiplex systems
    • H04J14/0201Add-and-drop multiplexing
    • H04J14/0202Arrangements therefor
    • H04J14/0205Select and combine arrangements, e.g. with an optical combiner at the output after adding or dropping
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04JMULTIPLEX COMMUNICATION
    • H04J14/00Optical multiplex systems
    • H04J14/02Wavelength-division multiplex systems
    • H04J14/0227Operation, administration, maintenance or provisioning [OAMP] of WDM networks, e.g. media access, routing or wavelength allocation
    • H04J14/0254Optical medium access
    • H04J14/0256Optical medium access at the optical channel layer
    • H04J14/0257Wavelength assignment algorithms
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04JMULTIPLEX COMMUNICATION
    • H04J14/00Optical multiplex systems
    • H04J14/02Wavelength-division multiplex systems
    • H04J14/0227Operation, administration, maintenance or provisioning [OAMP] of WDM networks, e.g. media access, routing or wavelength allocation
    • H04J14/0254Optical medium access
    • H04J14/0267Optical signaling or routing
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04JMULTIPLEX COMMUNICATION
    • H04J14/00Optical multiplex systems
    • H04J14/02Wavelength-division multiplex systems
    • H04J14/0201Add-and-drop multiplexing
    • H04J14/0215Architecture aspects
    • H04J14/0217Multi-degree architectures, e.g. having a connection degree greater than two

Definitions

  • the present invention relates to an optical switching network.
  • DCN data center network
  • systems and methods are disclosed for a method to communicate over an optical network by using hop-by-hop routing over an optical network; and dynamically constructing a network topology.
  • a method to communicate over an optical network includes dynamically constructing a network topology based on traffic demands and hop-by-hop routing; and constructing a dynamically changing data center network (DCN) architecture.
  • DCN dynamically changing data center network
  • a method for interconnecting a data center network includes using hop-by-hop routing over an optical network.
  • a method for interconnecting a data center network includes using hop-by-hop routing over an optical network; and using bidirectional optical network devices to enable bidirectional communication over fiber.
  • a method for interconnecting a data center network includes using hop-by-hop routing over an optical network; using bidirectional optical network devices to enable bidirectional communication over fiber; and dynamically constructing a network topology.
  • a method for interconnecting a data center with an optical network includes using bidirectional optical network devices to enable bidirectional communication over fiber.
  • the system is the first-ever all-optical switching architecture for data center networks (DCNs).
  • DCNs data center networks
  • the system addresses these drawbacks of static network topologies by providing a dynamic DCN architecture that can adapt to application traffic demands in an efficient manner while also supporting high bandwidth server-to-server connectivity.
  • the key feature is that allows any subset of servers to be connected at full-bandwidth in an on-demand manner without requiring static, all-to-all full bandwidth connectivity.
  • the preferred embodiment can adapt the network topology based on application traffic demands, while also supporting high bandwidth connectivity between any subset of servers.
  • the system uses three basic building blocks: (1) an innovative placement of optical devices, (2) algorithms for adaptive network reconfiguration (Procedure 2(a), 2(b), 3, and 5) based on traffic demand dynamics, and (3) hop-by-hop routing (Procedure 6).
  • optical devices allow this preferred embodiment to use re-configurable optical paths. This enables the system to be flexible in terms of path and capacity assignment between the servers. Exactly how these paths are re-configured to interconnect servers, as well as the capacity of each path, is controlled by our adaptive network re-configuration algorithms. By extensively using optical fibers that have the ability to support higher bandwidths simply by adding wavelengths, higher throughputs can be supported without re-wiring. As Proteus does not impose the requirement of underlying all-to-all electrical connectivity between the servers, and due to the physical limitation on the number of possible optical paths between servers, the inclusion of hop-by-hop routing is necessary in our design. The intuition here is that if a direct optical path does not exist, a hop-by-hop path can be used instead. For this purpose, we include a multi-hop routing protocol that uses source-routing.
  • On-demand flexibility Proteus does not make any assumption on traffic patterns and is able to adaptively reconstruct network communication paths based on traffic demand. This makes the preferred embodiment highly appealing to future data centers where both the network and application may evolve over time.
  • High server-to-server throughput Proteus significantly improves the communication bandwidth between any pair of servers. Once the optical circuit path is set up, a bit rate transparent communication pipe becomes available. With current technologies, per channel bit rate in optical fiber communications can be as high as 40 Gb/s or 100 Gb/s, and the total capacity per fiber with DWDM technologies can reach 69 Tb/s.
  • Network paths are dynamically constructed based on traffic demand in such a way that overall network-wide traffic can be maximally served. This global optimization overcomes network resource fragmentation incurred by today's tree-based DCN architectures and other existing approaches where local optimization is adopted.
  • FIG. 1 shows an exemplary system with optical interconnects in a data center network.
  • FIG. 2 shows in more details the optical component of FIG. 1 .
  • FIG. 3 shows an exemplary control manager for the system of FIG. 1 .
  • FIG. 4 shows an exemplary Greedy-Tree method to dynamically reconstruct routing paths according to changing network traffic demand.
  • FIG. 5 shows an exemplary Darwinian method to dynamically reconstruct routing paths according to changing network traffic demand.
  • FIG. 6 shows an exemplary fault-tolerant routing method.
  • FIG. 7 shows an exemplary wavelength assignment method.
  • FIG. 1 shows an exemplary system with optical interconnects in a data center network.
  • An optical switch matrix (OSM) 102 allows a plurality of optical ports to communicate with each other through optical components 110 .
  • Each optical component 110 in turn communicates with a top of rack (ToR) switch.
  • ToR top of rack
  • Each ToR switch in turn is connected to plurality of servers and to other ToRs.
  • the system of FIG. 1 uses hop-by-hop routing, in which traffic that cannot be provisioned with a direct end-to-end circuit will be routed to the destination by traversing multiple hops (i.e., TOR switches). Each TOR switch not only receives traffic destined at servers located in its own rack, but also forwards transit traffic targeted at servers residing in other racks. This mechanism allows the system of FIG. 1 to achieve connectivity between any pair of origin and destination servers. This approach is in contrast to conventional optical communication systems, in which only single-hop routing is employed.
  • each TOR switch is a conventional switch with 64 10-GigE ports. Of these 64 ports at each ToR, 32 are connected to servers via existing intra-ToR interconnects. Each of the remaining 32 ports is used to connect to the optical interconnect between ToRs. Each inter-ToR port is attached to transceivers associated with a fixed wavelength for sending and receiving data. Excluding the ToR switches, all the remaining interconnect elements are optical. These optical elements allow for reconfiguration, making the network highly adaptive to changes in the underlying traffic requirements.
  • optical network elements support on-demand provisioning of connectivity and capacity where required in the network, thus permitting the construction of thin, but malleable interconnects for large server pools.
  • Optical links can support higher bit-rates over longer distances using less power than copper cables.
  • optical switches run cooler than electrical ones, implying lower heat dissipation and cheaper cooling cost.
  • FIG. 2 shows in more details the optical component 110 .
  • each circuit over the MEMS is bidirectional.
  • optical circulators 126 and 136 are placed between the ToR and MEMS ports.
  • a circulator 126 connects the send channel of the transceiver from a ToR 120 to the MEMS port 102 (after the channel has passed through the WSS 124 ). It simultaneously delivers the traffic incoming towards a ToR from the MEMS, to this ToR.
  • the inter-ToR ports attach themselves to two transceivers so that they can send and receive data simultaneously. As shown in the left half of FIG.
  • the optical fiber from the “send” transceivers from each of the 32 ports at a ToR 120 is connected to an optical multiplexer 122 .
  • Each port is associated with a wavelength, unique across ports at the ToR 120 , in order to exploit wavelength division multiplexing (WDM). This allows data from different ports to be multiplexed into one fiber without contention.
  • WDM wavelength division multiplexing
  • This fiber is then connected to a 1 ⁇ 4 Wavelength Selective Switch (WSS) 124 .
  • WSS 124 is typically an optical component, consisting of one common port and wavelength ports. It partitions the set of wavelengths coming in through the common port among the wavelength ports and the mapping is runtime-configurable (in a few milliseconds).
  • the WSS 124 can split the set of 32 wavelengths it sees into four groups, each group being transmitted out on its own fiber.
  • This fiber is connected to the MEMS optical switch 102 through a circulator 126 to enable bidirectional traffic through it.
  • the circulators enable bidirectional optical transmission over a fiber, allowing more efficient use of the ports of optical switches.
  • An optical circulator is a three-port device: one port is a shared fiber or switching port, and the other two ports serve as send and receive ports.
  • Optical transceivers can be of two types: coarse WDM (CWDM) and dense WDM (DWDM).
  • CWDM coarse WDM
  • DWDM dense WDM
  • One embodiment uses DWDM-based transceivers, which support higher bit-rates and more wavelength channels in a single piece of fiber compared to CWDM.
  • the receiving infrastructure (shown in the right half of FIG. 2 ) has a coupler 136 connected to a demultiplexer 132 which separates multiple incoming wavelengths, each then delivered to a different port.
  • a demultiplexer 132 which separates multiple incoming wavelengths, each then delivered to a different port.
  • four receive fibers from each of four circulators are connected to a power coupler 134 which combines their wavelengths onto one optical fiber. This fiber feeds into a demultiplexer 132 which splits each incoming wavelength to its associated port for a TOR 130 .
  • the interconnect of FIG. 1 uses a 320-port micro-electrical mechanical systems (MEMS) switch, to connect 80 ToRs with a total of 2560 servers.
  • MEMS micro-electrical mechanical systems
  • a number of channels or wavelengths can be transmitted over a single piece of fiber in the conventional or C-band.
  • each wavelength is rate-limited by the electrical port it is connected to.
  • the OSM modules in optical communications can be bipartite switching matrices where any input port can be connected to any one of the output ports.
  • Micro-Electro-Mechanical Switch MEMS can be used as an OSM and achieves reconfigurable one-to-one circuit between its input and output ports by mechanically adjusting micro mirrors.
  • the system of FIG. 2 offers highly flexible bandwidth. Every ToR has degree k. If each edge had fixed bandwidth, multiple edges would need to be utilized for this ToR to communicate with another ToR at a rate higher than a single edge supports. To overcome this problem, the system combines the capability of optical fibers to carry multiple wavelengths at the same time (WDM) with the dynamic reconfigurability of the WSS. Consequently, a ToR is connected to MEMS through a multiplexer and a WSS unit.
  • WDM wavelengths at the same time
  • ToR A wants to communicate with ToR B using w times the line speed of a single port.
  • the ToR will use w ports, each associated with a (unique) wavelength, to serve this request.
  • WDM enables these w wavelengths, together with the rest from this ToR, to be multiplexed into one optical fiber that feeds the WSS.
  • the WSS splits these w wavelengths to the appropriate MEMS port which has a circuit to ToR B (doing likewise for k ⁇ 1 other sets of wavelengths).
  • a w ⁇ (line-speed) capacity circuit is set up from A to B, at runtime.
  • each ToR can communicate simultaneously with any four other ToRs.
  • the MEMS switch 102 can construct all possible 4-regular ToR interconnection graphs.
  • each of these four links' capacity can be varied in ⁇ 0, 10, 20, . . . , 320 ⁇ Gbps, provided the sum does not exceed 320 Gbps.
  • both the path between servers as well as the capacity of these paths can be varied in this architecture.
  • each ToR port (facing the optical interconnect) is assigned a wavelength unique across ports at the ToR.
  • the same wavelength is used to receive traffic as well: each port thus sends and receives traffic at one fixed wavelength.
  • the same set of wavelengths is recycled across ToRs. This allows all wavelengths at one ToR to be multiplexed and delivered after demultiplexing to individual ports at the destination ToR. This wavelength-port association is a static, design/build time decision.
  • Each ToR is a conventional electrical switch with 64 10-GigE non-blocking ports. 32 of these ports are connected to servers, while the remaining face the optical interconnect.
  • Each port facing the optical interconnect has a transceiver associated with a fixed and unique wavelength for sending and receiving data.
  • the transceiver uses separate fibers to connect to the send and receive infrastructures.
  • the send fiber from the transceivers from each of the 32 ports at a ToR is connected to an optical multiplexer.
  • the multiplexer feeds a 1 ⁇ 4 WSS.
  • the WSS splits the set of 32 wavelengths it sees into 4 groups, each group being transmitted on its own fiber. These fibers are connected to the MEMS switch through circulators to enable bidirectional traffic through them.
  • the 4 receive fibers from each of 4 circulators corresponding to a ToR are connected to a power coupler (similar to a multiplexer, but simpler), which combines their wavelengths onto one fiber. This fiber feeds a demultiplexer, which splits each incoming wavelength to its associated port on the ToR.
  • each ToR can communicate simultaneously with any 4 other ToRs.
  • MEMS reconfigurations allow us to construct all possible 4-regular ToR graphs.
  • each of these 4 links' capacity can be varied in ⁇ 0, 10, 20, . . . 320 ⁇ Gbps.
  • these configurations are decided by a centralized manager. The manager obtains the traffic matrix from the ToR switches, calculates appropriate configurations, and pushes them to the MEMS, WSS, and ToRs. This requires direct, out-of-band connections between the manager and these units.
  • ToR Top-of-Rack
  • the system of FIGS. 1-2 achieves topology flexibility by exploiting the reconfigurability of the MEMS.
  • the system uses hop-by-hop stitching of such circuits to achieve network connectivity.
  • To reach ToRs not directly connected to it through the MEMS a ToR uses one of its connections.
  • This first-hop ToR receives the transmission over fiber, converts it to electrical signals, reads the packet header, and routes it towards the destination.
  • O-E-O optics
  • Such conversion can be done in sub-nanosecond level.
  • the aggregate transit, incoming and outgoing traffic cannot exceed the port's capacity in each direction. So, high-volume connections must use a minimal number of hops.
  • the system manages the topology to adhere to this requirement.
  • the flexible DCN architecture of FIG. 1 also needs topology management manager that (a) configure the MEMs to adjust the topology to localize high traffic volumes, b) configure the WSS at each ToR to adjust the capacity of its four outgoing links to provision bandwidth where it is most gainful, and (c) pick routes between ToR-pairs to achieve high throughput, low latency and minimal network congestion.
  • a traffic demand D between ToRs—D ij is the desired bandwidth from ToR i to ToR j .
  • S ij have end-to-end meaning, while v ijk have hop-to-hop significance.
  • k ⁇ 1, 2, . . .
  • a wavelength ⁇ k can only be used between two ToRs if they are connected through MEMS:
  • ToR i can receive/send ⁇ k from/to at most one ToR (this is illustrated in FIG. 3 ):
  • ToR i is connected to exactly W other ToRs:
  • Hop-by-hop traffic is limited by port capacities (C port ), wavelength capacity (C ⁇ ), and provisioning:
  • the outgoing transit traffic (total traffic flowing out, minus total traffic for which ToR i is the origin) equals incoming transit traffic at ToR i :
  • MILP mixed-integer linear program
  • FIG. 3 shows an exemplary control manager 200 that controls the system 100 of FIG. 1 .
  • the control system includes a module 202 that estimates traffic demand.
  • the module 202 provides input to a module 204 that assigns pairs with heavy communications to direct links.
  • a module 206 performs the connectivity accordingly.
  • the manager 200 controls the MEMS optical switch 102 to adjust the network topology.
  • a module 210 identifies routing paths and sends all the ToRs these paths in order to set up their routing tables.
  • a module 214 determines the capacity demand on each link and a module 216 then determines the wavelength assignment scheme.
  • the software estimates the traffic demand according to max-min fair bandwidth allocation for TCP flows in an ideal non-blocking network. All the flows are only limited by the sender or receiver network interface cards (NICs).
  • NICs network interface cards
  • the manager assigns direct links for heavy communicating pairs.
  • High-volume communicating pairs i.e., ToR switches
  • Weighted b-matching is a graph theoretic problem for which an elegant polynomial-time algorithm is known. In one embodiment, the weighted b-matching algorithm is approximated using multiple 1-matchings.
  • Connectivity is achieved through the edge-exchange operation as follows. First, the method locates all connected components. If the graph is not connected, the method selects two edges a ⁇ b and c ⁇ d with lowest weights in different connected components, and simply replace links a ⁇ b and c ⁇ d with links a ⁇ c and b ⁇ d to connect them. A check is done to make sure that the links removed are not themselves cuts in the graph. The output of steps 2 and 3 is used to tell the MEMS optical switch 102 how to configure the network topology.
  • the MEMS optical switch configuration is known.
  • the method finds routes using any of the standard routing schemes such as the shortest path or a low congestion routing scheme. Some of the routes are single-hop MEMS connection while others are multi-hop MEMS connections.
  • the standard shortest path technique is used to calculate the routing paths.
  • the framework can be readily applied to any other routing scheme.
  • the output is used to tell ToRs on how to configure their routing tables.
  • the method Given the routing and the estimated traffic demand (aggregated) between each pair of ToRs, the method computes the link capacity desired on each link. To satisfy the capacity demand on each link, multiple wavelengths may be used. However, the sum of capacity demands of all links associated with a ToR switch must not exceed the capacity of this ToR.
  • On implementation requires at least one wavelength to be assigned to each edge on the physical topology. This guarantees an available path between any ToR-pair, which may be required for mice/bursty flows.
  • the output is used to tell WSS on how to assign wavelengths.
  • the system works based on the value of ⁇ is defined as the expected throughput achieved via the link capacity adjustment versus that achieved via network topology change. If the throughput obtained by only adjusting link capacity is significant enough compared to that obtained by rearranging the topology, the system can adjust link capacity while keep the current topology. This is cheaper than changing the topology since topology changes necessitate change in the routing tables of ToRs. It is possible that the traffic pattern is fundamentally changed so that only adjusting the link capacity cannot provide a satisfactory throughput. In this case, the system reconfigures the network topology. In practice, the system can modify ⁇ on-demand to satisfy different performance requirements.
  • routing can be easily realized in a centralized manner, where the manager is responsible for calculating and updating the routing table for each ToR.
  • the manager employ shortest path routing with failover paths.
  • any other sophisticated routing algorithms can be readily applied.
  • the flexibility of the architecture of FIG. 1 can be used not only to meet the changing traffic patterns, but also to handle failures (e.g., a WSS port failure can be taken care of via dynamically assigning that port's wavelength to remaining ports).
  • the system graphs are inherently fault-tolerant due to their path redundancy and we demonstrate, via simulations, appealing performance in the presence of a large percentage of link and/or node failures.
  • FIG. 4 shows another exemplary GreedyTree method to dynamically adjust the topology according to changing network traffic demand, different from the above method.
  • This mechanism is a tree inspired design and attempts to form a tree in such a way that traffic is concentrated towards the leaves, so that voluminous flows don't occupy large of hops.
  • the input is a traffic matrix D (traffic demand between any pair of racks) where Di,j denotes traffic travelling from ToR i to ToR j.
  • D is asymmetric due to the directional nature of network traffic.
  • the method initializes a virtual node set V ( 302 ).
  • the method checks if V has only one element ( 304 ) and if so, exits processing.
  • the method determines a traffic matrix M over the set V ( 306 ), and then applies maximum weighted bipartite matching to determine which pairs of nodes should be connected to form a higher level virtual node ( 308 ).
  • standard matching is used to determine the real underlying nodes to connect ( 310 ). If there are not enough wavelengths to connect the nodes, the method reassigns least used wavelengths from the lower levels while maintaining connectivity ( 310 ). The method loops back to 304 until all elements are processed.
  • the method attempts to connect pairs of virtual nodes that yield the maximum benefit by finding a matching.
  • the initial set of virtual nodes is the same as the set of ToRs.
  • pairs of virtual nodes from the previous stage are connected.
  • the total bandwidth demand across two virtual-nodes is first computed by summing demands from the real nodes in each virtual-node to the other. These pair-wise demands are used as weights for a standard matching algorithm (such as Edmond's algorithm, among others) to obtain the best set of virtual-edges.
  • Each virtual edge can have one or more real edges and a number of wavelengths.
  • edges and wavelengths are determined by a heuristic-based function which uses matching restricted to only the sets of nodes in the two virtual-nodes being connected. If more wavelengths and links are required than are available from the two virtual-nodes, then links and wavelengths from the lower-level are harvested (least useful at lower-level first) while preserving connectivity. The algorithm iterates until it has built one large virtual node. Once the method terminates, all configurations are pushed to the optical elements.
  • FIG. 5 shows an exemplary Darwinian method to dynamically reconstruct routing paths according to changing network traffic demand.
  • the method initializes a virtual node set V ( 330 ).
  • the method determines a traffic matrix M over the set V ( 332 ), and then applies a 4 matching technique to determine which pairs of nodes should be connected to form a higher level virtual node ( 334 ).
  • the method makes the graph connectivity using edge-exchange operations ( 336 ).
  • the Darwinian heuristic attempts to localize high-volume flows over direct circuit links. This is accomplished by using a weighted matching restricted to a degree of 4 (i.e., weighted 4-matching), representing the number of connections each ToR has to the MEMS. However, this does not impose connectivity. Connectivity is ensured using the edge-exchange operation on the edges of lowest weight across pairs of components, thus connecting them. This edge-exchange operation is repeated until connectivity is achieved between all source-destination pairs.
  • the Darwinian heuristic is based on the idea of starting out with a structured topology (like a k-regular circulant graph, a Kautz digraph, an incomplete hypercube, or even a DCell-like topology) from which the topology keeps evolving. Over this topology, it is possible to use degree-preserving operations to better conform to the traffic matrix. So if two ToRs which seek to establish a high bandwidth connection are connected to two other ToRs and are not serving much transit traffic, they can be connected directly, by breaking one of their current links.
  • the advantage of this method is that it is iterative and each iteration should be computationally inexpensive. It is also likely that a large number of large flows do not change simultaneously, thus a large number of such operations are should rarely be required. It is possible to use this method as a continuous background optimization. The objective is to ensure that a weighted sum of path lengths is minimized.
  • the GreedyTree and Darwinian heuristics or processes reconstruct the network topology in adaptation to changing traffic demand and can deal with arbitrary traffic patterns. This is in contrast to conventional systems where a particular traffic pattern is assumed.
  • the GreedyTree method intelligently utilizes the switching and reconfiguration functionalities of WSS and adaptively redistributes wavelength assignment to cope with topology and routing changes. This is also the first application of WSS in data center networks.
  • the system finds routes using any of standard routing schemes—shortest path or preferably, a low congestion routing scheme.
  • shortest path or preferably, a low congestion routing scheme.
  • FPR Fault-tolerant Proteus Routing
  • the input is the topology represented by a graph G(V, E), the edge weights w, the source node s, and the destination node d.
  • the weight of each edge is set to one ( 350 ).
  • ; and P Failover shortest_path(G, s, d, w). Finally, the method returns P Primary and P Failover as the result ( 356 ).
  • FPR The basic idea of FPR is simple. Leveraging on network status, the Manager is responsible for calculating the routing table for each ToR switch. In one embodiment, for simplicity, the shortest path routing method of FIG. 6 is used for routing table construction. However, the scheme is readily applied to any other sophisticated routing calculation. Once link or node failures happen, the related devices will report to the Manager, then the Manager will react by evoking the control software to rearrange the link capacity or topology (based on the degree of failures) to bypass the failed parts. In this sense, FPR is a simple and flexible way to handle failures largely due to the architecture of FIG. 1 .
  • FIG. 7 shows an exemplary wavelength assignment method.
  • the input is a system graph and capacity demand on each link.
  • the method determines the number n of wavelengths to satisfy the capacity demand and replaces the link with n parallel directed links ( 380 ).
  • the method converts the resulting directed graph to an undirected graph by merging anti-parallel links ( 382 ).
  • the method then applies a standard edge-coloring heuristics on this graph, where wavelengths are the colors to be used to color these edges ( 384 ). If the resulting graph is with one more extra color, then the method removes the color (i.e., wavelength) that is least used ( 386 ).
  • the system provisions or allocates wavelengths to serve capacity requirements.
  • the system first decides the necessary number (say n) of wavelengths allocated to each optical fiber to meet the capacity requirements and replaces this link with n parallel directed links in the graph. For instance, if each wavelength maximally carries 10 Gb/s and the capacity requirement of a particular link is 45 Gb/s, then the system replaces this link with 5 parallel links in the graph. This way, after this operation, we obtain a graph with degree of 32 for each node.
  • the system converts the resulting directed graph to an undirected graph by merging anti-parallel links, i.e., merging the directed link from node u to v and the one from v to u.
  • the system gets a new undirected graph with node degree 32.
  • the system applies a standard edge-coloring heuristics on this graph, where wavelengths are the colors to be used to color these edges. Since the heuristics may end up with coloring the graph with one more extra color (i.e., 33), then the final step is just to remove the color (i.e., wavelength) that is least used.
  • This method automatically generates hop-by-hop routing protocols based on network topology changes. This is also a breakthrough in optical communications especially in the context of data center networks, where only point-to-point optical communication is considered.
  • hop-by-hop path can be used instead.
  • a multi-hop routing protocol is used. Once a suitable configuration and paths have been computed, these are pushed to all ToRs. ToRs thus know their routes to all other ToRs and use source routing. Each packet from a server destined to some other server outside the ToR is tunneled through this source-routing protocol between ToRs. At the source ToR, a sequence of destination ToRs is specified in the header and sent to the first ToR through the local forwarding table. The first hop then looks at the next hop in sequence and sends the packet to it and this is repeated until the data reaches the destination.
  • the all-optical network described herein can be easily supplemented with other forms of network connectivity including wireless and electrical networks.

Abstract

Systems and methods are disclosed for a method to communicate over an optical network by using hop-by-hop routing over an optical network; and dynamically constructing a network topology.

Description

  • The present application claims priority to Provisional Application Ser. Nos. 61/362,482, filed Jul. 8, 2010, and 61/436,283, filed on Jan. 26, 2011, the contents of which are incorporated by reference.
  • BACKGROUND
  • The present invention relates to an optical switching network.
  • Two key challenges faced by existing data center network (DCN) architectures are (a) balancing the demand for high bandwidth connectivity between all pairs of servers with the associated high cost, and (b) having the flexibility to support a variety of applications and their traffic demand.
  • Many online services, such as those offered by Amazon, Google, FaceBook, and eBay, are powered by massive data centers hosting tens to hundreds of thousands of servers. The network interconnect of the data center plays a key role in the performance and scalability of these services. As application traffic and the number of hosted applications grow, the industry is constantly looking for larger server-pools, higher bit-rate network-interconnects, and smarter workload placement approaches to effectively utilize the network resources. To meet these goals, a careful examination of traffic characteristics, operator requirements, and network technology trends is critical
  • High bandwidth, static network connectivity between all server pairs ensures that the network can support an arbitrary application mix. However, static network topologies that provide such connectivity tend to be quite expensive (in terms of both the startup as well as recurring costs), and cannot scale beyond a certain number of interconnected servers. Further, for many applications, all-to-all connectivity at all times is not needed, and hence static network connectivity can be quite wasteful in these cases. Finally, such topologies also suffer from the need to “re-wire” the network to support greater network bandwidth demands from future applications.
  • Existing DCN architecture proposals attempt to address these challenges by using a hybrid approach that combines small-scale, all-to-all connectivity using electrical interconnects with alternative data transmission technologies (e.g. high-speed wireless or optical switching) that provide flexibility in terms of adapting to traffic demands. In these approaches, the workload is split between the electrical and optical network paths such that peak traffic is offloaded to the extra paths (could be wireless/optical/electrical). This use of optical or wireless transmission technologies as an add-on, as opposed to a fundamental component of the architecture, limits the applicability of these solutions to today's network traffic patterns and bandwidth demands—the base network topology is not flexible and is built on the assumption that average traffic patterns are known in advance. In addition, these solutions also suffer from the need to re-wire the electrical network to support higher throughputs.
  • SUMMARY
  • In one aspect, systems and methods are disclosed for a method to communicate over an optical network by using hop-by-hop routing over an optical network; and dynamically constructing a network topology.
  • In one aspect, a method to communicate over an optical network includes dynamically constructing a network topology based on traffic demands and hop-by-hop routing; and constructing a dynamically changing data center network (DCN) architecture.
  • In another aspect, a method for interconnecting a data center network includes using hop-by-hop routing over an optical network.
  • In yet another aspect, a method for interconnecting a data center network includes using hop-by-hop routing over an optical network; and using bidirectional optical network devices to enable bidirectional communication over fiber.
  • In a further aspect, a method for interconnecting a data center network includes using hop-by-hop routing over an optical network; using bidirectional optical network devices to enable bidirectional communication over fiber; and dynamically constructing a network topology.
  • In yet another aspect, a method for interconnecting a data center with an optical network includes using bidirectional optical network devices to enable bidirectional communication over fiber.
  • Advantages of the preferred embodiment may include one or more of the following. The system is the first-ever all-optical switching architecture for data center networks (DCNs). By exploiting runtime reconfigurable optical devices, the system can dynamically change network topology as well as link capacities, thus achieving unprecedented flexibility to adapt to different traffic patterns.
  • The system addresses these drawbacks of static network topologies by providing a dynamic DCN architecture that can adapt to application traffic demands in an efficient manner while also supporting high bandwidth server-to-server connectivity. The key feature is that allows any subset of servers to be connected at full-bandwidth in an on-demand manner without requiring static, all-to-all full bandwidth connectivity.
  • The preferred embodiment can adapt the network topology based on application traffic demands, while also supporting high bandwidth connectivity between any subset of servers. To accomplish these challenging tasks, the system uses three basic building blocks: (1) an innovative placement of optical devices, (2) algorithms for adaptive network reconfiguration (Procedure 2(a), 2(b), 3, and 5) based on traffic demand dynamics, and (3) hop-by-hop routing (Procedure 6).
  • The innovative placement of optical devices allows this preferred embodiment to use re-configurable optical paths. This enables the system to be flexible in terms of path and capacity assignment between the servers. Exactly how these paths are re-configured to interconnect servers, as well as the capacity of each path, is controlled by our adaptive network re-configuration algorithms. By extensively using optical fibers that have the ability to support higher bandwidths simply by adding wavelengths, higher throughputs can be supported without re-wiring. As Proteus does not impose the requirement of underlying all-to-all electrical connectivity between the servers, and due to the physical limitation on the number of possible optical paths between servers, the inclusion of hop-by-hop routing is necessary in our design. The intuition here is that if a direct optical path does not exist, a hop-by-hop path can be used instead. For this purpose, we include a multi-hop routing protocol that uses source-routing.
  • Other advantages of the preferred embodiment may include one or more of the following:
  • 1) On-demand flexibility: Proteus does not make any assumption on traffic patterns and is able to adaptively reconstruct network communication paths based on traffic demand. This makes the preferred embodiment highly appealing to future data centers where both the network and application may evolve over time.
  • 2) High server-to-server throughput: Proteus significantly improves the communication bandwidth between any pair of servers. Once the optical circuit path is set up, a bit rate transparent communication pipe becomes available. With current technologies, per channel bit rate in optical fiber communications can be as high as 40 Gb/s or 100 Gb/s, and the total capacity per fiber with DWDM technologies can reach 69 Tb/s.
  • 3) Efficient network resource utilization: Network paths are dynamically constructed based on traffic demand in such a way that overall network-wide traffic can be maximally served. This global optimization overcomes network resource fragmentation incurred by today's tree-based DCN architectures and other existing approaches where local optimization is adopted.
  • 4) Cabling simplicity: One of challenges faced by current data center networks is caused by the high complexity of a large number of connecting cables. With the adoption of optical fiber cabling, network upgrades and expansion can be achieved by adding additional wavelengths, instead of additional cables.
  • 5) Lower power consumption: Optical components generally consume a fraction of energy relative to their electrical counterparts, and since this preferred embodiment uses optical components extensively, the overall DCN power consumption should be lowered significantly.
  • BRIEF DESCRIPTION OF THE DRAWINGS
  • FIG. 1 shows an exemplary system with optical interconnects in a data center network.
  • FIG. 2 shows in more details the optical component of FIG. 1.
  • FIG. 3 shows an exemplary control manager for the system of FIG. 1.
  • FIG. 4 shows an exemplary Greedy-Tree method to dynamically reconstruct routing paths according to changing network traffic demand.
  • FIG. 5 shows an exemplary Darwinian method to dynamically reconstruct routing paths according to changing network traffic demand.
  • FIG. 6 shows an exemplary fault-tolerant routing method.
  • FIG. 7 shows an exemplary wavelength assignment method.
  • DETAILED DESCRIPTION
  • FIG. 1 shows an exemplary system with optical interconnects in a data center network. An optical switch matrix (OSM) 102 allows a plurality of optical ports to communicate with each other through optical components 110. Each optical component 110 in turn communicates with a top of rack (ToR) switch. Each ToR switch in turn is connected to plurality of servers and to other ToRs.
  • The system of FIG. 1 uses hop-by-hop routing, in which traffic that cannot be provisioned with a direct end-to-end circuit will be routed to the destination by traversing multiple hops (i.e., TOR switches). Each TOR switch not only receives traffic destined at servers located in its own rack, but also forwards transit traffic targeted at servers residing in other racks. This mechanism allows the system of FIG. 1 to achieve connectivity between any pair of origin and destination servers. This approach is in contrast to conventional optical communication systems, in which only single-hop routing is employed.
  • In one particular instantiation, each TOR switch is a conventional switch with 64 10-GigE ports. Of these 64 ports at each ToR, 32 are connected to servers via existing intra-ToR interconnects. Each of the remaining 32 ports is used to connect to the optical interconnect between ToRs. Each inter-ToR port is attached to transceivers associated with a fixed wavelength for sending and receiving data. Excluding the ToR switches, all the remaining interconnect elements are optical. These optical elements allow for reconfiguration, making the network highly adaptive to changes in the underlying traffic requirements.
  • The system of FIG. 1 uses all optical interconnects. In contrast to their electrical counterparts, optical network elements support on-demand provisioning of connectivity and capacity where required in the network, thus permitting the construction of thin, but malleable interconnects for large server pools. Optical links can support higher bit-rates over longer distances using less power than copper cables. Moreover, optical switches run cooler than electrical ones, implying lower heat dissipation and cheaper cooling cost.
  • FIG. 2 shows in more details the optical component 110. To make full use of the MEMS ports, each circuit over the MEMS is bidirectional. For this, optical circulators 126 and 136 are placed between the ToR and MEMS ports. A circulator 126 connects the send channel of the transceiver from a ToR 120 to the MEMS port 102 (after the channel has passed through the WSS 124). It simultaneously delivers the traffic incoming towards a ToR from the MEMS, to this ToR. Even though the MEMS edges are bidirectional, the capacities of the two directions are independent of each other. The inter-ToR ports attach themselves to two transceivers so that they can send and receive data simultaneously. As shown in the left half of FIG. 2, the optical fiber from the “send” transceivers from each of the 32 ports at a ToR 120 is connected to an optical multiplexer 122. Each port is associated with a wavelength, unique across ports at the ToR 120, in order to exploit wavelength division multiplexing (WDM). This allows data from different ports to be multiplexed into one fiber without contention. This fiber is then connected to a 1×4 Wavelength Selective Switch (WSS) 124. The WSS 124 is typically an optical component, consisting of one common port and wavelength ports. It partitions the set of wavelengths coming in through the common port among the wavelength ports and the mapping is runtime-configurable (in a few milliseconds). The WSS 124 can split the set of 32 wavelengths it sees into four groups, each group being transmitted out on its own fiber. This fiber is connected to the MEMS optical switch 102 through a circulator 126 to enable bidirectional traffic through it. The circulators enable bidirectional optical transmission over a fiber, allowing more efficient use of the ports of optical switches. An optical circulator is a three-port device: one port is a shared fiber or switching port, and the other two ports serve as send and receive ports. Optical transceivers can be of two types: coarse WDM (CWDM) and dense WDM (DWDM). One embodiment uses DWDM-based transceivers, which support higher bit-rates and more wavelength channels in a single piece of fiber compared to CWDM.
  • The receiving infrastructure (shown in the right half of FIG. 2) has a coupler 136 connected to a demultiplexer 132 which separates multiple incoming wavelengths, each then delivered to a different port. In one embodiment, four receive fibers from each of four circulators, are connected to a power coupler 134 which combines their wavelengths onto one optical fiber. This fiber feeds into a demultiplexer 132 which splits each incoming wavelength to its associated port for a TOR 130. In one embodiment, the interconnect of FIG. 1 uses a 320-port micro-electrical mechanical systems (MEMS) switch, to connect 80 ToRs with a total of 2560 servers.
  • Depending on the channel spacing, using WDM, a number of channels or wavelengths can be transmitted over a single piece of fiber in the conventional or C-band. In one embodiment, each wavelength is rate-limited by the electrical port it is connected to. The OSM modules in optical communications can be bipartite switching matrices where any input port can be connected to any one of the output ports. Micro-Electro-Mechanical Switch (MEMS) can be used as an OSM and achieves reconfigurable one-to-one circuit between its input and output ports by mechanically adjusting micro mirrors.
  • The system of FIG. 2 offers highly flexible bandwidth. Every ToR has degree k. If each edge had fixed bandwidth, multiple edges would need to be utilized for this ToR to communicate with another ToR at a rate higher than a single edge supports. To overcome this problem, the system combines the capability of optical fibers to carry multiple wavelengths at the same time (WDM) with the dynamic reconfigurability of the WSS. Consequently, a ToR is connected to MEMS through a multiplexer and a WSS unit.
  • Specifically, suppose ToR A wants to communicate with ToR B using w times the line speed of a single port. The ToR will use w ports, each associated with a (unique) wavelength, to serve this request. WDM enables these w wavelengths, together with the rest from this ToR, to be multiplexed into one optical fiber that feeds the WSS. The WSS splits these w wavelengths to the appropriate MEMS port which has a circuit to ToR B (doing likewise for k−1 other sets of wavelengths). Thus, a w×(line-speed) capacity circuit is set up from A to B, at runtime. By varying the value of w for every MEMS circuit connection, the system offers dynamic capacity for every edge.
  • In one embodiment, each ToR can communicate simultaneously with any four other ToRs. Thus, the MEMS switch 102 can construct all possible 4-regular ToR interconnection graphs. Secondly, through WSS configuration, each of these four links' capacity can be varied in {0, 10, 20, . . . , 320} Gbps, provided the sum does not exceed 320 Gbps. Thus, both the path between servers as well as the capacity of these paths can be varied in this architecture.
  • To enable a ToR pair to communicate using all available wavelengths, each ToR port (facing the optical interconnect) is assigned a wavelength unique across ports at the ToR. The same wavelength is used to receive traffic as well: each port thus sends and receives traffic at one fixed wavelength. The same set of wavelengths is recycled across ToRs. This allows all wavelengths at one ToR to be multiplexed and delivered after demultiplexing to individual ports at the destination ToR. This wavelength-port association is a static, design/build time decision.
  • One exemplary specific instantiation of FIG. 1 deploys N=80 ToRs, W=32 wavelengths and k=4 ToR-degree using a 320 port MEMS to support 2560 servers. Each ToR is a conventional electrical switch with 64 10-GigE non-blocking ports. 32 of these ports are connected to servers, while the remaining face the optical interconnect. Each port facing the optical interconnect has a transceiver associated with a fixed and unique wavelength for sending and receiving data. The transceiver uses separate fibers to connect to the send and receive infrastructures. The send fiber from the transceivers from each of the 32 ports at a ToR is connected to an optical multiplexer. The multiplexer feeds a 1×4 WSS. The WSS splits the set of 32 wavelengths it sees into 4 groups, each group being transmitted on its own fiber. These fibers are connected to the MEMS switch through circulators to enable bidirectional traffic through them. The 4 receive fibers from each of 4 circulators corresponding to a ToR are connected to a power coupler (similar to a multiplexer, but simpler), which combines their wavelengths onto one fiber. This fiber feeds a demultiplexer, which splits each incoming wavelength to its associated port on the ToR.
  • In this interconnect, each ToR can communicate simultaneously with any 4 other ToRs. This implies that MEMS reconfigurations allow us to construct all possible 4-regular ToR graphs. Second, through WSS configuration, each of these 4 links' capacity can be varied in {0, 10, 20, . . . 320} Gbps. As discussed in more details below, these configurations are decided by a centralized manager. The manager obtains the traffic matrix from the ToR switches, calculates appropriate configurations, and pushes them to the MEMS, WSS, and ToRs. This requires direct, out-of-band connections between the manager and these units. The implementation is highly flexible—given a number N of Top-of-Rack (ToR) switches and a design-time-fixed parameter k, the system can assume any k-regular topology over the N ToRs. To illustrate how many options this gives, consider that for just N=20, there are over 12 billion (non-isomorphic) connected 4-regular graphs. In addition, the system allows the capacity of each edge in this k-regular topology to be varied from a few Gb/s to a few hundred Gb/s. Simulations show that the system can always deliver full bisection bandwidth for low-degree (e.g., inter-ToR≦4) traffic patterns, and even over 60% of throughput of a non-blocking network in case of moderately high-degree (e.g., inter-ToRε[4,20]) traffic patterns. Furthermore, it enables lower (50%) power consumption and lower (20%) cabling complexity compared to a fat-tree connecting a similar number of servers. While at current retail prices, the system is marginally more costly (10%) than a fat-tree (at 10 GigE per-port), a cost advantage should materialize as optical equipment sees commoditization, and higher bit-rates gain traction.
  • With a larger number of MEMS and WSS ports, topologies with higher degrees and/or larger numbers of ToRs can be built. It is also possible to make heterogeneous interconnects—a few nodes can have larger degree than the rest.
  • The system of FIGS. 1-2 achieves topology flexibility by exploiting the reconfigurability of the MEMS. Given a ToR-graph connected by optical circuits through the MEMS, the system uses hop-by-hop stitching of such circuits to achieve network connectivity. To reach ToRs not directly connected to it through the MEMS, a ToR uses one of its connections. This first-hop ToR receives the transmission over fiber, converts it to electrical signals, reads the packet header, and routes it towards the destination. At each hop, every packet experiences conversion from optics to electronics and then back to optics (O-E-O). Such conversion can be done in sub-nanosecond level. At any port, the aggregate transit, incoming and outgoing traffic cannot exceed the port's capacity in each direction. So, high-volume connections must use a minimal number of hops. The system manages the topology to adhere to this requirement.
  • To support adapting to a wider variety of traffic patterns, the flexible DCN architecture of FIG. 1 also needs topology management manager that (a) configure the MEMs to adjust the topology to localize high traffic volumes, b) configure the WSS at each ToR to adjust the capacity of its four outgoing links to provision bandwidth where it is most gainful, and (c) pick routes between ToR-pairs to achieve high throughput, low latency and minimal network congestion.
  • The control software run by the topology manager solves this problem of topology management, which can be formulated as a mixed-integer linear program. In the following discussion, a traffic demand D between ToRs—Dij is the desired bandwidth from ToRi to ToRj.
  • Variables: Four classes of variables: lij=1 if ToRi is connected to ToRj through MEMS and 0 otherwise; wijk=1 if lij carries wavelength λk in the i→j direction and 0 otherwise; a traffic-served matrix S-Sij is the bandwidth provisioned (possibly over multiple paths) from ToRi to ToRj; vijk is the volume of traffic carried by wavelength λk along i→j. Among the latter two sets of variables, Sij have end-to-end meaning, while vijk have hop-to-hop significance. For all variables, kε{1, 2, . . . , λTotal}; i,jε{1, 2, . . . , #ToRs}, i≠j; lij are the only variables for which lij=lji always holds—all other variables are directional.
  • Objective: A simplistic objective is to maximize the traffic served (constrained by demand, see (6)):
  • Maximize i , j S ij . ( 1 )
  • Constraints:
  • A wavelength λk can only be used between two ToRs if they are connected through MEMS:

  • i,j,k:w ijk ≦l ij.  (2)
  • ToRi can receive/send λk from/to at most one ToR (this is illustrated in FIG. 3):
  • i , k : j w jik 1 ; j w ijk 1. ( 3 )
  • If the number of ports of the WSS units is W, then ToRi is connected to exactly W other ToRs:
  • i : j l ij = W . ( 4 )
  • Hop-by-hop traffic is limited by port capacities (Cport), wavelength capacity (Cλ), and provisioning:

  • i,j,k:v ijk≦min{C port ,C λ ×w ijk}.  (5)
  • A constraint is to never provision more traffic than demanded:

  • i,j:S ij ≦D ij.  (6)
  • The outgoing transit traffic (total traffic flowing out, minus total traffic for which ToRi is the origin) equals incoming transit traffic at ToRi:
  • i : j , k v ijk - j S ij = j , k v jik - j S ji . ( 7 )
  • The above mixed-integer linear program (MILP) can be seen as a maximum multi-commodity flow problem with degree bounds, further generalized to allow constrained choices in edge capacities. While several variants of the degree-bounded subgraph and maximum flow problems have known polynomial time algorithms, trivial combinations of two are known to be NP-hard. Thus, to simplify the computation, we present heuristic approaches for the control software for finding the optimized topology and link capacity assignment to meet the changing traffic patterns is discussed. The control software tightly interacts with OSM/MEMS, WSS and ToR switches to control the network topology, link capacity and routing.
  • FIG. 3 shows an exemplary control manager 200 that controls the system 100 of FIG. 1. The control system includes a module 202 that estimates traffic demand. The module 202 provides input to a module 204 that assigns pairs with heavy communications to direct links. Next a module 206 performs the connectivity accordingly. Through modules 204-206, the manager 200 controls the MEMS optical switch 102 to adjust the network topology. Next, a module 210 identifies routing paths and sends all the ToRs these paths in order to set up their routing tables. A module 214 then determines the capacity demand on each link and a module 216 then determines the wavelength assignment scheme.
  • In one embodiment, as conventionally done, the software estimates the traffic demand according to max-min fair bandwidth allocation for TCP flows in an ideal non-blocking network. All the flows are only limited by the sender or receiver network interface cards (NICs).
  • The manager assigns direct links for heavy communicating pairs. High-volume communicating pairs (i.e., ToR switches) over direct MEMS circuit links. This is accomplished by using a weighted b-matching, where b represents the number of connections that each ToR has to MEMS (b=4 in our example scenario). It is easy to cast the problem of localizing high-volume ToR-connections to b-matching: In the ToR graph, assign the edge-weight between two ToRs as the estimated flow-size between them. Weighted b-matching is a graph theoretic problem for which an elegant polynomial-time algorithm is known. In one embodiment, the weighted b-matching algorithm is approximated using multiple 1-matchings.
  • Connectivity is achieved through the edge-exchange operation as follows. First, the method locates all connected components. If the graph is not connected, the method selects two edges a→b and c→d with lowest weights in different connected components, and simply replace links a→b and c→d with links a→c and b→d to connect them. A check is done to make sure that the links removed are not themselves cuts in the graph. The output of steps 2 and 3 is used to tell the MEMS optical switch 102 how to configure the network topology.
  • Once connectivity is determined, the MEMS optical switch configuration is known. The method finds routes using any of the standard routing schemes such as the shortest path or a low congestion routing scheme. Some of the routes are single-hop MEMS connection while others are multi-hop MEMS connections. In one implementation, the standard shortest path technique is used to calculate the routing paths. However, the framework can be readily applied to any other routing scheme. The output is used to tell ToRs on how to configure their routing tables.
  • Given the routing and the estimated traffic demand (aggregated) between each pair of ToRs, the method computes the link capacity desired on each link. To satisfy the capacity demand on each link, multiple wavelengths may be used. However, the sum of capacity demands of all links associated with a ToR switch must not exceed the capacity of this ToR.
  • After figuring out the desired capacity on each link, the system needs to provision wavelengths appropriately to serve these demands. This problem is reduced to an edge-coloring problem on a multigraph. Multiple edges correspond to volume of traffic between two nodes, and wavelengths are the colors to be used to color these edges. For instance, D→A and B→A cannot both use the same wavelength. This constraint stems from the fact that two data-flows encoded over the same wavelength can not share the same optical fiber in the same direction. Various fast edge-coloring heuristics can be used, and an algorithm based on Vizing's theorem is used in one embodiment due to speed and code availability.
  • On implementation requires at least one wavelength to be assigned to each edge on the physical topology. This guarantees an available path between any ToR-pair, which may be required for mice/bursty flows. The output is used to tell WSS on how to assign wavelengths.
  • During the operation, the system works based on the value of η·η is defined as the expected throughput achieved via the link capacity adjustment versus that achieved via network topology change. If the throughput obtained by only adjusting link capacity is significant enough compared to that obtained by rearranging the topology, the system can adjust link capacity while keep the current topology. This is cheaper than changing the topology since topology changes necessitate change in the routing tables of ToRs. It is possible that the traffic pattern is fundamentally changed so that only adjusting the link capacity cannot provide a satisfactory throughput. In this case, the system reconfigures the network topology. In practice, the system can modify η on-demand to satisfy different performance requirements.
  • Due to easy availability of network state (e.g., topology, traffic demand etc) at the manager, routing can be easily realized in a centralized manner, where the manager is responsible for calculating and updating the routing table for each ToR. For simplicity, the manager employ shortest path routing with failover paths. However, any other sophisticated routing algorithms can be readily applied. The flexibility of the architecture of FIG. 1 can be used not only to meet the changing traffic patterns, but also to handle failures (e.g., a WSS port failure can be taken care of via dynamically assigning that port's wavelength to remaining ports). In addition, the system graphs are inherently fault-tolerant due to their path redundancy and we demonstrate, via simulations, appealing performance in the presence of a large percentage of link and/or node failures.
  • FIG. 4 shows another exemplary GreedyTree method to dynamically adjust the topology according to changing network traffic demand, different from the above method. This mechanism is a tree inspired design and attempts to form a tree in such a way that traffic is concentrated towards the leaves, so that voluminous flows don't occupy large of hops. In this method, the input is a traffic matrix D (traffic demand between any pair of racks) where Di,j denotes traffic travelling from ToR i to ToR j. D is asymmetric due to the directional nature of network traffic. First, the method initializes a virtual node set V (302). Next, the method checks if V has only one element (304) and if so, exits processing. Alternatively, the method determines a traffic matrix M over the set V (306), and then applies maximum weighted bipartite matching to determine which pairs of nodes should be connected to form a higher level virtual node (308). Next, for each pair of nodes to connect, standard matching is used to determine the real underlying nodes to connect (310). If there are not enough wavelengths to connect the nodes, the method reassigns least used wavelengths from the lower levels while maintaining connectivity (310). The method loops back to 304 until all elements are processed.
  • In one embodiment, for each iteration, the method attempts to connect pairs of virtual nodes that yield the maximum benefit by finding a matching. The initial set of virtual nodes is the same as the set of ToRs. At every stage, pairs of virtual nodes from the previous stage are connected. The total bandwidth demand across two virtual-nodes is first computed by summing demands from the real nodes in each virtual-node to the other. These pair-wise demands are used as weights for a standard matching algorithm (such as Edmond's algorithm, among others) to obtain the best set of virtual-edges. Each virtual edge can have one or more real edges and a number of wavelengths. These edges and wavelengths are determined by a heuristic-based function which uses matching restricted to only the sets of nodes in the two virtual-nodes being connected. If more wavelengths and links are required than are available from the two virtual-nodes, then links and wavelengths from the lower-level are harvested (least useful at lower-level first) while preserving connectivity. The algorithm iterates until it has built one large virtual node. Once the method terminates, all configurations are pushed to the optical elements.
  • Another heuristic alternative to FIG. 4 is discussed next. FIG. 5 shows an exemplary Darwinian method to dynamically reconstruct routing paths according to changing network traffic demand. First, the method initializes a virtual node set V (330). Next, the method determines a traffic matrix M over the set V (332), and then applies a 4 matching technique to determine which pairs of nodes should be connected to form a higher level virtual node (334). Next, the method makes the graph connectivity using edge-exchange operations (336).
  • The Darwinian heuristic attempts to localize high-volume flows over direct circuit links. This is accomplished by using a weighted matching restricted to a degree of 4 (i.e., weighted 4-matching), representing the number of connections each ToR has to the MEMS. However, this does not impose connectivity. Connectivity is ensured using the edge-exchange operation on the edges of lowest weight across pairs of components, thus connecting them. This edge-exchange operation is repeated until connectivity is achieved between all source-destination pairs.
  • The Darwinian heuristic is based on the idea of starting out with a structured topology (like a k-regular circulant graph, a Kautz digraph, an incomplete hypercube, or even a DCell-like topology) from which the topology keeps evolving. Over this topology, it is possible to use degree-preserving operations to better conform to the traffic matrix. So if two ToRs which seek to establish a high bandwidth connection are connected to two other ToRs and are not serving much transit traffic, they can be connected directly, by breaking one of their current links. The advantage of this method is that it is iterative and each iteration should be computationally inexpensive. It is also likely that a large number of large flows do not change simultaneously, thus a large number of such operations are should rarely be required. It is possible to use this method as a continuous background optimization. The objective is to ensure that a weighted sum of path lengths is minimized.
  • The GreedyTree and Darwinian heuristics or processes reconstruct the network topology in adaptation to changing traffic demand and can deal with arbitrary traffic patterns. This is in contrast to conventional systems where a particular traffic pattern is assumed. The GreedyTree method intelligently utilizes the switching and reconfiguration functionalities of WSS and adaptively redistributes wavelength assignment to cope with topology and routing changes. This is also the first application of WSS in data center networks.
  • Once connectivity is achieved, the MEMS configuration is known. The system finds routes using any of standard routing schemes—shortest path or preferably, a low congestion routing scheme. In one embodiment shown in FIG. 6, a simple, yet effective, shortest path routing scheme called Fault-tolerant Proteus Routing (FPR) is used.
  • In FIG. 6, the input is the topology represented by a graph G(V, E), the edge weights w, the source node s, and the destination node d. During initialization, the weight of each edge is set to one (350). Next, the method determines the primary path between s and d: PPrimary=shortest_path(G, s, d, w) (352). The method then determines the failover path between s and d (354). In one embodiment, this is done by determining for each edge e on the primary path PPrimary, calculate w(e)=w(e)+|E|; and PFailover=shortest_path(G, s, d, w). Finally, the method returns PPrimary and PFailover as the result (356).
  • The basic idea of FPR is simple. Leveraging on network status, the Manager is responsible for calculating the routing table for each ToR switch. In one embodiment, for simplicity, the shortest path routing method of FIG. 6 is used for routing table construction. However, the scheme is readily applied to any other sophisticated routing calculation. Once link or node failures happen, the related devices will report to the Manager, then the Manager will react by evoking the control software to rearrange the link capacity or topology (based on the degree of failures) to bypass the failed parts. In this sense, FPR is a simple and flexible way to handle failures largely due to the architecture of FIG. 1.
  • FIG. 7 shows an exemplary wavelength assignment method. Turning now to FIG. 7, the input is a system graph and capacity demand on each link. For each link, the method determines the number n of wavelengths to satisfy the capacity demand and replaces the link with n parallel directed links (380). Next, the method converts the resulting directed graph to an undirected graph by merging anti-parallel links (382). The method then applies a standard edge-coloring heuristics on this graph, where wavelengths are the colors to be used to color these edges (384). If the resulting graph is with one more extra color, then the method removes the color (i.e., wavelength) that is least used (386).
  • Using the method of FIG. 7, the system provisions or allocates wavelengths to serve capacity requirements. In one example, the system first decides the necessary number (say n) of wavelengths allocated to each optical fiber to meet the capacity requirements and replaces this link with n parallel directed links in the graph. For instance, if each wavelength maximally carries 10 Gb/s and the capacity requirement of a particular link is 45 Gb/s, then the system replaces this link with 5 parallel links in the graph. This way, after this operation, we obtain a graph with degree of 32 for each node. In the second step, the system converts the resulting directed graph to an undirected graph by merging anti-parallel links, i.e., merging the directed link from node u to v and the one from v to u. Now, the system gets a new undirected graph with node degree 32. Then, the system applies a standard edge-coloring heuristics on this graph, where wavelengths are the colors to be used to color these edges. Since the heuristics may end up with coloring the graph with one more extra color (i.e., 33), then the final step is just to remove the color (i.e., wavelength) that is least used.
  • Next, a hop-by-hop routing method is discussed. This method automatically generates hop-by-hop routing protocols based on network topology changes. This is also a breakthrough in optical communications especially in the context of data center networks, where only point-to-point optical communication is considered.
  • As the system does not impose the requirement of underlying all-to-all electrical connectivity between the servers, and due to the physical limitation on the number of possible optical paths between servers, the inclusion of hop-by-hop routing is necessary in the design. If a direct optical path does not exist, a hop-by-hop path can be used instead. For this purpose, a multi-hop routing protocol is used. Once a suitable configuration and paths have been computed, these are pushed to all ToRs. ToRs thus know their routes to all other ToRs and use source routing. Each packet from a server destined to some other server outside the ToR is tunneled through this source-routing protocol between ToRs. At the source ToR, a sequence of destination ToRs is specified in the header and sent to the first ToR through the local forwarding table. The first hop then looks at the next hop in sequence and sends the packet to it and this is repeated until the data reaches the destination.
  • The all-optical network described herein can be easily supplemented with other forms of network connectivity including wireless and electrical networks.
  • It will be apparent to those skilled in the art that various modifications and variation can be made in the present invention without departing from the spirit or scope of the invention. Thus, it is intended that the present invention covers the modifications and variations of this invention provided they come within the scope of the appended claims and their equivalents.

Claims (24)

1. A method to communicate data, comprising
forming an all optical network backbone; and
performing hop-by-hop routing (multi-hop routing) over the optical network backbone.
2. The method of claim 1, comprising receiving a traffic matrix to create on-demand the network topology.
3. The method of claim 2, comprising estimating traffic demand.
4. The method of claim 3, comprising applying a Greedy-Tree heuristic.
5. The method of claim 4, comprising determining a total bandwidth demand across two virtual-nodes by summing demands from the real nodes in each virtual node to the other.
6. The method of claim 4, comprising determining:
PairDemand ( vN i q , vN j q ) = a vN i q , b vN j q D ab + D ba
wherein pairwise demands are used as weights for standard matching to obtain the best set of virtual-edges and wherein each virtual edge can have one or more real edges and a number of wavelengths.
7. The method of claim 3, comprising applying a Darwinian heuristic.
8. The method of claim 7, comprising performing an n-matching technique to determine which pairs of nodes should be connected to form a higher level virtual node and generating graph connectivity using edge-exchange operations, wherein connectivity is ensured using the edge-exchange operation on edges of lowest weight across pairs of components.
9. The method of claim 7, comprising performing weighted matching restricted to a degree of N (i.e., weighted N-matching), where N is the number of connections to other top-of-racks (ToRs).
10. The method of claim 1, comprising performing fault-tolerant routing.
11. The method of claim 10, comprising performing shortest path routing for routing table construction and rearranging a link capacity or topology based on the degree of failures to bypass a failed node.
12. The method of claim 11, comprising determining a primary path between nodes s and d by determining PPrimary=shortest_path(G, s, d, w).
13. The method of claim 11, comprising determining a fail-over path between nodes s and d by determining, for each edge e on the primary path PPrimary, calculate w(e)=w(e)+|E|; and PFailover=shortest_path(G, s, d, w).
14. The method of claim 1, comprising assigning a wavelength to serve a capacity demand.
15. The method of claim 14, comprising determining n wavelengths to satisfy the capacity demand and replacing a link with n parallel directed links.
16. The method of claim 14, comprising merging anti-parallel links.
17. The method of claim 14, comprising applying a standard edge-coloring heuristics on a graph, wherein wavelengths correspond to colors to be used to color edges.
18. The method of claim 14, comprising removing a color corresponding to a wavelength that is least used.
19. The method of claim 1, comprising applying the multi-hop routing to form an optimal network topology that maximally serves overall network traffic demand.
20. The method of claim 19, wherein the multi-hop routing comprises source-routing.
21. The method of claim 1, comprising routing data over a supplementary electrical network or wireless network.
22. A method for interconnecting a data center network, said method comprising:
using hop-by-hop routing over an optical network; and
using bidirectional optical network devices to enable bidirectional communication over fiber.
23. The method of claim 22, comprising dynamically constructing a network topology.
24. A method to communicate over an optical network, comprising
dynamically constructing a network topology based on traffic demands and hop-by-hop routing; and
constructing a dynamically changing data center network (DCN) architecture.
US13/078,979 2010-07-08 2011-04-03 Optical switching network Abandoned US20120008944A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US13/078,979 US20120008944A1 (en) 2010-07-08 2011-04-03 Optical switching network

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US36248210P 2010-07-08 2010-07-08
US201161436283P 2011-01-26 2011-01-26
US13/078,979 US20120008944A1 (en) 2010-07-08 2011-04-03 Optical switching network

Publications (1)

Publication Number Publication Date
US20120008944A1 true US20120008944A1 (en) 2012-01-12

Family

ID=45438666

Family Applications (3)

Application Number Title Priority Date Filing Date
US13/078,979 Abandoned US20120008944A1 (en) 2010-07-08 2011-04-03 Optical switching network
US13/078,980 Active 2031-10-31 US8705954B2 (en) 2010-07-08 2011-04-03 Optical switching network
US13/078,978 Abandoned US20120008943A1 (en) 2010-07-08 2011-04-03 Optical switching network

Family Applications After (2)

Application Number Title Priority Date Filing Date
US13/078,980 Active 2031-10-31 US8705954B2 (en) 2010-07-08 2011-04-03 Optical switching network
US13/078,978 Abandoned US20120008943A1 (en) 2010-07-08 2011-04-03 Optical switching network

Country Status (1)

Country Link
US (3) US20120008944A1 (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2014066241A1 (en) 2012-10-26 2014-05-01 Sodero Networks, Inc. Method and apparatus for implementing a multi-dimensional optical circuit switching fabric
US20140270761A1 (en) * 2012-10-26 2014-09-18 Sodero Networks, Inc. Method and Apparatus for Efficient and Transparent Network Management and Application Coordination for Software Defined Optical Switched Data Center Networks
US20150181317A1 (en) * 2013-12-24 2015-06-25 Nec Laboratories America, Inc. Scalable hybrid packet/circuit switching network architecture
US10158929B1 (en) * 2017-02-17 2018-12-18 Capital Com SV Investments Limited Specialized optical switches utilized to reduce latency in switching between hardware devices in computer systems and methods of use thereof
US10581736B1 (en) * 2018-11-13 2020-03-03 At&T Intellectual Property I, L.P. Traffic matrix prediction and fast reroute path computation in packet networks
US20230208517A1 (en) * 2021-12-28 2023-06-29 Mellanox Technologies, Ltd. Devices, systems, and methods for free space key exchange

Families Citing this family (58)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP5633281B2 (en) * 2010-09-29 2014-12-03 富士通株式会社 Optical communication system, optical network management apparatus, and optical network management method
JP5842428B2 (en) * 2011-07-21 2016-01-13 富士通株式会社 Optical network and optical connection method
US8867915B1 (en) * 2012-01-03 2014-10-21 Google Inc. Dynamic data center network with optical circuit switch
US8965203B1 (en) * 2012-01-09 2015-02-24 Google Inc. Flexible non-modular data center with reconfigurable extended-reach optical network fabric
US20130250802A1 (en) * 2012-03-26 2013-09-26 Praveen Yalagandula Reducing cabling costs in a datacenter network
US8983293B2 (en) * 2012-04-25 2015-03-17 Ciena Corporation Electro-optical switching fabric systems and methods
CN103686466B (en) * 2012-09-12 2016-12-21 华为技术有限公司 The method and apparatus generating forwarding-table item for the equipment in optical-fiber network
CN103685399B (en) * 2012-09-17 2018-03-23 腾讯科技(深圳)有限公司 A kind of methods, devices and systems for logging in class Unix virtual containers
US20140241203A1 (en) * 2013-02-22 2014-08-28 Microsoft Corporation Programmable Physical Network Topologies for Datacenters
US9602434B1 (en) 2013-02-27 2017-03-21 Juniper Networks, Inc. Data center architecture utilizing optical switches
US9692639B1 (en) * 2013-03-15 2017-06-27 Google Inc. Achieving full bandwidth usage and max-min fairness in a computer network
US9184999B1 (en) * 2013-03-15 2015-11-10 Google Inc. Logical topology in a dynamic data center network
US9584885B2 (en) * 2013-05-10 2017-02-28 Huawei Technologies Co., Ltd. System and method for photonic switching
CN103338414B (en) * 2013-05-28 2016-05-25 苏州大学 A kind of method that minimizes IP over WDM network energy consumption
US9246760B1 (en) 2013-05-29 2016-01-26 Google Inc. System and method for reducing throughput loss responsive to network expansion
US20150043905A1 (en) * 2013-08-07 2015-02-12 Futurewei Technologies, Inc. System and Method for Photonic Switching and Controlling Photonic Switching in a Data Center
US9960878B2 (en) * 2013-10-01 2018-05-01 Indian Institute Of Technology Bombay Scalable ultra dense hypergraph network for data centers
US9520961B2 (en) * 2014-01-17 2016-12-13 Telefonaktiebolaget L M Ericsson (Publ) System and methods for optical lambda flow steering
US10116558B2 (en) 2014-01-24 2018-10-30 Fiber Mountain, Inc. Packet switch using physical layer fiber pathways
US9166692B1 (en) 2014-01-28 2015-10-20 Google Inc. Network fabric reconfiguration
US9678800B2 (en) 2014-01-30 2017-06-13 International Business Machines Corporation Optimum design method for configuration of servers in a data center environment
US10277496B2 (en) 2014-03-28 2019-04-30 Fiber Mountain, Inc. Built in alternate links within a switch
CN105025399A (en) * 2014-04-21 2015-11-04 江苏艾思特信息科技有限公司 Passive optical interconnection structure
WO2015164799A1 (en) * 2014-04-25 2015-10-29 Huawei Technologies Co., Ltd. Apparatus and methods for scalable photonic packet architectures using pic switches
US9537714B1 (en) * 2014-05-09 2017-01-03 Google Inc. Randomized rotation striping for direct connect networks
US9491526B1 (en) * 2014-05-12 2016-11-08 Google Inc. Dynamic data center network with a mesh of wavelength selective switches
CN103984660B (en) * 2014-05-19 2018-02-23 浪潮电子信息产业股份有限公司 A kind of design method exchanged based on light with the whole machine cabinet framework of distributed network
SG11201700004PA (en) 2014-07-03 2017-01-27 Fiber Mountain Inc Data center path switch with improved path interconnection architecture
CN104168204B (en) * 2014-07-10 2017-07-07 深圳大学 The energy-conservation traffic grooming method and system of the obstruction IP transmission networks based on WDM
US9989724B2 (en) 2014-09-29 2018-06-05 Fiber Mountain, Inc. Data center network
US10382845B2 (en) 2014-09-29 2019-08-13 Fiber Mountain, Inc. System for increasing fiber port density in data center applications
US10038514B2 (en) 2014-12-23 2018-07-31 Telefonaktiebolaget Lm Ericsson (Publ) Datacentre for processing a service
WO2016164769A1 (en) * 2015-04-09 2016-10-13 Fiber Mountain, Inc. Data center endpoint network device with built in switch
US10491973B2 (en) * 2015-04-24 2019-11-26 Rockley Photonics Limited Optoelectronic switch
US9722694B2 (en) 2015-09-11 2017-08-01 Microsoft Technology Licensing, Llc Backup communications scheme in computer networks
CN105610494B (en) * 2015-10-29 2018-05-04 电子科技大学 A kind of monitoring mark design method based on node pre-selection in all-optical network
US9894427B2 (en) * 2015-11-11 2018-02-13 Juniper Networks, Inc. Methods and apparatus for a flattened data center network employing wavelength-agnostic endpoints
CN106817288B (en) * 2015-11-30 2019-06-14 华为技术有限公司 A kind of data centre network system and signal transmission system
US10306344B2 (en) * 2016-07-04 2019-05-28 Huawei Technologies Co., Ltd. Method and system for distributed control of large photonic switched networks
US10791174B2 (en) * 2016-07-28 2020-09-29 Intel Corporation Mechanism for efficient discovery of storage resources in a rack scale architecture system
WO2018053179A1 (en) 2016-09-14 2018-03-22 Fiber Mountain, Inc. Intelligent fiber port management
TWI668557B (en) 2017-02-14 2019-08-11 美商莫仕有限公司 Server system
CN108199977A (en) * 2017-12-29 2018-06-22 国网湖南省电力有限公司 A kind of multihop routing and dispatching method of dual-active data center
US10687130B2 (en) * 2018-06-11 2020-06-16 Delta Electronics, Inc. Intelligence-defined optical tunnel network system controller and control method thereof
US10491302B1 (en) * 2018-08-06 2019-11-26 Hewlett Packard Enterprise Development Lp Rack-level photonic solution
US10623101B1 (en) 2018-08-07 2020-04-14 Hewlett Packard Enterprise Development Lp Hyperscale photonics connectivity solution
US10757041B2 (en) 2018-12-03 2020-08-25 Hewlett Packard Enterprise Development Lp Full server-level redundancy using a single network interface controller(NIC) and a single NIC card
CN109818792B (en) * 2019-01-29 2022-02-15 西安电子科技大学 Controller based on second-order linear system time-varying coupling complex dynamic network model
US20220394362A1 (en) * 2019-11-15 2022-12-08 The Regents Of The University Of California Methods, systems, and devices for bandwidth steering using photonic devices
US11381891B2 (en) * 2020-04-30 2022-07-05 Hewlett Packard Enterprise Development Lp Virtual fiber adapter for wavelength-as-a-service communications
TWI739635B (en) * 2020-10-20 2021-09-11 國立陽明交通大學 Reliability evaluation method of multi-state distributed network system
US20220174000A1 (en) * 2020-12-01 2022-06-02 Mellanox Technologies Tlv Ltd. Routing with a fixed matchings switch
EP4009564B1 (en) * 2020-12-03 2023-12-06 Hon Lin Technology Co., Ltd. Method for allocating wireless resources based on sensitivity to inter-cell interference and apparatus thereof
US11800266B2 (en) * 2021-05-19 2023-10-24 Mellanox Technologies, Ltd. Hybrid optoelectrical switches
US20220417624A1 (en) * 2021-06-29 2022-12-29 Nokia Solutions And Networks Oy Dynamic network topology control
CN114584868B (en) * 2022-02-12 2023-07-18 国网宁夏电力有限公司电力科学研究院 Data center photoelectric hybrid architecture upgrading method
US11711270B1 (en) * 2022-04-19 2023-07-25 Ciena Corporation Creating an optimal node interconnect topology given certain constraints
WO2024038544A1 (en) * 2022-08-18 2024-02-22 Nippon Telegraph And Telephone Corporation Interconnection apparatus and switching system

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7340169B2 (en) * 2003-11-13 2008-03-04 Intel Corporation Dynamic route discovery for optical switched networks using peer routing
US7363353B2 (en) * 2001-07-06 2008-04-22 Juniper Networks, Inc. Content service aggregation device for a data center
US7787770B2 (en) * 2003-01-27 2010-08-31 Ciena Corporation Methods for co-modelling and analyzing packet networks operating over optical networks

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6208443B1 (en) * 1996-10-03 2001-03-27 International Business Machines Corporation Dynamic optical add-drop multiplexers and wavelength-routing networks with improved survivability and minimized spectral filtering
JP4571933B2 (en) * 2006-12-28 2010-10-27 富士通株式会社 Optical transmission apparatus and optical transmission method
KR101668573B1 (en) * 2007-12-21 2016-10-21 칼 짜이스 에스엠테 게엠베하 Illumination system for a microlithographic projection exposure apparatus
US8155520B1 (en) * 2008-04-16 2012-04-10 Cyan, Inc. Multi-fabric shelf for a transport network

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7363353B2 (en) * 2001-07-06 2008-04-22 Juniper Networks, Inc. Content service aggregation device for a data center
US7787770B2 (en) * 2003-01-27 2010-08-31 Ciena Corporation Methods for co-modelling and analyzing packet networks operating over optical networks
US7340169B2 (en) * 2003-11-13 2008-03-04 Intel Corporation Dynamic route discovery for optical switched networks using peer routing

Cited By (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9332324B2 (en) * 2012-10-26 2016-05-03 Guohua Liu Method and apparatus for efficient and transparent network management and application coordination for software defined optical switched data center networks
US20140119728A1 (en) * 2012-10-26 2014-05-01 Sodero Networks, Inc. Method and apparatus for implementing a multi-dimensional optical circuit switching fabric
US20140270761A1 (en) * 2012-10-26 2014-09-18 Sodero Networks, Inc. Method and Apparatus for Efficient and Transparent Network Management and Application Coordination for Software Defined Optical Switched Data Center Networks
WO2014066241A1 (en) 2012-10-26 2014-05-01 Sodero Networks, Inc. Method and apparatus for implementing a multi-dimensional optical circuit switching fabric
CN104813603A (en) * 2012-10-26 2015-07-29 索德若网络有限公司 Method and apparatus for implementing a multi-dimensional optical circuit switching fabric
US9332323B2 (en) * 2012-10-26 2016-05-03 Guohua Liu Method and apparatus for implementing a multi-dimensional optical circuit switching fabric
US20150181317A1 (en) * 2013-12-24 2015-06-25 Nec Laboratories America, Inc. Scalable hybrid packet/circuit switching network architecture
US9654852B2 (en) * 2013-12-24 2017-05-16 Nec Corporation Scalable hybrid packet/circuit switching network architecture
US10158929B1 (en) * 2017-02-17 2018-12-18 Capital Com SV Investments Limited Specialized optical switches utilized to reduce latency in switching between hardware devices in computer systems and methods of use thereof
US20190387292A1 (en) * 2017-02-17 2019-12-19 Capital Com SV Investments Limited Specialized optical switches utilized to reduce latency in switching between hardware devices in computer systems and methods of use thereof
US10581736B1 (en) * 2018-11-13 2020-03-03 At&T Intellectual Property I, L.P. Traffic matrix prediction and fast reroute path computation in packet networks
US10805214B2 (en) * 2018-11-13 2020-10-13 At&T Intellectual Property I, L.P. Traffic matrix prediction and fast reroute path computation in packet networks
US11303565B2 (en) * 2018-11-13 2022-04-12 At&T Intellectual Property I, L.P. Traffic matrix prediction and fast reroute path computation in packet networks
US20230208517A1 (en) * 2021-12-28 2023-06-29 Mellanox Technologies, Ltd. Devices, systems, and methods for free space key exchange

Also Published As

Publication number Publication date
US20120008943A1 (en) 2012-01-12
US20120008945A1 (en) 2012-01-12
US8705954B2 (en) 2014-04-22

Similar Documents

Publication Publication Date Title
US8705954B2 (en) Optical switching network
Gerstel et al. Cost-effective traffic grooming in WDM rings
Singla et al. Proteus: a topology malleable data center network
US6535313B1 (en) Dynamically assignable optical signal access control apparatus
EP1263258B1 (en) Communications network for a metropolitan area
US9332323B2 (en) Method and apparatus for implementing a multi-dimensional optical circuit switching fabric
US8503879B2 (en) Hybrid optical/electrical switching system for data center networks
US8818198B2 (en) Photonic link information collection and advertisement systems and methods
US8249451B2 (en) Methods for characterizing optical switches and multiplexers/demultiplexers
US20020159114A1 (en) Method and apparatus for routing signals through an optical network
Assi et al. Integrated routing algorithms for provisioning “sub-wavelength” connections in IP-over-WDM networks
Chen et al. Hybrid switching and p-routing for optical burst switching networks
Yang et al. Interdomain dynamic wavelength routing in the next-generation translucent optical Internet
US7706383B2 (en) Minimum contention distributed wavelength assignment in optical transport networks
Barry et al. Optical switching in datacenters: architectures based on optical circuit switching
Patel et al. Optical-layer traffic grooming in flexible grid WDM networks
WO2013178006A1 (en) Path establishment method and device thereof
Xu et al. Optically interconnected data center networks
Qin et al. QoS for virtual private networks (VPN) over optical WDM networks
Xu et al. Multicarrier-collaboration-based emergency packet transport network construction in disaster recovery
US20230209230A1 (en) Transport of packets over optical networks
Sumedh et al. Design and analysis of an optical transit network
Yamanaka et al. Traffic engineering and signaling technologies in photonic-GMPLS-router networks
Assi et al. Designing a survivable IP-over-WDM network
JP2003198609A (en) Optical communication network, node equipment and path route calculating method used therein

Legal Events

Date Code Title Description
STCB Information on status: application discontinuation

Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION