JP2013179429A

JP2013179429A - Parallel packet processing method and device for switching distribution destination

Info

Publication number: JP2013179429A
Application number: JP2012041507A
Authority: JP
Inventors: Hiroki Date; 拓紀伊達; Takashi Kurimoto; 崇栗本; Kenichi Higuchi; 健一樋口; Hitoshi Kaneko; 斉金子; Akira Misawa; 明三澤
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2012-02-28
Filing date: 2012-02-28
Publication date: 2013-09-09
Anticipated expiration: 2032-02-28
Also published as: JP5672504B2

Abstract

PROBLEM TO BE SOLVED: To change a distribution destination when processing packets by parallel processing by a plurality of processing cores, and to reduce influence of processing delay generated when changing the distribution destination.SOLUTION: In a distribution part 107 for distributing flow to a plurality of processing cores, utilization of two old and new distribution algorithms is made possible. The state of applying the old distribution algorithm and distributing the flow is switched to the state of applying the new distribution algorithm to the flow by temporally changing a threshold for a hash value calculated from the flow so as to increase the probability of applying the new distribution algorithm to the flow with the lapse of time for each flow.

Description

本発明は、ネットワーク内でデータを送受信および中継する方法及び装置に関し、特に、Ｌ７（レイヤ７）／Ｌ４（レイヤ４）スイッチのような、転送データでの広い範囲を参照して転送データを修正する処理や参照データの情報に基づきデータをある方路へ転送または廃棄する処理での、並列処理による性能向上の方法と、そのような方法に基づいて構成された装置に関する。 The present invention relates to a method and apparatus for transmitting / receiving and relaying data in a network, and in particular, corrects transfer data with reference to a wide range of transfer data such as L7 (Layer 7) / L4 (Layer 4) switches. The present invention relates to a method for improving performance by parallel processing in processing for transferring data or processing for transferring or discarding data to a certain path based on information of reference data, and an apparatus configured based on such a method.

パケット転送は、一般に、ネットワーク内に設けられるノード装置によって処理される。パケット転送処理に対する要求の多様化と高度化により、パケット転送処理をソフトウェアベースで柔軟に実現しながらも、複数のノード装置を設けたり、ノード装置をマルチプロセッサ構成としたり、ノード装置内のプロセッサとして１チップに複数のコアが集積されたマルチコア構成のものを用いたりして並列処理によるスケールアウトを行い、性能向上を図ることが重要となっている。さらに、仮想化に合わせた柔軟な処理割当てなどを実現するために、動的に各処理プログラムに対する並列数を変更する、または、低消費電力化のために並列数を変化させて電源のオン／オフ制御を実行するという要求も高まってきている。ルータやＬ７スイッチでのパケット処理のように、短時間に大量の単純処理を実行しなければならない場合には、並列処理が特に有効である。 The packet transfer is generally processed by a node device provided in the network. With the diversification and sophistication of packet transfer processing requirements, packet transfer processing can be implemented flexibly on a software basis, but multiple node devices can be installed, node devices can be configured as multiprocessors, and the processor within the node device It is important to improve performance by scaling out by parallel processing using a multi-core configuration in which a plurality of cores are integrated on one chip. Furthermore, in order to realize flexible processing allocation in accordance with virtualization, the parallel number for each processing program is dynamically changed, or the parallel number is changed in order to reduce power consumption. There is also an increasing demand to execute off control. Parallel processing is particularly effective when a large amount of simple processing must be executed in a short time, such as packet processing at a router or L7 switch.

パケット転送処理での一般的な並列処理は、実際に処理を行う複数のパケット処理部と、それらのパケット処理部に対して負荷分散などを考慮してパケットあるいはデータを振り分ける振分部で実現される。マルチコア構成のプロセッサが用いられるとすれば、パケット処理部は、パケット処理用のプログラムをロードしたコアによって実現され、振分部は振分用のプログラムをロードしたコアによって実現される。したがって、パケット処理部は処理コアとも呼ばれ、振分部は振分コアとも呼ばれる。 General parallel processing in packet transfer processing is realized by a plurality of packet processing units that actually perform processing and a distribution unit that distributes packets or data to these packet processing units in consideration of load distribution and the like. The If a processor having a multi-core configuration is used, the packet processing unit is realized by a core loaded with a packet processing program, and the distribution unit is realized by a core loaded with a distribution program. Therefore, the packet processing unit is also called a processing core, and the sorting unit is also called a sorting core.

図１は、上述した並列処理を実行するパケット処理装置１０１の一般的な構成を示している。ネットワークとの接続に用いられる２つのインタフェース（Ｉ／Ｆ）部１０２，１０３が設けられており、インタフェース部１０２，１０３は、いずれもパケット入力部１０４とパケット出力部１０５とに接続している。パケット入力部１０４は、パケット処理の対象となるパケットを外部から受信するための受信インタフェース回路であり、パケット出力部１０５は、パケット処理がなされたパケットを外部に送信するための送信インタフェース回路である。パケット入力部１０４とパケット出力部１０５は、内部バス１０６に接続し、内部バス１０６には、振分部（振分コア）１０７と複数のパケット処理部（処理コア）１０８も接続している。パケット処理の実施において必要とするためにデータを保持するデータ保持部１０９が外部メモリとして設けられており、複数のパケット処理部１０８は、メモリバス１１０を介してデータ保持部１０９に接続する。データ保持部１０９は、複数のパケット処理部１０８に対する共有メモリとして機能するものである。この構成においてパケット入力部１０４、パケット出力部１０５、内部バス１０６、振分部１０７、パケット処理部１０８及びメモリバス１１０は、マルチコア構成のプロセッサ（例えばマルチコア構成のＣＰＵ（中央演算ユニット））１１１内に形成されている。 FIG. 1 shows a general configuration of a packet processing apparatus 101 that performs the parallel processing described above. Two interface (I / F) units 102 and 103 used for connection to a network are provided, and both interface units 102 and 103 are connected to a packet input unit 104 and a packet output unit 105. The packet input unit 104 is a reception interface circuit for receiving a packet to be subjected to packet processing from the outside, and the packet output unit 105 is a transmission interface circuit for transmitting the packet subjected to packet processing to the outside. . The packet input unit 104 and the packet output unit 105 are connected to an internal bus 106, and a distribution unit (distribution core) 107 and a plurality of packet processing units (processing cores) 108 are also connected to the internal bus 106. A data holding unit 109 is provided as an external memory to hold data to be necessary for performing packet processing, and the plurality of packet processing units 108 are connected to the data holding unit 109 via the memory bus 110. The data holding unit 109 functions as a shared memory for the plurality of packet processing units 108. In this configuration, the packet input unit 104, the packet output unit 105, the internal bus 106, the distribution unit 107, the packet processing unit 108, and the memory bus 110 are included in a multi-core processor (for example, a multi-core CPU (central processing unit)) 111. Is formed.

以下の説明において、個々のパケット処理部１０８を区別して説明する場合には、コア１、コア２、…などのように「コア」の用語に数字を付したものを使用するものとする。 In the following description, when the individual packet processing units 108 are described separately, the term “core” with a number such as core 1, core 2,... Is used.

各パケット処理部１０８で実行されるパケット処理は、例えば、ファイアウォール（ＦＷ）処理、ＱｏＳ(Quality of Service)処理、ＮＡＰＴ(Network Address Port Translation)処理、暗号／複号処理などが挙げられ、各パケット処理部１０８には、これらの処理を実行するためのプログラムが実装される。この構成では、ネットワーク側から受信したパケットは、パケット入力部１０４を介して、振分部１０７によって指定されたパケット処理部１０８に送られる。パケットが送られてきたパケット処理部１０８は、そのパケットに対してパケット処理を実行し、パケット処理がなされたパケットは、パケット出力部１０５を介してネットワーク側に送信される。 Examples of packet processing executed by each packet processing unit 108 include firewall (FW) processing, QoS (Quality of Service) processing, NAPT (Network Address Port Translation) processing, encryption / decoding processing, and the like. The processing unit 108 is installed with a program for executing these processes. In this configuration, a packet received from the network side is sent to the packet processing unit 108 designated by the distribution unit 107 via the packet input unit 104. The packet processing unit 108 that has received the packet performs packet processing on the packet, and the packet that has been subjected to the packet processing is transmitted to the network side via the packet output unit 105.

各パケット処理部１０８は、パケット処理の実施に際して、転送テーブルやアドレス変換テーブルなどのデータを参照するためにデータ保持部１０９にアクセスするが、このデータ保持部１０９は、複数のパケット処理部１０８からのアクセスが集中するため、ボトルネックとなりやすく、また、そもそも外部メモリであるデータ保持部１０９にアクセスする時間自体も性能低下を引き起こす。このボトルネックを回避するために、各パケット処理部１０８では、データコピー部として機能するキャッシュメモリが設けられており、このキャッシュメモリにデータのコピーの一部を保持することで高速な処理を実現している。ここでのキャッシュメモリは、例えば、マルチコアのプロセッサ１１１における各コアごとのキャッシュメモリとして設けられているものである。しかしキャッシュメモリの容量は限られていて必要な全データのコピーをキャッシュメモリ上に置くことはできず、キャッシュにおけるミスヒットが発生する。このようなミスヒットが発生すると、パケット処理部１０８がデータ保持部１０９へのアクセスを行う必要が生じて性能が低下する。図２は、上述したパケット処理装置における並列処理動作を示している。図２に示したものでは、各パケット処理部１０８（処理コア１〜４）は同一プログラムの実行が可能であるとし、振分部１０７は、ハッシュ関数などを用いた単純なアルゴリズムで後述するフロー単位で振分先を決定するものとしている。図２は、キャッシュミスヒットが発生した場合に、データ保持部１０９への問い合わせが発生することを示している。 Each packet processing unit 108 accesses the data holding unit 109 to refer to data such as a transfer table and an address conversion table when performing packet processing. The data holding unit 109 receives data from a plurality of packet processing units 108. The access to the data storage unit 109 is likely to become a bottleneck, and the time itself for accessing the data holding unit 109, which is an external memory, causes degradation in performance. In order to avoid this bottleneck, each packet processing unit 108 is provided with a cache memory that functions as a data copy unit, and high-speed processing is realized by holding a part of the data copy in this cache memory. doing. The cache memory here is provided as a cache memory for each core in the multi-core processor 111, for example. However, the capacity of the cache memory is limited, and a copy of all necessary data cannot be placed on the cache memory, causing a miss hit in the cache. When such a mishit occurs, the packet processing unit 108 needs to access the data holding unit 109, and the performance deteriorates. FIG. 2 shows a parallel processing operation in the packet processing apparatus described above. In the example shown in FIG. 2, it is assumed that each packet processing unit 108 (processing cores 1 to 4) can execute the same program, and the allocating unit 107 uses a simple algorithm using a hash function or the like to be described later. The allocation destination is determined in units. FIG. 2 shows that an inquiry to the data holding unit 109 occurs when a cache miss hit occurs.

できるかぎりキャッシュミスヒットを低減させるためには、振分部１０７において、同じ処理が施されることになるパケットを可能な限り同じパケット処理部へ振り分けるようにすればよい。このため振分部１０７は、パケットのヘッダ（5-tuple等）を参照し、同じパケット処理の対象となる一連のパケット群（フロー）を特定して、フロー単位に振り分けを行う。 In order to reduce cache miss hits as much as possible, the distribution unit 107 may distribute packets to be subjected to the same processing to the same packet processing unit as much as possible. For this reason, the allocating unit 107 refers to a packet header (5-tuple or the like), identifies a series of packet groups (flows) to be subjected to the same packet processing, and distributes them in units of flows.

また、複数のパケット処理部１０８で処理することになるパケットの振り分けを行うことから、振分部１０７には負荷が集中しやすい。振分処理がボトルネックとなることを防ぐためには、振分部１０７での振分処理として、単純な動作のものを用いる必要がある。個々のフローとそれらのフローに対して割り当てる処理コア（割当処理コア）のＩＤ（識別番号）とをメモリ上のテーブルで管理し、割当処理コアをメモリアクセスを伴う検索により特定するような振分動作は、メモリアクセスが多くてボトルネックとなりやすい。それよりも、非特許文献１に記載されるように、パケットヘッダでのフローを特定する領域からハッシュ関数などによりハッシュ値を算出して、そのハッシュ値に基づき、割当処理コアのＩＤが記述されたメモリアドレス番地を引く、もしくは、ハッシュ値からある程度まとめたフロー群で割当処理コアＩＤを検索する、といった動作から割当コアを求める動作の方が好ましく、これらの動作を用いることによって、メモリアクセス数を削減し、高速に処理可能とすることが可能である。 Further, since the packets to be processed by the plurality of packet processing units 108 are distributed, the load tends to concentrate on the distribution unit 107. In order to prevent the sorting process from becoming a bottleneck, it is necessary to use a simple operation as the sorting process in the sorting unit 107. Distribution in which individual flows and IDs (identification numbers) of processing cores (assignment processing cores) assigned to those flows are managed in a table on the memory, and the assignment processing cores are specified by a search involving memory access The operation tends to be a bottleneck due to many memory accesses. Instead, as described in Non-Patent Document 1, a hash value is calculated from an area for specifying a flow in the packet header by a hash function or the like, and the ID of the allocation processing core is described based on the hash value. It is preferable to obtain an allocation core from an operation such as subtracting a memory address or searching for an allocation processing core ID by a flow group collected to some extent from hash values. By using these operations, the number of memory accesses And can be processed at high speed.

このようにハッシュ関数を用いて振分を行う場合、柔軟な処理割当や省電力化のためにパケット処理部１０８の並列数を増減させると、新たに加えたパケット処理部１０８にも処理割当が行われ、あるいは、削減したパケット処理部には処理割当が行われないように、振分アルゴリズム（振分条件）を変更する。具体的には、ハッシュ値を求めるために使用するハッシュ関数の変更、ハッシュ値からフローをまとめる際に用いるしきい値の変更、まとまったフロー群と割当処理コアＩＤとの対応関係の変更などを行う必要がある。このような振分条件の変更を行うと、並列数の変更前後で多くのフローが異なったパケット処理部へ割り当てられることとなり、その結果、短時間にキャッシュミスヒットが集中して発生し、処理遅延が大きくなる。例えば、並列数を１つ増加させた場合は、新しいパケット処理部１０８に割り当てられたフローの処理では、全て、キャッシュミスヒットが発生することから、データ保持部１０９へのアクセス所要時間が長いことに起因して、パケット処理部１０８での処理遅延が発生する。 In this way, when allocating using a hash function, if the number of parallel processing of the packet processing units 108 is increased or decreased for flexible processing allocation or power saving, the newly allocated packet processing units 108 are also allocated for processing allocation. The distribution algorithm (distribution condition) is changed so that processing allocation is not performed for the packet processing units that are performed or reduced. Specifically, the change of the hash function used to obtain the hash value, the change of the threshold value used when collecting flows from the hash value, the change of the correspondence between the group of flows and the allocation processing core ID, etc. There is a need to do. When such a distribution condition is changed, many flows are assigned to different packet processing units before and after the change in the number of parallel processes. As a result, cache misses are concentrated in a short time, and processing is performed. Delay increases. For example, when the number of parallel processes is increased by 1, a cache miss hit occurs in all of the flow processes assigned to the new packet processing unit 108, so that the time required for accessing the data holding unit 109 is long. As a result, a processing delay in the packet processing unit 108 occurs.

図３は、パケット処理部１０８を１つ追加したときのキャッシュミスヒットの発生を説明する図である。（Ａ）に示すように、コア１〜コア３までの３個のパケット処理部（処理コア）１０８が設けられており、振分部１０７はそれらの処理コアに対してフローＡ〜Ｄを振分けているものとする。ここで新たにコア４を処理コアとして追加したとすると、振分部１０７は、振分アルゴリズムを３分割用のものから４分割用のものに切り替えて、フローＡ〜Ｄを４個のコア１〜４に改めて振分ける。その結果、（Ｂ）に示すように、各コアには従前のフローとは異なるフローが割り当てられることとなって、各コアにおいてキャッシュミスヒットが発生することになる。その結果、（Ｃ）に示すように、変更タイミングの直後に、処理遅延が急増することになる。 FIG. 3 is a diagram for explaining the occurrence of a cache miss when one packet processing unit 108 is added. As shown in (A), three packet processing units (processing cores) 108 from core 1 to core 3 are provided, and the distribution unit 107 distributes flows A to D to these processing cores. It shall be. Here, if the core 4 is newly added as a processing core, the distribution unit 107 switches the distribution algorithm from one for three divisions to one for four divisions, and flows A to D with four cores 1. Reassign to ~ 4. As a result, as shown in (B), a flow different from the previous flow is assigned to each core, and a cache miss hit occurs in each core. As a result, as shown in (C), the processing delay increases rapidly immediately after the change timing.

もう一例として、並列数を１つ減少させた場合は、残ったパケット処理部の全てに、減らしたパケット処理部の担当してたフローが割り当てられ、その結果、キャッシュミスヒットが複数のパケット処理部で発生し、データ保持部へのアクセス競合による待ち時間に起因して、各パケット処理部での処理遅延が発生する。個々のパケットに対する処理量は小さいものの多数のパケットに対して低遅延で順序逆転なく処理を実行することが求められるパケット転送処理において、このような振分先の変更時に発生する遅延は大きな課題である。 As another example, when the number of parallel processes is reduced by 1, the flow handled by the reduced packet processing unit is assigned to all of the remaining packet processing units. Processing delay in each packet processing unit due to a waiting time due to contention for access to the data holding unit. Although the amount of processing for each packet is small, the delay that occurs when changing the distribution destination is a major issue in packet transfer processing that requires processing with a low delay and no order inversion for many packets. is there.

特表２００２−５３８７２４号公報JP-T-2002-538724

パケット転送処理において並列処理を採用する場合、並列数の変更に伴って振分先を変更する際に大きな遅延が発生する。このような遅延の発生は、パケットの正常な転送を阻害する恐れがある。 When parallel processing is adopted in packet transfer processing, a large delay occurs when the distribution destination is changed in accordance with the change of the parallel number. The occurrence of such a delay may hinder the normal transfer of packets.

本発明の目的は、並列処理によってパケットを処理する際に振分先が変更可能であって、かつ、振分先の変更時に発生する処理遅延の影響を低減した並列パケット処理方法を提供することにある。 An object of the present invention is to provide a parallel packet processing method in which a distribution destination can be changed when a packet is processed by parallel processing, and the influence of a processing delay generated when the distribution destination is changed is reduced. It is in.

本発明の別の目的は、並列処理によってパケットを処理する際に振分先が変更可能であって、かつ、振分先の変更時に発生する処理遅延の影響を低減した並列パケット処理装置を提供することにある。 Another object of the present invention is to provide a parallel packet processing apparatus that can change the distribution destination when processing packets by parallel processing and that reduces the influence of processing delay that occurs when the distribution destination is changed. There is to do.

本発明の第１の並列パケット処理方法は、複数のパケット処理手段に対してフローを振り分けて複数のパケット処理手段間で並列にフローに対するパケット処理を実行する並列パケット処理方法であって、複数のパケット処理手段に対してフローを振り分ける、相互に異なる第１及び第２の振分アルゴリズムを用意し、第１の振分アルゴリズムをフローに適用している状態から、フローごとにそのフローに対して第２の振分アルゴリズムが適用される確率が時間経過とともに高くなるようにして、第２の振分アルゴリズムがフローに適用される状態に切り替える。第１の並列パケット処理方法の例示的な態様においては、第１の振分アルゴリズムによって振分先となるパケット処理手段の数と第２の振分アルゴリズムによって振分先となるパケット処理手段の数とが異なるようにしてもよい。 A first parallel packet processing method of the present invention is a parallel packet processing method for distributing a flow to a plurality of packet processing means and executing packet processing on the flow in parallel between the plurality of packet processing means, From the state in which the first distribution algorithm different from each other and the first distribution algorithm different from each other are allocated to the packet processing means, and the first distribution algorithm is applied to the flow, The probability that the second distribution algorithm is applied is increased with the passage of time so that the second distribution algorithm is applied to the flow. In an exemplary aspect of the first parallel packet processing method, the number of packet processing means to be assigned by the first distribution algorithm and the number of packet processing means to be assigned by the second distribution algorithm And may be different.

本発明の第２の並列パケット処理方法は、複数のパケット処理手段に対してフローを振り分けて複数のパケット処理手段間で並列にフローに対するパケット処理を実行する並列パケット処理方法であって、複数のパケット処理手段に対してフローを振り分ける、相互に異なる第１及び第２の振分アルゴリズムを用意し、第１の振分アルゴリズムを用いてフローを複数のパケット処理手段に振り分け、振り分けられたフローのうち、特定のパケット処理手段に振り分けられることとなったフローに対して第２の振分アルゴリズムを適用して再振分を実行する。第２の並列パケット処理方法の例示的な態様においては、第２の振分アルゴリズムは、特定のパケット処理手段を除いた複数のパケット処理手段にフローを振り分ける振分アルゴリズムであるようにしてもよい。 A second parallel packet processing method of the present invention is a parallel packet processing method for distributing a flow to a plurality of packet processing means and executing packet processing on the flow in parallel between the plurality of packet processing means, Prepare different first and second distribution algorithms that distribute flows to packet processing means, and distribute flows to a plurality of packet processing means using the first distribution algorithm. Among them, the second distribution algorithm is applied to the flow that has been distributed to the specific packet processing means, and redistribution is executed. In an exemplary aspect of the second parallel packet processing method, the second distribution algorithm may be a distribution algorithm that distributes a flow to a plurality of packet processing means excluding a specific packet processing means. .

本発明の第３の並列パケット処理方法は、複数のパケット処理手段に対してフローを振り分けて複数のパケット処理手段間で並列にフローに対するパケット処理を実行する並列パケット処理方法であって、複数のパケット処理手段に対してフローを振り分ける振分アルゴリズムを変更可能とし、振分アルゴリズムを変更する前に、その変更によって各フローが複数のパケット処理手段のいずれにおいて処理されるように変化するかを判断し、判断の結果に基づいて事前に、変更後の各フローの処理に必要なデータを対応するパケット処理手段に保持させ、その後、振分アルゴリズムの変更を行う。 A third parallel packet processing method of the present invention is a parallel packet processing method for distributing a flow to a plurality of packet processing means and executing packet processing on the flow in parallel between the plurality of packet processing means, The distribution algorithm for distributing flows to packet processing means can be changed, and before changing the distribution algorithm, it is determined which of the multiple packet processing means will change each flow due to the change. Then, based on the determination result, the data necessary for the processing of each changed flow is held in advance in the corresponding packet processing means, and then the distribution algorithm is changed.

本発明の第１の並列パケット処理装置は、フローに対してパケット処理を行う複数のパケット処理手段と、相互に異なる第１及び第２の振分アルゴリズムにより複数のパケット処理手段にフローを振り分ける振分手段と、第１の振分アルゴリズムをフローに適用している状態から、フローごとにそのフローに対して第２の振分アルゴリズムが適用される確率が時間経過とともに高くなるようにして第２の振分アルゴリズムがフローに適用される状態に切り替えるように、振分手段を制御する制御手段と、を有する。第１の並列パケット処理装置の例示的な態様においては、第１の振分アルゴリズムによって振分先となるパケット処理手段の数と第２の振分アルゴリズムによって振分先となるパケット処理手段の数とが異なっており、制御手段は、複数のパケット処理手段での並列数の変更を契機として、振分手段に状態の切り替えを指示するようにしてもよい。 The first parallel packet processing device according to the present invention includes a plurality of packet processing means for performing packet processing on a flow, and a plurality of packet processing means for distributing flows to the plurality of packet processing means using mutually different first and second distribution algorithms. From the state in which the distribution means and the first distribution algorithm are applied to the flow, the probability that the second distribution algorithm is applied to the flow for each flow is increased with the passage of time. Control means for controlling the distribution means so as to switch to a state in which the distribution algorithm is applied to the flow. In the exemplary aspect of the first parallel packet processing apparatus, the number of packet processing means that are the distribution destinations by the first distribution algorithm and the number of packet processing means that are the distribution destinations by the second distribution algorithm The control unit may instruct the distribution unit to change the state when the parallel number is changed by the plurality of packet processing units.

本発明の第２の並列パケット処理装置は、フローに対してパケット処理を行う複数のパケット処理手段と、第１の振分アルゴリズムにより複数のパケット処理手段にフローを振り分る振分手段と、振分手段を制御する制御手段と、を備え、振分手段は、第１の振分アルゴリズムを適用して振り分けられたフローについてそのフローの振分先に基づいて再振分の要否を判断する再振分判断部を有し、再振分が必要と判断したときに、第１の振分アルゴリズムとは異なる第２の振分アルゴリズムでそのフローを複数のパケット処理手段に再振分し、制御手段は、第１の振分アルゴリズムによって特定のパケット処理手段に割り振られることとなったフローの再振分を振分手段に指示する。第２の並列パケット処理装置の例示的な態様においては、第２の振分アルゴリズムは、特定のパケット処理手段を除いた複数のパケット処理手段にフローを振り分ける振分アルゴリズムであるようにしてもよい。 A second parallel packet processing device of the present invention includes a plurality of packet processing means for performing packet processing on a flow, a distribution means for distributing the flow to the plurality of packet processing means by a first distribution algorithm, Control means for controlling the distribution means, and the distribution means determines whether or not the redistribution is necessary based on the distribution destination of the flow for the flow distributed by applying the first distribution algorithm. A redistribution determination unit that redistributes the flow to a plurality of packet processing means using a second distribution algorithm different from the first distribution algorithm when it is determined that redistribution is necessary. The control unit instructs the distribution unit to redistribute the flow that has been allocated to the specific packet processing unit by the first distribution algorithm. In an exemplary aspect of the second parallel packet processing device, the second distribution algorithm may be a distribution algorithm that distributes a flow to a plurality of packet processing means excluding a specific packet processing means. .

本発明の第３の並列パケット処理装置は、フローに対してパケット処理を行う複数のパケット処理手段と、変更可能な振分アルゴリズムにより複数のパケット処理手段にフローを振り分ける振分手段と、振分アルゴリズム情報と振り分け対象となるパケット処理手段の数と各フロー情報を保持し、振分アルゴリズムの変更後に各フローがいずれのパケット処理手段により処理されるように変化するかを判断し、判断の結果に基づいて必要なデータを予め保持するよう各パケット処理手段に指示した後、振分先を変更するように振分手段を制御する制御手段と、を有する。 A third parallel packet processing device of the present invention includes a plurality of packet processing means for performing packet processing on a flow, a distribution means for distributing the flow to the plurality of packet processing means by a changeable distribution algorithm, Stores the algorithm information, the number of packet processing means to be distributed and each flow information, and determines which packet processing means each flow changes after the distribution algorithm is changed. And control means for controlling the distribution means so as to change the distribution destination after instructing each packet processing means to hold necessary data in advance.

本発明では、振分先の変更に時間の要素を取り入れ、少数のフロー単位に時間的に分散させて第１の振分アルゴリズムが適用される状態から第２の振分アルゴリズムが適用される状態へと振分先変更を行うことで、キャッシュミスヒットの発生の集中を防ぎ、振分先変更時の処理遅延の発生の低減が可能となる。また本発明では、特定の振分先に振り分けられたフローに対して再振分を行うことで、振分処理におけるフローとパケット処理コアの割当管理の負担を軽減しつつ、キャッシュミスヒットの発生の集中を防ぎ、振分先変更時の処理遅延の発生を低減することが可能となる。また本発明では、振分先の変更により各フローがいずれのパケット処理手段で処理されるようになるかを予測し、その予測結果に基づいて各フローのパケット処理に必要なデータを予め各パケット処理手段にキャッシュさせることで、キャッシュミスヒットの発生を防ぎ、振分先変更時の処理遅延の発生を低減することが可能となる。 In the present invention, a state in which the second distribution algorithm is applied from the state in which the first distribution algorithm is applied by incorporating the time element into the change of the distribution destination and temporally distributing it to a small number of flow units. By changing the distribution destination, it is possible to prevent the occurrence of cache miss hits and reduce the processing delay when changing the distribution destination. In addition, in the present invention, by performing redistribution on a flow allocated to a specific distribution destination, a cache miss hit occurs while reducing the burden of allocation management between the flow and the packet processing core in the distribution processing. It is possible to reduce the occurrence of processing delay when changing the distribution destination. In the present invention, it is predicted which packet processing means each flow will be processed by changing the distribution destination, and data necessary for packet processing of each flow is previously stored in each packet based on the prediction result. By causing the processing means to cache, it is possible to prevent the occurrence of a cache miss and reduce the occurrence of a processing delay when changing the distribution destination.

パケット処理装置の一般的な構成の一例を示すブロック図である。It is a block diagram which shows an example of the general structure of a packet processing apparatus. マルチコア構成のパケット処理装置における並列処理動作を説明する図である。It is a figure explaining the parallel processing operation | movement in the packet processing apparatus of a multi-core structure. 並列数を増加させたときのフローの再振分とキャッシュミスヒットの発生とを説明する図である。It is a figure explaining the redistribution of the flow when the number of parallel is increased, and generation | occurrence | production of a cache miss hit. 本発明の実施の一形態のパケット処理装置におけるパケット処理を説明する図である。It is a figure explaining the packet processing in the packet processing apparatus of one Embodiment of this invention. 振分部での判断ロジックを説明する図である。It is a figure explaining the judgment logic in a distribution part. 特定のコアにおいて障害が発生した場合のフローの再振分を説明する図である。It is a figure explaining the re-distribution of the flow when a failure occurs in a specific core. 特定のコアにおいて処理の偏りが発生した場合のフローの再振分を説明する図である。It is a figure explaining the re-distribution of the flow when the bias of a process generate | occur | produces in a specific core. パケット処理装置の第１の構成例を示すブロック図である。It is a block diagram which shows the 1st structural example of a packet processing apparatus. パケット処理装置の第２の構成例を示すブロック図である。It is a block diagram which shows the 2nd structural example of a packet processing apparatus. パケット処理装置の第３の構成例を示すブロック図である。It is a block diagram which shows the 3rd structural example of a packet processing apparatus. パケット処理装置の第４の構成例を示すブロック図である。It is a block diagram which shows the 4th structural example of a packet processing apparatus. パケット処理装置の第５の構成例を示すブロック図である。It is a block diagram which shows the 5th structural example of a packet processing apparatus. パケット処理装置の第６の構成例を示すブロック図である。It is a block diagram which shows the 6th structural example of a packet processing apparatus. マルチプロセッサ構成としたパケット処理装置の一例を示すブロック図である。It is a block diagram which shows an example of the packet processing apparatus made into the multiprocessor structure. マルチノード構成によるパケット処理システムの一例を示すブロック図である。It is a block diagram which shows an example of the packet processing system by a multinode structure.

次に、本発明の好ましい実施の形態について、図面を参照して説明する。 Next, a preferred embodiment of the present invention will be described with reference to the drawings.

図４は本発明の実施の一形態のパケット処理装置におけるパケット処理を示す図であって、（Ａ）は実施の一形態のパケット処理装置の要部を示すブロック図、（Ｂ）は新旧振分処理を説明する図、（Ｃ）はパケット処理における遅延の時間変化を示すグラフである。 4A and 4B are diagrams showing packet processing in the packet processing apparatus according to the embodiment of the present invention. FIG. 4A is a block diagram showing the main part of the packet processing apparatus according to the embodiment, and FIG. The figure explaining minute processing, (C) is a graph which shows the time change of the delay in packet processing.

本実施形態のパケット処理装置は、図１に示したものと同様に、ネットワークとの接続に用いられる複数のインタフェース部と、パケット処理を並列に実行するマルチコア構成のプロセッサ１１１と、パケット処理の実施において必要とするためにデータを保持する外部メモリとしてデータ保持部１０９とを備えたものである。図１に示したものとの相違点は、プロセッサ１１１における各プロセッサコアがどのように使用されるかという点であるので、図４の（Ａ）では、本実施形態のパケット処理装置のうち、プロセッサ１１１とデータ保持部（外部メモリ）１０９の部分だけが示されている。 The packet processing apparatus according to the present embodiment has a plurality of interface units used for connection to a network, a multi-core processor 111 that executes packet processing in parallel, and implementation of packet processing, as shown in FIG. The data holding unit 109 is provided as an external memory for holding data because it is necessary for the above. Since the difference from what is shown in FIG. 1 is how each processor core in the processor 111 is used, in FIG. 4A, among the packet processing devices of this embodiment, Only the processor 111 and the data holding unit (external memory) 109 are shown.

プロセッサ１１１には、図１に示したものと同様に、パケット入力部１０４、パケット出力部１０５、内部バス１０６、振分部（振分コア）１０７、複数のパケット処理部（処理コア）１０８、及びメモリバス１１０が設けられ、さらに、各パケット処理部１０８でのパケット処理と振分部１０７でのフローの振分とを制御するために、制御部１１２が設けられている。制御部１１２は、マルチコア構成のプロセッサ１１１において、制御用のプログラムをロードしたコアによって実現されるのものであり、制御コアとも呼ばれるものである。制御部１１２は、内部バス１０６を介して各パケット処理部１０８と振分部１０７とに接続しており、各パケット処理部１０８における処理プログラムを動的に変更し、さらに、振分部１０７に対して振分方法の変更を指示することで、プログラムのロード状況などに合わせて並列処理数を動的に変更することができ、また、外的要因で並列処理数が変化した場合にその変化にパケット処理装置が対応できるようにする。各パケット処理部１０８での処理プログラムの動的な変更は、例えば、処理プログラムのロードやリロードによって行われる。特に、本実施形態のパケット処理装置では、複数のパケット処理部１０８に対するフローの振分での振分先の変更を行う際に、時間の要素を取り入れ、少数のフロー単位に時間的に分散させて振分先変更を行うことで、キャッシュミスヒットの発生の集中を防ぎ、振分先変更時の処理遅延の発生の低減を可能としている。時間の要素を取り入れた振分先の変更を時間変動切替と呼ぶ。また、このパケット処理装置では、特定のパケット処理部に割り当てられていたフローのみを対象として、再度、振分の処理を実行することによって、振分処理におけるフローとパケット処理部の割当管理の負担を軽減しつつ、キャッシュミスヒットの発生の集中を防ぎ、振分先変更時の処理遅延の発生の低減を可能としている。特定のパケット処理部に割り当てられていたフローに対して再度、振分の処理を行うことを再振分と呼ぶ。また、このパケット処理装置では、振分先の変更により各フローがいずれのパケット処理手段で処理されるようになるかを予測し、予測結果に応じて各フローのパケット処理に必要なデータを予め各パケット処理手段にキャッシュさせることでキャッシュミスヒットの発生を防ぎ、振分先変更時の処理遅延の発生を低減することを可能としている。以下、本実施形態での振分先の変更について説明する。 The processor 111 includes a packet input unit 104, a packet output unit 105, an internal bus 106, a distribution unit (distribution core) 107, a plurality of packet processing units (processing cores) 108, as shown in FIG. And a memory bus 110, and a control unit 112 for controlling packet processing in each packet processing unit 108 and flow distribution in the distribution unit 107. The control unit 112 is realized by a core loaded with a control program in the processor 111 having a multi-core configuration, and is also called a control core. The control unit 112 is connected to each packet processing unit 108 and the distribution unit 107 via the internal bus 106, dynamically changes the processing program in each packet processing unit 108, and further to the distribution unit 107. By instructing the change of the distribution method, the number of parallel processes can be changed dynamically according to the program load status, etc., and when the number of parallel processes changes due to external factors, the change So that the packet processing apparatus can cope with this. The dynamic change of the processing program in each packet processing unit 108 is performed, for example, by loading or reloading the processing program. In particular, in the packet processing apparatus according to the present embodiment, when changing the distribution destination in the flow distribution to a plurality of packet processing units 108, a time element is taken in and dispersed in a small number of flow units. By changing the distribution destination, it is possible to prevent the occurrence of cache miss hits and reduce the processing delay when changing the distribution destination. Changing the distribution destination that incorporates the time element is called time variation switching. Further, in this packet processing device, by executing the distribution process again only for the flow allocated to the specific packet processing unit, the burden of allocation management of the flow and the packet processing unit in the distribution process is performed. This reduces the occurrence of cache miss hits and reduces processing delays when changing the distribution destination. Performing distribution processing again on a flow assigned to a specific packet processing unit is called redistribution. Further, in this packet processing apparatus, it is predicted which packet processing means each flow will be processed by changing the distribution destination, and data necessary for packet processing of each flow is preliminarily stored according to the prediction result. By causing each packet processing means to cache, the occurrence of a cache miss hit can be prevented, and the occurrence of processing delay when changing the distribution destination can be reduced. Hereinafter, the change of the distribution destination in the present embodiment will be described.

まず、時間変動切替による振分アルゴリズムの切替について説明する。 First, switching of the distribution algorithm by time variation switching will be described.

図４（Ａ）に示した構成において、当初は３個のパケット処理部１０８、具体的にはコア１〜３のみがパケット処理に用いられていたとする。その状態では、振分部１０７は、入力したフローを３つのコアに振り分けるアルゴリズムを使用して、コア１〜コア３にフローを振り分けている。ここで、制御部１１２が、並列処理に参加するパケット処理部（処理コア）の数を並列数変更指示を出力することによって、コア４の新規起動あるいは割当変更を行い、あるいは、並列数の変更を検出してコア４が新たに加わったことを認識したものとする（図４の［１］）。このとき制御部１１２は、振分部１０７に対して、振分先切替の開始と、利用する振分アルゴリズムとを指示する（図４の［２］）。振分アルゴリズムの指示は、具体的には、利用するハッシュ関数の指示などである。 In the configuration shown in FIG. 4A, it is assumed that initially three packet processing units 108, specifically, only the cores 1 to 3 are used for packet processing. In this state, the distribution unit 107 distributes the flows to the cores 1 to 3 using an algorithm that distributes the input flow to the three cores. Here, the control unit 112 outputs a parallel number change instruction for the number of packet processing units (processing cores) participating in the parallel processing, thereby newly starting the core 4 or changing the allocation, or changing the parallel number. Is detected and it is recognized that the core 4 is newly added ([1] in FIG. 4). At this time, the control unit 112 instructs the distribution unit 107 to start distribution destination switching and a distribution algorithm to be used ([2] in FIG. 4). The instruction of the distribution algorithm is specifically an instruction of a hash function to be used.

振分部１０７は、制御部１１２からの制御によって、フローを３つのコアに振り分ける振分アルゴリズム（３分割用振分アルゴリズム）と、フローを４つのコアに振り分ける振分アルゴリズム（４分割用振分アルゴリズム）とを実行可能であるとともに、フローごとに２つの振分アルゴリズムのどちらを適用するかを決定する振分アルゴリズム（新旧振分アルゴリズム）を実行可能である。特に新旧振分アルゴリズムは、切替開始指示からの時間経過をしきい値に利用して、新しい振分に対応する振分アルゴリズム（ここでは４分割用振分アルゴリズム）に渡されるフローが徐々に大きくなるように、２つの振分アルゴリズム（３分割用振分アルゴリズムと４分割用振分アルゴリズム）に対してフローを割り当てる。具体的には、図４（Ｂ）に示すように、フローからハッシュ値を計算し、ハッシュ値に応じてそのフローを（旧）３分割用振分アルゴリズムに振り分けるか（新）４分割用振分アルゴリズムに振り分けるかを決定し、この決定のしきい値を時間の経過に応じて変化させている。３分割用振分アルゴリズムと４分割用振分アルゴリズムは、新旧振分アルゴリズムよって振り分けられたフローをパケット処理部（処理コア）に振り分ける。このように、新しい振分に対応する振分アルゴリズムが適用されるフロー数を徐々に増やすようにすることにより、コア１〜４から外部メモリへのアクセスの集中を防ぎながら、振分先の変更を行うことができる。 The distribution unit 107 is controlled by the control unit 112 to distribute the flow to the three cores (distribution algorithm for three divisions) and the distribution algorithm for distributing the flows to the four cores (distribution for four divisions). Algorithm) and a sorting algorithm (new and old sorting algorithm) that determines which of the two sorting algorithms to apply to each flow. In particular, the old and new distribution algorithms use the elapsed time from the switching start instruction as a threshold value, and the flow passed to the distribution algorithm corresponding to the new distribution (here, the four-segment distribution algorithm) gradually increases. In this manner, flows are assigned to two distribution algorithms (a three-segment distribution algorithm and a four-division distribution algorithm). Specifically, as shown in FIG. 4B, a hash value is calculated from the flow, and the flow is distributed to the (old) 3-division distribution algorithm according to the hash value (new). Whether to distribute to the minute algorithm is determined, and the threshold value of this determination is changed with the passage of time. The 3-division distribution algorithm and 4-division distribution algorithm distribute the flows distributed by the old and new distribution algorithms to the packet processing unit (processing core). In this way, by gradually increasing the number of flows to which the distribution algorithm corresponding to the new distribution is applied, the distribution destination is changed while preventing concentration of accesses from the cores 1 to 4 to the external memory. It can be performed.

図４（Ｃ）は、本実施形態でのパケットの処理遅延の時間変化を示すグラフである。このグラフからも分かるように、フローに対して新しい振分を徐々に適用しているので、変更タイミングの直後に処理遅延が急増することはなく、パケット処理性能への振分先変更の影響が小さく、パケット処理に要求される時間的制約を十分に満たすことが可能となる。 FIG. 4C is a graph showing a change over time in packet processing delay in the present embodiment. As can be seen from this graph, since new distribution is gradually applied to the flow, the processing delay does not increase immediately after the change timing, and the influence of changing the distribution destination on the packet processing performance is not affected. It is small and can sufficiently satisfy the time constraint required for packet processing.

図５は、振分部における判断ロジックの具体的な例を示している。切替開始指示の時刻をｔ₀とし、現在の時刻をｔとすると、切替開始からの経過時間Δｔは、
Δｔ＝ｔ−ｔ₀
で表される。そこで、経過時間Δｔの関数Ｆ(Δｔ)として、しきい値を表すことにする。フローから求められるハッシュ値（出力Index値）がハッシュ空間においてIndex 0とIndex Xとの間をとるものとすると、出力Indexと、時間変化するしきい値Ｆ(Δｔ)とによって、フローを新旧いずれの振分アルゴリズムに振り分けるかを判断する。しきい値を表す関数Ｆとしては、Ｆ(０)では値がIndex 0であり、所定の時間をＴとしてＦ(Ｔ)では値がIndex Xであり、その間は経過時間Δｔに応じて例えば直線的に変化するものを用いることができる。 FIG. 5 shows a specific example of determination logic in the distribution unit. When the time of the switching start instruction is t ₀ and the current time is t, the elapsed time Δt from the switching start is
Δt = t−t ₀
It is represented by Therefore, the threshold value is expressed as a function F (Δt) of the elapsed time Δt. Assuming that the hash value (output index value) obtained from the flow is between Index 0 and Index X in the hash space, the flow can be changed between old and new by the output index and the time-varying threshold value F (Δt). Judge whether to assign to the sorting algorithm. As a function F representing the threshold value, F (0) has a value of Index 0, a predetermined time is T, and F (T) has a value of Index X, and during that time, for example, a straight line according to the elapsed time Δt. Can be used.

時間変動切替では、柔軟な処理割当や省電力化のためにパケット処理部の並列数を動的に増減させることを、パケット処理性能への影響を少なく実施することができる。 In the time variation switching, dynamically increasing or decreasing the parallel number of packet processing units for flexible process allocation and power saving can be implemented with less influence on packet processing performance.

また、時間変動切替は、パケット処理部への振分フロー数に偏りが発生した場合に、振分処理におけるフローとパケット処理部の割当変更の負担を軽減しつつ、より偏りのない振分方法へと変更するために用いることができる。パケット処理部間で処理量の偏りが生じた時、その偏りは制御部１１２によって検出される。その検出に基づいて制御部１１２は、より偏りが生じないように、フロー振分におけるアルゴリズムの変更（例えば、ハッシュ値を求めるために用いられるハッシュ関数の変更、ハッシュ値からフローをまとめる際に用いられるしきい値の変更、など）を振分部１０７に指示する。振分部１０７は、この指示に応じて、時間変化するしきい値を用いて、新振分アルゴリズムに振り分けられるフローが徐々に増加するように、フローを新旧の振分アルゴリズムの間で振り分ける。 In addition, when the time variation switching is biased in the number of distribution flows to the packet processing unit, the distribution method is more evenly distributed while reducing the burden of allocation change between the flow and the packet processing unit in the distribution process. Can be used to change When a processing amount deviation occurs between the packet processing units, the deviation is detected by the control unit 112. Based on the detection, the control unit 112 changes the algorithm in the flow distribution (for example, changes in the hash function used to obtain the hash value, used when collecting the flows from the hash value so that the bias does not occur more. The distribution unit 107 is instructed. In response to this instruction, the distribution unit 107 distributes the flow between the old and new distribution algorithms so that the flow allocated to the new distribution algorithm gradually increases using a time-varying threshold value.

図４に示した構成では、並列数変更指示による振分の切替を行う際、制御部１１２は、処理コアの並列数を変更するためのパケット処理用のプログラムをプロセッサ１１１にリロードするときに、振分部１０７に対して振分先変更の契機を指示している。処理コアの数を減少させる場合には、振分先の変更が完了してからプログラムのリロードを行い、処理コアの数を増やす橋には、プログラムのリロードの完了後に振分先変更を開始させる。 In the configuration shown in FIG. 4, when switching the allocation according to the parallel number change instruction, the control unit 112 is configured to reload the processor 111 with a packet processing program for changing the parallel number of processing cores. The distribution unit 107 is instructed to change the distribution destination. If you want to reduce the number of processing cores, reload the program after the change of the allocation destination is completed, and for the bridge that increases the number of processing cores, start the allocation destination change after the completion of the program reload. .

フローの偏りが発生したことに基づいて振分の切替を行う場合には、振分部１０７が各処理コアに振り分けたフローの数を制御部１１２にフィードバックし、制御部１１２は、そのフィードバック情報に例えばしきい値を適用して偏りを検出し、偏りを検出したこによって振分先変更を開始させる。あるいは、各コアの負荷状況（例えばＣＰＵ使用率）を制御部１１２が収集し、負荷状況に対して例えばしきい値を適用して偏りを検出するようにしてもよい。 When switching distribution based on the occurrence of flow bias, the number of flows allocated to each processing core by the distribution unit 107 is fed back to the control unit 112, and the control unit 112 provides feedback information. For example, a bias is detected by applying a threshold value, and the distribution destination change is started when the bias is detected. Alternatively, the load status (for example, CPU usage rate) of each core may be collected by the control unit 112, and a bias may be detected by applying a threshold value to the load status, for example.

次に、再振分による振分アルゴリズムの切替について説明する。 Next, switching of the distribution algorithm by redistribution will be described.

図６は、特定のパケット処理装置（処理コア）において障害が発生した場合のフローの再振分を説明する図である。図６（Ａ）は、図４に示したパケット処理装置において、４個のパケット処理部（コア１〜４）がパケット処理を行っている状態を示している。このとき、図６（Ｂ）に示すように、フローは、例えば、コア１、コア２及びコア４に振り分けられている。ここで、コア４において障害が発生したとする。すると、制御部１１２は、障害発生を検出し（図６の［１］）、障害発生の検出に応じて、振分部１０７に対し、再振分の開始と、利用する振分アルゴリズムとを指示する（図６の［２］）。具体的には、既存の振分アルゴリズムで振り分けたフローに対して再振分を要するかどうかの再振分判断を行い、障害を発生したパケット処理部（ここではコア４）に振り分けられたフローについて、障害を発生していないパケット処理部（ここではコア１〜３）に振り分けるように、制御部１１２は振分部１０７に指示する。その結果、図６の（Ｃ）に示すように、振分部１０７は、コア４に行くとされていたフローについて、再振分判断の結果、再振分を要するものとして、コア４以外の別のコアに振り分けるようにする。障害発生前に図６の（Ｂ）に示すようにフローが振り分けられていたとして、再振分の結果、図６の（Ｄ）に示すように、コア４あてのフローをコア３に振り分ける。なお、障害の発生の検出は、制御部１１２が処理コアの正常性を常時監視することによって、容易に行うことができる。 FIG. 6 is a diagram for explaining flow redistribution when a failure occurs in a specific packet processing device (processing core). FIG. 6A shows a state where four packet processing units (cores 1 to 4) are performing packet processing in the packet processing apparatus shown in FIG. At this time, as shown in FIG. 6B, the flow is distributed to the core 1, the core 2, and the core 4, for example. Here, it is assumed that a failure has occurred in the core 4. Then, the control unit 112 detects the occurrence of the failure ([1] in FIG. 6), and in response to the detection of the occurrence of the failure, the control unit 112 instructs the distribution unit 107 to start redistribution and to use a distribution algorithm. It instructs ([2] in FIG. 6). Specifically, the redistribution judgment is made as to whether redistribution is required for the flow allocated by the existing distribution algorithm, and the flow allocated to the packet processing unit (in this case, core 4) in which the failure has occurred. , The control unit 112 instructs the distribution unit 107 to distribute the packet processing units (cores 1 to 3 in this case) that have not failed. As a result, as shown in FIG. 6C, the distribution unit 107 determines that the flow that was supposed to go to the core 4 requires redistribution as a result of the redistribution determination. Try to distribute to another core. Assuming that the flow is distributed as shown in (B) of FIG. 6 before the failure occurs, the flow directed to the core 4 is distributed to the core 3 as shown in (D) of FIG. The occurrence of a failure can be easily detected by the control unit 112 constantly monitoring the normality of the processing core.

このように、特定のパケット処理部に障害が発生した場合に、その障害が発生したパケット処理部を除いてフローを振り返るような振分に切り替えることにより、振分処理におけるフローとパケット処理部の割当変更の負担を軽減しつつ、パケット処理性能への影響を低く抑えたまま、すべてのフローに対するパケット処理を継続することができるようになる。再振分での振分アルゴリズムとしては、１つを除くパケット処理部に振分が行われるような振分関数が用いられる。振分関数は、例えば、並列処理に参加するすべてのパケット処理部からある１つのパケット処理部を除き、残ったパケット処理部をハッシュ関数から導出されるアドレス番地に保存することによって構成される。どのパケット処理部が除かれたかだけが異なるこのような振分関数をパケット処理部の数だけ用意し、障害が発生したパケット処理部に応じてそれらの振分関数の中から該当するものを選択して再振分に使用することによって、障害が発生したパケット処理部に振り分けられたフローを正常なパケット処理部に再振分することができる。 In this way, when a failure occurs in a specific packet processing unit, by switching to a distribution that looks back at the flow except for the packet processing unit in which the failure has occurred, the flow in the distribution processing and the packet processing unit Packet processing for all flows can be continued while reducing the burden of allocation change and keeping the impact on packet processing performance low. As a distribution algorithm in the re-distribution, a distribution function is used in which distribution is performed in the packet processing unit except for one. The distribution function is configured by, for example, excluding one packet processing unit from all the packet processing units participating in the parallel processing and storing the remaining packet processing unit at an address address derived from the hash function. Prepare such distribution functions as many as the number of packet processing units, which differ only in which packet processing units have been removed, and select the appropriate distribution function from those depending on the packet processing unit in which a failure has occurred. By using it for redistribution, it is possible to redistribute the flow allocated to the packet processing unit in which a failure has occurred to a normal packet processing unit.

また、再振分による振分アルゴリズムの切替は、複数のパケット処理部のうちの特定のパケット処理部にフローが集中するような偏りが発生した場合に、その偏りを解消するためにも用いることができる。図７は、偏りの解消のための振分先変更の一例を示している。図７の（Ａ）は、コア１〜コア４のうち、コア４に大量のフローが割り当てられている状態を示している。このような偏りを制御部１１２が検出すると、障害発生の場合と同様に制御部１１２は、振分部１０７に対し、再振分の開始と、利用する振分アルゴリズムとを指示する。その結果、振分部１０７は、図７の（Ｂ）に示すように、コア４に行くとされていたフローについて、再振分判断の結果、再振分を要するものとして、コア４での偏りが解消するように、コア１〜コア４の全体にフローが分散するようにフローの再振分を実行する。再振分のアルゴリズムとしては、例えば、異なるハッシュ関数によりフローからハッシュ値を求めて再振分するものが挙げられる。コア４で障害が起きているわけではないので、再振分により一部のフローは依然としてコア４に振り分けられ続けることになる。再振分による振分アルゴリズム切替を行うことにより、パケット処理部への振分フロー数に偏りが発生した場合において、振分処理におけるフローとパケット処理部の割当変更の負担を軽減しつつ、パケット処理性能への影響を低く抑えたまま、より偏りのない振分方法へと変更することができる。 In addition, switching of the distribution algorithm by redistribution should also be used to eliminate a bias that causes a flow to concentrate on a specific packet processing unit among a plurality of packet processing units. Can do. FIG. 7 shows an example of changing the distribution destination for eliminating the bias. FIG. 7A shows a state in which a large number of flows are assigned to the core 4 among the cores 1 to 4. When the control unit 112 detects such a bias, the control unit 112 instructs the distribution unit 107 to start redistribution and to use a distribution algorithm, as in the case of a failure. As a result, as shown in (B) of FIG. 7, the distribution unit 107 determines that the flow that was supposed to go to the core 4 requires redistribution as a result of the redistribution determination. Flow redistribution is performed so that the flow is distributed throughout the cores 1 to 4 so that the bias is eliminated. Examples of the redistribution algorithm include an algorithm for obtaining a hash value from a flow using different hash functions and performing redistribution. Since the failure does not occur in the core 4, a part of the flow is still allocated to the core 4 due to the redistribution. By switching the distribution algorithm by redistribution, when there is a bias in the number of distribution flows to the packet processing unit, while reducing the burden of changing the allocation of the flow and packet processing unit in the distribution processing, It is possible to change to a distribution method without bias while keeping the influence on the processing performance low.

フローの偏りの発生の検出は、例えば、振分部１０７が各処理コアに振り分けたフローの数を制御部１１２にフィードバックし、制御部１１２が、そのフィードバック情報に例えばしきい値を適用することによって実現できる。あるいは、各コアの負荷状況（例えばＣＰＵ使用率）を制御部１１２が収集し、負荷状況に対して例えばしきい値を適用するようにしてもよい。 For example, detection of the occurrence of a flow deviation is performed by feeding back the number of flows distributed to each processing core by the distribution unit 107 to the control unit 112, and the control unit 112 applies, for example, a threshold value to the feedback information. Can be realized. Alternatively, the control unit 112 may collect the load status (for example, CPU usage rate) of each core and apply a threshold value to the load status, for example.

次に、各フローの振分先の変更後のパケット処理部１０８を予測し、その予測に基づいて予め各フローのパケット処理に必要なデータを各パケット処理部１０８に保持させる切替について説明する。これは振分アルゴリズムには依らないが、ここでは図４（Ａ）を例に説明する。 Next, switching for predicting the packet processing unit 108 after the change of the distribution destination of each flow and holding each packet processing unit 108 in advance with data necessary for packet processing of each flow based on the prediction will be described. This does not depend on the distribution algorithm, but here, FIG. 4A will be described as an example.

制御部１１２が、データ保持部１０９の各フローのフロー情報と、振分アルゴリズム、振り分け対象となるパケット処理部１０８の数とから、振分アルゴリズムまたは振分先となるパケット処理部１０８の数の変更により各フローがどのパケット処理部１０８に振り分けられるように変更されるかを予測し、振分部１０７に対して振分アルゴリズムの変更の実施を指示する前に、予め、各パケット処理部１０８のキャッシュ内に、変更後に処理することとなる各フローのパケット処理に必要なデータを保持させる。例えば、パケットのヘッダの5-tupleからハッシュを計算しそのハッシュ値から振分先のパケット処理部を決めている場合には、データ保持部１０９内の各フローの5-tupleとハッシュ計算に用いるハッシュ関数とハッシュ値からパケット処理部１０８を決定する割り当て方法とが分かれば、各フローがどのパケット処理部１０８で処理されることになるかが判断可能である。 Based on the flow information of each flow in the data holding unit 109, the distribution algorithm, and the number of packet processing units 108 to be distributed, the control unit 112 determines the number of packet processing units 108 that are the distribution algorithm or the distribution destination. The packet processing unit 108 is predicted in advance to which packet processing unit 108 the flow is changed by the change, and before the distribution unit 107 is instructed to execute the distribution algorithm change. In this cache, data necessary for packet processing of each flow to be processed after the change is held. For example, when a hash is calculated from the 5-tuple of the packet header and the packet processing unit to be distributed is determined from the hash value, the 5-tuple of each flow in the data holding unit 109 and the hash calculation are used. If the allocation method for determining the packet processing unit 108 from the hash function and the hash value is known, it is possible to determine which packet processing unit 108 is to process each flow.

さらに、時間変動振分や再振分によって同時に振分先が変更されることとなるフロー数を削減することで、予めパケット処理部１０８が保持すべきデータ量を削減することができ、パケット処理部１０８のキャッシュの必要サイズを削減することができる。 Furthermore, by reducing the number of flows whose assignment destinations are changed at the same time due to time-variant distribution or redistribution, the amount of data that the packet processing unit 108 should hold in advance can be reduced. The required size of the cache of the unit 108 can be reduced.

なお、ＱｏＳ処理のシェイピングのように、パケットごとにテーブル情報が更新される処理に関するデータについては、各パケット処理部１０８のキャッシュとデータ保持部１０９との間での情報同期を行い、振分先変更時の情報不整合を防ぐようにする。 For data related to processing in which table information is updated for each packet, such as shaping of QoS processing, information synchronization is performed between the cache of each packet processing unit 108 and the data holding unit 109, and the distribution destination Prevent inconsistencies in information when changing.

次に、本発明に基づくパケット処理装置の具体的な構成について説明する。図８〜図１３は、それぞれ、パケット処理装置の構成例を示している。これらのパケット処理装置は、図４に示したものと同様のものであり、振分部１０７、複数のパケット処理部１０８及び制御部１１２の具体的な構成がそれぞれ明らかになっている点で図４に示したものと異なっているから、図８〜図１２では、パケット処理装置のうち、パケット入力部１０３、パケット出力部１０４、振分部１０７、複数のパケット処理部１０８及び制御部１１２の部分のみが示されている。図１３では、パケット入力部１０３、パケット出力部１０４、振分部１０７、複数のパケット処理部１０８及び制御部１１２に加え、データ保持部１０９も示されている。なお、複数のパケット処理部１０８によって並列処理部１１３が構成されている。また、パケット処理部（処理コア）１０８ごとにそのコアを特定するコアＩＤ（識別番号）が付与されているものとする。 Next, a specific configuration of the packet processing device according to the present invention will be described. 8 to 13 each show a configuration example of the packet processing device. These packet processing apparatuses are the same as those shown in FIG. 4, and the specific configurations of the distribution unit 107, the plurality of packet processing units 108, and the control unit 112 are clarified. 8 to 12, the packet input unit 103, the packet output unit 104, the distribution unit 107, the plurality of packet processing units 108, and the control unit 112 are included in the packet processing apparatuses in FIGS. 8 to 12. Only the part is shown. In FIG. 13, in addition to the packet input unit 103, the packet output unit 104, the distribution unit 107, the plurality of packet processing units 108, and the control unit 112, a data holding unit 109 is also shown. A plurality of packet processing units 108 constitute a parallel processing unit 113. Further, it is assumed that each packet processing unit (processing core) 108 is assigned a core ID (identification number) that identifies the core.

図８は、時間変動切替を実行するパケット処理装置の構成の一例を示している。 FIG. 8 shows an example of the configuration of a packet processing device that executes time variation switching.

振分部１０７は、それぞれ異なった振分アルゴリズムでフローを振り分ける第１振分処理部１２２及び第２振分処理部１２３と、パケット入力部１０４に入力したパケットをフロー単位で第１振分処理部１２２及び第２振分処理部１２３に振り分けて受け渡す振分切替部１２１と、を備えている。振分切替部１２１は、振分数の変更などを行っていない定常時には、第１振分処理部１２２及び第２振分処理部１２３のいずれか一方にのみ、フローを受け渡す。特に振分切替部１２１は、図４及び図５に示した振分部１０７において新旧振分を行うものに相当し、制御部１１２からの指示により、第１振分処理部１２２及び第２振分処理部１２３のうちの一方にフローを受け渡す状態から、徐々に、それらのうちの他方にフローを受け渡す状態に遷移する。 The allocating unit 107 performs a first allocating process on the packets input to the first allocating processing unit 122 and the second allocating processing unit 123 that distribute the flows by different allocating algorithms, and the packet input unit 104 in units of flows. A distribution switching unit 121 that distributes and transfers the data to the unit 122 and the second distribution processing unit 123. The distribution switching unit 121 delivers the flow only to one of the first distribution processing unit 122 and the second distribution processing unit 123 in a steady state where the distribution number is not changed. In particular, the distribution switching unit 121 corresponds to the distribution unit 107 shown in FIGS. 4 and 5 that performs old and new distribution, and in accordance with instructions from the control unit 112, the first distribution processing unit 122 and the second distribution unit. The state where the flow is delivered to one of the minute processing units 123 is gradually changed to the state where the flow is delivered to the other one of them.

ここに示す例では、パケット処理部へのフローの振分のために、ハッシュ値から直接導出されるアドレス番地にコアＩＤを格納するものとする。各振分処理部１２２，１２３は、フローに対してハッシュ関数を適用してハッシュ値を求め、そのハッシュ値で表されるアドレス番地に格納されたコアＩＤに対応するパケット処理部１０８にそのフローを振り分けるようになっている。第１振分処理部１２２は、例えば、３つのアドレス番地のいずれかがフローに応じて導出されるようなハッシュ関数を使用し、第２振分処理部１２３は、例えば、４つのアドレス番地のいずれかがフローに応じて導出されるようなハッシュ関数を使用する。この場合、第１振分処理部１２２は３分割用振分アルゴリズムによるフロー振分を行い、第２振分処理部１２３は４分割用振分アルゴリズムによるフロー振分を行うことになる。振分部１０７においてこれらのハッシュ関数や振分先数、利用するアドレス番地、コアＩＤは、制御部１１２から指示・設定される。 In the example shown here, it is assumed that the core ID is stored at an address address directly derived from the hash value for distribution of the flow to the packet processing unit. Each distribution processing unit 122, 123 obtains a hash value by applying a hash function to the flow, and sends the flow to the packet processing unit 108 corresponding to the core ID stored at the address address represented by the hash value. Is supposed to be distributed. For example, the first distribution processing unit 122 uses a hash function such that any one of three address addresses is derived according to the flow, and the second distribution processing unit 123 uses, for example, four address addresses. Use a hash function that either is derived according to the flow. In this case, the first distribution processing unit 122 performs flow distribution by the three-divided distribution algorithm, and the second distribution processing unit 123 performs flow distribution by the four-divided distribution algorithm. In the distribution unit 107, these hash functions, the number of distribution destinations, the address address to be used, and the core ID are instructed and set from the control unit 112.

制御部１１２は、情報収集部１４１と振分指示部１４２とから構成されている。情報収集部１４１は、各パケット処理部１０８の情報を収集・管理し、振分の切替契機を抽出し（図８の［１］）、切替契機とパケット処理部１０８との状況とを振分切替契機通知として振分指示部１４２に通知する（図８の［２］）。振分の切替契機としては、例えば、並列処理数の変動、各パケット処理部での処理フロー数の偏りなどが挙げられ、また、制御部１１２から並列処理部１１３に対して並列数変更指示を出したことも含まれる。振分指示部１４２は、情報収集部１４１の振分切替契機通知から振分方法（例えば、振分先、ハッシュ関数など）を判断し、第１振分処理部１２２及び第２振分処理部１２３のうちの動作していない方に新しい振分方法（振分アルゴリズム）を指示し（図８の［３］）、振分切替部１２１に対して振分切替開始を指示する（図８の［４］）。その際に、振分指示部１４２は、新しい振分アルゴリズムに対して一定時間ごとに少数のフローずつ移っていくように、タイマを参照し、判断しきい値の変更を逐次指示する。 The control unit 112 includes an information collection unit 141 and a distribution instruction unit 142. The information collecting unit 141 collects and manages the information of each packet processing unit 108, extracts the switching trigger of distribution ([1] in FIG. 8), and distributes the switching trigger and the status of the packet processing unit 108. The distribution instruction unit 142 is notified as a switching opportunity notification ([2] in FIG. 8). Examples of the switching trigger of the distribution include a change in the number of parallel processes, a deviation in the number of processing flows in each packet processing unit, and the like. In addition, the control unit 112 instructs the parallel processing unit 113 to change the parallel number. It is also included. The distribution instruction unit 142 determines a distribution method (for example, a distribution destination, a hash function, etc.) from the distribution switching trigger notification of the information collection unit 141, and the first distribution processing unit 122 and the second distribution processing unit. A new distribution method (distribution algorithm) is instructed to the one not operating in 123 ([3] in FIG. 8), and the distribution switching unit 121 is instructed to start distribution switching (FIG. 8). [4]). At that time, the distribution instructing unit 142 refers to the timer and sequentially instructs the new distribution algorithm to change the judgment threshold so that a small number of flows are transferred at regular intervals.

このように振分指示部１４２から指示を受けると、振分切替部１２１は、図５に示したように、パケットデータからハッシュ関数により求められるハッシュ値と、時間変動する判断しきい値から、フローのパケット処理部１０８への振分を第１振分処理部１２２で実施するか第２振分処理部１２３で実施するかをフローごとに判断し、該当する振分処理部にそのフローを受け渡す。上述したように、振分切替部１２１は、制御部１１２からの振分先切替の開始指示から振分先の切り替えを開始する。判断しきい値は、一定時間ごとに少数フローずつ振分先のコアが変更されるように変動し、これによって、フローの受け渡し先は、動作中だった振分処理部からそれまで動作していなかった振分処理部へと徐々に変化し、最終的には、それまで動作していなかった方の振分処理部に全フローが受け渡されることになる。 When receiving an instruction from the distribution instruction unit 142 in this way, the distribution switching unit 121, as shown in FIG. 5, from the hash value obtained from the packet data by the hash function and the determination threshold that varies with time, It is determined for each flow whether the flow is distributed to the packet processing unit 108 by the first distribution processing unit 122 or the second distribution processing unit 123, and the flow is transferred to the corresponding distribution processing unit. Deliver. As described above, the distribution switching unit 121 starts switching the distribution destination from the distribution destination switching start instruction from the control unit 112. The decision threshold fluctuates so that the core of the distribution destination is changed by a small number of flows at regular time intervals, so that the flow delivery destination is operating from the distribution processing section that was operating until then. The flow gradually changes to the distribution processing unit that did not exist, and finally, the entire flow is delivered to the distribution processing unit that has not been operating.

第１振分処理部１２２が３分割用振分アルゴリズムに対応し、第２振分処理部１２３が４分割用振分アルゴリズムに対応する場合、パケット処理に関する並列数が３であれば、第１振分処理部１２２のみにフローが受け渡されている。ここで例えば、パケット処理に関する並列数が３から４に増えると、制御部１１２は新しいパケット処理部を用意し、振分部１０７に振分の切替開始を指示する。それにより、振分切替部１２１による切替が開始され、まず、少数のフローのみが第２振分処理部１２３に渡されるようになる。制御部１１２は、一定時間経過後に、第２振分処理部１２３へ渡されるフローが増えるようにしきい値を変更させ、このしきい値の変更を繰り返す。その結果、最終的には全フローが第２振分処理部１２３に渡されるようになり、３並列から４並列への移行が完了する。 When the first distribution processing unit 122 corresponds to the three-division distribution algorithm and the second distribution processing unit 123 corresponds to the four-division distribution algorithm, if the parallel number related to packet processing is 3, the first The flow is delivered only to the distribution processing unit 122. Here, for example, when the number of parallel processes related to packet processing increases from 3 to 4, the control unit 112 prepares a new packet processing unit and instructs the distribution unit 107 to start distribution switching. Thereby, switching by the distribution switching unit 121 is started, and only a small number of flows are first passed to the second distribution processing unit 123. The control unit 112 changes the threshold value so that the flow passed to the second distribution processing unit 123 increases after a certain period of time, and repeats this threshold value change. As a result, all the flows are finally passed to the second distribution processing unit 123, and the transition from 3 parallel to 4 parallel is completed.

図９は、時間変動切替を実行するパケット処理装置の構成の別の例を示している。図９に示すものは、図８に示すものと同様のものであるが、制御部１１２の振分指示部１４２は振分部１０７に対して、判断しきい値の逐次指示を行わず、その代わり、振分部１０７内に振分変更部１２４が設けられている点で図８のものと異なっている。振分変更部１２４は、振分指示部１４２から切替開始指示を受けると（図９の［４］）、その内蔵するタイマを参照し、一定時間経過ごとに、振分切替部１２１に対して判断しきい値の変更を指示する（図９の［５］）。この判断しきい値の変更は、フローの受け渡し先が、動作中だった振分処理部からそれまで動作していなかった振分処理部へと徐々に変化し、最終的にはそれまで動作していなかった方の振分処理部に全フローが受け渡されるように行われる。図８に示したものでは判断しきい値が制御部１１２からの逐一の指示により変更されるのに対し、図９に示したものでは、振分部１０７において、タイマ参照により自律的に変更されることになる。 FIG. 9 shows another example of the configuration of the packet processing device that executes time variation switching. 9 is the same as that shown in FIG. 8, but the distribution instruction unit 142 of the control unit 112 does not sequentially instruct the determination threshold value to the distribution unit 107. Instead, the distribution changing unit 124 is provided in the distribution unit 107, which is different from that of FIG. When receiving a switching start instruction from the distribution instruction unit 142 ([4] in FIG. 9), the distribution changing unit 124 refers to the built-in timer and instructs the distribution switching unit 121 after a certain period of time has elapsed. An instruction to change the judgment threshold is given ([5] in FIG. 9). This decision threshold change is made by gradually changing the flow delivery destination from the distribution processing unit that was operating to the distribution processing unit that was not operating until then. It is performed so that the entire flow is delivered to the distribution processing unit that was not. In the example shown in FIG. 8, the judgment threshold value is changed by a single instruction from the control unit 112, whereas in the example shown in FIG. Will be.

図１０は、時間変動切替によるパケット処理装置であるが、パケット処理部の数に比べて十分に大きい数の仮想ノードを想定し、各仮想ノードに対してコアＩＤを割り当てておき、フローの振分に対しては、フローのハッシュ値から仮想ノードを導出し、導出された仮想ノードがどのコアＩＤに対応しているかに応じて、そのフローをパケット処理部に振り分けるようにしたものである。この構成では、振分部１０７は、仮想ノード振分部１２５と実ノード振分部１２６を備えている。仮想ノード振分部１２５は、パケットデータにハッシュ関数を適用してハッシュ値を求め、そのハッシュ値から、仮想ノードに対応づけられたアドレス番地を導出する。仮想ノードの数は、振分対象となるパケット処理部１０８の数よりも十分に多いが、それでも、複数のフローが同一の仮想ノードに振り分けられる。実ノード振分部１２６は、パケットデータが仮想ノードに振り分けられると、その仮想ノードに対応づけられたアドレス番地に保存されているコアＩＤを検索し、そのパケットデータのフローを検索されたコアＩＤのパケット処理部１０８（すなわち実ノード）へ受け渡す。上述した仮想ノードとコアＩＤとの対応関係は、実ノード振分部１２６内に記録されている。 FIG. 10 shows a packet processing device based on time variation switching. Assuming a sufficiently large number of virtual nodes compared to the number of packet processing units, a core ID is assigned to each virtual node, and the flow allocation is performed. For the minutes, a virtual node is derived from the hash value of the flow, and the flow is distributed to the packet processing unit according to which core ID the derived virtual node corresponds to. In this configuration, the distribution unit 107 includes a virtual node distribution unit 125 and a real node distribution unit 126. The virtual node distribution unit 125 obtains a hash value by applying a hash function to the packet data, and derives an address address associated with the virtual node from the hash value. Although the number of virtual nodes is sufficiently larger than the number of packet processing units 108 to be distributed, a plurality of flows are still distributed to the same virtual node. When the packet data is distributed to the virtual node, the real node distribution unit 126 searches for the core ID stored at the address address associated with the virtual node, and the searched core ID for the flow of the packet data To the packet processing unit 108 (that is, the real node). The correspondence relationship between the virtual node and the core ID described above is recorded in the real node distribution unit 126.

制御部１１２としては、図８に示したものと同様のものが用いられており、割当変更部１４２は、実ノード振分部１２６に対し、仮想ノードとコアＩＤとの割り当ての変更を指示する（図１０の［３］）。このとき割当変更部１４２は、タイマを参照し、一定時間ごとに１つの仮想ノードずつ振分先の処理コアを変更するように、実ノード振分部１２６に指示する。これにより、振分部１０７は、振分先の切り替えを開始し、切替開始指示からの時間経過に応じて新しい振分に基づいて振り分けられるフローが徐々に多くなるように、フローの各パケット処理部１０８への振分が実行されることになる。 The control unit 112 is the same as that shown in FIG. 8, and the allocation changing unit 142 instructs the real node allocation unit 126 to change the allocation of the virtual node and the core ID. ([3] in FIG. 10). At this time, the allocation changing unit 142 refers to the timer and instructs the real node allocating unit 126 to change the processing core of the allocation destination by one virtual node at regular time intervals. As a result, the distribution unit 107 starts switching the distribution destination, and processes each packet of the flow so that the flow allocated based on the new distribution gradually increases as time elapses from the switching start instruction. The distribution to the unit 108 is executed.

図１０に示したものでは、実ノード振分部１２６での割当変更は、制御部１１２からの逐次の指示によって行われていたが、振分部１０７自体が自律的に割当変更を行うようにしてもよい。図１１は、振分部１０７自体が自律的に割当変更を行うようにした構成を示している。図１１に示すものでは、制御部１１２には割当変更部は設けられず、その代わり、振分部１０７に割当変更部１２７が設けられている。制御部１１２から振分部１０７に対して振分切替契機通知が直接送られており（図１１の［２］）、振分部１０７の割当変更部１２７は、振分切替契機通知を受けて、実ノード振分部１２６に割当変更を指示する（図１１の［３］）。 In the example shown in FIG. 10, the allocation change in the real node allocation unit 126 has been performed in accordance with sequential instructions from the control unit 112, but the allocation unit 107 itself performs the allocation change autonomously. May be. FIG. 11 shows a configuration in which the allocating unit 107 itself changes assignments autonomously. In the configuration shown in FIG. 11, the control unit 112 is not provided with an assignment changing unit, and instead, the distribution unit 107 is provided with an assignment changing unit 127. The distribution switching trigger notification is directly sent from the control unit 112 to the distribution unit 107 ([2] in FIG. 11), and the allocation changing unit 127 of the distribution unit 107 receives the distribution switching trigger notification. Then, the real node allocation unit 126 is instructed to change the allocation ([3] in FIG. 11).

図１２は、特定のパケット処理部にあてたフローを再振分するように構成したパケット処理装置の構成の一例を示している。 FIG. 12 shows an example of a configuration of a packet processing device configured to redistribute a flow assigned to a specific packet processing unit.

図１２に示す装置では、制御部１１２には、図８及び図９に示すものと同様に、情報収集部１４１と振分指示部１４２が設けられている。振分部１０７には、第１振分処理部１２８と第２振分処理部１２９と再振分判断部１３０が設けられている。 In the apparatus shown in FIG. 12, the control unit 112 is provided with an information collection unit 141 and a distribution instruction unit 142, similar to those shown in FIGS. The distribution unit 107 includes a first distribution processing unit 128, a second distribution processing unit 129, and a redistribution determination unit 130.

制御部１１２において情報収集部１４１は、各パケット処理部１０８の情報を収集・管理し、再振分を実施する契機を抽出するものである。情報収集部１４１は、各パケット処理部１０８から情報収集を行い、障害発生やフロー数の偏りなどの変化を検出し（図１２の［１］）、フローの再振分を行うべきかどうかを判断して、各パケット処理部１０８の状況に関する情報とともに再振分契機通知を振分指示部１４２に通知する（図１２の［２］）。振分指示部１４２は、再振分契機通知を受け取ると振分指示部１４２は、再振分契機通知から振分方法（例えば、振分先、ハッシュ関数など）を判断し、第１振分処理部１２８及び第２振分処理部１２９に振分方法（振分アルゴリズム）を指示し（図１２の［３］）、再振分判断部１３０に対して、再振分の対象となるコアＩＤと再振分の開始指示などを送る（図１２の［４］）。 In the control unit 112, the information collection unit 141 collects and manages information of each packet processing unit 108, and extracts a trigger for performing redistribution. The information collecting unit 141 collects information from each packet processing unit 108, detects a change such as occurrence of a failure or a deviation in the number of flows ([1] in FIG. 12), and determines whether or not to redistribute the flow. It judges and notifies the redistribution opportunity notification to the distribution instruction section 142 together with information on the status of each packet processing section 108 ([2] in FIG. 12). When the distribution instruction unit 142 receives the redistribution opportunity notification, the distribution instruction unit 142 determines a distribution method (for example, distribution destination, hash function, etc.) from the redistribution opportunity notification, and the first distribution The distribution unit (distribution algorithm) is instructed to the processing unit 128 and the second distribution processing unit 129 ([3] in FIG. 12), and the redistribution determination unit 130 is a target of redistribution. An ID and re-distribution start instruction are sent ([4] in FIG. 12).

第１振分処理部１２８は、振分指示部１４２からの指示に基づく振分アルゴリズム（振分先、ハッシュ関数など）でパケットデータをフロー単位に各パケット処理機能へ振り分け、そのフローのコアＩＤを対応付けるものであるが、データそのものは再振分判断部１３０に渡す。第２振分処理部１２９は、振分指示部１４２からの指示に基いて、第１振分処理部１２８におけるものとな異なる振分アルゴリズム（振分先、ハッシュ関数など）で、パケットデータをフロー単位に各パケット処理部１０８へ振り分ける機能を有する。再振分判断部１３０は、振分指示部１４２からの指示により再振分を行う状態となっている場合には、第１振分処理部１２８から受け取ったフローについて、そのフローに関連づけられているコアＩＤから、そのフローが再振分の対象となっているパケット処理部１０８に振り分けられているか否かを判断する。再振分の対象となっているパケット処理部に振り分けられたフローでなければ、再振分判断部１３０は、そのフローを、フローに関連付けられたコアＩＤで示すパケット処理部１０８に受け渡す。一方、第１振分処理部１２８から受け取ったフローが再振分の対象となるものである場合には、再振分判断部１３０は、そのフローを第２振分処理部１２９に受け渡す。 The first distribution processing unit 128 distributes packet data to each packet processing function in units of flows using a distribution algorithm (distribution destination, hash function, etc.) based on an instruction from the distribution instruction unit 142, and the core ID of the flow However, the data itself is transferred to the redistribution determination unit 130. Based on the instruction from the distribution instruction unit 142, the second distribution processing unit 129 uses a different distribution algorithm (distribution destination, hash function, etc.) from that in the first distribution processing unit 128 to process the packet data. It has a function of distributing to each packet processing unit 108 in units of flows. When the redistribution determination unit 130 is in a state of performing redistribution in accordance with an instruction from the distribution instruction unit 142, the flow received from the first distribution processing unit 128 is associated with the flow. It is determined whether or not the flow is distributed to the packet processing unit 108 to be redistributed from the core ID. If the flow is not distributed to the packet processing unit to be redistributed, the redistribution determining unit 130 transfers the flow to the packet processing unit 108 indicated by the core ID associated with the flow. On the other hand, when the flow received from the first distribution processing unit 128 is to be redistributed, the redistribution determination unit 130 passes the flow to the second distribution processing unit 129.

図１２に示したパケット処理装置に関し、具体的な動作を説明する。 A specific operation of the packet processing apparatus shown in FIG. 12 will be described.

まず、いずれかのパケット処理部（処理コア）１０８において障害が発生した場合を説明する。第２振分処理部１２９には、あらかじめ、並列処理部１１３内の複数のパケット処理部１０８のうち、１つを除くパケット処理部にフローの振分が行われるような振分関数が組み込まれている。いずれかのパケット処理部１０８での障害発生を制御部１１２が検出すると、再振分の対象とする処理コアを特定しその処理コアを除くように再振分を開始する指示が振分指示部１４２から再振分判断部１３０と第２振分処理部１２９とに送られる。第１振分処理部１２８での振分により障害発生コアに振り分けられたフローは、再振分判断部１３０と第２振分処理部１２９とによって、障害が発生していない処理コアに再振分されることになる。このような再振分は、例えば、障害が発生した処理コアのコアＩＤが保存されていたアドレス番地に、再振分関数を呼び出す命令へのポインタを格納することにより実現できる。 First, a case where a failure occurs in any of the packet processing units (processing cores) 108 will be described. The second distribution processing unit 129 incorporates in advance a distribution function that distributes the flow to a packet processing unit other than one of the plurality of packet processing units 108 in the parallel processing unit 113. ing. When the control unit 112 detects the occurrence of a failure in any one of the packet processing units 108, an instruction to specify a processing core to be redistributed and start redistribution so as to remove the processing core is issued to the distribution instructing unit 142 is sent to the redistribution determination unit 130 and the second distribution processing unit 129. The flow distributed to the faulty core by the distribution in the first distribution processing unit 128 is reassigned to the processing core in which no failure has occurred by the redistribution determination unit 130 and the second distribution processing unit 129. Will be divided. Such redistribution can be realized, for example, by storing a pointer to an instruction for calling a redistribution function at the address where the core ID of the processing core in which the failure has occurred is stored.

次に、いずれかのパケット処理部１０８に対してフローが偏った場合の再振分を説明する。第２振分処理部は、第１振分処理部とは異なるハッシュ関数及びパケットデータの参照範囲などによる振分関数を保持している。再振分の指示が振分指示部１４２から再振分判断部１３０と第２振分処理部１２９とに送られると、偏りが発生した処理コアに割り振られたフローを再振分判断部１３０が検出し、検出したフローを第２振分処理部１２９が再度、振り分けることによって、フローの偏りが是正された振分が行われるようになる。 Next, redistribution when the flow is biased with respect to any one of the packet processing units 108 will be described. The second distribution processing unit holds a hash function different from the first distribution processing unit and a distribution function based on a reference range of packet data. When the redistribution instruction is sent from the distribution instruction unit 142 to the redistribution determination unit 130 and the second distribution processing unit 129, the flow allocated to the processing core in which the bias has occurred is redistributed determination unit 130. Is detected, and the second distribution processing unit 129 distributes the detected flow again, so that the distribution with the corrected flow deviation is performed.

図１３は、図８に示したパケット処理装置と同様のものであるが、振分先変更を行う場合に、その変更後に各フローがどのパケット処理部１０８に振り分けられるかを予測し、予測結果に基づき、変更後の各フローのパケット処理に必要なデータを各パケット処理部１０８に予め保持させる切替を行うように構成したパケット処理装置の構成の一例を示している。 FIG. 13 is the same as the packet processing apparatus shown in FIG. 8, but when changing the distribution destination, predicts which packet processing unit 108 each flow is allocated to after the change, and the prediction result 1 shows an example of a configuration of a packet processing device configured to perform switching so that data necessary for packet processing of each flow after change is stored in advance in each packet processing unit 108.

制御部１１２は、まず、データ保持部１０９に保持されているフロー情報を参照し、制御部１１２において管理している振分アルゴリズムと、振分先となるパケット処理部１０８の数とから、振分アルゴリズムの変更または振分先となるパケット処理部１０８の数の変更により、各フローがどのパケット処理部１０８に振り分けられるようになるかを予測する。そして、制御部１１２は、振分部１１２に対して振分アルゴリズムの変更の実施を指示する前に、各パケット処理部１０８に対し、変更後に処理することになる各フローのパケット処理に必要なデータ（特定データ）を予め保持するよう指示する。その結果、各パケット処理部１０８は、データ保持部１０９から事前の情報取得を行う。 First, the control unit 112 refers to the flow information held in the data holding unit 109, and determines the allocation algorithm managed by the control unit 112 and the number of packet processing units 108 that are the allocation destinations. It is predicted to which packet processing unit 108 each flow will be distributed by changing the minute algorithm or changing the number of packet processing units 108 to be distributed. The control unit 112 is necessary for packet processing of each flow to be processed after the change to each packet processing unit 108 before instructing the distribution unit 112 to change the distribution algorithm. Instructs to hold data (specific data) in advance. As a result, each packet processing unit 108 acquires prior information from the data holding unit 109.

また、制御部１１２は、時間変動振分や再振分を実行する際に、そのような振分切替によって同時に振分先が変更されることとなるフローの数が多くならないような振分指示を行い、それに合わせて各パケット処理部１０８に特定データの取得指示を行うことで、パケット処理部１０８が予め保持しなければならないデータ量を削減することができ、パケット処理部１０８のキャッシュの必要サイズを削減することができる。 In addition, when executing the time-variable distribution or the re-distribution, the control unit 112 performs the distribution instruction so that the number of flows whose distribution destinations are simultaneously changed by such distribution switching is not increased. And instructing each packet processing unit 108 to acquire specific data accordingly can reduce the amount of data that the packet processing unit 108 must hold in advance, and the packet processing unit 108 needs to be cached. The size can be reduced.

以上、本発明の実施形態について、パケット処理が単一のプロセッサ内の複数の処理コア（パケット処理部）によて行われる場合を例に挙げて説明した。しかしながら、本発明の実施の形態は上記のものに限られるのものではない。例えば、図１４に示すように、パケット処理装置内に複数のプロセッサを設けてこれらの複数のプロセッサにおいてパケット処理を並列に実行するようにした場合にも、本発明の並列パケット処理方法を適用することができる。図１４に示したパケット処理装置１５１は、図４に示したパケット処理装置における処理コア（パケット処理部１０８）をそれぞれパケット処理用プロセッサ１５３に置き換え、また、振分部１０７及び制御部１１２についても、それぞれ独立のプロセッサ（振分用プロセッサ１５２及び制御用プロセッサ１５５）に置き換えたものである。この構成は、マルチプロセッサ構成のものであり、内部バス１０６及びメモリバス１１０は、いわゆるプロセッサ内バスではなく、プロセッサ間バスとして、パケット処理装置１５１内に設けられることになる。また、データ保持部１５４は、複数のプロセッサ１５３に対して共通の共有メモリとして設けられている。さらに、本発明の並列パケット処理方法は、ネットワークに設けられた複数のノードに分散させてパケット処理を行う場合にも適用できる。図１５は、ネットワークを含むシステムとして構成されたパケット処理システムを示している。図１５に示すシステムでは、ネットワーク１６１に対して、振分ノード１６２と、複数のパケット処理ノード１６３と、データサーバ１６４と、制御ノード１６５とが設けられている。パケット処理の対象となるパケットはネットワーク１６１に対して、直接、入出力している。ここで、振分ノード１６２は振分部１０７と同様の処理を行い、パケット処理ノード１６３はパケット処理部１０８と同様の処理を行い、データサーバ１６４はデータ保持部１０９と同等の機能を有し、制御ノード１６５は制御部１１２と同様の機能を有している。図１５に示すようなシステムも、本発明の並列パケット処理装置の範疇に含まれるものである。 As described above, the embodiment of the present invention has been described by taking as an example a case where packet processing is performed by a plurality of processing cores (packet processing units) in a single processor. However, the embodiment of the present invention is not limited to the above. For example, as shown in FIG. 14, the parallel packet processing method of the present invention is applied even when a plurality of processors are provided in the packet processing apparatus and the packet processing is executed in parallel by the plurality of processors. be able to. The packet processing device 151 shown in FIG. 14 replaces the processing core (packet processing unit 108) in the packet processing device shown in FIG. 4 with the packet processing processor 153, and the distribution unit 107 and the control unit 112 also. These are replaced with independent processors (distribution processor 152 and control processor 155). This configuration is of a multiprocessor configuration, and the internal bus 106 and the memory bus 110 are provided in the packet processing device 151 as interprocessor buses rather than so-called intraprocessor buses. The data holding unit 154 is provided as a common shared memory for the plurality of processors 153. Furthermore, the parallel packet processing method of the present invention can also be applied to packet processing distributed to a plurality of nodes provided in a network. FIG. 15 shows a packet processing system configured as a system including a network. In the system shown in FIG. 15, a distribution node 162, a plurality of packet processing nodes 163, a data server 164, and a control node 165 are provided for the network 161. Packets subject to packet processing are directly input / output from / to the network 161. Here, the distribution node 162 performs the same process as the distribution unit 107, the packet processing node 163 performs the same process as the packet processing unit 108, and the data server 164 has the same function as the data holding unit 109. The control node 165 has the same function as the control unit 112. A system as shown in FIG. 15 is also included in the category of the parallel packet processing apparatus of the present invention.

１０１，１５１パケット処理装置
１０２，１０３インタフェース部
１０４パケット入力部
１０５パケット出力部
１０６内部バス
１０７振分部
１０８パケット処理部
１０９，１５４データ保持部
１１０メモリバス
１１１プロセッサ
１１２制御部
１１３並列処理部
１２１振分切替部
１２２，１２８第１振分処理部
１２３，１２９第２振分処理部
１２４振分変更部
１２５仮想ノード振分部
１２６実ノード振分部
１２７，１４３割当変更部
１３０再振分判断部
１４１情報収集部
１４２振分指示部
１５２振分用プロセッサ
１５３パケット処理用プロセッサ
１５５制御用プロセッサ
１６１ネットワーク
１６２振分ノード
１６３パケット処理ノード
１６４データサーバ
１６５制御ノード 101, 151 Packet processing device 102, 103 Interface unit 104 Packet input unit 105 Packet output unit 106 Internal bus 107 Distribution unit 108 Packet processing unit 109, 154 Data holding unit 110 Memory bus 111 Processor 112 Control unit 113 Parallel processing unit 121 Minute switching unit 122, 128 First distribution processing unit 123, 129 Second distribution processing unit 124 Distribution change unit 125 Virtual node distribution unit 126 Real node distribution unit 127, 143 Allocation change unit 130 Redistribution determination unit 141 Information Collecting Unit 142 Distribution Instruction Unit 152 Distribution Processor 153 Packet Processing Processor 155 Control Processor 161 Network 162 Distribution Node 163 Packet Processing Node 164 Data Server 165 Control Node

Claims

A parallel packet processing method that distributes flows to a plurality of packet processing means and executes packet processing on the flows in parallel between the plurality of packet processing means,
Preparing different first and second distribution algorithms for distributing the flows to the plurality of packet processing means;
From the state where the first distribution algorithm is applied to the flow, the probability that the second distribution algorithm is applied to the flow for each flow is increased with the passage of time, and the first 2. A parallel packet processing method for switching to a state in which the distribution algorithm of 2 is applied to the flow.

2. The number of the packet processing means that is a distribution destination by the first distribution algorithm is different from the number of the packet processing means that is a distribution destination by the second distribution algorithm. Parallel packet processing method.

A parallel packet processing method that distributes flows to a plurality of packet processing means and executes packet processing on the flows in parallel between the plurality of packet processing means,
Preparing different first and second distribution algorithms for distributing the flows to the plurality of packet processing means;
The flow is distributed to the plurality of packet processing means using the first distribution algorithm, and the second flow is assigned to the flow that has been distributed to a specific packet processing means among the distributed flows. A parallel packet processing method that executes redistribution by applying a distribution algorithm.

The parallel packet processing method according to claim 3, wherein the second distribution algorithm is a distribution algorithm that distributes a flow to the plurality of packet processing units excluding the specific packet processing unit.

Before changing in the distribution algorithm, determine which of the plurality of packet processing means each flow is changed by the change,
6. The information processing apparatus according to any one of claims 1 to 5, wherein data necessary for processing of each changed flow is held in advance in a corresponding packet processing unit based on a result of the determination, and then the change is performed. The parallel packet processing method described.

A parallel packet processing method that distributes flows to a plurality of packet processing means and executes packet processing on the flows in parallel between the plurality of packet processing means,
The distribution algorithm for distributing the flow to the plurality of packet processing means can be changed,
Before changing the distribution algorithm, determine which of the plurality of packet processing means will change each flow by the change,
A parallel packet processing method in which data necessary for the processing of each changed flow is held in advance in a corresponding packet processing unit based on the determination result, and then the change is performed.

A plurality of packet processing means for performing packet processing on the flow;
Distribution means for distributing flows to the plurality of packet processing means by different first and second distribution algorithms;
From the state in which the first distribution algorithm is applied to the flow, the probability that the second distribution algorithm is applied to the flow for each flow is increased with the passage of time. Control means for controlling the allocating means so as to switch to a state in which the allocating algorithm is applied to the flow;
A parallel packet processing apparatus.

The number of the packet processing means that is the distribution destination by the first distribution algorithm is different from the number of the packet processing means that is the distribution destination by the second distribution algorithm, and the control means is The parallel packet processing device according to claim 7, wherein the parallel packet processing device is configured to instruct the distribution unit to switch the state when the parallel number is changed in the plurality of packet processing units.

A plurality of packet processing means for performing packet processing on the flow;
Distribution means for distributing the flow to the plurality of packet processing means by a first distribution algorithm;
Control means for controlling the allocating means;
With
The distribution means includes a redistribution determination unit that determines whether or not redistribution is necessary based on a distribution destination of the flow with respect to a flow distributed by applying the first distribution algorithm. When it is determined that distribution is necessary, the flow is redistributed to the plurality of packet processing means by a second distribution algorithm different from the first distribution algorithm,
The parallel packet processing device, wherein the control means instructs the distribution means to redistribute a flow that has been allocated to a specific packet processing means by the first distribution algorithm.

The parallel packet processing device according to claim 9, wherein the second distribution algorithm is a distribution algorithm that distributes a flow to the plurality of packet processing units excluding the specific packet processing unit.

The control means holds distribution algorithm information, the number of packet processing means to be distributed and each flow information, so that each flow is processed by any packet processing means after the distribution algorithm is changed. And determining whether to change, and instructing each of the packet processing means to hold necessary data in advance based on the result of the determination, and then controlling the distribution means to change the distribution destination. The parallel packet processing device according to any one of 7 to 10.

A plurality of packet processing means for performing packet processing on the flow;
A distribution means for distributing the flow to the plurality of packet processing means by a changeable distribution algorithm;
Holds distribution algorithm information, the number of packet processing means to be distributed and each flow information, and determines which packet processing means each flow changes after the distribution algorithm is changed And, after instructing each packet processing means to hold necessary data in advance based on the result of the determination, a control means for controlling the distribution means to change the distribution destination;
A parallel packet processing apparatus.