JP2018036952A

JP2018036952A - Distribution device and distribution method

Info

Publication number: JP2018036952A
Application number: JP2016170877A
Authority: JP
Inventors: 山田　剛史; Takashi Yamada; 剛史山田
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2016-09-01
Filing date: 2016-09-01
Publication date: 2018-03-08
Anticipated expiration: 2036-09-01
Also published as: JP6646338B2

Abstract

PROBLEM TO BE SOLVED: To provide a distribution DB system with high performance by reducing communication between DBs.SOLUTION: There is provided a distribution DB system in which an acquisition part 15a stores pieces of data belonging to a same community in a same DB, and distributes pieces of data belonging to different communities into plural DBs and stores the same and acquires a communication traffic volume following to request of data to the DB from each community, a determination part 15c determines a combination of communities stored in the same DB, in an order in which the communication traffic volume is large among communities. In addition, a creation part 15b creates a community sub group by combining the plural communities, reduces variation of the communication traffic volume from each community sub group in all community sub groups, and the determination part 15cc may determine a combination of the communities stored in the same DB in an order in which the communication traffic volume is large among the created community sub groups.SELECTED DRAWING: Figure 3

Description

本発明は、分散装置および分散方法に関する。 The present invention relates to a dispersion apparatus and a dispersion method.

一般に、大規模なデータを複数のデータベース（ＤＢ）に分散させて収容する分散ＤＢシステムが知られている。例えば、ユーザの要求に対する処理を要求ごとに所定のＷｅｂサーバが実行するシステムにおいて、各ＷｅｂサーバのＤＢに各要求の処理に必要なデータが収容される（特許文献１参照）。 In general, a distributed DB system is known in which large-scale data is distributed and accommodated in a plurality of databases (DB). For example, in a system in which a predetermined Web server executes a process for a user request for each request, data necessary for processing each request is accommodated in the DB of each Web server (see Patent Document 1).

分散ＤＢシステムの中には、ユーザが処理を要求したサーバのＤＢに、要求の処理に必要なデータが収容されていない場合がある。その場合に、ユーザが要求したサーバと所望のデータが収容されているサーバとの間で連携してＤＢ間通信を行って所望のデータを取得する必要がある。例えば、電話サービスにおいて、都道府県別に用意されたＤＢに加入者のデータが収容されている場合に、都道府県間通話の際にはＤＢ間通信により宛先ユーザのデータが取得される。 In a distributed DB system, there is a case where data necessary for processing a request is not accommodated in a DB of a server from which a user has requested processing. In that case, it is necessary to acquire the desired data by performing inter-DB communication in cooperation between the server requested by the user and the server in which the desired data is accommodated. For example, in the telephone service, when subscriber data is accommodated in a DB prepared for each prefecture, the data of the destination user is acquired by inter-DB communication during a call between prefectures.

特開２０１４−０９６１１３号公報JP 2014-096113 A

しかしながら、ＤＢ間の通信量が増加して輻輳すると、要求から処理結果の出力までの遅延すなわちＴＡＴ（Turn Around Time）遅延や処理失敗等が発生する恐れがあった。 However, when the amount of communication between DBs increases and congestion occurs, a delay from a request to the output of a processing result, that is, a TAT (Turn Around Time) delay or a processing failure may occur.

本発明は、上記に鑑みてなされたものであって、ＤＢ間通信を低減して高効率な分散ＤＢシステムを提供することを目的とする。 The present invention has been made in view of the above, and an object of the present invention is to provide a highly efficient distributed DB system by reducing inter-DB communication.

上述した課題を解決し、目的を達成するために、本発明に係る分散装置は、同一のコミュニティに属するデータを同一のデータベースに収容するとともに、異なるコミュニティに属するデータを複数のデータベースに分散して収容するデータベースシステムにおいて、各コミュニティからデータベースに対するデータの要求に伴う通信量を取得する取得部と、取得された前記通信量のうち、前記コミュニティ間での通信量が多い順に、同一のデータベースに収容するコミュニティの組み合わせを決定する決定部と、を備えることを特徴とする。 In order to solve the above-described problems and achieve the object, the distribution device according to the present invention accommodates data belonging to the same community in the same database and distributes data belonging to different communities to a plurality of databases. In the database system to be accommodated, an acquisition unit that acquires a communication amount associated with a data request from each community to the database, and the acquired communication amount, the communication amount between the communities is stored in the same database in descending order. And a determining unit that determines a combination of communities to perform.

本発明によれば、ＤＢ間通信を低減して高効率な分散ＤＢシステムを提供することが可能となる。 According to the present invention, it is possible to provide a highly efficient distributed DB system by reducing inter-DB communication.

図１は、本発明の一実施形態に係る分散装置の処理対象を説明するための説明図である。FIG. 1 is an explanatory diagram for explaining a processing target of a distribution apparatus according to an embodiment of the present invention. 図２は、本実施形態の分散装置の処理概要を説明するための説明図である。FIG. 2 is an explanatory diagram for explaining an outline of processing of the distribution apparatus according to the present embodiment. 図３は、本実施形態に係る分散装置の概略構成を示す模式図である。FIG. 3 is a schematic diagram illustrating a schematic configuration of the dispersion apparatus according to the present embodiment. 図４は、作成部の処理について説明するための説明図である。FIG. 4 is an explanatory diagram for explaining the processing of the creation unit. 図５は、作成部の処理について説明するための説明図である。FIG. 5 is an explanatory diagram for explaining processing of the creation unit. 図６は、決定部の処理について説明するための説明図である。FIG. 6 is an explanatory diagram for explaining the processing of the determination unit. 図７は、決定部の処理について説明するための説明図である。FIG. 7 is an explanatory diagram for explaining the processing of the determination unit. 図８は、分散処理の効果を説明するための説明図である。FIG. 8 is an explanatory diagram for explaining the effect of the distributed processing. 図９は、本実施形態の分散処理手順を示すフローチャートである。FIG. 9 is a flowchart showing the distributed processing procedure of this embodiment. 図１０は、分散プログラムを実行するコンピュータを例示する図である。FIG. 10 is a diagram illustrating a computer that executes a distributed program.

以下、図面を参照して、本発明の一実施形態を詳細に説明する。なお、この実施形態により本発明が限定されるものではない。また、図面の記載において、同一部分には同一の符号を付して示している。 Hereinafter, an embodiment of the present invention will be described in detail with reference to the drawings. In addition, this invention is not limited by this embodiment. Moreover, in description of drawing, the same code | symbol is attached | subjected and shown to the same part.

［分散装置の処理概要］
まず、図１を参照して、本実施形態に係る分散装置の処理対象について説明する。本実施形態において、分散装置は、例えば電話サービスの加入者のデータを、加入者の属する都道府県等のコミュニティ単位で複数のＤＢに分散して収容する分散ＤＢシステムを処理対象とする。図１に例示する分散ＤＢシステム１では、コミュニティＡ、コミュニティＢおよびコミュニティＣに属するデータがＤＢ＃１に収容され、コミュニティＤ、コミュニティＥおよびコミュニティＦに属するデータがＤＢ＃２に収容されている。 [Distributed device processing overview]
First, a processing target of the distribution apparatus according to this embodiment will be described with reference to FIG. In this embodiment, for example, the distributed device targets a distributed DB system that accommodates telephone service subscriber data distributed in a plurality of DBs in units of communities such as prefectures to which the subscriber belongs. In the distributed DB system 1 illustrated in FIG. 1, data belonging to community A, community B, and community C is accommodated in DB # 1, and data belonging to community D, community E, and community F is accommodated in DB # 2. .

ここで、コミュニティＡに属する加入者がコミュニティＢに属する加入者宛に発呼した場合に取得される宛先の加入者のデータは、コミュニティＡに属するデータＡと同一のＤＢ＃１に収容されている。このように、コミュニティＡからコミュニティＢに属するデータＢが要求された場合、所望のデータＢは、要求元のコミュニティＡが要求するＤＢ＃１と同一のＤＢに収容されているため、図１に破線で示すようにＤＢ＃１内におけるＤＢ内通信により取得される。 Here, when the subscriber belonging to the community A makes a call to the subscriber belonging to the community B, the data of the destination subscriber acquired is accommodated in the same DB # 1 as the data A belonging to the community A. Yes. Thus, when the data B belonging to the community B is requested from the community A, the desired data B is accommodated in the same DB as the DB # 1 requested by the requesting community A. As shown by the broken line, it is acquired by intra-DB communication in DB # 1.

一方、コミュニティＣに属する加入者がコミュニティＤに属する加入者を宛先にして発呼した場合等、コミュニティＣからコミュニティＤに属するデータが要求された場合、所望のデータＤはコミュニティＣが要求するＤＢ＃１とは異なるＤＢ＃２に収容されている。したがって、所望のデータＤは、図１に一点鎖線で示すように、ＤＢ＃１とＤＢ＃２との間で連携してＤＢ間通信を行うことにより取得される。 On the other hand, when data belonging to the community D is requested from the community C, such as when a subscriber belonging to the community C makes a call with a subscriber belonging to the community D as a destination, the desired data D is the DB requested by the community C. It is stored in DB # 2, which is different from # 1. Therefore, the desired data D is acquired by performing inter-DB communication in cooperation between DB # 1 and DB # 2, as indicated by a one-dot chain line in FIG.

このようなＤＢ間通信が行われている間には、要求元の処理が待たされる。また、ＤＢ間の通信量が増えて輻輳すると、ＴＡＴ遅延が発生したり処理が失敗に終わったりする恐れが生じる。そこで、本実施形態の分散装置は、コミュニティ間での通信量が多いコミュニティ同士のデータを同一のＤＢに収容するように、同一のＤＢにデータを収容するコミュニティの組み合わせを決定することにより、ＤＢ間通信を低減する。なお、コミュニティには、都道府県の他、市、会社、または部門等が例示される。 While such inter-DB communication is performed, the request source process is awaited. Further, when the communication amount between DBs increases and congestion occurs, there is a possibility that a TAT delay occurs or the process ends in failure. Therefore, the distributed apparatus according to the present embodiment determines the combination of communities that store data in the same DB so that the data of communities having a large amount of communication between communities is stored in the same DB. Reduce intercommunication. The community is exemplified by a city, a company, a department, etc. in addition to a prefecture.

次に、図２を参照して、本実施形態に係る分散装置の処理概要について説明する。図２（ａ）は、各コミュニティのコミュニティ内でのデータの要求に伴う通信量およびコミュニティ間でのデータの要求に伴う通信量を示す。図２（ａ）において、要求元のコミュニティが列方向に示され、要求先のコミュニティが行方向に示されている。図２（ａ）には、例えば、コミュニティＡのコミュニティ内通信量が１であり、コミュニティＡからコミュニティＣへのコミュニティ間通信量が８であることが例示されている。 Next, with reference to FIG. 2, an outline of processing of the distribution apparatus according to the present embodiment will be described. FIG. 2A shows the amount of communication associated with a request for data within the community of each community and the amount of communication associated with the request for data between communities. In FIG. 2A, the requesting communities are shown in the column direction, and the requesting communities are shown in the row direction. FIG. 2A illustrates, for example, that the intra-community traffic volume of the community A is 1 and the inter-community traffic volume from the community A to the community C is 8.

ここで、ＤＢ内通信には、コミュニティ内での通信と同一ＤＢに収容されたコミュニティ間での通信とが含まれる。図２（ｂ）は、図２（ａ）に例示された通信量のうち、各コミュニティのＤＢ内通信になり得る通信量を例示している。例えば、コミュニティＡについて、ＤＢ内通信になり得る通信量は、図２（ａ）の要求元がコミュニティＡの行の通信量を合計し、１＋１＋８＋７＋７＋２＝２６と集計される。 Here, intra-DB communication includes communication within a community and communication between communities accommodated in the same DB. FIG. 2B illustrates the amount of communication that can be the intra-DB communication of each community out of the amount of communication illustrated in FIG. For example, for the community A, the amount of communication that can be in-DB communication is totaled as 1 + 1 + 8 + 7 + 7 + 2 = 26, where the request source in FIG.

また、ＤＢ間通信には、異なるＤＢに収容されたコミュニティ間での通信が含まれる。図２（ｃ）は、図２（ａ）に例示された通信量のうち、各コミュニティ間での通信量を例示している。すなわち、図２（ｃ）は、ＤＢ間通信になり得る通信量を例示している。例えば、コミュニティＡとコミュニティＣとの交点におけるコミュニティ間通信量は、上述したように８である。 The inter-DB communication includes communication between communities housed in different DBs. FIG. 2C illustrates the amount of communication between the communities among the amount of communication illustrated in FIG. That is, FIG. 2C illustrates the amount of communication that can be communication between DBs. For example, the inter-community traffic at the intersection of the community A and the community C is 8 as described above.

なお、分散ＤＢシステム１が稼働する以上、ＤＢ内通信を低減することはできない。ただし、ＤＢ内通信がいずれかのＤＢに偏って発生することにより、ＤＢ内通信の輻輳が発生する恐れがある。そこで、本実施形態の分散装置は、まず、図２（ｂ）に例示したＤＢ内通信になり得る通信量のばらつきを低減するように、コミュニティを組み合わせたコミュニティサブグループを作成する。 As long as the distributed DB system 1 operates, intra-DB communication cannot be reduced. However, if the intra-DB communication is biased to one of the DBs, there is a possibility that the intra-DB communication may be congested. Therefore, the distribution apparatus according to the present embodiment first creates community subgroups in which communities are combined so as to reduce the variation in the amount of communication that can be the intra-DB communication illustrated in FIG.

次に、分散装置は、各通信量が平準化されたコミュニティサブグループについて、図２（ｃ）と同様に、ＤＢ間通信になり得るコミュニティサブグループ間通信量を集計し、コミュニティサブグループ間通信量が多い順にコミュニティサブグループを組み合わせる。これにより、コミュニティ間での通信量が多いコミュニティ同士のデータを同一のＤＢに収容して、ＤＢ間通信を低減する。 Next, for the community subgroups in which each traffic is leveled, the distributed device aggregates the traffic between the community subgroups that can be inter-DB communication as in FIG. Combine community subgroups in descending order. Thereby, the data of communities with much communication volume between communities are accommodated in the same DB, and communication between DB is reduced.

［分散装置の構成］
次に、図３を参照して、本実施形態に係る分散装置の概略構成を説明する。図３に示すように、本実施形態に係る分散装置１０は、ワークステーションやパソコン等の汎用コンピュータで実現され、入力部１１と出力部１２と通信制御部１３と、記憶部１４と、制御部１５とを備える。この分散装置１０は、後述する分散処理を実行して、コミュニティ間の通信量が多いコミュニティ同士のデータを同一のＤＢに収容するように、同一のＤＢにデータを収容するコミュニティの組み合わせを決定する。 [Distributed device configuration]
Next, a schematic configuration of the dispersion apparatus according to the present embodiment will be described with reference to FIG. As illustrated in FIG. 3, the distributed apparatus 10 according to the present embodiment is realized by a general-purpose computer such as a workstation or a personal computer, and includes an input unit 11, an output unit 12, a communication control unit 13, a storage unit 14, and a control unit. 15. The distribution apparatus 10 executes a distribution process described later, and determines a combination of communities that store data in the same DB so that data of communities having a large amount of communication between communities is stored in the same DB. .

入力部１１は、キーボードやマウス等の入力デバイスを用いて実現され、操作者による入力操作に対応して、制御部１５に対して各種指示情報を入力する。出力部１２は、液晶ディスプレイなどの表示装置、プリンター等の印刷装置、情報通信装置等によって実現され、後述する分散処理の結果等を操作者に対して出力する。 The input unit 11 is realized using an input device such as a keyboard or a mouse, and inputs various instruction information to the control unit 15 in response to an input operation by the operator. The output unit 12 is realized by a display device such as a liquid crystal display, a printing device such as a printer, an information communication device, and the like, and outputs a result of distributed processing described later to the operator.

通信制御部１３は、ＮＩＣ（Network Interface Card）等で実現され、ＬＡＮ（Local Area Network）やインターネットなどの電気通信回線を介した管理サーバ等の外部の装置と制御部１５との通信を制御する。 The communication control unit 13 is realized by a NIC (Network Interface Card) or the like, and controls communication between an external device such as a management server and the control unit 15 via a telecommunication line such as a LAN (Local Area Network) or the Internet. .

記憶部１４は、ＲＡＭ（Random Access Memory）、フラッシュメモリ（Flash Memory）等の半導体メモリ素子、または、ハードディスク、光ディスク等の記憶装置によって実現される。記憶部１４には、分散装置１０を動作させる処理プログラムや、処理プログラムの実行中に使用されるデータなどが予め記憶され、あるいは処理の都度一時的に記憶される。記憶部１４は、通信制御部１３を介して制御部１５と通信する構成でもよい。 The storage unit 14 is realized by a semiconductor memory device such as a RAM (Random Access Memory) or a flash memory, or a storage device such as a hard disk or an optical disk. The storage unit 14 stores in advance a processing program for operating the distribution apparatus 10, data used during the execution of the processing program, or temporarily stored for each processing. The storage unit 14 may be configured to communicate with the control unit 15 via the communication control unit 13.

制御部１５は、ＣＰＵ（Central Processing Unit）等の演算処理装置がメモリに記憶された処理プログラムを実行することにより、図３に例示するように、取得部１５ａ、作成部１５ｂおよび決定部１５ｃとして機能する。 As illustrated in FIG. 3, the control unit 15 performs an acquisition unit 15 a, a creation unit 15 b, and a determination unit 15 c by executing a processing program stored in a memory by an arithmetic processing unit such as a CPU (Central Processing Unit). Function.

取得部１５ａは、同一のコミュニティに属するデータを同一のＤＢに収容するとともに、異なるコミュニティに属するデータを複数のＤＢに分散して収容する分散ＤＢシステム１において、各コミュニティからＤＢに対するデータの要求に伴う通信量を取得する。 In the distributed DB system 1 that stores data belonging to the same community in the same DB and distributes data belonging to different communities to a plurality of DBs, the acquisition unit 15a responds to a request for data from each community to the DB. Get the accompanying traffic.

具体的に、取得部１５ａは、例えば分散ＤＢシステム１の管理サーバ等から入力部１１または通信制御部１３を介して、各コミュニティのコミュニティ内でのデータの要求に伴う通信量およびコミュニティ間でのデータの要求に伴う通信量（図２参照）を取得する。 Specifically, the acquisition unit 15a receives, for example, the amount of communication associated with a request for data within the community of each community from the management server of the distributed DB system 1 via the input unit 11 or the communication control unit 13 and between the communities. The amount of communication accompanying the data request (see FIG. 2) is acquired.

作成部１５ｂは、複数のコミュニティを組み合わせてコミュニティサブグループを作成し、取得された通信量（図２参照）のうち、各コミュニティサブグルーブからの通信量の全コミュニティサブグループ内でのばらつきを低減させる。 The creation unit 15b creates a community subgroup by combining a plurality of communities, and reduces variation in the amount of traffic from each community subgroup among all the community subgroups among the obtained traffic (see FIG. 2). Let

具体的に、図４および図５を参照して、作成部１５ｂの処理について説明する。作成部１５ｂは、図４に例示するように、以下に説明するグルーピング法を適用して、各コミュニティサブグループの通信量のばらつきが低減するように、複数のコミュニティを組み合わせてコミュニティサブグループを作成する。 Specifically, the processing of the creation unit 15b will be described with reference to FIGS. As illustrated in FIG. 4, the creation unit 15b applies a grouping method described below to create a community subgroup by combining a plurality of communities so as to reduce variation in the traffic of each community subgroup. To do.

図５は、グルーピング法を説明するための説明図である。ここでは、図５（ａ）に例示するように、昇順にソートしたグルーピング法の対象データのデータサイズが、ｙ＝ｆ（ｘ）で表され、標準偏差が０．８８である場合について説明する。グルーピング法では、図５（ｂ）に例示するように、データサイズの差分値の重心を求め、データサイズの差分値が重心より小さいデータのデータ数と重心より大きいデータとのデータ数との比率を算出する。図５（ｂ）に示す例では、重心より小さいデータのデータ数と重心より大きいデータのデータ数との比が２６．４対１３．６、すなわち２対１であった。 FIG. 5 is an explanatory diagram for explaining the grouping method. Here, as illustrated in FIG. 5A, a case where the data size of the target data of the grouping method sorted in ascending order is represented by y = f (x) and the standard deviation is 0.88 will be described. . In the grouping method, as illustrated in FIG. 5B, the centroid of the difference value of the data size is obtained, and the ratio between the number of data of the data whose difference value of the data size is smaller than the centroid and the number of data of the data larger than the centroid. Is calculated. In the example shown in FIG. 5B, the ratio of the number of data pieces of data smaller than the centroid and the number of data pieces of data larger than the centroid is 26.4 to 13.6, that is, 2 to 1.

その場合に、グルーピング法では、図５（ｃ）に例示するように、重心より小さいデータのうちの２つと重心より大きいデータのうちの１つとの３つのデータを組み合わせたサブグループを作成する。これにより、各サブグループのデータサイズを平準化することができる。図５（ｃ）に示す例では、各サブグループのデータサイズｙが大凡ａと定数になり、標準偏差が０．１２になった。このように、グルーピング法によれば、データサイズのばらつきを低減するように、サブグループを作成することができる。 In that case, in the grouping method, as illustrated in FIG. 5C, a subgroup is created by combining three data of two pieces of data smaller than the centroid and one piece of data larger than the centroid. Thereby, the data size of each subgroup can be equalized. In the example shown in FIG. 5C, the data size y of each subgroup is approximately a constant with a, and the standard deviation is 0.12. Thus, according to the grouping method, subgroups can be created so as to reduce variation in data size.

決定部１５ｃは、取得された通信量（図２参照）のうち、作成されたコミュニティサブグループ間での通信量が多い順に、同一のＤＢに収容するコミュニティの組み合わせを決定する。 The determining unit 15c determines the combination of communities accommodated in the same DB in the descending order of the traffic volume between the created community subgroups among the acquired traffic volume (see FIG. 2).

具体的に、図６および図７を参照して、決定部１５ｃの処理について説明する。決定部１５ｃは、図６に例示するように、作成部１５ｂが作成したコミュニティサブグループを対象に、コミュニティサブグループ間通信量を集計する。また、決定部１５ｃは、コミュニティサブグループ間通信量が多い順に、要求元および要求先の２つのコミュニティサブグループの組み合わせを同一のＤＢに収容するコミュニティグループとして決定する。 Specifically, the processing of the determination unit 15c will be described with reference to FIGS. As illustrated in FIG. 6, the determination unit 15 c totals the communication amount between community subgroups for the community subgroup created by the creation unit 15 b. In addition, the determination unit 15c determines a combination of two community subgroups of the request source and the request destination as community groups accommodated in the same DB in descending order of the traffic volume between the community subgroups.

図６に示す例では、コミュニティサブグループ１からコミュニティサブグループ２への通信量が最多であることから、コミュニティサブグループ１とコミュニティサブグループ２とは同一のＤＢに収容されることが決定する。決定されたこのコミュニティサブグループ１とコミュニティサブグループ２とを組み合わせたコミュニティグループをコミュニティグループＡとする。 In the example illustrated in FIG. 6, since the communication amount from the community subgroup 1 to the community subgroup 2 is the largest, it is determined that the community subgroup 1 and the community subgroup 2 are accommodated in the same DB. A community group obtained by combining the determined community subgroup 1 and community subgroup 2 is referred to as a community group A.

また、組み合わせが決定したコミュニティサブグループ１およびコミュニティサブグループ２を除いて、次に通信量が多いのは、コミュニティサブグループ４からコミュニティグループ３への通信量であることが例示されている。したがって、コミュニティサブグループ３とコミュニティサブグループ４とを同一のＤＢに収容することが決定する。決定されたコミュニティサブグループ３とコミュニティサブグループ４とを組み合わせたコミュニティグループをコミュニティグループＢとする。 In addition, except for community subgroup 1 and community subgroup 2 that have been determined to be combined, the next largest amount of communication is exemplified by the amount of communication from community subgroup 4 to community group 3. Therefore, it is determined that the community subgroup 3 and the community subgroup 4 are accommodated in the same DB. A community group in which the determined community subgroup 3 and community subgroup 4 are combined is referred to as a community group B.

決定部１５ｃは、図７に例示するように、決定したコミュニティグループを対象に、図６に示した例と同様に、コミュニティグループ間での通信量が多い順に、２つのコミュニティグループの組み合わせを同一のＤＢに収容するコミュニティグループとして決定する。決定部１５ｃは、この処理を繰り返し、分散させるＤＢの数のコミュニティグループを作成する。 As illustrated in FIG. 7, the determination unit 15 c targets the determined community group, and in the same manner as in the example illustrated in FIG. 6, the combination of the two community groups is the same in descending order of the amount of communication between the community groups. It is determined as a community group accommodated in the DB. The determination unit 15c repeats this process to create community groups for the number of DBs to be distributed.

図７に示す例では、コミュニティサブグループ１とコミュニティサブグループ２とを組み合わせてコミュニティグループＡとする。また、コミュニティサブグループ３とコミュニティサブグループ４とを組み合わせてコミュニティグループＢとする。そして、一次組として決定したコミュニティグループＡとコミュニティグループＢとの間の通信量が最多であることから、コミュニティグループＡとコミュニティグループＢとを同一のＤＢに収容するものと決定する。同様に、コミュニティグループＤとコミュニティグループＥとを同一のＤＢに収容するものと決定する。 In the example illustrated in FIG. 7, community subgroup 1 and community subgroup 2 are combined to form community group A. Also, community subgroup 3 and community subgroup 4 are combined to form community group B. Then, since the communication amount between the community group A and the community group B determined as the primary group is the largest, it is determined that the community group A and the community group B are accommodated in the same DB. Similarly, it is determined that the community group D and the community group E are accommodated in the same DB.

また、決定部１５ｃは、決定結果を適当な形式で出力部１２に出力する。例えば、各ＤＢに収容されるコミュニティを一覧可能に表示する。あわせて、各ＤＢのＤＢ間通信量を表示してもよい。 Further, the determination unit 15c outputs the determination result to the output unit 12 in an appropriate format. For example, the communities accommodated in each DB are displayed in a listable manner. In addition, the communication volume between DBs of each DB may be displayed.

なお、作成部１５ｂの処理は必ずしも実施されなくてもよい。その場合、決定部１５ｃは、コミュニティサブグループ間に代えて、コミュニティ間での通信量（図２（ｃ）参照）が多い順に、同一のＤＢに収容するコミュニティの組み合わせを決定する。 Note that the processing of the creation unit 15b is not necessarily performed. In that case, instead of the community subgroups, the determination unit 15c determines the combination of communities accommodated in the same DB in the descending order of the amount of communication between the communities (see FIG. 2C).

すなわち、決定部１５は、コミュニティ間での通信量が多い順に、２つのコミュニティの組み合わせを同一のＤＢに収容するコミュニティグループとして決定する。そして、上記と同様に、決定したコミュニティグループを対象に、コミュニティグループ間での通信量が多い順に、２つのコミュニティグループの組み合わせを同一のＤＢに収容するコミュニティグループとして決定する。作成部１５ｂは、この処理を繰り返し（図７参照）、分散させるＤＢの数のコミュニティグループを作成する。その場合にも、図８（ａ）に例示するように、従来と比較してＤＢ＃１からＤＢ＃２へのＤＢ間通信量およびＤＢ＃２からＤＢ＃１へのＤＢ間通信量が低減される。 That is, the determination unit 15 determines a combination of two communities as a community group accommodated in the same DB in descending order of communication amount between the communities. Similarly to the above, for the determined community group, the combination of the two community groups is determined as a community group accommodated in the same DB in the descending order of the communication amount between the community groups. The creation unit 15b repeats this process (see FIG. 7), and creates community groups as many as the number of DBs to be distributed. Even in this case, as illustrated in FIG. 8A, the inter-DB communication amount from DB # 1 to DB # 2 and the inter-DB communication amount from DB # 2 to DB # 1 are reduced as compared to the conventional case. Is done.

本実施形態では、作成部１５ｂの処理によりＤＢ内通信になりうる通信量のばらつきを低減した上で、決定部１５ｃの処理が実施されるので、図８（ｂ）に例示するように、ＤＢ間通信量のばらつきがさらに低減される。 In the present embodiment, the processing of the determination unit 15c is performed after reducing the variation in the amount of communication that can be in-DB communication by the processing of the creation unit 15b. Therefore, as illustrated in FIG. The variation in inter-communication traffic is further reduced.

［分散処理］
次に、図９を参照して、分散装置１０の分散処理について説明する。図９のフローチャートは、例えば、分散処理の開始を指示する操作入力があったタイミングで開始される。 [Distributed processing]
Next, the distribution process of the distribution apparatus 10 will be described with reference to FIG. The flowchart in FIG. 9 is started, for example, at a timing when there is an operation input instructing the start of distributed processing.

まず、取得部１５ａが、分散ＤＢシステム１の管理サーバ等から、各コミュニティのコミュニティ内でのデータの要求に伴う通信量およびコミュニティ間でのデータの要求に伴う通信量を取得する（ステップＳ１）。 First, the acquisition unit 15a acquires the communication amount associated with the data request within the community of each community and the communication amount associated with the data request between the communities from the management server of the distributed DB system 1 (step S1). .

次に、作成部１５ｂが、各コミュニティサブグループでの通信量のばらつきが低減するように、複数のコミュニティを組み合わせてコミュニティサブグループを作成する（ステップＳ２）。 Next, the creating unit 15b creates a community subgroup by combining a plurality of communities so as to reduce the variation in the amount of communication in each community subgroup (step S2).

次に、決定部１５ｃが、作成されたコミュニティサブグループ間での通信量が多い順に、同一のＤＢに収容するコミュニティの組み合わせを決定する（ステップＳ３）。また、決定部１５ｃが適当な形式で結果を出力する（ステップＳ４）。これにより、一連の分散処理が終了する。 Next, the determination part 15c determines the combination of the community accommodated in the same DB in order with much communication volume between the created community subgroups (step S3). Further, the determination unit 15c outputs the result in an appropriate format (step S4). Thereby, a series of distributed processing ends.

以上、説明したように、本実施形態の分散装置１０では、取得部１５ａが、同一のコミュニティに属するデータを同一のＤＢに収容するとともに、異なるコミュニティに属するデータを複数のＤＢに分散して収容する分散ＤＢシステム１において、各コミュニティからＤＢに対するデータの要求に伴う通信量を取得する。また、決定部１５ｃが、取得された通信量のうち、コミュニティ間での通信量が多い順に、同一のＤＢに収容するコミュニティの組み合わせを決定する。 As described above, in the distribution apparatus 10 of the present embodiment, the acquisition unit 15a stores data belonging to the same community in the same DB, and distributes data belonging to different communities to a plurality of DBs. In the distributed DB system 1, the amount of communication associated with the data request for the DB from each community is acquired. Moreover, the determination part 15c determines the combination of the community accommodated in the same DB in order with the largest communication volume between communities among the acquired communication volumes.

これにより、本実施形態の分散装置１０の分散処理によれば、ＤＢ間通信が低減されるので、ＴＡＴ遅延や処理失敗等の発生が抑制される。このように、高効率な分散ＤＢシステムを提供することができ、増設可能なＤＢ数に対する制約を抑えることができる。 Thereby, according to the distributed process of the distributed apparatus 10 of this embodiment, since communication between DB is reduced, generation | occurrence | production of a TAT delay, a process failure, etc. is suppressed. In this way, a highly efficient distributed DB system can be provided, and restrictions on the number of DBs that can be added can be suppressed.

また、作成部１５ｂが、複数のコミュニティを組み合わせてコミュニティサブグループを作成し、各コミュニティサブグルーブからの通信量の全コミュニティサブグループ内でのばらつきを低減させる。その場合、決定部１５ｃが、作成されたコミュニティサブグループ間での通信量が多い順に、同一のＤＢに収容するコミュニティの組み合わせを決定する。これにより、ＤＢ内通信になり得る通信量のばらつきが低減されるので、さらにＤＢ間通信量のばらつきが低減される。 In addition, the creation unit 15b creates a community subgroup by combining a plurality of communities, and reduces the variation of the traffic from each community subgroup within all the community subgroups. In that case, the determination unit 15c determines a combination of communities accommodated in the same DB in descending order of communication volume between the created community subgroups. As a result, the variation in the communication amount that can be the intra-DB communication is reduced, so that the variation in the inter-DB communication amount is further reduced.

［プログラム］
上記実施形態に係る分散装置１０が実行する処理をコンピュータが実行可能な言語で記述したプログラムを作成することもできる。一実施形態として、分散装置１０は、パッケージソフトウェアやオンラインソフトウェアとして上記の分散処理を実行する分散プログラムを所望のコンピュータにインストールさせることによって実装できる。例えば、上記の分散プログラムを情報処理装置に実行させることにより、情報処理装置を分散装置１０として機能させることができる。ここで言う情報処理装置には、デスクトップ型またはノート型のパーソナルコンピュータが含まれる。また、その他にも、情報処理装置にはスマートフォン、携帯電話機やＰＨＳ（Personal Handyphone System）などの移動体通信端末、さらには、ＰＤＡ（Personal Digital Assistants）などのスレート端末などがその範疇に含まれる。また、ユーザが使用する端末装置をクライアントとし、当該クライアントに上記の分散処理に関するサービスを提供するサーバ装置として実装することもできる。例えば、分散装置１０は、各コミュニティからデータを要求する通信量を入力とし、同一ＤＢに収容するコミュニティの組み合わせを出力する分散処理サービスを提供するサーバ装置として実装される。この場合、分散装置１０は、Ｗｅｂサーバとして実装することとしてもよいし、アウトソーシングによって上記の分散処理に関するサービスを提供するクラウドとして実装することとしてもかまわない。以下に、分散装置１０と同様の機能を実現する分散プログラムを実行するコンピュータの一例を説明する。 [program]
It is also possible to create a program in which processing executed by the distributed apparatus 10 according to the above embodiment is described in a language that can be executed by a computer. As an embodiment, the distribution apparatus 10 can be implemented by installing a distributed program that executes the above distributed processing as package software or online software on a desired computer. For example, the information processing apparatus can function as the distribution apparatus 10 by causing the information processing apparatus to execute the above distributed program. The information processing apparatus referred to here includes a desktop or notebook personal computer. In addition, the information processing apparatus includes mobile communication terminals such as smartphones, mobile phones and PHS (Personal Handyphone System), and slate terminals such as PDA (Personal Digital Assistants). In addition, a terminal device used by a user can be a client, and the client can be implemented as a server device that provides the client with services related to the distributed processing. For example, the distributed device 10 is implemented as a server device that provides a distributed processing service that receives a communication amount requesting data from each community and outputs a combination of communities accommodated in the same DB. In this case, the distributed apparatus 10 may be implemented as a Web server, or may be implemented as a cloud that provides services related to the above distributed processing by outsourcing. Hereinafter, an example of a computer that executes a distributed program that realizes the same function as the distributed apparatus 10 will be described.

図１０に示すように、分散プログラムを実行するコンピュータ１０００は、例えば、メモリ１０１０と、ＣＰＵ１０２０と、ハードディスクドライブインタフェース１０３０と、ディスクドライブインタフェース１０４０と、シリアルポートインタフェース１０５０と、ビデオアダプタ１０６０と、ネットワークインタフェース１０７０とを有する。これらの各部は、バス１０８０によって接続される。 As shown in FIG. 10, a computer 1000 that executes a distributed program includes, for example, a memory 1010, a CPU 1020, a hard disk drive interface 1030, a disk drive interface 1040, a serial port interface 1050, a video adapter 1060, and a network interface. 1070. These units are connected by a bus 1080.

メモリ１０１０は、ＲＯＭ（Read Only Memory）１０１１およびＲＡＭ１０１２を含む。ＲＯＭ１０１１は、例えば、ＢＩＯＳ（Basic Input Output System）等のブートプログラムを記憶する。ハードディスクドライブインタフェース１０３０は、ハードディスクドライブ１０３１に接続される。ディスクドライブインタフェース１０４０は、ディスクドライブ１０４１に接続される。ディスクドライブ１０４１には、例えば、磁気ディスクや光ディスク等の着脱可能な記憶媒体が挿入される。シリアルポートインタフェース１０５０には、例えば、マウス１０５１およびキーボード１０５２が接続される。ビデオアダプタ１０６０には、例えば、ディスプレイ１０６１が接続される。 The memory 1010 includes a ROM (Read Only Memory) 1011 and a RAM 1012. The ROM 1011 stores a boot program such as BIOS (Basic Input Output System). The hard disk drive interface 1030 is connected to the hard disk drive 1031. The disk drive interface 1040 is connected to the disk drive 1041. For example, a removable storage medium such as a magnetic disk or an optical disk is inserted into the disk drive 1041. For example, a mouse 1051 and a keyboard 1052 are connected to the serial port interface 1050. For example, a display 1061 is connected to the video adapter 1060.

ここで、図１０に示すように、ハードディスクドライブ１０３１は、例えば、ＯＳ１０９１、アプリケーションプログラム１０９２、プログラムモジュール１０９３およびプログラムデータ１０９４を記憶する。上記実施形態で説明した各テーブルは、例えばハードディスクドライブ１０３１やメモリ１０１０に記憶される。 Here, as shown in FIG. 10, the hard disk drive 1031 stores, for example, an OS 1091, an application program 1092, a program module 1093, and program data 1094. Each table described in the above embodiment is stored in the hard disk drive 1031 or the memory 1010, for example.

また、分散プログラムは、例えば、コンピュータ１０００によって実行される指令が記述されたプログラムモジュール１０９３として、ハードディスクドライブ１０３１に記憶される。具体的には、上記実施形態で説明した分散装置１０が実行する各処理が記述されたプログラムモジュール１０９３が、ハードディスクドライブ１０３１に記憶される。 Further, the distributed program is stored in the hard disk drive 1031 as a program module 1093 in which a command executed by the computer 1000 is described, for example. Specifically, a program module 1093 describing each process executed by the distribution apparatus 10 described in the above embodiment is stored in the hard disk drive 1031.

また、分散プログラムによる情報処理に用いられるデータは、プログラムデータ１０９４として、例えば、ハードディスクドライブ１０３１に記憶される。そして、ＣＰＵ１０２０が、ハードディスクドライブ１０３１に記憶されたプログラムモジュール１０９３やプログラムデータ１０９４を必要に応じてＲＡＭ１０１２に読み出して、上述した各手順を実行する。 Further, data used for information processing by the distributed program is stored in the hard disk drive 1031 as the program data 1094, for example. Then, the CPU 1020 reads the program module 1093 and the program data 1094 stored in the hard disk drive 1031 to the RAM 1012 as necessary, and executes the above-described procedures.

なお、分散プログラムに係るプログラムモジュール１０９３やプログラムデータ１０９４は、ハードディスクドライブ１０３１に記憶される場合に限られず、例えば、着脱可能な記憶媒体に記憶されて、ディスクドライブ１０４１等を介してＣＰＵ１０２０によって読み出されてもよい。あるいは、分散プログラムに係るプログラムモジュール１０９３やプログラムデータ１０９４は、ＬＡＮ（Local Area Network）やＷＡＮ（Wide Area Network）等のネットワークを介して接続された他のコンピュータに記憶され、ネットワークインタフェース１０７０を介してＣＰＵ１０２０によって読み出されてもよい。 Note that the program module 1093 and the program data 1094 related to the distributed program are not limited to being stored in the hard disk drive 1031, but are stored in, for example, a removable storage medium and read by the CPU 1020 via the disk drive 1041 or the like. May be. Alternatively, the program module 1093 and the program data 1094 related to the distributed program are stored in another computer connected via a network such as a LAN (Local Area Network) or a WAN (Wide Area Network), and the network interface 1070 is used. It may be read by the CPU 1020.

以上、本発明者によってなされた発明を適用した実施形態について説明したが、本実施形態による本発明の開示の一部をなす記述および図面により本発明は限定されることはない。すなわち、本実施形態に基づいて当業者等によりなされる他の実施形態、実施例および運用技術等は全て本発明の範疇に含まれる。 As mentioned above, although embodiment which applied the invention made | formed by this inventor was described, this invention is not limited with the description and drawing which make a part of indication of this invention by this embodiment. That is, other embodiments, examples, operational techniques, and the like made by those skilled in the art based on this embodiment are all included in the scope of the present invention.

１分散データベース（ＤＢ）システム
１０分散装置
１１入力部
１２出力部
１３通信制御部
１４記憶部
１５制御部
１５ａ取得部
１５ｂ作成部
１５ｃ決定部 DESCRIPTION OF SYMBOLS 1 Distributed database (DB) system 10 Distribution apparatus 11 Input part 12 Output part 13 Communication control part 14 Storage part 15 Control part 15a Acquisition part 15b Creation part 15c Determination part

Claims

Acquire data that accompanies requests for data from each community in a database system that accommodates data belonging to the same community in the same database and distributes data belonging to different communities to multiple databases. And
A determining unit that determines a combination of communities accommodated in the same database in order of increasing communication volume between the communities among the acquired communication volumes;
A dispersion apparatus comprising:

The determining unit determines a combination of two communities as a community group in the same database in descending order of the amount of communication between the communities, and in order of increasing communication amount between the determined community groups, The distribution apparatus according to claim 1, wherein a process of determining a combination of two community groups as a community group accommodated in the same database is repeated to determine the number of community groups of the databases to be distributed.

In addition, a community subgroup is created by combining a plurality of communities, and among the acquired communication amount, a creation unit that reduces variation in the entire community subgroup of the communication amount from each community subgroup,
The said determination part determines the combination of the community accommodated in the same database in order with a large communication amount between the created said community subgroups among the acquired said communication amount. Or the dispersion apparatus of 2.

A distributed method executed by a distributed device,
Acquire data that accompanies requests for data from each community in a database system that accommodates data belonging to the same community in the same database and distributes data belonging to different communities to multiple databases. Process,
A determination step of determining a combination of communities accommodated in the same database in order of increasing communication volume between the communities among the acquired communication volumes;
A dispersion method characterized by comprising: