JP2005004474A

JP2005004474A - Method for constructing information storage system for distributed management of defined information element

Info

Publication number: JP2005004474A
Application number: JP2003167193A
Authority: JP
Inventors: Osamu Kondo; 近藤　　治; Kosuke Shono; 功祐庄野; Hiroyuki Sugino; 浩之杉野; Yuhei Teramoto; 雄平寺本
Original assignee: Digital Dream Kk
Current assignee: Digital Dream Kk
Priority date: 2003-06-11
Filing date: 2003-06-11
Publication date: 2005-01-06

Abstract

<P>PROBLEM TO BE SOLVED: To achieve the distributed storage over a network of defined information belonging to individuals in a ubiquitous information society, thereby making efficient/shared use of the information. <P>SOLUTION: In an information space belonging to an individual who is uniquely defined over the network, structured information composed of information elements defined by tag names is stored. A master for dynamically processing information acquisition requests that arise one after another on a plurality of terminals existing over the network is defined to efficiently replace and/or duplicate information within a corresponding range. <P>COPYRIGHT: (C)2005,JPO&NCIPI

Description

【０００１】
【発明の属する技術分野】
本発明は、分散ネットワークの情報ストレージシステムに関する。
【０００２】
【従来の技術】
従来のネットワークでは、クライアントに分散しているデータをマスターがマージして総合データを集め、クライアントからの要求でデータを返信する方法や、クライアントからの要求があった時点で、自己のサーバー内で検索し、なければ他のサーバーに検索メッセージを送り、その結果を受け取ったサーバーがクライアントに結果を返信する方法をとっている。
【０００３】
分散情報システムに関して特開平１０−２１２６４号、特開２００１−３１２５０４号、特開２００２−３２４０５６号、特開２００２−３６６５４８などがあるが、これらは、上述のように、情報の意味的構造には無関係にデータを同期・複製するものであり、汎用的なソリューションではあるが、意味的に構造化が行われ、粒度の均質な情報単位の同期・複製について考慮されたものではない。
【０００４】
分散辞書についても、特開平０６−２５９６００号、特開平１０−１９８６８０号、特開２００１−１２５８８８号などがあり、分散情報システムと同様の同期・複製が用いられることも多いが、知識ベースの融合、共有といったレベルを実現するためには、さらに考慮が必要である。
【０００５】
また、従来、分散情報システムにおいては、明確に個人やグループによる情報の所有の概念が定義されていないため、個人的な辞書拡張やカスタマイズが要求される分散辞書との統合について検討されるケースはほとんど見られない。しかし、個人情報管理に存在する固有名詞や住所、クライアントとの折衝履歴（メール）、プロジェクトの進捗ミーティングといった情報が体系化されることにより、グループで共有可能な情報ストレージが生成されると考えられる。
【０００６】
従来のリレーショナルデータベースが持つ辞書構造は、固定的に構造化されたデータが基本となっており、データ構造の拡張や多様なデータを一元的に扱うことが困難なため、個人の自由な情報表現やグループ毎に特徴を持つ情報を融合し、統合的に扱うための能力が欠けている。
【０００７】
【発明が解決しようとする課題】
ユビキタス情報社会においては、個人の情報活動を基本として、いつでも、どこでも、どの端末からも、分散された個人の情報にアクセスできる環境が必要となる。このことは、分散データベースが、サーバー間での同期・複製を実現するのとは異なり、規模の大きなサーバーから、小型の移動型情報端末まで、それぞれの能力と利用形態に応じた情報の同期・複製技術が必要となる。
【０００８】
さらに、このような個人の情報活動を中心とした情報空間を実現するためには、どの情報端末かではなく、「だれ」が、情報端末を使っているのかを特定する必要がある。
【０００９】
個人と個人の情報交換もしくは、コミュニケーションとは、別にグループによる情報共有が有効な局面が多く存在する。これには、辞書共有のようなケースも考えられる。グループの住所録に登録された人名や住所が、あたかも、辞書が拡張されたように自動的に日本語入力システムから利用できるようなことが望ましい。
【００１０】
さらに、グループの住所録には、クライアントの仕事先の電話番号が記載されているが、個人的に親しいクライアントの場合には、携帯電話の電話番号を教えてもらっており、緊急の連絡が可能なようなケースでは、グループの住所録の差分として、個人の住所録を融合できるシステムが期待される。
【００１１】
このような個人情報とグループ情報は、自宅から、情報を修正したものが、会社から参照でき、また、大阪支社に出張中に行った変更が、リアルタイムで、東京本社の自分のチームで共有できるような分散環境での運用が要求される。
【００１２】
【課題を解決するための手段】
本発明の分散情報ストレージシステム構成方法は、ユーザーが生産する情報や知識ベースを、意味づけされた情報要素からなる構造化データとして情報ストレージに蓄積する機能と、ユーザーが情報ストレージを利用するときには、ユーザーがインターフェースから入力する内容に沿ってサービスモジュールを起動し、当該のサービスモジュールがそのサービス内容に対応する情報ストレージ内の意味づけされた情報要素の種別を決定し、分散ネットワーク系の情報ストレージの該情報要素のデータを検索、変更、複製するためのメッセージを発信する機能と、分散ネットワーク上の各ノードでそのメッセージを受信する機能と、受信した内容に従ってデータの参照や変更および同期の指令を実行するサービス機能とを有する。
【００１３】
より具体的には、意味づけされた情報要素からなる構造化データは、データの生成時にその情報が分散ストレージのどの場所にあっても一意であることを判定できるＩＤをデータへの意味づけタグに付加することによって、システム全体で、そのデータの同一性を判定する指標とする。データの変更が生じた場合には、該当する指標にマッチした分散ネットワーク上の同一データに対して同期をとる。
【００１４】
上記ストレージシステムが有する複数のサービス間では、サービスからサービスへの指令を行うとき、その指令をメッセージとして発行する機能を有する。また、サービスの種類においても、ソフトウェア内部のコアロジックを実行するサービスと、ユーザーへの表示や入力補助機能を提供するインタフェースサービスとを有する。前述のメッセージには、セマンテックスを含む構造化データを保持することによって、同じメッセージをコアロジックサービスが受け取っても、ユーザーインタフェースサービスが受け取っても、各サービスがセマンテックスに従って実行することができる。
【００１５】
さらに、本発明の意味づけされた情報要素からなる構造化データは、どの場所からアクセスしてもユーザーは自身の情報にアクセス可能とする。
【００１６】
より具体的には、ネットワーク上に使用者を一意づけるための個人認証サーバーを配置する。当該の個人認証サーバは、ユーザーがログインするときに、個人に対してグルーバルユニークな既存の識別子から一意の値として算出するＩＤを発行し、意味づけされた情報要素にＩＤを付加することによって、情報ストレージ内に、情報の所有者毎の情報保持空間を持つことを実現する。
【００１７】
さらに、本発明の構造化データに対する同期方法は、指定した名前空間によって一意に決定することができるデータの意味づけ（以下、セマンテックス）毎に、サービスが起動され、各々のサービスが構造化データの変更、参照、同期のメッセージを発行し、関係するセマンテックスのデータを選定して、分散ストレージの同期をとる。
【００１８】
より具体的には、本発明の情報ストレージシステムは、各ノードにサーバーおよびクライアント機能を有し、ある特定のセマンテックスについて、データ同期をとるためのマスターを自動的に決定する。マスターを判定する際には、同一ネットワーク内のノードに対し、各ノードの情報ストレージ内容に基づいて自動算出されるセマンテックス毎の優先順位を作成し、その優先順位をもとに、ある特定のセマンテックス毎に同一ネットワークの各ノードの中から一つのマスターが自動的に決まる。決定されるマスターは、各ノードで実行されるサービスが全体の情報ストレージを修正するものであれば、そのサービス内容を判別し、各ノードにデータ同期用のデータを配信する役割を担う。一方、スレーブ側は、配信された情報について、情報ストレージ内に関連するセマンテックスの項目がなければ、データ修正や追加などのデータ同期を行わず、スレーブ側の情報ストレージ内に関係しているデータのみ更新する機能を持つ。
【００１９】
ユーザーが移動して別のノードから当該ネットワークに接続した際には、それまでに使用中の自己情報ストレージは、移動先のノードでサービスを使用したときに、当該サービスに関するセマンテックス毎に対応したデータのみノード間で同期をとる。この方法によって、ユーザーの個人情報の全体の複製をとる必要はなく、使用するサービス機能に関する部分のデータのみが複製される。
【００２０】
さらに、個人の情報空間と複数のグループの情報空間について、情報ストレージの同一階層でリンク情報を保持することによって、グループ内のユーザーは、グループ自体のデータおよびグループ内の個人情報の公開されているデータを共有することを可能とする。
【００２１】
また、ネットワークのトラブルによって、分離したネットワーク上で同一情報に対して変更が行われた場合など、情報の更新履歴が複数に分岐するような場合に、データの矛盾を回避することを可能とする。具体的には、構造化データの情報要素に、情報の更新履歴を付加するモジュールを備え、データの更新履歴が複数に分岐した場合には、データの矛盾を検出するモジュールと、そのデータの変更履歴から自動的なデータのマージあるいはユーザーへの問い合わせを行うことで、データの不整合を解決するメンテナンスモジュールとを有する。
【００２２】
【発明実施の形態】
以下、本発明による実施の形態について、図面を参照しながら説明する。図１は分散した個々の情報ストレージ間の概略の分散ネットワークシステム構成を示す。
【００２３】
個々に意味づけされた情報要素からなる構造化された情報ストレージの表現は、たとえば、図２のように、整形式（Ｗｅｌｌ−ｆｏｒｍｅｄ）ＸＭＬデータ形式で実現される。
【００２４】
本発明のストレージシステムのシステム構成の一例を図３に示す。なお、他のノードも図４と同様の機能を備える。本システムは構造化データを蓄積する情報ストレージ４０と、各種のコアロジックサービス５０と、コアロジックのサービスから情報ストレージ４０へのアクセスをコントロールするストレージサービス６０とユーザーへの表示および入力を行うＧＵＩサービス７０と各サービス間での指令メッセージを監視し、各サービスに配信するメッセージモニター８０と、他のノードとの通信を行う通信サービス９０と、データの不整合を検出するために必要なジャーナル機能１００とを備える。
【００２５】
ユーザーがＧＵＩから入力したデータを情報ストレージ４０に蓄積するときは、ＧＵＩサービス７０が、入力時のインタフェースに対応したセマンテックスのサービス５０を指定して、データの保存を指示する。具体的には、ＧＵＩサービス７０からサービス間のメッセージを発行し、メッセージモニター８０がそのメッセージをメッセージキューに格納する。メッセージの先頭には、指令を渡すべきサービス名を記述しており、メッセージモニターは当該サービスに対してメッセージキューからメッセージを送信する。指令を受け取ったサービスは、そのデータが新しく生成されたものであれば、情報ストレージ４０への保存機能を担うストレージサービスにユニークなＩＤをセマンテックスに付加した構造化データを保存するよう指示する。指示を受け取ったストレージサービス６０はその構造化データを情報ストレージ４０へ格納する。
【００２６】
上記情報ストレージへの保存対象データがすでに存在するデータの編集のような場合には、そのデータのＩＤをもとに、他のノードに対しても通信サービス８０を介して同期指令を発行する。他のノードからの返信メッセージを受け取った通信サービスは、メッセージ内容に対応したサービス５０を起動し、ストレージサービス６０を介して情報ストレージ４０への変更を行う。このとき、自己のノードに対するデータ変更のときに、データ要素に不整合を示すものがあれば、不整合部のデータに対してデータの分岐情報として、データ要素の履歴をジャーナル機能１００によって構造化データに埋め込む。同時に、不整合検出メッセージをメッセージモニター８０へ発行し、データの自動マージを行うサービス５０あるいはユーザーへ選択を促すＧＵＩサービス７０を起動し、データの不整合を解消する機能を提供する。
【００２７】
本発明におけるシステムの利用者は、Ｗｅｂ上で公開されているＰｅｅｒＮａｍｅＳｅｒｖｉｃｅ（以下ＰＮＳ）を使って、メールアドレスから一意に決まるＰｅｅｒＩＤ（以下ＰＩＤ）を取得し、そのＰＩＤを用いてシステムはログインする個人を認識し、ログインしたノードの情報ストレージ内に、セマンテックスにＰＩＤを含むデータ形式で個人のデータを蓄積する。
【００２８】
ローカルの情報ストレージの内部には、上記のＰＮＳを利用した個人のデータを示す個人の構造化データのほかに、任意の個人の集合体として設定されるグループがシステム上で機能するサービスによって作成されて、各々のグループで個人と同じように情報を持つことが可能な複数のグループ情報としての構造化データも構成される。
【００２９】
個人およびグループの情報はＸＭＬデータにおけるタグ付けされたセマンテックスによって判別可能であり、ローカルのシステム内部で実行するサービスでは、個人もしくは個人およびグループの情報を対象とした修正・追加・参照を補助する機能が入力インターフェースを介したサービスとして提供される。
【００３０】
分散ストレージへのデータ追加や修正といった作業は、実施するサービスに関係したセマンテックス単位でアクセスされる。利用者は情報ストレージ内のどの情報にでもアクセスできるわけではなく、分散ストレージの構造化データの中から、自身の個人情報や所属するグループ情報といった、自己に関係したセマンテックスに関するデータのみを検索し、表示するサービスシステムが構築されている。したがって、情報ストレージ内の他人のデータに対するアクセスは、セマンテックスによって、区別されているため、不正にアクセスできないサービスシステムが構成されていることになる。
【００３１】
構造化データのあるセマンテックスの階層において、サービスが実行されるときに分散ストレージへのアクセスと同期の手順について説明する。
【００３２】
まず、サービスに関係するグループについて、ローカルの情報ストレージの中からそのグループを示すセマンテックスを基準としてグループに所属している個人を割り出し、そのノードのそのグループを構成している個人の数をマスター優先順位値としネットワーク上に公開する。各ノードは、一つのグループにつき一つのマスターを特定するために、マスター優先順位値の一番大きいノードをそのグループの同期用マスターとして判定する。
【００３３】
上記の判定は、分散ネットワークのすべてのノードで行われ、同一ネットワークにあるノードの中で一つのマスターが決定されるため、ネットワーク同士が分離しているときにはネットワーク毎にマスターが判定されることとなり、ネットワークが分離したときでも、情報ストレージの同期マスターは必ず存在し、データ同期は行われる。分散していたネットワークが結合したときには、結合したネットワーク内でのマスター優先順位値のもっとも大きいノードがマスターとなる。また、マスター優先順位値の同じノードが複数存在する場合には、マスター優先順位値の履歴情報および利用者のグループ利用頻度から、情報再利用性の高いノードの方が選択されるか、あるいは、起動時間の長いノードがマスターとして選択される。
【００３４】
マスターとしての役割を次に示す。あるサービスをローカルで実行するとき、まず、システムはそのサービスが分散ストレージのどの範囲を使用するか判定する。ローカル内のみの情報ストレージを使用する場合は、他のノードに対してサービスに関するメッセージは送信しない。
【００３５】
そのサービスがローカルのみでなく、ネットワーク上の情報ストレージを利用する場合には、まず、ローカル情報ストレージに対してサービスを実行し、結果を取得する。そして、関係するグループのマスターに対し、グループを示すセマンテックスとサービスの種類、およびローカル情報ストレージを利用したサービス後の結果を送信する。受信したマスター側は、マスターの情報ストレージ内の複数のグループ情報の中から、指定されたグループの構造化データに対して同じサービスを実行して結果を取得したのち、送られてきた結果との差分データを求める。マスターはスレーブからのサービス実行後の取得結果およびマスターでのサービス実行後の取得結果との差分データを、マスター内の情報ストレージへと反映して更新すると共に、分散ネットワークへ当該のデータを告知する。一方で、ネットワークから告知を見つけたノードは、ローカルの情報ストレージにそのグループの構造化データがあれば、マスターから上記のサービスの取得結果をローカルの情報ストレージに反映することで、構造化データのセマンテックスに応じたデータ複製を行う。
【００３６】
この方法によって、ローカル内の情報ストレージを利用するサービスによって構造化データが形成され、分散ネットワーク全体におよぶ情報ストレージの更新を行うときは、マスターを利用してデータ同期を行うが、サービス内容に関係ない部分の構造化データからの情報は送信データには含まれない上、スレーブ側でも自身に蓄積されている構造化データに当該のサービス内容に関係するセマンテックスがない場合にはデータ同期をとらないため、各ノードで個別の知識ベースが構成されることになる。
【００３７】
以上のセマンテックスに対応する構造化データの同期方法の一例を、図４を参照しながら説明する。図４の例では、分散ネットワークのノードは３つで、全ノードの利用者は個人ＡからＫまでの１０人であり、グループはＧｒ１、Ｇｒ２、Ｇｒ３の３つが存在している。ローカルのノードにおける情報ストレージ内には、当該ノードで利用した個人による生産活動やサービスの利用によって、個人の情報が構造化データとして追加され、各ノードの情報ストレージに図４のような個人、グループの構造化データが構成されている状態とする。
【００３８】
グループ情報の構造化データの中には、所属する個人リストが含まれており、各ノードの個人情報の配置から、それまでのログイン状況がわかるため、グループ毎のマスターの優先順位はノード内で図４のように算出されている。この優先順位値をネットワーク上に公開しているため、グループのマスターがどのノードか判明する。
【００３９】
ノード３にログインした個人ＡがグループＧｒ２に関する構造化データを操作するサービスを行ったときの処理ステップは次のとおりである。
【００４０】
ノード３でＧｒ２情報内の項目αに該当したすべてのデータβをデータγに変更する（Ｓ１０）。ネットワーク内の当該グループの優先順位値を参照し、マスターをノード１と判定する（Ｓ２０）。ノード３からノード１に対して、ステップ１のサービス内容と変更部分の内容を送信する（Ｓ３０）。マスターであるノード１は、自己の情報ストレージに対して、同じサービスを実行し、項目αに該当するすべてのデータβをデータγに変更する（Ｓ４０）。ノード１は、サービス内容と変更箇所を各ノードにブロードキャストする（Ｓ５０）。受信した各ノードは、変更箇所のすべてを受信するが、項目αに該当する内容がそのノードになければ変更は行わない（Ｓ６０）。
【００４１】
ステップＳ６０のように変更しない場合があるのは、そのノードにすべてのマスターの複製データがあるわけではなく、そのノードの使用者のグループ情報の利用範囲が異なるため、それまでに利用したセマンテックスに関する部分のみマスターノードの構造化データを複製していることに起因している。
【００４２】
当該グループ情報をほとんど利用していないノードにとっては、マスターからのデータ複製を行う必要性が小さく、このステップ５）のようにデータの複製をスレーブ側で除外することによって、情報ストレージにおける利用度の少ない領域の膨張を抑えることが可能となる。また、グループへの加入や脱退は一つのサービスであるため、マスターを固定せずに、自動的に算出されるマスター優先順位値をもとに動的にマスターを判定することがデータの再利用性を高めることにも貢献している。
【００４３】
本発明の分散情報ストレージシステムは、使用者があるノードにログインして作成した個人およびグループの情報は、当該のノードで構造化データが作成され保存されるが、利用者が移動して他のノードからログインしたときには、次のように同期をとる。利用者が作成した個人やグループの情報は、前述のＰｉｄをセマンテックスに含む構造化データとして形成しているため、利用者が情報ストレージを利用するサービスを実行した場合には、当該サービスが関係するＰｉｄを含んだセマンテックスを基にネットワーク内で問い合わせをするため、個人情報が検索されてデータ同期をとることが可能となる。
【００４４】
【発明の効果】
本発明の情報ストレージシステム構成方法によれば、いつでもどこでもどの端末からでも自分の情報とグループで共有した情報に効率的にアクセスすることができるユビキタス環境が実現される。
【００４５】
さまざまな端末から、ログオンして必要な情報にアクセスするだけで、必要な範囲の情報が、同期・複製され、ネットワークに障害が起こった場合にも、ローカルに複製された情報へのアクセスが可能であり、ネットワークが利用可能な状況になれば、再び、自動的にデータの同期が行われる。
【００４６】
このようにグループや個人といったデータの所有者の概念を明確にし、セマンテックスを考慮した構造化データの同期することによって、情報ストレージを知識ベースや辞書として利用することができる。
【００４７】
さらに個人の情報空間と複数のグループの情報空間を融合することが可能なことから、上記知識ベースや辞書を融合して共有することができる。
【図面の簡単な説明】
【図１】分散ネットワークを示した平面図
【図２】個々のノード内にある情報ストレージのデータ構造を示した図
【図３】ノード内の情報ストレージシステム構成の一例を示す概略図
【図４】マスター・スレーブによるデータ同期の手順の一例を示す情報ストレージの概略状態図
【符号の説明】
１０分散ネットワーク
２０ノード群
３０ＰＮＳ（ＰｅｅｒＮａｍｅＳｅｒｖｅｒ）
４０情報ストレージ
５０コアサービス
６０ストレージサービス
７０ＧＵＩサービス
８０メッセージモニター
９０通信サービス
１００ジャーナル機能[0001]
BACKGROUND OF THE INVENTION
The present invention relates to an information storage system for a distributed network.
[0002]
[Prior art]
In a conventional network, a master merges data distributed to clients, collects comprehensive data, and returns data in response to a request from the client. If there is no search, the search message is sent to another server, and the server that receives the result returns the result to the client.
[0003]
Regarding the distributed information system, there are JP-A-10-21264, JP-A-2001-3125504, JP-A-2002-324056, JP-A-2002-366548, and the like. It is a general-purpose solution that synchronizes and replicates data independently, but it is structured semantically and is not considered for synchronization and replication of information units with uniform granularity.
[0004]
As for the distributed dictionaries, there are JP 06-259600 A, JP 10-198680 A, JP 2001-125888 A, etc., and synchronization / replication similar to the distributed information system is often used. In order to realize a level of sharing, further consideration is necessary.
[0005]
Conventionally, in the distributed information system, the concept of ownership of information by individuals and groups has not been clearly defined, so there are cases where integration with distributed dictionaries that require personal dictionary expansion or customization is considered. It is hardly seen. However, systematization of information such as proper nouns and addresses in personal information management, client negotiation history (email), and project progress meetings will generate information storage that can be shared by groups. .
[0006]
The dictionary structure of conventional relational databases is based on fixedly structured data, and it is difficult to expand the data structure and handle diverse data in a unified manner. It lacks the ability to integrate and handle information that is unique to each group.
[0007]
[Problems to be solved by the invention]
In the ubiquitous information society, based on personal information activities, an environment in which distributed personal information can be accessed from any terminal anytime, anywhere is necessary. This is different from the case where a distributed database realizes synchronization and replication between servers, and it is possible to synchronize information according to each capability and usage form from a large scale server to a small mobile information terminal. Replication technology is required.
[0008]
Furthermore, in order to realize such an information space centered on personal information activities, it is necessary to specify which information terminal is used instead of which information terminal.
[0009]
There are many situations where information sharing by groups is effective separately from information exchange or communication between individuals. This may be the case for dictionary sharing. It is desirable that the names and addresses registered in the group address book can be automatically used from the Japanese input system as if the dictionary was expanded.
[0010]
In addition, the group's address book contains the phone number of the client's work place, but in the case of a personally close client, the phone number of the mobile phone is given and emergency contact is possible In such a case, a system capable of merging personal address books as a difference between group address books is expected.
[0011]
Such personal information and group information can be referred to from the company by correcting the information from home, and changes made during the business trip to the Osaka branch can be shared in real time with your own team at the Tokyo head office. Operation in such a distributed environment is required.
[0012]
[Means for Solving the Problems]
The distributed information storage system configuration method of the present invention has a function of accumulating information and knowledge base produced by a user in the information storage as structured data composed of meaningful information elements, and when the user uses the information storage, The service module is activated in accordance with the content input by the user from the interface, the service module determines the type of the meaningful information element in the information storage corresponding to the service content, and the information storage of the distributed network system A function for transmitting a message for searching, changing and copying data of the information element, a function for receiving the message at each node on the distributed network, and a command for referring to, changing and synchronizing data according to the received contents And a service function to be executed.
[0013]
More specifically, structured data composed of meaningful information elements has an ID that can be used to determine that the information is unique at any location in the distributed storage when the data is generated. Is used as an index for determining the identity of the data in the entire system. When data changes occur, the same data on the distributed network that matches the corresponding index is synchronized.
[0014]
Among a plurality of services included in the storage system, there is a function of issuing a command as a message when a command is issued from the service to the service. The service types also include a service that executes core logic inside the software, and an interface service that provides a display to the user and an input assist function. By holding structured data including semantics in the aforementioned messages, each service can execute according to the semantics whether the same message is received by the core logic service or the user interface service.
[0015]
Furthermore, the structured data made up of the meaningful information elements of the present invention allows the user to access his / her information regardless of where the data is accessed.
[0016]
More specifically, a personal authentication server for uniquely identifying the user is arranged on the network. When the user logs in, the personal authentication server issues an ID that is calculated as a unique value from the existing globally unique identifier to the individual, and adds the ID to the meaningful information element, An information storage space for each information owner is provided in the information storage.
[0017]
Further, according to the synchronization method for structured data of the present invention, a service is activated for each data meaning (hereinafter, “semantics”) that can be uniquely determined by a specified name space, and each service is structured data. Issue change, reference and synchronization messages, select the relevant semantic data, and synchronize the distributed storage.
[0018]
More specifically, the information storage system of the present invention has a server and client function in each node, and automatically determines a master for data synchronization for a specific semantic. When determining the master, for each node in the same network, create a priority for each semantic that is automatically calculated based on the information storage contents of each node, and based on the priority, One master is automatically determined from each node of the same network for each semantics. If the service executed at each node modifies the entire information storage, the determined master is responsible for determining the service content and distributing data for data synchronization to each node. On the other hand, if there is no related semantic item in the information storage, the slave side does not perform data synchronization such as data correction or addition, and data related to the information storage on the slave side. Only with the ability to update.
[0019]
When the user moves and connects to the network from another node, the self-information storage that has been used so far corresponds to the semantics related to the service when the service is used on the destination node. Synchronize only data nodes. By this method, it is not necessary to copy the entire personal information of the user, and only the data related to the service function to be used is copied.
[0020]
Further, by maintaining link information in the same hierarchy of information storage for the personal information space and the information space of a plurality of groups, the users in the group can disclose the data of the group itself and the personal information in the group. Allows sharing of data.
[0021]
In addition, it is possible to avoid data inconsistencies when the update history of information branches in multiple cases, such as when the same information is changed on a separate network due to a network problem. . Specifically, a module that adds an information update history to the information element of structured data is provided, and when the data update history branches into a plurality of data, a module that detects data inconsistency and the change of the data A maintenance module that solves data inconsistency by automatically merging data from the history or inquiring the user.
[0022]
DETAILED DESCRIPTION OF THE INVENTION
Embodiments of the present invention will be described below with reference to the drawings. FIG. 1 shows a schematic distributed network system configuration between distributed individual information storages.
[0023]
The representation of the structured information storage composed of individually meaningful information elements is realized in a well-formed XML data format as shown in FIG. 2, for example.
[0024]
An example of the system configuration of the storage system of the present invention is shown in FIG. The other nodes also have the same function as in FIG. The system includes an information storage 40 that stores structured data, various core logic services 50, a storage service 60 that controls access to the information storage 40 from the core logic services, and a GUI service that displays and inputs to the user. 70 and a command monitor 80 for monitoring each service, a message monitor 80 for delivering to each service, a communication service 90 for communicating with other nodes, and a journal function 100 necessary for detecting data inconsistencies. With.
[0025]
When the data input from the GUI by the user is stored in the information storage 40, the GUI service 70 designates the semantic service 50 corresponding to the interface at the time of input and instructs the storage of the data. Specifically, a message between services is issued from the GUI service 70, and the message monitor 80 stores the message in a message queue. The name of the service to which the command is to be passed is described at the head of the message, and the message monitor transmits the message from the message queue to the service. The service that receives the command instructs the storage service having the function of saving to the information storage 40 to store the structured data in which the unique ID is added to the semantics if the data is newly generated. Upon receipt of the instruction, the storage service 60 stores the structured data in the information storage 40.
[0026]
In the case of editing data in which data to be stored in the information storage already exists, a synchronization command is issued to other nodes via the communication service 80 based on the data ID. Upon receiving the reply message from another node, the communication service activates the service 50 corresponding to the message content, and changes to the information storage 40 via the storage service 60. At this time, if there is an inconsistency in the data element at the time of data change for its own node, the history of the data element is structured by the journal function 100 as data branch information for the data in the inconsistent portion. Embed in data. At the same time, an inconsistency detection message is issued to the message monitor 80, and the service 50 for automatically merging data or the GUI service 70 for prompting the user to select is started to provide a function for eliminating data inconsistency.
[0027]
A user of the system according to the present invention acquires a Peer ID (hereinafter referred to as PID) uniquely determined from an e-mail address using PeerNameService (hereinafter referred to as PNS) published on the Web, and the system logs in using the PID. The individual is recognized, and the personal data is stored in the data storage including the PID in the semantics in the information storage of the logged-in node.
[0028]
In the local information storage, in addition to the personal structured data indicating the personal data using the above PNS, a group set as an arbitrary collection of individuals is created by a service functioning on the system. Thus, structured data as a plurality of group information that can hold information in the same manner as individuals in each group is also configured.
[0029]
Individual and group information can be identified by tagged semantics in XML data, and services that run inside the local system assist in modifying, adding, or referencing individual or individual and group information The function is provided as a service via the input interface.
[0030]
Operations such as data addition and correction to the distributed storage are accessed in units of semantics related to the service to be performed. Users do not have access to any information in the information storage, but only search for data related to their own semantics, such as their own personal information and group information to which they belong, from the distributed storage structured data. A service system to display is constructed. Therefore, since access to other people's data in the information storage is distinguished by semantics, a service system that cannot be illegally accessed is configured.
[0031]
A procedure for accessing and synchronizing the distributed storage when a service is executed in a semantic hierarchy having structured data will be described.
[0032]
First, for the group related to the service, the individual belonging to the group is determined from the local information storage based on the semantics indicating the group, and the number of individuals constituting the group of the node is mastered. Publish as a priority value on the network. Each node determines the node having the largest master priority value as the synchronization master of the group in order to identify one master per group.
[0033]
The above determination is performed in all nodes of the distributed network, and one master is determined among the nodes in the same network. Therefore, when the networks are separated, the master is determined for each network. Even when networks are separated, there is always an information storage synchronization master and data synchronization is performed. When the distributed networks are joined, the node having the highest master priority value in the joined network becomes the master. In addition, when there are a plurality of nodes having the same master priority value, the node with higher information reusability is selected from the history information of the master priority value and the group usage frequency of the user, or The node with the long startup time is selected as the master.
[0034]
The role as a master is as follows. When running a service locally, the system first determines what range of distributed storage the service uses. When using information storage only in the local area, messages regarding services are not transmitted to other nodes.
[0035]
When the service uses not only local but also information storage on the network, first, the service is executed on the local information storage and the result is acquired. Then, the semantics indicating the group, the type of service, and the result after service using the local information storage are transmitted to the master of the related group. The receiving master side performs the same service on the structured data of the specified group from the plurality of group information in the master information storage, obtains the result, and then sends the result to the received result. Find the difference data. The master updates the difference data between the acquisition result after the service execution from the slave and the acquisition result after the service execution at the master in the information storage in the master and updates the data to the distributed network. . On the other hand, if the node that found the notification from the network has structured data for the group in the local information storage, the result of acquiring the above service from the master is reflected in the local information storage. Data replication according to semantics is performed.
[0036]
With this method, structured data is formed by a service that uses local information storage, and when updating information storage over the entire distributed network, data synchronization is performed using the master. Information from the unstructured data is not included in the transmitted data, and data synchronization is taken when there is no semantics related to the service content in the structured data stored on the slave side. Therefore, an individual knowledge base is configured at each node.
[0037]
An example of a method of synchronizing structured data corresponding to the above semantics will be described with reference to FIG. In the example of FIG. 4, there are three nodes in the distributed network, ten users from individuals A to K, and three groups Gr1, Gr2, and Gr3. In the information storage in the local node, personal information is added as structured data by the production activities and services used by the individual used in the node, and the individual and group shown in FIG. It is assumed that structured data is configured.
[0038]
The structured data of group information includes a personal list to which the group belongs, and the login status up to that point can be determined from the arrangement of the personal information of each node. It is calculated as shown in FIG. Since this priority value is disclosed on the network, it is determined which node is the master of the group.
[0039]
The processing steps when the individual A who logs into the node 3 performs a service for operating the structured data related to the group Gr2 are as follows.
[0040]
At node 3, all data β corresponding to the item α in the Gr2 information is changed to data γ (S10). With reference to the priority value of the group in the network, the master is determined to be node 1 (S20). The service contents of step 1 and the contents of the changed part are transmitted from node 3 to node 1 (S30). The node 1 as the master executes the same service for its own information storage, and changes all data β corresponding to the item α to data γ (S40). The node 1 broadcasts the service content and the changed location to each node (S50). Each received node receives all of the changed portions, but does not change unless the content corresponding to the item α exists in the node (S60).
[0041]
There is a case where there is a case where the change is not made as in step S60, because there is no copy data of all masters in the node, and the usage range of the group information of the user of the node is different. This is due to the fact that the master node's structured data is replicated only in the part.
[0042]
For nodes that rarely use the group information, it is less necessary to perform data replication from the master. By excluding data replication on the slave side as in step 5), the degree of utilization in information storage can be reduced. It is possible to suppress expansion in a small area. In addition, since joining and leaving a group is a service, it is possible to reuse the data by dynamically determining the master based on the automatically calculated master priority value without fixing the master. It also contributes to improving sex.
[0043]
In the distributed information storage system of the present invention, structured data is created and stored in the node and the personal and group information created by logging in to a certain node. When logging in from a node, synchronization is performed as follows. The personal and group information created by the user is formed as structured data that includes the above-mentioned Pid in the semantics. Therefore, when the user executes a service that uses information storage, the service is related. Since the inquiry is made in the network based on the semantics including the Pid to be executed, the personal information can be retrieved and the data can be synchronized.
[0044]
【The invention's effect】
According to the information storage system configuration method of the present invention, it is possible to realize a ubiquitous environment in which any information can be efficiently accessed from any terminal anytime, anywhere, and the information shared in the group.
[0045]
Simply log on and access the required information from various terminals, and the necessary range of information is synchronized and replicated, allowing you to access locally replicated information in the event of a network failure. When the network becomes available, data synchronization is automatically performed again.
[0046]
Thus, by clarifying the concept of data owners such as groups and individuals and synchronizing structured data in consideration of semantics, information storage can be used as a knowledge base or dictionary.
[0047]
Furthermore, since it is possible to fuse an individual information space and a plurality of groups of information spaces, the knowledge base and the dictionary can be fused and shared.
[Brief description of the drawings]
FIG. 1 is a plan view showing a distributed network. FIG. 2 is a diagram showing a data structure of an information storage in each node. FIG. 3 is a schematic diagram showing an example of an information storage system configuration in the node. ] Schematic state diagram of information storage showing an example of data synchronization procedure by master and slave [Explanation of symbols]
10 Distributed Network 20 Node Group 30 PNS (PeerNameServer)
40 Information storage 50 Core service 60 Storage service 70 GUI service 80 Message monitor 90 Communication service 100 Journal function

Claims

A storage structure that can have an independent information holding space for each information owner, and is generated by users and services in a distributed network information storage system that manages services and devices that can be used for information processing It has a function that recognizes all information as the same information by adding a unique identifier to the entire system system for each structured data including ID tags indicating the types of information elements that are individually meaningful. Therefore, data is replicated at the location where the user uses the relevant information on the distributed network, and it depends on the user's access location when performing data manipulation by the user or service execution. Data synchronization is performed across the entire network for all data that is recognized as the same information Configuring the distributed information storage system characterized and.

In the independent information space of the information owner described in Item 1, messaging between software and messages to the information owner realized by exchanging structured data consisting of individually defined information elements are integrated. A method for configuring a distributed information storage system, characterized by comprising a messaging system.

A personal authentication system for uniquely identifying a user on the network is included in the structured data of the information storage and includes an ID calculated as a unique value from an existing globally unique identifier for the individual. Distributed storage is stored on each terminal of the user who created the data, and the same kind of meaning that the user uses in the structured storage when moving to another terminal and connecting to the network. 2. The method for configuring a distributed information storage system according to claim 1, wherein only data corresponding to the hierarchy is synchronized between terminals.

Item 1, 2 or 3 is provided with a network connection function, can send a request to another terminal or program and receive and process the reply, and waits for a request message from another terminal or program When there is a request, it has a reply function corresponding to the reception process and a server function capable of internal processing, and can be executed asynchronously in parallel even when not connected to the network to other terminals. Individual information is stored in a data structure that includes both semantics and data values indicating the type and affiliation of data, and in a distributed network system that has a function to synchronize such information between individual terminals and programs , It stores locally information created by various functions and operations implemented in programs that are executed individually. When information storage is reused at the timing required for actions such as functions and operations executed in the program, service messages for each semantics are sent over the network according to the functions and operations, and connected over the network. From each information storage in the range, the master according to the semantics is automatically determined according to the state in the information storage, and the master sends a copy of the data required as a reply to the service message to each individual Each of the received information is stored in local data as reusable information storage, and the local data is used on a program.

The master selection method according to the semantics of Item 4 is based on the fact that nodes with a large number of data related to the same kind of semantics in the information storage stored in each node in the distributed network are dynamically set as values having a high master priority. If the software on individual terminals is stopped and isolated from the network, or if the network is divided into several parts due to network failure or disconnection, the master order within each closed network If the highest one is the master and there are multiple nodes with the same master priority value, the node with the long start-up time or the node with the high access frequency to the data is determined as the master, and the master Send data synchronization messages to each node within, and each node receives From the message, it acquires the data of the semantics required by that node, configuring the distributed information storage system, characterized by reflecting the local information storage.

Item 1, 2, 3, 4 or 5, information is inconsistent with respect to structured data when a service corresponding to semantics is executed in each of the synchronizations required when the network that has been divided becomes widespread A method for configuring a distributed information storage system comprising a journal function for holding a branched update history in structured data when an error is detected.

In item 6, for the structured data having the update history branched by the journal function, the service corresponding to the semantics of the data whose inconsistency is detected is started, and the change history is automatically merged or the user is inquired. And a method for configuring a distributed information storage system, characterized by performing a function of eliminating data inconsistency.

Seamless link between distributed information storage by linking with the information space of multiple participating groups in a personal information space with clear ownership and a group information space with multiple information owners Item 8. The method for configuring a distributed information storage system according to any one of Items 1 to 7, wherein an information space fused in a hierarchical manner is realized.