JP4137493B2

JP4137493B2 - Data management apparatus, program, and recording medium

Info

Publication number: JP4137493B2
Application number: JP2002109979A
Authority: JP
Inventors: 俊和大和田
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 2002-04-12
Filing date: 2002-04-12
Publication date: 2008-08-20
Anticipated expiration: 2022-04-12
Also published as: JP2003303122A

Description

【０００１】
【発明の属する技術分野】
この発明は、階層構造を持つノードによって構成され、各ノードが固有の識別情報とその親ノードの識別情報とを有する形式のデータを管理するデータ管理装置、コンピュータをこのようなデータ管理装置として機能させるためのプログラム及び、このようなプログラムを記録した記録媒体に関する。
【０００２】
【従来の技術】
従来から、ファイル管理装置や文書管理装置等のデータ管理装置（あるいはシステム）において、データに階層構造を持たせて管理することが行われている。そして、このような階層構造は、各階層にデータの配置単位としてノードを設け、各ノードに固有のＩＤ番号等の識別情報とその親ノードの識別情報とを持たせることにより、ノード自身のデータ量をさほど増加させることなく表現できる。なお、あるノードの親ノードとは、そのノードの直上位に位置するノードのことである。逆に、あるノードの直下位に位置するノードをそのノードの子ノード、子ノードの直下位に位置するノードを孫ノード等と呼ぶ。そして、あるノードより下位にあり、かつそのノードから上位ノードを経由することなく辿れるノードを、そのノードの子孫ノードと呼ぶ。
【０００３】
このような階層構造のデータを管理する場合、各ノードについてあらかじめそのノードの子孫ノードを調べ、親ノードとその子孫ノードとの対応データを階層情報として記憶しておくことが考えられる。このようにすることにより、ある特定のノードの子孫ノードすべてを取得する、といったような操作の高速化を図ることができる。
【０００４】
ところで、このような方式を採用した場合、通常は実際の階層構造とこの階層情報との間に不整合が生じることはないが、予期しないトラブル等によって不整合が生じてしまうことも考えられる。そして、不整合が生じてしまうとその部分についてはデータへのアクセスが正常に行えなくなる場合があるので、この不整合が生じていないかどうかを必要に応じて検査し、生じていた場合にはその修復を行わなければならない。
このような不整合の検出や修復に関連する技術としては、例えば特開平１１−１２００５８号公報に開示されているファイル管理装置や、特開２０００−２８４９９５号公報に開示されているデータ処理装置が挙げられる。ただし、これらの装置は、ファイルシステムの復旧に関するものであり、階層構造と階層情報との不整合を高速に検出するものではない。
【０００５】
【発明が解決しようとする課題】
階層構造と階層情報との不整合を検出する方式としては、例えば、実際の階層構造を辿って検出を行うノードの子孫ノードの情報を取得すると共に、記憶してある階層情報からも同じノードの子孫ノードの情報を取得し、取得したこれらの子孫ノードの情報をそれぞれ配列して、その配列間に差があるか否かを調べる方式が考えられる。
しかし、この方式において、配列間の差の有無を調べる処理、さらには配列の一方にあって一方にない要素を特定する処理は、単純に配列の要素をスキャンしたのでは時間がかかるという問題があった。
この発明は、このような問題を解決し、階層構造を持つノードによって構成され、各ノードが固有の識別情報とその親ノードの識別情報とを有する形式のデータを管理する場合において、実際の階層構造と記憶してある階層情報との整合性を高速に検査できるようにすることを目的とする。
【０００６】
【課題を解決するための手段】
上記の目的を達成するため、この発明は、階層構造を持つノードによって構成される形式のデータを管理するデータ管理装置において、各ノードについて固有の整数値からなる識別情報と、そのノードの直上位の親ノードの上記識別情報とをノード情報として記憶する手段と、上記各ノードについて、そのノードよりも下位であり且つそのノードの上位ノードを経由せずに辿れるノードである子孫ノードの各々の上記識別情報を、そのノードの上記識別情報と１対１で対応づけてノード階層情報として記憶する手段と、上記ノード情報をもとに、任意のノードの子孫ノードの上記識別情報を全て収集した第１の子孫ノードデータを作成する第１の子孫ノードデータ作成手段と、上記ノード階層情報をもとに、上記任意のノードの子孫ノードの上記識別情報を全て収集した第２の子孫ノードデータを作成する第２の子孫ノードデータ作成手段と、上記第１の子孫ノードデータ作成手段によって作成された第１の子孫ノードデータを、その第１の子孫ノードデータに含まれる子孫ノードの上記識別情報に従い、その識別情報の値に対応する位置のビットを立てたビットマップ形式のデータに変換する第１の変換手段と、上記第２の子孫ノードデータ作成手段によって作成された第２の子孫ノードデータを、その第２の子孫ノードデータに含まれる子孫ノードの上記識別情報に従い、その識別情報の値に対応する位置のビットを立てたビットマップ形式のデータに変換する第２の変換手段と、上記第１の変換手段によって変換されたビットマップ形式の第１の子孫ノードデータと上記第２の変換手段によって変換されたビットマップ形式の第２の子孫ノードデータとの排他的論理和を求めることにより、上記第１の子孫ノードデータと上記第２の子孫ノードデータとを比較する比較手段とを有することを特徴とする。
【０００７】
このようなデータ管理装置において、上記比較手段による比較が不一致だった場合に、その比較結果に基づいて上記ノード階層情報を修正する修正手段をさらに設けるとよい。
【０００８】
また、この発明のプログラムは、コンピュータを、階層構造を持つノードによって構成される形式のデータを管理するデータ管理手段と、各ノードについて固有の整数値からなる識別情報と、そのノードの直上位の親ノードの上記識別情報とをノード情報として記憶する手段と、上記各ノードについて、そのノードよりも下位であり且つそのノードの上位ノードを経由せずに辿れるノードである子孫ノードの各々の上記識別情報を、そのノードの上記識別情報と１対１で対応づけてノード階層情報として記憶する手段と、上記ノード情報をもとに、任意のノードの子孫ノードの上記識別情報を全て収集した第１の子孫ノードデータを作成する第１の子孫ノードデータ作成手段と、上記ノード階層情報をもとに、上記任意のノードの子孫ノードの上記識別情報を全て収集した第２の子孫ノードデータを作成する第２の子孫ノードデータ作成手段と、上記第１の子孫ノードデータ作成手段によって作成された第１の子孫ノードデータを、その第１の子孫ノードデータに含まれる子孫ノードの上記識別情報に従い、その識別情報の値に対応する位置のビットを立てたビットマップ形式のデータに変換する第１の変換手段と、上記第２の子孫ノードデータ作成手段によって作成された第２の子孫ノードデータを、その第２の子孫ノードデータに含まれる子孫ノードの上記識別情報に従い、その識別情報の値に対応する位置のビットを立てたビットマップ形式のデータに変換する第２の変換手段と、上記第１の変換手段によって変換されたビットマップ形式の第１の子孫ノードデータと上記第２の変換手段によって変換されたビットマップ形式の第２の子孫ノードデータとの排他的論理和を求めることにより、上記第１の子孫ノードデータと上記第２の子孫ノードデータとを比較する比較手段として機能させるためのものである。
このようなプログラムにおいて、コンピュータを、上記比較手段による比較が不一致だった場合に、その比較結果に基づいて上記ノード階層情報を修正する修正手段として機能させるためのプログラムをさらに含むようにするとよい。
この発明の記録媒体は、上記の各プログラムを記録したコンピュータ読み取り可能な記録媒体である。
【０００９】
【発明の実施の形態】
以下、この発明の好ましい実施の形態を図面を参照して説明する。
〔第１の実施形態：図１乃至図１０〕
初めに、この発明のデータ管理装置の第１の実施形態である文書管理サーバについて説明する。まず、その概略構成について、図１乃至図６を用いて説明する。図１はその文書管理サーバにおけるこの発明の特徴部分の機能構成を示す機能ブロック図、図２はその文書管理サーバの概略構成を示すブロック図、図３はその文書管理サーバにおけるデータの管理方式について説明するための図、図４は同じく子孫ノードデータの作成について説明するための図、図５は同じく子孫ノードデータの比較について説明するための図、図６は同じく子孫ノードデータのビットマップ形式への変換について説明するための図である。
【００１０】
この文書管理サーバ１０は、ハードウェア構成としては公知のサーバ装置を用いて構成することができる。すなわち、文書管理サーバ１０は、図２に示すように、ＣＰＵ１１，ＲＯＭ１２，ＲＡＭ１３，ハードディスクドライブ（ＨＤＤ）１４，ネットワークインタフェース（Ｉ／Ｆ）１５，データ蓄積手段１６を備え、これらがシステムバス１７によって接続されている。
ＣＰＵ１１は、この文書管理サーバ１０全体を制御する制御部であり、ＲＯＭ１２やＨＤＤ１４に格納された各種制御プログラムやアプリケーションプログラムを実行して装置の制御、クライアント装置２０によるデータへのアクセスの制御、データの管理等の動作を行う。
【００１１】
ＲＯＭ１２は、主に装置の制御プログラムを記憶する記憶手段である。そして、ＲＡＭ１３はＣＰＵ１１のワークメモリや一時的なデータの記憶に用いる記憶手段である。
ＨＤＤ１４は、各種アプリケーションプログラムやデータを記憶する記憶手段である。
ネットワークＩ／Ｆ１５は、文書管理サーバ１０をインターネットやローカルエリアネットワーク（ＬＡＮ）等のネットワークに接続するためのインタフェースであり、文書管理サーバ１０はこのネットワークＩ／Ｆ１５を介してパーソナルコンピュータ（ＰＣ）等のクライアント装置２０とのデータの授受を行う。
【００１２】
データ蓄積手段１６は、文書データ等の大量のデータを蓄積・記憶することのできる不揮発性記憶手段であるが、データ蓄積手段１６については、ＨＤＤ１４がこの役割を兼ねるようにしてもよい。
なお、図２には操作部や表示部を示していないが、キーボードやマウス等による操作部や、液晶やブラウン管（ＣＲＴ）による表示部を設け、ユーザからの入力の受付や動作結果の表示を行うことができるようにしてもよい。
【００１３】
このような文書管理サーバ１０は、データ蓄積手段１６に蓄積した文書データの管理を行い、クライアント装置２０からの要求に応じて管理下の文書データを送信する装置である。そして、この文書管理サーバ１０においては、管理対象である文書データは、図３（ａ）に示すように階層構造を持つノードの集合として、データ蓄積手段に記憶されている。ここで、ノードとは、データにアクセスする際の所定の単位であり、一般的なファイルシステムにおいてはファイルやフォルダに相当する単位であるが、これより細かい単位をノードとして扱うようにしてもよい。また、この管理動作は、ＲＯＭ１２やＨＤＤ１４に記憶された所要のデータ管理プログラムを実行してデータ管理手段として機能するＣＰＵ１１が行う。
【００１４】
データ蓄積手段１６には、これらのノードの階層構造を管理するためのデータとして、ノードデータとノード階層データとがそれぞれテーブルの形で記憶されている。
ノードデータはノードデータであり、図３（ｂ）に示すように、ノード毎に、固有の整数値からなる識別情報であるノードＩＤと、ノードの名称と、そのノードの直上位に位置する親ノードのノードＩＤとで構成されている。この文書管理サーバ１０は、このノードデータを参照することによって各ノードの配置を把握することができ、このノードデータはノードの階層構造を定義するための基本的なデータである。
【００１５】
一方、ノード階層データは、ノード階層情報であり、図３（ｃ）に示すように、ノードの親子関係に整数値のデータＩＤを付して記憶させたものである。そして、ノードの親子関係は、親ノードのノードＩＤと子ノードのノードＩＤの組み合わせとして記憶される。また、親ノードの直下位の子ノードだけでなく、さらに下位の孫ノード、ひ孫ノード等、親ノードより下位にあり、かつそのノードから上位ノードを経由することなく辿れる子孫ノードは、全てその親ノードの子ノードであるとして親子関係を記憶させている。
このようなノード階層データは、ノードの親子関係及び子孫関係の情報を、ノードデータに含まれる各ノードの親ノードの識別情報を参照して収集することによって作成できる。そして、このようなデータを予め作成して記憶しておくことにより、あるノードの下位に位置する全てのノードを抽出する処理を高速に行うことができるようになり、範囲を限定したノードの検索や、ノード階層構造の組換え等の処理を高速に行うことができる。
【００１６】
これらのノードデータとノード階層データは、通常は整合性が保たれるように管理されているが、予期しないトラブル等によって不整合が生じてしまうことがある。このような場合にはノードの階層構造が正しく認識できなくなり、文書管理動作に支障を来すことになるので、この不整合が生じていないかどうかを必要に応じて検査しなければならない。この検査を高速に行うことができるようにした点がこの発明の特徴である。
【００１７】
この高速な検査の処理は、ＣＰＵ１１にＲＯＭ１２あるいはＨＤＤ１４等に記憶している所要のプログラムを実行させ、図１に示した各部の機能を果たさせることによって実現している。ここで、その機能について説明する。
まず、整合性検査要求受付部１０１は、所定の時間毎、所定のイベントが生じる毎、あるいはユーザの指示等により整合性検査処理が要求された場合、その要求を受け付ける。そして、ノード抽出部１０２にその旨を伝達する機能を有する。
【００１８】
ノード抽出部１０２は、整合性検査要求受付部１０１から検査処理要求が伝達された場合、ノードデータテーブル１１１からノードＩＤを順次抽出し、検査対象ノードとして階層情報再作成部１０３及び階層情報抽出部１０４に伝達する機能を有する。
階層情報再作成部１０３は、検査対象ノードの子孫ノードのノードＩＤをノードデータテーブル１１１を参照して全て収集し、図４（ａ）に示すように配列の形式にまとめて検査対象ノードの子孫ノードを示す第１の子孫ノードデータを作成する機能を有する。この収集は例えば、まず対象ノードを親ノードとするノードを全て抽出し、次に抽出したノードを親ノードとするノードを全て抽出し・・・、という工程を、抽出できるノードがなくなるまで行うことによって実現できる。
【００１９】
階層情報抽出部１０４は、検査対象ノードの子孫ノードのノードＩＤを、ノード階層データテーブル１１２を参照して全て収集し、図４（ｂ）に示すように配列の形式にまとめて検査対象ノードの子孫ノードを示す第２の子孫ノードデータを作成する機能を有する。この収集は例えば、ノード階層データテーブル１１２において対象ノードを親ノードとする親子関係（子孫関係）を全て抽出することによって実現できる。
ビットマップ化処理部１０５は、階層情報再作成部１０３の作成した第１の子孫ノードデータおよび階層情報抽出部１０４の作成した第２の子孫ノードデータを、それぞれ配列形式のデータからビットマップ形式のデータに変換する機能を有する。ここで、ビットマップ形式のデータとは、所定の個数の０又は１の値を持つデータ（ビット）によって構成されるデータのことである。そして、ビットマップ形式への変換は、例えば図６に示すように、ノードＩＤの最大値の数だけ０のビットを持つビットマップを用意し、配列に含まれるノードＩＤ番目（図の例では２番目，３番目，４番目）のビットに１をセットすることによって行うことができる。なお、ここでは、図で左側のビットから順に、１番目，２番目，・・・としている。
【００２０】
ビットマップ比較部１０６は、ビットマップ化処理部１０５によって変換されたビットマップ形式の第１の子孫ノードデータとビットマップ形式の第２の子孫ノードデータとを比較する機能を有する。この比較は、２つのビットマップについての排他的論理和を求めることによって行うことができる。
排他的論理和は、対応するビットが同じであれば０、異なれば１になるので、図４及び図６に示したようにノードデータとノード階層データに不整合がない場合には、両者から得た子孫ノードデータは一致し、表１（ａ）に示すように排他的論理和は全てのビットで０となる。一方、不整合が起きている場合には、両者から得た子孫ノードデータには異なる部分があるので、表１（ｂ）及び（ｃ）に示すように、その部分のビットで排他的論理和が１となる。
【００２１】
【表１】

【００２２】
結果表示・記憶部１０７は、ビットマップ比較部１０６による比較結果を表示手段に表示させたり記憶手段に記憶させたりする機能を有する。すなわち、例えば不整合があった場合にその旨や箇所を知らせるメッセージを表示させたり、ログにその旨を記録したりする。不整合がなかった場合でも、その旨のメッセージを表示させるようにしてもよい。
以上の各部により、ノード抽出部１０２が抽出した全てのノードについて第１の子孫ノードデータと第２の子孫ノードデータとを比較することにより、ノード階層構造全体について、ノードデータとノード階層データとの間で不整合が生じていないか否かを検査することができる。
【００２３】
ここで、この検査処理を高速で行うために、第１の子孫ノードデータと第２の子孫ノードデータとの比較処理を高速で行うことが重要となる。
単純にこれらを配列形式のデータのままで比較することを考えてみると、例えば図５に示すように、第１の子孫ノードデータの配列に含まれる要素の１つ１つについて、第２の子孫ノードデータの配列の要素と順次比較（スキャン）し、同じ要素がある（ヒットする）か否かを確認しなければならない。その処理に必要な時間は、最悪の場合、（第１の子孫ノードデータの配列の要素数）×（第２の子孫ノードデータの配列全体のスキャンにかかる時間）に比例する。
【００２４】
また、第２の子孫ノードデータの配列に第１の子孫ノードデータに含まれない余計なデータがあるか否かを確認するためには、両者の立場を入れ替えて再度スキャンを行うか、第２の子孫ノードデータの配列をスキャンした時にヒットした要素を何らかの手段で記憶しておき、あとで全ての要素がヒットしているか否かを確認する必要がある。
このように、配列形式のデータのままで比較処理を行うと、比較に多くの処理が必要になる。
【００２５】
これに対しこの発明では、図６に示すように、配列形式の子孫ノードデータを一旦ビットマップ形式のデータに変換してから比較を行うようにしている。ビットマップ形式への変換は、配列の各要素を一度ずつ参照するだけで行うことができる。そして、ビットマップ同士の比較はコンピュータシステムの基礎となっているビット演算によって排他的論理和を求めることによって行うことができるので、ハードウェアでの高速動作が実現可能である。従って、配列形式のデータのままで比較を行う場合に比べて処理量をおよそ配列の要素数分の１に低減できるので、高速に比較を実行でき、ひいては整合性の検査も高速に行うことができる。
【００２６】
次に、図７乃至図１０のフローチャートも用いて、この文書管理サーバにおけるノードデータとノード階層データとの整合性検査の処理について、さらに詳しく説明する。図７乃至図１０は、この整合性検査の処理を示すフローチャートである。
整合性検査処理の要求があると、ＣＰＵ１１はその要求を受け付ける。この処理要求の受け付けは、ＣＰＵ１１がＲＯＭ１２あるいはＨＤＤ１４等に記憶しているデータ管理プログラムの一部を実行することによって常に行っている。
そして、この要求を受けると、ＣＰＵ１１は、データ管理プログラムの他の部分も実行し、図７のフローチャートに示す処理を開始する。すなわちＣＰＵ１１は、データ管理プログラムに含まれる適当なプログラムを実行することにより、以下に説明する各手段として機能する。
【００２７】
まず、ステップＳ１でノードデータを参照して初めのノードのノードＩＤを取得し、そのノードを検査対象とする。ここで、どのノードを初めとしてもよいが、例えば、ノードＩＤの最も小さいものとすればよい。
次のステップＳ２では、ノードがあったか否か判断するが、通常初めはノードがあるので、次のステップＳ３に進み、検査対象のノードについての第１の子孫ノードデータ作成処理を行う。この処理は、図８に示すものであるが、詳細については後述する。ここでは、ＣＰＵ１１が第１の子孫ノードデータ作成手段として機能する。
【００２８】
次に、ステップＳ４で、検査対象のノードについての第２の子孫ノードデータ作成処理を行う。この処理は、図９に示すものであるが、詳細については後述する。ここでは、ＣＰＵ１１が第２の子孫ノードデータ作成手段として機能する。これらのステップＳ３とステップＳ４で作成された第１及び第２の子孫ノードデータは、配列形式のデータであるが、次のステップＳ５では、これらのデータをそれぞれビットマップ形式のデータに変換する。この変換処理は、図１０に示すものであるが、詳細については後述する。ここでは、ＣＰＵ１１が第１及び第２の変換手段として機能する。
【００２９】
次に、ステップＳ６に進み、変換後の第１の子孫ノードデータと変換後の第２の子孫ノードデータとを比較する。この比較は、既に述べたように２つのデータの排他的論理和を求めることによって行うことができる。ここでは、ＣＰＵ１１が比較手段として機能する。
そして、ステップＳ７に進んで相違があるか否か判断し、相違があればステップＳ８に進んでその相違を表示手段に表示させたり記憶手段に記憶させたりする。すなわち、例えば不整合があった場合にその旨や箇所を知らせるメッセージを表示させたり、ログにその旨を記録したりする。不整合がなかった場合でも、その旨のメッセージを表示させるようにしてもよい。
【００３０】
その後、ステップＳ９に進む。ステップＳ７で相違がなければ、そのままステップＳ９に進む。
ステップＳ９では、ノードデータを参照して次のノードのノードＩＤを取得し、そのノードを検査対象とする。ここで、次に取得すべきノードは、任意の基準で定めればよい。
そして、ステップＳ２に戻り、取得するノードがあったか否か判断する。あれば、ステップＳ３以降の処理を繰り返し、なければ、全てのノードについて整合性の検査が終了したものと判断して処理を終了する。
【００３１】
次に、図７のステップＳ３における第１の子孫ノードデータ作成処理について、図３に示した構成のノードにおいてノードＡが検査対象のノード（対象ノード）である場合の処理例も交えて説明する。
第１の子孫ノードデータ作成処理は、図８のフローチャートに示す処理であり、まずステップＳ１１で、ノードデータを検索して対象ノードを親ノードとするノードのノードＩＤを抽出する。この抽出は、ノードデータがＳＱＬ（Structured Query Language）に対応したデータベーステーブルに記憶されている場合には、例えば表２に示したスキーマ定義と検索文とを用いて行うことができる。そして、図３に示した例でノードＡが対象であると、ノードＢとノードＣのノードＩＤとして「２」と「３」が抽出される。
【００３２】
【表２】

【００３３】
次に、ステップＳ１２でノードＩＤが抽出できたか否か判断する。そして、できていれば、ステップＳ１３に進んで抽出結果から初めのノードＩＤを選択する。ここでも、初めにどのノードＩＤを選択してもよいが、例えば、ノードＩＤの最も小さいものを選択すればよい。ここでは「２」となる。
次に、ステップＳ１４でノードＩＤが選択できたか否か判断する。そして、できていれば、ステップＳ１５で第１の子孫ノードデータの配列に選択したノードＩＤをセット（追加）する。そして、ステップＳ１６に進み、選択したノードＩＤのノードを対象ノードとして第１の子孫ノードデータ作成処理、すなわちこの図８のフローチャートの処理を再帰的に実行する。
【００３４】
この再帰実行は再びステップＳ１１から開始するが、ここでは対象ノードはノードＢ（ＩＤ「２」）であるので、ノードＤのノードＩＤとして「４」が抽出される。そして、ステップＳ１２，Ｓ１３と進んでこの「４」が選択され、ステップＳ１５でこの値が第１の子孫ノードデータの配列にセットされる。なお、第１の子孫ノードデータの配列は１つであり、初めの実行時も再帰実行時も、同一の配列に対してセットが行われる。ただし、新たに図７のステップＳ３の処理を開始する場合にリセットすることは、もちろんである。
そしてステップＳ１６では、今度は対象ノードをノードＤ（ＩＤ「４」）として第１の子孫ノードデータ作成処理を再度再帰実行する。
【００３５】
この再帰実行は再びステップＳ１１から開始するが、ここでは対象ノードはノードＤ（ＩＤ「４」）であるので、これを親ノードとするノードはなく、ステップＳ１１ではノードＩＤが抽出されない。
従って、ステップＳ１２の判断はＮＯになるので、元の処理に戻る。すなわち、１段階前のノードＢを対象ノードとする第１の子孫ノードデータ作成処理のステップＳ１６に戻り、ステップＳ１７に進む。
【００３６】
このステップＳ１７では、ステップＳ１１での抽出結果から次のノードＩＤを選択する。ここで、次に抽出すべきノードは任意の基準で定めればよいが、ノードＢを対象ノードとする処理においては、ステップＳ１１で抽出したノードＩＤは「４」のみである。従って、選択対象がないことになる。
そこで、そのままステップＳ１４に戻るが、ここでの判断はＮＯになり、再度元の処理に戻る。すなわち、１段階前のノードＡを対象ノードとする第１の子孫ノードデータ作成処理のステップＳ１６に戻り、ステップＳ１７に進む。
【００３７】
このステップＳ１７では、ステップＳ１１での抽出結果から次のノードＩＤを選択するが、ノードＡを対象ノードとする処理においてはステップＳ１１で抽出したノードＩＤは「２」と「３」であるので、まだ選択していない「３」を選択することになる。
そして、ステップＳ１４に戻り、ここでの判断はＹＥＳとなるのでステップＳ１５に進んで選択した「３」を第１の子孫ノードデータの配列にセットする。
ステップＳ１６では、今度は対象ノードをノードＣ（ＩＤ「３」）として第１の子孫ノードデータ作成処理を再帰実行するが、この処理で抽出されるノードＩＤはないので、説明を省略する。
【００３８】
そしてステップＳ１７に進むが、もう選択すべきノードＩＤはないのでそのままステップＳ１４に戻り、ここでの判断がＮＯになるので、元の処理に戻る。すなわち、第１の子孫ノードデータ作成処理を終了して図７の処理に戻る。
説明に用いた例の場合、処理の終了時に第１の子孫ノードデータの配列には「２」，「３」，「４」がセットされており、これがこの処理によって作成された第１の子孫ノードデータである。なお、このデータは最終的にはビットマップ形式に変換して用いるので、この時点でのノードＩＤの配列順は、どのような順番であっても構わない。
【００３９】
次に、図７のステップＳ４における第２の子孫ノードデータ作成処理について、図３に示した構成のノードにおいてノードＡが対象ノードである場合の処理例も交えて説明する。
第２の子孫ノードデータ作成処理は、図９のフローチャートに示す処理であり、まずステップＳ２１で、ノード階層データを検索して対象のノードを親ノードとする組み合わせの子ノードＩＤを抽出する。この抽出は、ノード階層データがＳＱＬ（Structured Query Language）に対応したデータベーステーブルに記憶されている場合には、例えば表３に示したスキーマ定義と検索文とを用いて行うことができる。そして、図３に示した例でノードＡが対象であると、ノードＩＤ「１」を親ノードとする子ノードのＩＤとして「２」，「３」，「４」が抽出される。
【００４０】
【表３】

【００４１】
そして、ステップＳ２２で第２の子孫ノードデータの配列に抽出したノードＩＤをセット（追加）する。以上で第２の子孫ノードデータ作成処理は終了し、元の処理すなわち図７の処理に戻る。ノード階層データには、直接の親子関係だけでなく、子孫関係のデータも含まれているので、ノードデータを参照する場合の用に再帰的な検索を行う必要がなく、このような簡単な処理で子孫ノードデータを作成できるのである。
説明に用いた例の場合、処理の終了時に第２の子孫ノードデータの配列には「２」，「３」，「４」がセットされており、これがこの処理によって作成された第２の子孫ノードデータである。なお、このデータについても、最終的にはビットマップ形式に変換して用いるので、この時点でのノードＩＤの配列順は、どのような順番であっても構わない。
【００４２】
次に、図７のステップＳ５における変換処理について、図６に示した子孫ノードデータを変換する場合の処理例も交えて説明する。
変換処理は、図１０のフローチャートに示す処理であり、まずステップＳ３１で、変換対象の子孫ノードデータの配列から初めの要素（ノードＩＤ）を取得する。
そして、ステップＳ３２に進んで取得できたか否か判断する。あれば、ステップＳ３３に進み、ビットマップの取得したノードＩＤ番目のビットに「１」をセットする。例えば取得したノードＩＤが「２」であれば２番目、「３」であれば３番目のビットに「１」をセットする。なお、ビットマップの初期値としては、第１及び第２の子孫ノードデータの配列に含まれる最大のノードＩＤの数だけ「０」のビットを並べたデータを用意しておくものとする。例えば、図６に示す例の場合には、最大のノードＩＤは「４」であるので、４個の「０」を並べたデータを用意しておく。
【００４３】
次に、ステップＳ３４に進み、配列の次の要素を取得してステップＳ３２に戻る。ステップＳ３４で要素を取得できた場合には、ステップＳ３２の判断がＹＥＳとなるのでステップＳ３３以降の処理を繰り返すが、次の要素がなくて取得できなかった場合には、ＮＯとなるので、全ての要素についての処理が終了したものとして変換処理を終了し、もとの処理に戻る。
ここで、図７のステップＳ５では、第１の子孫ノードデータと第２の子孫ノードデータのそれぞれについて、図１０の変換処理を実行するものとする。
また、この変換処理においては、配列の要素をどのような順で取得しても、最終的には各要素に対応したビットに「１」をセットしたデータを生成することができる。従って、配列形式の状態で各要素がどのような順で並んでいても、同じ要素によって構成されていれば、生成されるビットマップは同一になる。
【００４４】
この文書管理サーバ１０は、以上のような処理を行い、配列形式の子孫ノードデータを一旦ビットマップ形式のデータに変換してから比較を行うようにしたことにより、上述のように、配列形式のデータのままで比較を行う場合に比べて処理量をおよそ配列の要素数分の１に低減できるので、高速に比較を実行でき、ひいてはノードデータとノード階層データとの整合性すなわち実際の階層構造と記憶してある階層情報と整合性の検査も高速に行うことができる。
なお、図７のフローチャートの処理において、ステップＳ３乃至ステップＳ５の処理は必ずしもこの順で行う必要はない。例えば、第２の子孫ノードデータを先に作成するようにしてもよいし、作成した子孫ノードデータをただちに変換処理に供するようにしてもよい。
【００４５】
〔第２の実施形態：図１１乃至図１５〕
次に、この発明のデータ管理装置の第２の実施形態である文書管理サーバについて説明する。図１１はその文書管理サーバにおけるこの発明の特徴部分の機能構成を示す機能ブロック図、図１２及び図１３はその文書管理サーバにおける誤りのあるノード階層データの例を示す図、図１４及び図１５はこの文書管理サーバにおけるノードデータとノード階層データとの整合性検査の処理を示すフローチャートである。
この実施形態の文書管理サーバは、機能構成としてはノード階層データ修正部１０８を設けた点が異なるのみであり、ＣＰＵ１１による処理もこの点に伴って変更が加えられたのみであり、他の点は第１の実施形態の文書管理サーバ１０と同様であるので、相違点以外の説明は簡単にするか省略する。また、第１の実施形態の文書管理サーバ１０と対応する構成には同一の符号を用いる。
【００４６】
この実施形態の文書管理サーバにおけるノード階層データ修正部１０８は、ビットマップ比較部１０６における比較結果とビットマップ形式の第１の子孫ノードデータとに基づいてノード階層データの修正を行う機能を有する。
具体例を挙げると、例えば図３に示したノード階層構成において、ノード階層データが図１２に示すものであった場合、データＩＤが「４」のデータは、実際の階層構成と対応しておらず、誤ったデータである。このような状態で整合性検査を行うと、ノードＩＤが「１」のノードＡを対象としたとき、表４に示すように、ノードデータを参照して作成する第１の子孫ノードデータは「０１１１０」に、ノード階層データを参照して作成する第２の子孫ノードデータは「０１１１１」になるので、比較結果の排他的論理和に１のビットが出現し、不整合が検出される。
【００４７】
【表４】

【００４８】
ここで、１のビットは５番目であることから、ノードＩＤが「５」のノードについての子孫関係に不整合があることがわかる。一方、第１の子孫ノードデータの５番目のビットは「０」であるので、必然的に第２の子孫ノードデータの５番目のビットは「１」であり、実際にはノードＩＤが「５」のノードはノードＡの子孫ノードではなく、ノード階層データに余分なデータが含まれているのだということがわかる。従って、ノード階層データ修正部１０８は、親ノードＩＤが「１」，子ノードＩＤが「５」であるノード階層データを削除する修正を行うことにより、不整合を解消することができる。この修正は、例えばノード階層データが、ＳＱＬに対応し、表３に示したスキーマ定義を用いたデータベーステーブルに記憶されている場合には、
delete NodeHierarchyData where ParentID = 1 and ChildID = 5
のようなコマンドによって実行できる。
【００４９】
別の例として、例えば図３に示したノード階層構成において、ノード階層データが図１３に示すものであった場合、親ノードＩＤが「１」，子ノードＩＤが「４」であるデータが不足している。このような状態で整合性検査を行うと、ノードＩＤが「１」のノードＡを対象としたとき、表５に示すように、ノードデータを参照して作成する第１の子孫ノードデータは「０１１１」に、ノード階層データを参照して作成する第２の子孫ノードデータは「０１１０」になるので、やはり比較結果の排他的論理和に１のビットが出現し、不整合が検出される。
【００５０】
【表５】

【００５１】
ここで、１のビットは４番目であることから、ノードＩＤが「４」のノードについての子孫関係に不整合があることがわかる。一方、第１の子孫ノードデータの４番目のビットは「１」であるので、必然的に第２の子孫ノードデータの４番目のビットは「０」であり、実際にはノードＩＤが「４」のノードはノードＡの子孫ノードであるのに、ノード階層データに対応するデータが含まれていないことがわかる。従って、ノード階層データ修正部１０８は、親ノードＩＤが「１」，子ノードＩＤが「４」であるノード階層データを追加する修正を行うことにより、不整合を解消することができる。この修正は、例えばノード階層データが、ＳＱＬに対応し、表３に示したスキーマ定義を用いたデータベーステーブルに記憶されている場合には、
insert into NodeHierarchyData (ID, ParentID, ChildID) values (4, 1, 4)のようなコマンドによって実行できる。
【００５２】
このように、ノード階層データ修正部１０８は、比較結果の排他的論理和で「１」が立っているビットの位置によって不整合に係る子孫ノードのＩＤを取得し、第１の子孫ノードデータの対応するビットを参照することによってノード階層データに追加が必要か削除が必要かを判断して、適切な修正を行うことができる。
なお、ノードデータを修正しないのは、このデータがノードの階層構造を定義するための基本的なデータであり、このデータを変更すると、階層構造そのものを変更することになってしまうためである。一方、ノード階層データは、ノードデータを参照して作成されたものであり、階層構造を効率よく把握するためのデータであるので、こちらを変更しても階層構造を変更することにはならない。
【００５３】
次に、この文書管理サーバにおけるノードデータとノード階層データの整合性検査の処理について、フローチャートも用いてさらに詳しく説明する。
整合性検査処理の要求があると、ＣＰＵ１１はその要求を受け付ける。この処理要求の受け付けは、ＣＰＵ１１がＲＯＭ１２あるいはＨＤＤ１４等に記憶しているデータ管理プログラムの一部を実行することによって常に行っている。
そして、この要求を受けると、ＣＰＵ１１は、データ管理プログラムの他の部分も実行し、図１４のフローチャートに示す処理を開始する。すなわちＣＰＵ１１は、データ管理プログラムに含まれる適当なプログラムを実行することにより、以下に説明する各手段として機能する。
【００５４】
図１４のフローチャートに示す処理は、ステップＳ７で相違があった場合にステップＳ８の処理の次にステップＳ４１のノード階層データ修正処理を行う点を除き、第１の実施形態で図７乃至図１０のフローチャートを用いて説明した処理と同様であるので、このノード階層データ修正処理についてのみ説明する。この処理においては、ＣＰＵ１１が修正手段として機能する。
【００５５】
ノード階層データ修正処理は、図１５のフローチャートに示す処理である。
ここでは、まずステップＳ５１で、比較結果の排他的論理和のビットマップから初めのビットを取得する。
そして、ステップＳ５２で最後のビットを越えたか否か判断する。超えていなければ、ステップＳ５３に進み、取得したビットが１であるか否か判断する。１であれば、そのビットに対応するノードについて不整合があったと判断してステップＳ５４に進み、第１の子孫ノードデータのビットマップの対応ビットを参照する。
【００５６】
次に、ステップＳ５５でそのビットが０であるか判断する。０であれば、ノード階層データに余分なデータが含まれていたものと判断し、ステップＳ５６に進む。そして、ノード階層データから、その時点での検査対象ノードを親ノードとし、ステップＳ５４で参照したビットに対応するノード（例えばその時点で参照しているビットが４番目のビットであれば、ノードＩＤが「４」のノード）を子孫ノードとするデータを削除する。そして、ステップＳ５８に進んで比較結果の排他的論理和のビットマップから次のビットを取得してステップＳ５２に戻り、処理を繰り返す。
【００５７】
ステップＳ５５で０でなければ、ノード階層データに必要なデータが不足していたものと判断し、ステップＳ５７に進む。そして、ノード階層データに、その時点での検査対象ノードを親ノードとし、ステップＳ５４で参照したビットに対応するノードを子孫ノードとするデータを追加する。そして、ステップＳ５８に進んで比較結果の排他的論理和のビットマップから次のビットを取得してステップＳ５２に戻り、処理を繰り返す。
ステップＳ５３で１でなかった場合には、そのままステップＳ５８に進んで比較結果の排他的論理和のビットマップから次のビットを取得し、ステップＳ５２に戻って処理を繰り返す。
ステップＳ５２で最後のビットを越えていれば、現在の検査対象ノードについて必要な修正は全て完了したと判断し、元の図１４の処理に戻る。
【００５８】
この実施形態の文書管理サーバは、以上のような処理を行い、比較結果のビットマップを参照してノード階層データの不整合箇所を特定して修正を行うようにしたので、実際の階層構造と記憶してある階層情報との整合性の検査のみならず、不整合箇所の修正も高速に行うことができる。
【００５９】
なお、以上説明した各実施形態において、データ蓄積手段１６に蓄積した文書データの管理を行う文書管理サーバ１０を例として説明したが、ＨＤＤ１４等に格納したデータも一括して管理するようにしてもよいことはもちろんであり、外部の装置のものも含め、複数の記憶手段に記憶しているデータを一括して管理するようにしてもよい。これ以外にも、この発明のデータ管理装置は文書管理サーバに限られるものではもちろんなく、階層構造を持つノードによって構成される何らかのデータを管理する装置であればどのようなものにでも適用できることは、もちろんである。
【００６０】
また、上述した文書管理サーバ１０のＣＰＵ１１を上述した各手段として機能させるためのプログラムは、予め文書管理サーバ１０のＨＤＤ１４等に記憶させておくほか、記録媒体であるＣＤ−ＲＯＭあるいは光磁気（ＭＯ）ディスク，ミニディスク（ＭＤ），ＤＶＤ（Digital Versatile Disk），フレキシブルディスク，磁気テープ，ＨＤＤ等の不揮発性記録媒体（メモリ）にプログラムソースや実行形式のバイナリデータとして記録して提供することもできる。そのメモリに記録されたプログラムを文書管理サーバ１０にインストールしてＣＰＵ１１に実行させるか、ＣＰＵ１１にそのメモリからこのプログラムを読み出して実行させることにより、上述した各手段として機能させることができる。
さらに、このプログラムを記録した記録媒体を備える外部機器あるいはこのプログラムを記憶手段に記憶した外部機器からダウンロードして実行させることも可能である。
【００６１】
【発明の効果】
以上説明してきたように、この発明のデータ管理装置によれば、ノードの階層構造と記憶してある階層情報との整合性の検査を簡単な処理で行うことができるので、高速に検査を行うことができる。
また、この発明のプログラムによれば、コンピュータを上記のようなデータ管理装置として機能させることができる。
また、この発明の記録媒体によれば、上記のプログラムを記憶していないコンピュータにそのプログラムを記憶させ、その実行によって上記のようなデータ管理装置として機能させることができる。
【図面の簡単な説明】
【図１】この発明のデータ管理装置の第１の実施形態である文書管理サーバにおけるこの発明の特徴部分の機能構成を示す機能ブロック図である。
【図２】その文書管理サーバの概略構成を示すブロック図である。
【図３】その文書管理サーバにおけるデータの管理方式について説明するための図である。
【図４】同じく子孫ノードデータの作成について説明するための図である。
【図５】同じく子孫ノードデータの比較について説明するための図である。
【図６】同じく子孫ノードデータのビットマップ形式への変換について説明するための図である。
【図７】この発明のデータ管理装置の第１の実施形態である文書管理サーバにおける整合性検査の処理を示すフローチャートである。
【図８】図７のフローチャートに示した第１の子孫ノードデータ作成処理の内容を示すフローチャートである。
【図９】図７のフローチャートに示した第２の子孫ノードデータ作成処理の内容を示すフローチャートである。
【図１０】図７のフローチャートに示した変換処理であるビットマップ化処理の内容を示すフローチャートである。
【図１１】この発明のデータ管理装置の第２の実施形態である文書管理サーバにおけるこの発明の特徴部分の機能構成を示す機能ブロック図である。
【図１２】その文書管理サーバにおける誤りのあるノード階層データの例を示す図である。
【図１３】その別の例を示す図である。
【図１４】この発明のデータ管理装置の第２の実施形態である文書管理サーバにおける整合性検査の処理を示すフローチャートである。
【図１５】図１４のフローチャートに示したノード階層データ修正処理の内容を示すフローチャートである。
【符号の説明】
１０：文書管理サーバ１１：ＣＰＵ
１２：ＲＯＭ１３：ＲＡＭ
１４：ＨＤＤ１５：ネットワークＩ／Ｆ
１６：データ蓄積手段１７：システムバス
１０１：整合性検査要求受付部１０２：ノード抽出部
１０３：階層情報再作成部１０４：階層情報抽出部
１０５：ビットマップ化処理部
１０６：ビットマップ比較部
１０７：結果表示・記憶部
１０８：ノード階層データ修正部
１１１：ノードデータテーブル
１１２：ノード階層データテーブル[0001]
BACKGROUND OF THE INVENTION
The present invention is a data management device configured by nodes having a hierarchical structure, each of which manages data in a format having unique identification information and identification information of its parent node, and a computer functions as such a data management device And a recording medium on which such a program is recorded.
[0002]
[Prior art]
2. Description of the Related Art Conventionally, data management devices (or systems) such as file management devices and document management devices have managed data with a hierarchical structure. In such a hierarchical structure, a node is provided as a data arrangement unit in each hierarchy, and each node has identification information such as a unique ID number and identification information of its parent node. It can be expressed without increasing the amount. A parent node of a certain node is a node located immediately above that node. Conversely, a node located immediately below a certain node is called a child node of the node, and a node located immediately below the child node is called a grandchild node or the like. A node that is lower than a certain node and that can be traced from that node without going through an upper node is called a descendant node of that node.
[0003]
When managing data having such a hierarchical structure, it is conceivable that for each node, the descendant node of the node is examined in advance, and the correspondence data between the parent node and the descendant node is stored as hierarchical information. By doing so, it is possible to speed up operations such as obtaining all descendant nodes of a specific node.
[0004]
By the way, when such a method is adopted, inconsistency does not usually occur between the actual hierarchical structure and this hierarchical information, but inconsistency may occur due to an unexpected trouble or the like. And if inconsistency occurs, it may not be possible to access the data normally for that part, so check if this inconsistency has occurred and if it occurs, The repair must be done.
As a technique related to such inconsistency detection and repair, for example, there are a file management device disclosed in Japanese Patent Laid-Open No. 11-120058 and a data processing device disclosed in Japanese Patent Laid-Open No. 2000-28495. Can be mentioned. However, these apparatuses are related to the restoration of the file system, and do not detect inconsistencies between the hierarchical structure and the hierarchical information at high speed.
[0005]
[Problems to be solved by the invention]
As a method for detecting inconsistency between the hierarchical structure and the hierarchical information, for example, the information of the descendant nodes of the node to be detected by tracing the actual hierarchical structure is acquired, and the same node is also obtained from the stored hierarchical information. A method is conceivable in which information on descendant nodes is acquired, information on these acquired descendant nodes is arranged, and whether there is a difference between the arrangements is determined.
However, in this method, the process of checking for the presence or absence of differences between arrays, and the process of identifying elements that are not in one of the arrays, take a long time simply by scanning the elements of the array. there were.
The present invention solves such a problem, and is configured by nodes having a hierarchical structure, and each node manages data in a format having unique identification information and identification information of its parent node. It is an object of the present invention to enable high-speed inspection of the consistency between the structure and stored hierarchical information.
[0006]
[Means for Solving the Problems]
In order to achieve the above object, the present invention provides a data management apparatus for managing data in a format composed of nodes having a hierarchical structure. An integer value With identification information , Directly above that node Parent node the above Means for storing identification information as node information; For each of the above nodes, the identification information of each descendant node that is a node lower than that node and that can be traced without going through the upper node of the node is associated with the identification information of the node in a one-to-one correspondence. The Descending node of any node based on the means to store as node hierarchy information and the above node information Collected all the above identification information First descendant node data creating means for creating first descendant node data, and a descendant node of the arbitrary node based on the node hierarchy information Collected all the above identification information Second descendant node data creating means for creating second descendant node data; the above First descendant node data created by the first descendant node data creating means In accordance with the identification information of the descendant node included in the first descendant node data, the bit at the position corresponding to the value of the identification information is set First conversion means for converting into bitmap format data; the above Second descendant node data created by the second descendant node data creation means In accordance with the above-mentioned identification information of the descendant node included in the second descendant node data, the bit at the position corresponding to the value of the identification information is set A second conversion means for converting the data into the bitmap format; a first descendant node data in the bitmap format converted by the first conversion means; and a bitmap format converted by the second conversion means. Second descendant node data and By calculating the exclusive OR of the first descendant node data and the second descendant node data, And a comparison means for comparing.
[0007]
like this In the data management apparatus, when the comparison by the comparison means does not match, it is preferable to further provide a correction means for correcting the node hierarchy information based on the comparison result.
[0008]
Further, the program of the present invention includes a data management means for managing data in a format composed of nodes having a hierarchical structure, and a program specific to each node. An integer value With identification information , Directly above that node Parent node the above Means for storing identification information as node information; For each of the above nodes, the identification information of each descendant node that is a node lower than that node and that can be traced without going through the upper node of the node is associated with the identification information of the node in a one-to-one correspondence. The Descending node of any node based on the means to store as node hierarchy information and the above node information Collected all the above identification information First descendant node data creating means for creating first descendant node data, and a descendant node of the arbitrary node based on the node hierarchy information Collected all the above identification information Second descendant node data creating means for creating second descendant node data; the above First descendant node data created by the first descendant node data creating means In accordance with the identification information of the descendant node included in the first descendant node data, the bit at the position corresponding to the value of the identification information is set First conversion means for converting into bitmap format data; the above Second descendant node data created by the second descendant node data creation means In accordance with the above-mentioned identification information of the descendant node included in the second descendant node data, the bit at the position corresponding to the value of the identification information is set A second conversion means for converting the data into the bitmap format; a first descendant node data in the bitmap format converted by the first conversion means; and a bitmap format converted by the second conversion means. Second descendant node data and By calculating the exclusive OR of the first descendant node data and the second descendant node data, It is for functioning as a comparison means for comparing.
In such a program, it is preferable that the computer further includes a program for causing the computer to function as a correction unit that corrects the node hierarchy information based on the comparison result when the comparison by the comparison unit does not match.
The recording medium of the present invention is a computer-readable recording medium on which the above programs are recorded.
[0009]
DETAILED DESCRIPTION OF THE INVENTION
Preferred embodiments of the present invention will be described below with reference to the drawings.
[First Embodiment: FIGS. 1 to 10]
First, a document management server which is a first embodiment of the data management apparatus of the present invention will be described. First, the schematic configuration will be described with reference to FIGS. FIG. 1 is a functional block diagram showing a functional configuration of a characteristic part of the present invention in the document management server, FIG. 2 is a block diagram showing a schematic configuration of the document management server, and FIG. 3 is a data management method in the document management server. FIG. 4 is a diagram for explaining generation of descendant node data, FIG. 5 is a diagram for explaining comparison of descendant node data, and FIG. 6 is also a bitmap format of descendant node data. It is a figure for demonstrating conversion of these.
[0010]
The document management server 10 can be configured using a known server device as a hardware configuration. That is, the document management server 10 includes a CPU 11, a ROM 12, a RAM 13, a hard disk drive (HDD) 14, a network interface (I / F) 15, and data storage means 16 as shown in FIG. It is connected.
The CPU 11 is a control unit that controls the entire document management server 10, executes various control programs and application programs stored in the ROM 12 and the HDD 14, controls the apparatus, controls access to data by the client apparatus 20, and data Operations such as management are performed.
[0011]
The ROM 12 is a storage unit that mainly stores a control program for the apparatus. The RAM 13 is a storage means used for the work memory of the CPU 11 and temporary data storage.
The HDD 14 is a storage unit that stores various application programs and data.
The network I / F 15 is an interface for connecting the document management server 10 to a network such as the Internet or a local area network (LAN). The document management server 10 is connected to the personal computer (PC) or the like via the network I / F 15. Data exchange with the client device 20 is performed.
[0012]
The data storage unit 16 is a non-volatile storage unit that can store and store a large amount of data such as document data. However, the HDD 14 may also serve as the data storage unit 16.
2 does not show an operation unit or a display unit, but an operation unit such as a keyboard or a mouse or a display unit such as a liquid crystal or a cathode ray tube (CRT) is provided to accept an input from a user and display an operation result. You may be able to do that.
[0013]
Such a document management server 10 is a device that manages document data stored in the data storage unit 16 and transmits document data under management in response to a request from the client device 20. In the document management server 10, the document data to be managed is stored in the data storage means as a set of nodes having a hierarchical structure as shown in FIG. Here, the node is a predetermined unit for accessing data, and is a unit corresponding to a file or folder in a general file system, but a smaller unit may be handled as a node. . This management operation is performed by the CPU 11 that functions as a data management unit by executing a required data management program stored in the ROM 12 or the HDD 14.
[0014]
In the data storage unit 16, node data and node hierarchy data are stored in the form of tables as data for managing the hierarchical structure of these nodes.
The node data is node data. As shown in FIG. 3B, for each node, the node ID, which is identification information consisting of a unique integer value, the name of the node, and the parent located immediately above the node. It consists of the node ID of the node. The document management server 10 can grasp the arrangement of each node by referring to the node data, and this node data is basic data for defining the hierarchical structure of the nodes.
[0015]
On the other hand, the node hierarchy data is node hierarchy information, and is stored by adding an integer data ID to the parent-child relationship of the nodes as shown in FIG. The parent-child relationship of the nodes is stored as a combination of the node ID of the parent node and the node ID of the child node. Also, not only the child nodes immediately below the parent node, but also all descendant nodes that are lower than the parent node, such as lower grandchild nodes, great-grandchild nodes, and the like that can be traced without going through the upper node from the parent node. The parent-child relationship is stored as a child node of the node.
Such node hierarchy data can be created by collecting information on parent-child relationships and descendant relationships of nodes with reference to identification information of the parent node of each node included in the node data. And by creating and storing such data in advance, the process of extracting all the nodes located below a certain node can be performed at high speed, and the search for nodes with a limited range is performed. In addition, processing such as recombination of the node hierarchical structure can be performed at high speed.
[0016]
These node data and node hierarchy data are normally managed so as to maintain consistency, but inconsistencies may occur due to unexpected troubles. In such a case, the hierarchical structure of the nodes cannot be recognized correctly, and the document management operation will be hindered. Therefore, it is necessary to check whether or not this inconsistency has occurred. A feature of the present invention is that this inspection can be performed at high speed.
[0017]
This high-speed inspection process is realized by causing the CPU 11 to execute a required program stored in the ROM 12 or the HDD 14 and to perform the functions of the respective units shown in FIG. Here, the function will be described.
First, the consistency inspection request accepting unit 101 accepts a request for consistency inspection processing every predetermined time, every time a predetermined event occurs, or when consistency inspection processing is requested by a user instruction or the like. The node extraction unit 102 has a function of transmitting the fact.
[0018]
When the inspection processing request is transmitted from the consistency inspection request receiving unit 101, the node extracting unit 102 sequentially extracts the node IDs from the node data table 111, and the hierarchical information re-creating unit 103 and the hierarchical information extracting unit as inspection target nodes 104 has a function to transmit to 104.
The hierarchy information re-creating unit 103 collects all the node IDs of the descendant nodes of the inspection target node with reference to the node data table 111, and collects all the node IDs in the array format as shown in FIG. It has a function of creating first descendant node data indicating a node. This collection is performed, for example, by first extracting all nodes having the target node as a parent node, and then extracting all nodes having the extracted node as a parent node until there are no nodes that can be extracted. Can be realized.
[0019]
The hierarchy information extraction unit 104 collects all the node IDs of the descendant nodes of the inspection target node with reference to the node hierarchy data table 112 and collects all the node IDs of the inspection target nodes in an array format as shown in FIG. A function of creating second descendant node data indicating the descendant node; This collection can be realized, for example, by extracting all parent-child relationships (descendant relationships) having the target node as a parent node in the node hierarchy data table 112.
The bitmap processing unit 105 converts the first descendant node data created by the hierarchy information re-creation unit 103 and the second descendant node data created by the hierarchy information extraction unit 104 from the data in the array format into the bitmap format. Has the function of converting to data. Here, the bitmap format data is data composed of a predetermined number of data (bits) having a value of 0 or 1. For the conversion to the bitmap format, for example, as shown in FIG. 6, a bitmap having 0 bits as many as the maximum value of the node ID is prepared, and the node ID number (2 in the example in the figure) is included. This can be done by setting 1 to the (th, third, fourth) bits. Here, the first, second,... Are shown in order from the left bit in the figure.
[0020]
The bitmap comparison unit 106 has a function of comparing the first descendant node data in the bitmap format converted by the bitmap processing unit 105 with the second descendant node data in the bitmap format. This comparison can be done by finding the exclusive OR of the two bitmaps.
The exclusive OR is 0 if the corresponding bits are the same, and is 1 if the corresponding bits are different, so if there is no mismatch between the node data and the node hierarchy data as shown in FIGS. The obtained descendant node data matches, and the exclusive OR becomes 0 for all bits as shown in Table 1 (a). On the other hand, when there is a mismatch, there is a different part in the descendant node data obtained from both, so as shown in Tables 1 (b) and (c), the exclusive OR is performed with the bits of that part. Becomes 1.
[0021]
[Table 1]

[0022]
The result display / storage unit 107 has a function of displaying the comparison result by the bitmap comparison unit 106 on the display unit or storing the result in the storage unit. That is, for example, when there is a mismatch, a message notifying the fact or location is displayed, or the fact is recorded in the log. Even if there is no inconsistency, a message to that effect may be displayed.
By comparing the first descendant node data and the second descendant node data for all the nodes extracted by the node extraction unit 102 by the above-described units, the node data and the node hierarchy data are compared for the entire node hierarchy structure. It is possible to check whether or not there is a mismatch between the two.
[0023]
Here, in order to perform this inspection processing at high speed, it is important to perform comparison processing between the first descendant node data and the second descendant node data at high speed.
Considering that these are simply compared in the form of array data, for example, as shown in FIG. 5, for each element included in the array of the first descendant node data, the second It is necessary to compare (scan) sequentially with the elements of the array of descendant node data to check whether or not there is the same element (hits). In the worst case, the time required for the processing is proportional to (number of elements in the array of the first descendant node data) × (time required to scan the entire array of the second descendant node data).
[0024]
In order to check whether there is extra data that is not included in the first descendant node data in the array of the second descendant node data, the second position is switched, or a second scan is performed. It is necessary to store the elements that were hit when the descendant node data array was scanned by some means and to check whether all the elements were hit later.
Thus, if the comparison process is performed with the array format data as it is, a large amount of processing is required for the comparison.
[0025]
On the other hand, in the present invention, as shown in FIG. 6, the descendant node data in the array format is once converted into the data in the bitmap format and then compared. Conversion to the bitmap format can be performed by referring to each element of the array once. Since the comparison between the bitmaps can be performed by obtaining an exclusive OR by a bit operation that is the basis of the computer system, a high-speed operation in hardware can be realized. Accordingly, since the processing amount can be reduced to about one-hundred of the number of elements of the array as compared with the case where the comparison is performed with the array format data as it is, the comparison can be executed at a high speed, and the consistency check can also be performed at a high speed. it can.
[0026]
Next, the consistency check process between the node data and the node hierarchy data in the document management server will be described in more detail with reference to the flowcharts of FIGS. 7 to 10 are flowcharts showing the consistency check process.
When there is a request for consistency check processing, the CPU 11 receives the request. This processing request is always accepted by the CPU 11 executing a part of the data management program stored in the ROM 12 or the HDD 14 or the like.
Upon receiving this request, the CPU 11 also executes other portions of the data management program and starts the processing shown in the flowchart of FIG. That is, the CPU 11 functions as each unit described below by executing an appropriate program included in the data management program.
[0027]
First, in step S1, the node data is referred to obtain the node ID of the first node, and that node is set as the inspection target. Here, any node may be the first, but for example, the node having the smallest node ID may be used.
In the next step S2, it is determined whether or not there is a node. Usually, since there is a node at first, the process proceeds to the next step S3, and first descendant node data creation processing is performed for the node to be inspected. This process is shown in FIG. 8, and details will be described later. Here, the CPU 11 functions as first descendant node data creation means.
[0028]
Next, in step S4, a second descendant node data creation process for the node to be inspected is performed. This process is shown in FIG. 9, and details will be described later. Here, the CPU 11 functions as second descendant node data creation means. The first and second descendant node data created in step S3 and step S4 are array format data. In the next step S5, these data are converted into bitmap format data, respectively. This conversion process is shown in FIG. 10, and details will be described later. Here, the CPU 11 functions as first and second conversion means.
[0029]
In step S6, the converted first descendant node data is compared with the converted second descendant node data. This comparison can be performed by obtaining an exclusive OR of two data as already described. Here, the CPU 11 functions as a comparison unit.
Then, the process proceeds to step S7 to determine whether or not there is a difference. If there is a difference, the process proceeds to step S8 and the difference is displayed on the display means or stored in the storage means. That is, for example, when there is a mismatch, a message notifying the fact or location is displayed, or the fact is recorded in the log. Even if there is no inconsistency, a message to that effect may be displayed.
[0030]
Thereafter, the process proceeds to step S9. If there is no difference in step S7, the process directly proceeds to step S9.
In step S9, the node ID of the next node is obtained by referring to the node data, and that node is set as the inspection target. Here, the node to be acquired next may be determined based on an arbitrary standard.
And it returns to step S2 and it is judged whether there existed the node to acquire. If there is, the process from step S3 is repeated, and if not, it is determined that the consistency check has been completed for all the nodes, and the process is terminated.
[0031]
Next, the first descendant node data creation processing in step S3 in FIG. 7 will be described with a processing example in the case where node A is a node to be examined (target node) in the node having the configuration shown in FIG. .
The first descendant node data creation process is the process shown in the flowchart of FIG. 8. First, in step S11, the node data is searched to extract the node ID of the node having the target node as the parent node. When the node data is stored in a database table corresponding to SQL (Structured Query Language), this extraction can be performed using, for example, the schema definition and the search sentence shown in Table 2. If the node A is the target in the example shown in FIG. 3, “2” and “3” are extracted as the node IDs of the nodes B and C.
[0032]
[Table 2]

[0033]
Next, it is determined whether or not the node ID has been extracted in step S12. If it is completed, the process proceeds to step S13, and the first node ID is selected from the extraction result. Again, any node ID may be selected first, but for example, the node ID having the smallest node ID may be selected. Here, it is “2”.
Next, it is determined whether or not the node ID has been selected in step S14. If it is completed, the selected node ID is set (added) to the first descendant node data array in step S15. In step S16, the first descendant node data creation process, that is, the process of the flowchart of FIG. 8 is recursively executed with the node having the selected node ID as the target node.
[0034]
This recursive execution starts again from step S11. Here, since the target node is the node B (ID “2”), “4” is extracted as the node ID of the node D. Then, the process proceeds to steps S12 and S13, and this “4” is selected. In step S15, this value is set in the array of the first descendant node data. Note that there is one array of first descendant node data, and the same array is set during the initial execution and during the recursive execution. However, it is a matter of course that resetting is performed when the process of step S3 in FIG. 7 is newly started.
In step S16, the first descendant node data creation process is recursively executed again with the target node as the node D (ID “4”).
[0035]
This recursive execution starts again from step S11. Here, since the target node is the node D (ID “4”), there is no node having this as a parent node, and the node ID is not extracted in step S11.
Accordingly, the determination in step S12 is NO, and the process returns to the original process. That is, the process returns to step S16 of the first descendant node data creation process in which the node B one stage before is the target node, and proceeds to step S17.
[0036]
In step S17, the next node ID is selected from the extraction result in step S11. Here, the node to be extracted next may be determined based on an arbitrary criterion. However, in the process in which node B is the target node, the node ID extracted in step S11 is only “4”. Therefore, there is no selection target.
Therefore, the process directly returns to step S14, but the determination here is NO, and the process returns to the original process again. That is, the process returns to step S16 of the first descendant node data creation process with the node A one stage before as the target node, and proceeds to step S17.
[0037]
In this step S17, the next node ID is selected from the extraction result in step S11. In the process where node A is the target node, the node IDs extracted in step S11 are “2” and “3”. “3” that has not yet been selected is selected.
Then, the process returns to step S14, and the determination here is YES, so the process proceeds to step S15, and the selected “3” is set in the array of the first descendant node data.
In step S16, the first descendant node data creation process is recursively executed this time with the target node as node C (ID "3"), but the description is omitted because there is no node ID extracted in this process.
[0038]
Then, the process proceeds to step S17. Since there is no node ID to be selected, the process returns to step S14 as it is, and the determination here is NO, and the process returns to the original process. That is, the first descendant node data creation process is terminated and the process returns to the process of FIG.
In the case of the example used for the description, “2”, “3”, and “4” are set in the array of the first descendant node data at the end of the process, and this is the first descendant created by this process. Node data. Since this data is finally converted into a bitmap format and used, the order of arrangement of the node IDs at this time may be any order.
[0039]
Next, the second descendant node data creation processing in step S4 in FIG. 7 will be described with reference to a processing example in the case where node A is the target node in the node having the configuration shown in FIG.
The second descendant node data creation process is the process shown in the flowchart of FIG. 9. First, in step S21, the node hierarchy data is searched to extract a child node ID of a combination having the target node as a parent node. When the node hierarchy data is stored in a database table corresponding to SQL (Structured Query Language), this extraction can be performed using, for example, the schema definition and the search sentence shown in Table 3. Then, if node A is the target in the example shown in FIG. 3, “2”, “3”, and “4” are extracted as the IDs of the child nodes having the node ID “1” as the parent node.
[0040]
[Table 3]

[0041]
In step S22, the extracted node ID is set (added) to the second descendant node data array. Thus, the second descendant node data creation process ends, and the process returns to the original process, that is, the process of FIG. Since the node hierarchy data includes not only the direct parent-child relationship but also the descendant relationship data, there is no need to perform a recursive search for referencing node data, and this simple processing Descendant node data can be created with.
In the case of the example used for the explanation, “2”, “3”, “4” are set in the array of the second descendant node data at the end of the process, and this is the second descendant created by this process. Node data. Since this data is also finally converted into a bitmap format and used, the order in which node IDs are arranged at this time may be any order.
[0042]
Next, the conversion process in step S5 of FIG. 7 will be described with a process example in the case of converting the descendant node data shown in FIG.
The conversion process is the process shown in the flowchart of FIG. 10, and first, in step S31, the first element (node ID) is acquired from the array of descendant node data to be converted.
And it progresses to step S32 and it is judged whether it was able to acquire. If there is, the process proceeds to step S33, and “1” is set to the node ID-th bit acquired in the bitmap. For example, if the acquired node ID is “2”, the second bit is set, and if it is “3”, the third bit is set to “1”. As an initial value of the bit map, data in which “0” bits are arranged as many as the maximum number of node IDs included in the first and second descendant node data arrays is prepared. For example, in the example shown in FIG. 6, since the maximum node ID is “4”, data in which four “0” s are arranged is prepared.
[0043]
Next, it progresses to step S34, acquires the next element of an array, and returns to step S32. If the element can be acquired in step S34, the determination in step S32 is YES, so the processing from step S33 is repeated, but if there is no next element and cannot be acquired, the result is NO. The conversion process is terminated assuming that the process for the element is completed, and the process returns to the original process.
Here, in step S5 of FIG. 7, the conversion process of FIG. 10 is executed for each of the first descendant node data and the second descendant node data.
Further, in this conversion process, even if the elements of the array are acquired in any order, finally, data in which “1” is set to the bit corresponding to each element can be generated. Therefore, regardless of the order in which the elements are arranged in an array format, the generated bitmaps are the same as long as they are composed of the same elements.
[0044]
As described above, the document management server 10 performs the above-described processing, converts the descendant node data in the array format into data in the bitmap format, and then performs the comparison. Compared to the comparison with the data as it is, the processing amount can be reduced to about 1 / number of elements of the array, so that the comparison can be executed at a high speed, and the consistency between the node data and the node hierarchy data, that is, the actual hierarchy It is also possible to check the hierarchy information stored and the consistency at a high speed.
In the process of the flowchart of FIG. 7, the processes in steps S3 to S5 are not necessarily performed in this order. For example, the second descendant node data may be created first, or the created descendant node data may be immediately used for the conversion process.
[0045]
[Second Embodiment: FIGS. 11 to 15]
Next, a document management server according to a second embodiment of the data management apparatus of the present invention will be described. FIG. 11 is a functional block diagram showing the functional configuration of the characterizing portion of the present invention in the document management server. FIGS. 12 and 13 show examples of erroneous node hierarchy data in the document management server. FIG. 9 is a flowchart showing processing for checking consistency between node data and node hierarchy data in the document management server.
The document management server of this embodiment is different only in that the node hierarchy data correction unit 108 is provided as a functional configuration, and the processing by the CPU 11 is only changed in accordance with this point. Is the same as that of the document management server 10 of the first embodiment, and the description other than the differences will be simplified or omitted. Also, the same reference numerals are used for the components corresponding to the document management server 10 of the first embodiment.
[0046]
The node hierarchy data correction unit 108 in the document management server of this embodiment has a function of correcting node hierarchy data based on the comparison result in the bitmap comparison unit 106 and the first descendant node data in the bitmap format.
Specifically, for example, in the node hierarchy configuration shown in FIG. 3, if the node hierarchy data is as shown in FIG. 12, the data with the data ID “4” does not correspond to the actual hierarchy configuration. Incorrect data. When the consistency check is performed in such a state, when the node A having the node ID “1” is targeted, as shown in Table 4, the first descendant node data created by referring to the node data is “ Since the second descendant node data created by referring to the node hierarchy data becomes “01111” at “01110”, a bit of 1 appears in the exclusive OR of the comparison result, and an inconsistency is detected.
[0047]
[Table 4]

[0048]
Here, since the 1 bit is the fifth, it can be seen that there is an inconsistency in the descendant relationship for the node with the node ID “5”. On the other hand, since the fifth bit of the first descendant node data is “0”, the fifth bit of the second descendant node data is necessarily “1”, and the node ID is actually “5”. "Is not a descendant node of node A, but it is understood that extra data is included in the node hierarchy data. Therefore, the node hierarchy data correction unit 108 can eliminate the inconsistency by performing the correction to delete the node hierarchy data having the parent node ID “1” and the child node ID “5”. For example, when the node hierarchy data corresponds to SQL and is stored in the database table using the schema definition shown in Table 3,
delete NodeHierarchyData where ParentID = 1 and ChildID = 5
Can be executed by a command such as
[0049]
As another example, in the node hierarchy configuration shown in FIG. 3, for example, when the node hierarchy data is as shown in FIG. 13, the data with the parent node ID “1” and the child node ID “4” is insufficient. is doing. When the consistency check is performed in such a state, when the node A whose node ID is “1” is targeted, as shown in Table 5, the first descendant node data created by referring to the node data is “ Since the second descendant node data created by referring to the node hierarchy data at “0111” is “0110”, a bit of 1 appears in the exclusive OR of the comparison result, and inconsistency is detected.
[0050]
[Table 5]

[0051]
Here, since the bit of 1 is the fourth, it can be seen that there is an inconsistency in the descendant relationship for the node with the node ID “4”. On the other hand, since the fourth bit of the first descendant node data is “1”, the fourth bit of the second descendant node data is necessarily “0”, and the node ID is actually “4”. The node “” is a descendant node of the node A, but no data corresponding to the node hierarchy data is included. Therefore, the node hierarchy data correction unit 108 can eliminate the inconsistency by performing the correction to add the node hierarchy data having the parent node ID “1” and the child node ID “4”. For example, when the node hierarchy data corresponds to SQL and is stored in the database table using the schema definition shown in Table 3,
It can be executed by a command like insert into NodeHierarchyData (ID, ParentID, ChildID) values (4, 1, 4).
[0052]
In this way, the node hierarchy data correction unit 108 acquires the ID of the descendant node related to the mismatch according to the bit position where “1” is set in the exclusive OR of the comparison result, and the first descendant node data By referring to the corresponding bit, it can be determined whether the node hierarchy data needs to be added or deleted, and appropriate correction can be made.
The reason why the node data is not modified is that this data is basic data for defining the hierarchical structure of the node, and if this data is changed, the hierarchical structure itself is changed. On the other hand, the node hierarchy data is created by referring to the node data and is data for efficiently grasping the hierarchy structure. Therefore, even if this is changed, the hierarchy structure is not changed.
[0053]
Next, processing for checking consistency between node data and node hierarchy data in the document management server will be described in more detail with reference to a flowchart.
When there is a request for consistency check processing, the CPU 11 receives the request. This processing request is always accepted by the CPU 11 executing a part of the data management program stored in the ROM 12 or the HDD 14 or the like.
Upon receiving this request, the CPU 11 also executes other parts of the data management program and starts the processing shown in the flowchart of FIG. That is, the CPU 11 functions as each unit described below by executing an appropriate program included in the data management program.
[0054]
The process shown in the flowchart of FIG. 14 is the same as that of the first embodiment except that the node hierarchy data correction process of step S41 is performed after the process of step S8 when there is a difference in step S7. Therefore, only the node hierarchy data correction process will be described. In this process, the CPU 11 functions as a correction unit.
[0055]
The node hierarchy data correction process is a process shown in the flowchart of FIG.
Here, first, in step S51, the first bit is acquired from the exclusive OR bit map of the comparison result.
In step S52, it is determined whether or not the last bit has been exceeded. If not, the process proceeds to step S53 to determine whether or not the acquired bit is 1. If it is 1, it is determined that there is a mismatch with respect to the node corresponding to the bit, and the process proceeds to step S54 to refer to the corresponding bit in the bitmap of the first descendant node data.
[0056]
Next, it is determined whether or not the bit is 0 in step S55. If 0, it is determined that extra data is included in the node hierarchy data, and the process proceeds to step S56. Then, from the node hierarchy data, the node to be inspected at that time is set as the parent node, and the node corresponding to the bit referenced in step S54 (for example, if the bit referenced at that time is the fourth bit, the node ID Delete the data having the node “4” as a descendant node. Then, the process proceeds to step S58, the next bit is acquired from the exclusive OR bit map of the comparison result, the process returns to step S52, and the process is repeated.
[0057]
If it is not 0 in step S55, it is determined that the data necessary for the node hierarchy data is insufficient, and the process proceeds to step S57. Then, the node hierarchy data is added with data in which the node to be inspected at that time is the parent node and the node corresponding to the bit referenced in step S54 is a descendant node. Then, the process proceeds to step S58, the next bit is acquired from the exclusive OR bit map of the comparison result, the process returns to step S52, and the process is repeated.
If it is not 1 in step S53, the process directly proceeds to step S58 to acquire the next bit from the exclusive OR bit map of the comparison result, and returns to step S52 to repeat the processing.
If the last bit is exceeded in step S52, it is determined that all necessary corrections for the current node to be inspected have been completed, and the processing returns to the original processing in FIG.
[0058]
Since the document management server of this embodiment performs the above-described processing and refers to the comparison result bitmap to identify and correct the inconsistent portion of the node hierarchical data, the actual hierarchical structure and In addition to checking consistency with the stored hierarchical information, it is possible to correct inconsistent portions at high speed.
[0059]
In each of the embodiments described above, the document management server 10 that manages the document data stored in the data storage unit 16 has been described as an example. However, the data stored in the HDD 14 or the like may be managed collectively. Needless to say, the data stored in a plurality of storage means may be collectively managed, including those of external devices. In addition to this, the data management device of the present invention is not limited to the document management server, but can be applied to any device that manages some data composed of nodes having a hierarchical structure. Of course.
[0060]
A program for causing the CPU 11 of the document management server 10 to function as each of the above-described means is stored in advance in the HDD 14 or the like of the document management server 10, or a CD-ROM or magneto-optical (MO) as a recording medium. ) It can also be provided as a program source or binary data recorded in an executable format on a non-volatile recording medium (memory) such as a disk, mini disk (MD), DVD (Digital Versatile Disk), flexible disk, magnetic tape, or HDD. . The program recorded in the memory can be installed in the document management server 10 and executed by the CPU 11, or the CPU 11 can read out and execute the program from the memory to function as each unit described above.
Furthermore, it is also possible to download the program from an external device provided with a recording medium that records the program, or from an external device that stores the program in a storage unit.
[0061]
【The invention's effect】
As described above, according to the data management apparatus of the present invention, the consistency check between the hierarchical structure of the nodes and the stored hierarchical information can be performed with a simple process, so the inspection is performed at high speed. be able to.
Moreover, according to the program of this invention, a computer can be functioned as a data management apparatus as described above.
Further, according to the recording medium of the present invention, the program can be stored in a computer that does not store the program, and can be caused to function as the data management apparatus as described above by executing the program.
[Brief description of the drawings]
FIG. 1 is a functional block diagram showing a functional configuration of a characteristic part of the present invention in a document management server which is a first embodiment of a data management apparatus of the present invention.
FIG. 2 is a block diagram showing a schematic configuration of the document management server.
FIG. 3 is a diagram for explaining a data management method in the document management server;
FIG. 4 is a diagram for explaining generation of descendant node data in the same manner.
FIG. 5 is a diagram for explaining comparison of descendant node data.
FIG. 6 is a diagram for explaining conversion of descendant node data into a bitmap format.
FIG. 7 is a flowchart showing a consistency check process in the document management server according to the first embodiment of the data management apparatus of the present invention;
FIG. 8 is a flowchart showing the contents of the first descendant node data creation process shown in the flowchart of FIG. 7;
FIG. 9 is a flowchart showing the contents of second descendant node data creation processing shown in the flowchart of FIG. 7;
FIG. 10 is a flowchart showing the contents of a bitmap process that is the conversion process shown in the flowchart of FIG. 7;
FIG. 11 is a functional block diagram showing a functional configuration of a characteristic part of the present invention in a document management server which is a second embodiment of the data management apparatus of the present invention;
FIG. 12 is a diagram showing an example of erroneous node hierarchy data in the document management server.
FIG. 13 is a diagram showing another example.
FIG. 14 is a flowchart showing a consistency check process in a document management server according to a second embodiment of the data management apparatus of the present invention;
FIG. 15 is a flowchart showing the contents of the node hierarchy data correction process shown in the flowchart of FIG. 14;
[Explanation of symbols]
10: Document management server 11: CPU
12: ROM 13: RAM
14: HDD 15: Network I / F
16: Data storage means 17: System bus
101: Consistency check request reception unit 102: Node extraction unit
103: Hierarchy information re-creation unit 104: Hierarchy information extraction unit
105: Bitmap processing unit
106: Bitmap comparison unit
107: Result display / storage unit
108: Node hierarchy data correction unit
111: Node data table
112: Node hierarchy data table

Claims

A data management device that manages data in a format composed of nodes having a hierarchical structure,
Means for storing, as node information , identification information consisting of a unique integer value for each node and the identification information of the parent node immediately above the node;
For each of the nodes, the identification information of each descendant node that is a node lower than the node and traceable without going through the upper node of the node is associated with the identification information of the node on a one-to-one basis. It means for storing as a node hierarchy information Te,
First descendant node data creating means for creating first descendant node data that collects all the identification information of descendant nodes of any node based on the node information;
Second descendant node data creating means for creating second descendant node data that collects all the identification information of descendant nodes of the arbitrary node based on the node hierarchy information;
Wherein the first descendent node data created by the first descendent node data creating means, in accordance with the identification information of the descendent nodes included in the first descendent node data, the bit position corresponding to the value of the identification information first conversion means for converting the bitmap format of the data set a,
Wherein the second descendent node data created by the second descendent node data creating means, in accordance with the identification information of the descendent nodes included in the second descendent node data, the bit position corresponding to the value of the identification information second conversion means for converting the bitmap format of the data set a,
Obtaining an exclusive OR between the first descendant node data in the bitmap format converted by the first converting means and the second descendant node data in the bitmap format converted by the second converting means. Thus, the data management device further comprises a comparing means for comparing the first descendant node data with the second descendant node data .

The data management device according to claim 1,
A data management apparatus comprising: a correction unit that corrects the node hierarchy information based on a comparison result when the comparison by the comparison unit is inconsistent.

Data management means for managing data in a format comprising a computer having nodes having a hierarchical structure;
Means for storing, as node information , identification information consisting of a unique integer value for each node and the identification information of the parent node immediately above the node;
For each of the nodes, the identification information of each descendant node that is a node lower than the node and traceable without going through the upper node of the node is associated with the identification information of the node on a one-to-one basis. It means for storing as a node hierarchy information Te,
First descendant node data creating means for creating first descendant node data that collects all the identification information of descendant nodes of any node based on the node information;
Second descendant node data creating means for creating second descendant node data that collects all the identification information of descendant nodes of the arbitrary node based on the node hierarchy information;
Wherein the first descendent node data created by the first descendent node data creating means, in accordance with the identification information of the descendent nodes included in the first descendent node data, the bit position corresponding to the value of the identification information first conversion means for converting the bitmap format of the data set a,
Wherein the second descendent node data created by the second descendent node data creating means, in accordance with the identification information of the descendent nodes included in the second descendent node data, the bit position corresponding to the value of the identification information second conversion means for converting the bitmap format of the data set a,
Obtaining an exclusive OR between the first descendant node data in the bitmap format converted by the first converting means and the second descendant node data in the bitmap format converted by the second converting means. The program for causing the first descendant node data and the second descendant node data to function as comparing means for comparing.

A program according to claim 3, wherein
A program further comprising: a program for causing a computer to function as a correction unit that corrects the node hierarchy information based on a comparison result when the comparison by the comparison unit is inconsistent.

A computer-readable recording medium on which the program according to claim 3 or 4 is recorded.