JP2008203986A

JP2008203986A - Data management apparatus and program

Info

Publication number: JP2008203986A
Application number: JP2007036708A
Authority: JP
Inventors: Hideto Miyahara; 秀人宮原
Original assignee: Bank of Tokyo Mitsubishi UFJ Trust Co
Current assignee: MUFG Bank Ltd
Priority date: 2007-02-16
Filing date: 2007-02-16
Publication date: 2008-09-04
Anticipated expiration: 2027-02-16
Also published as: JP4503031B2

Abstract

<P>PROBLEM TO BE SOLVED: To use a combination of simple processes to achieve determination of whether or not pieces of data with same information on a first data item set therefor and different information on a second data item set therefor are present, among multiple pieces of data each with information set therefor regarding the plurality of data items. <P>SOLUTION: A data management apparatus, for extracting, from a customer information DB, customer information including a same telephone number TEL NO, and a different customer name NAME, sorts respective records in the DB by telephone number, thereby grouping a plurality of groups comprising records for the same telephone number (cf. (B)), as for a flag FLG, sets "1" for records having the starting record and a customer name identical to each other in the starting record and subsequent records in each group, and sets "0" for records having a starting record and the customer name different from each other in the second and subsequent records (cf. (D)), sorts records in ascending order of flag in each group (cf. (E)), and determines corresponding customer information (cf. (F)) based on the value of the flag of the starting record in each group. <P>COPYRIGHT: (C)2008,JPO&INPIT

Description

本発明はデータ管理装置及びプログラムに係り、特に、複数のデータ項目について各々情報が設定されて成る複数のデータの中に、一定の条件に合致するデータが存在しているか否かを判断するデータ管理装置、及び、コンピュータを前記データ管理装置として機能させるためのデータ管理プログラムに関する。 The present invention relates to a data management apparatus and program, and in particular, data for determining whether or not there is data that meets a certain condition among a plurality of data in which information is set for a plurality of data items. The present invention relates to a management apparatus and a data management program for causing a computer to function as the data management apparatus.

現在、データベースは広範に普及し、多種多様な業務で用いられている。データベースの管理に関する技術として、例えば特許文献１には、少なくとも氏名を含む各種の項目データから成る複数の名刺データを氏名データの五十音順に並び替えてＲＡＭの編集領域に格納し直し、編集領域の先頭から順に隣接する２つの名刺データの氏名を比較し、一致するものがあった場合は該当データをＲＡＭのチェック結果領域に格納する処理を繰り返すことで、氏名が重複する名刺データの有無を調べ、該当する名刺データが有った場合は表示部に一覧表示して内容確認表示や削除を行う技術が開示されている。 Currently, databases are widely used and used in a wide variety of businesses. As a technique related to database management, for example, in Patent Document 1, a plurality of business card data including various item data including at least a name is rearranged in the alphabetical order of the name data and stored in the editing area of the RAM. The names of two adjacent business card data are compared in order from the beginning of the card, and if there is a match, the process of storing the corresponding data in the check result area of the RAM is repeated, so that the presence or absence of business card data with a duplicate name is determined. A technique is disclosed in which, when there is a corresponding business card data, a list is displayed on the display unit and a content confirmation display or deletion is performed.

また、特許文献２には、名刺レコードからＩＤと一部の項目データを抽出して簡易名刺レコードを作成し、作成した簡易名刺レコードを一定の条件で集めて簡易名刺データを複数生成し、その特徴毎に名前が付与された名刺箱に入れておき、簡易名刺データを利用することで、視覚的で判り易く操作性に優れた名刺データの管理（重複チェックを含む）を実現する技術が開示されている。 Also, in Patent Document 2, an ID and some item data are extracted from a business card record to create a simple business card record, the created simple business card records are collected under certain conditions, and a plurality of simple business card data are generated. Disclosed is a technology for managing business card data (including duplication check) that is visually easy to understand and easy to use by putting it in a business card box with a name for each feature and using simple business card data. Has been.

一方、特許文献３には、表計算処理において、表を構成する各セルに、そのセルを処理対象とするか否かを指示する属性情報を設定すると共に、各セルの属性情報に有効又は無効を示すフラグを設定しておき、或るセルの出力内容を求める際に、当該セルに参照先として設定されている他のセルの属性情報を参照し、属性情報のフラグの内容により前記他のセルを処理対象から除外するか否かを判定する技術が開示されている。
特開平７−２９６００２号公報特開２０００−３５７１７５号公報特開平８−３０６９６号公報 On the other hand, in Patent Document 3, in spreadsheet processing, attribute information that indicates whether or not to process the cell is set for each cell constituting the table, and the attribute information of each cell is valid or invalid. When the output content of a certain cell is obtained, the attribute information of another cell set as a reference destination for the cell is referred to, and the other information is determined by the content of the flag of the attribute information. A technique for determining whether to exclude a cell from a processing target is disclosed.
JP 7-296002 A JP 2000-357175 A JP-A-8-30696

ところで、データベース（の編成やデータベースにアクセスするデータベース管理プログラムの処理内容等）は、一般に、当該データベースを利用する利用者の利用目的等に応じて設計・構築されるが、既に稼働しているデータベースに対し、当初は想定していなかった或る条件に合致するデータの有無を確認する必要が突発的に生ずることがある。 By the way, the database (the organization of the database and the processing contents of the database management program for accessing the database, etc.) is generally designed and constructed in accordance with the purpose of use of the user who uses the database. On the other hand, it may occur suddenly that it is necessary to confirm the presence or absence of data that meets a certain condition that was not initially assumed.

具体的には、例えば金融機関は個々の顧客に関する情報（例えば氏名又は名称や住所、電話番号等）を、個々の顧客を識別するための顧客ＩＤと共にデータベース（顧客データベース）に登録・管理しており、通常の業務で顧客データベースにアクセスする際には、顧客ＩＤ等をキーとしてアクセス対象のデータを検索する処理が行われる。しかし、セグメント別の顧客データ処理・分析や、マーケティング、顧客宛のアンケートの実施、各種サービスの提案等の業務を行うにあたり、例えば「電話番号が同一で氏名又は名称が相違している」等の新たな条件に合致するデータの有無を確認する必要が生ずることがある。しかし、顧客データベースと共に設計・開発された管理プログラムには、上記の新たな条件に合致するデータを検索する機能は設けられていない。 Specifically, for example, a financial institution registers and manages information (for example, name or name, address, telephone number, etc.) about each customer in a database (customer database) together with a customer ID for identifying each customer. Therefore, when accessing the customer database in a normal operation, a process for searching access target data using a customer ID or the like as a key is performed. However, when performing operations such as customer data processing and analysis by segment, marketing, conducting questionnaires addressed to customers, and proposing various services, for example, “phone numbers are the same but names or names are different” It may be necessary to check for the presence of data that meets the new conditions. However, the management program designed and developed together with the customer database does not have a function for searching for data that meets the above new conditions.

このように、データベースに対して「第１のデータ項目の情報（の一部又は全体）が同一かつ第２のデータ項目の情報（の一部又は全体）が相違している」という条件に合致するデータの有無を確認する必要が生じたものの、既存のデータベース管理プログラムに、上記の新たな条件に合致するデータを検索する機能が設けられていない場合には、上記の新たな条件に合致するデータを検索するプログラムを開発した後に、開発したプログラムを用いて該当するデータの検索を行うことになる。 In this way, the database satisfies the condition that the information (part or whole) of the first data item is the same and the information (part or whole) of the second data item is different. If the existing database management program does not have a function to search for data that matches the above new conditions, the new conditions are met. After developing a program for retrieving data, the corresponding data is retrieved using the developed program.

しかし、上記の検索を実現するプログラムは、一般に、データベースから順にデータを読み出してワークメモリに貯えながら、読み出したデータをワークメモリに既に貯えられているデータと各々比較することを繰り返すことで、該当するデータを探す等のアルゴリズムが採用されるが、このようなアルゴリズムで動作するプログラムは規模が大きく、デバッグを含めて開発に時間が掛るので、特に結果を迅速に得たい場合に適していない。 However, the program for realizing the above search is generally applicable by repeatedly reading the data from the database and storing it in the work memory, repeatedly comparing the read data with the data already stored in the work memory. Although an algorithm such as searching for data to be used is employed, a program that operates with such an algorithm is large in scale and takes a long time to develop including debugging, and therefore is not suitable particularly when it is desired to obtain a result quickly.

なお、前述した特許文献１，２に記載の技術は、データベースに登録されているデータの重複の有無を確認する技術であり、また特許文献３に記載の技術は、表計算処理における処理対象のセルを切り替える技術であり、何れも、「第１のデータ項目の情報（の一部又は全体）が同一かつ第２のデータ項目の情報（の一部又は全体）が相違している」という条件に合致するデータの有無を確認したい場合には適用できない。 The techniques described in Patent Documents 1 and 2 described above are techniques for confirming whether data registered in a database is duplicated, and the technique described in Patent Document 3 is a processing target in spreadsheet processing. This is a technology for switching cells, and in any case, the condition is that the information (part or whole) of the first data item is the same and the information (part or whole) of the second data item is different. It is not applicable when you want to check the existence of data that matches

本発明は上記事実を考慮して成されたもので、複数のデータ項目について各々情報が設定されて成る複数のデータの中に、第１のデータ項目の情報の一部又は全体が同一かつ第２のデータ項目の情報の一部又は全体が相違しているデータが存在しているか否かを判断することを、簡易な処理の組み合わせで実現できるデータ管理装置及びデータ管理プログラムを得ることが目的である。 The present invention has been made in consideration of the above-described facts, and among a plurality of data in which information is set for each of a plurality of data items, part or all of the information of the first data item is the same and An object of the present invention is to provide a data management device and a data management program that can realize whether or not there is data in which part or all of the information of the two data items is different by a combination of simple processes. It is.

上記目的を達成するために請求項１記載の発明に係るデータ管理装置は、複数のデータ項目について各々情報が設定されて成る複数のデータを、前記複数のデータ項目のうちの第１の項目の情報に基づいてソートすることで、前記複数のデータを、前記第１のデータ項目の情報の一部又は全体が同一のデータで各々構成された複数のグループに分ける第１ソート手段と、前記第１ソート手段によるソートを経た前記複数のデータの各々に対し、前記複数のデータ項目のうちの第２のデータ項目の情報の一部又は全体が、同一のグループに属する特定データと相違しているか否かに応じて値の異なるフラグを各々設定するフラグ設定手段と、前記フラグ設定手段によって個々のデータに設定されたフラグの値に基づいて、個々のグループを構成するデータを個々のグループ内でソートする第２ソート手段と、前記第２ソート手段によるソートを経た前記個々のグループの先頭又は末尾のデータに設定されているフラグの値に基づいて、前記第１のデータ項目の情報の一部又は全体が同一かつ前記第２のデータ項目の情報の一部又は全体が相違しているデータの有無を判断する判断手段と、を含んで構成されている。 In order to achieve the above object, a data management device according to the first aspect of the present invention provides a plurality of pieces of data in which information is set for each of a plurality of data items, the first item among the plurality of data items. A first sorting means for sorting the plurality of data into a plurality of groups each including a part or all of the information of the first data item, each of which is composed of the same data, by sorting based on the information; Whether part or all of the information of the second data item of the plurality of data items differs from the specific data belonging to the same group for each of the plurality of data that has been sorted by one sort means Flag setting means for setting different flags depending on whether or not, and each group is configured based on the flag values set for individual data by the flag setting means Second sorting means for sorting the data within each group, and the first sorting means based on the value of the flag set in the first or last data of the individual group that has been sorted by the second sorting means. And determining means for determining whether or not there is data in which part or all of the information of the data item is the same and part or all of the information of the second data item is different.

請求項１記載の発明に係るデータ管理装置は、複数のデータ項目について各々情報が設定されて成る複数のデータを処理対象としており、第１ソート手段は、複数のデータ項目のうちの第１の項目の情報に基づいて複数のデータをソートすることで、複数のデータを、第１のデータ項目の情報の一部又は全体が同一のデータで各々構成された複数のグループに分ける。また、フラグ設定手段は、第１ソート手段によるソートを経た複数のデータの各々に対し、複数のデータ項目のうちの第２のデータ項目の情報の一部又は全体が、同一のグループに属する特定データと相違しているか否かに応じて値の異なるフラグを各々設定し、第２ソート手段は、フラグ設定手段によって個々のデータに設定されたフラグの値に基づいて、個々のグループを構成するデータを個々のグループ内でソートする。 The data management device according to the first aspect of the present invention is intended for processing a plurality of data each having information set for a plurality of data items, and the first sorting means is a first of the plurality of data items. By sorting the plurality of data based on the item information, the plurality of data is divided into a plurality of groups each composed of the same or part of the information of the first data item. In addition, the flag setting means specifies that part or all of the information of the second data item of the plurality of data items belongs to the same group for each of the plurality of data that has been sorted by the first sorting means. Flags having different values are set according to whether or not they differ from the data, and the second sorting means configures individual groups based on the flag values set for the individual data by the flag setting means. Sort data within individual groups.

これにより、個々のグループ内に、複数のデータ項目のうちの第２のデータ項目の情報の一部又は全体が、同一のグループに属する特定データと相違しているデータが存在していれば、当該データは、第２ソート手段によるソートによって個々のグループ内の先頭又は末尾へ移動する。このため、判断手段は、第２ソート手段によるソートを経た個々のグループの先頭又は末尾のデータに設定されているフラグの値に基づいて、第１のデータ項目の情報の一部又は全体が同一かつ第２のデータ項目の情報の一部又は全体が相違しているデータの有無を判断する。 Thereby, in each group, if there is data in which part or all of the information of the second data item of the plurality of data items is different from the specific data belonging to the same group, The data moves to the head or tail in each group by sorting by the second sorting means. For this reason, the judging means uses the same or part of the information of the first data item based on the flag value set in the head or tail data of each group that has been sorted by the second sorting means. In addition, it is determined whether there is data in which part or all of the information of the second data item is different.

上記の第１ソート手段及び第２ソート手段によるソート、フラグ設定手段によるフラグ設定、判断手段による判断は、何れもごく簡単な処理であり、請求項１記載の発明によれば、複数のデータ項目について各々情報が設定されて成る複数のデータの中に、第１のデータ項目の情報の一部又は全体が同一かつ第２のデータ項目の情報の一部又は全体が相違しているデータが存在しているか否かを判断することを、簡易な処理の組み合わせで実現することができる。そして、前述のように第１ソート手段及び第２ソート手段によるソート、フラグ設定手段によるフラグ設定、判断手段による判断が何れもごく簡単な処理であることから、第１ソート手段、フラグ設定手段、第２ソート手段及び判断手段を、各々コンピュータによって実行されるプログラムとして開発する場合にも、非常に短い期間で開発を完了させることができ、第１のデータ項目の情報の一部又は全体が同一かつ第２のデータ項目の情報の一部又は全体が相違しているデータの有無を確認する必要が突発的に生じた等の場合にも、該当するデータの有無を迅速に確認することが可能となる。 The sorting by the first sorting means and the second sorting means, the flag setting by the flag setting means, and the judgment by the judging means are all simple processes. According to the invention of claim 1, a plurality of data items There is data in which a part or the whole of the information of the first data item is the same and a part or the whole of the information of the second data item is different from among the plurality of data in which information is set for each It can be realized by a combination of simple processes. Since the sorting by the first sorting means and the second sorting means, the flag setting by the flag setting means, and the judgment by the judging means are all simple processes as described above, the first sorting means, the flag setting means, Even when the second sorting means and the judging means are developed as programs that are executed by a computer, the development can be completed in a very short period of time, and part or all of the information of the first data item is the same. In addition, even when there is a sudden need to check for the presence or absence of data that is partly or entirely different from the information in the second data item, it is possible to quickly check the presence or absence of the corresponding data. It becomes.

なお、請求項１記載の発明において、判断手段が有無を判断するデータは、第１のデータ項目の情報の一部が同一の情報であってもよいし、第１のデータ項目の情報全体が同一の情報であってもよいが、判断手段が、第１のデータ項目の情報全体が同一かつ第２のデータ項目の情報の一部又は全体が相違しているデータの有無を判断する場合には、第１ソート手段を、例えば請求項２に記載したように、第１の項目の情報全体に基づいて複数のデータをソートすることで、複数のデータを、第１のデータ項目の情報全体が同一のデータで各々構成された複数のグループに分けるように構成すればよい。これにより、個々のグループが、第１のデータ項目の情報全体が同一のデータによって各々構成されることになり、フラグ設定手段によって設定されたフラグの値に基づいて、第２ソート手段が個々のグループを構成するデータを個々のグループ内でソートすることで、個々のグループの先頭又は末尾のデータに設定されているフラグの値が、第１のデータ項目の情報全体が同一かつ第２のデータ項目の情報の一部又は全体が相違しているデータの有無を表す値となるので、第１のデータ項目の情報全体が同一かつ第２のデータ項目の情報の一部又は全体が相違しているデータの有無を判断手段が判断することが可能となる。 In the first aspect of the invention, the data for which the determination means determines whether or not the information in the first data item may be the same information, or the entire information in the first data item may be the same. The same information may be used, but the determination means determines whether or not there is data in which the entire information of the first data item is the same and part or all of the information of the second data item is different. The first sorting means sorts a plurality of data based on the entire information of the first item, for example, as described in claim 2, thereby converting the plurality of data into the entire information of the first data item. May be divided into a plurality of groups each composed of the same data. As a result, the entire information of the first data item is composed of the same data, and the second sorting means is individually set based on the flag value set by the flag setting means. By sorting the data constituting the group within each group, the value of the flag set in the first or last data of each group is the same as the second data and the entire information of the first data item is the same. Since it is a value indicating the presence or absence of data in which part or all of the item information is different, the whole information of the first data item is the same and part or all of the information of the second data item is different. The determination means can determine the presence or absence of data.

また、請求項１記載の発明において、判断手段が、第１のデータ項目の情報の一部が同一かつ第２のデータ項目の情報の一部又は全体が相違しているデータの有無を判断する場合には、第１ソート手段を、例えば請求項３に記載したように、第１の項目の情報のうち一定位置に位置している一部の情報に基づいて複数のデータをソートすることで、複数のデータを、第１のデータ項目の情報の一部が同一のデータで各々構成された複数のグループに分けるように構成すればよい。これにより、個々のグループが、第１のデータ項目の情報の一部が同一のデータによって各々構成されることになり、フラグ設定手段によって設定されたフラグの値に基づいて、第２ソート手段が個々のグループを構成するデータを個々のグループ内でソートすることで、個々のグループの先頭又は末尾のデータに設定されているフラグの値が、第１のデータ項目の情報の一部が同一かつ第２のデータ項目の情報の一部又は全体が相違しているデータの有無を表す値となるので、第１のデータ項目の情報の一部が同一かつ第２のデータ項目の情報の一部又は全体が相違しているデータの有無を判断手段が判断することが可能となる。 In the first aspect of the present invention, the determination means determines whether there is data in which a part of the information of the first data item is the same and a part or the whole of the information of the second data item is different. In this case, the first sorting means sorts a plurality of data based on a part of information located at a certain position among the information of the first item, for example, as described in claim 3. The plurality of data may be configured so as to be divided into a plurality of groups each including a part of information of the first data item composed of the same data. As a result, each group is composed of a part of the information of the first data item by the same data, and the second sorting means is based on the flag value set by the flag setting means. By sorting the data constituting each group within each group, the flag value set in the data at the beginning or end of each group is the same as part of the information of the first data item and Since a part or all of the information of the second data item is a value indicating the presence or absence of data that is different, a part of the information of the first data item is the same and a part of the information of the second data item Alternatively, the determination means can determine the presence or absence of data that is entirely different.

また、請求項１記載の発明において、第１ソート手段は、例えば請求項４に記載したように、複数のデータを複数のグループに分けた後に、単一のデータから成るグループに属するデータを後段の処理対象（フラグ設定手段や第２ソート手段、判断手段の処理対象）から除外する処理を行うように構成することが好ましい。単一のデータから成るグループが存在していた場合、当該グループに属するデータは、第１のデータ項目の情報の一部又は全体が同一の他のデータが存在していない、すなわち、第１のデータ項目の情報の一部又は全体が同一かつ第２のデータ項目の情報の一部又は全体が相違しているデータに該当しないと判断できるので、このデータを後段の処理対象から除外することで、後段の処理対象のデータの数が絞り込まれることになり、後段の処理の高速化を実現することができる。 Further, in the first aspect of the invention, the first sorting means, as described in the fourth aspect, for example, after dividing a plurality of data into a plurality of groups, It is preferable that the processing to be excluded from the processing target (the processing target of the flag setting unit, the second sorting unit, and the determination unit) is performed. When a group consisting of a single data exists, the data belonging to the group does not have any other data in which part or all of the information of the first data item is the same. Since it can be determined that the data item information is partly or entirely the same and the second data item information is partly or entirely different, it is possible to exclude this data from the subsequent processing target. As a result, the number of data to be processed in the subsequent stage is narrowed down, and the speed of the subsequent process can be increased.

更に、請求項１記載の発明において、判断手段が有無を判断するデータは、第２のデータ項目の情報の一部が同一の情報であってもよいし、第２のデータ項目の情報全体が同一の情報であってもよいが、判断手段が、第１のデータ項目の情報の一部又は全体が同一かつ第２のデータ項目の情報全体が相違しているデータの有無を判断する場合には、フラグ設定手段を、例えば請求項５に記載したように、複数のデータの各々に対し、第２のデータ項目の情報全体を特定データと比較し、比較結果に応じてフラグを設定するように構成すればよい。これにより、フラグ設定手段によって設定されたフラグの値に基づいて、第２ソート手段が個々のグループを構成するデータを個々のグループ内でソートすることにより、個々のグループの先頭又は末尾のデータに設定されているフラグの値が、第１のデータ項目の情報の一部又は全体が同一かつ第２のデータ項目の情報全体が相違しているデータの有無を表す値となるので、第１のデータ項目の情報の一部又は全体が同一かつ第２のデータ項目の情報全体が相違しているデータの有無を判断手段が判断することが可能となる。 Further, in the first aspect of the invention, the data for which the determination means determines the presence or absence of the second data item may be the same information, or the entire information of the second data item may be the same. The same information may be used, but the determination means determines whether or not there is data in which part or all of the information of the first data item is the same and the entire information of the second data item is different. For example, as described in claim 5, the flag setting unit compares the entire information of the second data item with the specific data for each of the plurality of data, and sets the flag according to the comparison result. What is necessary is just to comprise. As a result, the second sorting means sorts the data constituting each group within each group based on the flag value set by the flag setting means, so that the first or last data of each group is obtained. Since the value of the set flag is a value indicating the presence or absence of data in which a part or the whole of the information of the first data item is the same and the whole information of the second data item is different, the first data item The determination means can determine whether or not there is data in which part or all of the information of the data item is the same and the entire information of the second data item is different.

また、請求項１記載の発明において、判断手段が、第１のデータ項目の情報の一部又は全体が同一かつ第２のデータ項目の情報の一部が相違しているデータの有無を判断する場合には、フラグ設定手段は、例えば請求項６にも記載したように、複数のデータの各々に対し、第２のデータ項目の情報のうち一定位置に位置している一部の情報を特定データと比較し、比較結果に応じてフラグを設定するように構成すればよい。これにより、フラグ設定手段によって設定されたフラグの値に基づいて、第２ソート手段が個々のグループを構成するデータを個々のグループ内でソートすることにより、個々のグループの先頭又は末尾のデータに設定されているフラグの値が、第１のデータ項目の情報の一部又は全体が同一かつ第２のデータ項目の情報の一部が相違しているデータの有無を表す値となるので、第１のデータ項目の情報の一部又は全体が同一かつ第２のデータ項目の情報の一部が相違しているデータの有無を判断手段が判断することが可能となる。 In the first aspect of the present invention, the determination means determines whether or not there is data in which part or all of the information of the first data item is the same and part of the information of the second data item is different. In this case, for example, as described in claim 6, the flag setting unit specifies a part of information located at a certain position among the information of the second data item for each of the plurality of data. What is necessary is just to comprise so that a flag may be set according to a comparison result compared with data. As a result, the second sorting means sorts the data constituting each group within each group based on the flag value set by the flag setting means, so that the first or last data of each group is obtained. Since the value of the set flag is a value indicating the presence or absence of data in which part or all of the information of the first data item is the same and part of the information of the second data item is different, The determination means can determine whether or not there is data in which part or all of the information of one data item is the same and part of the information of the second data item is different.

また、請求項１記載の発明において、フラグ設定手段は、例えば請求項７に記載したように、個々のグループの特定データに対しては、第２のデータ項目の情報の一部又は全体が特定データと同一のデータに前記フラグとして設定する値と同一の値（すなわち、第２のデータ項目の情報の一部又は全体が特定データと相違しているデータに前記フラグとして設定する値と異なる値）を、フラグとして無条件に設定するように構成することができる。 In the first aspect of the invention, the flag setting means may specify a part or all of the information of the second data item for the specific data of each group as described in, for example, the seventh aspect. The same value as the value set as the flag in the same data as the data (that is, a value different from the value set as the flag in the data in which part or all of the information of the second data item is different from the specific data) ) Can be configured unconditionally as a flag.

請求項８記載の発明に係るデータ管理プログラムは、コンピュータを、複数のデータ項目について各々情報が設定されて成る複数のデータを、前記複数のデータ項目のうちの第１の項目の情報に基づいてソートすることで、前記複数のデータを、前記第１のデータ項目の情報の一部又は全体が同一のデータで各々構成された複数のグループに分ける第１ソート手段、前記第１ソート手段によるソートを経た前記複数のデータの各々に対し、前記複数のデータ項目のうちの第２のデータ項目の情報の一部又は全体が、同一のグループに属する特定のデータと相違しているか否かに応じて値の異なるフラグを各々設定するフラグ設定手段、前記フラグ設定手段によって個々のデータに設定されたフラグの値に基づいて、個々のグループを構成するデータを個々のグループ内でソートする第２ソート手段、及び、前記第２ソート手段によるソートを経た前記個々のグループの先頭又は末尾のデータに設定されているフラグの値に基づいて、前記第１のデータ項目の情報の一部又は全体が同一かつ前記第２のデータ項目の情報の一部又は全体が相違しているデータの有無を判断する判断手段として機能させる。 According to an eighth aspect of the present invention, there is provided a data management program comprising: a computer, a plurality of pieces of data each having information set for a plurality of data items, based on information of a first item of the plurality of data items; Sorting by the first sorting means, the first sorting means for sorting the plurality of data into a plurality of groups, each of which includes a part or all of the information of the first data item, each composed of the same data For each of the plurality of data that has passed through, whether or not part or all of the information of the second data item of the plurality of data items is different from the specific data belonging to the same group Flag setting means for setting flags having different values respectively, and individual groups are configured based on the flag values set in the individual data by the flag setting means. Second sorting means for sorting the data within each group, and based on the value of the flag set in the first or last data of the individual group that has been sorted by the second sorting means, It functions as a determination means for determining whether or not there is data in which part or all of the information of one data item is the same and part or all of the information of the second data item is different.

請求項８記載の発明に係るデータ管理プログラムは、コンピュータを、上記の第１ソート手段、フラグ設定手段、第２ソート手段及び判断手段として機能させるためのプログラムであるので、コンピュータが請求項８記載の発明に係るデータ管理プログラムを実行することで、コンピュータが請求項１に記載のデータ管理装置として機能することになり、請求項１記載の発明と同様に、複数のデータ項目について各々情報が設定されて成る複数のデータの中に、第１のデータ項目の情報の一部又は全体が同一かつ第２のデータ項目の情報の一部又は全体が相違しているデータが存在しているか否かを判断することを、簡易な処理の組み合わせで実現することができる。 The data management program according to the invention described in claim 8 is a program for causing a computer to function as the first sort means, flag setting means, second sort means, and determination means. By executing the data management program according to the present invention, the computer functions as the data management device according to claim 1, and as with the invention according to claim 1, information is set for each of a plurality of data items. Whether there is data in which a part or the whole of the information of the first data item is the same and a part or the whole of the information of the second data item is different in the plurality of data formed It can be realized by a combination of simple processes.

以上説明したように本発明は、複数のデータ項目について各々情報が設定されて成る複数のデータを、第１の項目の情報に基づいてソートすることで、複数のデータを、第１のデータ項目の情報の一部又は全体が同一のデータで各々構成された複数のグループに分け、複数のデータの各々に対し、第２のデータ項目の情報の一部又は全体が、同一のグループに属する特定データと相違しているか否かに応じて値の異なるフラグを各々設定し、個々のデータに設定されたフラグの値に基づいて、個々のグループを構成するデータを個々のグループ内でソートし、前記個々のグループの先頭又は末尾のデータに設定されているフラグの値に基づいて、第１のデータ項目の情報の一部又は全体が同一かつ第２のデータ項目の情報の一部又は全体が相違しているデータの有無を判断するので、複数のデータ項目について各々情報が設定されて成る複数のデータの中に、第１のデータ項目の情報の一部又は全体が同一かつ第２のデータ項目の情報の一部又は全体が相違しているデータが存在しているか否かを判断することを、簡易な処理の組み合わせで実現できる、という優れた効果を有する。 As described above, the present invention sorts a plurality of pieces of data in which information is set for a plurality of data items based on the information of the first item, so that the plurality of pieces of data are converted into the first data item. A part or all of the information is divided into a plurality of groups each composed of the same data, and for each of the plurality of data, a part or all of the information of the second data item belongs to the same group Set different flags with different values depending on whether they differ from the data, sort the data that make up each group within each group based on the flag value set for each data, Based on the value of the flag set in the head or tail data of each individual group, part or whole of the information of the first data item is the same and part or whole of the information of the second data item is phase Since the presence or absence of the data is determined, the second data item in which a part or all of the information of the first data item is the same among the plurality of data in which information is set for each of the plurality of data items. It has an excellent effect that it can be realized by a combination of simple processes to determine whether or not there is data in which part or all of the information is different.

以下、図面を参照して本発明の実施形態の一例を詳細に説明する。図１には本実施形態に係る、特定金融機関に設けられたコンピュータ・システム１０が示されている。コンピュータ・システム１０は、特定金融機関の情報センタ等に設置されたホスト・コンピュータ１２と、特定金融機関内に構築されたコンピュータ・ネットワーク１８を含んで構成されている。なお、ホスト・コンピュータ１２は請求項８に記載のコンピュータに対応しており、後述するストレージ１４と共に本発明に係るデータ管理装置に対応している。 Hereinafter, an example of an embodiment of the present invention will be described in detail with reference to the drawings. FIG. 1 shows a computer system 10 provided in a specific financial institution according to this embodiment. The computer system 10 includes a host computer 12 installed in an information center of a specific financial institution and a computer network 18 constructed in the specific financial institution. The host computer 12 corresponds to the computer described in claim 8 and corresponds to the data management apparatus according to the present invention together with the storage 14 described later.

ホスト・コンピュータ１２は汎用の大型コンピュータから成り、ＣＰＵ１２Ａ、ＲＡＭ等から成るメモリ１２Ｂ、磁気ディスク等から成る不揮発性の記憶部１２Ｃ、ネットワークインタフェース（Ｉ／Ｆ）部１２Ｄを備えている。ホスト・コンピュータ１２は、ネットワークＩ／Ｆ部１２Ｄに接続された通信回線を介してコンピュータ・ネットワーク１８（詳しくはネットワーク１８内のブランチ・サーバ２０）に接続されている。また、ホスト・コンピュータ１２には、大容量の磁気ディスク等から成り顧客情報データベース（顧客情報ＤＢ）を記憶するための記憶領域が設けられたストレージ１４と、セットされたＭＴ（磁気テープ）１５に対して情報の読み出しや書き込みを行うＭＴ装置１６が各々接続されている。また、ホスト・コンピュータ１２の記憶部１２Ｃには、ホスト・コンピュータ１２のＣＰＵ１２Ａによって顧客情報ＤＢ操作処理を行うための顧客情報ＤＢ操作プログラムと、ホスト・コンピュータ１２のＣＰＵ１２Ａによってデータ抽出処理を行うためのデータ抽出プログラムが各々記憶されている。なお、上記各プログラムのうち、データ抽出プログラムは請求項８に記載のデータ管理プログラムに対応している。 The host computer 12 is a general-purpose large computer, and includes a CPU 12A, a memory 12B including a RAM, a non-volatile storage unit 12C including a magnetic disk, and a network interface (I / F) unit 12D. The host computer 12 is connected to a computer network 18 (specifically, a branch server 20 in the network 18) via a communication line connected to the network I / F unit 12D. The host computer 12 includes a storage 14 composed of a large-capacity magnetic disk or the like and provided with a storage area for storing a customer information database (customer information DB), and a set MT (magnetic tape) 15. An MT device 16 for reading and writing information is connected to each other. Further, in the storage unit 12C of the host computer 12, a customer information DB operation program for performing a customer information DB operation process by the CPU 12A of the host computer 12 and a data extraction process by the CPU 12A of the host computer 12 are performed. Each data extraction program is stored. Of the above programs, the data extraction program corresponds to the data management program described in claim 8.

一方、コンピュータ・ネットワーク１８は、特定金融機関の各支店に各々設置されたブランチ・サーバ２０（ＰＣ、ワークステーション、大型コンピュータの何れでもよい）が通信回線２０を介して互いに接続されて構成されており、個々のブランチ・サーバ２０には、個々のブランチ・サーバ２０と同一の支店に設置された複数台のＡＴＭ（Automatic Teller Machine：現金自動預け払い機）２２及び複数台の営業店端末（金融機関の従業員が操作するための端末）２４が各々接続されている。ＡＴＭ２２には、特定金融機関に口座を開設している顧客が所持しているキャッシュカードを装填可能で、装填されたキャッシュカードに磁気的に記録された情報を読み取り可能なカードリーダが取り付けられており、営業店端末２４には、特定金融機関に口座を開設している顧客が所持している通帳を装填可能で、装填された通帳に磁気的に記録された情報を読み取り可能で、且つ任意の情報を通帳に記録することも可能な記帳機が取り付けられている。 On the other hand, the computer network 18 is configured by connecting branch servers 20 (PCs, workstations, and large computers) installed in each branch of a specific financial institution to each other via a communication line 20. Each branch server 20 includes a plurality of ATMs (Automatic Teller Machines) 22 installed in the same branch as each branch server 20 and a plurality of branch office terminals (financials). Terminals 24 for operation by the employees of the institution are connected to each other. The ATM 22 is equipped with a card reader that can be loaded with a cash card possessed by a customer who has opened an account with a specific financial institution and can read information magnetically recorded on the loaded cash card. The branch terminal 24 can be loaded with a passbook held by a customer who has opened an account with a specific financial institution, can read information recorded magnetically in the loaded passbook, and is optional. A bookkeeping machine that can record the information in the passbook is installed.

次に本実施形態の作用を説明する。本実施形態に係る顧客情報ＤＢは、特定金融機関の個々の顧客に関する情報を登録・管理するためのデータベースであり、単一の顧客の顧客情報として、顧客名や電話番号、住所、顧客が特定金融機関に開設している全ての口座の種別、支店番号、口座番号等の情報が顧客ＩＤと対応付けられて単一のレコードに各々登録される。本実施形態に係る特定金融機関では、通常の業務の遂行に伴い、顧客情報ＤＢに対して顧客情報の登録、読み出し、更新等のアクセスを行う必要が生ずるが、通常業務での顧客情報ＤＢに対するアクセスは、ホスト・コンピュータ１２によって顧客情報ＤＢ操作プログラムが実行されることで実現される。 Next, the operation of this embodiment will be described. The customer information DB according to the present embodiment is a database for registering and managing information related to individual customers of a specific financial institution. As customer information of a single customer, the customer name, telephone number, address, and customer are specified. Information such as the types, branch numbers, and account numbers of all accounts opened in financial institutions is registered in a single record in association with customer IDs. In the specific financial institution according to the present embodiment, it is necessary to access the customer information DB for registration, reading, updating, etc. with the execution of the normal business. Access is realized by executing a customer information DB operation program by the host computer 12.

具体的には、例えば新規顧客によって特定金融機関の口座開設が指示されたり、特定金融機関に既に口座を開設している顧客によって別口座の開設が指示されたり、顧客から住所や電話番号等の変更が通知された場合、顧客情報ＤＢへのアクセスを要求する電文がＡＴＭ２２又は営業店端末２４から送信される。この電文がコンピュータ・ネットワーク１８やブランチ・サーバ２０を経由してホスト・コンピュータ１２で受信されると、顧客情報ＤＢ操作プログラムがＣＰＵ１２Ａによって実行され、顧客情報ＤＢに対して要求されたアクセスを行う（顧客情報ＤＢ操作処理）。なお、要求されたアクセスが顧客情報ＤＢに対する顧客情報の新規登録以外（読み出しや更新等）であれば、ＡＴＭ２２や営業店端末２４から受信する電文にアクセス対象の顧客ＩＤが設定されているので、顧客情報ＤＢ操作処理では、電文に設定されている顧客ＩＤをキーとして顧客情報ＤＢを検索することで、アクセス対象の顧客情報を顧客情報ＤＢから抽出した後に、要求されたアクセスを行う。そして、要求されたアクセスが完了すると、要求元のＡＴＭ２２又は営業店端末２４へ応答（電文）を送信する。 Specifically, for example, a new customer is instructed to open an account with a specific financial institution, a customer who has already opened an account with a specific financial institution is instructed to open another account, or a customer has an address, telephone number, etc. When the change is notified, a message requesting access to the customer information DB is transmitted from the ATM 22 or the branch terminal 24. When the electronic message is received by the host computer 12 via the computer network 18 or the branch server 20, the customer information DB operation program is executed by the CPU 12A to perform the requested access to the customer information DB ( Customer information DB operation processing). If the requested access is other than new registration of customer information to the customer information DB (reading, updating, etc.), the customer ID to be accessed is set in the message received from the ATM 22 or the branch terminal 24. In the customer information DB operation process, the customer information DB is searched using the customer ID set in the message as a key, and the requested access is performed after extracting the customer information to be accessed from the customer information DB. When the requested access is completed, a response (message) is transmitted to the requesting ATM 22 or the branch terminal 24.

ところで、特定金融機関では、セグメント別の顧客データ処理・分析や、マーケティング、顧客宛のアンケートの実施、各種サービスの提案等の業務を行うにあたって急遽、顧客情報ＤＢから「電話番号が同一で顧客名が相違している顧客情報」を抽出する必要が生じた。しかし、特定金融機関における通常業務では上記の条件に合致する顧客情報を顧客情報ＤＢから抽出する必要はなく、顧客情報ＤＢ及び顧客情報ＤＢ操作プログラムの設計開発時には、顧客情報ＤＢに対して上記の条件に合致する顧客情報を検索・抽出することを想定していなかったため、顧客情報ＤＢ操作プログラムには、上記の条件に合致する顧客情報を検索する機能は設けられていない。このため、特定金融機関では、顧客情報ＤＢに対して上記の条件に合致する顧客情報を検索・抽出するために顧客情報抽出プログラムを開発した。以下、ホスト・コンピュータ１２のＣＰＵ１２Ａによって顧客情報抽出プログラムが実行されることで実現される顧客情報抽出処理について、図２を参照して説明する。 By the way, in a specific financial institution, when performing business such as customer data processing and analysis by segment, marketing, conducting questionnaires addressed to customers, proposals for various services, etc. Need to extract "customer information". However, it is not necessary to extract customer information that meets the above conditions from the customer information DB in normal operations at a specific financial institution. When designing and developing the customer information DB and the customer information DB operation program, the customer information DB Since it is not assumed that customer information that matches the conditions is searched / extracted, the customer information DB operation program is not provided with a function for searching for customer information that meets the above conditions. For this reason, a specific financial institution has developed a customer information extraction program in order to search and extract customer information that matches the above conditions from the customer information DB. Hereinafter, customer information extraction processing realized by executing the customer information extraction program by the CPU 12A of the host computer 12 will be described with reference to FIG.

顧客情報抽出処理では、まずステップ５０において、顧客情報ＤＢの全てのレコードをストレージ１４からメモリ１２Ｂに順次読み出すと共に、読み出した各レコードを電話番号（第１キー：本発明に係る第１のデータ項目に相当）及び顧客名（第２キー：本発明に係る第２のデータ項目に相当）に基づいてソートすることで、各レコードを電話番号（の全体）が同一のレコードから成るグループにグループ化し、処理結果を第１中間データとして記憶部１２Ｃに書き込む。図３(Ａ)にも示すように、顧客情報ＤＢ内には複数の顧客情報（複数のレコード）が電話番号TELNO及び顧客名NAMEに関してランダムに配列されており、図３(Ａ)に示す９個の顧客情報（９個のレコード）に対し、ステップ５０で電話番号TELNOの昇順かつ顧客名NAMEの昇順にソートを行った場合、９個のレコードは、図３(Ｂ)に示すように、各々電話番号TELNO＝"1111","2222","3333","4444"のレコードから成る４個のグループにグループ化されることになる（図３では各グループの境界を太線で示している）。 In the customer information extraction process, first, in step 50, all the records of the customer information DB are sequentially read from the storage 14 to the memory 12B, and each read record is read by telephone number (first key: first data item according to the present invention). ) And customer names (second key: corresponding to the second data item according to the present invention) to sort each record into a group consisting of records having the same telephone number (the whole). The processing result is written in the storage unit 12C as the first intermediate data. As shown in FIG. 3A, a plurality of pieces of customer information (a plurality of records) are randomly arranged in the customer information DB with respect to the telephone number TELNO and the customer name NAME. When the customer information (9 records) is sorted in the ascending order of the telephone number TELNO and the ascending order of the customer name NAME in step 50, the 9 records are as shown in FIG. Each group is grouped into four groups of records with telephone numbers TELNO = “1111”, “2222”, “3333”, “4444” (in FIG. 3, the boundaries of each group are indicated by bold lines). ).

なお、上記のステップ５０は本発明に係る第１ソート手段（詳しくは請求項２に記載の第１ソート手段）に対応している。また、電話番号TELNOは本発明に係る第１の項目の情報に、顧客名NAMEは本発明に係る第２の項目の情報に各々対応している。また、ステップ５０では電話番号（第１のデータ項目）及び顧客名（第２のデータ項目）に基づいてソートしているが、電話番号（第１のデータ項目）のみに基づいてソートするようにしてもよい。また、図３では図面を簡単にするために、各レコードの情報として電話番号TELNO及び顧客名NAME以外の図示を省略しているが、実際には各レコードに住所等の他の情報も付加されていることは言うまでもない。 The above step 50 corresponds to the first sorting means (specifically, the first sorting means described in claim 2) according to the present invention. The telephone number TELNO corresponds to the information of the first item according to the present invention, and the customer name NAME corresponds to the information of the second item according to the present invention. In step 50, sorting is performed based on the telephone number (first data item) and the customer name (second data item). However, sorting is performed based only on the telephone number (first data item). May be. Further, in FIG. 3, for the sake of simplicity of the drawing, illustrations other than the telephone number TELNO and the customer name NAME are omitted as information of each record, but actually other information such as an address is added to each record. Needless to say.

次のステップ５２では、ステップ５０で記憶部１２Ｃに書き込んだ第１中間データを記憶部１２Ｃからメモリ１２Ｂに順次読み出し、ステップ５０の処理によって読み出した第１中間データ内に形成されている複数のグループのうち単一のレコードから成るグループを検索し、この検索によって抽出されたグループに属するレコードを除去する。そして、処理結果を第２中間データとして記憶部１２Ｃに書き込む。 In the next step 52, the first intermediate data written to the storage unit 12C in step 50 is sequentially read from the storage unit 12C to the memory 12B, and a plurality of groups formed in the first intermediate data read by the processing of step 50 A group consisting of a single record is searched, and records belonging to the group extracted by this search are removed. Then, the processing result is written in the storage unit 12C as second intermediate data.

例として、記憶手段１２Ｃから読み出した第１中間データが、図３(Ｂ)に示すように、各々電話番号TELNO＝"1111","2222","3333","4444"のレコードから成る４個のグループが形成されたデータであった場合、これらのグループのうち単一のレコードから成るグループは電話番号TELNO＝"4444"のレコードから成るグループであるので、図３(Ｃ)にも示すように、当該グループに属するレコード（電話番号TELNO＝"4444"、顧客名NAME＝"かかかか"のレコード）が除去される（図３(Ｃ)では該当するレコードの出力対象情報O/P＝"×"となっているが、これは該当するレコードを第２中間データとして出力しないことを意味し、図３(Ｄ)にも示すように該当するレコードは中間データ上から除去されている）。 As an example, as shown in FIG. 3B, the first intermediate data read from the storage means 12C is composed of 4 records of telephone numbers TELNO = “1111”, “2222”, “3333”, “4444”, respectively. In the case of data in which a plurality of groups are formed, a group consisting of a single record among these groups is a group consisting of records of the telephone number TELNO = “4444”, and is also shown in FIG. In this way, the records belonging to the group (the telephone number TELNO = “4444”, the customer name NAME = “high / low” record) are removed (the output target information O / of the corresponding record in FIG. 3C). P = “×” means that the corresponding record is not output as the second intermediate data, and the corresponding record is removed from the intermediate data as shown in FIG. )

ステップ５２で除去されたレコードは次のステップ５４以降の処理の対象から除外されるので、ステップ５２は請求項４に記載の第１ソート手段に対応している。また、レコードそのものを中間データから除去することに代えて、単一のレコードから成るグループに属するレコードに所定の情報を付加して中間データとして出力し、次のステップ５４以降の処理において、中間データのうち所定の情報が付加されているレコードを処理対象から除外するようにしてもよい。 Since the record removed in step 52 is excluded from the processing target in the next step 54 and subsequent steps, step 52 corresponds to the first sorting means described in claim 4. Also, instead of removing the record itself from the intermediate data, predetermined information is added to the records belonging to the group consisting of a single record and output as intermediate data. Of these, records to which predetermined information is added may be excluded from the processing target.

次のステップ５４では、ステップ５２で記憶部１２Ｃに書き込んだ第２中間データを記憶部１２Ｃからメモリ１２Ｂに順次読み出し、読み出した第２中間データの各レコードに各々フラグを設定する。このフラグの設定は、個々のグループ毎に以下のようにして行われる。すなわち、第２中間データ内の個々のグループの先頭レコードに対してはフラグとして無条件に"1"を設定し、第２中間データ内の個々のグループの２番目以降の各レコードに対しては、同一グループの先頭レコードと顧客名NAME（の全体）を比較し、同一グループの先頭レコードと顧客名NAME（の全体）が同一であればフラグとして"1"を、同一グループの先頭レコードと顧客名NAME（の全体）が相違していればフラグとして"0"を各々設定する。そして、処理結果を第３中間データとして記憶部１２Ｃに書き込む。 In the next step 54, the second intermediate data written in the storage unit 12C in step 52 is sequentially read from the storage unit 12C to the memory 12B, and a flag is set for each record of the read second intermediate data. This flag is set for each group as follows. That is, “1” is unconditionally set as the flag for the first record of each group in the second intermediate data, and for the second and subsequent records of each group in the second intermediate data. Compare the first record of the same group with the customer name NAME (entire), and if the first record of the same group and the customer name NAME (entire) are the same, set the flag to "1", the first record of the same group and the customer If the names NAME are different, "0" is set for each flag. Then, the processing result is written in the storage unit 12C as the third intermediate data.

例として、記憶手段１２Ｃから読み出した第２中間データが、図３(Ｄ)に示すように、各々電話番号TELNO＝"1111","2222","3333"のレコードから成る３個のグループが形成されたデータであった場合、電話番号TELNO＝"1111"のグループについては、当該グループの先頭レコード（顧客名NAME＝"ああああ"）にフラグFLGとして無条件に"1"が設定され、２番目のレコード（顧客名NAME＝"いいいい"）は顧客名NAME（の全体）が先頭レコードと相違しているので、フラグFLGとして"0"が設定される。また、電話番号TELNO＝"2222"のグループについては、当該グループの先頭レコード（顧客名NAME＝"うううう"）にフラグFLGとして無条件に"1"が設定され、２番目のレコード（顧客名NAME＝"うううう"）は顧客名NAME（の全体）が先頭レコードと同一であるのでフラグFLGとして"1"が設定される。更に、電話番号TELNO＝"3333"のグループについては、当該グループの先頭レコード（顧客名NAME＝"ええええ"）にフラグFLGとして無条件に"1"が設定され、２番目のレコード（顧客名NAME＝"ええええ"）は顧客名NAME（の全体）が先頭レコードと同一であるのでフラグFLGとして"1"が設定され、３，４番目のレコード（何れも顧客名NAME＝"おおおお"）は顧客名NAME（の全体）が先頭レコードと相違しているのでフラグFLGとして"0"が各々設定される。 As an example, as shown in FIG. 3D, the second intermediate data read from the storage means 12C includes three groups each consisting of records of telephone numbers TELNO = “1111”, “2222”, “3333”. In the case of the formed data, for the group with the telephone number TELNO = “1111”, “1” is unconditionally set as the flag FLG in the first record of the group (customer name NAME = “Oh Ah”). Since the first record (customer name NAME = “good”) is different from the first record in customer name NAME (entire), “0” is set as the flag FLG. For the group with the telephone number TELNO = "2222", "1" is unconditionally set as the flag FLG in the first record (customer name NAME = "Uu Uu") of the group, and the second record (customer name) NAME = “Uu Uu”), since the customer name NAME (the whole) is the same as the first record, “1” is set as the flag FLG. Furthermore, for the group with the telephone number TELNO = “3333”, “1” is unconditionally set as the flag FLG in the first record (customer name NAME = “yeah yeah”) of the group, and the second record (customer name) Since the customer name NAME (whole) is the same as the first record in NAME = “Yes yeah”), the flag FLG is set to “1”, and the third and fourth records (both customer names NAME = “Oh”) ) Is different from the first record in the customer name NAME (whole), so “0” is set as the flag FLG.

上記のステップ５４は本発明に係るフラグ設定手段（詳しくは請求項５及び請求項７に記載のフラグ設定手段）に対応している。なお、上記では個々のグループの先頭レコード、及び、個々のグループの２番目以降のレコードのうち同一グループの先頭レコードと顧客名NAME（の全体）が同一のレコードに対してはフラグとして"1"を設定し、個々のグループの２番目以降のレコードのうち同一グループの先頭レコードと顧客名NAME（の全体）が相違しているレコードに対してはフラグとして"0"を設定しているが、設定するフラグの値は上記で別の値を設定したレコードを区別可能な値であればよく、例えば値を逆に設定してもよいし、"0","1"以外の別の値を用いてもよい。 The step 54 corresponds to flag setting means according to the present invention (specifically, flag setting means according to claims 5 and 7). In the above, the first record of each group, and the first record of the same group and the record with the same customer name NAME (the whole) among the second and subsequent records of each group are set to “1” as a flag. , And among the records after the second of each group, the first record of the same group and the customer name NAME (whole) are different, but "0" is set as a flag. The flag value to be set only needs to be a value that can distinguish a record set with another value as described above. For example, the value may be set in reverse, or another value other than “0” and “1” may be set. It may be used.

次のステップ５６では、ステップ５４で記憶部１２Ｃに書き込んだ第３中間データを記憶部１２Ｃからメモリ１２Ｂに順次読み出し、読み出した第３中間データの個々のグループ内を、ステップ５４で設定したフラグの値の昇順にソートする。そして、処理結果を第４中間データとして記憶部１２Ｃに書き込む。例えば、記憶手段１２Ｃから読み出した第３中間データが図３(Ｄ)に示すような内容であった場合、個々のグループ内をフラグの値の昇順にソートすることで、図３(Ｅ)に示すように、電話番号TELNO＝"1111"のグループについては、フラグFLGとして"0"が設定された顧客名NAME＝"いいいい"のレコード（図３(Ｄ)において、当該グループの２番目に位置しているレコード）が先頭レコードとなり、電話番号TELNO＝"3333"のグループについては、フラグFLGとして"0"が設定された顧客名NAME＝"おおおお"の２個のレコード（図３(Ｄ)において、当該グループの３番目及び４番目に位置しているレコード）が各々先頭レコード、２番目のレコードとなる。 In the next step 56, the third intermediate data written in the storage unit 12C in step 54 is sequentially read from the storage unit 12C to the memory 12B, and the individual groups of the read third intermediate data are set in the flag set in step 54. Sort in ascending order of value. Then, the processing result is written in the storage unit 12C as the fourth intermediate data. For example, when the third intermediate data read from the storage unit 12C has the contents as shown in FIG. 3D, the individual groups are sorted in ascending order of the flag values to obtain the result shown in FIG. As shown, for the group with the telephone number TELNO = “1111”, the customer name NAME = “good” record with the flag FLG set to “0” (in FIG. 3D, the second of the group) For the group with the telephone number TELNO = “3333”, the two records with the customer name NAME = “Oo” with the flag FLG set to “0” (FIG. 3 ( In D), the third and fourth records in the group are the first record and the second record, respectively.

上記のステップ５６は本発明に係る第２ソート手段に対応している。なお、上記では個々のグループ内をフラグの値の昇順にソートしているが、個々のグループ内をフラグの値の降順にソートするようにしてもよい（この場合、次のステップ５８において、個々のグループの末尾のレコードに設定されているフラグの値に基づいて、個々のグループを出力対象とするか否かを判断するようにすればよい）。 Step 56 above corresponds to the second sorting means according to the present invention. In the above, the individual groups are sorted in ascending order of the flag values, but the individual groups may be sorted in descending order of the flag values (in this case, in the next step 58, the individual groups are sorted in descending order). Based on the flag value set in the last record of the group, it is sufficient to determine whether or not each group is to be output).

また、ステップ５８では、ステップ５６で記憶部１２Ｃに書き込んだ第４中間データを記憶部１２Ｃからメモリ１２Ｂに順次読み出し、読み出した第４中間データにおいて、個々のグループの先頭レコードに設定されているフラグの値に基づいて、個々のグループを出力対象とするか否かを判断する。すなわち、或るグループの先頭レコードに設定されているフラグの値が"1"の場合、このグループ内には、フラグの値が"0"のレコード、すなわち同一グループ内の他のレコードと顧客名NAMEが相違しているレコードが存在していないと判断できるので、このグループに属する各レコードを非出力対象に設定する。一方、或るグループの先頭レコードに設定されているフラグの値が"0"の場合、このグループ内には、同一グループ内の他のレコードと顧客名NAMEが相違しているレコードが混在していると判断できるので、このグループに属する各レコードを出力対象に設定する。そして、第４中間データのうち、出力対象に設定したグループのレコードのみを出力データとして、ＭＴ装置１６を介してＭＴ１５に書き込み、顧客情報抽出処理を終了する。 In step 58, the fourth intermediate data written in the storage unit 12C in step 56 is sequentially read from the storage unit 12C to the memory 12B, and the flag set in the first record of each group in the read fourth intermediate data. Whether or not each group is an output target is determined based on the value of. That is, if the value of the flag set in the first record of a group is "1", the record with the flag value "0" in this group, that is, the customer name with other records in the same group Since it can be determined that there is no record having a different NAME, each record belonging to this group is set as a non-output target. On the other hand, if the value of the flag set for the first record of a group is "0", this group contains records with different customer names NAME from other records in the same group. Therefore, each record belonging to this group is set as an output target. Then, among the fourth intermediate data, only the record of the group set as the output target is written as output data to the MT 15 via the MT device 16, and the customer information extraction process is terminated.

例えば、記憶手段１２Ｃから読み出した第４中間データが図３(Ｅ)に示すような内容であった場合、電話番号TELNO＝"1111"のグループについては先頭レコードにフラグFLGとして"0"が設定されていることで、図３(Ｆ)に示すように、当該グループに属する全レコード（２個のレコード）が出力対象に設定され（出力対象情報O/P＝"○"に設定され）、電話番号TELNO＝"2222"のグループについては先頭レコードにフラグFLGとして"1"が設定されていることで、当該グループに属する全レコード（２個のレコード）が非出力対象に設定され（出力対象情報O/P＝"×"に設定され）、電話番号TELNO＝"3333"のグループについては先頭レコードにフラグFLGとして"0"が設定されていることで、当該グループに属する全レコード（４個のレコード）が出力対象に設定される（出力対象情報O/P＝"○"に設定される）。そして、出力対象に設定された（出力対象情報O/P＝"○"に設定された）レコードのみが出力データとしてＭＴ１５に書き込まれる。なお、出力データはＭＴ１５以外の他の媒体へ出力するようにしてもよいことは言うまでもない。 For example, when the fourth intermediate data read from the storage means 12C has the contents as shown in FIG. 3E, for the group with the telephone number TELNO = "1111", "0" is set as the flag FLG in the first record. By doing so, as shown in FIG. 3 (F), all records (two records) belonging to the group are set as output targets (output target information O / P = “O” is set), For the group with the telephone number TELNO = "2222", all records belonging to the group (2 records) are set as non-output targets because the flag FLG is set to "1" in the first record (output target) Information O / P = “×”), and for the group with the telephone number TELNO = “3333”, “0” is set as the flag FLG in the first record, so all the records belonging to the group (4 records) Records) are set as output targets (output) Force target information O / P = “O”). Only the record set as the output target (output target information O / P = “◯”) is written in the MT 15 as output data. Needless to say, the output data may be output to a medium other than MT15.

これにより、顧客情報ＤＢに登録されている各レコード（各顧客情報）のうち、顧客情報抽出処理における抽出対象である、「電話番号TELNOが同一で顧客名NAMEが相違している顧客情報（レコード）」が混在しているグループのレコードのみが、出力データとしてＭＴ１５に書き込まれることになる。上述した顧客情報抽出処理の各ステップは何れもごく簡単な処理であり、特にステップ５０，５６におけるソートはデータ処理で多用される定型処理であるので、既存のプログラムを流用できる可能性が非常に高く、顧客情報抽出処理を実現する顧客情報抽出プログラムは、非常に短い期間で開発を完了させることができる。従って、上記で説明したように「電話番号TELNOが同一で顧客名NAMEが相違している顧客情報（レコード）」を抽出する場合のみならず、本発明に係るアルゴリズム（顧客情報抽出処理のアルゴリズム）を利用可能な条件に合致するデータを抽出する場合にも、該当するデータの有無を迅速に確認したり、該当するデータを抽出することができる。 As a result, among the records (each customer information) registered in the customer information DB, “customer information (records with the same telephone number TELNO and different customer names NAME), which is the extraction target in the customer information extraction process. Only the records in the group in which “)” are mixed are written in the MT 15 as output data. Each step of the customer information extraction process described above is a very simple process, and the sorting in steps 50 and 56 is a standard process frequently used in data processing, so there is a great possibility that an existing program can be diverted. The customer information extraction program that realizes the customer information extraction process can be completed in a very short period of time. Therefore, as described above, not only the case of extracting “customer information (record) with the same telephone number TELNO but different customer name NAME”, the algorithm according to the present invention (algorithm for customer information extraction processing) In the case of extracting data that matches the conditions that can be used, it is possible to quickly confirm the presence or absence of the corresponding data or to extract the corresponding data.

なお、上記では本発明に係る第１のデータ項目として電話番号TELNOを、第２のデータ項目として顧客名NAMEを適用した態様を説明したが、これに限定されるものではなく、例えば第１のデータ項目として住所を、第２のデータ項目として支店番号を適用する等のように、第１及び第２のデータ項目としては任意の情報を適用可能である。また、上記では処理対象のデータとして顧客情報ＤＢに登録されている顧客情報を例に説明したが、これに限定されるものでもなく、任意のデータを処理対象とすることが可能である。また、上記では特定の条件に合致するデータの抽出を行う契機として、セグメント別の顧客データ処理・分析や、マーケティング、顧客宛のアンケートの実施、各種サービスの提案等の業務を例に説明したが、本発明の用途がこれに限定されるものでもないことは言うまでもない。 In addition, although the aspect which applied telephone number TELNO as a 1st data item which concerns on this invention above, and applied customer name NAME as a 2nd data item was demonstrated, it is not limited to this, For example, it is 1st Arbitrary information can be applied as the first and second data items, such as applying an address as a data item and a branch number as a second data item. In the above description, customer information registered in the customer information DB as data to be processed has been described as an example. However, the present invention is not limited to this, and arbitrary data can be processed. In addition, in the above, as an opportunity to extract data that matches a specific condition, the explanation was given by taking as an example the operations such as customer data processing and analysis by segment, marketing, conducting questionnaires addressed to customers, proposals for various services, etc. Needless to say, the application of the present invention is not limited to this.

また、上記では第１のデータ項目としての電話番号TELNO全体が同一かつ第２のデータ項目としての顧客名NAME全体が相違しているデータ（顧客情報）の有無の確認・抽出を例に説明したが、本発明はこれに限定されるものではなく、第１のデータ項目の情報の一部が同一のデータを対象としてもよい。これは、第１の項目の情報に基づくソート（図２におけるステップ５０）において、第１の項目の情報のうちの一部の情報のみを比較してソート（及びグループ化）を行うことで実現できる。具体的には、例えば第１のデータ項目として住所を適用した態様において、住所のうち町名までの情報を比較して（番地等の情報を比較対象から除外して）ソート（及びグループ化）を行う。この場合、住所のうち少なくとも町名迄が同一のデータ毎にグループを形成することができる。上記態様は請求項３記載の発明に対応している。また、第２のデータ項目の情報の一部が相違するデータを対象としてもよい。これは、第２のデータ項目の情報に基づくフラグ設定（図２におけるステップ５４）において、第２の項目の情報のうちの一部の情報のみを比較し、比較結果に応じてフラグを設定することで実現できる。具体的には、例えば第２のデータ項目として顧客名を適用した態様において、顧客名のうち苗字のみを比較して（名前を比較対象から除外して）フラグを設定する。この場合、個々のグループ内でフラグの値に基づいてデータをソートした後に、個々のグループ内の先頭又は末尾のデータに設定されているフラグの値を参照することで、各グループ内に顧客名のうちの少なくとも苗字が相違するデータが混在しているか否かを判断することができる。上記態様は請求項６記載の発明に対応している。 In addition, the above description has been made by taking as an example confirmation / extraction of the presence / absence of data (customer information) in which the entire telephone number TELNO as the first data item is the same and the entire customer name NAME as the second data item is different. However, the present invention is not limited to this, and data having a part of information of the first data item may be the same. This is realized by sorting (and grouping) by comparing only a part of the information of the first item in the sorting based on the information of the first item (step 50 in FIG. 2). it can. Specifically, for example, in an aspect in which an address is applied as the first data item, sorting (and grouping) is performed by comparing information up to the town name in the address (excluding information such as addresses from the comparison target). Do. In this case, a group can be formed for each piece of data having the same address at least up to the town name. The above aspect corresponds to the invention described in claim 3. Further, data in which part of the information of the second data item is different may be targeted. In the flag setting based on the information of the second data item (step 54 in FIG. 2), only a part of the information of the second item is compared, and the flag is set according to the comparison result. This can be achieved. Specifically, for example, in the aspect in which the customer name is applied as the second data item, the flag is set by comparing only the last name of the customer name (excluding the name from the comparison target). In this case, after sorting the data based on the flag value in each group, by referring to the flag value set in the first or last data in each group, the customer name in each group It is possible to determine whether or not data having different last names is mixed. The above aspect corresponds to the invention described in claim 6.

また、上記では顧客情報ＤＢから読み出した各レコードに対し、ステップ５０で電話番号TELNO及び顧客名NAMEに基づいてソートすることでグループ化した後に、ステップ５２において、単一のレコードから成るグループを検索し、該当するグループ（に属するレコード）を除去する態様を説明したが、これに限定されるものではなく、例えば該当するグループ（レコード）の数が比較的少数であることが予想される等の場合には、単一のレコードから成るグループの検索及び該当するグループ（に属するレコード）の除去を省略するようにしてもよい。 Further, in the above, after the records read from the customer information DB are grouped by sorting based on the telephone number TELNO and customer name NAME in step 50, a group consisting of a single record is searched in step 52. However, the mode of removing the corresponding group (records belonging to) has been described, but the present invention is not limited to this. For example, it is expected that the number of corresponding groups (records) is relatively small. In this case, the search for a group consisting of a single record and the removal of the corresponding group (records belonging to it) may be omitted.

更に、上記では顧客情報ＤＢから全レコードを読み出し、第１のデータ項目の情報に基づくソート（グループ化）やフラグFLGの設定、フラグFLGの値に基づく個々のグループ内のソート等の処理を経て、第１のデータ項目の情報が同一かつ第２のデータ項目の情報が相違しているレコードが混在しているグループに属する全レコードを出力データとして出力する態様を説明したが、この処理方式では、図３での図示は省略しているものの、各レコードに第１のデータ項目の情報及び第２のデータ項目の情報以外の他の情報（例えば住所等）も付加された状態で各処理を行っているので、各中間データのサイズが大きく、各処理を行う毎に中間データを記憶部１２Ｃに書き込む必要が生ずる（なお、中間データを記憶部１２Ｃに書き込む必要が有るか否かは、個々のレコードのサイズや数（処理対象のデータの規模）、メモリ１２Ｂ等のリソースのサイズにも左右されるので、上記処理方式でも中間データを記憶部１２Ｃに書き込まずに済む可能性もある）。 Further, in the above, all records are read from the customer information DB, and after processing such as sorting (grouping) based on the information of the first data item, setting of the flag FLG, sorting within individual groups based on the value of the flag FLG, etc. In the processing method, all records belonging to a group in which records having the same first data item information and different second data item information are mixed are described as output data. Although not shown in FIG. 3, each process is performed in a state where other information (for example, an address) other than the information of the first data item and the information of the second data item is added to each record. Therefore, the size of each intermediate data is large, and it is necessary to write the intermediate data to the storage unit 12C every time each process is performed (the intermediate data needs to be written to the storage unit 12C). Whether or not it depends on the size and number of individual records (scale of data to be processed) and the size of resources such as the memory 12B, it is not necessary to write intermediate data to the storage unit 12C even in the above processing method. Possible).

このため、リソースとして使用可能な記憶部１２Ｃのサイズに制限がある等の場合には、顧客情報ＤＢの各レコードから第１のデータ項目の情報及び第２のデータ項目の情報のみを抽出することで各レコードのインデックスを作成し、このインデックスに対して第１のデータ項目の情報に基づくソート（グループ化）、フラグFLGの設定、フラグFLGの値に基づく個々のグループ内のソート、第１のデータ項目の情報が同一かつ第２のデータ項目の情報が相違しているレコードの判断、の各処理を順に行った後に、前記判断の結果に基づいて第１のデータ項目の情報が同一かつ第２のデータ項目の情報が相違しているレコードの実データを顧客情報ＤＢから順に読み出し、出力データとして出力するようにしてもよい。 For this reason, when the size of the storage unit 12C that can be used as a resource is limited, only the information on the first data item and the information on the second data item are extracted from each record of the customer information DB. To create an index for each record, and sort (grouping) based on the information of the first data item for this index, set flag FLG, sort within individual groups based on the value of flag FLG, first After sequentially determining each of the records having the same data item information and different second data item information, the first data item information is the same and first based on the result of the determination. The actual data of the records in which the information of the two data items is different may be sequentially read from the customer information DB and output as output data.

また、上記では第１のデータ項目及び第２のデータ項目として各々単一のデータ項目を適用した態様を説明したが、本発明はこれに限定されるものでもなく、第１のデータ項目及び第２のデータ項目の少なくとも一方が複数のデータ項目から構成されていてもよい。第１のデータ項目が複数のデータ項目から構成されている場合には、第１の項目の情報に基づくソート（図２におけるステップ５０）において、第１のデータ項目を構成する複数のデータ項目に基づいてソートを行い、処理対象の複数のデータを、第１のデータ項目を構成する複数のデータ項目が各々同一のデータで構成された複数のグループに分けるようにすればよい。また、第２のデータ項目が複数のデータ項目から構成されている場合は、第２のデータ項目の情報に基づくフラグの設定（図２におけるステップ５４）において、第２のデータ項目を構成する複数のデータ項目の情報を各々比較し、第２のデータ項目を構成する複数のデータ項目の情報が各々相違しているか否か（或いは、第２のデータ項目を構成する複数のデータ項目の情報のうちの少なくとも１つが相違しているか否か）に応じて値の異なるフラグを設定するようにすればよい。 Moreover, although the aspect which applied each single data item as a 1st data item and a 2nd data item was demonstrated above, this invention is not limited to this, A 1st data item and a 1st data item At least one of the two data items may be composed of a plurality of data items. When the first data item is composed of a plurality of data items, in the sorting based on the information of the first item (step 50 in FIG. 2), the plurality of data items constituting the first data item are Based on the sorting, the plurality of data to be processed may be divided into a plurality of groups in which a plurality of data items constituting the first data item are each composed of the same data. Further, when the second data item is composed of a plurality of data items, a plurality of data constituting the second data item is set in the flag setting (step 54 in FIG. 2) based on the information of the second data item. Are compared with each other, whether the information of the plurality of data items constituting the second data item is different from each other (or the information of the plurality of data items constituting the second data item). Different flags may be set according to whether at least one of them is different.

また、上記では本発明に係るデータ管理プログラムに対応する顧客情報抽出プログラムが記憶部１２Ｃに予め記憶されている態様を説明したが、本発明に係るデータ管理プログラムは、ＣＤ−ＲＯＭやＤＶＤ−ＲＯＭ、或いはＭＴ等の記録媒体に記録されている形態で提供することも可能である。 In the above description, the customer information extraction program corresponding to the data management program according to the present invention is stored in the storage unit 12C in advance. However, the data management program according to the present invention can be a CD-ROM or DVD-ROM. Alternatively, it may be provided in a form recorded on a recording medium such as MT.

本実施形態に係るコンピュータ・システムの概略構成を示すブロック図である。It is a block diagram which shows schematic structure of the computer system which concerns on this embodiment. 顧客情報抽出処理の内容を示すフローチャートである。It is a flowchart which shows the content of a customer information extraction process. 顧客情報抽出処理によって得られるデータの一例を示す概略図である。It is the schematic which shows an example of the data obtained by a customer information extraction process.

Explanation of symbols

１０コンピュータ・システム
１２ホスト・コンピュータ
１２ＡＣＰＵ
１２Ｂメモリ
１２Ｃ記憶部
１４ストレージ
１５ＭＴ 10 Computer System 12 Host Computer 12A CPU
12B Memory 12C Storage unit 14 Storage 15 MT

Claims

By sorting a plurality of data each having information set for a plurality of data items based on information of a first item of the plurality of data items, the plurality of data are converted into the first data. A first sorting means for dividing a part or the whole of the item information into a plurality of groups each composed of the same data;
For each of the plurality of data that has been sorted by the first sorting means, part or all of the information of the second data item of the plurality of data items is different from the specific data belonging to the same group. Flag setting means for setting different flags depending on whether or not
Second sorting means for sorting the data constituting the individual groups within the individual groups based on the flag values set for the individual data by the flag setting means;
Based on the value of the flag set in the first or last data of the individual group that has been sorted by the second sorting means, part or all of the information of the first data item is the same and the second A determination means for determining the presence or absence of data in which some or all of the data item information is different;
Data management device including

The first sorting unit is configured to determine whether or not there is data in which the entire information of the first data item is the same and part or all of the information of the second data item is different. By sorting the plurality of data based on the entire information of the first item, the plurality of data is divided into a plurality of groups each composed of the same information as the entire information of the first data item. The data management apparatus according to claim 1, wherein the data management apparatus is divided.

In the first sorting unit, the determining unit determines whether there is data in which a part of the information of the first data item is the same and a part or the whole of the information of the second data item is different. In this case, by sorting the plurality of data based on a part of information located at a fixed position among the information of the first item, the plurality of data is converted into the information of the first data item. 2. The data management apparatus according to claim 1, wherein a part of the data is divided into a plurality of groups each constituted by the same data.

2. The first sorting unit, after dividing the plurality of data into the plurality of groups, performing a process of excluding data belonging to a group consisting of a single data from a subsequent processing target. The data management device described.

The flag setting means, when the determination means determines the presence or absence of data in which part or all of the information of the first data item is the same and the entire information of the second data item is different, 2. The data management apparatus according to claim 1, wherein for each of the plurality of data, the entire information of the second data item is compared with the specific data, and the flag is set according to the comparison result.

The flag setting means determines whether or not there is data in which a part or all of the information of the first data item is the same and a part of the information of the second data item is different. In addition, for each of the plurality of data, a part of the information of the second data item that is located at a certain position is compared with the specific data, and the flag is set according to the comparison result. The data management apparatus according to claim 1.

The flag setting means, for the specific data of each group, a part or all of the information of the second data item is the same value as the value set as the flag in the same data as the specific data The data management apparatus according to claim 1, wherein the flag is unconditionally set as the flag.

Computer
By sorting a plurality of data each having information set for a plurality of data items based on information of a first item of the plurality of data items, the plurality of data are converted into the first data. A first sorting means for dividing a part or the whole of the item information into a plurality of groups each composed of the same data;
For each of the plurality of data that has been sorted by the first sorting means, part or all of the information of the second data item of the plurality of data items is different from the specific data belonging to the same group. Flag setting means for setting different flags depending on whether or not
Second sorting means for sorting the data constituting the individual groups within the individual groups based on the flag values set for the individual data by the flag setting means;
And based on the value of the flag set in the data at the beginning or end of the individual group that has been sorted by the second sorting means, part or all of the information of the first data item is the same and the A data management program that functions as a determination unit that determines whether there is data in which part or all of the information of the second data item is different.