JP2010165004A

JP2010165004A - Data storage system and data management method

Info

Publication number: JP2010165004A
Application number: JP2009004297A
Authority: JP
Inventors: Yoshihiko Kinoshita; 佳彦木下
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 2009-01-13
Filing date: 2009-01-13
Publication date: 2010-07-29
Anticipated expiration: 2029-01-13
Also published as: JP5402000B2

Abstract

<P>PROBLEM TO BE SOLVED: To provide a data storage system, etc., for streamlining storage data in a physical disk and virtually generating latest data at a specified time point in the past or tabulation data during a period, based on the storage data. <P>SOLUTION: Not the whole of past data are held and only the new or updated data are held concerning the data to be stored in an analysis data storage database server. Consequently, the physical disk to be used is streamlined. Besides, the latest state data on any designated days in the past is displayed and also the number of cases within a designated period is tabulated by devising designation in retrieving the data. <P>COPYRIGHT: (C)2010,JPO&INPIT

Description

本発明は、ＤＷＨ（データウェアハウス）の概念を利用するにあたり、物理ディスク内の蓄積データを効率化したり、この蓄積データより過去の特定時点での最新データ等を仮想的に作成する技術に関する。 The present invention relates to a technique for making the accumulated data in a physical disk more efficient or virtually creating the latest data at a specific point in the past from the accumulated data when using the concept of DWH (data warehouse).

通常、分析用のデータ蓄積においては、日々、週時、月次、四半期、年次等の一定期間を定義し、それぞれの期間ごとに分析に必要な全データを蓄積し、それらの変化の推移や統計取得等、自由に分析するため活用する。 In general, in the accumulation of data for analysis, a certain period such as daily, weekly, monthly, quarterly, or annual is defined, and all data necessary for analysis is accumulated for each period, and the change of those changes And use it for free analysis such as obtaining statistics.

しかし、データを蓄積するデータベースを持つサーバおける物理ディスク容量が応対履歴データベース（オンラインでデータを格納／更新する側のデータベース）に比べ遥かに大容量の物理ディスク容量を確保しておくことが必要不可欠であった。 However, it is essential to ensure that the physical disk capacity of the server with the database that stores data is much larger than that of the response history database (online data storage / update database). Met.

また、上記を要因とするディスクの制限により、過去のある時点での最新データの持ち方に制限（Ｘ日分、Ｘヶ月分、Ｘ年分等）が生じる懸念もあった。 In addition, there is a concern that restrictions on the latest data at a certain point in the past (X days, X months, X years, etc.) may occur due to disk limitations due to the above.

関連技術として、予めＣＤ−ＲＯＭからＤＢがハードディスクにコピーされ、次に該ＤＢについての差分情報が配布され、該差分情報を基に該ハードディスクの内容を書き換える技術が提案されている（例えば、特許文献１参照）。
また、利用者端末から商品ＤＢを検索する場合、予め配布されたＣＤ−ＲＯＭを検索し次にＮＷ経由で送られた差分情報を検索して所望のデータを得る技術が提案されている（例えば、特許文献２参照）。 As a related technique, a technique has been proposed in which a DB is copied from a CD-ROM to a hard disk in advance, then difference information about the DB is distributed, and the contents of the hard disk are rewritten based on the difference information (for example, patents). Reference 1).
Also, when searching for a product DB from a user terminal, a technique has been proposed in which a CD-ROM distributed in advance is searched, and then difference information sent via the NW is searched to obtain desired data (for example, , See Patent Document 2).

特開２００３−１４０９３１号公報JP 2003-140931 A 特開２００４−２４０５６２号公報JP 2004-240562 A

しかしながら、上述の関連技術は、コンタクトセンター等において顧客データや相談・苦情の内容を分析し、マーケティング等で活用するためにＤＷＨ（データウェアハウス）の概念を利用するにあたり、分析データ蓄積データベースサーバ用に調達した機器のディスク容量が応対履歴データベースサーバと同等しかなかったことにより、物理ディスク内の蓄積データを効率化することや、この蓄積データより過去の特定時点での最新データ又は期間中の集計データを仮想的に作成することが困難であった。 However, the above-mentioned related technology uses the DWH (data warehouse) concept to analyze customer data and consultation / complaint content at a contact center, etc., and use it in marketing etc. Since the disk capacity of the equipment procured in the past was equal to the response history database server, the accumulated data in the physical disk can be made more efficient, the latest data at a specific point in the past than this accumulated data, or the total during the period It was difficult to create data virtually.

本発明は、以上のような課題を解決するためになされたもので、物理ディスク内の蓄積データを効率化することや、この蓄積データより過去の特定時点での最新データ又は期間中の集計データを仮想的に作成することができるデータ蓄積システムおよびデータ管理方法を提供することを目的とする。 The present invention has been made in order to solve the above-described problems, and improves the efficiency of the accumulated data in the physical disk, and the latest data at a specific point in the past or the aggregated data during the period from the accumulated data. An object of the present invention is to provide a data storage system and a data management method capable of virtually creating a data.

本発明の第一のデータ蓄積システムは、全案件のデータコピーを実施せず、前回のコピー時より新規追加または変更のあった案件データについてのみコピーするよう、レコードの最終登録更新日付にて判定を行い、対象を抽出し、
更新データであった場合、更新前状態のデータもＤＷＨ−ＤＢ側ではそのまま保持し、
抽出時には前記ＤＷＨ―ＤＢ側にて、別途専用の一意となるコードを割り振るデータ蓄積システムにおいて、
指定された対象日付以前の最終更新日付にマッチするレコードをすべて検索し、案件が重複した場合は案件コードにて名寄せを行い、最新データのみ抽出・表示することを特徴とする。 The first data storage system of the present invention does not perform data copy of all cases, but determines only the case data that has been newly added or changed since the previous copy, based on the last registration update date of the record. To extract the target,
If it is update data, the data before update is also kept on the DWH-DB side,
At the time of extraction, in the data storage system that allocates a dedicated unique code separately on the DWH-DB side,
All records that match the last update date before the specified target date are searched, and if there are duplicated items, name identification is performed with the item code, and only the latest data is extracted and displayed.

本発明の第二のデータ蓄積システムは、全案件のデータコピーを実施せず、前回のコピー時より新規追加または変更のあった案件データについてのみコピーするよう、レコードの最終登録更新日付にて判定を行い、対象を抽出し、
更新データであった場合、更新前状態のデータもＤＷＨ−ＤＢ側ではそのまま保持し、
抽出時には前記ＤＷＨ―ＤＢ側にて、別途専用の一意となるコードを割り振るデータ蓄積システムにおいて、
指定期間に合致する初期受付日付の全レコードをすべて検索し、案件が重複した場合は案件コードと初期受付日付にて名寄せを行い、最新データのみ抽出・表示することを特徴とする。 The second data storage system of the present invention does not perform data copy of all cases, but determines only the case data that has been newly added or changed since the previous copy, based on the record last registration update date. To extract the target,
If it is update data, the data before update is also kept on the DWH-DB side,
At the time of extraction, in the data storage system that allocates a dedicated unique code separately on the DWH-DB side,
All records of the initial reception date that match the specified period are searched. If there are duplicate cases, name identification is performed based on the case code and the initial reception date, and only the latest data is extracted and displayed.

本発明の第一のデータ管理方法は、全案件のデータコピーを実施せず、前回のコピー時より新規追加または変更のあった案件データについてのみコピーするよう、レコードの最終登録更新日付にて判定を行い、対象を抽出し、
更新データであった場合、更新前状態のデータもＤＷＨ−ＤＢ側ではそのまま保持し、
抽出時には前記ＤＷＨ―ＤＢ側にて、別途専用の一意となるコードを割り振るデータ管理方法において、
指定された対象日付以前の最終更新日付にマッチするレコードをすべて検索し、案件が重複した場合は案件コードにて名寄せを行い、最新データのみ抽出・表示することを特徴とする。 In the first data management method of the present invention, the data of all cases is not copied, and only the case data that has been newly added or changed since the previous copy is copied based on the last registration update date of the record. To extract the target,
If it is update data, the data before update is also kept on the DWH-DB side,
In the data management method of allocating a dedicated unique code separately on the DWH-DB side at the time of extraction,
All records that match the last update date before the specified target date are searched, and if there are duplicated items, name identification is performed with the item code, and only the latest data is extracted and displayed.

本発明の第二のデータ管理方法は、全案件のデータコピーを実施せず、前回のコピー時より新規追加または変更のあった案件データについてのみコピーするよう、レコードの最終登録更新日付にて判定を行い、対象を抽出し、
更新データであった場合、更新前状態のデータもＤＷＨ−ＤＢ側ではそのまま保持し、
抽出時には前記ＤＷＨ―ＤＢ側にて、別途専用の一意となるコードを割り振るデータ管理方法において、
指定期間に合致する初期受付日付の全レコードをすべて検索し、案件が重複した場合は案件コードと初期受付日付にて名寄せを行い、最新データのみ抽出・表示することを特徴とする。 In the second data management method of the present invention, the data of all cases is not copied, and only the case data that has been newly added or changed since the previous copy is copied based on the last registration update date of the record. To extract the target,
If it is update data, the data before update is also kept on the DWH-DB side,
In the data management method of allocating a dedicated unique code separately on the DWH-DB side at the time of extraction,
All records of the initial reception date that match the specified period are searched. If there are duplicate cases, name identification is performed based on the case code and the initial reception date, and only the latest data is extracted and displayed.

本発明によれば、物理ディスク内の蓄積データを効率化することや、この蓄積データより過去の特定時点での最新データ又は期間中の集計データを仮想的に作成することができる。 According to the present invention, the accumulated data in the physical disk can be made more efficient, and the latest data at a specific point in the past or the total data during the period can be virtually created from the accumulated data.

本発明の実施の形態に係るシステム構成を示す図である。It is a figure which shows the system configuration | structure which concerns on embodiment of this invention. 本発明の実施の形態に係るシステムの全体の動作を示すフローチャートである。It is a flowchart which shows the operation | movement of the whole system which concerns on embodiment of this invention.

以下、本発明の第一の実施の形態について図面を参照して詳細に説明する。図１に示す本実施の形態におけるデータ蓄積システムは、応対履歴データ入力用端末１０と、応対履歴データベースサーバ２０と、分析用データ蓄積データベースサーバ３０と、分析用端末４０とから構成されている。これらの応対履歴データ入力用端末１０と、応対履歴データベースサーバ２０と、分析用データ蓄積データベースサーバ３０および分析用端末４０は、プログラム制御により動作し、ローカルエリアネットワーク１００を介して相互に接続されている。 Hereinafter, a first embodiment of the present invention will be described in detail with reference to the drawings. The data storage system in the present embodiment shown in FIG. 1 includes a response history data input terminal 10, a response history database server 20, an analysis data storage database server 30, and an analysis terminal 40. These response history data input terminal 10, response history database server 20, analysis data storage database server 30 and analysis terminal 40 operate under program control and are connected to each other via local area network 100. Yes.

応対履歴データ入力用端末１０は、コンタクトセンターに設置されるパーソナルコンピュータ等の情報処理装置であり、コンタクトセンターに電話で寄せられる相談・苦情等の内容を入力し、この内容をネットワークを介して応対履歴データベースサーバ２０へ登録する機能を有する。また同様に、応対履歴データベースサーバ２０にアクセスして登録された案件の検索表示又は一度登録されたデータの更新を行うことも可能とする。 The response history data input terminal 10 is an information processing device such as a personal computer installed in the contact center, and inputs contents such as consultations and complaints received by telephone to the contact center, and receives the contents via the network. It has a function of registering in the history database server 20. Similarly, it is possible to access the response history database server 20 to search and display registered cases or update registered data once.

応対履歴データベースサーバ２０は、データセンターに設置されているサーバ等の情報処理装置であり、ミドルウェアであるリレーショナルデータベースがインストールされている。コンタクトセンターにて実施した応対の全履歴をリレーショナルデータベースへ格納する機能を有する。なお、データは１つの案件につき１レコード保有し、案件番号によって一意に識別される。 The response history database server 20 is an information processing apparatus such as a server installed in a data center, and has a relational database as middleware installed therein. It has a function to store the entire history of responses conducted at the contact center in a relational database. The data has one record per case and is uniquely identified by the case number.

分析用データ蓄積データベースサーバ３０は、マーケティング等を実施する部署に設置される、サーバ等の情報処理装置であり、ミドルウェアであるリレーショナルデータベースがインストールされている。業務時間外に応対履歴データベースサーバ２０のデータ内容を抽出し、格納する機能を有する。このため、業務中はデータの更新が発生せず、検索・集計専用として使用される。 The analysis data storage database server 30 is an information processing apparatus such as a server installed in a department that performs marketing or the like, and a relational database that is middleware is installed therein. It has a function of extracting and storing data contents of the response history database server 20 outside business hours. For this reason, data is not updated during work, and is used exclusively for search and aggregation.

分析用端末４０は、マーケティング等を実施する部署に設置される、パーソナルコンピュータ等の情報処理装置であり、分析用データ蓄積データベースサーバ３０にアクセスして項目と時系列ごとの分析を自由に行えるＯＬＡＰ機能を有する。この分析により、たとえばある地域や季節におけるある年代はある商品への興味が多い傾向にある等、データの傾向によりマーケティングや顧客サービスへの活用に利用する。 The analysis terminal 40 is an information processing apparatus such as a personal computer installed in a department that conducts marketing, etc., and is an OLAP that can access the analysis data storage database server 30 and freely analyze items and time series. It has a function. Based on this analysis, for example, there is a tendency that a certain age in a certain region or season has a lot of interest in a certain product, and the data is used for marketing or customer service depending on the tendency of data.

次に、図１〜図２を参照して本実施の形態の動作について詳細に説明する。
応対履歴データ入力用端末１０は、コンタクトセンター等において顧客より相談・苦情等の電話を受けると、応対履歴データベースサーバ２０にアクセスし、かかってきた電話の番号にマッチする顧客に係る情報（氏名、住所、電話番号、年代等）を抽出し、画面表示する（ステップＡ１、Ａ２、Ａ３）。なお新規顧客の場合は表示しないため、新規顧客として手作業による入力を行う。 Next, the operation of the present embodiment will be described in detail with reference to FIGS.
When receiving a phone call for consultation / complaint from a customer at a contact center or the like, the response history data input terminal 10 accesses the response history database server 20 and receives information related to the customer that matches the number of the incoming call (name, name, Address, telephone number, age, etc.) are extracted and displayed on the screen (steps A1, A2, A3). In addition, since it is not displayed in the case of a new customer, manual input is performed as a new customer.

その後、電話における相談・苦情の内容、回答内容を入力し（ステップＡ４）、応対履歴データベースサーバ２０へ登録する。なお登録時には、システムにより自動で案件番号、初回受付日時、最終登録日時が付与される（ステップＡ５、Ａ６）。 Thereafter, the contents of the consultation / complaint over the telephone and the contents of the answer are input (step A4) and registered in the response history database server 20. At the time of registration, the system automatically assigns the case number, the first reception date and time, and the last registration date and time (steps A5 and A6).

上記により登録・更新された応対履歴データベースサーバ２０に格納されたデータについて、毎日決められた時間（業務運用していない時間帯）に分析用データ蓄積データベースサーバ３０へコピーを実施するが、この際に以下の取り決めによりデータの抽出及び蓄積を行う（ステップＡ７、Ａ８、Ａ９）。 The data stored in the response history database server 20 registered and updated as described above is copied to the data storage database server 30 for analysis at a time determined every day (time period when the business is not operated). The data is extracted and stored according to the following rules (steps A7, A8, A9).

（１）全案件のデータコピーを実施せず、前回のコピー時より新規追加または変更のあった案件データについてのみコピーするよう、レコードの最終登録更新日付にて判定を行い、対象を抽出する。 (1) The data is not copied for all items, and the record is determined based on the last registration update date of the record so as to copy only the item data that has been newly added or changed since the previous copy, and the target is extracted.

（２）更新データであった場合、更新前状態のデータもＤＷＨ−ＤＢ側ではそのまま保持する。 (2) If it is update data, the data in the pre-update state is also retained on the DWH-DB side.

（３）抽出時にはＤＷＨ―ＤＢ側にて、別途専用の一意となるコードを割り振る（案件が更新データであった場合、案件番号が重複して一意識別できなくなるため）。 (3) At the time of extraction, a dedicated unique code is allocated on the DWH-DB side (if the case is update data, the case number is duplicated and cannot be uniquely identified).

このようにして分析用データ蓄積データベースサーバ３０に蓄積されたデータについて、分析用端末４０よりアクセスして分析又は集計を行うが、この分析対象の抽出は以下の方法にて行う（ステップＡ１０、Ａ１１、Ａ１２、Ａ１３）。 The data stored in the analysis data storage database server 30 in this way is accessed from the analysis terminal 40 for analysis or totalization. The analysis target is extracted by the following method (steps A10 and A11). , A12, A13).

１）指定日での最新全データ表示
指定された対象日付以前（指定日を含める）の最終更新日付にマッチするレコードをすべて検索し、案件が重複した場合は案件コードにて名寄せを行い、最新データのみ抽出・表示する。この抽出データが指定日時点での最新データとなるため、これを基にした各種分析を行う。 1) Displaying the latest all data on the specified date Search for all records that match the last update date before the specified target date (including the specified date). Extract and display data only. Since this extracted data becomes the latest data as of the designated date, various analyzes are performed based on this.

２）指定期間内におけるデータの集計
指定期間に合致する初期受付日付の全レコードをすべて検索し、案件が重複した場合は案件コードと初期受付日付にて名寄せを行い、最新データのみ抽出・表示する。これを基にした検索又は集計を行う。
最終更新日付で指定期間内での名寄せを行った場合、別期間に同一案件が重複した場合に別途計上してしまい、トータル件数が案件数と合致しないため、初期受付日付を使用する。 2) Aggregation of data within the specified period All records with the initial reception date that match the specified period are searched. If there are duplicates, the name is sorted by the case code and the initial reception date, and only the latest data is extracted and displayed. . Search or tabulate based on this.
If name identification is performed within the specified period on the last update date, it will be separately recorded if the same case is duplicated in another period, and the total number of cases does not match the number of cases, so the initial reception date is used.

上記により、データ保持期間を限定しないシステム導入から全期間全日の最新状態を仮想的に保有し、分析及び集計の対象とすることを可能とする。 As described above, it is possible to virtually hold the latest state of the entire day from the introduction of the system without limiting the data retention period, and make it the target of analysis and aggregation.

上記の方法によってデータを蓄積した場合に、全データを保有する場合と比較した例として、使用するディスクサイズの試算を行った。 When data was accumulated by the above method, a trial calculation of the disk size to be used was performed as an example compared with the case where all data was retained.

なお、前提条件は以下とする。
・１ヶ月の案件数を２７７，５００（件）、１年間の案件数を２，９１８，０００（件）とする
・案件１件あたり容量想定を５ＫＢとする
・初回登録された案件が更新される割合を３０％とする The prerequisites are as follows.
・ The number of cases per month is 277,500 (cases), the number of cases per year is 2,918,000 (cases) ・ Capacity assumption per case is 5KB ・ The first registered cases are updated 30%

(１) 全データを格納した場合の蓄積データ量
前掲および下記の条件を前提とする。
・過去データについては、日時：６０日分、月次：２４か月分、年次：５年分を保有する
・保有期限切れデータについては、削除する
ア応対履歴ＤＢ１日の増加量
１ヶ月の案件数 ÷ 30（日） × 案件1件あたり容量想定
より、
2,918,000（件） ÷ 360（日） × 5（KB）＝ 40527(KB) ＝ 39.6（MB）
イ過去データ量の想定値（日次）
日次で保存するデータは、前日時点の最新分データも含め日に１度のデータが６０日分蓄積される。
１日目：[ 39.6 (MB) × 1(日)] ＝ 39.6（MB）
２日目：[ 39.6 (MB) × 2(日)] ＝ 79.2（MB）
・・・
1741日目：｛ 39.6 (MB) × 1741(日) ｝＝ 68943.6（MB）
・・・
1800日目：｛ 39.6 (MB) × 1800(日) ｝＝ 71280（MB）
このうち最大想定データは、1741日目〜1800日目の合計値となるので、
68943.6（MB）＋・・・＋ 71280（MB）＝４１０８．１（ＧＢ）となる。
ウ過去データ量の想定値（月次）
月次で保存する過去データは月に1度のデータが２年間(２４ヶ月)分蓄積される。
１ヶ月目：[ 39.6 (MB) × 30(日) ] ＝ 1188（MB）
２ヶ月目：[ 39.6 (MB) × 60(日) ] ＝ 2376（MB）
・・・
37ヶ月目：｛ 39.6 (MB) × 1410(日) ｝＝ 55836（MB）
・・・
60ヶ月目：｛ 39.6 (MB) × 1800(日) ｝＝ 71280（MB）
このうち、最大想定データは、37ヶ月〜60ヶ月目の合計値となるので、
55836（MB）＋・・・＋ 71280（MB）＝1489.6（GB）となる。
エ過去データ量の想定値（年次）
年次で保存する過去データは年に１度のデータが５年間分蓄積される
１年目：[ 39.6 (MB) × 360（日）] ＝ 14256（MB）
２年目：｛ 39.6 (MB) × 720（日）｝＝ 28512（MB）
・・・
５年目：｛39.6 (MB) × 1800（日）｝＝ 71280（MB）
最大想定データは、この１〜５年の全合計値となることより、
14256 (MB) ＋ 28512 (MB) ＋・・・＋ 71280（MB）＝ 208.8（GB）
となる。
オＤＷＨデータ量の想定値（最大値）
上記イ〜エの合計値によりＤＷＨデータ量想定値が求められる。
4108.1(GB)+ 1489.6(GB) + 208.8(GB) = 5806.5(GB) (1) Amount of accumulated data when all data is stored The above conditions and the following conditions are assumed.
・ For past data, date / time: 60 days, monthly: 24 months, annual: 5 years are retained ・ Delete expired data a) Response history DB daily increase 1 month projects Number ÷ 30 (days) × Capacity assumption per project
2,918,000 (cases) ÷ 360 (days) × 5 (KB) = 40527 (KB) = 39.6 (MB)
B Estimated value of past data volume (daily)
Data stored daily is accumulated for 60 days once a day, including the latest data as of the previous day.
Day 1: [39.6 (MB) x 1 (day)] = 39.6 (MB)
Day 2: [39.6 (MB) x 2 (Sun)] = 79.2 (MB)
...
Day 1741: {39.6 (MB) x 1741 (Sunday)} = 68943.6 (MB)
...
Day 1800: {39.6 (MB) x 1800 (Sun)} = 71280 (MB)
Of these, the maximum expected data is the total value from the 1741st day to the 1800th day,
68943.6 (MB) +... +71280 (MB) = 4108.1 (GB).
C Estimated value of past data volume (monthly)
As for past data to be stored monthly, data is accumulated once a month for two years (24 months).
First month: [39.6 (MB) x 30 (days)] = 1188 (MB)
Month 2: [39.6 (MB) x 60 (Sun)] = 2376 (MB)
...
Month 37: {39.6 (MB) x 1410 (Sun)} = 55836 (MB)
...
60th month: {39.6 (MB) x 1800 (Sun)} = 71280 (MB)
Of these, the maximum expected data is the total for the 37th to 60th months,
55836 (MB) + ... + 71280 (MB) = 1489.6 (GB).
D Expected value of past data volume (annual)
The past data to be saved annually is accumulated once a year for five years. First year: [39.6 (MB) x 360 (days)] = 14256 (MB)
Second year: {39.6 (MB) × 720 (Sun)} = 28512 (MB)
...
5th year: {39.6 (MB) x 1800 (Sun)} = 71280 (MB)
The maximum expected data is the total value for this 1-5 year,
14256 (MB) + 28512 (MB) + ... + 71280 (MB) = 208.8 (GB)
It becomes.
E Estimated value (maximum value) of DWH data volume
The estimated DWH data amount is obtained from the total value of the above (i) to (d).
4108.1 (GB) + 1489.6 (GB) + 208.8 (GB) = 5806.5 (GB)

（２）本発明を使用した場合の蓄積データ量
前掲の条件を前提とする。
カ応対履歴ＤＢ１日の増加量
１ヶ月の案件数 ÷ 30（日） × 案件1件あたり容量想定
より、
2,918,000（件） ÷ 360（日） × 5（KB）＝ 40527(KB) ＝ 39.6(MB)
キデータ量の想定値（初回登録データ）
キより、新規で追加される想定データ5年間(1800日)分は、イと同様の結果となり、
(39.6(MB) × 1800(日)) ＝ 70.0(GB)となる。
クデータ量の想定値（既存案件の更新データ）
既存の応対履歴データが更新された場合、データ蓄積データベースサーバ側では更新前のデータと併せて保持する必要がある。
作成されたレコードが更新される割合（30 %）をイの結果に乗算し、
70.0(GB) × 30% ＝ 21.0(GB)となる。
ケＤＷＨデータ量の想定値（最大値）
上記キ、クの合計値によりＤＷＨデータ量想定値が求められる。
70.0（GB）＋ 21.0（GB）＝ 91.0(GB)
従って、上記例ではこれまでの手法５８０６．５（ＧＢ）に対し９１．０（ＧＢ）となり、１／６３．８程度まで効率化でき、かつ指定日が全日において可能となる結果が得られた。 (2) Accumulated data amount when the present invention is used The above conditions are assumed.
Increase in daily response history DB Number of projects per month ÷ 30 (days) × Capacity assumption per project
2,918,000 (cases) ÷ 360 (days) × 5 (KB) = 40527 (KB) = 39.6 (MB)
Expected value of data volume (initial registration data)
From Ki, the expected data to be newly added for 5 years (1800 days) will be the same result as i,
(39.6 (MB) x 1800 (day)) = 70.0 (GB).
Estimated value of data volume (update data for existing projects)
When existing response history data is updated, the data storage database server side needs to store it together with the data before update.
Multiply the result of i by the rate at which the records created are updated (30%)
70.0 (GB) x 30% = 21.0 (GB).
D Expected value of DWH data volume (maximum value)
The estimated DWH data amount is obtained from the total value of the keys.
70.0 (GB) + 21.0 (GB) = 91.0 (GB)
Therefore, in the above example, the result is 91.0 (GB) compared to the conventional method 5806.5 (GB), and the efficiency can be improved to about 1 / 63.8, and the designated date can be obtained in all days. .

なお、上述する各実施の形態は、本発明の好適な実施の形態であり、本発明の要旨を逸脱しない範囲内において種々変更実施が可能である。例えば、各装置の機能を実現するためのプログラムを各装置に読込ませて実行することにより各装置の機能を実現する処理を行ってもよい。さらに、そのプログラムは、コンピュータ読み取り可能な記録媒体であるＣＤ−ＲＯＭまたは光磁気ディスクなどを介して、または伝送媒体であるインターネット、電話回線などを介して伝送波により他のコンピュータシステムに伝送されてもよい。また、各装置の機能が他の装置によりまとめて実現されたり、追加の装置により機能が分散されて実現される形態も本発明の範囲内である。 Each of the above-described embodiments is a preferred embodiment of the present invention, and various modifications can be made without departing from the scope of the present invention. For example, a process for realizing the function of each device may be performed by causing each device to read and execute a program for realizing the function of each device. Further, the program is transmitted to another computer system by a transmission wave via a computer-readable recording medium such as a CD-ROM or a magneto-optical disk, or via a transmission medium such as the Internet or a telephone line. Also good. In addition, it is also within the scope of the present invention that the functions of each device are realized by other devices collectively or the functions are distributed by additional devices.

本発明は、顧客情報と新たな商品開発等に活用できるデータを蓄積しているシステムでデータウェアハウスの概念を活用しデータ分析を実施する場合で、分析対象データ件数または容量が多い場合に有効となる。 The present invention is effective when data analysis is performed using a data warehouse concept in a system that accumulates customer information and data that can be used for new product development, etc. It becomes.

１０応対履歴データ入力用端末
２０応対履歴データベースサーバ
３０分析用データ蓄積データベースサーバ
４０分析用端末
１００ローカルエリアネットワーク 10 Response History Data Input Terminal 20 Response History Database Server 30 Analysis Data Storage Database Server 40 Analysis Terminal 100 Local Area Network

Claims

Do not copy the data of all the matters, but make a judgment based on the last registration update date of the record so as to copy only the matter data that has been newly added or changed since the previous copy, extract the target,
If it is update data, the data before update is also kept on the DWH-DB side,
At the time of extraction, in the data storage system that allocates a dedicated unique code separately on the DWH-DB side,
A data storage system that searches for all records that match the last update date before the specified target date, and if there are duplicates of items, performs a name identification with the item code and extracts and displays only the latest data.

Do not copy the data of all the matters, but make a judgment based on the last registration update date of the record so as to copy only the matter data that has been newly added or changed since the previous copy, extract the target,
If it is update data, the data before update is also kept on the DWH-DB side,
At the time of extraction, in the data storage system that allocates a dedicated unique code separately on the DWH-DB side,
A data storage system that searches for all records of the initial reception date that match the specified period, and if there are duplicates of items, performs a name identification with the case code and the initial reception date, and extracts and displays only the latest data.

3. The data storage system according to claim 1, wherein a search or a total is performed based on the latest data extracted and displayed.

Do not copy the data of all the matters, but make a judgment based on the last registration update date of the record so as to copy only the matter data that has been newly added or changed since the previous copy, extract the target,
If it is update data, the data before update is also kept on the DWH-DB side,
In the data management method of allocating a dedicated unique code separately on the DWH-DB side at the time of extraction,
A data management method characterized by searching for all records that match the last update date before the specified target date, and extracting and displaying only the latest data by performing name identification when the items are duplicated.

Do not copy the data of all the matters, but make a judgment based on the last registration update date of the record so as to copy only the matter data that has been newly added or changed since the previous copy, extract the target,
If it is update data, the data before update is also kept on the DWH-DB side,
In the data management method of allocating a dedicated unique code separately on the DWH-DB side at the time of extraction,
A data management method characterized in that all records of the initial reception date that match the specified period are searched, and if a case is duplicated, name identification is performed with the case code and the initial reception date, and only the latest data is extracted and displayed.

6. The data management method according to claim 4, wherein a search or a total is performed based on the latest data extracted and displayed.