JP2015072637A

JP2015072637A - Program, information processing device, and information processing method

Info

Publication number: JP2015072637A
Application number: JP2013208574A
Authority: JP
Inventors: 崇志西谷; Takashi Nishitani; 高大矢和田; Takahiro Yawata; 誠佐々原; Makoto Sasahara; 志乃廣澤; Shino Hirosawa
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 2013-10-03
Filing date: 2013-10-03
Publication date: 2015-04-16
Anticipated expiration: 2033-10-03
Also published as: JP6390085B2

Abstract

PROBLEM TO BE SOLVED: To provide a program or the like capable of determining data attributes with high accuracy.SOLUTION: A computer extracts object data to be an analysis object from a data group. The computer determines attributes to be a candidate for the object data and evaluation values of the attributes on the basis of contents of the extracted data, data adjacent to the object data and a position. Further, the computer extracts a position at which the object data is arranged, refers to a storage part storing the attributes and the evaluation values in association with a position at which data is arranged, reads the attributes and the evaluation values of the object data, and determines attributes to be a candidate for the object data and evaluation values of the attributes on the basis of the read attributes and evaluation values.

Description

本発明は、コンピュータに各種処理を実行させるプログラム、情報処理装置、及び、情報処理方法に関する。 The present invention relates to a program for causing a computer to execute various processes, an information processing apparatus, and an information processing method.

従来、文書または各種帳票に表が記載されている場合に、当該表のイメージデータから必要なデータを整理して読み取る表処理方法が提案されている（例えば、特許文献１参照）。 Conventionally, when a table is described in a document or various forms, a table processing method that arranges and reads necessary data from image data of the table has been proposed (see, for example, Patent Document 1).

特開平１０−１１６３１４号公報Japanese Patent Laid-Open No. 10-116314

しかしながら、従来の技術では、データの属性について精度良く決定することができない。 However, the conventional technique cannot accurately determine the data attribute.

一つの側面では、本発明はデータの属性を精度良く決定することが可能なプログラム等を提供することを目的とする。 In one aspect, an object of the present invention is to provide a program or the like that can accurately determine data attributes.

本願に開示するプログラムは、コンピュータに、データ群から解析対象となる対象データを抽出し、抽出した対象データの内容、該対象データに隣接するデータ及び位置に基づき、前記対象データの候補となる属性及び属性の評価値を決定する処理を実行させる。 The program disclosed in the present application extracts target data to be analyzed from a data group to a computer, and based on the content of the extracted target data, data adjacent to the target data, and the attribute that is a candidate for the target data And processing for determining the evaluation value of the attribute.

一つの側面では、データの属性を精度良く決定することが可能となる。 In one aspect, data attributes can be determined with high accuracy.

情報処理装置のハードウェア群を示すブロック図である。It is a block diagram which shows the hardware group of information processing apparatus. 帳票データのイメージを示す説明図である。It is explanatory drawing which shows the image of form data. 帳票データの一部を示す説明図である。It is explanatory drawing which shows a part of form data. 位置テーブルのレコードレイアウトを示す説明図である。It is explanatory drawing which shows the record layout of a position table. 隣接テーブルのレコードレイアウトを示す説明図である。It is explanatory drawing which shows the record layout of an adjacent table. 数値テーブルのレコードレイアウトを示す説明図である。It is explanatory drawing which shows the record layout of a numerical value table. キーワードテーブルのレコードレイアウトを示す説明図である。It is explanatory drawing which shows the record layout of a keyword table. 対象データテーブルのレコードレイアウトを示す説明図である。It is explanatory drawing which shows the record layout of a target data table. 対象データの抽出処理手順を示すフローチャートである。It is a flowchart which shows the extraction process procedure of object data. 対象データの抽出処理手順を示すフローチャートである。It is a flowchart which shows the extraction process procedure of object data. 位置に基づく属性及び評価値決定処理の手順を示すフローチャートである。It is a flowchart which shows the procedure of the attribute based on a position, and evaluation value determination processing. 隣接対象データの存否に基づく属性及び評価値決定処理の手順を示すフローチャートである。It is a flowchart which shows the procedure of the attribute and evaluation value determination process based on the presence or absence of adjacent object data. 対象データの文字及び数字内容に基づく属性及び評価値決定処理の手順を示すフローチャートである。It is a flowchart which shows the procedure of the attribute and evaluation value determination process based on the character and number content of object data. 対象データの文字及び数字内容に基づく属性及び評価値決定処理の手順を示すフローチャートである。It is a flowchart which shows the procedure of the attribute and evaluation value determination process based on the character and number content of object data. 合計評価値の算出処理手順を示すフローチャートである。It is a flowchart which shows the calculation process procedure of a total evaluation value. 対象データ名の決定処理手順を示すフローチャートである。It is a flowchart which shows the determination processing procedure of object data name. 対象データ名の決定処理手順を示すフローチャートである。It is a flowchart which shows the determination processing procedure of object data name. 帳票データのイメージを示す説明図である。It is explanatory drawing which shows the image of form data. 実施の形態２に係る対象データテーブルのレコードレイアウトを示す説明図である。It is explanatory drawing which shows the record layout of the object data table which concerns on Embodiment 2. FIG. 設定タブの表示処理手順を示すフローチャートである。It is a flowchart which shows the display processing procedure of a setting tab. 評価値の補正処理手順を示すフローチャートである。It is a flowchart which shows the correction process procedure of an evaluation value. 評価値の補正処理手順を示すフローチャートである。It is a flowchart which shows the correction process procedure of an evaluation value. 上述した形態のコンピュータの動作を示す機能ブロック図である。It is a functional block diagram which shows operation | movement of the computer of the form mentioned above. 実施の形態３に係るコンピュータのハードウェア群を示すブロック図である。FIG. 10 is a block diagram illustrating a hardware group of a computer according to a third embodiment.

実施の形態１
以下実施の形態を、図面を参照して説明する。図１は情報処理装置１のハードウェア群を示すブロック図である。情報処理装置１は例えば、パーソナルコンピュータ、サーバコンピュータ、携帯電話機、スマートフォン、ゲーム機またはＰＤＡ（Personal Digital Assistant）等である。以下では、情報処理装置１をパーソナルコンピュータ１（以下、コンピュータ１という）に適用する例を挙げて説明する。コンピュータ１は制御部としてのＣＰＵ（Central Processing Unit）１１、ＲＡＭ(Random Access Memory)１２、入力部１３、表示部１４、時計部１８、記憶部１５、及び通信部１６等を含む。ＣＰＵ１１は、バス１７を介してハードウェア各部と接続されている。ＣＰＵ１１は記憶部１５に記憶された制御プログラム１５Ｐに従いハードウェア各部を制御する。ＲＡＭ１２は例えばＳＲＡＭ（Static RAM）、ＤＲＡＭ(Dynamic RAM)、フラッシュメモリ等である。ＲＡＭ１２は、記憶部としても機能し、ＣＰＵ１１による各種プログラムの実行時に発生する種々のデータを一時的に記憶する。 Embodiment 1
Hereinafter, embodiments will be described with reference to the drawings. FIG. 1 is a block diagram illustrating a hardware group of the information processing apparatus 1. The information processing apparatus 1 is, for example, a personal computer, a server computer, a mobile phone, a smartphone, a game machine, or a PDA (Personal Digital Assistant). Hereinafter, an example in which the information processing apparatus 1 is applied to a personal computer 1 (hereinafter referred to as a computer 1) will be described. The computer 1 includes a central processing unit (CPU) 11 as a control unit, a random access memory (RAM) 12, an input unit 13, a display unit 14, a clock unit 18, a storage unit 15, a communication unit 16, and the like. The CPU 11 is connected to each part of the hardware via the bus 17. The CPU 11 controls each part of the hardware according to the control program 15P stored in the storage unit 15. The RAM 12 is, for example, SRAM (Static RAM), DRAM (Dynamic RAM), flash memory, or the like. The RAM 12 also functions as a storage unit, and temporarily stores various data generated when the CPU 11 executes various programs.

入力部１３はマウスまたはキーボード、マウスまたはタッチパネル等の入力デバイスであり、受け付けた操作情報をＣＰＵ１１へ出力する。表示部１４は液晶ディスプレイまたは有機ＥＬ（electroluminescence）ディスプレイ等であり、ＣＰＵ１１の指示に従い各種情報を表示する。通信部１６は通信モジュールであり、インターネットまたは公衆回線網等の通信網を介して図示しない他のコンピュータとの間で情報の送受信を行う。 The input unit 13 is an input device such as a mouse or a keyboard, a mouse or a touch panel, and outputs received operation information to the CPU 11. The display unit 14 is a liquid crystal display, an organic EL (electroluminescence) display, or the like, and displays various information according to instructions from the CPU 11. The communication unit 16 is a communication module, and transmits / receives information to / from another computer (not shown) via a communication network such as the Internet or a public line network.

時計部１８は日時をＣＰＵ１１へ出力する。記憶部１５はハードディスクまたは大容量メモリであり、上述した制御プログラム１５Ｐの他、位置テーブル１５１、隣接テーブル１５２、数値テーブル１５３、キーワードテーブル１５４及び対象データテーブル１５５等を含む。なお、本実施形態ではコンピュータ１の記憶部１５に各種テーブルを記憶する例を挙げたがこれに限るものではない。例えば通信網を介して接続される図示しないデータベースサーバ等に上述した各種テーブルを適宜記憶しても良い。この場合、ＣＰＵ１１は、必要に応じてデータベースサーバにアクセスし、データの書き込み及び読み出しを行う。ＣＰＵ１１は、記憶部１５からデータ群を読み出し、制御プログラム１５Ｐに従い各データについて解析を行う。以下詳細を説明する。 The clock unit 18 outputs the date and time to the CPU 11. The storage unit 15 is a hard disk or a large-capacity memory, and includes a position table 151, an adjacent table 152, a numerical table 153, a keyword table 154, a target data table 155, and the like in addition to the control program 15P described above. In the present embodiment, an example in which various tables are stored in the storage unit 15 of the computer 1 is described, but the present invention is not limited to this. For example, the various tables described above may be stored as appropriate in a database server (not shown) connected via a communication network. In this case, the CPU 11 accesses the database server as necessary, and writes and reads data. The CPU 11 reads a data group from the storage unit 15 and analyzes each data according to the control program 15P. Details will be described below.

図２は帳票データのイメージを示す説明図である。帳票データは例えばエクセル（登録商標）等の表計算ソフトウェア、文書入力ソフトウェアまたはＰＤＦ（Portable Document Format：登録商標）等のファイル閲覧ソフトウェアにより表示されるデータ群である。タイトル、項目及び数値データ等の各種データ群が表内にテキスト形式で記述されている。図２の例は企業内従業員の個人就業実績を示している。中央上部に表示されている「個人就業実績」は帳簿データのタイトルであり、タイトル左下には、職員氏名「日本花子」と、「日本花子」を特定するための固有の識別情報（以下、職員ＩＤという）と、が記述されている。 FIG. 2 is an explanatory diagram showing an image of the form data. The form data is, for example, a data group displayed by spreadsheet software such as Excel (registered trademark), document input software, or file browsing software such as PDF (Portable Document Format: registered trademark). Various data groups such as titles, items, and numerical data are described in a text format in the table. The example of FIG. 2 shows the individual employment performance of employees in the company. The “Personal Employment Record” displayed in the upper center is the title of the book data. In the lower left corner of the title is the staff name “Nippon Hanako” and unique identification information for identifying “Nippon Hanako” ID)).

右上隅の「第２開発部」は職員の所属部署を示す。職員名の下側の表には、当該職員の出勤、退勤、所定内労働（以下、所定内という）、残業、休日出勤等の項目と、各項目に対応付けて、出勤時間、退勤時刻、所定内時間、残業時間、休日出勤時間等の数値データが記述されている。下側には項目「総計」に対応付けて、所定内時間、残業時間、休日出勤時間等の合計に係る数値データが記述されている。ＣＰＵ１１は、記憶部１５に記憶された、または、通信部１６を介して受信した帳票データを解析する。なお、本実施形態では帳票データを対象としたが、これに限るものではない。医療カルテデータ、財務データ、または各種統計データ等であっても良い。 The “second development department” in the upper right corner indicates the department to which the staff belongs. The table below the staff name shows items such as attendance, leaving work, scheduled work (hereinafter referred to as designated work), overtime, holiday work, etc. Numerical data such as predetermined hours, overtime hours, and holiday work hours are described. On the lower side, numerical data relating to the total of predetermined hours, overtime hours, holiday work hours, etc. are described in association with the item “total”. The CPU 11 analyzes the form data stored in the storage unit 15 or received via the communication unit 16. In the present embodiment, the form data is targeted, but the present invention is not limited to this. It may be medical chart data, financial data, or various statistical data.

ＣＰＵ１１は、１行分の帳票データを読み込む。１行目の場合、「個人就業実績」及び「第２開発部」が読み込まれる。ＣＰＵ１１は、１行内の単語を順次抽出する。以下では例としてＣＰＵ１１が、「個人就業実績」を抽出し、「個人就業実績」の属性及び評価値を決定する処理を説明する。なお、単語の抽出は文字間の距離が記憶部１５に記憶した所定距離以上の場合、または、文字間に罫線が存在する場合に、単語の区切りであると判断すればよい。なお、単語間の距離は、フォントサイズに応じて適宜の長さ（座標数またはピクセル数）とすればよい。以下では、ＣＰＵ１１が帳票データを読み込み、文字認識により単語間で分割されたタイトル、項目及び数値データ等の内、解析対象となるデータを対象データという。 The CPU 11 reads form data for one line. In the case of the first line, “individual employment record” and “second development department” are read. The CPU 11 sequentially extracts words in one line. Hereinafter, as an example, a process in which the CPU 11 extracts “individual employment performance” and determines the attribute and evaluation value of “individual employment performance” will be described. In addition, what is necessary is just to judge that the extraction of a word is a word division | segmentation, when the distance between characters is more than the predetermined distance memorize | stored in the memory | storage part 15, or when a ruled line exists between characters. The distance between words may be an appropriate length (the number of coordinates or the number of pixels) according to the font size. Hereinafter, data to be analyzed among titles, items, numerical data, and the like divided between words by character recognition by the CPU 11 is referred to as target data.

図３は帳票データの一部を示す説明図である。項目「退勤」、「所定内」及び「残業」についてそれぞれ退勤時刻、所定内時間及び残業時間の数値データが記憶されている。例えば一列目の数値データでは、退勤時刻が17時15分、所定内時間が8時間、残業時間が０時間であることが理解できる。２つ目の対象データは数値データ一列目の残業時間「０．００」を例として説明する。ＣＰＵ１１は、数値データ「０．００」の属性及び評価値を決定する。ＣＰＵ１１は、１行分の単語を順次抽出する。ＣＰＵ１１は、17:15、8.00及び0.00を検出し、そのうち0.00を抽出し解析を行う。なお、左上の隣接する対象データ「所定内」、左で隣接する対象データ「8.00」及び上で隣接する対象データ「残業」についての属性及び評価値は既に解析済みであるものとする。 FIG. 3 is an explanatory diagram showing a part of the form data. For the items “leaving work”, “within a predetermined time”, and “overtime”, numerical data of the work time, the predetermined time, and the overtime time are stored. For example, in the numerical data in the first column, it can be understood that the working time is 17:15, the predetermined time is 8 hours, and the overtime time is 0 hours. The second target data will be described by taking the overtime “0.00” in the first column of numerical data as an example. The CPU 11 determines the attribute and evaluation value of the numerical data “0.00”. The CPU 11 sequentially extracts words for one line. The CPU 11 detects 17:15, 8.00, and 0.00, and extracts and analyzes 0.00. It is assumed that the attributes and evaluation values have already been analyzed for the target data “within predetermined” on the upper left, the target data “8.00” adjacent on the left, and the target data “overtime” adjacent on the upper side.

図４は位置テーブル１５１のレコードレイアウトを示す説明図である。位置テーブル１５１は座標領域フィールド、属性フィールド及び評価値フィールド等を含む。座標領域フィールドには帳票内の対象データが記述されている領域の座標が記憶されている。本実施形態では帳票のファイルの左上端を原点とし、右に向かう方向をＸ軸正方向とし、下に向かう方向をＹ軸正方向とする。また座標領域は対象データが存在する領域の左上の座標及び右下の座標を記憶しておけばよい。 FIG. 4 is an explanatory diagram showing a record layout of the position table 151. The position table 151 includes a coordinate area field, an attribute field, an evaluation value field, and the like. The coordinate area field stores the coordinates of the area in which the target data in the form is described. In this embodiment, the upper left corner of the form file is the origin, the rightward direction is the X-axis positive direction, and the downward direction is the Y-axis positive direction. The coordinate area may store the upper left coordinates and lower right coordinates of the area where the target data exists.

属性フィールドには、座標領域に対応付けて属性が記憶されている。属性は、対象データの性質または特徴を示すものである。本実施形態では、タイトル、項目、数値データ及び所属等を一例として用いている。後述するが数値データにはさらに下位属性として時刻、電話番号、価格、職員ＩＤ、時間、日数、日付、数値、西暦等がある。評価値フィールドには、座標領域に対応付けて対象データの属性の確からしさを評価する値である評価値が記憶されている。本実施形態では値が大きいほど、属性の確からしさが向上するものとして説明する。 In the attribute field, attributes are stored in association with coordinate areas. The attribute indicates the property or characteristic of the target data. In this embodiment, titles, items, numerical data, affiliations, and the like are used as examples. As will be described later, numerical data further includes time, telephone number, price, staff ID, time, number of days, date, numerical value, year, etc. as lower attributes. The evaluation value field stores an evaluation value that is a value for evaluating the likelihood of the attribute of the target data in association with the coordinate area. In the present embodiment, the description will be made assuming that the greater the value, the more accurate the attribute.

位置テーブル１５１の内容は適宜入力部１３から、値を設定することが可能である。例えば、上部中段付近の座標領域には、一般にタイトルが記述されることが多い。従って、当該領域に対象データが存在する場合には属性が「タイトル」、評価値は「３」と決定される。ＣＰＵ１１は、対象データの中心点または領域を求める。ＣＰＵ１１は、位置テーブル１５１を参照し、中心点が属する座標領域を抽出する。ＣＰＵ１１は、座標領域に対応する属性及び評価値を読み出す。例えば、上述した「個人就業実績」の場合、ＣＰＵ１１は、中心点及び座標領域から、属性「タイトル」及び「評価値」を読み出す。また対象データ「0.00」の場合も、中心点及び座標領域から対応する属性「数値データ」及び評価値「１」を読み出す。なお、本実施形態では対象データの中心点を用いる例を挙げたがこれに限るものではなく、対象データの任意の座標を用いても良い。またＣＰＵ１１は、対象データの座標領域と、位置テーブルの座標領域フィールドに記憶された座標領域との重複する座標数を計数し、最も計数値が多い座標領域に対応する属性及び評価値を選択するようにしても良い。 The contents of the position table 151 can be appropriately set from the input unit 13. For example, a title is often written in a coordinate area near the upper middle stage. Therefore, when the target data exists in the area, the attribute is determined as “title” and the evaluation value is determined as “3”. CPU11 calculates | requires the center point or area | region of object data. The CPU 11 refers to the position table 151 and extracts a coordinate area to which the center point belongs. The CPU 11 reads the attribute and evaluation value corresponding to the coordinate area. For example, in the case of the above-mentioned “individual work performance”, the CPU 11 reads the attributes “title” and “evaluation value” from the center point and the coordinate area. In the case of the target data “0.00”, the corresponding attribute “numerical data” and evaluation value “1” are read from the center point and the coordinate area. In the present embodiment, an example in which the center point of the target data is used has been described. However, the present invention is not limited to this, and arbitrary coordinates of the target data may be used. The CPU 11 counts the number of overlapping coordinates between the coordinate area of the target data and the coordinate area stored in the coordinate area field of the position table, and selects an attribute and an evaluation value corresponding to the coordinate area having the largest count value. You may do it.

図５は隣接テーブル１５２のレコードレイアウトを示す説明図である。隣接テーブル１５２には左の隣接対象データフィールド、左上の隣接対象データフィールド、上の隣接対象データフィールド及び属性評価値フィールド等を含む。隣接する対象データ(以下、場合により隣接対象データと略す)の有無に対応付けて属性及び評価値が記憶されている。例えば、左、左上及び上のどこにも隣接対象データが存在しない場合は、タイトルである可能性が高いことから、属性「タイトル」に評価値「１．０」が記憶されている。なお、本実施形態では左、左上及び上の隣接対象データの有無を利用する形態を挙げたが、これに限るものではない。全方向の隣接対象データを利用するほか、右上の隣接対象データ等、少なくとも一つの隣接対象データを利用するものであればよい。 FIG. 5 is an explanatory diagram showing a record layout of the adjacent table 152. The adjacent table 152 includes a left adjacent target data field, an upper left adjacent target data field, an upper adjacent target data field, an attribute evaluation value field, and the like. Attributes and evaluation values are stored in association with the presence / absence of adjacent target data (hereinafter abbreviated as adjacent target data in some cases). For example, when there is no adjacent target data anywhere on the left, upper left, and above, there is a high possibility of being a title, and therefore an evaluation value “1.0” is stored in the attribute “title”. In the present embodiment, the form using the presence / absence of adjacent target data on the left, upper left and upper is described, but the present invention is not limited to this. In addition to using the adjacent target data in all directions, any data that uses at least one adjacent target data such as the upper right adjacent target data may be used.

上述した「個人就業実績」の場合、ＣＰＵ１１は、隣接する対象データが存在しないため、属性「タイトル」及び評価値「１．０」を読み出す。また「０．００」の場合、左、左上及び上の隣接対象データが存在するため、ＣＰＵ１１は、属性「数値データ」及び評価値「１．０」が読み出される。なお、隣接テーブル１５２の内容は入力部１３から適宜変更することが可能である。 In the case of the above-mentioned “individual work record”, the CPU 11 reads the attribute “title” and the evaluation value “1.0” because there is no adjacent target data. In the case of “0.00”, the left, upper left, and upper adjacent target data exist, so the CPU 11 reads the attribute “numerical data” and the evaluation value “1.0”. Note that the contents of the adjacent table 152 can be changed as appropriate from the input unit 13.

図６は数値テーブル１５３のレコードレイアウトを示す説明図である。数値テーブル１５３は、表示形式フィールド、属性フィールド、及び評価値フィールド等を含む。表示形式フィールドには、数値、記号及び条件等が記憶されている。属性フィールドには、表示形式に対応付けて、属性「数値データ」の下位属性である時刻、電話番号、価格、職員ＩＤ、時間、日数等が記憶されている。評価値フィールドには表示形式に対応付けて、評価値が記憶されている。例えば、表示形式「：」を含む対象データの場合、時刻の可能性が高いことから属性は「時刻」、評価値は「２」と記憶されている。 FIG. 6 is an explanatory diagram showing a record layout of the numerical value table 153. The numerical value table 153 includes a display format field, an attribute field, an evaluation value field, and the like. In the display format field, numerical values, symbols, conditions, and the like are stored. The attribute field stores time, telephone number, price, staff ID, time, number of days, and the like, which are lower attributes of the attribute “numerical data”, in association with the display format. An evaluation value is stored in the evaluation value field in association with the display format. For example, in the case of target data including the display format “:”, since the possibility of time is high, the attribute is stored as “time” and the evaluation value is stored as “2”.

また、表示形式の内条件である「：の右の数値が６０以下」は例えば数値データ「２：５６」の「５６」が６０以下の条件を満たす場合、時刻2時56分である可能性が高く、属性は「時刻」、評価値は「１」と記憶されている。その他、「am」、「pm」を対象データが含む場合も属性は「時刻」と判断される。表示形式「−」を含む場合、電話番号である可能性が高いことから、属性は「電話番号」、評価値は「１」と記憶されている。また表示形式に係る条件「最初の数字が０」は、例えば数値データ「０９０−ＸＸＸ―ＸＸＸ」の場合、当該条件を満たすことから電話番号を示す可能性が高く、属性は「電話番号」、評価値は「１」と記憶されている。 Further, the display condition “: numeric value to the right of“: 60 or less ”may be, for example, 2:56 when“ 56 ”of numeric data“ 2:56 ”satisfies the condition of 60 or less. Is high, the attribute is “time”, and the evaluation value is “1”. In addition, when the target data includes “am” and “pm”, the attribute is determined to be “time”. When the display format “-” is included, since it is highly likely that the number is a telephone number, the attribute is stored as “telephone number” and the evaluation value is stored as “1”. In addition, the condition “first number is 0” related to the display format, for example, in the case of numerical data “090-XXX-XXX”, there is a high possibility of indicating a telephone number because the condition is satisfied, and the attribute is “phone number”, The evaluation value is stored as “1”.

表示形式「＄」または「￥」を対象データが含む場合、価格である可能性が高いことから、属性は「価格」、評価値は「３」と記憶されている。表示形式「ＩＤ」を対象データが含む場合、職員ＩＤである可能性が高いことから、属性「職員ＩＤ」、評価値「３」が記憶されている。表示形式「.」を対象データが含む場合、時間または日数である可能性が高いことから属性は「時間」または「日数」が記憶され、評価値はそれぞれ「２」、「１」と記憶されている。 When the target data includes the display format “$” or “¥”, since the possibility of the price is high, the attribute is stored as “price” and the evaluation value is stored as “3”. When the target data includes the display format “ID”, the attribute “staff ID” and the evaluation value “3” are stored because there is a high possibility of the staff ID. When the target data includes the display format “.”, The attribute is stored as “hour” or “days” because the possibility of being the time or the number of days is high, and the evaluation values are stored as “2” and “1”, respectively. ing.

ＣＰＵ１１は、対象データが数値を含むか否かを判断する。ＣＰＵ１１は、抽出した対象データを一字毎に分割する。ＣＰＵ１１は、分割した字の総数及び、数値の数を計数する。ＣＰＵ１１は、総数に対する数値の割合が６０％以上の場合、対象データは数値データであると判断する。例えば「２：５０」は数字が７５％、記号が２５％であるため、数値データとなる。一方、「平成２５年計画表」は数値の割合が２５％と低いため、数値データとは判断されない。なお、数値の割合により対象データが数値データか否かを判断する例を挙げて説明するが、これに限るものではない。数値が所定数存在する場合に、対象データが数値データと判断するようにしても良い。 The CPU 11 determines whether the target data includes a numerical value. The CPU 11 divides the extracted target data for each character. The CPU 11 counts the total number of divided characters and the number of numerical values. When the ratio of the numerical value to the total number is 60% or more, the CPU 11 determines that the target data is numerical data. For example, “2:50” is numeric data because the number is 75% and the symbol is 25%. On the other hand, the “2013 plan table” is not determined as numerical data because the numerical ratio is as low as 25%. An example of determining whether the target data is numerical data based on a numerical ratio will be described. However, the present invention is not limited to this. When a predetermined number of numerical values exist, the target data may be determined as numerical data.

上述の「個人就業実績」の場合、ＣＰＵ１１は、数値データと判断しない。上述の「０．００」の場合、ＣＰＵ１１は、数値の割合が所定値よりも高いため、数値データと判断する。そしてＣＰＵ１１は、数値テーブル１５３を参照し、表示形式に一致するデータが存在するか否かを判断する。図６の例では「.」を含むため、属性は「時刻」、評価値は「２」となる。また、属性は「日数」、評価値は「１」となる。すなわち対象データ「０．００」の属性の第１候補は時刻、第２候補は日数となる。なお、数値テーブル１５３の記憶内容は、入力部１３から適宜の値を設定することが可能である。 In the case of the above “individual work performance”, the CPU 11 does not determine that the data is numerical data. In the case of “0.00” described above, the CPU 11 determines that the numerical value data because the ratio of numerical values is higher than a predetermined value. Then, the CPU 11 refers to the numerical value table 153 and determines whether there is data that matches the display format. Since the example of FIG. 6 includes “.”, The attribute is “time” and the evaluation value is “2”. The attribute is “days” and the evaluation value is “1”. That is, the first candidate of the attribute of the target data “0.00” is the time, and the second candidate is the number of days. Note that the storage contents of the numerical value table 153 can be set to appropriate values from the input unit 13.

図７はキーワードテーブル１５４のレコードレイアウトを示す説明図である。キーワードテーブル１５４はキーワードフィールド、属性フィールド及び評価値フィールド等を含む。キーワードフィールドには、対象データ中に含まれるキーワードが記憶されている。例えば、就業実績、事業部、残業、山田等である。属性フィールドには、キーワードに対応付けてタイトル、所属、項目、氏名等の属性が記憶されている。評価値フィールドには、キーワードに対応付けて評価値が記憶されている。ＣＰＵ１１は、対象データが数値データでないと判断した場合、キーワードテーブル１５４を参照し、対象データ中にキーワードが含まれているか否かを判断する。 FIG. 7 is an explanatory diagram showing a record layout of the keyword table 154. The keyword table 154 includes a keyword field, an attribute field, an evaluation value field, and the like. In the keyword field, keywords included in the target data are stored. For example, working results, business divisions, overtime, Yamada, etc. In the attribute field, attributes such as title, affiliation, item, and name are stored in association with the keyword. In the evaluation value field, an evaluation value is stored in association with the keyword. When the CPU 11 determines that the target data is not numerical data, the CPU 11 refers to the keyword table 154 and determines whether or not a keyword is included in the target data.

ＣＰＵ１１は、キーワードが含まれていると判断した場合、キーワードに対応する属性及び評価値を読み出す。例えば上述した「個人就業実績」の場合、ＣＰＵ１１は、キーワード「就業実績」に対応する属性「タイトル」及び評価値「３」を読み出す。 When the CPU 11 determines that the keyword is included, the CPU 11 reads the attribute and evaluation value corresponding to the keyword. For example, in the case of the above “individual work record”, the CPU 11 reads the attribute “title” and the evaluation value “3” corresponding to the keyword “work record”.

図８は対象データテーブル１５５のレコードレイアウトを示す説明図である。対象データＩＤフィールドには対象データを特定するための固有の識別情報(以下、対象データＩＤという)が記憶されている。抽出データフィールドには、文字認識により抽出された対象データが、対象データＩＤに対応付けて記憶されている。対象データ名フィールドにはタグＩＤに対応付けて対象データの名称（以下、対象データ名）が記憶されている。属性フィールドには対象データＩＤに対応付けて候補となる属性が記憶されている。合計評価値フィールドには、上述した処理により決定した評価値の合計値が、対象データＩＤ及び属性に対応付けて記憶されている。すなわち、対象データのデータ内容、位置及び沈設する対象データの有無に応じて決定された属性及び評価値に基づき、ＣＰＵ１１は、属性の評価値の合計値を算出する。上述した「個人就業実績」の例では、ＣＰＵ１１は、上述した処理により属性「タイトル」と決定した際の評価値の合計値を合計評価値フィールドに記憶する。またＣＰＵ１１は、同様に上述した処理により「項目」と決定した際の評価値の合計値を合計評価値フィールドに記憶する。 FIG. 8 is an explanatory diagram showing a record layout of the target data table 155. In the target data ID field, unique identification information (hereinafter referred to as target data ID) for specifying the target data is stored. In the extracted data field, target data extracted by character recognition is stored in association with the target data ID. In the target data name field, the name of the target data (hereinafter referred to as the target data name) is stored in association with the tag ID. In the attribute field, candidate attributes are stored in association with the target data ID. In the total evaluation value field, the total value of the evaluation values determined by the above-described processing is stored in association with the target data ID and the attribute. In other words, the CPU 11 calculates the total value of the attribute evaluation values based on the data contents, the position of the target data, and the attributes and evaluation values determined according to the presence or absence of target data to be laid. In the above-described example of “individual work performance”, the CPU 11 stores the total value of evaluation values when the attribute “title” is determined by the above-described processing in the total evaluation value field. Similarly, the CPU 11 stores the total value of the evaluation values when “item” is determined by the above-described processing in the total evaluation value field.

上述した「０．００」の例では、ＣＰＵ１１は、上述した処理により属性「時間」と決定した際の評価値の合計値を合計評価値フィールドに記憶する。同様に、ＣＰＵ１１は、上述した処理により属性「日数」と決定した際の評価値の合計値を合計評価値フィールドに記憶する。またＣＰＵ１１は、同様に上述した処理により「金額」と決定した際の評価値の合計値を合計評価値フィールドに記憶する。また、ＣＰＵ１１は、属性を「数値データ」と判断した場合、下位属性である「時刻」、「電話番号」、「価格」、「職員ＩＤ」、「時間」及び「日数」の評価値に属性「数値データ」と判断した属性値を加算する。なお、ＣＰＵ１１は、複数の属性が存在する場合、合計評価値が最も高いもの順にソートして記憶しておけばよい。 In the example of “0.00” described above, the CPU 11 stores the total value of the evaluation values when the attribute “time” is determined by the above-described processing in the total evaluation value field. Similarly, the CPU 11 stores the total value of the evaluation values when the attribute “days” is determined by the above-described processing in the total evaluation value field. Similarly, the CPU 11 stores the total value of the evaluation values when the “money amount” is determined by the above-described processing in the total evaluation value field. When the CPU 11 determines that the attribute is “numerical data”, the attribute is assigned to the evaluation values of the subordinate attributes “time”, “phone number”, “price”, “staff ID”, “time”, and “days”. Add the attribute value determined to be “numeric data”. In addition, CPU11 should just sort and memorize | store in order with the highest total evaluation value, when a some attribute exists.

上フィールド、左フィールド及び左上フィールドには、対象データの上、左または左上で隣接する対象データの有無と、存在する場合は対応する対象データＩＤとを、対象データＩＤに対応付けて記憶している。対象データＩＤ「１」の「個人就業実績」では、隣接する対象データが存在しないため、全て「なし」と記憶されている。一方、対象データＩＤ「１８」の「０．００」では、隣接する対象データが存在するため、上フィールドには対象データＩＤ「１０」、左フィールドには対象データＩＤ「１７」、左上フィールドには対象データＩＤ「９」と記憶されている。座標領域フィールドには、対象データがファイル上で占める座標領域を対象データＩＤに対応付けて記憶している。 In the upper field, the left field, and the upper left field, the presence or absence of target data adjacent to the target data on the left or upper left, and the corresponding target data ID, if present, are stored in association with the target data ID. Yes. In the “individual work record” of the target data ID “1”, since there is no adjacent target data, “none” is stored. On the other hand, when the target data ID “18” is “0.00”, there is adjacent target data, so the upper field is the target data ID “10”, the left field is the target data ID “17”, and the upper left field is the target data ID “18”. Is stored as the target data ID “9”. In the coordinate area field, the coordinate area occupied by the target data on the file is stored in association with the target data ID.

続いて対象データ名の決定処理について説明する。ＣＰＵ１１は、抽出した対象データの属性の内、最も合計評価値の高い属性がタイトル、項目または名前である場合、当該対象データを含む対象データ名を生成する。例えば対象データＩＤ「０１」の属性は「タイトル」である。この場合、抽出した対象データ「個人就業実績」が対象データ名「個人就業実績」として、対象データテーブル１５５に記憶される。また対象データＩＤ「９」の属性は項目であることから、抽出した対象データ「所定内」が対象データ名として記憶される。 Next, the target data name determination process will be described. CPU11 produces | generates the target data name containing the said target data, when the attribute with the highest total evaluation value is a title, an item, or a name among the attributes of the extracted target data. For example, the attribute of the target data ID “01” is “title”. In this case, the extracted target data “individual employment performance” is stored in the target data table 155 as the target data name “individual employment performance”. Further, since the attribute of the target data ID “9” is an item, the extracted target data “within a predetermined” is stored as the target data name.

一方、ＣＰＵ１１は、抽出した対象データの属性の内、最も合計評価値の高い属性が時間、日数、または金額である場合、隣接するデータの対象データ名に基づき、対象データ名を決定する。例えば対象データＩＤ「１８」の属性は「時間」である。この場合、上で隣接する対象データ（対象データＩＤ「１０」）の対象データ名「残業」を含む対象データ名「残業」とする。 On the other hand, when the attribute with the highest total evaluation value is the time, the number of days, or the amount of money among the extracted attributes of the target data, the CPU 11 determines the target data name based on the target data name of adjacent data. For example, the attribute of the target data ID “18” is “time”. In this case, the target data name “overtime” including the target data name “overtime” of the adjacent target data (target data ID “10”) is used.

ＣＰＵ１１は、隣接する対象データの対象データ名の内、上または左に対象データ名が存在する場合、いずれか一方の対象データ名を利用する。例えば図２において、上述した対象データＩＤ「１８」の下側の対象データ「0.00」は属性が時間であり、また上に対象データ名が存在するため、ＣＰＵ１１は、対象データ名を同様に「残業」とする。また図２の下方に存在する対象データ「総計」の右側の「119.00」は属性が時間である場合、左の対象データ名「総計」と同じ対象データ名「総計」が記憶される。なお、ＣＰＵ１１は、上及び左の両方に対象データ名が存在する場合、予め定めたいずれか一方の対象データ名を利用すれば良い。上、左及び左上の全てに対象データが存在しない場合、不明としておけばよい。本実施形態では上の対象データ名を利用する例を挙げて説明する。 The CPU 11 uses one of the target data names when the target data name exists above or to the left of the target data names of the adjacent target data. For example, in FIG. 2, the object data “0.00” below the object data ID “18” has the attribute “time” and the object data name exists above the CPU 11. Overtime ”. In addition, when the attribute “time” is “119.00” on the right side of the target data “total” existing in the lower part of FIG. 2, the same target data name “total” as the target data name “total” on the left is stored. Note that, when the target data names exist on both the upper and left sides, the CPU 11 may use any one of the predetermined target data names. If there is no target data in all of the top, left, and top left, it may be set as unknown. In this embodiment, an example using the above target data name will be described.

以上のハードウェア群において各種ソフトウェア処理手順を、フローチャートを用いて説明する。図９及び図１０は対象データの抽出処理手順を示すフローチャートである。ＣＰＵ１１は、左上の行からスキャンを行い、対象データを抽出する（ステップＳ９１）。ＣＰＵ１１は、新たな対象データを抽出した場合、対象データＩＤを生成する（ステップＳ９２）。ＣＰＵ１１は、生成した対象データＩＤを対象データテーブル１５５に記憶する（ステップＳ９３）。ＣＰＵ１１は、抽出した対象データを対象データテーブル１５５に記憶する（ステップＳ９４）。 Various software processing procedures in the above hardware group will be described with reference to flowcharts. 9 and 10 are flowcharts showing the target data extraction processing procedure. The CPU 11 scans from the upper left row and extracts target data (step S91). CPU11 produces | generates object data ID, when new object data is extracted (step S92). The CPU 11 stores the generated target data ID in the target data table 155 (step S93). The CPU 11 stores the extracted target data in the target data table 155 (step S94).

ＣＰＵ１１は、対象データの座標領域を対象データテーブル１５５に記憶する（ステップＳ９５）。ＣＰＵ１１は、上に隣接する対象データが存在するか否かを判断する（ステップＳ９６）。具体的には、ＣＰＵ１１は、対象データテーブル１５５の座標領域を参照し、対象となる対象データの座標領域に対し、Ｙ軸負方向の所定範囲を座標領域に含む対象データＩＤを抽出する。その他、座標領域同士が隣接する対象データＩＤを抽出し、Ｘ軸方向で隣接する対象データＩＤを最終的に上側で隣接する対象データＩＤとして抽出しても良い。ＣＰＵ１１は、上に隣接する対象データが存在すると判断した場合（ステップＳ９６でＹＥＳ）、処理をステップＳ９７へ移行させる。ＣＰＵ１１は、上に隣接する対象データの対象データＩＤを対象データテーブル１５５に記憶する（ステップＳ９７）。一方、上に隣接する対象データが存在しないと判断した場合（ステップＳ９６でＮＯ）、ステップＳ９７をスキップし、処理をステップＳ９８へ移行させる。 The CPU 11 stores the coordinate area of the target data in the target data table 155 (step S95). The CPU 11 determines whether or not there is target data adjacent above (step S96). Specifically, the CPU 11 refers to the coordinate area of the target data table 155 and extracts a target data ID that includes a predetermined range in the negative Y-axis direction in the coordinate area with respect to the coordinate area of the target data. In addition, target data IDs adjacent to each other in the coordinate area may be extracted, and target data IDs adjacent in the X-axis direction may be finally extracted as target data IDs adjacent on the upper side. If the CPU 11 determines that there is target data adjacent above (YES in step S96), the process proceeds to step S97. The CPU 11 stores the target data ID of the target data adjacent above in the target data table 155 (step S97). On the other hand, if it is determined that there is no target data adjacent above (NO in step S96), step S97 is skipped and the process proceeds to step S98.

ＣＰＵ１１は、左に隣接する対象データが存在するか否かを判断する（ステップＳ９８）。具体的には、ＣＰＵ１１は、対象データテーブル１５５の座標領域を参照し、対象となる対象データの座標領域に対し、Ｘ軸負方向の所定範囲を座標領域に含む対象データＩＤを抽出する。ＣＰＵ１１は、左に隣接する対象データが存在すると判断した場合（ステップＳ９８でＹＥＳ）、処理をステップＳ９９へ移行させる。ＣＰＵ１１は、左に隣接する対象データの対象データＩＤを対象データテーブル１５５に記憶する（ステップＳ９９）。一方、左に隣接する対象データが存在しないと判断した場合（ステップＳ９８でＮＯ）、ステップＳ９９をスキップし、処理をステップＳ１０１へ移行させる。 The CPU 11 determines whether there is target data adjacent to the left (step S98). Specifically, the CPU 11 refers to the coordinate area of the target data table 155, and extracts a target data ID that includes a predetermined range in the negative X-axis direction in the coordinate area of the target target data coordinate area. If the CPU 11 determines that there is target data adjacent to the left (YES in step S98), the process proceeds to step S99. The CPU 11 stores the target data ID of the target data adjacent to the left in the target data table 155 (step S99). On the other hand, if it is determined that there is no target data adjacent to the left (NO in step S98), step S99 is skipped, and the process proceeds to step S101.

ＣＰＵ１１は、左上に隣接する対象データが存在するか否かを判断する（ステップＳ１０１）。具体的には、ＣＰＵ１１は、対象データテーブル１５５の座標領域を参照し、対象となる対象データの座標領域に対し、Ｘ軸負方向及びＹ軸負方向の所定範囲を座標領域に含む対象データＩＤを抽出する。ＣＰＵ１１は、左上に隣接する対象データが存在すると判断した場合（ステップＳ１０１でＹＥＳ）、処理をステップＳ１０２へ移行させる。ＣＰＵ１１は、左上に隣接する対象データの対象データＩＤを対象データテーブル１５５に記憶する（ステップＳ１０２）。一方、左上に隣接する対象データが存在しないと判断した場合（ステップＳ１０１でＮＯ）、ステップＳ１０２をスキップし、一連の処理を終了する。ＣＰＵ１１は、以上の処理を全ての抽出した対象データに対して行う。 The CPU 11 determines whether there is adjacent target data on the upper left (step S101). Specifically, the CPU 11 refers to the coordinate area of the target data table 155, and the target data ID includes a predetermined range in the negative X-axis direction and negative Y-axis direction in the coordinate area with respect to the target coordinate area of the target data. To extract. If the CPU 11 determines that there is target data adjacent to the upper left (YES in step S101), the process proceeds to step S102. The CPU 11 stores the target data ID of the target data adjacent to the upper left in the target data table 155 (step S102). On the other hand, if it is determined that there is no target data adjacent to the upper left (NO in step S101), step S102 is skipped, and the series of processes is terminated. CPU11 performs the above process with respect to all the extracted object data.

図１１は位置に基づく属性及び評価値決定処理の手順を示すフローチャートである。ＣＰＵ１１は、対象データテーブル１５５から座標領域を読み出す（ステップＳ１１１）。ＣＰＵ１１は、座標領域の中心点を算出する（ステップＳ１１２）。具体的には、ＣＰＵ１１は、左上のＸ座標と右下のＸ座標の平均値を中心点のＸ座標とし、左上のＹ座標と右下のＹ座標の平均値を中心点のＹ座標とすればよい。ＣＰＵ１１は、位置テーブル１５１を参照し、中心点が属する座標領域を抽出する（ステップＳ１１３）。 FIG. 11 is a flowchart showing a procedure of position-based attribute and evaluation value determination processing. The CPU 11 reads the coordinate area from the target data table 155 (step S111). The CPU 11 calculates the center point of the coordinate area (step S112). Specifically, the CPU 11 sets the average value of the upper left X coordinate and the lower right X coordinate as the X coordinate of the center point, and sets the average value of the upper left Y coordinate and the lower right Y coordinate as the Y coordinate of the center point. That's fine. The CPU 11 refers to the position table 151 and extracts a coordinate area to which the center point belongs (step S113).

ＣＰＵ１１は、抽出した座標領域に対応する属性及び評価値を読み出す（ステップＳ１１４）。ＣＰＵ１１は、対象データＩＤに対応付けてＲＡＭ１２に属性及び評価値を記憶する（ステップＳ１１５）。ＣＰＵ１１は、上述した処理を対象データテーブル１５５に記憶した全ての対象データＩＤに対し、処理を終了したか否かを判断する（ステップＳ１１６）。ＣＰＵ１１は、処理を全て終えていないと判断した場合（ステップＳ１１６でＮＯ）、処理をステップＳ１１７へ移行させる。ＣＰＵ１１は、次の対象データＩＤの座標領域を対象データテーブル１５５から読み出す（ステップＳ１１７）。ＣＰＵ１１は、その後処理をステップＳ１１２に戻す。以上の処理を繰り返すことにより、全ての対象データＩＤについての属性及び評価値が決定する。ＣＰＵ１１は、全ての対象データＩＤについて処理を終了したと判断した場合（ステップＳ１１６でＹＥＳ）、一連の処理を終了する。 CPU11 reads the attribute and evaluation value corresponding to the extracted coordinate area (step S114). The CPU 11 stores the attribute and evaluation value in the RAM 12 in association with the target data ID (step S115). The CPU 11 determines whether or not the processing has been completed for all the target data IDs stored in the target data table 155 in the above-described processing (step S116). If the CPU 11 determines that all the processes have not been completed (NO in step S116), the process proceeds to step S117. The CPU 11 reads the coordinate area of the next target data ID from the target data table 155 (step S117). CPU11 returns a process to step S112 after that. By repeating the above processing, attributes and evaluation values for all target data IDs are determined. If the CPU 11 determines that the process has been completed for all target data IDs (YES in step S116), the series of processes is terminated.

図１２は隣接対象データの存否に基づく属性及び評価値決定処理の手順を示すフローチャートである。ＣＰＵ１１は、対象データテーブル１５５から対象データＩＤを読み出す（ステップＳ１２１）。ＣＰＵ１１は、対象データテーブル１５５から、読み出した対象データＩＤに対応する隣接する対象データＩＤを読み出す（ステップＳ１２２）。ＣＰＵ１１は、隣接テーブル１５２を参照し、対応する属性及び評価値を読み出す（ステップＳ１２３）。具体的には、ＣＰＵ１１は、隣接対象データの存否に基づき、隣接テーブル１５２から対応する属性及び評価値を読み出す。 FIG. 12 is a flowchart showing a procedure of attribute and evaluation value determination processing based on the presence / absence of adjacent target data. The CPU 11 reads the target data ID from the target data table 155 (step S121). The CPU 11 reads the adjacent target data ID corresponding to the read target data ID from the target data table 155 (step S122). The CPU 11 refers to the adjacent table 152 and reads the corresponding attribute and evaluation value (step S123). Specifically, the CPU 11 reads the corresponding attribute and evaluation value from the adjacency table 152 based on the presence / absence of adjacency target data.

ＣＰＵ１１は、対象データＩＤに対応付けてＲＡＭ１２に属性及び評価値を記憶する（ステップＳ１２４）。なお、複数の属性の候補が存在する場合は、それぞれの属性及び評価値を記憶しておけばよい。例えば、左の隣接対象データ及び左上の隣接対象データが存在せず、上の隣接データのみが存在する場合、対象データＩＤに対応付けて、「属性数値データ、評価値０．７」と「属性項目、評価値０．３」と記憶される。 The CPU 11 stores the attribute and the evaluation value in the RAM 12 in association with the target data ID (step S124). When there are a plurality of attribute candidates, each attribute and evaluation value may be stored. For example, if the left adjacent target data and the upper left adjacent target data do not exist, but only the upper adjacent data exists, “attribute numeric data, evaluation value 0.7” and “attribute are associated with the target data ID. Item, evaluation value 0.3 "is stored.

ＣＰＵ１１は、上述した処理を対象データテーブル１５５に記憶した全ての対象データＩＤに対し、処理を終了したか否かを判断する（ステップＳ１２５）。ＣＰＵ１１は、処理を全て終えていないと判断した場合（ステップＳ１２５でＮＯ）、処理をステップＳ１２６へ移行させる。ＣＰＵ１１は、次の対象データＩＤを対象データテーブル１５５から読み出す（ステップＳ１２６）。ＣＰＵ１１は、その後処理をステップＳ１２２に戻す。以上の処理を繰り返すことにより、全ての対象データＩＤについての属性及び評価値が決定する。ＣＰＵ１１は、全ての対象データＩＤについて処理を終了したと判断した場合（ステップＳ１２５でＹＥＳ）、一連の処理を終了する。 The CPU 11 determines whether or not the processing has been completed for all the target data IDs stored in the target data table 155 in the above-described processing (step S125). If the CPU 11 determines that all the processes have not been completed (NO in step S125), the process proceeds to step S126. The CPU 11 reads the next target data ID from the target data table 155 (step S126). CPU11 returns a process to step S122 after that. By repeating the above processing, attributes and evaluation values for all target data IDs are determined. If the CPU 11 determines that the processing has been completed for all the target data IDs (YES in step S125), the series of processing ends.

図１３及び図１４は対象データの文字及び数字内容に基づく属性及び評価値決定処理の手順を示すフローチャートである。ＣＰＵ１１は、対象データテーブル１５５から対象データＩＤ及び対象データを読み出す（ステップＳ１３１）。ＣＰＵ１１は、対象データの数字が占める割合を算出する（ステップＳ１３２）。ＣＰＵ１１は、記憶部１５に記憶した閾値を読み出す（ステップＳ１３３）。ＣＰＵ１１は、算出した割合が閾値以上か否かを判断する（ステップＳ１３４）。 FIG. 13 and FIG. 14 are flowcharts showing the procedure of attribute and evaluation value determination processing based on the character and number contents of the target data. The CPU 11 reads out the target data ID and the target data from the target data table 155 (step S131). CPU11 calculates the ratio which the number of object data occupies (step S132). The CPU 11 reads out the threshold value stored in the storage unit 15 (step S133). The CPU 11 determines whether or not the calculated ratio is greater than or equal to a threshold value (step S134).

ＣＰＵ１１は、閾値以上と判断した場合（ステップＳ１３４でＹＥＳ）、対象データは数値データであると認識し、ステップＳ１３５へ移行する。ＣＰＵ１１は、数値テーブル１５３を参照し、対象データが条件を満たす属性及び評価値を抽出する（ステップＳ１３５）。具体的には、ＣＰＵ１１は、対象データを構成する数字、記号または文字が数値テーブル１５３の表示形式フィールドに含まれているか否かを判断する。またＣＰＵ１１は、対象データを構成する数字、記号または文字が、表示形式フィールドに記述されている条件を満たすか否かを判断する。ＣＰＵ１１は、表示形式フィールドに含まれていると判断した場合、または、条件を満たすと判断した場合、対応する属性及び評価値を抽出する。 If the CPU 11 determines that it is equal to or greater than the threshold (YES in step S134), the CPU 11 recognizes that the target data is numerical data, and proceeds to step S135. The CPU 11 refers to the numerical value table 153, and extracts attributes and evaluation values that satisfy the target data (step S135). Specifically, the CPU 11 determines whether numbers, symbols, or characters constituting the target data are included in the display format field of the numerical value table 153. Further, the CPU 11 determines whether or not the numbers, symbols, or characters constituting the target data satisfy the conditions described in the display format field. When the CPU 11 determines that it is included in the display format field or determines that the condition is satisfied, the CPU 11 extracts the corresponding attribute and evaluation value.

ＣＰＵ１１は、ステップＳ１３１で読み出した対象データＩＤに対応付けてＲＡＭ１２に属性及び評価値を記憶する（ステップＳ１３６）。ＣＰＵ１１は、閾値以上でないと判断した場合（ステップＳ１３４でＮＯ）、対象データは文字データと判断し、処理をステップＳ１３７へ移行する。ＣＰＵ１１は、キーワードテーブル１５４を参照し、対応する属性及び評価値を抽出する（ステップＳ１３７）。具体的にはＣＰＵ１１は、キーワードフィールドに記憶されたキーワードが、対象データに含まれているか、または一致するか否かを判断する。ＣＰＵ１１は、キーワードに一致または含まれていると判断した場合、当該キーワードに対応する属性及び評価値を読み出す。 The CPU 11 stores the attribute and the evaluation value in the RAM 12 in association with the target data ID read in step S131 (step S136). If the CPU 11 determines that it is not equal to or greater than the threshold value (NO in step S134), it determines that the target data is character data, and the process proceeds to step S137. The CPU 11 refers to the keyword table 154 and extracts the corresponding attribute and evaluation value (step S137). Specifically, the CPU 11 determines whether the keyword stored in the keyword field is included in the target data or matches. When the CPU 11 determines that the keyword matches or is included, the CPU 11 reads the attribute and evaluation value corresponding to the keyword.

ＣＰＵ１１は、対象データＩＤに対応付けてＲＡＭ１２に属性及び評価値を記憶する（ステップＳ１３８）。ステップＳ１３８またはステップＳ１３６の処理後、ＣＰＵ１１は、上述した処理を対象データテーブル１５５に記憶した全ての対象データＩＤに対し、処理を終了したか否かを判断する（ステップＳ１３９）。ＣＰＵ１１は、処理を全て終えていないと判断した場合（ステップＳ１３９でＮＯ）、処理をステップＳ１４１へ移行させる。ＣＰＵ１１は、次の対象データＩＤ及び対象データを対象データテーブル１５５から読み出す（ステップＳ１４１）。ＣＰＵ１１は、その後処理をステップＳ１３２に戻す。以上の処理を繰り返すことにより、全ての対象データＩＤについての属性及び評価値が決定する。ＣＰＵ１１は、全ての対象データＩＤについて処理を終了したと判断した場合（ステップＳ１３９でＹＥＳ）、一連の処理を終了する。 The CPU 11 stores the attribute and the evaluation value in the RAM 12 in association with the target data ID (step S138). After the process of step S138 or step S136, the CPU 11 determines whether or not the process has been completed for all the target data IDs stored in the target data table 155 (step S139). If the CPU 11 determines that all the processes have not been completed (NO in step S139), the process proceeds to step S141. The CPU 11 reads the next target data ID and target data from the target data table 155 (step S141). CPU11 returns a process to step S132 after that. By repeating the above processing, attributes and evaluation values for all target data IDs are determined. If the CPU 11 determines that the processing has been completed for all the target data IDs (YES in step S139), the series of processing ends.

図１５は合計評価値の算出処理手順を示すフローチャートである。ＣＰＵ１１は、ステップＳ１１５、Ｓ１２４、Ｓ１３６、及びＳ１３８にて記憶した対象データＩＤに対応する属性及び評価値を、ＲＡＭ１２から読み出す（ステップＳ１５１）。ＣＰＵ１１は、候補となる属性毎に評価値を加算する（ステップＳ１５２）。ＣＰＵ１１は、属性「数値データ」の評価値を、属性「時刻」、「電話番号」、「価格」、「職員ＩＤ」、「時間」、「日数」の各評価値に加算する（ステップＳ１５３）。ＣＰＵ１１は、属性及び合計評価値を、対象データテーブル１５５に記憶する（ステップＳ１５４）。なお、属性「数値データ」の評価値については係数を乗じた値を加算しても良い。また、ＣＰＵ１１は、合計評価値が高い順に属性を、対象データテーブル１５５に記憶する。 FIG. 15 is a flowchart showing the procedure for calculating the total evaluation value. The CPU 11 reads the attribute and evaluation value corresponding to the target data ID stored in steps S115, S124, S136, and S138 from the RAM 12 (step S151). The CPU 11 adds an evaluation value for each candidate attribute (step S152). The CPU 11 adds the evaluation value of the attribute “numerical data” to the evaluation values of the attributes “time”, “phone number”, “price”, “staff ID”, “time”, and “days” (step S153). . The CPU 11 stores the attribute and the total evaluation value in the target data table 155 (step S154). Note that the value obtained by multiplying the evaluation value of the attribute “numerical data” by a coefficient may be added. Further, the CPU 11 stores the attributes in the target data table 155 in descending order of the total evaluation value.

ＣＰＵ１１は、上述した処理をＲＡＭ１２に記憶した全ての対象データＩＤに対し、処理を終了したか否かを判断する（ステップＳ１５５）。ＣＰＵ１１は、処理を全て終えていないと判断した場合（ステップＳ１５５でＮＯ）、処理をステップＳ１５６へ移行させる。ＣＰＵ１１は、次の対象データＩＤ、当該対象データＩＤに対応する属性及び評価値を対象データテーブル１５５から読み出す（ステップＳ１５６）。ＣＰＵ１１は、その後処理をステップＳ１５２に戻す。以上の処理を繰り返すことにより、全ての対象データＩＤについての属性及び合計評価値が決定する。ＣＰＵ１１は、全ての対象データＩＤについて処理を終了したと判断した場合（ステップＳ１５５でＹＥＳ）、一連の処理を終了する。なお、ＣＰＵ１１は、罫線の存否に基づき、属性及び評価値を決定しても良い。ＣＰＵ１１は、罫線が対象データの両辺または上下に存在すると判断した場合、属性「数値データ」、評価値「１」としても良い。 The CPU 11 determines whether or not the process has been completed for all the target data IDs stored in the RAM 12 (step S155). If the CPU 11 determines that all the processes have not been completed (NO in step S155), the process proceeds to step S156. The CPU 11 reads the next target data ID, the attribute corresponding to the target data ID, and the evaluation value from the target data table 155 (step S156). CPU11 returns a process to step S152 after that. By repeating the above processing, attributes and total evaluation values for all target data IDs are determined. If the CPU 11 determines that the processing has been completed for all the target data IDs (YES in step S155), the series of processing ends. Note that the CPU 11 may determine the attribute and the evaluation value based on the presence or absence of ruled lines. When the CPU 11 determines that the ruled line exists on both sides or above and below the target data, the CPU 11 may set the attribute “numerical data” and the evaluation value “1”.

図１６及び図１７は対象データ名の決定処理手順を示すフローチャートである。ＣＰＵ１１は、対象データテーブル１５５から対象データＩＤ及び合計評価値の最も高い属性を読み出す（ステップＳ１６１）。ＣＰＵ１１は、属性はタイトル、項目、または名前であるか否かを判断する（ステップＳ１６２）。ＣＰＵ１１は、属性がタイトル、項目、または名前であると判断した場合（ステップＳ１６２でＹＥＳ）、処理をステップＳ１６３へ移行させる。ＣＰＵ１１は、抽出データフィールドのデータ、すなわち対象データを、対象データ名フィールドにコピーする（ステップＳ１６３）。 16 and 17 are flowcharts showing the procedure for determining the target data name. The CPU 11 reads the target data ID and the attribute having the highest total evaluation value from the target data table 155 (step S161). The CPU 11 determines whether the attribute is a title, an item, or a name (step S162). If the CPU 11 determines that the attribute is a title, an item, or a name (YES in step S162), the CPU 11 proceeds to step S163. The CPU 11 copies the data in the extracted data field, that is, the target data to the target data name field (step S163).

ＣＰＵ１１は、属性がタイトル、項目、または名前でないと判断した場合（ステップＳ１６２でＮＯ）、処理をステップＳ１６４へ移行させる。ＣＰＵ１１は、対象データテーブル１５５を参照し、上で隣接する対象データが存在するか否かを判断する（ステップＳ１６４）。ＣＰＵ１１は、存在すると判断した場合（ステップＳ１６４でＹＥＳ）、処理をステップＳ１６５へ移行させる。ＣＰＵ１１は、上で隣接する対象データＩＤを読み出す（ステップＳ１６５）。ＣＰＵ１１は、読み出した対象データＩＤに対応する対象データ名を読み出す（ステップＳ１６６）。ＣＰＵ１１は、読み出した対象データ名を対象データ名フィールドにコピーする（ステップＳ１６７）。すなわち、上で隣接する対象データ名と同名とする。 If the CPU 11 determines that the attribute is not a title, item, or name (NO in step S162), the CPU 11 proceeds to step S164. The CPU 11 refers to the target data table 155 and determines whether there is target data adjacent above (step S164). If the CPU 11 determines that it exists (YES in step S164), the process proceeds to step S165. The CPU 11 reads out the target data ID adjacent above (step S165). The CPU 11 reads a target data name corresponding to the read target data ID (step S166). The CPU 11 copies the read target data name to the target data name field (step S167). That is, the name is the same as the name of the target data adjacent above.

ＣＰＵ１１は、上に隣接する対象データが存在しないと判断した場合（ステップＳ１６４でＮＯ）、処理をステップＳ１６８へ移行させる。ＣＰＵ１１は左で隣接する対象データが存在するか否か判断する（ステップＳ１６８）。ＣＰＵ１１は存在すると判断した場合（ステップＳ１６８でＹＥＳ）、ステップＳ１６９へ移行する。ＣＰＵ１１は、左で隣接する対象データＩＤを読み出す（ステップＳ１６９）。ＣＰＵ１１は、読み出した対象データＩＤに対応する対象データ名を読み出す（ステップＳ１７１）。ＣＰＵ１１は、読み出した対象データ名を対象データ名フィールドにコピーする（ステップＳ１７２）。すなわち、左で隣接する対象データ名と同名とする。なお、本実施形態ではステップＳ１６４〜Ｓ１６７の処理を、ステップＳ１６８〜Ｓ１７２の処理よりも先に行ったか、左の隣接対象データを優先すべく、先にステップＳ１６８〜Ｓ１７２の処理を実行しても良い。 If the CPU 11 determines that there is no adjacent target data (NO in step S164), the process proceeds to step S168. The CPU 11 determines whether there is target data adjacent on the left (step S168). If the CPU 11 determines that it exists (YES in step S168), the process proceeds to step S169. The CPU 11 reads the target data ID adjacent on the left (step S169). The CPU 11 reads a target data name corresponding to the read target data ID (step S171). The CPU 11 copies the read target data name to the target data name field (step S172). That is, the same name as the name of the target data adjacent on the left. In the present embodiment, the processing of steps S164 to S167 is performed before the processing of steps S168 to S172, or the processing of steps S168 to S172 is executed first in order to prioritize the left adjacent target data. good.

ＣＰＵ１１は、左で隣接する対象データが存在しないと判断した場合（ステップＳ１６８でＮＯ）、処理をステップＳ１７３に移行させる。ＣＰＵ１１は、対象データ名フィールドに不明と記憶する（ステップＳ１７３）。ＣＰＵ１１は、上述した処理をＲＡＭ１２に記憶した全ての対象データＩＤに対し、処理を終了したか否かを判断する（ステップＳ１７４）。ＣＰＵ１１は、処理を全て終えていないと判断した場合（ステップＳ１７４でＮＯ）、処理をステップＳ１７５へ移行させる。また、ＣＰＵ１１は、ステップＳ１６３、Ｓ１６７及びＳ１７２の後も、ステップＳ１７４へ処理を移行させる。 If the CPU 11 determines that there is no target data adjacent on the left (NO in step S168), the process proceeds to step S173. The CPU 11 stores “unknown” in the target data name field (step S173). The CPU 11 determines whether or not the process has been completed for all the target data IDs stored in the RAM 12 (step S174). If the CPU 11 determines that all the processes have not been completed (NO in step S174), the process proceeds to step S175. Further, the CPU 11 shifts the processing to step S174 even after steps S163, S167, and S172.

ＣＰＵ１１は、次の対象データＩＤ、及び、最も合計評価値の高い属性を対象データテーブル１５５から読み出す（ステップＳ１７５）。ＣＰＵ１１は、その後処理をステップＳ１６２に戻す。以上の処理を繰り返すことにより、全ての対象データＩＤについての対象データ名が決定する。ＣＰＵ１１は、全ての対象データＩＤについて処理を終了したと判断した場合（ステップＳ１７４でＹＥＳ）、一連の処理を終了する。以上の処理により、抽出した対象データの属性を高い精度で特定することが可能となる。 The CPU 11 reads the next target data ID and the attribute having the highest total evaluation value from the target data table 155 (step S175). The CPU 11 then returns the process to step S162. By repeating the above processing, target data names for all target data IDs are determined. If the CPU 11 determines that the process has been completed for all target data IDs (YES in step S174), the series of processes is terminated. Through the above processing, it is possible to specify the attribute of the extracted target data with high accuracy.

実施の形態２
実施の形態２はユーザの使用履歴に応じて評価値を補正する形態に関する。図１８は帳票データのイメージを示す説明図である。ＣＰＵ１１は、入力部１３から対象データの選択を受け付ける。図１８の例では、項目「残業」の残業時間が複数選択されている。なお、本実施形態では複数の対象データを選択する例を挙げて説明するが、一の対象データを選択するようにしても良い。ＣＰＵ１１は、選択された対象データについて、プロパティの表示要求を、入力部１３から受け付ける。ＣＰＵ１１は、プロパティの表示要求を受け付けた場合、図１８に示す如く、設定タブ１９１を記憶部１５から読み出し、表示部１４に表示する。 Embodiment 2
The second embodiment relates to a form in which an evaluation value is corrected according to a user's usage history. FIG. 18 is an explanatory diagram showing an image of form data. The CPU 11 receives selection of target data from the input unit 13. In the example of FIG. 18, a plurality of overtime hours for the item “overtime” are selected. In the present embodiment, an example of selecting a plurality of target data will be described. However, one target data may be selected. The CPU 11 receives a property display request for the selected target data from the input unit 13. When receiving a property display request, the CPU 11 reads the setting tab 191 from the storage unit 15 and displays it on the display unit 14 as shown in FIG.

図１９は実施の形態２に係る対象データテーブル１５５のレコードレイアウトを示す説明図である。ＣＰＵ１１は、対象データテーブル１５５を参照し、選択された対象データの対象データＩＤを読み出す。ＣＰＵ１１は、対象データＩＤに対応する対象データ名を読み出す。ＣＰＵ１１は、設定タブ１９１上の対象データ名ボックス１９４に対象データ名を記述する。図１８の例では「残業」と表示されている。ＣＰＵ１１は、対象データＩＤに対応する、属性を読み出す。ＣＰＵ１１は、合計評価値の高い順に読み出した属性を属性プルダウンボックス１９２に記述する。図１８に示すように属性プルダウンボックス１９２には、最も合計評価値の高い属性「時間」を先頭に、「金額」、「職員ＩＤ」が記述されている。なお、本実施形態では選択した複数の対象データの対象データ名、属性及び属性の合計評価値に伴う順位が一致しているものとして説明する。また属性プルダウンボックス１９２には、タイトル、項目等、合計評価値がゼロの他の属性をも表示しても良い。 FIG. 19 is an explanatory diagram showing a record layout of the target data table 155 according to the second embodiment. The CPU 11 refers to the target data table 155 and reads the target data ID of the selected target data. The CPU 11 reads the target data name corresponding to the target data ID. The CPU 11 describes the target data name in the target data name box 194 on the setting tab 191. In the example of FIG. 18, “Overtime” is displayed. The CPU 11 reads the attribute corresponding to the target data ID. The CPU 11 describes the attributes read out in descending order of the total evaluation value in the attribute pull-down box 192. As shown in FIG. 18, the attribute pull-down box 192 describes “money amount” and “staff ID” starting with the attribute “time” having the highest total evaluation value. In the present embodiment, description will be made on the assumption that the target data names, attributes, and ranks of attribute total evaluation values of a plurality of selected target data match. The attribute pull-down box 192 may also display other attributes such as titles and items whose total evaluation value is zero.

ユーザは入力部１３から属性プルダウンボックス１９２に表示された属性を選択する。例えば当初は属性「時間」と定義されていたが、属性「金額」に変更を希望する場合、属性「金額」を選択する。ユーザは、属性を選択した後、ＯＫボタン１９３を入力する。ＣＰＵ１１は、ＯＫボタン１９３の入力及び属性の入力を受け付ける。ＣＰＵ１１は、対象データＩＤ及び入力された属性を参照し、対象データテーブル１５５の選択回数フィールドの数をインクリメントする。 The user selects an attribute displayed in the attribute pull-down box 192 from the input unit 13. For example, the attribute “time” was initially defined, but when the attribute “amount” is desired to be changed, the attribute “amount” is selected. After selecting the attribute, the user inputs an OK button 193. The CPU 11 accepts input of the OK button 193 and input of attributes. The CPU 11 refers to the target data ID and the input attribute, and increments the number of selection times field of the target data table 155.

ＣＰＵ１１は、選択回数が更新された場合、または、一定時間経過後に、選択回数を合計評価値に反映させる処理を行う。ＣＰＵ１１は、属性の選択回数が更新される度に、増加した数を、合計評価値に加算する。例えば属性「時間」が１回選択された場合、属性「時間」の合計評価値を１増加させる。なお、ＣＰＵ１１は、係数を乗じ、乗算後の値を合計評価値に加算するようにしても良い。その他、一定時間経過後に、複数の属性の選択回数に基づき、加算値を算出するようにしても良い。 The CPU 11 performs a process of reflecting the number of selections in the total evaluation value when the number of selections is updated or after a predetermined time has elapsed. The CPU 11 adds the increased number to the total evaluation value every time the number of attribute selections is updated. For example, when the attribute “time” is selected once, the total evaluation value of the attribute “time” is increased by one. The CPU 11 may multiply the coefficient and add the value after multiplication to the total evaluation value. In addition, the addition value may be calculated based on the number of selections of a plurality of attributes after a predetermined time has elapsed.

具体的には、ＣＰＵ１１は、合計評価値の最も高い属性の選択回数と、その他の属性の選択回数とを読み出す。ＣＰＵ１１は、その他の属性の選択回数を、合計評価値の最も高い属性の選択回数で除した値を、当該その他の属性の加算値として求める。ＣＰＵ１１は、求めた加算値を、当該その他の属性の合計評価値に加算する。ＣＰＵ１１は、その他の属性の選択回数の合計値を算出する。ＣＰＵ１１は、合計評価値の最も高い属性の選択回数から、合計値を減じ、減じた値を合計評価値の最も高い属性の選択回数で除し、当該除した値を、合計評価値の最も高い属性の加算値とする。ＣＰＵ１１は、合計評価値の最も高い属性の合計評価値に加算値を加算する。ＣＰＵ１１は、選択回数を考慮した合計表価値を求めた後に、合計評価値の高い順に属性を並び替える。 Specifically, the CPU 11 reads the number of times of selecting the attribute having the highest total evaluation value and the number of times of selecting other attributes. CPU11 calculates | requires the value which remove | divided the selection frequency of the other attribute by the selection frequency of the attribute with the highest total evaluation value as an addition value of the said other attribute. The CPU 11 adds the obtained addition value to the total evaluation value of the other attributes. The CPU 11 calculates the total value of the number of selections of other attributes. The CPU 11 subtracts the total value from the number of selections of the attribute having the highest total evaluation value, divides the subtracted value by the number of selections of the attribute having the highest total evaluation value, and the divided value has the highest total evaluation value. The added value of the attribute. The CPU 11 adds the added value to the total evaluation value of the attribute having the highest total evaluation value. The CPU 11 rearranges the attributes in descending order of the total evaluation value after obtaining the total table value considering the number of selections.

図２０は設定タブ１９１の表示処理手順を示すフローチャートである。ＣＰＵ１１は、記憶部１５から図２に示す個人就業実績ファイルを読み出す。ＣＰＵ１１は、対象データの選択を入力部１３から受け付ける（ステップＳ２０１）。ＣＰＵ１１は、選択した対象データについてのプロパティの表示要求を受け付けたか否かを判断する（ステップＳ２０２）。ＣＰＵ１１は、受け付けていないと判断した場合（ステップＳ２０２でＮＯ）、受け付けるまで待機する。ＣＰＵ１１は、受け付けたと判断した場合（ステップＳ２０２でＹＥＳ）、処理をステップＳ２０３へ移行させる。 FIG. 20 is a flowchart showing the display processing procedure of the setting tab 191. The CPU 11 reads the personal work performance file shown in FIG. CPU11 receives selection of object data from input part 13 (Step S201). The CPU 11 determines whether a property display request for the selected target data has been received (step S202). If the CPU 11 determines that it has not been accepted (NO in step S202), it waits until it is accepted. If the CPU 11 determines that it has been received (YES in step S202), the process proceeds to step S203.

ＣＰＵ１１は、選択された対象データの対象データＩＤを読み出す（ステップＳ２０３）。ＣＰＵ１１は、読み出した対象データＩＤに対応する対象データ名を読み出す（ステップＳ２０４）。ＣＰＵ１１は、記憶部１５から設定タブ１９１のテンプレートを読み出す。ＣＰＵ１１は、設定タブ１９１の対象データ名ボックス１９４に対象データ名を記述する（ステップＳ２０５）。ＣＰＵ１１は、対象データＩＤに対応する属性及び合計評価値を全て読み出す（ステップＳ２０６）。ＣＰＵ１１は、設定タブ１９１の属性プルダウンボックス１９２に合計評価値の最も高い順に、属性を記述する（ステップＳ２０７）。 The CPU 11 reads out the target data ID of the selected target data (step S203). The CPU 11 reads the target data name corresponding to the read target data ID (step S204). The CPU 11 reads the setting tab 191 template from the storage unit 15. The CPU 11 describes the target data name in the target data name box 194 of the setting tab 191 (step S205). The CPU 11 reads all attributes and total evaluation values corresponding to the target data ID (step S206). The CPU 11 describes the attributes in the attribute pull-down box 192 of the setting tab 191 in descending order of the total evaluation value (step S207).

ＣＰＵ１１は、記述を終えた設定タブ１９１を表示部１４に表示する（ステップＳ２０８）。ＣＰＵ１１は、入力部１３から属性の選択を受け付けたか否かを判断する（ステップＳ２０９）。ＣＰＵ１１は、属性の選択を受け付けていないと判断した場合（ステップＳ２０９でＮＯ）、属性を受け付けるまで待機する。ＣＰＵ１１は、属性の選択を受け付けたと判断した場合（ステップＳ２０９でＹＥＳ）、ＯＫボタン１９３の入力を条件に、ステップＳ２０３で読み出した対象データＩＤ及び受け付けた属性に対応する選択回数をインクリメントする（ステップＳ２０１０）。ＣＰＵ１１は、設定タブ１９１の表示を消去する（ステップＳ２０１１）。 The CPU 11 displays the setting tab 191 whose description has been completed on the display unit 14 (step S208). CPU11 judges whether selection of an attribute was received from input part 13 (Step S209). If the CPU 11 determines that no attribute selection has been received (NO in step S209), the CPU 11 waits until an attribute is received. If the CPU 11 determines that the selection of the attribute has been accepted (YES in step S209), the CPU 11 increments the number of selections corresponding to the target data ID read in step S203 and the accepted attribute on the condition that the OK button 193 is input (step S209). S2010). The CPU 11 deletes the display of the setting tab 191 (step S2011).

図２１及び図２２は評価値の補正処理手順を示すフローチャートである。以下では、定期的（例えば３日ごと）に合計評価値を補正する例を挙げて説明する。ＣＰＵ１１は、対象データテーブル１５５を参照し、対象データＩＤを読み出す（ステップＳ２１１）。ＣＰＵ１１は、対象データＩＤに対応する属性、合計評価値及び選択回数を読み出す（ステップＳ２１２）。ＣＰＵ１１は、最も合計評価値の高い属性の選択回数（以下、最大選択回数という）を記憶部１５に記憶する（ステップＳ２１３）。ＣＰＵ１１は、最大選択回数を除く他の属性の選択回数の合計値を算出する（ステップＳ２１４）。 21 and 22 are flowcharts showing the evaluation value correction processing procedure. Hereinafter, an example in which the total evaluation value is corrected periodically (for example, every three days) will be described. The CPU 11 reads the target data ID with reference to the target data table 155 (step S211). The CPU 11 reads the attribute, total evaluation value, and number of selections corresponding to the target data ID (step S212). The CPU 11 stores the number of selections of the attribute having the highest total evaluation value (hereinafter referred to as the maximum selection number) in the storage unit 15 (step S213). The CPU 11 calculates the total value of the selection counts of other attributes excluding the maximum selection count (step S214).

ＣＰＵ１１は、最大選択回数から合計値を減算する（ステップＳ２１５）。ＣＰＵ１１は、減算値を最大選択回数で除す（ステップＳ２１６）。ＣＰＵ１１は、除した値を合計評価値に加算する（ステップＳ２１７）。なお、除した値に係数をさらに乗じても良い。ＣＰＵ１１は、属性に対応付けて加算値を合計評価値フィールドに上書きする（ステップＳ２１８）。ＣＰＵ１１は、次に合計評価値の高い属性の選択回数を、最大選択回数で除す（ステップＳ２１９）。ＣＰＵ１１は、除した値を、ステップＳ２１９の属性に対応する合計評価値に加算する（ステップＳ２２１）。 The CPU 11 subtracts the total value from the maximum number of selections (step S215). The CPU 11 divides the subtraction value by the maximum number of selections (step S216). The CPU 11 adds the divided value to the total evaluation value (step S217). Note that the divided value may be further multiplied by a coefficient. The CPU 11 overwrites the added value in the total evaluation value field in association with the attribute (step S218). The CPU 11 divides the number of selections of the attribute having the next highest total evaluation value by the maximum number of selections (step S219). The CPU 11 adds the divided value to the total evaluation value corresponding to the attribute in step S219 (step S221).

ＣＰＵ１１は、属性に対応付けて加算値を合計評価値フィールドに上書きする（ステップＳ２２２）。ＣＰＵ１１は、全ての属性の合計評価値について、ステップＳ２１９〜Ｓ２２２の処理を終了したか否かを判断する（ステップＳ２２３）。ＣＰＵ１１は、処理を終了していないと判断した場合（ステップＳ２２３でＮＯ）、処理をステップＳ２１９に戻し、さらに次に合計値の高い属性の選択回数を、最大選択回数で除す処理を行う。ＣＰＵ１１は、全ての属性の合計評価値について処理を終了したと判断した場合（ステップＳ２２３でＹＥＳ）、処理をステップＳ２２４へ移行させる。 The CPU 11 overwrites the added value in the total evaluation value field in association with the attribute (step S222). CPU11 judges whether the process of step S219-S222 was complete | finished about the total evaluation value of all the attributes (step S223). If the CPU 11 determines that the process has not ended (NO in step S223), it returns the process to step S219, and further performs a process of dividing the number of times of selection of the attribute having the next highest total value by the maximum number of times of selection. If the CPU 11 determines that the process has been completed for the total evaluation value of all attributes (YES in step S223), the process proceeds to step S224.

ＣＰＵ１１は、合計評価値の高い順に、対象データテーブル１５５の属性及び合計評価値を並び替える（ステップＳ２２４）。ＣＰＵ１１は、選択回数フィールドの値を消去する（ステップＳ２２５）。ＣＰＵ１１は、全ての対象ＩＤについて処理を終了したか否かを判断する（ステップＳ２２６）。ＣＰＵ１１は、処理を終えていないと判断した場合（ステップＳ２２６でＮＯ）、処理をステップＳ２２７へ移行させる。ＣＰＵ１１は、未処理の次の対象ＩＤを読み出す（ステップＳ２２７）。ＣＰＵ１１は、その後処理をステップＳ２１２に戻す。ＣＰＵ１１は、全ての対象ＩＤについて処理を終了したと判断した場合（ステップＳ２２６でＹＥＳ）、一連の処理を終了する。これにより、帳票データから特定することが可能な属性に加え、ユーザの使用実績をも反映して対象データの属性を精度良く特定することが可能となる。また例えば属性「残業」のデータが「0.00,8.00・・」のような時間の場合と、「7,500、15,000・・」のような金額の場合との複数種類の属性が考えられるところ、本実施形態では対象データの内容と選択実績に基づき精度良く属性を特定することが可能となる。 The CPU 11 rearranges the attributes and the total evaluation values of the target data table 155 in descending order of the total evaluation values (step S224). The CPU 11 deletes the value in the selection number field (step S225). The CPU 11 determines whether the processing has been completed for all target IDs (step S226). If the CPU 11 determines that the process has not been completed (NO in step S226), the process proceeds to step S227. The CPU 11 reads the next unprocessed target ID (step S227). Thereafter, the CPU 11 returns the process to step S212. If the CPU 11 determines that the process has been completed for all target IDs (YES in step S226), the series of processes is terminated. As a result, in addition to the attributes that can be specified from the form data, it is possible to accurately specify the attributes of the target data reflecting the user's usage record. In addition, for example, there are multiple types of attributes where the data of the attribute “Overtime” is a time such as “0.00,8.00 ...” and the amount of money such as “7,500, 15,000 ...”. In the form, the attribute can be specified with high accuracy based on the content of the target data and the selection record.

本実施の形態２は以上の如きであり、その他は実施の形態１と同様であるので、対応する部分には同一の参照番号を付してその詳細な説明を省略する。 The second embodiment is as described above, and the other parts are the same as those of the first embodiment. Therefore, the corresponding parts are denoted by the same reference numerals, and detailed description thereof is omitted.

実施の形態３
図２３は上述した形態のコンピュータ１の動作を示す機能ブロック図である。ＣＰＵ１１が制御プログラム１５Ｐを実行することにより、コンピュータ１は以下のように動作する。抽出部２３１はデータ群から解析対象となる対象データを抽出する。決定部２３２は抽出した対象データの内容、該対象データに隣接するデータ及び位置に基づき、前記対象データの候補となる属性及び属性の評価値を決定する。 Embodiment 3
FIG. 23 is a functional block diagram showing the operation of the computer 1 of the above-described form. When the CPU 11 executes the control program 15P, the computer 1 operates as follows. The extraction unit 231 extracts target data to be analyzed from the data group. The determining unit 232 determines attributes and attribute evaluation values that are candidates for the target data based on the content of the extracted target data, data adjacent to the target data, and position.

図２４は実施の形態３に係るコンピュータ１のハードウェア群を示すブロック図である。コンピュータ１を動作させるためのプログラムは、ディスクドライブ等の読み取り部１０ＡにCD-ROM、DVD（Digital Versatile Disc）ディスク、メモリーカード、またはUSB(Universal Serial Bus)メモリ等の可搬型記録媒体１Ａを読み取らせて記憶部１５に記憶しても良い。また当該プログラムを記憶したフラッシュメモリ等の半導体メモリ１Ｂをコンピュータ１内に実装しても良い。さらに、当該プログラムは、インターネット等の通信網を介して接続される他のサーバコンピュータ（図示せず）からダウンロードすることも可能である。以下に、その内容を説明する。 FIG. 24 is a block diagram illustrating a hardware group of the computer 1 according to the third embodiment. A program for operating the computer 1 reads a portable recording medium 1A such as a CD-ROM, a DVD (Digital Versatile Disc) disk, a memory card, or a USB (Universal Serial Bus) memory into a reading unit 10A such as a disk drive. It may be stored in the storage unit 15. Further, a semiconductor memory 1B such as a flash memory storing the program may be mounted in the computer 1. Further, the program can be downloaded from another server computer (not shown) connected via a communication network such as the Internet. The contents will be described below.

図２４に示すコンピュータ１は、上述した各種ソフトウェア処理を実行するプログラムを、可搬型記録媒体１Ａまたは半導体メモリ１Ｂから読み取り、或いは、通信網を介して他のサーバコンピュータ（図示せず）からダウンロードする。当該プログラムは、制御プログラム１５Ｐとしてインストールされ、ＲＡＭ１２にロードして実行される。これにより、上述したコンピュータ１として機能する。 The computer 1 shown in FIG. 24 reads a program for executing the above-described various software processes from the portable recording medium 1A or the semiconductor memory 1B or downloads it from another server computer (not shown) via a communication network. . The program is installed as the control program 15P, loaded into the RAM 12, and executed. Thereby, it functions as the computer 1 described above.

本実施の形態３は以上の如きであり、その他は実施の形態１と同様であるので、対応する部分には同一の参照番号を付してその詳細な説明を省略する。 The third embodiment is as described above, and the other parts are the same as those of the first embodiment. Therefore, the corresponding parts are denoted by the same reference numerals, and detailed description thereof is omitted.

以上の実施の形態１から３を含む実施形態に関し、さらに以下の付記を開示する。 With respect to the embodiments including the above first to third embodiments, the following additional notes are further disclosed.

（付記１）
コンピュータに、
データ群から解析対象となる対象データを抽出し、
抽出した対象データの内容、該対象データに隣接するデータ及び位置に基づき、前記対象データの候補となる属性及び属性の評価値を決定する
処理を実行させるプログラム。 (Appendix 1)
On the computer,
Extract the target data to be analyzed from the data group,
A program for executing a process of determining attributes and attribute evaluation values as candidates for the target data based on the contents of the extracted target data, data and positions adjacent to the target data.

（付記２）
対象データが配置された位置を抽出し、
データの配置される位置に対応付けて属性及び評価値を記憶した記憶部を参照し、前記対象データの属性及び評価値を読み出し、
読み出した属性及び評価値に基づき、前記対象データの候補となる属性及び属性の評価値を決定する
付記１に記載のプログラム。 (Appendix 2)
Extract the location where the target data is placed,
Refer to the storage unit that stores the attribute and the evaluation value in association with the position where the data is arranged, and read the attribute and the evaluation value of the target data,
The program according to appendix 1, wherein attributes and evaluation values that are candidates for the target data are determined based on the read attributes and evaluation values.

（付記３）
隣接するデータの存否を判断し、
隣接するデータの存否に応じて属性及び評価値を記憶した記憶部を参照し、前記対象データの属性及び評価値を読み出し、
読み出した属性及び評価値に基づき、前記対象データの候補となる属性及び属性の評価値を決定する
付記１または２に記載のプログラム。 (Appendix 3)
Determine the presence or absence of adjacent data,
Refer to the storage unit storing the attribute and evaluation value according to the presence or absence of adjacent data, read the attribute and evaluation value of the target data,
The program according to claim 1 or 2, wherein attributes and evaluation values that are candidates for the target data are determined based on the read attributes and evaluation values.

（付記４）
隣接するデータが存在する場合に、前記隣接するデータの属性を取得し、
取得した属性に基づき、前記対象データの候補となる属性および属性の評価値を決定する
付記１から３のいずれか一つに記載のプログラム。 (Appendix 4)
If adjacent data exists, obtain the attribute of the adjacent data,
The program according to any one of appendices 1 to 3, wherein an attribute that is a candidate for the target data and an evaluation value of the attribute are determined based on the acquired attribute.

（付記５）
対象データが数値を含む数値データであるか否かを判断し、
数値データであると判断した場合、対象データの表示形式に対応付けて時間、日数または金額を含む属性及び評価値を記憶した記憶部を参照し、前記対象データの属性及び評価値を読み出し、
読み出した属性及び評価値に基づき、前記対象データの候補となる属性及び属性の評価値を決定する
付記１から４のいずれか一つに記載のプログラム。 (Appendix 5)
Determine whether the target data is numeric data including numeric values,
If it is determined that the data is numerical data, refer to the storage unit storing the attribute and evaluation value including the time, the number of days or the amount in association with the display format of the target data, and read the attribute and the evaluation value of the target data,
The program according to any one of appendices 1 to 4, wherein attributes and evaluation values that are candidates for the target data are determined based on the read attributes and evaluation values.

（付記６）
キーワードに対応付けてタイトル、項目または名前を含む属性及び評価値を記憶した記憶部を参照し、前記対象データの属性及び評価値を読み出し、
読み出した属性及び評価値に基づき、前記対象データの候補となる属性及び属性の評価値を決定する
付記１から５のいずれか一つに記載のプログラム。 (Appendix 6)
Refer to the storage unit storing the attribute and evaluation value including the title, item or name in association with the keyword, and read the attribute and evaluation value of the target data,
The program according to any one of appendices 1 to 5, wherein attributes and evaluation values that are candidates for the target data are determined based on the read attributes and evaluation values.

（付記７）
決定した属性がタイトル、項目または名前である場合、前記対象データを含む対象データ名を生成する
付記１から６のいずれか一つに記載のプログラム。 (Appendix 7)
The program according to any one of appendices 1 to 6, wherein when the determined attribute is a title, an item, or a name, a target data name including the target data is generated.

（付記８）
決定した属性が時間、日数または金額である場合、隣接するデータの対象データ名に基づき前記対象データの対象データ名を生成する
付記１から７のいずれか一つに記載のプログラム。 (Appendix 8)
The program according to any one of appendices 1 to 7, wherein when the determined attribute is time, number of days, or amount of money, the target data name of the target data is generated based on the target data name of adjacent data.

（付記９）
決定した候補となる属性を出力し、
各属性に対する選択回数を計数し、
各属性の選択回数に応じて、各属性の評価値を補正する
付記１から８のいずれか一つに記載のプログラム。 (Appendix 9)
Output the candidate attributes that have been determined,
Count the number of selections for each attribute,
The program according to any one of appendices 1 to 8, wherein the evaluation value of each attribute is corrected according to the number of times each attribute is selected.

（付記１０）
データ群から解析対象となる対象データを抽出する抽出部と、
抽出した対象データの内容、該対象データに隣接するデータ及び位置に基づき、前記対象データの候補となる属性及び属性の評価値を決定する決定部と
を備える情報処理装置。 (Appendix 10)
An extraction unit for extracting target data to be analyzed from the data group;
An information processing apparatus comprising: a determination unit that determines an attribute that is a candidate for the target data and an evaluation value of the attribute based on the content of the extracted target data, data adjacent to the target data, and a position.

（付記１１）
情報処理装置を用いた情報処理方法において、
データ群から解析対象となる対象データを抽出し、
抽出した対象データの内容、該対象データに隣接するデータ及び位置に基づき、前記対象データの候補となる属性及び属性の評価値を決定する
情報処理方法。 (Appendix 11)
In an information processing method using an information processing device,
Extract the target data to be analyzed from the data group,
An information processing method for determining an attribute that is a candidate for the target data and an evaluation value of the attribute based on the content of the extracted target data, data adjacent to the target data, and a position.

１コンピュータ
１Ａ可搬型記録媒体
１Ｂ半導体メモリ
１０Ａ読み取り部
１１ＣＰＵ
１２ＲＡＭ
１３入力部
１４表示部
１５記憶部
１５Ｐ制御プログラム
１６通信部
１８時計部
１５１位置テーブル
１５２隣接テーブル
１５３数値テーブル
１５４キーワードテーブル
１５５対象データテーブル
１９１設定タブ
１９２属性プルダウンボックス
１９３ＯＫボタン
１９４対象データ名ボックス DESCRIPTION OF SYMBOLS 1 Computer 1A Portable recording medium 1B Semiconductor memory 10A Reading part 11 CPU
12 RAM
13 Input unit 14 Display unit 15 Storage unit 15P Control program 16 Communication unit 18 Clock unit 151 Position table 152 Adjacent table 153 Numerical table 154 Keyword table 155 Target data table 191 Setting tab 192 Attribute pull-down box 193 OK button 194 Target data name box

Claims

On the computer,
Extract the target data to be analyzed from the data group,
A program for executing a process of determining attributes and attribute evaluation values as candidates for the target data based on the contents of the extracted target data, data and positions adjacent to the target data.

Extract the location where the target data is placed,
Refer to the storage unit that stores the attribute and the evaluation value in association with the position where the data is arranged, and read the attribute and the evaluation value of the target data,
The program according to claim 1, wherein the attribute and the evaluation value of the attribute that are candidates for the target data are determined based on the read attribute and the evaluation value.

Determine the presence or absence of adjacent data,
Refer to the storage unit storing the attribute and evaluation value according to the presence or absence of adjacent data, read the attribute and evaluation value of the target data,
The program according to claim 1 or 2, wherein attributes and evaluation values that are candidates for the target data are determined based on the read attributes and evaluation values.

An extraction unit for extracting target data to be analyzed from the data group;
An information processing apparatus comprising: a determination unit that determines an attribute that is a candidate for the target data and an evaluation value of the attribute based on the content of the extracted target data, data adjacent to the target data, and a position.

In an information processing method using an information processing device,
Extract the target data to be analyzed from the data group,
An information processing method for determining an attribute that is a candidate for the target data and an evaluation value of the attribute based on the content of the extracted target data, data adjacent to the target data, and a position.