JP6187236B2

JP6187236B2 - Data identification program, data identification method, and information processing apparatus

Info

Publication number: JP6187236B2
Application number: JP2013262166A
Authority: JP
Inventors: 田邊　浩靖; 浩靖田邊; 美枝子 ▲高▼橋; 春奈前▲原▼; 伸弘齋藤; 奈央大井; 俊宏石井; 亜希高岡; 泰士捧
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 2013-12-19
Filing date: 2013-12-19
Publication date: 2017-08-30
Anticipated expiration: 2033-12-19
Also published as: JP2015118591A

Description

本発明は、データ特定プログラム、データ特定方法および情報処理装置に関する。 The present invention relates to a data specifying program, a data specifying method, and an information processing apparatus.

従来、複数のサイトの情報を１画面に集約して表示するアカウントアグリゲーションと呼ばれるサービスがある。アカウントアグリゲーションによれば、例えば、インターネットバンキングなどに預金者が保有している異なる金融機関の複数の口座の情報を一覧画面に集約して表示することができる。 Conventionally, there is a service called account aggregation that aggregates and displays information on a plurality of sites on one screen. According to account aggregation, for example, information on a plurality of accounts of different financial institutions held by depositors in Internet banking or the like can be aggregated and displayed on a list screen.

関連する先行技術としては、例えば、サーバが、画像から矩形領域の画像を切り出して、ＯＣＲ処理によりテキストを認識し、ｈｔｍｌファイルのソースから認識されたテキストと最も一致度の高いテキストを抽出してクライアント端末へ送信する技術がある。また、ユーザが必要としている個所を抽出するためのウェブページ情報抽出システムがある。 As related prior art, for example, a server cuts out an image of a rectangular area from an image, recognizes the text by OCR processing, and extracts the text having the highest degree of coincidence with the text recognized from the source of the html file. There is a technique for transmitting to a client terminal. In addition, there is a web page information extraction system for extracting a part that a user needs.

また、複数のサイトから取得したｈｔｍｌ文書に対し、切り出しルール、語彙情報、推論演算に基づき、ｈｔｍｌ文書のタグを頼りに抽出データオブジェクトを取り出す技術がある。また、時間変化する情報を含む画面を画像データとして取り込み、画像データに対して文字認識を利用することにより、時間変化する情報を取得して蓄積し、画面上の指定された領域に表示する技術がある。 Further, there is a technique for extracting an extracted data object from an html document acquired from a plurality of sites based on a cut-out rule, vocabulary information, and an inference operation, using a tag of the html document. Technology that captures and stores time-varying information by capturing a screen containing time-varying information as image data and using character recognition for the image data, and displays it in a specified area on the screen There is.

特開２０１１−１２３７４０号公報JP 2011-123740 A 特開２００３−３０８２７５号公報JP 2003-308275 A 特開２００４−０６２４４６号公報JP 2004-062446 A 特開２００７−２０６９０８号公報JP 2007-206908 A

しかしながら、従来技術によれば、ユーザにより指定される文字列と同一内容のデータがサイトのｈｔｍｌデータ内に複数存在すると、サイトのｈｔｍｌデータにおける、サイトから取得する情報の位置を特定することができない場合がある。 However, according to the prior art, if there is a plurality of data having the same content as the character string specified by the user in the html data of the site, the position of the information acquired from the site in the html data of the site cannot be specified. There is a case.

一つの側面では、本発明は、サイトの画面情報における、サイトから取得する情報の位置を正確に特定することができるデータ特定プログラム、データ特定方法および情報処理装置を提供することを目的とする。 In one aspect, an object of the present invention is to provide a data specifying program, a data specifying method, and an information processing apparatus that can accurately specify the position of information acquired from a site in screen information of the site.

本発明の一側面によれば、サイトの画面の画像データ上で選択を受け付けた範囲の画像データから得られるテキストデータと同一内容のテキストデータを、前記画面の画面情報から検索し、前記画面の画面情報内の検索したテキストデータを異なるテキストデータに変更し、変更後の前記画面の画面情報に基づく前記画面の画像データ上の、前記選択を受け付けた範囲と同一の範囲の画像データから得られるテキストデータが、前記異なるテキストデータと一致するか否かを判定することにより、前記画面の画面情報から前記選択を受け付けた範囲に対応するテキストデータを特定するデータ特定プログラム、データ特定方法および情報処理装置が提案される。 According to one aspect of the present invention, text data having the same content as text data obtained from image data in a range in which selection has been received on image data of a screen of a site is searched from screen information of the screen, The searched text data in the screen information is changed to different text data, and is obtained from the image data in the same range as the selection received on the screen image data based on the screen information after the change. A data specifying program, a data specifying method, and an information processing for specifying text data corresponding to a range in which the selection is accepted from screen information of the screen by determining whether text data matches the different text data A device is proposed.

本発明の一態様によれば、サイトの画面情報における、サイトから取得する情報の位置を正確に特定することができるという効果を奏する。 According to one aspect of the present invention, it is possible to accurately specify the position of information acquired from a site in screen information of the site.

図１は、実施の形態にかかるデータ特定方法の一実施例を示す説明図である。FIG. 1 is an explanatory diagram of an example of the data specifying method according to the embodiment. 図２は、システム２００のシステム構成例を示す説明図である。FIG. 2 is an explanatory diagram illustrating a system configuration example of the system 200. 図３は、情報処理装置１０１のハードウェア構成例を示すブロック図である。FIG. 3 is a block diagram illustrating a hardware configuration example of the information processing apparatus 101. 図４は、サーバ２０１のハードウェア構成例を示すブロック図である。FIG. 4 is a block diagram illustrating a hardware configuration example of the server 201. 図５は、アカウントアグリゲーション情報ＤＢ２２０の記憶内容の一例を示す説明図である。FIG. 5 is an explanatory diagram showing an example of the contents stored in the account aggregation information DB 220. 図６は、サイト別目的データ属性ＤＢ２３０の記憶内容の一例を示す説明図である。FIG. 6 is an explanatory diagram showing an example of the contents stored in the site-specific purpose data attribute DB 230. 図７は、一覧情報ＤＢ２４０の記憶内容の一例を示す説明図である。FIG. 7 is an explanatory diagram showing an example of the contents stored in the list information DB 240. 図８は、一覧設定画面の画面例を示す説明図である。FIG. 8 is an explanatory diagram illustrating a screen example of a list setting screen. 図９は、領域初期設定画面の画面例を示す説明図である。FIG. 9 is an explanatory diagram illustrating a screen example of a region initial setting screen. 図１０は、一覧画面の画面例を示す説明図である。FIG. 10 is an explanatory diagram illustrating a screen example of a list screen. 図１１は、情報処理装置１０１の機能的構成例を示すブロック図である。FIG. 11 is a block diagram illustrating a functional configuration example of the information processing apparatus 101. 図１２は、領域再設定画面の画面例を示す説明図である。FIG. 12 is an explanatory diagram illustrating a screen example of the area resetting screen. 図１３は、情報処理装置１０１の情報提供処理手順の一例を示すフローチャートである。FIG. 13 is a flowchart illustrating an example of an information provision processing procedure of the information processing apparatus 101. 図１４は、新規登録処理の具体的処理手順の一例を示すフローチャート（その１）である。FIG. 14 is a flowchart (part 1) illustrating an example of a specific processing procedure of the new registration processing. 図１５は、新規登録処理の具体的処理手順の一例を示すフローチャート（その２）である。FIG. 15 is a flowchart (part 2) illustrating an example of a specific processing procedure of the new registration processing. 図１６は、一覧表示処理の具体的処理手順の一例を示すフローチャートである。FIG. 16 is a flowchart illustrating an example of a specific processing procedure of the list display processing. 図１７は、目的データ設定処理の具体的処理手順の一例を示すフローチャートである。FIG. 17 is a flowchart illustrating an example of a specific processing procedure of the target data setting process. 図１８は、領域再設定画面表示処理の具体的処理手順の一例を示すフローチャートである。FIG. 18 is a flowchart illustrating an example of a specific processing procedure of the area reset screen display process.

以下に図面を参照して、本発明にかかるデータ特定プログラム、データ特定方法および情報処理装置の実施の形態を詳細に説明する。 Exemplary embodiments of a data specifying program, a data specifying method, and an information processing apparatus according to the present invention will be described below in detail with reference to the drawings.

（データ特定方法の一実施例）
図１は、実施の形態にかかるデータ特定方法の一実施例を示す説明図である。図１において、情報処理装置１０１は、複数のサイトの情報を一画面に集約して出力する機能を有するコンピュータである。サイトは、ページまたはページの集合であり、例えば、Ｗｅｂサイトである。 (One example of data identification method)
FIG. 1 is an explanatory diagram of an example of the data specifying method according to the embodiment. In FIG. 1, an information processing apparatus 101 is a computer having a function of collecting and outputting information on a plurality of sites on one screen. A site is a page or a set of pages, for example, a website.

ページは、ネットワーク上に公開される情報であり、例えば、Ｗｅｂページである。ページは、ｈｔｍｌ（ＨｙｐｅｒＴｅｘｔＭａｒｋｕｐＬａｎｇｕａｇｅ）またはｘｈｔｍｌ（ＥｘｔｅｎｓｉｂｌｅＨｙｐｅｒＴｅｘｔＭａｒｋｕｐＬａｎｇｕａｇｅ）によって記述された電子文書（ｈｔｍｌデータ、ｘｈｔｍｌデータ）や画像データなどを含む。 The page is information disclosed on the network, for example, a web page. The page includes an electronic document (html data, xhtml data) or image data described by html (HyperText Markup Language) or xhtml (Extensible HyperText Markup Language).

ここで、銀行サイトＳ１、証券会社サイトＳ２および年金サイトＳ３の情報を一覧画面に集約して出力する場合を想定する。この場合、ユーザは、銀行サイトＳ１、証券会社サイトＳ２および年金サイトＳ３の各サイトについて、各サイトのどのページのどの部分の情報を取得するのかを設定する。 Here, it is assumed that information on the bank site S1, the securities company site S2, and the pension site S3 is collected and output on a list screen. In this case, the user sets, for each site of the bank site S1, the securities company site S2, and the pension site S3, which part of which page of each site is to be acquired.

一例として、年金サイトＳ３の年金の支払額を取得する場合を想定する。この場合、例えば、年金サイトＳ３の画面の画像データ１１０において、ユーザの操作入力により、一覧画面に表示する情報を含む範囲（以下、「領域Ｔ１」と称する）を選択することによって、一覧画面に表示する文字列「１８７６５４０」のテキストデータを取得することができる。 As an example, it is assumed that the payment amount of the pension of the pension site S3 is acquired. In this case, for example, in the image data 110 on the screen of the pension site S3, by selecting a range including information to be displayed on the list screen (hereinafter referred to as “region T1”) by a user operation input, the list screen is displayed. The text data of the character string “18776540” to be displayed can be acquired.

また、年金サイトＳ３の画面のｈｔｍｌデータ１２０から文字列「１８７６５４０」のテキストデータを含むｈｔｍｌ要素のタグを特定することにより、ｈｔｍｌデータ１２０における文字列「１８７６５４０」の位置を特定することが考えられる。タグとは、予め定められた記法により文書に埋め込む形で記述される付加情報である。 Further, it is conceivable to specify the position of the character string “18776540” in the html data 120 by specifying the tag of the html element including the text data of the character string “18776540” from the html data 120 on the screen of the pension site S3. . A tag is additional information described in a form embedded in a document using a predetermined notation.

ｈｔｍｌデータ（または、ｘｈｔｍｌデータ）では、元になる文書に「＜」と「＞」とで囲まれた半角英数字をタグとして埋め込むことにより、ブラウザに対して文書構造、書式、文字飾りなどを指示したり、画像や他の文書へのリンクを埋め込むことができる。また、ｈｔｍｌ要素は、ｈｔｍｌデータを構成する要素であり、例えば、開始タグと内容と終了タグを含む。 In html data (or xhtml data), by embedding half-width alphanumeric characters enclosed in “<” and “>” as tags in the original document, the document structure, format, character decoration, etc. are given to the browser. You can instruct and embed images and links to other documents. The html element is an element constituting html data, and includes, for example, a start tag, contents, and an end tag.

ところが、年金サイトＳ３の画面の中に、ユーザにより指定された文字列「１８７６５４０」と同じ文字列が偶然存在する場合がある。この場合、ユーザにより指定された文字列「１８７６５４０」のテキストデータだけでは、ｈｔｍｌデータ１２０から抽出すべき情報を含むｈｔｍｌ要素のタグを一意に特定することができないことがある。 However, the same character string as the character string “18776540” specified by the user may exist by chance in the screen of the pension site S3. In this case, the tag of the html element including the information to be extracted from the html data 120 may not be uniquely specified only by the text data of the character string “18776540” designated by the user.

そこで、本実施の形態では、サイトのｈｔｍｌデータにおける、ユーザにより指定された文字列に対応するテキストデータの位置を正確に特定するデータ特定方法について説明する。以下、情報処理装置１０１のデータ特定処理の一実施例について説明する。 Therefore, in this embodiment, a data specifying method for accurately specifying the position of text data corresponding to a character string designated by the user in the html data of the site will be described. Hereinafter, an embodiment of the data specifying process of the information processing apparatus 101 will be described.

（１）情報処理装置１０１は、予め記録されたサイトＳの識別情報を参照して、サイトＳの画面情報を取得する。ここで、サイトＳは、一画面に情報を集約して表示する複数のサイトのいずれかのサイトである。サイトＳの識別情報とは、サイトＳを識別する情報であり、例えば、サイトＳのＵＲＬ（ＵｎｉｆｏｒｍＲｅｓｏｕｒｃｅＬｏｃａｔｏｒ）である。 (1) The information processing apparatus 101 acquires screen information of the site S with reference to the identification information of the site S recorded in advance. Here, the site S is one of a plurality of sites that collect and display information on one screen. The identification information of the site S is information for identifying the site S, and is, for example, a URL (Uniform Resource Locator) of the site S.

より詳細に説明すると、サイトＳの識別情報は、例えば、一覧画面に表示するサイトＳの情報を含むページのＵＲＬである。一覧画面は、複数のサイトの情報を集約して表示する画面である。また、サイトＳの画面情報は、サイトＳの情報を含むページを表示するための情報であり、例えば、サイトＳの情報を含むページのｈｔｍｌデータやｘｈｔｍｌデータである。 More specifically, the identification information of the site S is, for example, a URL of a page including the information of the site S displayed on the list screen. The list screen is a screen that aggregates and displays information on a plurality of sites. The screen information of the site S is information for displaying a page including the information of the site S, and is, for example, html data or xhtml data of a page including the information of the site S.

以下の説明では、一覧画面に表示するサイトＳの情報を「目的データ」と表記する場合がある。また、サイトＳの目的データを含むページを「目的ページ」と表記する場合がある。また、サイトＳの画面情報として「ｈｔｍｌデータ」を例に挙げて説明する。 In the following description, the information on the site S displayed on the list screen may be referred to as “target data”. Further, a page including the target data of the site S may be referred to as “target page”. Further, “html data” will be described as an example of the screen information of the site S.

ここでは、一例として、目的ページを「年金サイトＳ３」とし、目的データを「年金の支払額を示す数字列」とする。この場合、情報処理装置１０１は、年金サイトＳ３のＵＲＬを指定して年金サイトＳ３にアクセスすることにより、年金サイトＳ３のｈｔｍｌデータ１２０を取得する。 Here, as an example, the target page is “pension site S3”, and the target data is “numeric string indicating the amount of pension payment”. In this case, the information processing apparatus 101 acquires the html data 120 of the pension site S3 by specifying the URL of the pension site S3 and accessing the pension site S3.

（２）情報処理装置１０１は、サイトＳの画面の画像データ上に設定された領域Ｔの画像データから得られるテキストデータと同一内容のテキストデータを、サイトＳの画面のｈｔｍｌデータから検索する。ここで、領域Ｔの画像データから得られるテキストデータは、目的データのテキストデータである。 (2) The information processing apparatus 101 searches the html data on the screen of the site S for text data having the same content as the text data obtained from the image data of the region T set on the image data on the screen of the site S. Here, the text data obtained from the image data in the region T is the text data of the target data.

具体的には、例えば、まず、情報処理装置１０１は、年金サイトＳ３の画像データ１１０から、画像データ１１０上に設定された領域Ｔ１の画像データ１１１を抽出する。つぎに、情報処理装置１０１は、抽出した領域Ｔ１の画像データ１１１の文字認識処理を行う。ここで、文字認識処理とは、画像データの中から、文字の形状に基づいて文字を識別し、コンピュータ上で扱える文字データに変換する処理である。 Specifically, for example, first, the information processing apparatus 101 extracts the image data 111 of the region T1 set on the image data 110 from the image data 110 of the pension site S3. Next, the information processing apparatus 101 performs character recognition processing on the image data 111 of the extracted region T1. Here, the character recognition process is a process of identifying a character from image data based on the shape of the character and converting it into character data that can be handled on a computer.

文字認識処理は、例えば、ＯＣＲ（ＯｐｔｉｃａｌＣｈａｒａｃｔｅｒＲｅｃｏｇｎｉｔｉｏｎ）処理である。文字認識処理によれば、領域Ｔ内の文字あるいは文字列をテキストデータとして得ることができる。図１の例では、目的データのテキストデータとして、支払（円）を示す数字列「１８７６５４０」のテキストデータが得られる。そして、情報処理装置１０１は、年金サイトＳ３のｈｔｍｌデータ１２０から、文字認識処理により得られた目的データ「１８７６５４０」のテキストデータと同一内容のテキストデータを検索する。 The character recognition process is, for example, an OCR (Optical Character Recognition) process. According to the character recognition process, characters or character strings in the region T can be obtained as text data. In the example of FIG. 1, text data of a numeric string “18776540” indicating payment (yen) is obtained as text data of the target data. Then, the information processing apparatus 101 searches the html data 120 of the pension site S3 for text data having the same contents as the text data of the target data “18776540” obtained by the character recognition process.

（３）情報処理装置１０１は、サイトＳの画面のｈｔｍｌデータ内の検索したテキストデータを異なるテキストデータに変更する。図１の例では、目的データ「１８７６５４０」のテキストデータ「１８７６５４０」と同一内容のテキストデータとして、「年金の支払額」を示すテキストデータ１２１と、「電話番号」の一部を示すテキストデータ１２２が検索される。 (3) The information processing apparatus 101 changes the searched text data in the html data on the screen of the site S to different text data. In the example of FIG. 1, text data 121 indicating “payment amount of annuity” and text data 122 indicating a part of “telephone number” are text data having the same content as text data “18776540” of target data “18776540”. Is searched.

この場合、情報処理装置１０１は、テキストデータ１２１，１２２のいずれかのテキストデータを異なるテキストデータに変更する。図１の例では、ｈｔｍｌデータ１２０内のテキストデータ１２１が、所定の文字列「ココ？」を示すテキストデータ１２３に変更されている。 In this case, the information processing apparatus 101 changes any text data of the text data 121 and 122 to different text data. In the example of FIG. 1, the text data 121 in the html data 120 is changed to text data 123 indicating a predetermined character string “here?”.

なお、情報処理装置１０１は、例えば、上記（２）において、ｈｔｍｌデータ全体に対するテキストデータの検索が終了した後に、上記（３）の処理を実行することにしてもよい。また、情報処理装置１０１は、例えば、上記（２）において、ｈｔｍｌデータの先頭あるいは末尾からテキストデータの検索を行い、同一内容のテキストデータが検索される度に、その都度上記（３）の処理を実行することにしてもよい。 Note that, for example, the information processing apparatus 101 may execute the process (3) after the text data search for the entire html data is completed in (2). Further, for example, in (2) above, the information processing apparatus 101 searches for text data from the beginning or end of html data, and whenever the text data having the same content is searched, the processing of (3) above is performed. May be executed.

（４）情報処理装置１０１は、変更後のサイトＳのｈｔｍｌデータに基づくサイトＳの画面の画像データ上の領域Ｔの画像データから得られるテキストデータが、変更した異なるテキストデータと一致するか否かを判定する。具体的には、例えば、まず、情報処理装置１０１は、変更後のｈｔｍｌデータ１２０に基づいて、年金サイトＳ３をキャプチャして、年金サイトＳ３の画像データ１３０を取得する。なお、キャプチャとは、ディスプレイに表示される画面イメージを画像データとして保存することである。 (4) The information processing apparatus 101 determines whether the text data obtained from the image data of the region T on the image data of the screen of the site S based on the html data of the site S after the change matches the changed different text data. Determine whether. Specifically, for example, first, the information processing apparatus 101 captures the pension site S3 based on the changed html data 120 and acquires the image data 130 of the pension site S3. Note that capture means saving a screen image displayed on the display as image data.

そして、情報処理装置１０１は、取得した年金サイトＳ３の画像データ１３０から、画像データ１３０上の領域Ｔ１（画像データ１１０上で選択された範囲と同一の範囲）の画像データ１３１を抽出する。つぎに、情報処理装置１０１は、抽出した領域Ｔ１の画像データ１３１の文字認識処理を行う。そして、情報処理装置１０１は、文字認識処理により得られたテキストデータが、テキストデータ１２３と一致するか否かを判定する。 Then, the information processing apparatus 101 extracts the image data 131 of the region T1 on the image data 130 (the same range as the range selected on the image data 110) from the acquired image data 130 of the pension site S3. Next, the information processing apparatus 101 performs character recognition processing on the image data 131 of the extracted region T1. Then, the information processing apparatus 101 determines whether the text data obtained by the character recognition process matches the text data 123.

（５）情報処理装置１０１は、判定した判定結果に基づいて、サイトＳの画面のｈｔｍｌデータから領域Ｔに対応するテキストデータを特定する。ここで、テキストデータが一致する場合は、ｈｔｍｌデータにおいて、異なるテキストデータに変更した箇所が、目的データの位置であることを示す。 (5) The information processing apparatus 101 identifies text data corresponding to the region T from the html data on the screen of the site S based on the determined determination result. Here, when the text data matches, it indicates that the location of the target data is the part of the html data that has been changed to different text data.

このため、情報処理装置１０１は、テキストデータが一致する場合、ｈｔｍｌデータのうち、異なるテキストデータに変更したテキストデータを、領域Ｔに対応するテキストデータとして特定する。図１の例では、文字認識処理により得られたテキストデータが、テキストデータ１２３と一致する。 For this reason, when the text data matches, the information processing apparatus 101 specifies the text data changed to different text data in the html data as the text data corresponding to the region T. In the example of FIG. 1, the text data obtained by the character recognition process matches the text data 123.

この場合、情報処理装置１０１は、ｈｔｍｌデータ１２０内のテキストデータ１２１，１２２のうち、異なるテキストデータ１２３に変更したテキストデータ１２１を、領域Ｔに対応するテキストデータとして特定する。なお、テキストデータが一致しない場合は、異なるテキストデータに変更するテキストデータを切り替えて（例えば、テキストデータ１２２）、上記（３）〜（５）の一連の処理を繰り返す。 In this case, the information processing apparatus 101 specifies the text data 121 changed to the different text data 123 among the text data 121 and 122 in the html data 120 as the text data corresponding to the region T. If the text data does not match, the text data to be changed to different text data is switched (for example, text data 122), and the series of processes (3) to (5) is repeated.

ただし、上述した例では、上記（２）で検索されるテキストデータは、テキストデータ１２１，１２２の２つである。このため、文字認識処理により得られたテキストデータがテキストデータ１２３と一致しない場合は、情報処理装置１０１は、例えば、上記（３）〜（５）の処理を繰り返すことなく、テキストデータ１２２を、領域Ｔに対応するテキストデータとして特定することにしてもよい。 However, in the example described above, the text data searched in (2) is two text data 121 and 122. For this reason, when the text data obtained by the character recognition process does not match the text data 123, the information processing apparatus 101 can change the text data 122 without repeating the processes (3) to (5), for example. The text data corresponding to the region T may be specified.

このように、情報処理装置１０１によれば、年金サイトＳ３のｈｔｍｌデータ１２０から、年金サイトＳ３の画像データ１１０上に設定された領域Ｔ１の画像データ１１１から得られるテキストデータと同一内容のテキストデータを検索することができる。これにより、年金サイトＳ３のｈｔｍｌデータ１２０から、目的データと同一内容のテキストデータを検索することができる。 Thus, according to the information processing apparatus 101, text data having the same content as the text data obtained from the html data 120 of the pension site S3 and the image data 111 of the area T1 set on the image data 110 of the pension site S3. Can be searched. Thereby, text data having the same content as the target data can be searched from the html data 120 of the pension site S3.

また、情報処理装置１０１によれば、複数のテキストデータ１２１，１２２が検索された場合、年金サイトＳ３のｈｔｍｌデータ１２０内の複数のテキストデータ１２１，１２２のいずれかのテキストデータ（例えば、テキストデータ１２１）を異なるテキストデータに変更することができる。また、情報処理装置１０１によれば、変更後の年金サイトＳ３のｈｔｍｌデータ１２０に基づく年金サイトＳ３の画像データ１３０上の領域Ｔ１の画像データ１３１から得られるテキストデータが、変更した異なるテキストデータと一致するか否かを判定することができる。 Further, according to the information processing apparatus 101, when a plurality of text data 121 and 122 are searched, any text data (for example, text data) of the plurality of text data 121 and 122 in the html data 120 of the pension site S3. 121) can be changed to different text data. Further, according to the information processing apparatus 101, the text data obtained from the image data 131 of the region T1 on the image data 130 of the pension site S3 based on the html data 120 of the changed pension site S3 is different from the changed different text data. It can be determined whether or not they match.

また、情報処理装置１０１によれば、変更した異なるテキストデータと一致する場合、年金サイトＳ３のｈｔｍｌデータ１２０のうち、異なるテキストデータに変更したテキストデータ１２１を、領域Ｔ１に対応するテキストデータとして特定することができる。これにより、年金サイトＳ３のｈｔｍｌデータ１２０内に目的データと同一内容のテキストデータが複数存在する場合であっても、年金サイトＳ３のｈｔｍｌデータ１２０における目的データの位置を正確に特定することができる。 Further, according to the information processing apparatus 101, when the text data matches the changed different text data, the text data 121 changed to the different text data among the html data 120 of the pension site S3 is specified as the text data corresponding to the region T1. can do. Thereby, even when there are a plurality of text data having the same contents as the target data in the html data 120 of the pension site S3, the position of the target data in the html data 120 of the pension site S3 can be accurately specified. .

（システム２００のシステム構成例）
つぎに、実施の形態にかかるシステム２００のシステム構成例について説明する。 (System configuration example of system 200)
Next, a system configuration example of the system 200 according to the embodiment will be described.

図２は、システム２００のシステム構成例を示す説明図である。図２において、システム２００は、情報処理装置１０１とサーバ２０１を含む。システム２００において、情報処理装置１０１とサーバ２０１は、有線または無線のネットワーク２１０を介して相互に通信可能に接続される。ネットワーク２１０は、例えば、インターネット、ＬＡＮ（ＬｏｃａｌＡｒｅａＮｅｔｗｏｒｋ）、ＷＡＮ（ＷｉｄｅＡｒｅａＮｅｔｗｏｒｋ）などである。 FIG. 2 is an explanatory diagram illustrating a system configuration example of the system 200. In FIG. 2, a system 200 includes an information processing apparatus 101 and a server 201. In the system 200, the information processing apparatus 101 and the server 201 are connected to each other via a wired or wireless network 210 so that they can communicate with each other. The network 210 is, for example, the Internet, a LAN (Local Area Network), a WAN (Wide Area Network), or the like.

ここで、情報処理装置１０１は、アカウントアグリゲーション情報ＤＢ（データベース）２２０、サイト別目的データ属性ＤＢ２３０および一覧情報ＤＢ２４０を有する。具体的には、例えば、情報処理装置１０１は、ブラウザがインストールされたＰＣ（ＰｅｒｓｏｎａｌＣｏｍｐｕｔｅｒ）、ノートＰＣ、タブレット型ＰＣ、スマートフォン、携帯電話機などである。 Here, the information processing apparatus 101 includes an account aggregation information DB (database) 220, a site-specific objective data attribute DB 230, and a list information DB 240. Specifically, for example, the information processing apparatus 101 is a PC (Personal Computer) installed with a browser, a notebook PC, a tablet PC, a smartphone, a mobile phone, or the like.

なお、アカウントアグリゲーション情報ＤＢ２２０、サイト別目的データ属性ＤＢ２３０および一覧情報ＤＢ２４０についての説明は、図５〜図７を用いて後述する。 The account aggregation information DB 220, the site-specific purpose data attribute DB 230, and the list information DB 240 will be described later with reference to FIGS.

サーバ２０１は、情報処理装置１０１からの要求に応じて、ｈｔｍｌデータや画像などを含むサイトＳの画面情報を送信するコンピュータである。情報処理装置１０１は、サーバ２０１からのサイトＳの画面情報に基づいて、サイトＳの画面を表示することができる。具体的には、例えば、サーバ２０１は、Ｗｅｂサーバである。 The server 201 is a computer that transmits screen information of the site S including html data and images in response to a request from the information processing apparatus 101. The information processing apparatus 101 can display the screen of the site S based on the screen information of the site S from the server 201. Specifically, for example, the server 201 is a Web server.

（情報処理装置１０１のハードウェア構成例）
図３は、情報処理装置１０１のハードウェア構成例を示すブロック図である。図３において、情報処理装置１０１は、ＣＰＵ（ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）３０１と、メモリ３０２と、ディスクドライブ３０３と、ディスク３０４と、ディスプレイ３０５と、Ｉ／Ｆ（Ｉｎｔｅｒｆａｃｅ）３０６と、キーボード３０７と、マウス３０８と、スキャナ３０９と、プリンタ３１０と、を有する。また、各構成部はバス３００によってそれぞれ接続される。 (Hardware configuration example of information processing apparatus 101)
FIG. 3 is a block diagram illustrating a hardware configuration example of the information processing apparatus 101. In FIG. 3, an information processing apparatus 101 includes a CPU (Central Processing Unit) 301, a memory 302, a disk drive 303, a disk 304, a display 305, an I / F (Interface) 306, a keyboard 307, a mouse. 308, a scanner 309, and a printer 310. Each component is connected by a bus 300.

ここで、ＣＰＵ３０１は、情報処理装置１０１の全体の制御を司る。メモリ３０２は、例えば、ＲＯＭ（ＲｅａｄＯｎｌｙＭｅｍｏｒｙ）、ＲＡＭ（ＲａｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ）およびフラッシュＲＯＭなどを有する。具体的には、例えば、フラッシュＲＯＭやＲＯＭが各種プログラムを記憶し、ＲＡＭがＣＰＵ３０１のワークエリアとして使用される。メモリ３０２に記憶されるプログラムは、ＣＰＵ３０１にロードされることで、コーディングされている処理をＣＰＵ３０１に実行させる。 Here, the CPU 301 governs overall control of the information processing apparatus 101. The memory 302 includes, for example, a ROM (Read Only Memory), a RAM (Random Access Memory), and a flash ROM. Specifically, for example, a flash ROM or ROM stores various programs, and a RAM is used as a work area for the CPU 301. The program stored in the memory 302 is loaded into the CPU 301 to cause the CPU 301 to execute the coded process.

ディスクドライブ３０３は、ＣＰＵ３０１の制御にしたがってディスク３０４に対するデータのリード／ライトを制御する。ディスク３０４は、ディスクドライブ３０３の制御で書き込まれたデータを記憶する。ディスク３０４としては、例えば、磁気ディスク、光ディスクなどが挙げられる。 The disk drive 303 controls reading / writing of data with respect to the disk 304 according to the control of the CPU 301. The disk 304 stores data written under the control of the disk drive 303. Examples of the disk 304 include a magnetic disk and an optical disk.

ディスプレイ３０５は、カーソル、アイコンあるいはツールボックスをはじめ、文書、画像、機能情報などのデータを表示する。ディスプレイ３０５は、例えば、ＣＲＴ、ＴＦＴ液晶ディスプレイ、プラズマディスプレイなどを採用することができる。 A display 305 displays data such as a document, an image, and function information as well as a cursor, an icon, or a tool box. As the display 305, for example, a CRT, a TFT liquid crystal display, a plasma display, or the like can be adopted.

Ｉ／Ｆ３０６は、通信回線を通じてネットワーク２１０に接続され、ネットワーク２１０を介して他のコンピュータ（例えば、サーバ２０１）に接続される。そして、Ｉ／Ｆ３０６は、ネットワーク２１０と内部のインターフェースを司り、他のコンピュータからのデータの入出力を制御する。Ｉ／Ｆ３０６には、例えば、モデムやＬＡＮアダプタなどを採用することができる。 The I / F 306 is connected to the network 210 via a communication line, and is connected to another computer (for example, the server 201) via the network 210. The I / F 306 controls an internal interface with the network 210 and controls data input / output from other computers. For example, a modem or a LAN adapter may be employed as the I / F 306.

キーボード３０７は、文字、数字、各種指示などの入力のためのキーを備え、データの入力を行う。キーボード３０７は、タッチパネル式の入力パッドやテンキーなどであってもよい。マウス３０８は、カーソルの移動や範囲選択、あるいはウィンドウの移動やサイズの変更などを行う。 The keyboard 307 includes keys for inputting characters, numbers, various instructions, and the like, and inputs data. The keyboard 307 may be a touch panel type input pad or a numeric keypad. The mouse 308 performs cursor movement, range selection, window movement, size change, and the like.

スキャナ３０９は、画像を光学的に読み取り、情報処理装置１０１内に画像データを取り込む。スキャナ３０９は、ＯＣＲ機能を有していてもよい。プリンタ３１０は、画像データや文書データを印刷する。プリンタ３１０には、例えば、レーザプリンタやインクジェットプリンタを採用することができる。なお、情報処理装置１０１は、例えば、上述した構成部のうち、スキャナ３０９、プリンタ３１０などを有さないことにしてもよい。 The scanner 309 optically reads an image and takes in image data into the information processing apparatus 101. The scanner 309 may have an OCR function. The printer 310 prints image data and document data. As the printer 310, for example, a laser printer or an ink jet printer can be employed. Note that the information processing apparatus 101 may not include the scanner 309, the printer 310, or the like among the above-described components.

（サーバ２０１のハードウェア構成例）
図４は、サーバ２０１のハードウェア構成例を示すブロック図である。図４において、サーバ２０１は、ＣＰＵ４０１と、メモリ４０２と、Ｉ／Ｆ４０３と、ディスクドライブ４０４と、ディスク４０５と、を有する。また、各構成部は、バス４００によってそれぞれ接続される。 (Hardware configuration example of server 201)
FIG. 4 is a block diagram illustrating a hardware configuration example of the server 201. In FIG. 4, the server 201 includes a CPU 401, a memory 402, an I / F 403, a disk drive 404, and a disk 405. Each component is connected by a bus 400.

ここで、ＣＰＵ４０１は、サーバ２０１の全体の制御を司る。メモリ４０２は、例えば、ＲＯＭ、ＲＡＭおよびフラッシュＲＯＭなどを有する。具体的には、例えば、フラッシュＲＯＭやＲＯＭが各種プログラムを記憶し、ＲＡＭがＣＰＵ４０１のワークエリアとして使用される。メモリ４０２に記憶されるプログラムは、ＣＰＵ４０１にロードされることで、コーディングされている処理をＣＰＵ４０１に実行させる。 Here, the CPU 401 governs overall control of the server 201. The memory 402 includes, for example, a ROM, a RAM, a flash ROM, and the like. Specifically, for example, a flash ROM or ROM stores various programs, and the RAM is used as a work area of the CPU 401. The program stored in the memory 402 is loaded on the CPU 401 to cause the CPU 401 to execute the coded process.

Ｉ／Ｆ４０３は、通信回線を通じてネットワーク２１０に接続され、ネットワーク２１０を介して他のコンピュータ（例えば、図２に示した情報処理装置１０１）に接続される。そして、Ｉ／Ｆ４０３は、ネットワーク２１０と内部のインターフェースを司り、他のコンピュータからのデータの入出力を制御する。Ｉ／Ｆ４０３には、例えば、モデムやＬＡＮアダプタなどを採用することができる。 The I / F 403 is connected to the network 210 via a communication line, and is connected to another computer (for example, the information processing apparatus 101 shown in FIG. 2) via the network 210. The I / F 403 controls an internal interface with the network 210 and controls input / output of data from other computers. For example, a modem or a LAN adapter may be employed as the I / F 403.

ディスクドライブ４０４は、ＣＰＵ４０１の制御にしたがってディスク４０５に対するデータのリード／ライトを制御する。ディスク４０５は、ディスクドライブ４０４の制御で書き込まれたデータを記憶する。ディスク４０５としては、例えば、磁気ディスク、光ディスクなどが挙げられる。なお、サーバ２０１は、上述した構成部のほか、例えば、キーボード、マウス、ディスプレイなどを有することにしてもよい。 The disk drive 404 controls reading / writing of data with respect to the disk 405 according to the control of the CPU 401. The disk 405 stores data written under the control of the disk drive 404. Examples of the disk 405 include a magnetic disk and an optical disk. Note that the server 201 may include, for example, a keyboard, a mouse, a display, and the like in addition to the components described above.

（アカウントアグリゲーション情報ＤＢ２２０の記憶内容）
つぎに、情報処理装置１０１が有するアカウントアグリゲーション情報ＤＢ２２０の記憶内容について説明する。アカウントアグリゲーション情報ＤＢ２２０は、例えば、図３に示した情報処理装置１０１のメモリ３０２、ディスク３０４などの記憶装置により実現される。 (Contents stored in the account aggregation information DB 220)
Next, the contents stored in the account aggregation information DB 220 included in the information processing apparatus 101 will be described. The account aggregation information DB 220 is realized by a storage device such as the memory 302 and the disk 304 of the information processing apparatus 101 illustrated in FIG.

図５は、アカウントアグリゲーション情報ＤＢ２２０の記憶内容の一例を示す説明図である。図５において、アカウントアグリゲーション情報ＤＢ２２０は、ＩＤ、ＰＷ、ログインＵＲＬおよびデータＵＲＬのフィールドを有する。各フィールドに情報を設定することで、アカウントアグリゲーション情報（アカウントアグリゲーション情報５００−１〜５００−７）がレコードとして記憶される。 FIG. 5 is an explanatory diagram showing an example of the contents stored in the account aggregation information DB 220. In FIG. 5, the account aggregation information DB 220 has fields of ID, PW, login URL, and data URL. By setting information in each field, account aggregation information (account aggregation information 500-1 to 500-7) is stored as a record.

ここで、ＩＤ（ｉｄｅｎｔｉｆｉｃａｔｉｏｎ）は、サイトＳのユーザを識別する識別子である。ＰＷ（ｐａｓｓｗｏｒｄ）は、サイトＳにログインするためのユーザのパスワードである。ログインＵＲＬは、サイトＳにログインするためのＷｅｂページ（いわゆる、ログイン画面）を表示するためのＵＲＬである。 Here, ID (identification) is an identifier for identifying a user of the site S. PW (password) is a user password for logging in to the site S. The login URL is a URL for displaying a Web page (so-called login screen) for logging into the site S.

データＵＲＬは、一覧画面に表示するサイトＳの情報を含むＷｅｂページを表示するためのＵＲＬである。一覧画面は、複数のサイトＳの情報を集約して表示する画面である。ここでは、データＵＲＬは、ＣＧＩ（ＣｏｍｍｏｎＧａｔｅｗａｙＩｎｔｅｒｆａｃｅ）スクリプトのＵＲＬである。 The data URL is a URL for displaying a Web page including information on the site S to be displayed on the list screen. The list screen is a screen that aggregates and displays information on a plurality of sites S. Here, the data URL is a URL of a CGI (Common Gateway Interface) script.

例えば、アカウントアグリゲーション情報５００−１は、ＩＤ「１２３４５」、ＰＷ「Ｐ１１１１１１」、ログインＵＲＬ「Ａ．ｈｔｍｌ」およびデータＵＲＬ「Ａ／１２３４５．ｃｇｉ」を示す。 For example, the account aggregation information 500-1 indicates ID “12345”, PW “P111111”, login URL “A.html”, and data URL “A / 12345.cgi”.

（サイト別目的データ属性ＤＢ２３０の記憶内容）
つぎに、情報処理装置１０１が有するサイト別目的データ属性ＤＢ２３０の記憶内容について説明する。サイト別目的データ属性ＤＢ２３０は、例えば、情報処理装置１０１のメモリ３０２、ディスク３０４などの記憶装置により実現される。 (Storage contents of site-specific purpose data attribute DB 230)
Next, the contents stored in the site-specific purpose data attribute DB 230 of the information processing apparatus 101 will be described. The site-specific purpose data attribute DB 230 is realized by a storage device such as the memory 302 and the disk 304 of the information processing apparatus 101, for example.

図６は、サイト別目的データ属性ＤＢ２３０の記憶内容の一例を示す説明図である。図６において、サイト別目的データ属性ＤＢ２３０は、データＵＲＬ、データ特定ｈｔｍｌ属性およびデータ属性のフィールドを有する。各フィールドに情報を設定することで、サイト別目的データ属性情報（例えば、サイト別目的データ属性情報６００−１〜６００−５）がレコードとして記憶される。 FIG. 6 is an explanatory diagram showing an example of the contents stored in the site-specific purpose data attribute DB 230. In FIG. 6, the site-specific purpose data attribute DB 230 includes fields for a data URL, a data specific html attribute, and a data attribute. By setting information in each field, site-specific target data attribute information (for example, site-specific target data attribute information 600-1 to 600-5) is stored as a record.

ここで、データＵＲＬは、サイトＳの目的ページを表示するためのＵＲＬである。データ特定ｈｔｍｌ属性は、目的データを含むｈｔｍｌ要素のタグを特定するための情報である。データ属性は、目的データの属性である。データ属性としては、例えば、数値、漢字、かな、カナ、アルファベットなどがある。例えば、サイト別目的データ属性情報６００−１は、データＵＲＬ「Ａ／１２３４５．ｃｇｉ」、データ特定ｈｔｍｌ属性「ｔｄ全１２個中の４番目」およびデータ属性「数値」を示す。 Here, the data URL is a URL for displaying the target page of the site S. The data specifying html attribute is information for specifying a tag of an html element including target data. The data attribute is an attribute of the target data. Examples of data attributes include numerical values, kanji, kana, kana, and alphabet. For example, the site-specific purpose data attribute information 600-1 indicates the data URL “A / 12345.cgi”, the data specific html attribute “fourth of all td twelve”, and the data attribute “numerical value”.

（一覧情報ＤＢ２４０の記憶内容）
つぎに、情報処理装置１０１が有する一覧情報ＤＢ２４０の記憶内容について説明する。一覧情報ＤＢ２４０は、例えば、情報処理装置１０１のメモリ３０２、ディスク３０４などの記憶装置により実現される。 (Storage contents of list information DB 240)
Next, the contents stored in the list information DB 240 included in the information processing apparatus 101 will be described. The list information DB 240 is realized by a storage device such as the memory 302 and the disk 304 of the information processing apparatus 101, for example.

図７は、一覧情報ＤＢ２４０の記憶内容の一例を示す説明図である。図７において、一覧情報ＤＢ２４０は、データＵＲＬおよび一覧位置のフィールドを有し、各フィールドに情報を設定することで、一覧情報（例えば、一覧情報７００−１〜７００−５）をレコードとして記憶する。 FIG. 7 is an explanatory diagram showing an example of the contents stored in the list information DB 240. In FIG. 7, the list information DB 240 has fields of data URL and list position, and sets information in each field to store list information (for example, list information 700-1 to 700-5) as a record. .

ここで、データＵＲＬは、サイトＳの目的ページを表示するためのＵＲＬである。一覧位置は、一覧画面におけるサイトＳの目的データを表示する位置を示す情報である。ここでは、一覧位置は、一覧画面内のボックス（例えば、図８に示すボックスＢ１〜Ｂ３）の番号を示す。例えば、一覧情報７００−１は、データＵＲＬ「Ａ／１２３４５．ｃｇｉ」および一覧位置「２」を示す。 Here, the data URL is a URL for displaying the target page of the site S. The list position is information indicating a position where the target data of the site S is displayed on the list screen. Here, the list position indicates the number of a box (for example, boxes B1 to B3 shown in FIG. 8) in the list screen. For example, the list information 700-1 indicates the data URL “A / 12345.cgi” and the list position “2”.

（一覧設定画面の画面例）
つぎに、情報処理装置１０１のディスプレイ３０５に表示される一覧設定画面の画面例について説明する。一覧設定画面は、複数のサイトＳの目的データを表示する一覧画面の画面構成や掲載内容を設定する画面である。 (Example of list setting screen)
Next, a screen example of the list setting screen displayed on the display 305 of the information processing apparatus 101 will be described. The list setting screen is a screen for setting the screen configuration of the list screen displaying the target data of the plurality of sites S and the posting contents.

図８は、一覧設定画面の画面例を示す説明図である。図８において、一覧設定画面８００は、一覧画面に表示する目的データの項目名および表示位置を設定する画面である。一覧設定画面８００において、図３に示したキーボード３０７やマウス３０８を用いたユーザの操作入力により、一覧画面に表示する目的データの項目名を設定することができる。 FIG. 8 is an explanatory diagram illustrating a screen example of a list setting screen. In FIG. 8, a list setting screen 800 is a screen for setting item names and display positions of target data to be displayed on the list screen. In the list setting screen 800, the item name of the target data to be displayed on the list screen can be set by the user's operation input using the keyboard 307 and the mouse 308 shown in FIG.

図８の例では、一覧画面に表示する目的データの項目名「年金加入月数」、「年金受給（見込み）額」および「Ｘ銀行の預金残高」が設定されている。なお、「年金加入月数」と「年金受給（見込み）額」は、ある年金サイトの情報である。また、「Ｘ銀行の預金残高」は、ある銀行サイトの情報である。 In the example of FIG. 8, the item names “number of months of pension participation”, “pension receipt (expected) amount” and “bank balance of X bank” of the target data displayed on the list screen are set. The “months of pension participation” and “pension receipt (expected) amount” are information on a certain pension site. Further, “deposit balance of bank X” is information on a certain bank site.

また、一覧設定画面８００において、ユーザの操作入力により、目的データを表示するボックスを設定することができる。図８の例では、項目名「年金加入月数」の目的データを表示するボックスＢ１、項目名「年金受給（見込み）額」の目的データを表示するボックスＢ２および項目名「Ｘ銀行の預金残高」の目的データを表示するボックスＢ３が設定されている。 In the list setting screen 800, a box for displaying target data can be set by a user operation input. In the example of FIG. 8, the box B1 that displays the target data of the item name “pension enrollment months”, the box B2 that displays the target data of the item name “pension receipt (expected amount)”, and the item name “bank X deposit balance”. The box B3 for displaying the target data “is set.

（領域初期設定画面の画面例）
つぎに、情報処理装置１０１のディスプレイ３０５に表示される領域初期設定画面の画面例について説明する。領域初期設定画面は、目的ページの画面における目的データを含む領域Ｔを設定する画面である。 (Example of initial area setting screen)
Next, a screen example of the area initial setting screen displayed on the display 305 of the information processing apparatus 101 will be described. The area initial setting screen is a screen for setting an area T including target data on the screen of the target page.

図９は、領域初期設定画面の画面例を示す説明図である。図９において、領域初期設定画面９００は、年金サイトの厚生年金情報ページの画面における目的データを含む領域Ｔを設定する画面である。領域初期設定画面９００には、年金サイトの厚生年金情報ページの画面の画像データ９１０が表示されている。 FIG. 9 is an explanatory diagram illustrating a screen example of a region initial setting screen. In FIG. 9, a region initial setting screen 900 is a screen for setting a region T including target data on the screen of the welfare pension information page of the pension site. In the area initial setting screen 900, image data 910 of the screen of the welfare pension information page of the pension site is displayed.

領域初期設定画面９００において、ユーザによる領域指定の操作入力として、画像データ９１０上の目的データを含む範囲の選択を受け付けることにより、厚生年金情報ページの画面における目的データを含む領域Ｔを設定することができる。図９の例では、厚生年金情報ページの画面における目的データを含む領域として、領域Ｔ１，Ｔ２が設定されている。 In the region initial setting screen 900, by accepting selection of a range including the target data on the image data 910 as an operation input for specifying the region by the user, the region T including the target data on the screen of the welfare annuity information page is set. Can do. In the example of FIG. 9, regions T <b> 1 and T <b> 2 are set as regions including target data on the welfare pension information page screen.

ここで、領域Ｔ１は、厚生年金情報ページの加入期間［月］を示す数字列を含む領域である。領域Ｔ２は、厚生年金情報ページの年金額（見込み）［円］を示す数字列を含む領域である。また、領域初期設定画面９００において、ユーザの操作入力により、設定完了ボタン９２０がクリック（押下）されると、領域Ｔの設定が完了する。 Here, area | region T1 is an area | region containing the numerical sequence which shows the enrollment period [month] of an employee pension information page. The region T2 is a region including a numeric string indicating the annual amount (expected) [yen] of the employee pension information page. On the region initial setting screen 900, when the setting completion button 920 is clicked (pressed) by a user operation input, the setting of the region T is completed.

（一覧画面の画面例）
つぎに、情報処理装置１０１のディスプレイ３０５に表示される一覧画面の画面例について説明する。一覧画面は複数のサイトＳの目的データを集約して表示する画面である。 (Screen example of list screen)
Next, a screen example of a list screen displayed on the display 305 of the information processing apparatus 101 will be described. The list screen is a screen that aggregates and displays the target data of a plurality of sites S.

図１０は、一覧画面の画面例を示す説明図である。図１０において、一覧画面１０００は、年金サイトの厚生年金情報ページの目的データと、銀行サイトの口座情報ページの目的データとを集約して表示する画面である。具体的には、一覧画面１０００には、年金サイトの厚生年金情報ページの年金加入月数がボックスＢ１に表示され、年金受給（見込み）額がボックスＢ２に表示されている。 FIG. 10 is an explanatory diagram illustrating a screen example of a list screen. In FIG. 10, a list screen 1000 is a screen that collectively displays the purpose data of the welfare pension information page of the pension site and the purpose data of the account information page of the bank site. Specifically, on the list screen 1000, the number of months of pension participation on the welfare pension information page of the pension site is displayed in box B1, and the amount of pension receipt (expected) is displayed in box B2.

また、一覧画面１０００には、銀行サイトの口座情報ページの預金残高がボックスＢ３に表示されている。一覧画面１０００によれば、ユーザは、年金サイトの厚生年金情報ページの年金加入月数、年金受給（見込み）額および銀行サイトの口座情報ページの預金残高を一目で確認することができる。 On the list screen 1000, the deposit balance on the account information page of the bank site is displayed in a box B3. According to the list screen 1000, the user can confirm at a glance the number of months of pension participation, the pension receipt (expected) amount on the welfare pension information page of the pension site, and the deposit balance on the account information page of the bank site.

（情報処理装置１０１の機能的構成例）
図１１は、情報処理装置１０１の機能的構成例を示すブロック図である。図１１において、情報処理装置１０１は、受付部１１０１と、取得部１１０２と、登録部１１０３と、表示制御部１１０４と、認識部１１０５と、検索部１１０６と、変更部１１０７と、判定部１１０８と、特定部１１０９と、を含む構成である。受付部１１０１〜特定部１１０９は制御部となる機能であり、具体的には、例えば、図３に示したメモリ３０２、ディスク３０４などの記憶装置に記憶されたプログラムをＣＰＵ３０１に実行させることにより、または、Ｉ／Ｆ３０６により、その機能を実現する。各機能部の処理結果は、例えば、メモリ３０２、ディスク３０４などの記憶装置に記憶される。 (Functional configuration example of the information processing apparatus 101)
FIG. 11 is a block diagram illustrating a functional configuration example of the information processing apparatus 101. In FIG. 11, the information processing apparatus 101 includes a reception unit 1101, an acquisition unit 1102, a registration unit 1103, a display control unit 1104, a recognition unit 1105, a search unit 1106, a change unit 1107, and a determination unit 1108. The specifying unit 1109 is included. The receiving unit 1101 to the specifying unit 1109 are functions as control units. Specifically, for example, by causing the CPU 301 to execute a program stored in a storage device such as the memory 302 and the disk 304 illustrated in FIG. Alternatively, the function is realized by the I / F 306. The processing result of each functional unit is stored in a storage device such as the memory 302 and the disk 304, for example.

＜新規登録要求を受け付けた場合＞
まず、新規登録要求を受け付けた場合の各機能部の処理内容について説明する。新規登録要求は、一覧画面に表示する目的データを新規登録する要求である。 <When a new registration request is accepted>
First, processing contents of each functional unit when a new registration request is received will be described. The new registration request is a request for newly registering target data to be displayed on the list screen.

受付部１１０１は、新規登録要求を受け付ける。具体的には、例えば、受付部１１０１は、キーボード３０７やマウス３０８を用いたユーザの操作入力により、新規登録要求を受け付ける。また、受付部１１０１は、外部のコンピュータから新規登録要求を受信することにより、新規登録要求を受け付けることにしてもよい。 The accepting unit 1101 accepts a new registration request. Specifically, for example, the reception unit 1101 receives a new registration request by a user operation input using the keyboard 307 or the mouse 308. The receiving unit 1101 may receive a new registration request by receiving a new registration request from an external computer.

取得部１１０２は、新規登録要求を受け付けたことに応じて、サイトＳのＩＤ、ＰＷ、ログインＵＲＬおよびデータＵＲＬを取得する。具体的には、例えば、取得部１１０２は、ユーザの操作入力により、サイトＳのＩＤ、ＰＷ、ログインＵＲＬおよびデータＵＲＬを取得する。この際、取得部１１０２は、ユーザの操作入力によって目的ページまで画面遷移させることにより、目的ページのデータＵＲＬを取得することにしてもよい。 The acquisition unit 1102 acquires the ID, PW, login URL, and data URL of the site S in response to receiving a new registration request. Specifically, for example, the acquisition unit 1102 acquires the ID, PW, login URL, and data URL of the site S by a user operation input. At this time, the acquisition unit 1102 may acquire the data URL of the target page by causing a screen transition to the target page by a user operation input.

なお、サイトＳのＩＤ、ＰＷ、ログインＵＲＬおよびデータＵＲＬは、新規登録要求に含まれていてもよい。この場合、取得部１１０２は、受け付けられた新規登録要求から、サイトＳのＩＤ、ＰＷ、ログインＵＲＬおよびデータＵＲＬを取得する。 Note that the ID, PW, login URL, and data URL of the site S may be included in the new registration request. In this case, the acquisition unit 1102 acquires the ID, PW, login URL, and data URL of the site S from the accepted new registration request.

登録部１１０３は、取得されたサイトＳのＩＤ、ＰＷ、ログインＵＲＬおよびデータＵＲＬをアカウントアグリゲーション情報ＤＢ２２０に登録する。具体的には、例えば、登録部１１０３は、アカウントアグリゲーション情報ＤＢ２２０の各フィールドに、取得されたサイトＳのＩＤ、ＰＷ、ログインＵＲＬおよびデータＵＲＬを設定する。 The registration unit 1103 registers the acquired ID, PW, login URL, and data URL of the site S in the account aggregation information DB 220. Specifically, for example, the registration unit 1103 sets the acquired site S ID, PW, login URL, and data URL in each field of the account aggregation information DB 220.

これにより、アカウントアグリゲーション情報ＤＢ２２０に新たなアカウントアグリゲーション情報が新規登録される。 As a result, new account aggregation information is newly registered in the account aggregation information DB 220.

取得部１１０２は、目的ページの画面のｈｔｍｌデータを取得する。具体的には、例えば、まず、取得部１１０２は、取得したサイトＳのログインＵＲＬを用いて、サイトＳのログイン画面にアクセスする。そして、取得部１１０２は、取得したサイトＳのＩＤ、ＰＷを用いて、サイトＳにログインする。つぎに、取得部１１０２は、取得したサイトＳのデータＵＲＬを用いて、サイトＳの目的ページのｈｔｍｌデータを取得する。 The acquisition unit 1102 acquires html data of the screen of the target page. Specifically, for example, first, the acquisition unit 1102 accesses the login screen of the site S using the acquired login URL of the site S. Then, the acquisition unit 1102 logs into the site S using the acquired ID and PW of the site S. Next, the acquisition unit 1102 acquires the html data of the target page of the site S using the acquired data URL of the site S.

表示制御部１１０４は、取得された目的ページの画面のｈｔｍｌデータに基づいて、目的ページの画面の画像データを出力する。具体的には、例えば、まず、表示制御部１１０４は、取得したｈｔｍｌデータに基づいて、目的ページの画面をキャプチャすることにより、目的ページの画面の画像データを取得する。そして、表示制御部１１０４は、取得した目的ページの画面の画像データを含む領域初期設定画面（例えば、図９に示した領域初期設定画面９００）をディスプレイ３０５に表示する。 The display control unit 1104 outputs image data of the screen of the target page based on the acquired html data of the screen of the target page. Specifically, for example, first, the display control unit 1104 acquires the image data of the screen of the target page by capturing the screen of the target page based on the acquired html data. Then, the display control unit 1104 displays an area initial setting screen (for example, the area initial setting screen 900 shown in FIG. 9) including the acquired image data of the screen of the target page on the display 305.

受付部１１０１は、出力された目的ページの画面の画像データ上の目的データを含む領域Ｔの選択を受け付ける。具体的には、例えば、受付部１１０１は、領域初期設定画面９００におけるユーザの操作入力により、画像データ９１０（図９参照）上の目的データを含む領域Ｔ（例えば、領域Ｔ１，Ｔ２）の選択を受け付ける。 The accepting unit 1101 accepts selection of an area T including the target data on the image data of the output target page screen. Specifically, for example, the accepting unit 1101 selects an area T (for example, areas T1 and T2) including target data on the image data 910 (see FIG. 9) by a user operation input on the area initial setting screen 900. Accept.

認識部１１０５は、選択された領域Ｔの画像データの文字認識処理を行う。具体的には、例えば、まず、認識部１１０５は、目的ページの画像データから、選択された領域Ｔの画像データを抽出する。そして、認識部１１０５は、抽出した領域Ｔの画像データに対してＯＣＲ処理を行う。これにより、目的データのテキストデータを取得することができる。 The recognition unit 1105 performs character recognition processing on the image data of the selected region T. Specifically, for example, the recognition unit 1105 first extracts the image data of the selected region T from the image data of the target page. Then, the recognition unit 1105 performs OCR processing on the extracted image data of the region T. Thereby, the text data of the target data can be acquired.

また、認識部１１０５は、選択された領域Ｔの位置情報を取得する。ここで、領域Ｔの位置情報とは、目的ページの画像データにおける領域Ｔの位置を示す情報である。例えば、領域Ｔが矩形の場合、領域Ｔの位置情報は、矩形の対角の２頂点の座標（ｘ座標、ｙ座標）である。また、領域Ｔが円の場合、領域Ｔの位置情報は、円の中心の座標と半径である。具体的には、例えば、領域Ｔが矩形の場合、認識部１１０５は、選択された領域Ｔの左上の座標（ｘ座標，ｙ座標）と右下の座標（ｘ座標，ｙ座標）を取得する。 In addition, the recognition unit 1105 acquires the position information of the selected region T. Here, the position information of the area T is information indicating the position of the area T in the image data of the target page. For example, when the region T is a rectangle, the position information of the region T is the coordinates (x coordinate, y coordinate) of two vertices of the diagonal of the rectangle. When the region T is a circle, the position information of the region T is the coordinates and radius of the center of the circle. Specifically, for example, when the region T is rectangular, the recognition unit 1105 acquires the upper left coordinates (x coordinate, y coordinate) and the lower right coordinates (x coordinate, y coordinate) of the selected region T. .

検索部１１０６は、取得された目的ページのｈｔｍｌデータから、認識された文字あるいは文字列と同一内容のテキストデータを検索する。すなわち、検索部１１０６は、目的ページのｈｔｍｌデータから、ＯＣＲ処理により得られる目的データのテキストデータと同一内容のテキストデータを検索する。 The retrieval unit 1106 retrieves text data having the same content as the recognized character or character string from the obtained html data of the target page. That is, the search unit 1106 searches the html data of the target page for text data having the same content as the text data of the target data obtained by the OCR process.

変更部１１０７は、目的ページのｈｔｍｌデータ内の検索されたテキストデータを異なるテキストデータに変更する。具体的には、例えば、変更部１１０７は、複数のテキストデータが検索された場合、目的ページのｈｔｍｌデータ内の複数のテキストデータのいずれかのテキストデータを、所定の文字列（あるいは、文字）を示すテキストデータに変更する。所定の文字列は、任意に設定可能である。例えば、所定の文字列は、ページのｈｔｍｌデータの中に出現しにくい文字列に設定される。 The changing unit 1107 changes the searched text data in the html data of the target page to different text data. Specifically, for example, when a plurality of text data are searched, the changing unit 1107 converts any text data of the plurality of text data in the html data of the target page into a predetermined character string (or character). Change to text data indicating. The predetermined character string can be arbitrarily set. For example, the predetermined character string is set to a character string that hardly appears in the html data of the page.

認識部１１０５は、変更後の目的ページのｈｔｍｌデータに基づく目的ページの画像データ上の領域Ｔの画像データの文字認識処理を行う。具体的には、例えば、まず、認識部１１０５は、変更後のｈｔｍｌデータをメモリ３０２に展開することにより、目的ページをキャプチャして、目的ページの画像データを取得する。つぎに、認識部１１０５は、取得した領域Ｔの位置情報に基づいて、取得した目的ページの画像データから領域Ｔの画像データを抽出する。そして、認識部１１０５は、抽出した領域Ｔの画像データの文字認識処理を行う。 The recognition unit 1105 performs character recognition processing of the image data in the region T on the image data of the target page based on the html data of the target page after the change. Specifically, for example, first, the recognition unit 1105 captures the target page by developing the changed html data in the memory 302, and acquires the image data of the target page. Next, the recognition unit 1105 extracts image data of the region T from the acquired image data of the target page based on the acquired position information of the region T. Then, the recognition unit 1105 performs character recognition processing on the image data of the extracted region T.

判定部１１０８は、変更後の目的ページのｈｔｍｌデータに基づく目的ページの画像データ上の領域Ｔの画像データから得られるテキストデータが、変更した異なるテキストデータと一致するか否かを判定する。具体的には、例えば、判定部１１０８は、文字認識処理により得られたテキストデータが、所定の文字列を示すテキストデータと一致するか否かを判定する。 The determination unit 1108 determines whether or not the text data obtained from the image data in the region T on the target page image data based on the changed target page html data matches the changed different text data. Specifically, for example, the determination unit 1108 determines whether the text data obtained by the character recognition process matches text data indicating a predetermined character string.

特定部１１０９は、目的ページのｈｔｍｌデータから領域Ｔに対応するテキストデータを特定する。具体的には、例えば、特定部１１０９は、検索部１１０６によって１つのテキストデータが検索された場合は、目的ページのｈｔｍｌデータのうち、検索されたテキストデータを、領域Ｔに対応するテキストデータとして特定する。 The specifying unit 1109 specifies text data corresponding to the region T from the html data of the target page. Specifically, for example, when one text data is searched by the search unit 1106, the specifying unit 1109 sets the searched text data among the html data of the target page as text data corresponding to the region T. Identify.

一方、複数のテキストデータが検索された場合には、特定部１１０９は、判定された判定結果に基づいて、検索された複数のテキストデータから領域Ｔに対応するテキストデータを特定する。例えば、特定部１１０９は、テキストデータが一致する場合、目的ページのｈｔｍｌデータのうち、異なるテキストデータに変更したテキストデータを、領域Ｔに対応するテキストデータとして特定する。 On the other hand, when a plurality of text data are searched, the specifying unit 1109 specifies the text data corresponding to the region T from the plurality of searched text data based on the determined determination result. For example, when the text data matches, the specifying unit 1109 specifies text data changed to different text data among the html data of the target page as text data corresponding to the region T.

また、特定部１１０９は、特定した領域Ｔに対応するテキストデータに基づいて、目的ページのｈｔｍｌデータにおけるタグに関する情報を特定する。ここで、タグに関する情報とは、目的ページのｈｔｍｌデータのうち、目的データを含むｈｔｍｌ要素のタグを特定するための情報である。 Further, the specifying unit 1109 specifies information related to the tag in the html data of the target page based on the text data corresponding to the specified region T. Here, the information regarding the tag is information for specifying the tag of the html element including the target data among the html data of the target page.

具体的には、例えば、まず、特定部１１０９は、目的ページのｈｔｍｌデータから、領域Ｔに対応するテキストデータを含むｈｔｍｌ要素を検索する。そして、特定部１１０９は、目的ページのｈｔｍｌデータにおける、検索したｈｔｍｌ要素のタグのデータ特定ｈｔｍｌ属性を、タグに関する情報として特定する。 Specifically, for example, first, the specifying unit 1109 searches for html elements including text data corresponding to the region T from html data of the target page. Then, the specifying unit 1109 specifies the data specifying html attribute of the tag of the searched html element in the html data of the target page as information related to the tag.

データ特定ｈｔｍｌ属性とは、目的ページのｈｔｍｌデータにおける、目的データのテキストデータを含むｈｔｍｌ要素のタグの位置を特定するための情報である。データ特定ｈｔｍｌ属性は、例えば、タグの種類や、ｈｔｍｌデータにおける同一種類のタグ全何個中の先頭から何番目のタグであるかなどを示す。 The data specifying html attribute is information for specifying the position of the tag of the html element including the text data of the target data in the html data of the target page. The data specifying html attribute indicates, for example, the type of tag and the tag number from the top of all tags of the same type in the html data.

また、特定部１１０９は、特定した領域Ｔに対応するテキストデータのデータ属性を特定する。具体的には、例えば、認識部１１０５は、領域Ｔに対応するテキストデータを解析することにより、当該テキストデータのデータ属性（例えば、数値、漢字、かな、カナ、アルファベット）を特定する。 Further, the specifying unit 1109 specifies the data attribute of the text data corresponding to the specified region T. Specifically, for example, the recognizing unit 1105 analyzes the text data corresponding to the region T to identify the data attribute (for example, numeric value, kanji, kana, kana, alphabet) of the text data.

また、受付部１１０１は、目的データの一覧位置を受け付ける機能を有する。具体的には、例えば、受付部１１０１は、ユーザの操作入力により、目的データの一覧位置（例えば、図８に示した一覧設定画面８００のボックスの番号）を受け付ける。 The receiving unit 1101 has a function of receiving a list position of target data. Specifically, for example, the accepting unit 1101 accepts a list position of the target data (for example, a box number on the list setting screen 800 shown in FIG. 8) by a user operation input.

登録部１１０３は、目的ページのデータＵＲＬと対応付けて、特定された目的ページのｈｔｍｌデータにおけるタグに関する情報を記録する。また、登録部１１０３は、目的ページのデータＵＲＬと対応付けて、特定された領域Ｔに対応するテキストデータのデータ属性を記録する。 The registration unit 1103 records information related to the tag in the html data of the specified target page in association with the data URL of the target page. In addition, the registration unit 1103 records the data attribute of the text data corresponding to the specified region T in association with the data URL of the target page.

具体的には、例えば、登録部１１０３は、サイト別目的データ属性ＤＢ２３０の各フィールドに、データＵＲＬ、データ特定ｈｔｍｌ属性およびデータ属性を設定する。これにより、サイト別目的データ属性ＤＢ２３０に新たなサイト別目的データ属性情報が新規登録される。 Specifically, for example, the registration unit 1103 sets a data URL, a data specific html attribute, and a data attribute in each field of the site-specific purpose data attribute DB 230. As a result, new site-specific purpose data attribute information is newly registered in the site-specific purpose data attribute DB 230.

また、登録部１１０３は、目的ページのデータＵＲＬと対応付けて、受け付けた目的データの一覧位置を記録する。具体的には、例えば、登録部１１０３は、一覧情報ＤＢ２４０の各フィールドに、データＵＲＬおよび一覧位置を設定する。これにより、一覧情報ＤＢ２４０に新たな一覧情報が新規登録される。 Also, the registration unit 1103 records the list position of the received target data in association with the data URL of the target page. Specifically, for example, the registration unit 1103 sets a data URL and a list position in each field of the list information DB 240. As a result, new list information is newly registered in the list information DB 240.

なお、上述した説明では、テキストデータを変更する際の所定の文字列が設定されている場合について説明したが、これに限らない。例えば、目的ページのｈｔｍｌデータに予め設定された所定の文字列が偶然含まれる場合がある。このため、変更部１１０７は、例えば、変更前の目的ページのｈｔｍｌデータから、所定の文字列を示すテキストデータを検索し、テキストデータが検索された場合は、所定の文字列を異なる文字列に設定し直すことにしてもよい。 In the above description, a case has been described in which a predetermined character string for changing text data is set. However, the present invention is not limited to this. For example, there is a case where a predetermined character string set in advance is accidentally included in the html data of the target page. For this reason, for example, the changing unit 1107 searches text data indicating a predetermined character string from html data of the target page before the change, and when the text data is searched, the predetermined character string is changed to a different character string. You may decide to set it again.

＜一覧表示要求を受け付けた場合＞
つぎに、一覧表示要求を受け付けた場合の各機能部の処理内容について説明する。一覧表示要求は、複数のサイトＳの目的データを集約して表示する一覧画面（例えば、図１０に示した一覧画面１０００）の表示要求である。 <When a list display request is accepted>
Next, processing contents of each functional unit when a list display request is received will be described. The list display request is a display request for a list screen (for example, the list screen 1000 shown in FIG. 10) that aggregates and displays target data of a plurality of sites S.

受付部１１０１は、一覧表示要求を受け付ける。具体的には、例えば、受付部１１０１は、ユーザの操作入力により、一覧表示要求を受け付ける。また、受付部１１０１は、外部のコンピュータから一覧表示要求を受信することにより、一覧表示要求を受け付けることにしてもよい。 The accepting unit 1101 accepts a list display request. Specifically, for example, the reception unit 1101 receives a list display request by a user operation input. The receiving unit 1101 may receive a list display request by receiving a list display request from an external computer.

取得部１１０２は、一覧表示要求を受け付けたことに応じて、目的ページの画面のｈｔｍｌデータを取得する。具体的には、例えば、まず、取得部１１０２は、アカウントアグリゲーション情報ＤＢ２２０からアカウントアグリゲーション情報（レコード）を取得する。そして、取得部１１０２は、取得したアカウントアグリゲーション情報のログインＵＲＬを用いて、サイトＳのログイン画面にアクセスする。つぎに、取得部１１０２は、取得したアカウントアグリゲーション情報のＩＤ、ＰＷを用いて、サイトＳにログインする。そして、取得部１１０２は、取得したアカウントアグリゲーション情報のデータＵＲＬを用いて、サイトＳの目的ページのｈｔｍｌデータを取得する。 The acquisition unit 1102 acquires html data of the screen of the target page in response to receiving the list display request. Specifically, for example, the acquiring unit 1102 first acquires account aggregation information (record) from the account aggregation information DB 220. Then, the acquisition unit 1102 accesses the login screen of the site S using the login URL of the acquired account aggregation information. Next, the acquisition unit 1102 logs in to the site S using the ID and PW of the acquired account aggregation information. Then, the acquisition unit 1102 acquires html data of the target page of the site S using the data URL of the acquired account aggregation information.

検索部１１０６は、取得された目的ページのｈｔｍｌデータから、目的ページのデータＵＲＬと対応付けて予め記録されたタグに関する情報により特定されるデータ（テキストデータ）を検索する。具体的には、例えば、まず、検索部１１０６は、サイト別目的データ属性ＤＢ２３０から、目的ページのデータＵＲＬに対応するサイト別目的データ属性情報（レコード）を取得する。 The retrieval unit 1106 retrieves data (text data) specified by information on the tag recorded in advance in association with the data URL of the target page from the acquired html data of the target page. Specifically, for example, first, the search unit 1106 acquires site-specific purpose data attribute information (record) corresponding to the data URL of the target page from the site-specific target data attribute DB 230.

そして、検索部１１０６は、目的ページのｈｔｍｌデータから、取得したサイト別目的データ属性情報のデータ特定ｈｔｍｌ属性により特定されるデータを検索する。例えば、サイト別目的データ属性情報６００−１を取得した場合、検索部１１０６は、目的ページのｈｔｍｌデータから、ｔｄ全１２個中の４番目のｔｄのデータを検索する。 Then, the search unit 1106 searches the data specified by the data specifying html attribute of the acquired site-specific target data attribute information from the html data of the target page. For example, when the site-specific target data attribute information 600-1 is acquired, the search unit 1106 searches for the data of the fourth td out of all td 12 from the html data of the target page.

表示制御部１１０４は、検索部１１０６によってデータが検索されなかった場合、取得された目的ページの画面のｈｔｍｌデータに基づく目的ページの画面の画像データを出力する。具体的には、例えば、表示制御部１１０４は、目的ページの画面の画像データを含む領域再設定画面をディスプレイ３０５に表示する。なお、領域再設定画面の画面例については、図１２を用いて後述する。 When the search unit 1106 does not search for data, the display control unit 1104 outputs image data of the target page screen based on the acquired html data of the target page screen. Specifically, for example, the display control unit 1104 displays an area reset screen including image data of the screen of the target page on the display 305. A screen example of the area resetting screen will be described later with reference to FIG.

これにより、目的ページの画面構成や掲載内容が変更されて目的データを取得できなくなった場合に、変更後の目的ページの画面における目的データを含む領域Ｔを再設定するための領域再設定画面をディスプレイ３０５に表示することができる。 As a result, an area reset screen for resetting the area T including the target data on the screen of the target page after the change when the target page cannot be acquired due to the change in the screen configuration or posted content of the target page. It can be displayed on the display 305.

また、表示制御部１１０４は、検索部１１０６によってデータが検索された場合、当該データのデータ属性が、タグに関する情報と対応付けて予め記録されたデータ属性と一致するか否かを判断する。具体的には、例えば、表示制御部１１０４は、検索されたデータのデータ属性が、目的ページのデータＵＲＬに対応するサイト別目的データ属性情報のデータ属性と一致するか否かを判断する。 Further, when data is retrieved by the retrieval unit 1106, the display control unit 1104 determines whether or not the data attribute of the data matches the data attribute recorded in advance in association with the information regarding the tag. Specifically, for example, the display control unit 1104 determines whether the data attribute of the searched data matches the data attribute of the site-specific target data attribute information corresponding to the data URL of the target page.

そして、表示制御部１１０４は、データのデータ属性が一致しない場合、取得された目的ページの画面のｈｔｍｌデータに基づく目的ページの画面の画像データを出力することにしてもよい。これにより、目的ページの画面構成や掲載内容が変更されて領域Ｔのデータのデータ属性が変わった場合に、変更後の目的ページの画面における目的データを含む領域Ｔを再設定するための領域再設定画面をディスプレイ３０５に表示することができる。 If the data attributes of the data do not match, the display control unit 1104 may output image data of the target page screen based on the acquired html data of the target page screen. As a result, when the screen configuration or posted content of the target page is changed and the data attribute of the data of the region T is changed, the region reset for resetting the region T including the target data on the screen of the target page after the change is performed. A setting screen can be displayed on the display 305.

受付部１１０１は、出力された目的ページの画面の画像データ上の目的データを含む領域Ｔの選択を受け付ける。具体的には、例えば、受付部１１０１は、後述する領域再設定画面１２００におけるユーザの操作入力により、画像データ１２１０（図１２参照）上の目的データを含む領域Ｔの選択を受け付ける。 The accepting unit 1101 accepts selection of an area T including the target data on the image data of the output target page screen. Specifically, for example, the accepting unit 1101 accepts selection of an area T including target data on the image data 1210 (see FIG. 12) by a user operation input on an area resetting screen 1200 described later.

認識部１１０５は、選択された領域Ｔの画像データの文字認識処理を行う。文字認識処理の具体的な処理内容は、新規登録要求時と同様である。 The recognition unit 1105 performs character recognition processing on the image data of the selected region T. The specific processing content of the character recognition processing is the same as when a new registration request is made.

特定部１１０９は、目的ページのｈｔｍｌデータにおける目的データを含むｈｔｍｌ要素のタグに関する情報を特定する。タグに関する情報を特定する具体的な処理内容は、新規登録要求時と同様である。また、特定部１１０９は、文字認識処理により認識されたデータのデータ属性を特定する。データ属性を特定する具体的な処理内容は、新規登録要求時と同様である。 The specifying unit 1109 specifies information related to the tag of the html element including the target data in the html data of the target page. The specific processing content for specifying information related to the tag is the same as when a new registration is requested. Further, the specifying unit 1109 specifies the data attribute of the data recognized by the character recognition process. The specific processing content for specifying the data attribute is the same as when a new registration is requested.

登録部１１０３は、特定部１１０９によって特定されたタグに関する情報によって、目的ページのＵＲＬと対応付けて予め記録されたタグに関する情報を更新する。具体的には、例えば、登録部１１０３は、特定されたデータ特定ｈｔｍｌ属性を、目的ページのデータＵＲＬに対応するサイト別目的データ属性ＤＢ２３０内のサイト別目的データ属性情報のデータ特定ｈｔｍｌ属性に上書きする。また、登録部１１０３は、特定されたデータ属性をサイト別目的データ属性情報のデータ属性に上書きする。 The registration unit 1103 updates the information about the tag recorded in advance in association with the URL of the target page with the information about the tag specified by the specifying unit 1109. Specifically, for example, the registration unit 1103 overwrites the specified data specifying html attribute with the data specifying html attribute of the site-specific target data attribute information in the site-specific target data attribute DB 230 corresponding to the data URL of the target page. To do. Also, the registration unit 1103 overwrites the identified data attribute with the data attribute of the site-specific purpose data attribute information.

これにより、サイト別目的データ属性ＤＢ２３０内のサイト別目的データ属性が、目的ページの画面構成や掲載内容の変更に合わせて更新される。 As a result, the site-specific purpose data attribute in the site-specific purpose data attribute DB 230 is updated in accordance with the change in the screen configuration of the target page and the posted content.

表示制御部１１０４は、一覧画面における、領域Ｔの位置情報と対応付けて予め記録された位置に、検索部１１０６によって検索されたデータを挿入した一覧画面を出力する。具体的には、例えば、まず、表示制御部１１０４は、一覧設定画面８００（図８参照）のｈｔｍｌデータに基づいて、目的データが挿入されていない一覧画面１０００のｈｔｍｌデータを生成する。 The display control unit 1104 outputs a list screen in which the data searched by the search unit 1106 is inserted at a position recorded in advance in association with the position information of the region T on the list screen. Specifically, for example, first, the display control unit 1104 generates html data of the list screen 1000 in which the target data is not inserted, based on the html data of the list setting screen 800 (see FIG. 8).

つぎに、表示制御部１１０４は、目的ページのデータＵＲＬに対応するサイト別目的データ属性ＤＢ２３０内のサイト別目的データ属性情報の一覧位置を特定する。そして、表示制御部１１０４は、特定した一覧位置に、検索されたデータを挿入した一覧画面１０００のｈｔｍｌデータを生成してディスプレイ３０５に表示する。これにより、複数のサイトＳの目的データを集約して表示する一覧画面をディスプレイ３０５に表示することができる。 Next, the display control unit 1104 specifies the list position of the site-specific target data attribute information in the site-specific target data attribute DB 230 corresponding to the data URL of the target page. Then, the display control unit 1104 generates html data of the list screen 1000 in which the searched data is inserted at the specified list position and displays it on the display 305. As a result, a list screen can be displayed on the display 305 that aggregates and displays the target data of the plurality of sites S.

なお、上述した説明では、情報処理装置１０１が各機能部１１０１〜１１０９を有することにしたが、サーバ２０１が各機能部１１０１〜１１０９を有することにしてもよい。具体的には、例えば、必要な機能を必要な分だけサービスとして情報処理装置１０１に利用できるようにしたＳａａＳ（ＳｏｆｔｗａｒｅａｓａＳｅｒｖｉｃｅ）により、システム２００を実現することにしてもよい。 In the above description, the information processing apparatus 101 has the function units 1101 to 1109. However, the server 201 may have the function units 1101 to 1109. Specifically, for example, the system 200 may be realized by SaaS (Software as a Service) that enables the information processing apparatus 101 to use a necessary function as a required service.

（領域再設定画面の画面例）
つぎに、情報処理装置１０１のディスプレイ３０５に表示される領域再設定画面の画面例について説明する。領域再設定画面は、目的ページの画面における目的データを含む領域Ｔを再設定する画面である。 (Screen example of area reset screen)
Next, a screen example of the area resetting screen displayed on the display 305 of the information processing apparatus 101 will be described. The area reset screen is a screen for resetting the area T including the target data on the screen of the target page.

図１２は、領域再設定画面の画面例を示す説明図である。図１２において、領域再設定画面１２００は、年金サイトの厚生年金情報ページの画面における目的データを含む領域Ｔ２を再設定する画面である。領域再設定画面１２００には、年金サイトの厚生年金情報ページの画面の画像データ１２１０が表示されている。 FIG. 12 is an explanatory diagram illustrating a screen example of the area resetting screen. In FIG. 12, an area resetting screen 1200 is a screen for resetting an area T2 including target data on the welfare pension information page screen of the pension site. On the area resetting screen 1200, image data 1210 of the screen of the welfare pension information page of the pension site is displayed.

領域再設定画面１２００において、ユーザによる領域指定の操作入力として、画像データ１２１０上の任意の範囲の選択を受け付けることにより、厚生年金情報ページの画面における目的データを含む領域Ｔ２を再設定することができる。 In the area resetting screen 1200, as an operation input for specifying the area by the user, by accepting selection of an arbitrary range on the image data 1210, the area T2 including the target data on the screen of the welfare annuity information page can be reset. it can.

図１２の例では、ユーザの操作入力により、厚生年金情報ページの画面における年金額（見込み）［円］を示す数字列を含む領域Ｔ２が再設定されている。また、領域再設定画面１２００において、ユーザの操作入力により、設定完了ボタン１２２０がクリック（押下）されると、領域Ｔ２の再設定が完了する。 In the example of FIG. 12, the region T2 including a numeric string indicating the annual amount (expected) [yen] on the screen of the welfare annuity information page is reset by the user's operation input. On the region reset screen 1200, when the setting completion button 1220 is clicked (pressed) by a user operation input, the resetting of the region T2 is completed.

このように、領域再設定画面１２００によれば、年金サイトの厚生年金情報ページの画面における目的データを含む領域Ｔ２を再設定することができる。 Thus, according to the area resetting screen 1200, the area T2 including the target data on the screen of the welfare pension information page of the pension site can be reset.

（情報処理装置１０１の情報提供処理手順）
つぎに、情報処理装置１０１の情報提供処理手順について説明する。 (Information provision processing procedure of information processing apparatus 101)
Next, an information provision processing procedure of the information processing apparatus 101 will be described.

図１３は、情報処理装置１０１の情報提供処理手順の一例を示すフローチャートである。図１３のフローチャートにおいて、まず、情報処理装置１０１は、新規登録要求を受け付けたか否かを判断する（ステップＳ１３０１）。 FIG. 13 is a flowchart illustrating an example of an information provision processing procedure of the information processing apparatus 101. In the flowchart of FIG. 13, first, the information processing apparatus 101 determines whether or not a new registration request has been received (step S1301).

ここで、新規登録要求を受け付けた場合（ステップＳ１３０１：Ｙｅｓ）、情報処理装置１０１は、新規登録処理を実行して（ステップＳ１３０２）。本フローチャートによる一連の処理を終了する。新規登録処理の具体的な処理手順については、図１４および図１５のフローチャートを用いて後述する。 If a new registration request is accepted (step S1301: Yes), the information processing apparatus 101 executes a new registration process (step S1302). A series of processing by this flowchart is complete | finished. A specific processing procedure of the new registration processing will be described later with reference to the flowcharts of FIGS.

一方、新規登録要求を受け付けていない場合（ステップＳ１３０１：Ｎｏ）、情報処理装置１０１は、一覧表示要求を受け付けたか否かを判断する（ステップＳ１３０３）。ここで、一覧表示要求を受け付けていない場合（ステップＳ１３０３：Ｎｏ）、情報処理装置１０１は、ステップＳ１３０１に戻る。 On the other hand, if a new registration request has not been received (step S1301: No), the information processing apparatus 101 determines whether a list display request has been received (step S1303). Here, when the list display request is not received (step S1303: No), the information processing apparatus 101 returns to step S1301.

一方、一覧表示要求を受け付けた場合（ステップＳ１３０３：Ｙｅｓ）、情報処理装置１０１は、一覧表示処理を実行して（ステップＳ１３０４）。本フローチャートによる一連の処理を終了する。一覧表示処理の具体的な処理手順については、図１６のフローチャートを用いて後述する。 On the other hand, when a list display request is received (step S1303: Yes), the information processing apparatus 101 executes list display processing (step S1304). A series of processing by this flowchart is complete | finished. A specific processing procedure of the list display processing will be described later with reference to a flowchart of FIG.

＜新規登録処理の具体的処理手順＞
つぎに、図１３に示したステップＳ１３０２の新規登録処理の具体的な処理手順について説明する。 <Specific processing procedure of new registration processing>
Next, a specific processing procedure of the new registration processing in step S1302 shown in FIG. 13 will be described.

図１４および図１５は、新規登録処理の具体的処理手順の一例を示すフローチャートである。図１４のフローチャートにおいて、まず、情報処理装置１０１は、サイトＳのＩＤ、ＰＷ、ログインＵＲＬおよびデータＵＲＬを取得する（ステップＳ１４０１）。 14 and 15 are flowcharts showing an example of a specific processing procedure of the new registration processing. In the flowchart of FIG. 14, first, the information processing apparatus 101 acquires the ID, PW, login URL, and data URL of the site S (step S1401).

そして、情報処理装置１０１は、取得したサイトＳのＩＤ、ＰＷ、ログインＵＲＬおよびデータＵＲＬをアカウントアグリゲーション情報ＤＢ２２０に登録する（ステップＳ１４０２）。これにより、アカウントアグリゲーション情報ＤＢ２２０に新たなアカウントアグリゲーション情報が新規登録される。 The information processing apparatus 101 registers the acquired ID, PW, login URL, and data URL of the site S in the account aggregation information DB 220 (step S1402). As a result, new account aggregation information is newly registered in the account aggregation information DB 220.

つぎに、情報処理装置１０１は、取得したサイトＳのＩＤ、ＰＷ、ログインＵＲＬおよびデータＵＲＬを用いて、サイトＳの目的ページのｈｔｍｌデータを取得する（ステップＳ１４０３）。そして、情報処理装置１０１は、取得したｈｔｍｌデータに基づいて、目的ページをキャプチャすることにより、目的ページの画像データを取得する（ステップＳ１４０４）。 Next, the information processing apparatus 101 acquires html data of the target page of the site S using the acquired ID, PW, login URL, and data URL of the site S (step S1403). Then, the information processing apparatus 101 acquires the target page image data by capturing the target page based on the acquired html data (step S1404).

つぎに、情報処理装置１０１は、取得した目的ページの画像データを含む領域初期設定画面をディスプレイ３０５に表示する（ステップＳ１４０５）。そして、情報処理装置１０１は、ユーザの操作入力により、目的ページの画像データ上の目的データを含む領域Ｔが選択されたか否かを判断する（ステップＳ１４０６）。 Next, the information processing apparatus 101 displays a region initial setting screen including the acquired image data of the target page on the display 305 (step S1405). Then, the information processing apparatus 101 determines whether or not the region T including the target data on the image data of the target page has been selected by the user's operation input (step S1406).

ここで、情報処理装置１０１は、領域Ｔが選択されるのを待つ（ステップＳ１４０６：Ｎｏ）。そして、領域Ｔが選択された場合（ステップＳ１４０６：Ｙｅｓ）、情報処理装置１０１は、目的ページの画像データ上の選択された領域Ｔの位置情報を取得する（ステップＳ１４０７）。 Here, the information processing apparatus 101 waits for the area T to be selected (step S1406: No). If the area T is selected (step S1406: Yes), the information processing apparatus 101 acquires the position information of the selected area T on the image data of the target page (step S1407).

つぎに、情報処理装置１０１は、目的ページの画像データから領域Ｔの画像データを抽出して、領域Ｔの画像データのＯＣＲ処理を行うことにより、目的データのテキストデータを取得する（ステップＳ１４０８）。以下の説明では、ＯＣＲ処理により得られたテキストデータを「領域データ」と表記する場合がある。 Next, the information processing apparatus 101 extracts the image data of the region T from the image data of the target page and performs the OCR process on the image data of the region T, thereby acquiring the text data of the target data (step S1408). . In the following description, text data obtained by the OCR process may be referred to as “region data”.

そして、情報処理装置１０１は、目的ページのｈｔｍｌデータから、領域データと同一内容のテキストデータを検索する（ステップＳ１４０９）。つぎに、情報処理装置１０１は、検索ヒット件数が「１」でないかを判断する（ステップＳ１４１０）。検索ヒット件数は、ステップＳ１４０９において検索されたテキストデータの数である。 The information processing apparatus 101 searches the html data of the target page for text data having the same content as the area data (step S1409). Next, the information processing apparatus 101 determines whether the number of search hits is “1” (step S1410). The number of search hits is the number of text data searched in step S1409.

ここで、検索ヒット件数が「１」の場合（ステップＳ１４１０：Ｎｏ）、情報処理装置１０１は、図１５に示すステップＳ１５０８に移行する。一方、検索ヒット件数が「１」でない場合（ステップＳ１４１０：Ｙｅｓ）、情報処理装置１０１は、検索ヒット件数が「０」であるかを判断する（ステップＳ１４１１）。 If the number of search hits is “1” (step S1410: NO), the information processing apparatus 101 proceeds to step S1508 shown in FIG. On the other hand, when the number of search hits is not “1” (step S1410: Yes), the information processing apparatus 101 determines whether the number of search hits is “0” (step S1411).

ここで、検索ヒット件数が「０」の場合（ステップＳ１４１１：Ｙｅｓ）、情報処理装置１０１は、ユーザの操作入力により領域データの入力を受け付けて（ステップＳ１４１２）、ステップＳ１４０９に戻る。すなわち、検索ヒット件数が「０」の場合は、情報処理装置１０１は、目的データを認識できなかったと判断して、目的データのテキストデータ（領域データ）をユーザに手入力させる。 If the number of search hits is “0” (step S1411: YES), the information processing apparatus 101 accepts an input of area data by a user operation input (step S1412) and returns to step S1409. That is, when the number of search hits is “0”, the information processing apparatus 101 determines that the target data has not been recognized, and causes the user to manually input text data (region data) of the target data.

一方、検索ヒット件数が「０」でない場合（ステップＳ１４１１：Ｎｏ）、情報処理装置１０１は、図１５に示すステップＳ１５０１に移行する。以下の説明では、ステップＳ１４０９において検索されたテキストデータを「検索ヒットデータ」と表記する場合がある。 On the other hand, when the number of search hits is not “0” (step S1411: No), the information processing apparatus 101 proceeds to step S1501 illustrated in FIG. In the following description, the text data searched in step S1409 may be referred to as “search hit data”.

図１５のフローチャートにおいて、まず、情報処理装置１０１は、「ｍ＝１」として（ステップＳ１５０１）、目的ページのｈｔｍｌデータの先頭からｍ番目の検索ヒットデータを、所定の文字列を示すテキストデータに変更する（ステップＳ１５０２）。 In the flowchart of FIG. 15, first, the information processing apparatus 101 sets “m = 1” (step S1501), and converts the m-th search hit data from the top of the html data of the target page into text data indicating a predetermined character string. Change (step S1502).

つぎに、情報処理装置１０１は、変更後の目的ページのｈｔｍｌデータをメモリ３０２に展開することにより、変更後の目的ページをキャプチャして、変更後の目的ページの画像データを取得する（ステップＳ１５０３）。 Next, the information processing apparatus 101 expands the html data of the changed target page in the memory 302, thereby capturing the changed target page and acquiring the changed target page image data (step S1503). ).

そして、情報処理装置１０１は、ステップＳ１４０７において取得された領域Ｔの位置情報に基づいて、変更後の目的ページの画像データから領域Ｔの画像データを抽出して、領域Ｔの画像データのＯＣＲ処理を行うことにより領域データを取得する（ステップＳ１５０４）。つぎに、情報処理装置１０１は、ステップＳ１５０４のＯＣＲ処理により得られた領域データが、所定の文字列を示すテキストデータと一致するか否かを判定する（ステップＳ１５０５）。 Then, the information processing apparatus 101 extracts the image data of the area T from the image data of the target page after the change based on the position information of the area T acquired in step S1407, and performs the OCR process on the image data of the area T To obtain area data (step S1504). Next, the information processing apparatus 101 determines whether or not the area data obtained by the OCR process in step S1504 matches text data indicating a predetermined character string (step S1505).

ここで、所定の文字列を示すテキストデータと一致しない場合（ステップＳ１５０５：Ｎｏ）、情報処理装置１０１は、変更後の検索ヒットデータを、変更前の検索ヒットデータに変更する（ステップＳ１５０６）。そして、情報処理装置１０１は、「ｍ」をインクリメントして（ステップＳ１５０７）、ステップＳ１５０２に戻る。 If the data does not match the text data indicating the predetermined character string (step S1505: No), the information processing apparatus 101 changes the changed search hit data to the search hit data before the change (step S1506). The information processing apparatus 101 increments “m” (step S1507) and returns to step S1502.

一方、ステップＳ１５０５において、所定の文字列を示すテキストデータと一致する場合（ステップＳ１５０５：Ｙｅｓ）、情報処理装置１０１は、ｍ番目の検索ヒットデータを目的データのテキストデータとして特定する（ステップＳ１５０８）。 On the other hand, in step S1505, when the data matches the text data indicating the predetermined character string (step S1505: Yes), the information processing apparatus 101 identifies the mth search hit data as the text data of the target data (step S1508). .

そして、情報処理装置１０１は、領域データのデータ属性を特定する（ステップＳ１５０９）。つぎに、情報処理装置１０１は、変更前の目的ページのｈｔｍｌデータから領域データを含むｈｔｍｌ要素を検索することにより、目的ページのｈｔｍｌデータにおけるｈｔｍｌ要素のタグのデータ特定ｈｔｍｌ属性を特定する（ステップＳ１５１０）。 Then, the information processing apparatus 101 identifies the data attribute of the area data (step S1509). Next, the information processing apparatus 101 specifies the data specification html attribute of the tag of the html element in the html data of the target page by searching the html element including the region data from the html data of the target page before the change (step). S1510).

そして、情報処理装置１０１は、目的ページのデータＵＲＬと対応付けて、特定したデータ特定ｈｔｍｌ属性およびデータ属性をサイト別目的データ属性ＤＢ２３０に登録して（ステップＳ１５１１）、新規登録処理を呼び出したステップに戻る。これにより、サイト別目的データ属性ＤＢ２３０に新たなサイト別目的データ属性情報が新規登録される。 The information processing apparatus 101 registers the specified data specific html attribute and data attribute in the site-specific target data attribute DB 230 in association with the data URL of the target page (step S1511), and calls the new registration process. Return to. As a result, new site-specific purpose data attribute information is newly registered in the site-specific purpose data attribute DB 230.

なお、目的ページのデータＵＲＬに対応する一覧位置については、一覧設定画面（例えば、一覧設定画面８００）において、ユーザの操作入力により受け付けることにより、目的ページのデータＵＲＬに対応付けて一覧情報ＤＢ２４０に設定される。 The list position corresponding to the data URL of the target page is accepted by the user's operation input on the list setting screen (for example, the list setting screen 800), and is associated with the data URL of the target page in the list information DB 240. Is set.

＜一覧表示処理の具体的処理手順＞
つぎに、図１３に示したステップＳ１３０４の一覧表示処理の具体的な処理手順について説明する。 <Specific processing procedure of list display processing>
Next, a specific processing procedure of the list display processing in step S1304 shown in FIG. 13 will be described.

図１６は、一覧表示処理の具体的処理手順の一例を示すフローチャートである。図１６のフローチャートにおいて、まず、情報処理装置１０１は、目的データが挿入されていない一覧画面のｈｔｍｌデータを生成する（ステップＳ１６０１）。 FIG. 16 is a flowchart illustrating an example of a specific processing procedure of the list display processing. In the flowchart of FIG. 16, first, the information processing apparatus 101 generates html data of a list screen in which target data is not inserted (step S1601).

つぎに、情報処理装置１０１は、アカウントアグリゲーション情報ＤＢ２２０のレコード数ｎを取得して（ステップＳ１６０２）、「ｉ＝１」とする（ステップＳ１６０３）。そして、情報処理装置１０１は、目的データ設定処理を実行する（ステップＳ１６０４）。目的データ設定処理の具体的な処理手順については、図１７のフローチャートを用いて後述する。 Next, the information processing apparatus 101 acquires the number n of records in the account aggregation information DB 220 (step S1602) and sets “i = 1” (step S1603). The information processing apparatus 101 executes target data setting processing (step S1604). A specific processing procedure of the target data setting process will be described later with reference to a flowchart of FIG.

つぎに、情報処理装置１０１は、「ｉ」をインクリメントして（ステップＳ１６０５）、「ｉ」が「ｎ」より大きくなったか否かを判断する（ステップＳ１６０６）。ここで、「ｉ」が「ｎ」以下の場合（ステップＳ１６０６：Ｎｏ）、情報処理装置１０１は、ステップＳ１６０４に戻る。 Next, the information processing apparatus 101 increments “i” (step S1605), and determines whether “i” is greater than “n” (step S1606). If “i” is equal to or smaller than “n” (step S1606: NO), the information processing apparatus 101 returns to step S1604.

一方、「ｉ」が「ｎ」より大きくなった場合（ステップＳ１６０６：Ｙｅｓ）、情報処理装置１０１は、一覧画面のｈｔｍｌデータをディスプレイ３０５に表示して（ステップＳ１６０７）、一覧表示処理を呼び出したステップに戻る。 On the other hand, when “i” becomes larger than “n” (step S1606: Yes), the information processing apparatus 101 displays the html data of the list screen on the display 305 (step S1607) and calls the list display process. Return to step.

これにより、複数のサイトＳの目的データを集約した一覧画面（例えば、一覧画面１０００）をディスプレイ３０５に表示することができる。 Thereby, a list screen (for example, list screen 1000) in which target data of a plurality of sites S are aggregated can be displayed on the display 305.

＜目的データ設定処理の具体的処理手順＞
つぎに、図１６に示したステップＳ１６０４の目的データ設定処理の具体的な処理手順について説明する。 <Specific processing procedure of target data setting processing>
Next, a specific processing procedure of the target data setting process in step S1604 shown in FIG. 16 will be described.

図１７は、目的データ設定処理の具体的処理手順の一例を示すフローチャートである。図１７のフローチャートにおいて、まず、情報処理装置１０１は、アカウントアグリゲーション情報ＤＢ２２０のｉ番目のレコード（以下、「レコードＲａ」と称する）を取得する（ステップＳ１７０１）。 FIG. 17 is a flowchart illustrating an example of a specific processing procedure of the target data setting process. In the flowchart of FIG. 17, the information processing apparatus 101 first acquires the i-th record (hereinafter referred to as “record Ra”) of the account aggregation information DB 220 (step S1701).

つぎに、情報処理装置１０１は、取得したレコードＲａのログインＵＲＬを用いて、サイトＳのログイン画面にアクセスし、レコードＲａのＩＤ、ＰＷを用いて、サイトＳにログインする（ステップＳ１７０２）。そして、情報処理装置１０１は、レコードＲａのデータＵＲＬを用いて、サイトＳの目的ページのｈｔｍｌデータを取得する（ステップＳ１７０３）。 Next, the information processing apparatus 101 accesses the login screen of the site S using the acquired login URL of the record Ra, and logs in to the site S using the ID and PW of the record Ra (step S1702). Then, the information processing apparatus 101 acquires the html data of the target page of the site S using the data URL of the record Ra (step S1703).

つぎに、情報処理装置１０１は、サイト別目的データ属性ＤＢ２３０から、取得したレコードＲａのデータＵＲＬに対応するレコード（以下、「レコードＲｂ」と称する）を取得する（ステップＳ１７０４）。そして、情報処理装置１０１は、取得した目的ページのｈｔｍｌデータから、取得したレコードＲｂのデータ特定ｈｔｍｌ属性により特定されるデータを検索する（ステップＳ１７０５）。 Next, the information processing apparatus 101 acquires a record (hereinafter referred to as “record Rb”) corresponding to the data URL of the acquired record Ra from the site-specific objective data attribute DB 230 (step S1704). Then, the information processing apparatus 101 searches the data specified by the data specification html attribute of the acquired record Rb from the acquired html data of the target page (step S1705).

つぎに、情報処理装置１０１は、目的ページのｈｔｍｌデータからデータが検索されたか否かを判断する（ステップＳ１７０６）。ここで、データが検索された場合（ステップＳ１７０６：Ｙｅｓ）、情報処理装置１０１は、検索したデータのデータ属性を特定する（ステップＳ１７０７）。 Next, the information processing apparatus 101 determines whether data is retrieved from the html data of the target page (step S1706). Here, when the data is searched (step S1706: Yes), the information processing apparatus 101 specifies the data attribute of the searched data (step S1707).

そして、情報処理装置１０１は、特定したデータ属性がレコードＲｂのデータ属性と一致するか否かを判断する（ステップＳ１７０８）。ここで、データ属性が一致する場合（ステップＳ１７０８：Ｙｅｓ）、情報処理装置１０１は、一覧情報ＤＢ２４０から、レコードＲａのデータＵＲＬに対応する一覧位置を取得する（ステップＳ１７０９）。 The information processing apparatus 101 determines whether the specified data attribute matches the data attribute of the record Rb (step S1708). If the data attributes match (step S1708: YES), the information processing apparatus 101 acquires a list position corresponding to the data URL of the record Ra from the list information DB 240 (step S1709).

そして、情報処理装置１０１は、特定した一覧位置に基づいて、一覧画面のｈｔｍｌデータに、検索したデータを挿入して（ステップＳ１７１０）、目的データ設定処理を呼び出したステップに戻る。これにより、予め設定された一覧位置にサイトＳの目的データを埋め込んだ一覧画面のｈｔｍｌデータを生成することができる。 The information processing apparatus 101 inserts the retrieved data into the html data on the list screen based on the identified list position (step S1710), and returns to the step that called the target data setting process. As a result, it is possible to generate html data of a list screen in which the target data of the site S is embedded at a preset list position.

また、ステップＳ１７０６において、データが検索されなかった場合（ステップＳ１７０６：Ｎｏ）、情報処理装置１０１は、領域再設定画面表示処理を実行して（ステップＳ１７１１）、ステップＳ１７０１に戻る。領域再設定画面表示処理の具体的な処理手順については、図１８のフローチャートを用いて後述する。 If no data is retrieved in step S1706 (step S1706: No), the information processing apparatus 101 executes region reset screen display processing (step S1711) and returns to step S1701. A specific processing procedure of the area reset screen display process will be described later with reference to the flowchart of FIG.

また、ステップＳ１７０８において、データ属性が一致しない場合（ステップＳ１７０８：Ｎｏ）、情報処理装置１０１は、ステップＳ１７１１に移行する。 If the data attributes do not match in step S1708 (step S1708: No), the information processing apparatus 101 proceeds to step S1711.

＜領域再設定画面表示処理の具体的処理手順＞
つぎに、図１７に示したステップＳ１７１１の領域再設定画面表示処理の具体的な処理手順について説明する。 <Specific processing procedure of area reset screen display processing>
Next, a specific processing procedure of the area reset screen display process in step S1711 shown in FIG. 17 will be described.

図１８は、領域再設定画面表示処理の具体的処理手順の一例を示すフローチャートである。図１８のフローチャートにおいて、まず、情報処理装置１０１は、目的ページのｈｔｍｌデータに基づいて、目的ページをキャプチャすることにより、目的ページの画像データを取得する（ステップＳ１８０１）。 FIG. 18 is a flowchart illustrating an example of a specific processing procedure of the area reset screen display process. In the flowchart of FIG. 18, first, the information processing apparatus 101 acquires image data of the target page by capturing the target page based on the html data of the target page (step S1801).

そして、情報処理装置１０１は、取得した目的ページの画面の画像データを含む領域再設定画面をディスプレイ３０５に表示する（ステップＳ１８０２）。つぎに、情報処理装置１０１は、ユーザの操作入力により、目的ページの画像データ上の目的データを含む領域Ｔが選択されたか否かを判断する（ステップＳ１８０３）。 Then, the information processing apparatus 101 displays an area resetting screen including the image data of the acquired screen of the target page on the display 305 (step S1802). Next, the information processing apparatus 101 determines whether or not the region T including the target data on the image data of the target page has been selected by the user's operation input (step S1803).

ここで、情報処理装置１０１は、領域Ｔが選択されるのを待つ（ステップＳ１８０３：Ｎｏ）。そして、領域Ｔが選択された場合（ステップＳ１８０３：Ｙｅｓ）、情報処理装置１０１は、目的ページの画像データから領域Ｔの画像データを抽出して、領域Ｔの画像データのＯＣＲ処理を行うことにより領域データを取得する（ステップＳ１８０４）。 Here, the information processing apparatus 101 waits for the area T to be selected (step S1803: No). When the region T is selected (step S1803: Yes), the information processing apparatus 101 extracts the image data of the region T from the image data of the target page and performs OCR processing on the image data of the region T. Area data is acquired (step S1804).

つぎに、情報処理装置１０１は、取得した領域データのデータ属性を特定する（ステップＳ１８０５）。そして、情報処理装置１０１は、目的ページのｈｔｍｌデータから、取得した領域データを内容に含むｈｔｍｌ要素を検索する（ステップＳ１８０６）。つぎに、情報処理装置１０１は、検索したｈｔｍｌ要素のタグのデータ特定ｈｔｍｌ属性を特定する（ステップＳ１８０７）。 Next, the information processing apparatus 101 identifies the data attribute of the acquired area data (step S1805). The information processing apparatus 101 searches the html data of the target page for an html element that includes the acquired region data (step S1806). Next, the information processing apparatus 101 specifies the data specifying html attribute of the tag of the searched html element (step S1807).

そして、情報処理装置１０１は、特定したデータ属性およびデータ特定ｈｔｍｌ属性をレコードＲｂに上書きすることにより、サイト別目的データ属性ＤＢ２３０を更新して（ステップＳ１８０８）、領域再設定画面表示処理を呼び出したステップに戻る。これにより、目的ページの画面構成や掲載内容の変更に合わせて、サイト別目的データ属性ＤＢ２３０の記憶内容を更新することができる。 Then, the information processing apparatus 101 updates the site-specific purpose data attribute DB 230 by overwriting the specified data attribute and the data specific html attribute on the record Rb (step S1808), and calls the area reset screen display process. Return to step. Thereby, the storage content of the site-specific purpose data attribute DB 230 can be updated in accordance with the change of the screen configuration of the target page and the posted content.

以上説明したように、実施の形態にかかる情報処理装置１０１によれば、目的ページのｈｔｍｌデータから、目的ページの画像データ上に設定された領域Ｔの画像データから得られるテキストデータと同一内容のテキストデータを検索することができる。これにより、目的ページのｈｔｍｌデータから、目的データと同一内容のテキストデータを検索することができる。 As described above, according to the information processing apparatus 101 according to the embodiment, the same content as the text data obtained from the image data of the region T set on the image data of the target page from the html data of the target page. Text data can be searched. Thereby, text data having the same content as the target data can be searched from the html data of the target page.

また、情報処理装置１０１によれば、複数のテキストデータが検索された場合、目的ページのｈｔｍｌデータ内の複数のテキストデータのいずれかのテキストデータを異なるテキストデータに変更することができる。また、情報処理装置１０１によれば、変更後の目的ページのｈｔｍｌデータに基づく目的ページの画像データ上の領域Ｔの画像データから得られるテキストデータが、変更した異なるテキストデータと一致するか否かを判定することができる。 Further, according to the information processing apparatus 101, when a plurality of text data is searched, any one of the plurality of text data in the html data of the target page can be changed to different text data. Further, according to the information processing apparatus 101, whether or not the text data obtained from the image data in the region T on the image data of the target page based on the html data of the target page after the change matches the changed different text data. Can be determined.

また、情報処理装置１０１によれば、変更した異なるテキストデータと一致する場合、目的ページのｈｔｍｌデータのうち、異なるテキストデータに変更したテキストデータを、領域Ｔに対応するテキストデータとして特定することができる。これにより、目的データと同一内容のテキストデータが複数存在する場合であっても、目的ページのｈｔｍｌデータにおける目的データの位置を正確に特定することができる。 Further, according to the information processing apparatus 101, when the text data matches the changed different text data, the text data changed to the different text data among the html data of the target page can be specified as the text data corresponding to the region T. it can. Thereby, even when there are a plurality of text data having the same content as the target data, the position of the target data in the html data of the target page can be accurately specified.

また、情報処理装置１０１によれば、特定した領域Ｔに対応するテキストデータにより特定される、目的ページのｈｔｍｌデータにおけるタグに関する情報を、目的ページのデータＵＲＬと対応付けて記録することができる。 Further, according to the information processing apparatus 101, information on the tag in the html data of the target page specified by the text data corresponding to the specified region T can be recorded in association with the data URL of the target page.

また、情報処理装置１０１によれば、取得した目的ページのｈｔｍｌデータから、記録したタグに関する情報により特定されるデータを検索することができる。また、情報処理装置１０１によれば、データが検索されなかった場合に、取得した目的ページのｈｔｍｌデータに基づく目的ページの画像データを含む領域再設定画面（例えば、領域再設定画面１２００）をディスプレイ３０５に表示することができる。 Further, according to the information processing apparatus 101, it is possible to search the data specified by the information about the recorded tag from the obtained html data of the target page. Further, according to the information processing apparatus 101, when data is not searched, an area reset screen (for example, the area reset screen 1200) including the image data of the target page based on the acquired html data of the target page is displayed. 305 can be displayed.

これにより、タグに関する情報により特定されるデータを検索できたか否かによって、目的ページの画面構成や掲載内容の変更によりユーザの意図通りの情報を取得できなくなったか否かを判断することができる。また、ユーザの意図通りの情報を取得できなくなった場合に、目的ページのどの部分の情報を取得するのかについての再設定をしやすくして設定変更にかかる手間を削減することができる。 As a result, it is possible to determine whether or not information as intended by the user can no longer be acquired due to a change in the screen configuration of the target page or the posted content, depending on whether or not the data specified by the information related to the tag can be searched. Further, when it becomes impossible to acquire information as intended by the user, it is easy to re-set which part of the target page information is to be acquired, and it is possible to reduce the trouble of changing the setting.

また、情報処理装置１０１によれば、出力した目的ページの画像データ上に設定された領域Ｔのデータにより特定されるタグに関する情報によって、記録したタグに関する情報を更新することができる。これにより、目的ページの画面構成や掲載内容の変更に合わせて、サイト別目的データ属性ＤＢ２３０の記憶内容を更新することができる。 Further, according to the information processing apparatus 101, the information related to the recorded tag can be updated with the information related to the tag specified by the data of the region T set on the output image data of the target page. Thereby, the storage content of the site-specific purpose data attribute DB 230 can be updated in accordance with the change of the screen configuration of the target page and the posted content.

また、情報処理装置１０１によれば、データが検索された場合、当該データのデータ属性が、タグに関する情報と対応付けて予め記録されたデータ属性と一致するか否かを判断することができる。また、情報処理装置１０１によれば、予め記録されたデータ属性と一致しない場合に、取得した目的ページの画像データを含む領域再設定画面をディスプレイ３０５に表示することができる。 Further, according to the information processing apparatus 101, when data is searched, it is possible to determine whether or not the data attribute of the data matches the data attribute recorded in advance in association with the information related to the tag. Further, according to the information processing apparatus 101, when the data attribute does not match the pre-recorded data attribute, the area reset screen including the acquired image data of the target page can be displayed on the display 305.

これにより、データが検索されても、検索されたデータのデータ属性が予め記録されたデータ属性と異なる場合は、ユーザの意図通りの情報を取得できなくなったと判断して、目的ページの画像データを含む領域再設定画面（例えば、領域再設定画面１２００）をディスプレイ３０５に表示することができる。 As a result, even if the data is searched, if the data attribute of the searched data is different from the pre-recorded data attribute, it is determined that the information as intended by the user cannot be obtained, and the image data of the target page is The area reset screen including the area reset screen (for example, the area reset screen 1200) can be displayed on the display 305.

また、情報処理装置１０１によれば、予め記録されたデータ属性と一致する場合には、タグに関する情報と対応付けて予め記録された一覧位置に、検索されたデータを挿入した一覧画面を出力することができる。これにより、複数のサイトＳの目的データを集約した一覧画面（例えば、一覧画面１０００）をディスプレイ３０５に表示することができる。 Further, according to the information processing apparatus 101, when the data attribute matches the pre-recorded data attribute, a list screen in which the searched data is inserted is output at the list position recorded in advance in association with the information related to the tag. be able to. Thereby, a list screen (for example, list screen 1000) in which target data of a plurality of sites S are aggregated can be displayed on the display 305.

なお、本実施の形態で説明したデータ特定方法は、予め用意されたプログラムをパーソナル・コンピュータやワークステーション等のコンピュータで実行することにより実現することができる。本データ特定プログラムは、ハードディスク、フレキシブルディスク、ＣＤ−ＲＯＭ、ＭＯ、ＤＶＤ等のコンピュータで読み取り可能な記録媒体に記録され、コンピュータによって記録媒体から読み出されることによって実行される。また、本データ特定プログラムは、インターネット等のネットワークを介して配布してもよい。 The data specifying method described in this embodiment can be realized by executing a program prepared in advance on a computer such as a personal computer or a workstation. The data specifying program is recorded on a computer-readable recording medium such as a hard disk, a flexible disk, a CD-ROM, an MO, and a DVD, and is executed by being read from the recording medium by the computer. The data specifying program may be distributed through a network such as the Internet.

上述した実施の形態に関し、さらに以下の付記を開示する。 The following additional notes are disclosed with respect to the embodiment described above.

（付記１）コンピュータに、
サイトの画面の画像データ上で選択を受け付けた範囲の画像データから得られるテキストデータと同一内容のテキストデータを、前記画面の画面情報から検索し、
前記画面の画面情報内の検索したテキストデータを異なるテキストデータに変更し、
変更後の前記画面の画面情報に基づく前記画面の画像データ上の、前記選択を受け付けた範囲と同一の範囲の画像データから得られるテキストデータが、前記異なるテキストデータと一致するか否かを判定することにより、前記画面の画面情報から前記選択を受け付けた範囲に対応するテキストデータを特定する、
処理を実行させることを特徴とするデータ特定プログラム。 (Supplementary note 1)
Search the screen information of the screen for text data having the same content as the text data obtained from the image data in the range of selection received on the screen data of the site,
Change the searched text data in the screen information of the screen to different text data,
Determining whether text data obtained from image data in the same range as the selected range on the screen image data based on the screen information after the change matches the different text data By specifying the text data corresponding to the range that received the selection from the screen information of the screen,
A data identification program for executing a process.

（付記２）前記変更する処理は、
複数のテキストデータが検索された場合に、前記画面の画面情報内の前記複数のテキストデータのいずれかのテキストデータを異なるテキストデータに変更し、
前記特定する処理は、
前記異なるテキストデータと一致する場合、前記画面の画面情報内の前記いずれかのテキストデータを、前記選択を受け付けた範囲に対応するテキストデータとして特定することを特徴とする付記１に記載のデータ特定プログラム。 (Supplementary note 2)
When a plurality of text data is searched, the text data of any of the plurality of text data in the screen information of the screen is changed to different text data,
The process to specify is
The data specification according to appendix 1, wherein if any of the different text data matches, the text data in the screen information of the screen is specified as text data corresponding to the range in which the selection is accepted. program.

（付記３）前記コンピュータに、
特定した前記選択を受け付けた範囲に対応するテキストデータにより特定される、前記画面の画面情報におけるタグに関する情報を、前記サイトの識別情報と対応付けて記録する処理を実行させることを特徴とする付記１または２に記載のデータ特定プログラム。 (Supplementary note 3)
A supplementary note that executes processing for recording information related to a tag in screen information of the screen specified by text data corresponding to a range in which the selected selection is received, in association with identification information of the site. The data identification program according to 1 or 2.

（付記４）前記コンピュータに、
記録した前記サイトの識別情報と前記サイトの画面の画面情報におけるタグに関する情報とに基づいて、取得した前記サイトの画面の画面情報から、前記タグに関する情報により特定されるデータを検索し、
前記データが検索されなかった場合に、取得した前記サイトの画面情報に基づく前記サイトの画像データを出力する、
処理を実行させることを特徴とする付記３に記載のデータ特定プログラム。 (Supplementary note 4)
Based on the recorded identification information of the site and information related to the tag in the screen information of the screen of the site, the data specified by the information related to the tag is searched from the acquired screen information of the screen of the site.
When the data is not searched, the image data of the site based on the acquired screen information of the site is output.
4. The data specifying program according to appendix 3, wherein the program is executed.

（付記５）前記コンピュータに、
前記データが検索された場合には、複数のサイトの情報を集約して表示する一覧画面における、前記タグに関する情報と対応付けて記録された位置に、検索した前記データを挿入した前記一覧画面を出力する処理を実行させることを特徴とする付記４に記載のデータ特定プログラム。 (Supplementary note 5)
When the data is searched, the list screen in which the searched data is inserted at the position recorded in association with the information on the tag in the list screen that aggregates and displays information of a plurality of sites is displayed. The data identification program according to appendix 4, wherein the output process is executed.

（付記６）コンピュータが、
サイトの画面の画像データ上で選択を受け付けた範囲の画像データから得られるテキストデータと同一内容のテキストデータを、前記画面の画面情報から検索し、
前記画面の画面情報内の検索したテキストデータを異なるテキストデータに変更し、
変更後の前記画面の画面情報に基づく前記画面の画像データ上の、前記選択を受け付けた範囲と同一の範囲の画像データから得られるテキストデータが、前記異なるテキストデータと一致するか否かを判定することにより、前記画面の画面情報から前記選択を受け付けた範囲に対応するテキストデータを特定する、
処理を実行することを特徴とするデータ特定方法。 (Appendix 6)
Search the screen information of the screen for text data having the same content as the text data obtained from the image data in the range of selection received on the screen data of the site,
Change the searched text data in the screen information of the screen to different text data,
Determining whether text data obtained from image data in the same range as the selected range on the screen image data based on the screen information after the change matches the different text data By specifying the text data corresponding to the range that received the selection from the screen information of the screen,
A data identification method characterized by executing processing.

（付記７）サイトの画面の画像データ上で選択を受け付けた範囲の画像データから得られるテキストデータと同一内容のテキストデータを、前記画面の画面情報から検索し、前記画面の画面情報内の検索したテキストデータを異なるテキストデータに変更し、変更後の前記画面の画面情報に基づく前記画面の画像データ上の、前記選択を受け付けた範囲と同一の範囲の画像データから得られるテキストデータが、前記異なるテキストデータと一致するか否かを判定することにより、前記画面の画面情報から前記選択を受け付けた範囲に対応するテキストデータを特定する制御部、
を有することを特徴とする情報処理装置。 (Supplementary note 7) Text data having the same content as the text data obtained from the image data in the range of selection received on the screen image data of the site is searched from the screen information of the screen, and the search within the screen information of the screen The text data obtained from the image data in the same range as the range on which the selection is received on the image data of the screen based on the screen information of the screen after the change is changed to different text data, A controller that identifies text data corresponding to a range in which the selection is accepted from the screen information of the screen by determining whether or not the text data matches different text data;
An information processing apparatus comprising:

（付記８）コンピュータに、
サイトの画面の画像データ上で選択を受け付けた範囲の画像データから得られるテキストデータと同一内容のテキストデータを、前記画面の画面情報から検索し、
前記画面の画面情報内の検索したテキストデータを異なるテキストデータに変更し、
変更後の前記画面の画面情報に基づく前記画面の画像データ上の、前記選択を受け付けた範囲と同一の範囲の画像データから得られるテキストデータが、前記異なるテキストデータと一致するか否かを判定することにより、前記画面の画面情報から前記選択を受け付けた範囲に対応するテキストデータを特定する、
処理を実行させるデータ特定プログラムを記録したことを特徴とする前記コンピュータに読み取り可能な記録媒体。 (Appendix 8)
Search the screen information of the screen for text data having the same content as the text data obtained from the image data in the range of selection received on the screen data of the site,
Change the searched text data in the screen information of the screen to different text data,
Determining whether text data obtained from image data in the same range as the selected range on the screen image data based on the screen information after the change matches the different text data By specifying the text data corresponding to the range that received the selection from the screen information of the screen,
A computer-readable recording medium in which a data specifying program for executing processing is recorded.

１０１情報処理装置
２００システム
２０１サーバ
２２０アカウントアグリゲーション情報ＤＢ
２３０サイト別目的データ属性ＤＢ
２４０一覧情報ＤＢ
１１０１受付部
１１０２取得部
１１０３登録部
１１０４表示制御部
１１０５認識部
１１０６検索部
１１０７変更部
１１０８判定部
１１０９特定部 101 Information processing apparatus 200 System 201 Server 220 Account aggregation information DB
230 Site-specific purpose data attribute DB
240 List information DB
DESCRIPTION OF SYMBOLS 1101 Reception part 1102 Acquisition part 1103 Registration part 1104 Display control part 1105 Recognition part 1106 Search part 1107 Change part 1108 Determination part 1109 Identification part

Claims

On the computer,
Search the screen information of the screen for text data having the same content as the text data obtained from the image data in the range of selection received on the screen data of the site,
Change the searched text data in the screen information of the screen to different text data,
Determining whether text data obtained from image data in the same range as the selected range on the screen image data based on the screen information after the change matches the different text data By specifying the text data corresponding to the range that received the selection from the screen information of the screen,
A data identification program for executing a process.

The process to change is
When a plurality of text data is searched, the text data of any of the plurality of text data in the screen information of the screen is changed to different text data,
The process to specify is
2. The data according to claim 1, wherein when the text data matches the different text data, the text data in the screen information of the screen is specified as text data corresponding to the range in which the selection is accepted. Specific program.

In the computer,
A process of recording information related to a tag in screen information of the screen specified by text data corresponding to a range in which the selected selection is received in association with identification information of the site is executed. Item 3. The data specifying program according to item 1 or 2.

In the computer,
Based on the recorded identification information of the site and information related to the tag in the screen information of the screen of the site, the data specified by the information related to the tag is searched from the acquired screen information of the screen of the site.
When the data is not searched, the image data of the site based on the acquired screen information of the site is output.
The data specifying program according to claim 3, wherein a process is executed.

In the computer,
When the data is searched, the list screen in which the searched data is inserted at the position recorded in association with the information on the tag in the list screen that aggregates and displays information of a plurality of sites is displayed. The data specifying program according to claim 4, wherein an output process is executed.

Computer
Search the screen information of the screen for text data having the same content as the text data obtained from the image data in the range of selection received on the screen data of the site,
Change the searched text data in the screen information of the screen to different text data,
Determining whether text data obtained from image data in the same range as the selected range on the screen image data based on the screen information after the change matches the different text data By specifying the text data corresponding to the range that received the selection from the screen information of the screen,
A data identification method characterized by executing processing.

Search the screen information of the screen for text data having the same contents as the text data obtained from the image data in the range of selection received on the image data of the screen of the site, and search for the searched text data in the screen information of the screen Change to different text data, text data obtained from image data in the same range as the selected range on the image data of the screen based on the screen information of the screen after the change, the different text data A control unit that identifies text data corresponding to a range in which the selection is accepted from the screen information of the screen by determining whether or not they match;
An information processing apparatus comprising: