JP2002108855A

JP2002108855A - Document processor

Info

Publication number: JP2002108855A
Application number: JP2000300484A
Authority: JP
Inventors: Noriyuki Osuga; 典之大須賀
Original assignee: Brother Industries Ltd
Current assignee: Brother Industries Ltd
Priority date: 2000-09-29
Filing date: 2000-09-29
Publication date: 2002-04-12

Abstract

PROBLEM TO BE SOLVED: To provide a document processor by which a character part in picture data of a document consisting of text data and picture data is recognized with high precision. SOLUTION: Text data is translated (S3) when text data exists in an HTML document (S2: YES), picture data is analyzed (S5) when picture data exists (S4: YES), a character is segmented (S7) when the character part of picture data exists (S6: YES) and, then, the character is recognized (S8). A recognition candidate is compared with the character in text data, the recognition candidate is re-arranged (S9), the re-arranged recognition candidate is translated (S10) and a translation result is synthesized with the graphic part of picture data (S11).

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、テキストデータと
画像データとを持つデータを解析して翻訳を行う文書処
理装置に関する。[0001] 1. Field of the Invention [0002] The present invention relates to a document processing apparatus for analyzing and translating data having text data and image data.

【０００２】[0002]

【従来の技術】従来、テキストデータと画像データとを
持つデータを解析して翻訳を行う文書処理装置におい
て、テキストデータと画像データの両方を含む文書を翻
訳する際には、テキストデータのみを翻訳して画像デー
タと合成して結果を表示する方法が知られている。例え
ば、図８に示すように、文書の左半分のＡ部分に「セミ
ナーのお知らせ、本日、１時より右会場でセミナーを行
います。」とテキストデータで記述され、また、文書の
右半分のＢ部分に、地図及び「セミナー会場」という文
字の画像からなる画像データが配置されている文書を翻
訳した場合には、図９に示すように、文書の左半分のＣ
部分に「Information of seminar Today there is semi
nar in right place from 1:00」と翻訳され、文書の右
半分のＤ部分には、画像データである地図及び「セミナ
ー会場」の文字がそのまま表示されることになる。2. Description of the Related Art Conventionally, in a document processing apparatus for analyzing and translating data having text data and image data, when translating a document including both text data and image data, only the text data is translated. A method of displaying the result by combining the image data with image data is known. For example, as shown in FIG. 8, in the A portion of the left half of the document, "Seminar announcement, today, a seminar will be held at the right venue from 1 o'clock" is described in text data, and the right half of the document is described. When a document in which image data including a map and an image of a character “seminar venue” is arranged in part B is translated, as shown in FIG.
In the part, `` Information of seminar Today there is semi
The translation is "nar in right place from 1:00", and the map, which is image data, and the characters "seminar venue" are displayed as they are in the D portion of the right half of the document.

【０００３】[0003]

【発明が解決しようとする課題】しかしながら、上記の
方法では、画像データ中の文字部分である「セミナー会
場」の文字が翻訳されないために、図９に示すように日
本語と英語が混合した文書になってしまうという問題点
があった。また、特開平１０−１５４１４５号公報に記
載されているように、画像データ中の文字部分を抽出し
て文字認識を行い、翻訳する技術も提案されているが、
必ずしも、文字認識の精度が満足いかないという問題点
があった。However, in the above-mentioned method, since the characters of the "seminar venue" which is the character portion in the image data are not translated, a document in which Japanese and English are mixed as shown in FIG. There was a problem of becoming. Further, as described in Japanese Patent Application Laid-Open No. H10-154145, a technique of extracting a character portion in image data, performing character recognition, and translating the image portion has been proposed.
There was a problem that the accuracy of character recognition was not always satisfactory.

【０００４】本発明は、上記課題を解決するためになさ
れたものであり、テキストデータと画像データとからな
る文書を解析して文字認識を行って、その結果の翻訳を
行う文書処理装置において、画像データ中の文字部分の
文字認識精度の高い文書処理装置を提供することを目的
とする。SUMMARY OF THE INVENTION The present invention has been made to solve the above-mentioned problem. In a document processing apparatus which analyzes a document including text data and image data, performs character recognition, and translates the result, An object of the present invention is to provide a document processing apparatus having high character recognition accuracy for character portions in image data.

【０００５】[0005]

【課題を解決するための手段】上記目的を達成するため
の請求項１に係る発明の文書処理装置は、入力されたテ
キストデータとそのテキストデータに付随する画像デー
タとを記憶する記憶手段と、前記記憶手段に記憶された
画像データから画像部分と文字部分とを分離する分離手
段と、前記分離手段が分離した文字部分を文字認識を行
って文字コードに変換する文字認識手段と、前記文字認
識手段の認識結果の候補と前記記憶手段に記憶されたテ
キストデータ内の文字列とを比較し、前記認識結果の候
補と前記テキストデータ内の文字列の一致を検出する候
補比較手段と、前記候補比較手段の比較結果に基づい
て、前記文字認識手段の認識結果の候補を並べ替える候
補並べ替え手段とを備えたことを特徴とする構成となっ
ている。According to a first aspect of the present invention, there is provided a document processing apparatus for storing input text data and image data accompanying the text data. Separating means for separating an image part and a character part from the image data stored in the storage means, character recognition means for performing character recognition on the character part separated by the separating means and converting the character part into a character code; Candidate comparing means for comparing a candidate of the recognition result of the means with a character string in the text data stored in the storage means, and detecting a match between the candidate of the recognition result and the character string in the text data; And a candidate rearranging unit for rearranging the candidates of the recognition result of the character recognizing unit based on the comparison result of the comparing unit.

【０００６】この構成の文書処理装置では、記憶手段は
入力されたテキストデータとそのテキストデータに付随
する画像データとを記憶し、分離手段は記憶手段に記憶
された画像データから画像部分と文字部分とを分離し、
文字認識手段は分離手段が分離した文字部分を文字認識
を行って文字コードに変換し、候補比較手段が、文字認
識手段の認識結果の候補と記憶手段に記憶されたテキス
トデータ内の文字列とを比較し、認識結果の候補とテキ
ストデータ内の文字列の一致を検出すると、候補並べ替
え手段は候補比較手段の比較結果に基づいて文字認識手
段の認識結果の候補を並べ替える。In the document processing apparatus having this configuration, the storage means stores the input text data and the image data accompanying the text data, and the separation means stores the image part and the character part from the image data stored in the storage means. And separate
The character recognizing means performs character recognition on the character portion separated by the separating means and converts it into a character code.The candidate comparing means compares the character string in the text data stored in the storage means with the candidate of the recognition result of the character recognizing means. Are compared with each other, and when a match between the candidate of the recognition result and the character string in the text data is detected, the candidate sorting unit sorts the candidates of the recognition result of the character recognition unit based on the comparison result of the candidate comparison unit.

【０００７】また、請求項２に係る発明の文書処理装置
では、請求項１に記載の文書処理装置の構成に加えて、
前記候補比較手段は、前記文字認識手段が認識した複数
の候補の中の文字の組み合わせと前記記憶手段に記憶さ
れたテキストデータ内の文字列とを比較し、前記候補比
較手段が前記複数の候補の中の文字の組み合わせと前記
テキストデータ内の連続した複数の文字列が一致するこ
とを検出した場合に、前記候補並べ替え手段は、当該連
続した複数の文字列を優先候補として前記文字認識手段
の認識結果の候補を並べ替えることを特徴とする構成と
なっている。According to a second aspect of the present invention, in addition to the configuration of the first aspect,
The candidate comparing unit compares a combination of characters in a plurality of candidates recognized by the character recognizing unit with a character string in text data stored in the storage unit, and the candidate comparing unit compares the plurality of candidates. When it is detected that a combination of characters in the text data matches a plurality of continuous character strings in the text data, the candidate reordering unit sets the plurality of continuous character strings as priority candidates and sets the character recognition unit Are sorted.

【０００８】この構成の文書処理装置では、請求項１に
記載の文書処理装置の作用に加えて、候補比較手段は、
文字認識手段が認識した複数の候補の中の文字の組み合
わせと記憶手段に記憶されたテキストデータ内の文字列
とを比較し、候補比較手段が複数の候補の中の文字の組
み合わせとテキストデータ内の連続した複数の文字列が
一致することを検出した場合に、候補並べ替え手段は、
当該連続した複数の文字列を優先候補として文字認識手
段の認識結果の候補を並べ替える。[0008] In the document processing apparatus having this configuration, in addition to the operation of the document processing apparatus according to the first aspect, the candidate comparing means includes:
The combination of characters in the plurality of candidates recognized by the character recognition unit is compared with the character string in the text data stored in the storage unit, and the candidate comparison unit compares the combination of characters in the plurality of candidates with the character string in the text data. When it is detected that a plurality of consecutive character strings match, the candidate sorting unit
The candidates of the recognition result of the character recognition means are rearranged with the plurality of continuous character strings as priority candidates.

【０００９】また、請求項３に係る発明の文書処理装置
では、請求項１又は２に記載の文書処理装置の構成に加
えて、前記候補並べ替え手段により並べ替えられた認識
結果の候補の文字列を当該文字列の言語とは異なる他の
言語の文字列に翻訳する翻訳手段を備えたことを特徴と
する構成となっている。According to a third aspect of the present invention, in addition to the configuration of the first or second aspect, in addition to the configuration of the document processing apparatus, the candidate character of the recognition result rearranged by the candidate rearranging unit is provided. A translation unit is provided for translating the string into a character string in another language different from the language of the character string.

【００１０】この構成の文書処理装置では、請求項１又
は２に記載の文書処理装置の作用に加えて、翻訳手段
は、候補並べ替え手段により並べ替えられた認識結果の
候補の文字列を当該文字列の言語とは異なる他の言語の
文字列に翻訳する。[0010] In the document processing apparatus of this configuration, in addition to the operation of the document processing apparatus according to claim 1 or 2, the translating means converts the character string of the recognition result candidate rearranged by the candidate rearranging means. Translate to a string in another language different from the language of the string.

【００１１】また、請求項４に係る発明の文書処理装置
では、請求項１乃至３の何れかに記載の文書処理装置の
構成に加えて、前記翻訳手段の翻訳結果と前記分離手段
により分離された画像部分とを合成した合成画像データ
を作成する文書合成手段を備えたことを特徴とする構成
となっている。According to a fourth aspect of the present invention, in addition to the configuration of the document processing apparatus according to any one of the first to third aspects, the translation result of the translation unit is separated from the translation result by the separation unit. And a document synthesizing means for generating synthesized image data obtained by synthesizing the image portion.

【００１２】この構成の文書処理装置では、請求項１乃
至３の何れかに記載の文書処理装置の作用に加えて、文
書合成手段は前記翻訳手段の翻訳結果と分離手段により
分離された画像部分とを合成した合成画像データを作成
する。[0012] In the document processing apparatus having this configuration, in addition to the operation of the document processing apparatus according to any one of claims 1 to 3, the document synthesizing means may include a translation result of the translation means and an image portion separated by the separation means. To generate composite image data.

【００１３】また、請求項５に係る発明の文書処理装置
では、請求項１乃至４の何れかに記載の文書処理装置の
構成に加えて、前記文書合成手段が合成した合成画像デ
ータを表示する表示手段を備えたことを特徴とする構成
となっている。According to a fifth aspect of the present invention, in addition to the configuration of the document processing apparatus according to any one of the first to fourth aspects, the combined image data combined by the document combining means is displayed. The display device is provided with a display means.

【００１４】この構成の文書処理装置では、請求項１乃
至４の何れかに記載の文書処理装置の作用に加えて、表
示手段は、文書合成手段が合成した合成画像データを表
示する。In the document processing apparatus having this configuration, in addition to the operation of the document processing apparatus according to any one of claims 1 to 4, the display means displays the combined image data combined by the document combining means.

【００１５】また、請求項６に係る発明の文書処理装置
では、請求項１乃至５の何れかに記載の文書処理装置の
構成に加えて、前記入力されたテキストデータとそのテ
キストデータに付随する画像データとがＨＴＭＬで記述
されていることを特徴とする構成となっている。According to a sixth aspect of the invention, in addition to the configuration of the document processing apparatus according to any one of the first to fifth aspects, the input text data and the text data are attached. The image data is described in HTML.

【００１６】この構成の文書処理装置では、請求項１乃
至５の何れかに記載の文書処理装置の作用に加えて、入
力されたテキストデータとそのテキストデータに付随す
る画像データとがＨＴＭＬで記述されているので、文書
の分析及び合成が容易にできる。In the document processing apparatus having this configuration, in addition to the operation of the document processing apparatus according to any one of claims 1 to 5, input text data and image data accompanying the text data are described in HTML. This facilitates the analysis and synthesis of documents.

【００１７】[0017]

【発明の実施の形態】以下、本発明の一実施の形態につ
いて図面を参照して説明する。DESCRIPTION OF THE PREFERRED EMBODIMENTS One embodiment of the present invention will be described below with reference to the drawings.

【００１８】図１は、文書処理装置１の装置の概略構成
を示すブロック図であり、図２は、文書処理装置１の主
制御のフローチャートであり、図３は、文書処理装置１
の認識候補とテキストデータの比較処理及び候補並び替
えのサブルーチンのフローチャートである。FIG. 1 is a block diagram showing a schematic configuration of the document processing apparatus 1, FIG. 2 is a flowchart of main control of the document processing apparatus 1, and FIG.
5 is a flowchart of a subroutine for comparing a recognition candidate with text data and rearranging candidates.

【００１９】図１に示すように、本発明の第一の実施の
形態の文書処理装置１は、文書処理装置１の制御を司る
ＣＰＵ２に、ＣＰＵ２で実行されるプログラム等を記憶
したＲＯＭ７と、データを一時的に記憶するＲＡＭ８
と、ＣＰＵ２へのデータの入出力を仲介するＩ／Ｏイン
ターフェース３とが接続されている。また、Ｉ／Ｏイン
ターフェース３には、文字を入力するキーボード４と、
マウス５と、ＬＣＤ又はＣＲＴから構成された表示装置
６と、ＣＤ−ＲＯＭドライブ９と、外部のネットワーク
に接続するためのＬＡＮボード１０と、大容量のデータ
を記憶するハードディスクドライブ１２とが接続されて
いる。また、ＬＡＮボード１０は、ケーブル１１を介し
て、図示外のルータに接続され、図示外のルータは、イ
ンターネットに接続されている。As shown in FIG. 1, a document processing apparatus 1 according to a first embodiment of the present invention has a CPU 2 for controlling the document processing apparatus 1, a ROM 7 storing programs executed by the CPU 2, and the like. RAM 8 for temporarily storing data
And an I / O interface 3 that mediates input / output of data to / from the CPU 2. The I / O interface 3 has a keyboard 4 for inputting characters,
A mouse 5, a display device 6 composed of an LCD or CRT, a CD-ROM drive 9, a LAN board 10 for connecting to an external network, and a hard disk drive 12 for storing a large amount of data are connected. ing. The LAN board 10 is connected to a router (not shown) via a cable 11, and the router (not shown) is connected to the Internet.

【００２０】次に、ＲＯＭ７の記憶エリアについて説明
する。ＲＯＭ７には、図１に示すように、ＨＴＭＬファ
イル解析プログラム記憶エリア７ａと、レイアウト解析
プログラム記憶エリア７ｂと、文字切り出しプログラム
記憶エリア７ｃと、文字認識プログラム記憶エリア７
ｄ、文字認識辞書記憶エリア７ｅと、候補比較プログラ
ム記憶エリア７ｆと、候補並べ替えプログラム記憶エリ
ア７ｇと、翻訳プログラム記憶エリア７ｈと、ＨＴＭＬ
ファイル合成プログラム記憶エリア７ｉとが設けられて
いる。Next, the storage area of the ROM 7 will be described. As shown in FIG. 1, the ROM 7 stores an HTML file analysis program storage area 7a, a layout analysis program storage area 7b, a character cutout program storage area 7c, and a character recognition program storage area 7.
d, character recognition dictionary storage area 7e, candidate comparison program storage area 7f, candidate rearrangement program storage area 7g, translation program storage area 7h, HTML
A file synthesizing program storage area 7i is provided.

【００２１】尚、ＨＴＭＬファイル解析プログラム記憶
エリア７ａには、ＨＴＭＬで記述されたファイルを解析
して、テキストデータと画像データとを抽出する処理を
ＣＰＵ２に行わせるＨＴＭＬファイル解析プログラムが
記憶されている。また、レイアウト解析プログラム記憶
エリア７ｂには、抽出された画像データをレイアウト解
析し、図のブロック、文字のブロック、表のブロック等
の解析する処理をＣＰＵ２に行わせるレイアウト解析プ
ログラムが記憶されている。The HTML file analysis program storage area 7a stores an HTML file analysis program that causes the CPU 2 to analyze a file described in HTML and extract text data and image data. . The layout analysis program storage area 7b stores a layout analysis program that causes the CPU 2 to perform a layout analysis of the extracted image data and analyze a block of a figure, a block of a character, a block of a table, and the like. .

【００２２】さらに、文字切り出しプログラム記憶エリ
ア７ｃには、抽出された画像データの文字のブロックか
ら文字を出す処理をＣＰＵ２に行わせる文字切り出しプ
ログラムが記憶されている。また、文字認識プログラム
記憶エリア７ｄには、文字認識を行う処理をＣＰＵ２に
行わせる文字認識プログラムが記憶されている。また、
文字認識辞書記憶エリア７ｅには、日本語の文字認識時
に参照される文字認識辞書が記憶されている。さらに、
候補比較プログラム記憶エリア７ｆには、文字認識結果
の候補文字とテキストデータの比較を行う候補比較処理
をＣＰＵ２に行わせる候補比較プログラムが記憶されて
いる。Further, a character extracting program storage area 7c stores a character extracting program for causing the CPU 2 to perform a process of extracting characters from a character block of the extracted image data. The character recognition program storage area 7d stores a character recognition program for causing the CPU 2 to perform character recognition processing. Also,
The character recognition dictionary storage area 7e stores a character recognition dictionary that is referred to when recognizing Japanese characters. further,
The candidate comparison program storage area 7f stores a candidate comparison program that causes the CPU 2 to perform a candidate comparison process of comparing text data with candidate characters of the character recognition result.

【００２３】候補並べ替えプログラム記憶エリア７ｇに
は、候補比較処理の結果に基づいて、文字認識結果の候
補文字の並び替えを行う処理をＣＰＵ２に行わせる候補
並べ替えプログラムが記憶されている。また、翻訳プロ
グラム記憶エリア７ｈには、日本語から英語又は英語か
ら日本語等の翻訳処理をＣＰＵ２に行わせる翻訳プログ
ラムが記憶されている。さらに、ＨＴＭＬファイル合成
プログラム記憶エリア７ｉには、テキストデータの翻訳
結果と、画像データ中の文字部分の翻訳された結果と、
画像データ中の文字部分以外の部分を合成する処理をＣ
ＰＵ２に行わせるＨＴＭＬファイル合成プログラムが記
憶されている。A candidate rearrangement program storage area 7g stores a candidate rearrangement program for causing the CPU 2 to perform a process of rearranging the candidate characters of the character recognition result based on the result of the candidate comparison process. The translation program storage area 7h stores a translation program that causes the CPU 2 to perform a translation process from Japanese to English or from English to Japanese. Further, in the HTML file synthesizing program storage area 7i, the translation result of the text data, the translation result of the character portion in the image data,
The process of combining parts other than the character part in the image data
An HTML file synthesizing program to be executed by PU2 is stored.

【００２４】次に、ＲＡＭ８の記憶エリアについて説明
する。ＲＡＭ８には、図１に示すように、ＨＴＭＬで記
述されたファイルから抽出したテキストデータを記憶す
るテキストデータ記憶エリア８ａと、ＨＴＭＬで記述さ
れたファイルから抽出した画像データ記憶する画像デー
タ記憶エリア８ｂと、画像データの解析結果を記憶する
解析結果記憶エリア８ｃと、翻訳結果を記憶する翻訳結
果記憶エリア８ｄと、画像データ中の文字部分からの文
字の切り出し結果を記憶する文字切り出し結果記憶エリ
ア８ｅ、文字認識の結果を記憶する認識結果記憶エリア
８ｆと、画像データ中の文字部分の翻訳された結果と画
像データ中の文字部分以外の部分との合成結果を記憶す
る合成結果記憶エリア８ｇと、ＣＰＵ２が行う各種の処
理でデータを記憶するワーキングエリア８ｈとが設けら
れている。Next, the storage area of the RAM 8 will be described. As shown in FIG. 1, the RAM 8 has a text data storage area 8a for storing text data extracted from a file described in HTML, and an image data storage area 8b for storing image data extracted from a file described in HTML. An analysis result storage area 8c for storing analysis results of image data, a translation result storage area 8d for storing translation results, and a character extraction result storage area 8e for storing character extraction results from character portions in image data. A recognition result storage area 8f for storing a result of character recognition, a synthesis result storage area 8g for storing a synthesis result of a translated result of a character portion in the image data and a portion other than the character portion in the image data, A working area 8h for storing data in various processes performed by the CPU 2 is provided.

【００２５】以上のように構成された文書処理装置１の
動作について、図２乃至図８を参照して説明する。図２
は、文書処理装置１が行う主制御のフローチャートであ
り、図３は、認識候補とテキストデータの比較処理及び
候補並び替えのサブルーチンのフローチャートであり、
図４及び図５は、候補並べ替え処理の変形例のフローチ
ャートであり、図６は、文字認識された候補の文字の例
示であり、図７は、翻訳結果が表示装置６のブラウザ画
面上に表示された例を示す図であり、図８は、テキスト
データと画像データとを有するＨＴＭＬファイルが、表
示装置６のブラウザ画面上に表示された例を示す図であ
る。The operation of the document processing apparatus 1 configured as described above will be described with reference to FIGS. FIG.
FIG. 3 is a flowchart of main control performed by the document processing apparatus 1, and FIG. 3 is a flowchart of a subroutine for comparing recognition candidates and text data and rearranging candidates.
4 and 5 are flowcharts of a modification of the candidate rearrangement process. FIG. 6 is an example of candidate characters recognized as characters. FIG. 7 is a diagram showing the translation result displayed on the browser screen of the display device 6. 8 is a diagram illustrating an example in which an HTML file including text data and image data is displayed on a browser screen of the display device 6. FIG.

【００２６】以下、インターネットのホームページを日
本語から英語へ翻訳する場合を例にとって説明する。イ
ンターネットのホームページ上に、図８に示すように、
文書の左半分のＡ部分に「セミナーのお知らせ、本日、
１時より右会場でセミナーを行います。」とテキストデ
ータで記述され、また、文書の右半分のＢ部分に、地図
及び「セミナー会場」という文字の画像からなる画像デ
ータが配置されている画面を表示するＨＴＭＬファイル
があるとする。このＨＴＭＬファイルでは、「「セミナ
ーのお知らせ、本日、１時より右会場でセミナーを行い
ます。」が日本語のテキストデータで記述され、地図及
び「セミナー会場」という文字は、一体の画像データか
ら構成されているとする。Hereinafter, a case where the homepage of the Internet is translated from Japanese to English will be described as an example. On the Internet homepage, as shown in Figure 8,
In the A part of the left half of the document, "Seminar announcement, today,
A seminar will be held at the right venue from 1:00. And an HTML file that displays a screen in which a map and image data composed of images of characters "seminar venue" are arranged in the right half B of the document. In this HTML file, "Seminar announcement, a seminar will be held at the right venue from 1:00 today." Will be described in Japanese text data, and the map and the text "Seminar venue" will be converted from the integrated image data. It is assumed that it is configured.

【００２７】文書処理装置１では、そのインターネット
のホームページ上のＨＴＭＬファイルを、図示外のルー
タを介して、ケーブル１１及びＬＡＮボード１０を経
て、取り込みＲＡＭ８のワーキングエリア８ｈに記憶す
る。すると、表示装置６のブラウザ画面上には、図８に
示すように、画面の左半分のＡ部分に「セミナーのお知
らせ、本日、１時より右会場でセミナーを行います。」
とテキストデータで記述され、また、画面の右半分のＢ
部分に、地図及び「セミナー会場」という文字の画像か
らなる画像データの表示がなされる。The document processing apparatus 1 captures an HTML file on the Internet homepage via a router (not shown) via a cable 11 and a LAN board 10 and stores it in a working area 8h of a RAM 8. Then, on the browser screen of the display device 6, as shown in FIG. 8, “Announcement of a seminar, a seminar will be held at the right venue from 1:00 today.”
And text data, and B in the right half of the screen
In the portion, image data including a map and an image of characters “seminar venue” is displayed.

【００２８】ここで、使用者が、マウス５を用いて、表
示装置６に表示されている。図示外の翻訳ボタンをクリ
ックすると、図８に示す表示画面のソースコードであ
り、ＲＡＭ８のワーキングエリア８ｈに記憶されている
ＨＴＭＬファイルを解析し、そのＨＴＭＬファイルから
テキストデータと画像データとを抽出し、テキストデー
タをＲＡＭ８のテキストデータ記憶エリア８ａに記憶
し、画像データをＲＡＭ８の画像データ記憶エリア８ｂ
に記憶する（Ｓ１）。ここでは、「セミナーのお知ら
せ、本日、１時より右会場でセミナーを行います。」
が、テキストデータとして抽出され、地図及び「セミナ
ー会場」という文字は、一体の画像データとして抽出さ
れる。Here, the user is displayed on the display device 6 using the mouse 5. When a translation button (not shown) is clicked, an HTML file, which is the source code of the display screen shown in FIG. 8 and stored in the working area 8h of the RAM 8, is analyzed, and text data and image data are extracted from the HTML file. The text data is stored in the text data storage area 8a of the RAM 8, and the image data is stored in the image data storage area 8b of the RAM 8.
(S1). Here, "Seminar announcement, seminar will be held at the right venue from 1:00 today."
Are extracted as text data, and the map and the characters “seminar venue” are extracted as integrated image data.

【００２９】次に、Ｓ２の判断処理で、ＨＴＭＬファイ
ルから抽出されたテキストデータが有るか否かを判断す
る。８に示す表示画面では、「セミナーのお知らせ、本
日、１時より右会場でセミナーを行います。」の部分
が、テキストデータとして抽出されているので（Ｓ２：
ＹＥＳ）、周知の機械翻訳法により、当該テキストデー
タを英語に翻訳する（Ｓ３）。すると、「Information
of seminar Today thereis seminar in right place fr
om 1:00」と翻訳され、ＲＡＭ８の翻訳結果記憶エリア
８ｄに記憶される（Ｓ３）。Next, in the determination process of S2, it is determined whether or not there is text data extracted from the HTML file. In the display screen shown in Fig. 8, the part of "Seminar announcement, a seminar will be held at the right venue from 1:00 today" is extracted as text data (S2:
YES), the text data is translated into English by a well-known machine translation method (S3). Then, "Information
of seminar Today thereis seminar in right place fr
om 1:00 "and stored in the translation result storage area 8d of the RAM 8 (S3).

【００３０】次いで、ＨＴＭＬファイルから抽出された
画像データが有るか否かを判断する（Ｓ４）。図８に示
す表示画面では、地図及び「セミナー会場」という文字
は、一体の画像データとして抽出されているので（Ｓ
４：ＹＥＳ）、当該画像データの解析を行う（Ｓ５）。
具体的には、抽出された画像データをレイアウト解析
し、図のブロック、文字のブロック、表のブロック等を
抽出して、解析結果記憶エリア８ｃに記憶する処理を行
う。この解析処理は、レイアウト解析の周知技術を用い
ることができ、例えば、画像データの射影を求めて、そ
の射影の特徴により、図のブロック、文字のブロック、
表のブロック等の解析を行う（Ｓ５）。Next, it is determined whether there is image data extracted from the HTML file (S4). In the display screen shown in FIG. 8, since the map and the characters “seminar venue” are extracted as one piece of image data, (S
4: YES), the image data is analyzed (S5).
More specifically, a layout analysis is performed on the extracted image data to extract a block of a figure, a block of a character, a block of a table, and the like, and store the extracted block in the analysis result storage area 8c. This analysis process can use a well-known technique of layout analysis. For example, a projection of image data is obtained, and a diagram block, a text block,
Analysis of blocks in the table is performed (S5).

【００３１】例えば、文字のブロックには、射影に特徴
のある山谷が生じるが、図のブロックでは、射影に上記
の文字のような特徴が生じないので文字のブロックと図
のブロックは容易に識別できる。図８に示す例では、画
像データ中に文字部（文字のブロック）である「セミナ
ー会場」が存在するので、「セミナー会場」を文字部
（文字のブロック）として、又、地図を図のブロックと
して、解析結果記憶エリア８ｃに記憶する。For example, in a block of characters, peaks and valleys having a characteristic of projection are generated, but in the block of the figure, the character block is not easily distinguished from the block of the figure since the projection does not have the characteristic like the above character. it can. In the example shown in FIG. 8, the “seminar venue” which is a character part (character block) is present in the image data, so the “seminar venue” is a character part (character block) and the map is a block of the figure. Is stored in the analysis result storage area 8c.

【００３２】図８に示す例では、画像データ中に文字部
（文字のブロック）である「セミナー会場」が存在する
ので（Ｓ６：ＹＥＳ）、次に、文字切り出し処理を行う
（Ｓ７）。この文字切り出し処理では、まず、解析結果
記憶エリア８ｃに記憶された文字のブロックである「セ
ミナー会場」の画像データから１文字ずつ文字の切り出
しを行う。この文字の切り出し法は、ＯＣＲの公知の技
術を使用できる。例えば、文字のブロックが横書きであ
れば、まず、横方向に画像を走査し、黒いドットの数を
カウントし、黒いドットの総数が定期的に山と谷を繰り
返すようであれば、その谷の部分が行の切れ目と判断で
きる。その後、１行に対し、縦方向に画像を走査し、同
じように黒いドットをカウントし、黒いドットの総数が
少なくなるところが文字の切れ目と判断して、一文字ず
つ文字を切り出し、文字切り出し結果記憶エリア８ｅに
記憶する（Ｓ７）。In the example shown in FIG. 8, since there is a "seminar venue" which is a character portion (a block of characters) in the image data (S6: YES), a character cutout process is performed (S7). In this character extraction processing, first, characters are extracted one by one from the image data of the “seminar venue” which is a block of characters stored in the analysis result storage area 8c. This character can be cut out by using a known technique of OCR. For example, if a block of characters is written horizontally, first scan the image in the horizontal direction, count the number of black dots, and if the total number of black dots periodically repeats peaks and valleys, The part can be determined as a line break. Thereafter, the image is scanned in the vertical direction for one line, black dots are counted in the same manner, and a place where the total number of black dots is reduced is determined to be a character break, and characters are cut out one by one, and the character cutout result storage is performed. It is stored in the area 8e (S7).

【００３３】次いで、文字認識処理（Ｓ８）では、文字
切り出し結果記憶エリア８ｅに記憶されている切り出さ
れた一文字の文字毎に、文字認識辞書記憶エリア７ｅに
記憶されている文字認識辞書を参照して、オフライン認
識等の周知の文字認識法により、文字の認識を行い、認
識結果を認識結果記憶エリア８ｆに記憶する（Ｓ８）。Next, in the character recognition process (S8), the character recognition dictionary stored in the character recognition dictionary storage area 7e is referred to for each character of the extracted one character stored in the character extraction result storage area 8e. Then, the character is recognized by a well-known character recognition method such as off-line recognition, and the recognition result is stored in the recognition result storage area 8f (S8).

【００３４】上記の文字認識処理（Ｓ８）により、文字
のブロックである「セミナー会場」の画像データの文字
認識が完了すると、例えば、図６に示すように、第１候
補として「ヒミ＋ー会場」、第２候補として「セ三ナ＋
合湯」、第３候補として「ゼシフ÷全易」が認識され、
ＲＡＭ８の認識結果記憶エリア８ｆに記憶される。しか
し、これらの認識結果は、何れも正しい認識結果でな
い。When the character recognition of the image data of the "seminar venue" which is a block of characters is completed by the above-described character recognition processing (S8), for example, as shown in FIG. ”As the second candidate,
Goyu "and" Zesif @ Zenyaku "are recognized as the third candidate,
It is stored in the recognition result storage area 8f of the RAM 8. However, none of these recognition results is a correct recognition result.

【００３５】次に、ＲＡＭ８の認識結果記憶エリア８ｆ
に記憶された認識候補と、テキストデータ記憶エリア８
ａに記憶されたテキストデータの比較及び候補の並び替
えをを行う（Ｓ９）。このＳ９の処理について、図３に
示すサブルーチンのフローチャートを参照して説明す
る。この図３に示すサブルーチンの処理では、認識候補
の組み合わせとテキストデータ内の文字列が連続して２
文字以上一致した場合に、その一致した候補を優先して
並び替えるようにするものである。Next, the recognition result storage area 8f of the RAM 8
Candidate and the text data storage area 8
The comparison of the text data stored in a and the rearrangement of the candidates are performed (S9). The process of S9 will be described with reference to a flowchart of a subroutine shown in FIG. In the processing of the subroutine shown in FIG. 3, the combination of the recognition candidates and the character string
When more than characters match, the matching candidates are sorted in priority.

【００３６】以下の説明では、何文字めの認識結果に注
目しているかを示すカウンタｉの値と、一致した文字数
を示すカウンタｊの値は、ＲＡＭ８のワーキングエリア
８ｈに記憶されるものとする。In the following description, it is assumed that the value of the counter i indicating the number of the recognition result of the character and the value of the counter j indicating the number of matching characters are stored in the working area 8h of the RAM 8. .

【００３７】まず、カウンタｉ＝１として（Ｓ２１）、
１文字めの認識候補が認識結果記憶エリア８ｆに記憶さ
れているか否かを判断する（Ｓ２２）。ここでは、認識
結果記憶エリア８ｆに第１候補「ヒミ＋ー会場」、第２
候補「セ三ナ＋合湯」、第３候補「ゼシフ÷全易」が記
憶されており、各々、１文字めの認識候補が「ヒ」、
「セ」、「ゼ」が存在するので（Ｓ２２：ＹＥＳ）、そ
の第１候補「ヒミ＋ー会場」、第２候補「セ三ナ＋合
湯」、第３候補「ゼシフ÷全易」の各々の１文字めの認
識候補「ヒ」、「セ」、「ゼ」がテキストデータ内にあ
るかどうか検索する（Ｓ２３）。First, assuming that the counter i = 1 (S21),
It is determined whether or not the first character recognition candidate is stored in the recognition result storage area 8f (S22). Here, the first candidate “Himi + − venue” and the second candidate are stored in the recognition result storage area 8f.
The candidate "Sesana + Goyu" and the third candidate "Zesif @ Zenyaku" are stored, and the first character recognition candidate is "H",
Since "se" and "ze" are present (S22: YES), the first candidate "Himi + -Venue", the second candidate "Sesana + goyu", and the third candidate "Zesif @ Zenyaku" A search is made to determine whether the first character recognition candidates "hi", "se", and "ze" are present in the text data (S23).

【００３８】テキストデータ記憶エリア８ａに記憶され
ているテキストデータ「セミナーのお知らせ、本日、１
時より右会場でセミナーを行います。」内に、第２候補
の「セ三ナ＋合湯」の１文字めの「セ」と同じ文字であ
る「セ」があるので（Ｓ２３：ＹＥＳ）、ワーキングエ
リア８ｈに並び替え候補の１文字目として「セ」を記憶
する（Ｓ２４）。尚、テキストデータ「セミナーのお知
らせ、本日、１時より右会場でセミナーを行います。」
内の検索では、「セ」は、２つ見つかるが、最初に出て
きた文字を採用するものとする。また、Ｓ２３の判断で
否と判断された場合には（Ｓ２３：ＮＯ）、ｉの値を
「１」増加し（Ｓ３２）、Ｓ２２に戻る。The text data "seminar notice, today, 1
We will hold a seminar at the right venue from time to time. , There is "se" which is the same character as the first character "se" of the second candidate "sesana + goyu" (S23: YES), so that one of the rearrangement candidates is placed in the working area 8h. "C" is stored as the character (S24). In addition, the text data "Seminar announcement, seminar will be held at the right venue from 1:00 today."
In the search in, two "se" are found, but the character that first appears is adopted. When it is determined that the answer is NO in S23 (S23: NO), the value of i is increased by "1" (S32), and the process returns to S22.

【００３９】次に、カウンタｊ＝１として（Ｓ２５）、
テキストデータ記憶エリア８ａに記憶されているテキス
トデータ「セミナーのお知らせ、本日、１時より右会場
でセミナーを行います。」中で、Ｓ２３の処理で見つか
った「セ」の文字の次の文字である「ミ」が第１候補
「ヒミ＋ー会場」、第２候補「セ三ナ＋合湯」、第３候
補「ゼシフ÷全易」の（ｉ＋ｊ）文字め、即ち、２文字
めに存在するか否かを判断する（Ｓ２６）。ここでは、
第１候補「ヒミ＋ー会場」の２文字めに、「ミ」が存在
するので（Ｓ２６：ＹＥＳ）、ワーキングエリア８ｈに
並び替え候補の２文字目として「ミ」を記憶する（Ｓ２
７）。Next, assuming that the counter j = 1 (S25),
In the text data stored in the text data storage area 8a, "Seminar announcement, today we will hold a seminar at the right venue from 1:00." A certain “mi” exists in the (i + j) character of the first candidate “Himi + —venue”, the second candidate “Sesana + goyu”, and the third candidate “zesif @ zeny”, that is, the second character. It is determined whether or not to perform (S26). here,
Since "mi" is present in the second character of the first candidate "Himi +-venue" (S26: YES), "mi" is stored in the working area 8h as the second character of the sorting candidate (S2).
7).

【００４０】次いで、カウンタｊの値を「１」増加して
「ｊ＝２」とし（Ｓ２８）、テキストデータ内の「セ
ミ」の文字の次の文字である「ナ」が第１候補「ヒミ＋
ー会場」、第２候補「セ三ナ＋合湯」、第３候補「ゼシ
フ÷全易」の（ｉ＋ｊ）文字め、即ち、３文字めに存在
するか否かを判断する（Ｓ２６）。ここでは、第２候補
「セ三ナ＋合湯」の３文字めに、「ナ」が存在するので
（Ｓ２６：ＹＥＳ）、ワーキングエリア８ｈに並び替え
候補の３文字目として「ナ」を記憶する（Ｓ２７）。Next, the value of the counter j is increased by "1" to "j = 2" (S28), and "na", which is the character next to the "semi" character in the text data, is the first candidate "Himi". +
It is determined whether or not the third character is present in the (i + j) -th character of the third candidate “Zesif @ Zen-y”, the second candidate “Sesena + goyu”, and the third candidate (S26). Here, since “na” is present in the third character of the second candidate “sesana + goyu” (S26: YES), “na” is stored in the working area 8h as the third character of the rearrangement candidate. (S27).

【００４１】次いで、カウンタｊの値を「１」増加して
「ｊ＝３」とし（Ｓ２８）、テキストデータ内の「セミ
ナ」の文字の次の文字である「ー」が第１候補「ヒミ＋
ー会場」、第２候補「セ三ナ＋合湯」、第３候補「ゼシ
フ÷全易」の（ｉ＋ｊ）文字め、即ち、４文字めに存在
するか否かを判断する（Ｓ２６）。ここでは、第１候補
「ヒミ＋ー会場」の４文字めに、「ー」が存在するので
（Ｓ２６：ＹＥＳ）、ワーキングエリア８ｈに並び替え
候補の４文字目として「ー」を記憶する（Ｓ２７）。Next, the value of the counter j is increased by "1" to "j = 3" (S28), and the next character "-" after the character "Semina" in the text data is the first candidate "Himi". +
It is determined whether or not the second character is present in the (i + j) -th character, ie, the fourth character of the second candidate “Sesana + goyu” and the third candidate “Zesif @ Zen-y” (S26). Here, since “−” exists in the fourth character of the first candidate “Himi + − venue” (S26: YES), “−” is stored as the fourth character of the rearrangement candidate in the working area 8h ( S27).

【００４２】次いで、カウンタｊの値を「１」増加して
「ｊ＝４」とし（Ｓ２８）、テキストデータ内の「セミ
ナー」の文字の次の文字である「の」が第１候補「ヒミ
＋ー会場」、第２候補「セ三ナ＋合湯」、第３候補「ゼ
シフ÷全易」の（ｉ＋ｊ）文字め、即ち、５文字めに存
在するか否かを判断する（Ｓ２６）。ここでは、見つか
らないので（Ｓ２６：ＮＯ）、Ｓ２９の判断処理に移行
する。Ｓ２９の判断では、ｊが２以上か否かを判断す
る。ｊが２以上の場合には、２文字以上、認識候補の文
字とテキストデータ記憶エリア８ａに記憶されているテ
キストデータ内の文字が一致しているので（Ｓ２９：Ｙ
ＥＳ）、認識結果記憶エリア８ｆに記憶した第１候補の
文字のｉ文字めからｉ＋ｊ−１文字め（上記の例では、
１文字めから４文字め）までを、ワーキングエリア８ｈ
に記憶した並べ替え候補の「セミナー」と並び替える
（Ｓ３０）。Next, the value of the counter j is increased by "1" to "j = 4" (S28), and "no" which is the next character of the character of "seminar" in the text data is the first candidate "Himi". + -Venue ”, the second candidate“ Sesuna + goyu ”, and the third candidate“ Zesif @ Zeny ”(i + j), that is, whether or not the fifth character is present (S26). . Here, since it is not found (S26: NO), the processing shifts to the determination processing of S29. In the determination of S29, it is determined whether j is 2 or more. When j is 2 or more, since two or more characters match the characters of the recognition candidate and the characters in the text data stored in the text data storage area 8a (S29: Y
ES), the i-th to (i + j−1) -th characters of the first candidate character stored in the recognition result storage area 8f (in the above example,
Working area 8h from 1st character to 4th character)
(S30).

【００４３】その後、カウンタｉ＝ｉ＋ｊとする（Ｓ３
１）。即ち、上記の例では、「ｉ＝１＋４＝５」となる
（Ｓ３１）。その後、Ｓ２２に戻って、ｉ＝５に対応し
た５文字めの認識候補が認識結果記憶エリア８ｆに記憶
されているか否かを判断する（Ｓ２２）。５文字めの認
識候補が認識結果記憶エリア８ｆに記憶されているので
（Ｓ２２：ＹＥＳ）、認識結果記憶エリア８ｆに記憶さ
れている第１候補〜第３候補の５文字めの認識候補
「会」、「合」、「全」がテキストデータ内にあるかど
うか検索する（Ｓ２３）。Thereafter, the counter is set to i = i + j (S3
1). That is, in the above example, "i = 1 + 4 = 5" (S31). Thereafter, returning to S22, it is determined whether or not the fifth character recognition candidate corresponding to i = 5 is stored in the recognition result storage area 8f (S22). Since the fifth character recognition candidate is stored in the recognition result storage area 8f (S22: YES), the fifth to fifth character recognition candidates of the first to third candidates stored in the recognition result storage area 8f are displayed. A search is made to determine whether "", "go", and "all" are in the text data (S23).

【００４４】テキストデータ記憶エリア８ａに記憶され
ているテキストデータ「セミナーのお知らせ、本日、１
時より右会場でセミナーを行います。」内の１７文字め
（読点は数えず）に、第１候補の「会」の文字が見つか
るので（Ｓ２３：ＹＥＳ）、ワーキングエリア８ｈに並
び替え候補の５文字目として「会」を記憶する（Ｓ２
４）。The text data “seminar notice, today,
We will hold a seminar at the right venue from time to time. Since the character of the first candidate "kai" is found in the 17th character (without counting the reading points) in "" (S23: YES), "kai" is stored in the working area 8h as the fifth character of the sorting candidate. (S2
4).

【００４５】次に、カウンタｊ＝１として、テキストデ
ータ記憶エリア８ａに記憶されているテキストデータ
「セミナーのお知らせ、本日、１時より右会場でセミナ
ーを行います。」中で、Ｓ２３の処理で見つかった
「会」の文字の次の文字である「場」が第１候補〜第３
候補の（ｉ＋ｊ）文字め、即ち、６文字めに存在するか
否かを判断する（Ｓ２６）。ここでは、第１候補の６文
字めに、「場」が存在するので（Ｓ２６：ＹＥＳ）、ワ
ーキングエリア８ｈに並び替え候補の６文字目として
「場」を記憶する（Ｓ２７）。Next, assuming that the counter j = 1, in the text data stored in the text data storage area 8a, "Seminar announcement, today, a seminar will be held at the right venue from 1:00.""Place", which is the character next to the character of "Kai", is the first to third candidates
It is determined whether or not the candidate exists at the (i + j) th character, that is, the sixth character (S26). Here, since "place" exists in the sixth character of the first candidate (S26: YES), "place" is stored in the working area 8h as the sixth character of the rearrangement candidate (S27).

【００４６】次いで、カウンタｊの値を「１」増加して
「ｊ＝２」とし（Ｓ２８）、テキストデータ内の「会
場」の文字の次の文字である「で」が第１候補〜第３候
補の（ｉ＋ｊ）文字め、即ち、７文字めに存在するか否
かを判断する（Ｓ２６）。ここでは、見つからないので
（Ｓ２６：ＮＯ）、Ｓ２９の判断処理に移行する。Ｓ２
９の判断では、ｊが２以上か否かを判断する。ｊが２以
上の場合には、２文字以上、認識候補の文字とテキスト
データ記憶エリア８ａに記憶されているテキストデータ
内の文字が一致しているので（Ｓ２９：ＹＥＳ）、認識
結果記憶エリア８ｆに記憶した第１候補の文字のｉ文字
めからｉ＋ｊ−１文字め（上記の例では、５文字めから
６文字め）までを、ワーキングエリア８ｈに記憶した並
べ替え候補の「会場」に並び替える（Ｓ３０）。ここで
は、第１候補の５文字め〜６文字めが、「会場」だった
ので、並び替え前と変わらない。Next, the value of the counter j is increased by "1" to "j = 2" (S28), and "de", which is the character next to the character of "venue" in the text data, is the first candidate to the first. It is determined whether or not the third candidate exists in the (i + j) th character, that is, the seventh character (S26). Here, since it is not found (S26: NO), the processing shifts to the determination processing of S29. S2
In the determination of 9, it is determined whether j is 2 or more. When j is 2 or more, the recognition candidate character matches the character in the text data stored in the text data storage area 8a with two or more characters (S29: YES), and thus the recognition result storage area 8f. Are arranged from the i-th character to the (i + j-1) -th character (the fifth to sixth characters in the above example) of the first candidate characters stored in the working area 8h stored in the working area 8h. Replace (S30). Here, the fifth and sixth characters of the first candidate are the “venues”, so they are the same as before sorting.

【００４７】その後、カウンタｉ＝ｉ＋ｊとする（Ｓ３
１）。即ち、上記の例では、「ｉ＝５＋２＝７」となる
（Ｓ３１）。その後、Ｓ２２に戻って、ｉ＝７に対応し
た７文字めの認識候補が認識結果記憶エリア８ｆに記憶
されているか否かを判断する（Ｓ２２）。７文字めの認
識候補が認識結果記憶エリア８ｆに記憶されていないの
で（Ｓ２２：ＮＯ）、認識候補の並べ替え処理を終了
し、図２に示すフローチャートのＳ１０に移行する。以
上の並び替え処理により、認識結果記憶エリア８ｆに記
憶された第１候補は、「セミナー会場」と並び替えられ
ている。Thereafter, the counter is set to i = i + j (S3
1). That is, in the above example, "i = 5 + 2 = 7" (S31). Thereafter, returning to S22, it is determined whether or not the recognition candidate of the seventh character corresponding to i = 7 is stored in the recognition result storage area 8f (S22). Since the recognition candidate for the seventh character is not stored in the recognition result storage area 8f (S22: NO), the process of rearranging the recognition candidates is terminated, and the process shifts to S10 in the flowchart shown in FIG. By the above rearrangement processing, the first candidates stored in the recognition result storage area 8f are rearranged as “seminar venue”.

【００４８】次に、Ｓ１０では、並び替えられた認識候
補の第１候補である「セミナー会場」を周知の機械翻訳
により英語に翻訳する（Ｓ１０）。すると、「Place of
seminar」との翻訳結果を得るので、ＲＡＭ８の翻訳結
果記憶エリア８ｄに記憶する。Next, in step S10, the first seminar candidate "seminar venue" which is rearranged is translated into English by a well-known machine translation (S10). Then, "Place of
Since the translation result of "seminar" is obtained, it is stored in the translation result storage area 8d of the RAM 8.

【００４９】次に、ＨＴＭＬファイルの合成処理を行う
（Ｓ１１）。このＳ１１の処理では、翻訳結果記憶エリ
ア８ｄに記憶されている翻訳されたテキストデータ「In
formation of seminar Today there is seminar in rig
ht place from 1:00」と、Ｓ５で解析された画像データ
の文字部以外の画像データ（本例では、地図部分）と、
画像データの文字部「セミナー会場」を翻訳した結果の
「Place of seminar」とを合成し、図７に示すように、
表示装置６のブラウザ画面の左側部分Ｅに「Informatio
n of seminar Today there is seminar in right place
from 1:00」と表示され、右側部分Ｆに地図と「Place
of seminar」の文字が表示されるＨＴＭＬの文書ファイ
ルを作成し、ＲＡＭ８の合成結果記憶エリア８ｇに記憶
し、処理を終了する。ここで、ＨＴＭＬファイルの合成
処理におけるテキストデータの配置や画像の配置は、Ｓ
１におけるＨＴＭＬファイルの解析時に読み取った翻訳
前のテキストデータの配置や画像の配置の情報を利用し
て、翻訳後のテキストデータの配置や画像の配置を行
う。Next, an HTML file synthesizing process is performed (S11). In the process of S11, the translated text data “In” stored in the translation result storage area 8d is read.
formation of seminar Today there is seminar in rig
ht place from 1:00 ", image data other than the character part of the image data analyzed in S5 (in this example, a map part),
The text part “seminar venue” of the image data is translated and “Place of seminar” is synthesized, and as shown in FIG. 7,
"Informatio" is displayed on the left side E of the browser screen of the display device 6.
n of seminar Today there is seminar in right place
from 1:00 "and a map and" Place
An HTML document file in which the letters "of seminar" are displayed is created and stored in the synthesis result storage area 8g of the RAM 8, and the process is terminated. Here, the arrangement of the text data and the arrangement of the images in the synthesis processing of the HTML file are represented by S
The arrangement of the text data after translation and the arrangement of the images are performed by using the information of the arrangement of the text data before translation and the arrangement of the images read at the time of analyzing the HTML file in 1.

【００５０】すると、表示装置６のブラウザ画面には、
図７に示すように、左側部Ｅに「Information of semin
ar Today there is seminar in right place from 1:0
0」と表示され、右側部Ｆに地図が表示され、その下に
「Place of seminar」と表示される。Then, the browser screen of the display device 6 displays
As shown in FIG. 7, "Information of semin"
ar Today there is seminar in right place from 1: 0
"0" is displayed, a map is displayed on the right side F, and "Place of seminar" is displayed below it.

【００５１】以上説明したように、上記の実施の形態の
文書処理装置１では、認識候補とテキストデータ内の文
字列を比較して、認識候補の並べ替えを行うので、文字
認識の精度を上げることができる。また、テキストデー
タ内の２文字以上が一致した場合に、候補の並べ替えを
行うので、間違って候補の並べ替えを行ってしまうこと
を防ぐことができる。As described above, in the document processing apparatus 1 of the above embodiment, the recognition candidates are compared with the character strings in the text data and the recognition candidates are rearranged, so that the accuracy of character recognition is improved. be able to. Further, when two or more characters in the text data match, the candidates are rearranged, so that it is possible to prevent the rearrangement of the candidates by mistake.

【００５２】次に、ＲＡＭ８の認識結果記憶エリア８ｆ
に記憶された認識候補と、テキストデータ記憶エリア８
ａに記憶されたテキストデータの比較及び候補の並び替
え処理（Ｓ９）の変形例を、図４及び図５に示すサブル
ーチンのフローチャートを参照して説明する。上記の実
施の形態では、２文字以上一致した場合に、無条件で候
補を並べ替えるようにしたが、この図４及び図５に示す
サブルーチンの処理では、認識候補の認識スコアも考慮
して、あまり認識スコアがよくない候補は、より多くの
文字列（例えば、４文字以上の文字列）が一致しない
と、認識候補の並べ替えを行わないようにしたものであ
る。Next, the recognition result storage area 8f of the RAM 8
Candidate and the text data storage area 8
A modified example of the comparison of the text data stored in a and the sorting of the candidates (S9) will be described with reference to the flowcharts of the subroutines shown in FIGS. In the above embodiment, when two or more characters match, the candidates are unconditionally rearranged. However, in the processing of the subroutine shown in FIGS. 4 and 5, the recognition score of the recognition candidate is also taken into consideration. Candidates with poor recognition scores do not sort recognition candidates unless more character strings (eg, four or more character strings) match.

【００５３】以下の説明では、何文字めの認識結果に注
目しているかを示すカウンタｉの値と、一致した文字数
を示すカウンタｊの値と、認識候補が認識スコアの悪い
単語を含むか否かを示す認識スコアフラグｓは、ＲＡＭ
８のワーキングエリア８ｈに記憶されるものとする。
尚、認識スコアフラグｓ＝０の場合には、認識候補が認
識スコアの悪い単語を含むこと意味し、認識スコアフラ
グｓ＝１の場合には、認識候補が認識スコアの悪い単語
を含まないこと意味する。In the following description, the value of a counter i indicating the number of characters to which the recognition result is focused, the value of a counter j indicating the number of matching characters, and whether a recognition candidate includes a word having a bad recognition score. The recognition score flag s indicating whether
8 in the working area 8h.
If the recognition score flag s = 0, it means that the recognition candidate includes a word with a bad recognition score, and if the recognition score flag s = 1, the recognition candidate does not include a word with a bad recognition score. means.

【００５４】まず、図４に示すように、カウンタｉ＝１
とし（Ｓ２１）、認識スコアフラグｓ＝１とする（Ｓ４
２）。次いで、カウンタｉ＝１に対応した１文字めの認
識候補が認識結果記憶エリア８ｆに記憶されているか否
かを判断する（Ｓ４３）。ここでは、認識結果記憶エリ
ア８ｆに第１候補「ヒミ＋ー会場」、第２候補「セ三ナ
＋合湯」、第３候補「ゼシフ÷全易」が記憶されてお
り、各々、１文字めの認識候補が「ヒ」、「セ」、
「ゼ」が存在するので（Ｓ４３：ＹＥＳ）、その第１候
補「ヒミ＋ー会場」、第２候補「セ三ナ＋合湯」、第３
候補「ゼシフ÷全易」の各々の１文字めの認識候補
「ヒ」、「セ」、「ゼ」がテキストデータ内にあるかど
うか検索する（Ｓ４４）。First, as shown in FIG.
(S21), and the recognition score flag s = 1 (S4
2). Next, it is determined whether or not the first character recognition candidate corresponding to the counter i = 1 is stored in the recognition result storage area 8f (S43). Here, the first candidate "Himi + -Venue", the second candidate "Sesuna + goyu", and the third candidate "Zesif @ Zenyai" are stored in the recognition result storage area 8f, each of which has one character. Recognition candidates are "hi", "se",
Since "ze" exists (S43: YES), the first candidate "Himi +-venue", the second candidate "Sesana + Goyu", the third candidate
A search is made to determine whether the first character recognition candidates "hi", "se", and "ze" of each of the candidates "zeshifu @ zenyaku" are present in the text data (S44).

【００５５】テキストデータ記憶エリア８ａに記憶され
ているテキストデータ「セミナーのお知らせ、本日、１
時より右会場でセミナーを行います。」内に、第２候補
の「セ三ナ＋合湯」の１文字めの「セ」と同じ文字であ
る「セ」があるので（Ｓ４４：ＹＥＳ）、ワーキングエ
リア８ｈに並び替え候補の１文字目として「セ」を記憶
する（Ｓ４５）。尚、テキストデータ「セミナーのお知
らせ、本日、１時より右会場でセミナーを行います。」
内の検索では、「セ」は、２つ見つかるが、最初に出て
きた文字を採用するものとする。また、Ｓ４４の判断で
否と判断された場合には（Ｓ４４：ＮＯ）、ｉの値を
「１」増加し（Ｓ４９）、Ｓ４３に戻る。The text data “seminar notice, today, 1
We will hold a seminar at the right venue from time to time. , There is "se" which is the same character as the first character "se" of the second candidate "sesana + goyu" (S44: YES), so that one of the rearrangement candidates is placed in the working area 8h. "C" is stored as the character (S45). In addition, the text data "Seminar announcement, seminar will be held at the right venue from 1:00 today."
In the search in, two "se" are found, but the character that first appears is adopted. Further, when it is determined that the result of S44 is NO (S44: NO), the value of i is increased by "1" (S49), and the process returns to S43.

【００５６】次に、カウンタｊ＝１として、テキストデ
ータ記憶エリア８ａに記憶されているテキストデータ
「セミナーのお知らせ、本日、１時より右会場でセミナ
ーを行います。」中で、Ｓ４４の処理で見つかった
「セ」と一致する第２候補の「セ三ナ＋合湯」の１文字
めの「セ」の認識スコアが５０未満か否かを判断する
（Ｓ４７）。認識スコアとは、画像データからなる文字
をオフライン認識等の周知の文字認識法により文字認識
を行ったときに、文字の基本形状の特徴データとの一致
を数値化したもので、「０〜１００」の間に正規化され
た数値である。そして、認識スコアが１００に近いほど
認識精度が良く、０に近いほど認識精度が悪いことを表
している。Next, assuming that the counter j = 1, in the text data stored in the text data storage area 8a, "Seminar announcement, today, a seminar will be held at the right venue from 1:00." It is determined whether the recognition score of the first character “se” of the second candidate “sesana + goyu” that matches the found “se” is less than 50 (S47). The recognition score is a value obtained by digitizing a match between the character of the basic shape of the character and the characteristic data when the character composed of the image data is subjected to character recognition by a known character recognition method such as off-line recognition. ] Are normalized numbers. The closer the recognition score is to 100, the better the recognition accuracy, and the closer to 0, the worse the recognition accuracy.

【００５７】例えば、Ｓ４４の処理で見つかった「セ」
と一致する第２候補の「セ三ナ＋合湯」の１文字めの
「セ」の認識スコアが４５の場合には、認識スコアが５
０未満で、「セ」の認識精度が悪いと判断されるので
（Ｓ４７：ＹＥＳ）、認識スコアフラグｓ＝０としてワ
ーキングエリア８ｈに記憶し（Ｓ４８）、図５に示すＳ
５０の判断処理に移行する。尚、認識スコアが５０以上
の場合には（Ｓ４７：ＮＯ）、図５に示すＳ５０の判断
処理に移行する。For example, "S" found in the process of S44
When the recognition score of the first character “se” of the second candidate “sesana + goyu” that matches with is 45, the recognition score is 5
If it is less than 0, it is determined that the recognition accuracy of “C” is poor (S47: YES), so that it is stored in the working area 8h as the recognition score flag s = 0 (S48), and S shown in FIG.
The process proceeds to a determination process of 50. If the recognition score is 50 or more (S47: NO), the process shifts to the determination process of S50 shown in FIG.

【００５８】次に、テキストデータ記憶エリア８ａに記
憶されているテキストデータ「セミナーのお知らせ、本
日、１時より右会場でセミナーを行います。」中で、Ｓ
４４の処理で見つかった「セ」の文字の次の文字である
「ミ」が第１候補「ヒミ＋ー会場」、第２候補「セ三ナ
＋合湯」、第３候補「ゼシフ÷全易」の（ｉ＋ｊ）文字
め、即ち、２文字めに存在するか否かを判断する（Ｓ５
０）。ここでは、第１候補「ヒミ＋ー会場」の２文字め
に、「ミ」が存在するので（Ｓ５０：ＹＥＳ）、ワーキ
ングエリア８ｈに並び替え候補の２文字目として「ミ」
を記憶する（Ｓ５１）。Next, in the text data stored in the text data storage area 8a, "Seminar announcement, a seminar will be held at the right venue from 1:00 today."
The character "mi" next to the character "se" found in the processing of 44 is the first candidate "Himi +-venue", the second candidate "sesana + goto", and the third candidate "zesif @ all" It is determined whether or not the character exists in the (i + j) -th character of “Easy”, that is, the second character (S5).
0). Here, since "mi" is present in the second character of the first candidate "Himi +-venue" (S50: YES), "mi" is selected as the second character of the rearrangement candidate in the working area 8h.
Is stored (S51).

【００５９】次いで、カウンタｊの値を「１」増加して
「ｊ＝２」とし（Ｓ５２）、Ｓ５０の処理で見つかった
「ミ」と一致する第１候補「ヒミ＋ー会場」の２文字め
の「ミ」の認識スコアが５０未満か否かを判断する（Ｓ
５３）。認識スコアが５０未満の場合には、認識スコア
フラグｓ＝０としてワーキングエリア８ｈに記憶し（Ｓ
５４）、Ｓ５０の判断処理に移行する。尚、認識スコア
が５０以上の場合にも（Ｓ５３：ＮＯ）、Ｓ５０の判断
処理に移行する。Next, the value of the counter j is incremented by “1” to “j = 2” (S52), and the two characters of the first candidate “Himi + —venue” that matches “Mi” found in the processing of S50. It is determined whether the recognition score of “mi” is less than 50 (S
53). If the recognition score is less than 50, the recognition score flag s = 0 is stored in the working area 8h (S
54), the process proceeds to the determination process of S50. When the recognition score is 50 or more (S53: NO), the process shifts to the determination process of S50.

【００６０】Ｓ５０の判断処理では、テキストデータ内
の「セミ」の文字の次の文字である「ナ」が第１候補
「ヒミ＋ー会場」、第２候補「セ三ナ＋合湯」、第３候
補「ゼシフ÷全易」の（ｉ＋ｊ）文字め、即ち、３文字
めに存在するか否かを判断する（Ｓ５０）。ここでは、
第２候補「セ三ナ＋合湯」の３文字めに、「ナ」が存在
するので（Ｓ５０：ＹＥＳ）、ワーキングエリア８ｈに
並び替え候補の３文字目として「ナ」を記憶する（Ｓ５
１）。In the determination processing of S50, the character "na" which is the character next to the character "semi" in the text data is the first candidate "Himi + -venue", the second candidate "Sesana + goyu", It is determined whether the third candidate "zeshifu @ zenyaku" exists in the (i + j) th character, that is, in the third character (S50). here,
Since "na" is present in the third character of the second candidate "sesana + goyu" (S50: YES), "na" is stored in the working area 8h as the third character of the rearrangement candidate (S5).
1).

【００６１】次いで、カウンタｊの値を「１」増加して
「ｊ＝３」とし（Ｓ５２）、上記同様に認識スコアの判
断を行い（Ｓ５３）、認識スコアが５０未満なら認識ス
コアフラグｓ＝０とし（Ｓ５４）、次に、テキストデー
タ内の「セミナ」の文字の次の文字である「ー」が第１
候補「ヒミ＋ー会場」、第２候補「セ三ナ＋合湯」、第
３候補「ゼシフ÷全易」の（ｉ＋ｊ）文字め、即ち、４
文字めに存在するか否かを判断する（Ｓ５０）。ここで
は、第１候補「ヒミ＋ー会場」の４文字めに、「ー」が
存在するので（Ｓ５０：ＹＥＳ）、ワーキングエリア８
ｈに並び替え候補の４文字目として「ー」を記憶する
（Ｓ５１）。Next, the value of the counter j is increased by "1" to "j = 3" (S52), and the recognition score is determined in the same manner as above (S53). If the recognition score is less than 50, the recognition score flag s = 0 (S54), and the next character "-" after the character "Semina" in the text data is the first character.
The (i + j) letters of the candidate "Himi + venue", the second candidate "Sesana + Goyu", and the third candidate "Zesif @ Zenyaku", that is, 4
It is determined whether or not the character exists (S50). Here, since “−” exists in the fourth character of the first candidate “Himi + − venue” (S50: YES), the working area 8
"-" is stored in h as the fourth character of the sorting candidate (S51).

【００６２】次いで、カウンタｊの値を「１」増加して
「ｊ＝４」とし（Ｓ５２）、上記同様に認識スコアの判
断を行い（Ｓ５３）、認識スコアが５０未満なら認識ス
コアフラグｓ＝０とし（Ｓ５４）、次に、テキストデー
タ内の「セミナー」の文字の次の文字である「の」が第
１候補「ヒミ＋ー会場」、第２候補「セ三ナ＋合湯」、
第３候補「ゼシフ÷全易」の（ｉ＋ｊ）文字め、即ち、
５文字めに存在するか否かを判断する（Ｓ５０）。ここ
では、見つからないので（Ｓ５０：ＮＯ）、Ｓ５５の判
断処理に移行する。Ｓ５５の判断では、認識スコアフラ
グｓが「０」か否かを判断する（Ｓ５５）。Next, the value of the counter j is increased by "1" to "j = 4" (S52), and the recognition score is determined in the same manner as above (S53). If the recognition score is less than 50, the recognition score flag s = 0 (S54), and the next character "no" in the text data is the first candidate "Himi +-venue", the second candidate "Semina + Goyu",
(I + j) character of the third candidate “Zesif @ Zeny”,
It is determined whether or not the fifth character exists (S50). Here, since it is not found (S50: NO), the processing shifts to the determination processing of S55. In the determination of S55, it is determined whether the recognition score flag s is “0” (S55).

【００６３】認識スコアフラグｓが「０」の場合には
（Ｓ５５：ＹＥＳ）、テキストデータ内の文字と一致し
た認識候補に認識精度が低いものが含まれているので、
カウンタｊが４以上の場合（Ｓ５６：ＹＥＳ）、即ち、
４文字以上、認識候補の文字とテキストデータ記憶エリ
ア８ａに記憶されているテキストデータ内の文字が一致
している場合のみ（Ｓ５６：ＹＥＳ）、認識結果記憶エ
リア８ｆに記憶した第１候補の文字のｉ文字めからｉ＋
ｊ−１文字め（上記の例では、１文字めから４文字め）
までを、ワーキングエリア８ｈに記憶した並べ替え候補
の「セミナー」と並び替えを行う（Ｓ５８）。When the recognition score flag s is “0” (S55: YES), recognition candidates having low recognition accuracy are included in the recognition candidates that match the characters in the text data.
When the counter j is 4 or more (S56: YES), that is,
Only when four or more characters of the recognition candidate match characters in the text data stored in the text data storage area 8a (S56: YES), the first candidate character stored in the recognition result storage area 8f From the i character to i +
j-1th character (in the above example, the first to fourth characters)
Are sorted with the sorting seminar "seminar" stored in the working area 8h (S58).

【００６４】認識スコアフラグｓが「１」の場合には
（Ｓ５５：ＮＯ）、テキストデータ内の文字と一致した
認識候補に認識精度が低いものが含まれていないので、
カウンタｊが２以上の場合（Ｓ５７：ＹＥＳ）、即ち、
２文字以上、認識候補の文字とテキストデータ記憶エリ
ア８ａに記憶されているテキストデータ内の文字が一致
していれば（Ｓ５７：ＹＥＳ）、認識結果記憶エリア８
ｆに記憶した第１候補の文字のｉ文字めからｉ＋ｊ−１
文字め（上記の例では、１文字めから４文字め）まで
を、ワーキングエリア８ｈに記憶した並べ替え候補の
「セミナー」と並び替えを行う（Ｓ５８）。If the recognition score flag s is “1” (S55: NO), since recognition candidates having low recognition accuracy are not included in the recognition candidates that match the characters in the text data,
When the counter j is 2 or more (S57: YES), that is,
If two or more characters of the recognition candidate match the characters in the text data stored in the text data storage area 8a (S57: YES), the recognition result storage area 8
From the i-th character of the first candidate character stored in f to i + j-1
The characters (the first to fourth characters in the above example) are rearranged with the rearrangement candidate “seminar” stored in the working area 8h (S58).

【００６５】その後、カウンタｉ＝ｉ＋ｊとする（Ｓ５
９）。即ち、上記の例では、「ｉ＝１＋４＝５」となる
（Ｓ５９）。その後、Ｓ４３に戻って、ｉ＝５に対応し
た５文字めの認識候補が認識結果記憶エリア８ｆに記憶
されているか否かを判断する（Ｓ４３）。５文字めの認
識候補が認識結果記憶エリア８ｆに記憶されているので
（Ｓ４３：ＹＥＳ）、認識結果記憶エリア８ｆに記憶さ
れている第１候補〜第３候補の５文字めの認識候補
「会」、「合」、「全」がテキストデータ内にあるかど
うか検索する（Ｓ４４）。Thereafter, the counter is set to i = i + j (S5
9). That is, in the above example, “i = 1 + 4 = 5” (S59). Thereafter, the process returns to S43, and it is determined whether or not the recognition candidate of the fifth character corresponding to i = 5 is stored in the recognition result storage area 8f (S43). Since the fifth character recognition candidate is stored in the recognition result storage area 8f (S43: YES), the fifth to fifth character recognition candidates of the first to third candidates stored in the recognition result storage area 8f are displayed. A search is made to see if "", "go", and "all" are in the text data (S44).

【００６６】テキストデータ記憶エリア８ａに記憶され
ているテキストデータ「セミナーのお知らせ、本日、１
時より右会場でセミナーを行います。」内の１７文字め
（読点は数えず）に、第１候補の「ヒミ＋ー会場」の５
文字め「会」の文字が見つかるので（Ｓ４４：ＹＥ
Ｓ）、ワーキングエリア８ｈに並び替え候補の５文字目
として「会」を記憶する（Ｓ４５）。The text data “seminar notice, today, 1
We will hold a seminar at the right venue from time to time. In the 17th character (without counting the readings) in the 5th
Since the character "kai" is found (S44: YE
S), "kai" is stored in the working area 8h as the fifth character of the sorting candidate (S45).

【００６７】次に、カウンタｊ＝１として（Ｓ４６）、
テキストデータ記憶エリア８ａに記憶されているテキス
トデータ「セミナーのお知らせ、本日、１時より右会場
でセミナーを行います。」中で、Ｓ４４の処理で見つか
った「会」と一致する第１候補「ヒミ＋ー会場」の５文
字めの「会」の認識スコアが５０未満か否かを判断する
（Ｓ４７）。例えば、Ｓ４４の処理で見つかった「会」
と一致する第１候補の１文字めの「会」の第１候補「ヒ
ミ＋ー会場」の５文字めの「会」の認識スコアが４８の
場合には、認識スコアが５０未満で、「会」の認識精度
が悪いと判断されるので（Ｓ４７：ＹＥＳ）、認識スコ
アフラグｓ＝０としてワーキングエリア８ｈに記憶し
（Ｓ４８）、図５に示すＳ５０の判断処理に移行する。
尚、認識スコアが５０以上の場合には（Ｓ４７：Ｎ
Ｏ）、図５に示すＳ５０の判断処理に移行する。Next, the counter j = 1 (S46),
In the text data stored in the text data storage area 8a, "Announcement of the seminar, a seminar will be held at the right venue from 1 o'clock today." It is determined whether or not the recognition score of the fifth character of "Himi + -venue" is less than 50 (S47). For example, "kai" found in the processing of S44
When the recognition score of the first character of the first character "kai" of the first character "kai" of the first candidate "Himi + Venue" that matches the recognition score is 48, the recognition score is less than 50, and " It is determined that the recognition accuracy of the event is low (S47: YES), so that the recognition score flag s = 0 is stored in the working area 8h (S48), and the process proceeds to the determination process of S50 shown in FIG.
If the recognition score is 50 or more (S47: N
O), the process proceeds to the determination process of S50 shown in FIG.

【００６８】次に、Ｓ５０では、テキストデータ記憶エ
リア８ａに記憶されているテキストデータ「セミナーの
お知らせ、本日、１時より右会場でセミナーを行いま
す。」中で、Ｓ４４の処理で見つかった「会」の文字の
次の文字である「場」が第１候補〜第３候補の（ｉ＋
ｊ）文字め、即ち、６文字めに存在するか否かを判断す
る（Ｓ５０）。ここでは、第１候補の６文字めに、
「場」が存在するので（Ｓ５０：ＹＥＳ）、ワーキング
エリア８ｈに並び替え候補の６文字目として「場」を記
憶する（Ｓ５１）。Next, in S50, in the text data stored in the text data storage area 8a, "Seminar announcement, a seminar will be held at the right venue from 1:00 today." The first character of the first to third candidates (i +
j) It is determined whether the character exists in the sixth character, that is, in the sixth character (S50). Here, for the sixth character of the first candidate,
Since "place" exists (S50: YES), "place" is stored in the working area 8h as the sixth character of the sorting candidate (S51).

【００６９】次いで、カウンタｊの値を「１」増加して
「ｊ＝２」とし（Ｓ５２）、上記同様に認識スコアの判
断を行い（Ｓ５３）、認識スコアが５０未満なら認識ス
コアフラグｓ＝０とし（Ｓ５４）、次に、テキストデー
タ内の「会場」の文字の次の文字である「で」が第１候
補〜第３候補の（ｉ＋ｊ）文字め、即ち、７文字めに存
在するか否かを判断する（Ｓ５０）。ここでは、見つか
らないので（Ｓ５０：ＮＯ）、Ｓ５５の判断処理に移行
する。Ｓ５５の判断では、認識スコアフラグｓが「０」
か否かを判断する（Ｓ５５）。Next, the value of the counter j is increased by "1" to "j = 2" (S52), and the recognition score is determined in the same manner as above (S53). If the recognition score is less than 50, the recognition score flag s = The value is set to 0 (S54), and next, "de", which is the next character of the character of "venue" in the text data, exists in the (i + j) -th character of the first to third candidates, that is, the seventh character. It is determined whether or not it is (S50). Here, since it is not found (S50: NO), the processing shifts to the determination processing of S55. In the determination of S55, the recognition score flag s is “0”
It is determined whether or not (S55).

【００７０】ここでは、認識スコアフラグｓが「０」に
なっているので（Ｓ５５：ＹＥＳ）、テキストデータ内
の文字と一致した認識候補に認識精度が低い「会」の文
字が含まれており、カウンタｊが２なので（Ｓ５６：Ｎ
Ｏ）、Ｓ５８の並び替えを行わず、Ｓ５９に移行し、カ
ウンタｉ＝ｉ＋ｊとする（Ｓ５９）。即ち、上記の例で
は、「ｉ＝５＋２＝７」となる（Ｓ５９）。その後、Ｓ
４３に戻って、ｉ＝７に対応した７文字めの認識候補が
認識結果記憶エリア８ｆに記憶されているか否かを判断
する（Ｓ４３）。７文字めの認識候補が認識結果記憶エ
リア８ｆに記憶されていないので（Ｓ４３：ＮＯ）、認
識候補の並べ替え処理を終了し、図２に示すフローチャ
ートのＳ１０に移行する。以上の並び替え処理により、
認識結果記憶エリア８ｆに記憶された第１候補は、「セ
ミナー」部分のみ並び替えられていることになる。Here, since the recognition score flag s is “0” (S55: YES), the recognition candidate that matches the character in the text data includes the character of “kai” with low recognition accuracy. Since the counter j is 2 (S56: N
O), the process proceeds to S59 without rearranging S58, and sets the counter i = i + j (S59). That is, in the above example, “i = 5 + 2 = 7” (S59). Then, S
Returning to 43, it is determined whether or not the recognition candidate for the seventh character corresponding to i = 7 is stored in the recognition result storage area 8f (S43). Since the recognition candidate for the seventh character is not stored in the recognition result storage area 8f (S43: NO), the process of rearranging the recognition candidates is terminated, and the process shifts to S10 of the flowchart shown in FIG. By the above sorting process,
In the first candidate stored in the recognition result storage area 8f, only the "seminar" portion is rearranged.

【００７１】以上説明したように、上記の変形例では、
認識候補とテキストデータ内の文字列を比較して、認識
候補の並べ替えを行うので、文字認識の精度を上げるこ
とができる。また、テキストデータ内の２文字以上が一
致した場合でも、認識時の認識スコアも考慮して、認識
スコアがよくない候補が含まれる場合は、４文字以上が
テキストデータと一致しないと並べ替えないようにして
いるので、間違って認識スコアの良くない候補に並べ替
えを行ってしまうことを防ぐことができる。As described above, in the above modification,
Since the recognition candidates are compared with the character strings in the text data and the recognition candidates are rearranged, the accuracy of character recognition can be improved. Also, even when two or more characters in the text data match, if a candidate with a poor recognition score is included in consideration of the recognition score at the time of recognition, sorting is not performed unless four or more characters match the text data. By doing so, it is possible to prevent erroneous sorting of candidates with poor recognition scores.

【００７２】尚、上記の実施の形態の文書処理装置１
は、各種の変形ができることは、言うまでもない。例え
ば、上記の実施の形態では、日本語のホームページから
英語に翻訳する例を挙げたが、逆に、英語のホームペー
ジから日本語に翻訳することも可能である。また、言語
は、日本語と英語に限られず、中国語、韓国語、ドイツ
語、フランス語、イタリア語、スペイン語等の各国語に
適用できることは言うまでもない。さらに、第１の実施
の形態では、２文字以上がテキストデータと一致しない
と並べ替えないようにしているが、必ずしも２文字に限
られず、３文字、４文字、５文字等の所定の文字数以上
がテキストデータと一致しないと並べ替を行わないよう
にしても良い。また、上記の変形例では、認識スコアが
よくない候補が含まれる場合は、４文字以上がテキスト
データと一致しないと並べ替えないようにしているが、
必ずしも４文字に限られず、３文字、５文字、６文字等
の所定の文字数以上がテキストデータと一致しないと並
べ替を行わないようにしても良い。The document processing apparatus 1 of the above embodiment
It goes without saying that various modifications can be made. For example, in the above-described embodiment, an example of translating from a Japanese homepage to English has been described, but it is also possible to translate from an English homepage to Japanese. Further, the language is not limited to Japanese and English, and it goes without saying that the invention can be applied to various languages such as Chinese, Korean, German, French, Italian, Spanish, and the like. Furthermore, in the first embodiment, the sorting is not performed unless two or more characters match the text data. However, the sorting is not necessarily limited to two characters and a predetermined number of characters such as three, four, five, etc. May not be sorted if does not match the text data. Further, in the above-described modified example, when a candidate having a poor recognition score is included, sorting is not performed unless four or more characters match the text data.
The number of characters is not necessarily limited to four, and the rearrangement may not be performed unless a predetermined number of characters such as three, five, or six characters matches the text data.

【００７３】[0073]

【発明の効果】以上説明した通り、請求項１に係る発明
の文書処理装置では、記憶手段は入力されたテキストデ
ータとそのテキストデータに付随する画像データとを記
憶し、分離手段は前記記憶手段に記憶された画像データ
から画像部分と文字部分とを分離し、文字認識手段は前
記分離手段が分離した文字部分を文字認識を行って文字
コードに変換し、候補比較手段が、前記文字認識手段の
認識結果の候補と前記記憶手段に記憶されたテキストデ
ータ内の文字列とを比較し、前記認識結果の候補と前記
テキストデータ内の文字列の一致を検出すると、候補並
べ替え手段は前記候補比較手段の比較結果に基づいて前
記文字認識手段の認識結果の候補を並べ替えることがで
きる。従って、認識候補の並べ替えにテキストデータ内
の文字列を参照するので、文字認識の精度を高めること
ができる。As described above, in the document processing apparatus according to the first aspect of the present invention, the storage means stores the input text data and the image data accompanying the text data, and the separation means stores the text data. Separating the image portion and the character portion from the image data stored in the storage device, the character recognition device performs character recognition on the character portion separated by the separation device to convert the character portion into a character code, and the candidate comparison device includes the character recognition device. Comparing the candidate of the recognition result with the character string in the text data stored in the storage unit, and detecting a match between the candidate of the recognition result and the character string in the text data, the candidate reordering unit sets the candidate The recognition result candidates of the character recognition means can be rearranged based on the comparison result of the comparison means. Therefore, since the character strings in the text data are referred to for rearranging the recognition candidates, the accuracy of character recognition can be improved.

【００７４】また、請求項２に係る発明の文書処理装置
では、請求項１に記載の文書処理装置の効果に加えて、
前記候補比較手段は、前記文字認識手段が認識した複数
の候補の中の文字の組み合わせと前記記憶手段に記憶さ
れたテキストデータ内の文字列とを比較し、前記候補比
較手段が前記複数の候補の中の文字の組み合わせと前記
テキストデータ内の連続した複数の文字列が一致するこ
とを検出した場合に、前記候補並べ替え手段は、当該連
続した複数の文字列を優先候補として前記文字認識手段
の認識結果の候補を並べ替えることができる。In the document processing apparatus according to the second aspect of the present invention, in addition to the effects of the document processing apparatus according to the first aspect,
The candidate comparing unit compares a combination of characters in a plurality of candidates recognized by the character recognizing unit with a character string in text data stored in the storage unit, and the candidate comparing unit compares the plurality of candidates. When it is detected that a combination of characters in the text data matches a plurality of continuous character strings in the text data, the candidate reordering unit sets the plurality of continuous character strings as priority candidates and sets the character recognition unit Can be sorted.

【００７５】また、請求項３に係る発明の文書処理装置
では、請求項１又は２に記載の文書処理装置の効果に加
えて、翻訳手段は、前記候補並べ替え手段により並べ替
えられた認識結果の候補の文字列を当該文字列の言語と
は異なる他の言語の文字列に翻訳することができる。According to a third aspect of the present invention, in addition to the effects of the first or second aspect, the translating unit may further include a recognition result rearranged by the candidate rearranging unit. Can be translated into a character string in another language different from the language of the character string.

【００７６】また、請求項４に係る発明の文書処理装置
では、請求項１乃至３の何れかに記載の文書処理装置の
効果に加えて、文書合成手段は前記翻訳手段の翻訳結果
と前記分離手段により分離された画像部分とを合成した
合成画像データを作成することができる。従って、翻訳
結果の中に異なる言語が混在しない合成画像データを得
ることができる。According to a fourth aspect of the present invention, in addition to the effects of the document processing apparatus according to any one of the first to third aspects, the document synthesizing means includes a translation result of the translation means and the separation result. Combined image data can be created by combining the image portion separated by the means. Therefore, it is possible to obtain composite image data in which different languages are not mixed in the translation result.

【００７７】また、請求項５に係る発明の文書処理装置
では、請求項１乃至４の何れかに記載の文書処理装置の
効果に加えて、表示手段は、前記文書合成手段が合成し
た合成画像データを表示することができる。従って、使
用者は、表示手段により、翻訳結果の中に異なる言語が
混在しない合成画像を見ることができる。According to a fifth aspect of the present invention, in addition to the effects of the document processing apparatus according to any one of the first to fourth aspects, the display means further comprises a synthetic image synthesized by the document synthesizing means. Data can be displayed. Therefore, the user can view the synthesized image in which different languages are not mixed in the translation result by the display means.

【００７８】また、請求項６に係る発明の文書処理装置
では、請求項１乃至５の何れかに記載の文書処理装置の
効果に加えて、前記入力されたテキストデータとそのテ
キストデータに付随する画像データとがＨＴＭＬで記述
されているので、文書の分析及び合成が容易にできる。According to a sixth aspect of the present invention, in addition to the effects of the document processing apparatus according to any one of the first to fifth aspects, the input text data and the text data are attached. Since the image data is described in HTML, it is possible to easily analyze and synthesize the document.

[Brief description of the drawings]

【図１】図１は、文書処理装置１の装置の概略構成を示
すブロック図である。FIG. 1 is a block diagram showing a schematic configuration of a document processing apparatus 1;

【図２】図２は、文書処理装置１の主制御のフローチャ
ートである。FIG. 2 is a flowchart of main control of the document processing apparatus 1.

【図３】図３は、文書処理装置１の候補文字とテキスト
データの比較処理及び候補並び替えのサブルーチンのフ
ローチャートである。FIG. 3 is a flowchart of a subroutine for comparing a candidate character and text data and reordering candidates in the document processing apparatus 1.

【図４】図４は、候補並べ替え処理の変形例のフローチ
ャートである。FIG. 4 is a flowchart of a modification of the candidate reordering process.

【図５】図５は、候補並べ替え処理の変形例のフローチ
ャートである。FIG. 5 is a flowchart of a modification of the candidate reordering process.

【図６】図６は、文字認識された候補の文字の例示であ
る。FIG. 6 is an example of candidate characters recognized as characters;

【図７】図７は、翻訳結果が表示装置６のブラウザ画面
上に表示された例を示す図である。FIG. 7 is a diagram illustrating an example in which a translation result is displayed on a browser screen of a display device 6;

【図８】図８は、テキストデータと画像データとを有す
るＨＴＭＬファイルが、表示装置６のブラウザ画面上に
表示された例を示す図である。FIG. 8 is a diagram illustrating an example in which an HTML file having text data and image data is displayed on a browser screen of a display device 6;

【図９】図９は、従来の技術の翻訳結果が表示装置６の
ブラウザ画面上に表示された例を示す図である。FIG. 9 is a diagram illustrating an example in which a translation result according to a conventional technique is displayed on a browser screen of a display device 6;

【符号の説明】１文書処理装置２ＣＰＵ３Ｉ／Ｏインターフェース４キーボード５マウス６表示装置７ＲＯＭ７ａＨＴＭＬファイル解析プログラム記憶エリア７ｂレイアウト解析プログラム記憶エリア７ｃ文字切り出しプログラム記憶エリア７ｄ文字認識プログラム記憶エリア７ｅ文字認識辞書記憶エリア７ｆ候補比較プログラム記憶エリア７ｇ候補並べ替えプログラム記憶エリア７ｈ翻訳プログラム記憶エリア７ｉＨＴＭＬファイル合成プログラム記憶エリア８ＲＡＭ８ａテキストデータ記憶エリア８ｂ画像データ記憶エリア８ｃ解析結果記憶エリア８ｄ翻訳結果記憶エリア８ｅ文字切り出し結果記憶エリア８ｆ認識結果記憶エリア８ｇ合成結果記憶エリア８ｈワーキングエリア９ＣＤ−ＲＯＭドライブ１０ＬＡＮボード１１ケーブル１２ハードディスクドライブ[Description of Signs] 1 Document processing device 2 CPU 3 I / O interface 4 Keyboard 5 Mouse 6 Display device 7 ROM 7a HTML file analysis program storage area 7b Layout analysis program storage area 7c Character extraction program storage area 7d Character recognition program storage area 7e Character recognition dictionary storage area 7f Candidate comparison program storage area 7g Candidate sorting program storage area 7h Translation program storage area 7i HTML file synthesis program storage area 8 RAM 8a Text data storage area 8b Image data storage area 8c Analysis result storage area 8d Translation Result storage area 8e Character cutout result storage area 8f Recognition result storage area 8g Synthesis result storage area 8h Working area 9 CD-ROM drive 10 LAN board 11 Cable 12 Hard disk drive

Claims

[Claims]

A storage unit configured to store input text data and image data accompanying the text data; a separation unit configured to separate an image portion and a character portion from the image data stored in the storage device; A character recognition unit that performs character recognition on the character portion separated by the separation unit and converts the character portion into a character code; and compares a candidate for a recognition result of the character recognition unit with a character string in text data stored in the storage unit. A candidate comparing unit that detects a match between the candidate for the recognition result and a character string in the text data; and a candidate arrangement for rearranging the candidates for the recognition result of the character recognition unit based on the comparison result of the candidate comparing unit. A document processing apparatus comprising: a replacement unit.

2. The candidate comparing unit compares a combination of characters in a plurality of candidates recognized by the character recognizing unit with a character string in text data stored in the storage unit. Detects that a combination of characters in the plurality of candidates matches a plurality of consecutive character strings in the text data, the candidate rearranging unit sets the plurality of consecutive character strings as priority candidates. 2. The document processing apparatus according to claim 1, wherein the candidates of the recognition result of the character recognition unit are rearranged.

3. A translation unit for translating a character string of a candidate of the recognition result rearranged by the candidate rearranging unit into a character string of another language different from the language of the character string. The document processing device according to claim 1.

4. The apparatus according to claim 1, further comprising document synthesizing means for generating synthetic image data obtained by synthesizing a translation result of said translating means and an image portion separated by said separating means. A document processing device according to claim 1.

5. The document processing apparatus according to claim 1, further comprising display means for displaying the combined image data combined by said document combining means.

6. The document processing apparatus according to claim 1, wherein the input text data and image data accompanying the text data are described in HTML.