JPH10254871A

JPH10254871A - Document input method and its device

Info

Publication number: JPH10254871A
Application number: JP9058196A
Authority: JP
Inventors: Osamu Nakamura; 修中村; Kenji Ogura; 健司小倉; Teruo Akiyama; 照雄秋山; Masami Oguro; 雅己小黒
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 1997-03-13
Filing date: 1997-03-13
Publication date: 1998-09-25

Abstract

PROBLEM TO BE SOLVED: To provide an efficient document inputting method and its device by clearly indicating an inputting position within an original and preventing the erroneous selection of a homonym. SOLUTION: Document original image data 501 is displayed on a picture 106a in parallel with the inputted document 502 of a document inputting person. Character recognition is executed with respect to characters in this document original and the recognizing result and a character string just after inputting by the document inputting person are collated with each other to search a coincident character string. A position in document image data 501 corresponding to this coincident character string is detected and the position 503 is displayed in data 501 in the picture to minimize the moving of the eyes of the document inputting person. For KANA (Japanese syllabary) and KANJI (Chinese character) conversion of the character string input of the document inputting person, at least one homonym candidates are outputted to collate the character string of the character recognizing result and the homonym candidates with each other and the homonym candidates are rearranged and outputted in a probable order based on the content of the coincident character candidate to prevent the erroneous selection of homonyms.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、活字や手書きの文
字イメージデータを文字コードに変換する文字認識技術
を応用して、効率のよい文書入力を実現するための技術
に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a technique for realizing efficient document input by applying a character recognition technique for converting character or handwritten character image data into a character code.

【０００２】[0002]

【従来の技術】ＯＡ化の進展に伴い、文書を情報処理シ
ステムで処理可能な文字コード列として入力する作業
が、各種の分野で行われている。このために、一般的な
手段となっているワードプロセッサ（以下、ワープロ）
等を用いて、原稿を参照しながらキーボードから入力す
る方法が広く行われている。2. Description of the Related Art With the progress of OA, work of inputting a document as a character code string that can be processed by an information processing system has been performed in various fields. For this purpose, word processors (hereinafter word processors) have become common means.
For example, a method of inputting data from a keyboard while referring to a document by using such a method is widely used.

【０００３】この場合、例えば単位時間当たりに入力可
能な文字コード列の量等で表現される入力効率は、文書
入力者の打鍵速度に最も影響されるが、原稿の見やすさ
等入力環境による影響も無視することはできない。例え
ば、稠密に記述された原稿を参照しながらの入力作業で
は、原稿面とワープロ画面との間を文書入力者の視線が
頻繁に往復し、その度に現在入力している原稿内での入
力位置を見失うため、入力効率が低下するという問題が
生じる。In this case, the input efficiency expressed by, for example, the amount of character code strings that can be input per unit time is most affected by the keying speed of the document input person, but is affected by the input environment such as the legibility of the original. Cannot be ignored. For example, in an input operation while referring to a densely described original, the line of sight of the document input person frequently reciprocates between the original surface and the word processing screen, and each time the input is performed within the original input. Since the position is lost, the input efficiency is reduced.

【０００４】また、ワープロ等による日本語文字列コー
ドの入力においては、一般的にはかな漢字変換を前提と
している。この場合、キーボードから入力する情報は、
原稿に記述された文字列の読み（かな）であるが、異な
る漢字表記文字列が同一の読みを有する、いわゆる同音
語の存在が、前記の入力位置喪失とともに入力効率を低
下させる原因となっている。In addition, input of Japanese character string codes by a word processor or the like generally presupposes kana-kanji conversion. In this case, the information entered from the keyboard is
Although the reading of a character string described in a manuscript (kana), the presence of so-called homophone words in which different kanji notation character strings have the same reading, causes a reduction in input efficiency together with the above-mentioned loss of input position. I have.

【０００５】すなわち、文書入力者は、入力した“か
な”から変換された漢字混じり文字列（同音語）が、原
稿に記述された漢字混じり文字列と一致するか否かの確
認と、誤って変換された場合には、正しい漢字交じり文
字列への修正を行わなくてはならない。このため、ワー
プロ画面と原稿との間の視線往復の回数を増加させ、ま
たキーボードを押下する回数も増加することになり入力
効率が低下してしまう。[0005] That is, the document input person confirms whether or not the kanji-mixed character string (homophone) converted from the input "kana" matches the kanji-mixed character string described in the manuscript, and erroneously checks whether If it is converted, it must be corrected to the correct kanji mixed character string. For this reason, the number of times of line-of-sight reciprocation between the word processing screen and the document is increased, and the number of times of pressing the keyboard is also increased, so that the input efficiency is reduced.

【０００６】一方、文書入力の効率を向上させる技術の
１つとして、文字イメージデータを文字コードに変換す
る文字認識技術が期待されている。文字認識技術では、
予め学習させた文字パタンとの類似性によって、入力さ
れた文字イメージデータがいずれの文字コードに対応す
るかを識別することを処理の基本としている。このた
め、比較的字体の変形が少ない印刷文字に対しては認識
性能は安定しており、この技術を利用した製品が多数発
表されている。しかし、手書き文字、特に手書き漢字に
対して記入者固有の字体の変化が大きい場合には、認識
誤りが頻発し、文字認識技術の適用の効果が低下してし
まう。On the other hand, as one technique for improving the efficiency of document input, a character recognition technique for converting character image data into a character code is expected. In character recognition technology,
The basis of the processing is to identify which character code the input character image data corresponds to based on the similarity with the character pattern that has been learned in advance. For this reason, the recognition performance is stable for printed characters having relatively little deformation of the font, and many products using this technology have been announced. However, if the character style unique to the writer is large for handwritten characters, especially handwritten Chinese characters, recognition errors frequently occur, and the effect of applying the character recognition technology is reduced.

【０００７】[0007]

【発明が解決しようとする課題】上述のとおり従来のワ
ープロ等による文書入力の方法には、文書入力者の能力
とは別に、視線移動や同音語選択誤りが入力効率を低下
させるという問題があった。また、文字認識技術を用い
た文書入力の方法には、字体変形の大きい文字を認識し
た場合に、文字認識の効果を発揮できないという問題が
あった。As described above, the conventional method of inputting a document using a word processor or the like has a problem that, apart from the ability of the document inputting person, the movement of the line of sight and the erroneous selection of the phonetic word reduce the input efficiency. Was. Further, the document input method using the character recognition technology has a problem that the effect of character recognition cannot be exerted when a character having large font deformation is recognized.

【０００８】本発明は、上記事情に鑑みてなされたもの
で、その課題は、従来の技術における上述のような問題
を解消し、原稿内の入力位置の明示、および同音語選択
誤りの防止により、効率の良い文書入力方法および装置
を提供することである。SUMMARY OF THE INVENTION The present invention has been made in view of the above circumstances, and has as its object to solve the above-mentioned problems in the prior art, to specify an input position in a manuscript, and to prevent a homonym selection error. And an efficient document input method and apparatus.

【０００９】[0009]

【課題を解決するための手段】上述の課題を解決するた
め、本発明による文字入力方法は、文字認識技術を用い
る文書入力方法であって、入力対象文書のイメージデー
タを画面に表示し、前記入力対象文書内の文字について
文字認識を実行し、文書入力者によって入力された文字
コード列と、前記文字認識の実行結果である文字コード
列との照合を行って、一致する文字コード列の探索を行
い、前記探索の結果、一致した文字コード列に対応する
前記入力対象文書のイメージデータ中の位置を検出し、
前記検出された位置を前記画面内に表示することを特徴
とする。In order to solve the above-mentioned problems, a character input method according to the present invention is a document input method using a character recognition technology, wherein image data of a document to be input is displayed on a screen. Character recognition is performed on characters in the input document, and a character code string input by the document input person is collated with a character code string that is the result of the character recognition to search for a matching character code string. Performing the search, detecting the position in the image data of the input target document corresponding to the matched character code string,
The detected position is displayed in the screen.

【００１０】また、前記文書入力者によって入力された
文字コード列については、かな漢字変換を行って１個以
上の同音語候補を得て、前記文字認識の実行結果である
文字コード列と前記同音語候補との照合を行って、一致
する文字候補の含有率に基づき前記得られた同音語候補
を確からしい候補から順に並べ替えることを特徴とす
る。The character code string input by the document input person is subjected to Kana-Kanji conversion to obtain one or more homophone candidates, and the character code string which is the result of the character recognition and the homophone word are output. Matching with candidates is performed, and based on the content ratio of matching character candidates, the obtained homonym candidates are sorted in order from the most probable candidates.

【００１１】同じく上述の課題を解決するため、本発明
による文字入力装置は、文字認識技術を用いる文書入力
装置であって、入力対象文書のイメージデータを画面に
表示する手段と、前記入力対象文書内の文字について文
字認識を実行する手段と、文書入力者によって入力され
た文字コード列と、前記文字認識の実行結果である文字
コード列とを照合する手段と、前記照合の結果、一致す
る文字コード列の探索を行う手段と、前記探索の結果、
一致した文字コード列に対応する前記入力対象文書のイ
メージデータ中の位置を検出する手段と、前記検出され
た位置を前記画面内に表示する手段と、を備えたことを
特徴とする。According to another aspect of the present invention, there is provided a character input device using a character recognition technique, comprising: means for displaying image data of an input target document on a screen; Means for performing character recognition for the characters in the character string; means for comparing a character code string input by a document input person with a character code string that is the result of the character recognition; Means for searching for a code sequence, and a result of the search;
Means for detecting a position in the image data of the input target document corresponding to the matched character code string, and means for displaying the detected position on the screen.

【００１２】また、上記の文字入力装置において、前記
文書入力者によって入力された文字コード列について、
かな漢字変換を行って１個以上の同音語候補を得る手段
と、前記文字認識の実行結果である文字コード列と前記
得られた同音語候補との照合を行う手段と、前記照合の
結果、一致する文字候補の含有率に基づき前記得られた
同音語候補を確からしい候補から順に並べ替える手段
と、を備えたことを特徴とする。In the above character input device, the character code string input by the document input person may be
Means for performing Kana-Kanji conversion to obtain one or more homophone candidates; means for collating a character code string as an execution result of the character recognition with the obtained homophone candidates; Means for rearranging the obtained homophone candidates in order from the most probable candidates based on the content ratio of the character candidates to be performed.

【００１３】本発明に係わる文書入力方法および装置に
おいては、入力対象文書のイメージデータを画面に表示
し、入力対象とする文書内の文字について文字認識を実
行し、文書入力者によって入力された文字コード列と、
前記文字認識の実行結果である文字コード列との照合を
行って、一致する文字コード列の探索を行い、一致文字
コード列に対応する文書イメージデータ中の位置を検出
し、検出位置を画面内に表示することで、文書入力者が
視線の移動を最少限に入力位置を確定可能とする。ま
た、文書入力者による文字コード列入力については、か
な漢字変換を行って１個以上の同音語候補を得て、文字
認識結果である文字コード列と前記同音語候補との照合
を行い、一致する文字候補をより多く含む同音語候補の
含有率に基づき同音語候補を確からしい候補から順に並
べ替えることで、現状の技術レベルの文字認識技術の利
用でも、かな漢字変換時に発生する同音語選択誤りを防
止できるようにする。これらによって、文書入力者の能
力に応じた効率の良い文書入力を行うことを可能にす
る。[0013] In the document input method and apparatus according to the present invention, image data of a document to be input is displayed on a screen, character recognition is performed on characters in the document to be input, and characters input by a document input person are performed. Code strings,
By comparing the character code string which is the result of the character recognition with the character code string, searching for a matching character code string, detecting a position in the document image data corresponding to the matching character code string, and setting the detected position on the screen. , The document input person can determine the input position with minimum movement of the line of sight. In addition, regarding the input of a character code string by the document input person, one or more homophone candidates are obtained by performing kana-kanji conversion, and the character code string that is the result of character recognition is collated with the homophone candidate to match. By rearranging the homophone candidates in order from the most probable candidates based on the content ratio of the homophone candidates that contain more character candidates, even if the current technology level of character recognition technology is used, it is possible to reduce the homophone selection error that occurs during kana-kanji conversion. Be prevented. These make it possible to perform efficient document input according to the ability of the document input person.

【００１４】[0014]

【発明の実施の形態】以下、図面を用いて本発明の実施
形態例を説明する。Embodiments of the present invention will be described below with reference to the drawings.

【００１５】図１は、本発明の第１の実施形態例を示す
文書入力方法および装置の処理ブロック図である。図１
において、１０１は入力対象となる文書原稿イメージデ
ータ、１０２は文字イメージデータを文字コードに変換
する文字認識手段、１０３は文書入力者、１０４は日本
語文字列を入力する手段、１０５は原稿内の入力位置を
検出する入力位置検出手段、１０６は文書原稿イメージ
および文書入力者１０３によって入力された文字コード
列を表示する手段、１０７は入力済みの文書ファイルで
ある。FIG. 1 is a processing block diagram of a document input method and apparatus showing a first embodiment of the present invention. FIG.
101, document original image data to be input; 102, character recognition means for converting character image data into character codes; 103, a document inputter; 104, a means for inputting a Japanese character string; Input position detecting means 106 for detecting an input position, means 106 for displaying a document original image and a character code string input by the document input person 103, and 107 an input document file.

【００１６】以下、図１に示した処理ブロックの詳細な
動作について説明する。The detailed operation of the processing block shown in FIG. 1 will be described below.

【００１７】まず、文字認識手段１０２によって、文書
原稿イメージデータ１０１に対する文字認識処理を行
う。文字認識処理には各種の方法が提案されているが、
本発明で使用する文字認識手段としてはいずれの方法も
適用することが可能である。例えば、文字認識手段の具
体的な方法として、文字領域の抽出、単文字イメージデ
ータの切り出し、および単文字認識の各処理過程を実行
することで実現可能である。文字認識手段１０２から
は、文字認識結果である文字コードと、文字認識を行っ
た文書原稿イメージデータ中における各単文字イメージ
データの位置情報を含む文字認識結果１０８を出力す
る。First, the character recognizing means 102 performs a character recognizing process on the document original image data 101. Various methods have been proposed for character recognition processing.
Either method can be applied as the character recognition means used in the present invention. For example, as a specific method of the character recognition means, it can be realized by executing the processing steps of extracting a character area, extracting single character image data, and single character recognition. The character recognition unit 102 outputs a character code as a character recognition result and a character recognition result 108 including position information of each single character image data in the document original image data on which the character recognition has been performed.

【００１８】次に、文書入力者１０３が、文書イメージ
／文字列表示手段１０６によって表示された文書原稿イ
メージを参照しながら文書入力を行う。この文書入力に
おいては、日本語文字列入力手段１０４を用いて行う。
日本語文字列入力手段１０４には、かな漢字変換機能を
備えた各種の日本語入力手段を用いることができる。す
なわち、日本語文字列入力手段１０４では、文書入力者
が入力したかな文字列を漢字混じり文字列に変換する。
かな文字列を漢字混じり文字列に変換するアルゴリズム
には、最長一致法、文節数最少法等各種の方法を用いる
ことができ、また製品レベルのソフトウェアパッケージ
として、ＭＳ−ＩＭＥ等のかな漢字変換ライブラリを利
用することも可能である。日本語文字列入力手段１０４
からの出力は、かな漢字変換結果である漢字混じり文字
列１０９である。かな漢字変換の単位は、文書入力者の
操作内容に依存し、単漢字、単語、文節、句、文、等様
々な単位での変換が可能である。Next, the document input user 103 inputs a document while referring to the document original image displayed by the document image / character string display means 106. This document input is performed using the Japanese character string input unit 104.
As the Japanese character string input means 104, various Japanese input means having a kana-kanji conversion function can be used. That is, the Japanese character string input unit 104 converts a kana character string input by a document input person into a character string mixed with kanji.
Various methods such as the longest match method and the minimum number of clauses method can be used as an algorithm for converting a kana character string into a character string mixed with kanji. A kana-kanji conversion library such as MS-IME can be used as a product level software package. It is also possible to use it. Japanese character string input means 104
Is a kanji-mixed character string 109 that is the result of the kana-kanji conversion. The unit of Kana-Kanji conversion depends on the operation contents of the document input person, and can be converted in various units such as single Kanji, word, phrase, phrase, sentence, and the like.

【００１９】次に、入力位置検出手段１０５では、文字
認識手段１０２からの出力である文字認識結果１０８
と、日本語文字列入力手段１０４からの出力である漢字
混じり文字列１０９との照合を行い、漢字混じり文字列
に一致する文字認識結果を探索することによって文書入
力者１０３による現在入力中の文字位置を検出する。入
力位置検出手段１０５からは、検出した入力位置情報を
情報１１０に含めて出力する。文書イメージ／文字列表
示手段１０６では、文書原稿イメージデータ１０１と、
文書入力者１０３によって入力された漢字混じり文字列
１０９および現在入力中の文書原稿イメージ中の文字位
置を含む情報１１０と、を画面上に表示する。ここで、
現在入力中の文字位置の表示には各種の方法が考えられ
るが、例えば、最後に入力した漢字混じり文字列に相当
する文書原稿イメージデータの末尾を反転表示する等の
方法を用いることができる。Next, in the input position detecting means 105, a character recognition result 108 output from the character recognizing means 102 is output.
Is compared with a character string 109 mixed with kanji output from the Japanese character string input means 104, and a character recognition result matching the character string mixed with kanji is searched for, thereby obtaining the character currently input by the document input unit 103. Detect the position. The input position detecting means 105 outputs the detected input position information included in the information 110. In the document image / character string display means 106, the document original image data 101,
A character string 109 mixed with kanji input by the document input person 103 and information 110 including a character position in a currently input document image are displayed on the screen. here,
Various methods are conceivable for displaying the character position currently being input. For example, a method of reversely displaying the end of the document original image data corresponding to the last input kanji mixed character string can be used.

【００２０】図１を用いて説明した本発明による文書入
力では、文書入力者が文書原稿イメージ画面上に表示さ
れる入力位置に視線を合わせて文書入力を行うことで、
不要な視線移動を防止することが可能となる。尚、図１
の説明中の入力位置検出手段１０５、文字イメージ／文
字列表示手段１０６については、後に図３、図４を用い
てそれぞれ処理内容の詳細を説明する。In the document input according to the present invention described with reference to FIG. 1, the document input is performed by adjusting the line of sight to the input position displayed on the document original image screen.
Unnecessary gaze movement can be prevented. FIG.
The details of the processing of the input position detecting means 105 and the character image / character string displaying means 106 in the description will be described later with reference to FIGS.

【００２１】図２は、本発明の第２の実施形態例を示す
図であって、図１に示した本発明による第１の実施形態
例の文書入力方法および装置に、同音語並べ替え手段を
付加した場合の文書入力方法および装置の処理ブロック
図である。図２において、２０１が上記付加した同音語
並べ替え手段である。図２に示した本実施形態例による
文書入力方法および装置の処理内容は、図１に示した第
１の実施形態例の文書入力方法および装置と以下に説明
する相違点の内容で異なる以外は、全て同一の処理内容
である。FIG. 2 is a diagram showing a second embodiment of the present invention. In the second embodiment of the present invention shown in FIG. FIG. 10 is a processing block diagram of a document input method and a device when “.” Is added. In FIG. 2, reference numeral 201 denotes the added homonym rearranging means. The processing contents of the document input method and the apparatus according to the embodiment shown in FIG. 2 are different from those of the document input method and the apparatus according to the first embodiment shown in FIG. 1 except for the differences described below. , All have the same processing content.

【００２２】相違点の第１点目は、日本語文字列入力手
段１０４が同一の読み（かな）に対して複数の異なる漢
字混じり文字列を出力する点である。例えば、“いと
う”という読みに対して、図１の方法では伊藤という漢
字文字列のみを出力するのに対して、図２に示すかな漢
字変換入力手段１０４では、伊藤、伊東、井藤、等の複
数の同音語候補を出力する点である。The first difference is that the Japanese character string input means 104 outputs a plurality of different kanji mixed character strings for the same reading (kana). For example, for the reading of "Ito", the method of FIG. 1 outputs only the kanji character string of Ito, while the kana-kanji conversion input means 104 shown in FIG. 2 uses a plurality of characters such as Ito, Ito, Ito, etc. Is output.

【００２３】相違点の第２点目は、同音語並べ替え手段
２０１が、日本語文字列入力手段１０４の出力である同
音語候補１０９と、文字認識手段１０２からの出力であ
る文字認識結果（文字候補）２０２との照合を行って、
文字候補と一致する文字候補の含有率に基づき同音語候
補を確からしい候補から順に並べ替える点である。従っ
て、同音語並べ替え手段２０１からの出力は、信頼性の
高い同音語候補とすることが可能となる。The second difference is that the homonym rearranging means 201 uses the homonym candidate 109 output from the Japanese character string input means 104 and the character recognition result (output) from the character recognition means 102. (Character candidate) 202, and
The point is that the homophone candidates are sorted in order from the most probable candidates based on the content ratio of the character candidates matching the character candidates. Therefore, the output from the homonym rearranging means 201 can be a homonym candidate with high reliability.

【００２４】上記に説明したとおり、同音語並べ替え手
段２０１を付加することにより、より確からしい同音語
候補の出力が可能になると同時に、入力位置検出の高精
度化が可能となる。As described above, by adding the homonym rearranging means 201, it is possible to output more likely homonym candidates, and at the same time, it is possible to improve the accuracy of input position detection.

【００２５】図３は、図１中の入力位置検出手段１０５
の構成および処理内容を詳細に説明するための図であ
る。FIG. 3 shows the input position detecting means 105 in FIG.
FIG. 3 is a diagram for describing in detail the configuration and processing contents of FIG.

【００２６】図３において、３０１は文字認識手段１０
２から出力された情報１０８の内の文字認識結果、３０
２は同じく文字認識手段１０２から出力された情報１０
８の内の文字認識位置情報である。３０３は、図１に示
した日本語文字列入力手段１０４の出力、または図２に
示した同音語並べ替え手段２０１の出力である入力済み
文字列である。この入力済み文字列の処理単位は、図１
中または図２中の日本語文字列入力手段１０４で入力す
る単位と同等とし、例えば、かな漢字変換を用いて入力
する場合には、単語、文節、文等である。３０４は、図
１に示した日本語文字列入力手段１０４の出力、または
図２に示した同音語並べ替え手段２０１の出力である入
力済み文字列の位置情報である。この入力済み文字列の
位置情報も、図１中または図２中の日本語文字列入力手
段１０４で入力する単位の文字列に関する位置情報と
し、例えば、単語、文節、文等を単位に位置を示す情報
とする。３０５は、文字認識結果である文字候補３０１
と入力済み文字列３０３との照合を行い、一致する文字
列を探索する文字位置照合手段である。文字位置照合手
段３０５は、この文字位置照合によって一致する文字候
補と入力済み文字列の組み合わせが見つかった場合に
は、文書原稿イメージデータ中における文字候補の該当
文字イメージデータ位置情報と、入力済み文書文字列集
合中における入力済み文字列の位置情報とを合わせた情
報１１０を出力する。In FIG. 3, reference numeral 301 denotes the character recognizing means 10.
Character recognition result in information 108 output from 2
2 is the information 10 similarly output from the character recognition means 102
8 is character recognition position information. Reference numeral 303 denotes an input character string which is the output of the Japanese character string input unit 104 shown in FIG. 1 or the output of the homophone rearranging unit 201 shown in FIG. The processing unit of this input character string is shown in FIG.
The unit is equivalent to the unit input by the Japanese character string input unit 104 in FIG. 2 or FIG. Reference numeral 304 denotes position information of an input character string which is an output of the Japanese character string input unit 104 shown in FIG. 1 or an output of the homophone rearranging unit 201 shown in FIG. The position information of the input character string is also position information on the character string of the unit input by the Japanese character string input means 104 in FIG. 1 or FIG. Information. A character candidate 305 is a character recognition result.
This is a character position matching unit that matches the input character string 303 with the input character string 303 and searches for a matching character string. When a combination of a matching character candidate and an input character string is found by the character position matching, the character position matching unit 305 determines the corresponding character image data position information of the character candidate in the document manuscript image data, It outputs information 110 that is combined with the position information of the input character string in the character string set.

【００２７】図４は、図１および図２中に示した文字イ
メージ／文字列表示手段１０６の構成および処理内容を
詳細に説明するための図である。FIG. 4 is a diagram for explaining in detail the configuration and processing contents of the character image / character string display means 106 shown in FIGS. 1 and 2.

【００２８】図４中において、４０１は、文書原稿イメ
ージデータ１０１と、現在の入力文字列位置情報１１０
に基づく位置指定イメージデータとを重畳する手段であ
る。４０２は、実際に文書原稿イメージデータや入力済
み文字列を表示するための手段であって、一般的なディ
スプレイ装置を利用することができる。In FIG. 4, reference numeral 401 denotes document original image data 101 and current input character string position information 110.
This is a means for superimposing the position designation image data based on. Reference numeral 402 denotes a unit for actually displaying document manuscript image data and input character strings, and a general display device can be used.

【００２９】以下、図４を用いて、文書イメージ／文字
列表示手段１０６の処理内容を説明する。まず、重畳手
段４０１には、文書原稿イメージデータと、文字位置照
合手段３０５からの出力である、入力直後の文字列の入
力済み文字列集合中における位置情報Ａと、入力直後の
文字列と一致した文字認識結果の対応する文書原稿イメ
ージデータ中における位置情報Ｂとを入力する。次に、
上記位置情報Ａで示される入力直後の文字列を明示する
ための画像を合成する。この画像合成には、例えば、対
象文字列の反転、下線、囲み、等各種の方法が適用でき
る。次に、上記位置情報Ｂで示される文書原稿イメージ
データ中の領域を明示するための画像を合成する。この
画像合成にも、上記と同様に、対象領域イメージデータ
の反転、下線、囲み、等の方法が適用できる。The processing contents of the document image / character string display means 106 will be described below with reference to FIG. First, the superimposing means 401 matches the document manuscript image data with the position information A in the input character string set of the character string immediately after the input, which is the output from the character position collating means 305, and matches the character string immediately after the input. The position information B in the document original image data corresponding to the character recognition result is input. next,
An image for specifying the character string immediately after the input indicated by the position information A is synthesized. For this image synthesis, for example, various methods such as inversion of the target character string, underlining, enclosing, and the like can be applied. Next, an image for specifying an area in the document original image data indicated by the position information B is synthesized. Similarly to the above, a method of inverting, underlining, enclosing, and the like the target area image data can be applied to this image synthesis.

【００３０】図５は、以上の実施形態例におけるデイス
プレイ装置画面での文書原稿イメージと入力文字列の表
示例を説明する図である。FIG. 5 is a diagram for explaining a display example of a document original image and an input character string on the display device screen in the above embodiment.

【００３１】図５中において、１０６ａはデイスプレイ
装置の画面、５０１は文書原稿イメージ、５０２は入力
済み文書、５０３は文書原稿イメージ５０１中の入力対
象領域、５０３は入力直後の入力文書の文字列位置を示
している。In FIG. 5, reference numeral 106a denotes a screen of the display device; 501, a document manuscript image; 502, an input document; 503, an input target area in the document manuscript image 501; Is shown.

【００３２】この表示例では、元原稿をそのままイメー
ジ５０１として画面１０６ａの左側に表示する。一方、
文書入力者が打ち込んで入力した文書は画面１０６ａの
右側に表示されるとともに、入力された文字列と文書原
稿イメージデータの文字認識結果とが内部処理で比較さ
れて、入力直後の文字列位置５０３に対応する元原稿の
位置が文書原稿イメージ５０１中に入力対象領域５０３
として文書入力者に見やすい形で表示され、文書入力者
が視線の移動を最小限に入力位置を確定することができ
る。なお、図５の表示例では、文書原稿イメージと文書
入力者の入力文書を左右に並べて表示する例を示した
が、上下に並べて表示してもよい。In this display example, the original document is displayed as it is on the left side of the screen 106a as the image 501. on the other hand,
The document input and entered by the document inputter is displayed on the right side of the screen 106a, and the input character string is compared with the character recognition result of the document original image data by internal processing. Is set in the input target area 503 in the document original image 501.
Is displayed in a form that is easy for the document input person to see, and the document input person can determine the input position with minimum movement of the line of sight. Although the display example of FIG. 5 shows an example in which the document original image and the input document of the document input person are displayed side by side, they may be displayed vertically.

【００３３】また、かな漢字変換による変換文字列の第
１候補は、入力直後の文字列位置５０２に表示し、第２
候補以下は、変換キーを押す毎に順次先の変換文字列に
置き換えて行く。あるいは、第２候補以下は所定の窓枠
に表示し、変換キー等によるカーソル移動や番号の指定
により先の変換文字列に置き換え可能にしたりしてもよ
い。要は、第１候補の確からしさを向上させることで、
同音語の選択操作を減少させて入力効率を高めるととも
に、現状の技術レベルの文字認識技術の利用でも、かな
漢字変換時に発生する同音語選択誤りを防止する。The first candidate of the character string converted by the kana-kanji conversion is displayed at the character string position 502 immediately after the input, and
Each time the conversion key is pressed, the candidates below the candidate are sequentially replaced with the preceding converted character string. Alternatively, the second and subsequent candidates may be displayed in a predetermined window frame, and may be replaced with the previous converted character string by moving the cursor using a conversion key or by designating a number. In short, by improving the likelihood of the first candidate,
In addition to improving the input efficiency by reducing the operation of selecting homophones, the use of character recognition technology at the current technical level prevents homonym selection errors occurring during kana-kanji conversion.

【００３４】[0034]

【発明の効果】以上、詳細に説明したように、本発明に
よれば、文書入力者が視線の移動を最小限に入力位置を
確定することができ、また現状の技術レベルの文字認識
技術の利用でも、かな漢字変換時に発生する同音語選択
誤りを防止することができ、これらによって文書入力者
の能力に応じて、効率の良い文書入力を行うことが可能
になるという顕著な効果を奏するものである。As described above in detail, according to the present invention, the document input person can determine the input position with minimum movement of the line of sight. Even with the use, it is possible to prevent homonym selection errors that occur during kana-kanji conversion, which has the remarkable effect of enabling efficient document input according to the capabilities of the document input person. is there.

[Brief description of the drawings]

【図１】本発明の第１の実施形態例を示す文書入力方法
および装置の構成と処理内容を説明するための処理ブロ
ック図である。FIG. 1 is a processing block diagram for explaining the configuration and processing contents of a document input method and apparatus according to a first embodiment of the present invention.

【図２】本発明の第２の実施形態例を示す文書入力方法
および装置の構成と処理内容を説明するための処理ブロ
ック図であって、図１に示した構成に同音語並べ替え手
段を付加した処理ブロック図である。FIG. 2 is a processing block diagram for explaining the configuration and processing contents of a document input method and apparatus according to a second embodiment of the present invention, wherein a homophone rearranging means is provided in the configuration shown in FIG. It is an added processing block diagram.

【図３】上記実施形態例における入力位置検出手段の構
成と処理内容を詳細に説明するための処理ブロック図で
ある。FIG. 3 is a processing block diagram for describing in detail a configuration and processing contents of an input position detection unit in the embodiment.

【図４】上記実施形態例における文書イメージ／文字列
表示手段の構成と処理内容を詳細に説明するための処理
ブロック図である。FIG. 4 is a processing block diagram for explaining in detail the configuration and processing contents of a document image / character string display unit in the embodiment.

【図５】上記実施形態例におけるデイスプレイ装置画面
での文書原稿イメージと入力文字列の表示例を説明する
ための図である。FIG. 5 is a diagram for describing a display example of a document original image and an input character string on a display device screen in the embodiment.

[Explanation of symbols]

１０１…文書原稿イメージデータ１０２…文字認識手段１０３…文書入力者１０４…日本語文字列入力手段１０５…入力位置検出手段１０６…文書イメージ／文字列表示手段２０１…同音語並べ替え手段２０２、３０１…文字認識結果３０２…文字認識位置情報３０３…入力済み文字列３０４…入力済み文字列の位置情報３０５…文字位置照合手段４０１…表示データ重畳手段４０２…表示手段 101: Document manuscript image data 102: Character recognition means 103: Document input person 104: Japanese character string input means 105: Input position detection means 106: Document image / character string display means 201: Homophone rearrangement means 202, 301 ... Character recognition result 302 Character recognition position information 303 Input character string 304 Position information of input character string 305 Character position collating means 401 Display data superimposing means 402 Display means

───────────────────────────────────────────────────── フロントページの続き (72)発明者小黒雅己東京都新宿区西新宿３丁目19番２号日本電信電話株式会社内 ──────────────────────────────────────────────────の Continued on the front page (72) Inventor Masami Oguro 3-19-2 Nishishinjuku, Shinjuku-ku, Tokyo Inside Nippon Telegraph and Telephone Corporation

Claims

[Claims]

1. A document input method using a character recognition technology, comprising displaying image data of an input target document on a screen, performing character recognition on characters in the input target document, and inputting the input data by a document input person. A character code string is compared with a character code string that is the result of performing the character recognition, and a search is made for a matching character code string. As a result of the search, the input target document corresponding to the matched character code string Detecting a position in the image data, and displaying the detected position on the screen.

2. A character code string input by the document input person is subjected to Kana-Kanji conversion to obtain one or more homophone candidates, and a character code string which is a result of the character recognition and the homophone word are obtained. 2. The document input method according to claim 1, wherein matching is performed with candidates, and the obtained homophone candidates are rearranged in ascending order based on the content ratio of matching character candidates.

3. A document input device using a character recognition technology, comprising: means for displaying image data of an input target document on a screen; means for performing character recognition on characters in the input target document; Means for collating the character code string input by the above with the character code string which is the result of the character recognition; means for searching for a matching character code string as a result of the collation; A document input device comprising: means for detecting a position in the image data of the input target document corresponding to the determined character code string; and means for displaying the detected position on the screen.

4. A means for performing Kana-Kanji conversion on a character code string input by the document input user to obtain one or more homophone candidates, and a character code string which is an execution result of the character recognition. Means for collating with the same homonym candidate, and means for rearranging the obtained homonym candidates in order from the most probable candidates based on the content ratio of the matching character candidates as a result of the collation. The document input device according to claim 3, wherein