JPH09245121A

JPH09245121A - Character recognition device

Info

Publication number: JPH09245121A
Application number: JP8048624A
Authority: JP
Inventors: Taminori Tomita; 民則冨田; Shunichi Tano; 俊一田野; Hideki Sakao; 秀樹坂尾; Yasuharu Nanba; 康晴難波; Hirokazu Aoshima; 弘和青島
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 1996-03-06
Filing date: 1996-03-06
Publication date: 1997-09-19

Abstract

PROBLEM TO BE SOLVED: To efficiently recognize a character by segmenting a series of strokes by the unit of a single character, recognizing them by each segmenting pattern to output a candidate character and comparing the recognizing result after character segmentation and each newest recognizing result with each other to detect a difference. SOLUTION: A character segmenting means 4 decides a character segmenting pattern by collecting inputted stroke data by each strokes of one character constitution. A comparing means 6 compares a past character segmenting pattern and a new character segmenting pattern to detect the difference and a character recognition means 7 recognizes stroke data for one character. A comparing means 9 compares a past candidate character and a new candidate character to detect a difference. A word recognizing means 10 decides a proper word candidate by collating a word obtained by combining the candidate characters with a word dictionary 11. A comparing means 13 compares a past candidate word and a new candidate word to detect a difference. A grammar recognition means 14 decides a suited word string and a deciding means 17 decides a proper recognizing result and output it.

Description

Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】本発明は手書き文字を読み取
るための文字認識技術に関わり、特に次々とストローク
が入力されるオンライン手書き文字認識処理を効果的に
行う文字認識技術に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a character recognition technique for reading handwritten characters, and more particularly to a character recognition technique for effectively performing online handwritten character recognition processing in which strokes are successively input.

【０００２】[0002]

【従来の技術】従来のオンライン手書き文字認識装置
は、例えば図１２に示したようなものであった。その処
理の内容は、以下の通りである。まず、処理の第一段階
として手書きストロークがストローク入力手段１２０１
によって入力される。第二段階として文字切り出し手段
１２０２、例えば特公平６−９０７４１号公報に示され
るような方法により一文字分のストローク集合が文字と
して切り出される。次に、第三段階では、文字切り出し
されたストローク集合が、個別文字認識手段１２０３に
入力され文字として認識される。2. Description of the Related Art A conventional on-line handwritten character recognition device is, for example, as shown in FIG. The contents of the processing are as follows. First, as the first step of the process, the handwriting stroke is stroke input means 1201.
Is entered by As a second step, a character cutout unit 1202, for example, a method as disclosed in Japanese Patent Publication No. 6-90741, cuts out a stroke set for one character as a character. Next, in the third step, the character set strokes are input to the individual character recognition unit 1203 and recognized as characters.

【０００３】このとき、個々の文字認識処理の結果とし
ては複数の候補文字とその候補文字の確からしさを表す
認識評価値が得られる。この結果を利用して候補文字の
絞り込みが可能である。At this time, a plurality of candidate characters and a recognition evaluation value representing the certainty of the candidate characters are obtained as a result of the individual character recognition processing. Using this result, it is possible to narrow down the candidate characters.

【０００４】その後さらに認識率の向上のため必要に応
じて、第四段階として単語認識１２０４、第五段階とし
て文法認識１２０５等の言語的制約や背景知識を利用し
て候補文字を決定する後処理が行われるといったように
階層的に処理がおこなわれるものであった。Thereafter, in order to further improve the recognition rate, post-processing for determining a candidate character using linguistic restrictions such as word recognition 1204 as a fourth step and grammar recognition 1205 as a fifth step and background knowledge. Processing was performed in a hierarchical manner such as.

【０００５】[0005]

【発明が解決しようとする課題】しかし、上記従来のオ
ンライン手書き文字認識装置では、以下の問題点があ
る。入力した手書きストロークを予め決められた方法で
一文字分に切り出した後は、新たなストロークの追加入
力による文字切り出しの変更は不可能である。また、文
字切り出しの変化に伴う認識結果の動的な更新、変更は
困難であった。さらに一連の単語認識、文法認識等の後
処理は、部分的な認識結果の変化を反映する。このため
に、全ての候補文字再度入力して処理をやり直す必要が
あったため効率的ではなかった。However, the above-mentioned conventional online handwritten character recognition device has the following problems. After cutting out the input handwriting stroke into one character by a predetermined method, it is impossible to change the character cutout by additionally inputting a new stroke. Moreover, it is difficult to dynamically update or change the recognition result due to the change in the character segmentation. Further, a series of post-processing such as word recognition and grammar recognition reflects a partial change in the recognition result. For this reason, it was not efficient because it was necessary to input all candidate characters again and redo the processing.

【０００６】本発明は、上記の問題点を鑑み、新たに入
力されたストロークを動的に反映し、かつ後処理を含め
た認識を効率的に行うことが可能な文字認識装置を提供
することを目的としている。In view of the above problems, the present invention provides a character recognition device which can dynamically reflect a newly input stroke and efficiently perform recognition including post-processing. It is an object.

【０００７】[0007]

【課題を解決するための手段】本発明は、上記目的を達
成するために以下の構成を採用した。ストローク入力手
段と、入力された一連のストロークを一文字を構成する
単位に切り出す文字切り出し手段と、切り出された文字
切り出しパターン毎に認識を行い候補文字を出力する個
別文字認識手段と、文字切り出し、文字認識、単語認
識、文法認識等の後処理の各処理完了後の認識結果を一
時保存する手段と、一時保存された各処理結果から最適
な認識結果を求め出力する認識結果出力手段と、一時保
存された各段階の処理結果と最新の各段階の認識結果を
比較して差分を検出する手段を備えたことを特徴とする
ものである。The present invention has the following features to attain the object mentioned above. Stroke input means, character cutout means for cutting out a series of input strokes into units that form one character, individual character recognition means for recognizing each cut out character cutout pattern and outputting a candidate character, character cutout, character A means for temporarily storing the recognition result after completion of each processing of post-processing such as recognition, word recognition, grammar recognition, a recognition result output means for obtaining and outputting an optimum recognition result from each temporarily saved processing result, and a temporary storage The present invention is characterized by comprising means for comparing the processed result of each stage and the latest recognition result of each stage to detect a difference.

【０００８】[0008]

【発明の実施の形態】以下、図面を用いて本発明の実施
例を説明する。図１は本発明の文字認識装置の一実施例
を示す概略構成図である。同図に示す装置は、１〜１７
の構成要件から構成される。Embodiments of the present invention will be described below with reference to the drawings. FIG. 1 is a schematic configuration diagram showing an embodiment of a character recognition device of the present invention. The device shown in FIG.
It consists of the configuration requirements of.

【０００９】まず、ストローク入力手段１は、手書き文
字ストロークデータを入力する。ストローク記憶手段３
は、ストローク入力手段１から新規に入力された手書き
文字ストロークデータを逐次追加記憶し、過去に入力さ
れたストロークとともに記憶する。ストローク比較手段
２は、ストローク入力手段１で新規に入力されたストロ
ークとストローク記憶手段に記憶されたストロークとを
比較する。文字切り出し手段４は、ストローク入力手段
１で入力された一連のストロークデータを一文字を構成
するストローク毎にまとめ、文字切り出しパターンを決
定する。文字切り出しパターン記憶手段５は、文字切り
出し手段で決定された文字切り出しパターンを逐次記憶
する。切り出しパターン比較手段６は、文字切り出しパ
ターン記憶手段５から得られる過去の文字切り出しパタ
ーンと、文字切り出し手段４から得られる新規文字切り
出しパターンとを比較し差分を検出する。個別文字認識
手段７は、文字切り出しパターンとして切り出された一
文字分のストロークデータを認識して候補文字を出力す
る。First, the stroke input means 1 inputs handwritten character stroke data. Stroke storage means 3
Stores sequentially the handwritten character stroke data newly input from the stroke input means 1 and stores it together with the strokes input in the past. The stroke comparison means 2 compares the stroke newly input by the stroke input means 1 with the stroke stored in the stroke storage means. The character cutout unit 4 collects a series of stroke data input by the stroke input unit 1 for each stroke forming one character, and determines a character cutout pattern. The character cutout pattern storage means 5 sequentially stores the character cutout patterns determined by the character cutout means. The cut-out pattern comparison means 6 compares the past character cut-out pattern obtained from the character cut-out pattern storage means 5 with the new character cut-out pattern obtained from the character cut-out means 4 to detect a difference. The individual character recognition unit 7 recognizes the stroke data for one character cut out as a character cutout pattern and outputs a candidate character.

【００１０】候補文字記憶手段８は、個別文字認識手段
７で出力された候補文字を逐次記憶する。候補文字比較
手段９は、候補文字記憶手段８から得られる過去の候補
文字と、個別文字認識手段７から得られる新規候補文字
とを比較し差分を検出する。単語認識手段１０は、候補
文字比較手段での出力に基づいて、各候補文字を組み合
わせて構成される単語を単語辞書１１と照合して妥当と
思われる単語候補を決定する。単語認識手段１０で決定
された候補単語を逐次記憶する。候補単語比較手段１３
は、候補単語記憶手段１２から得られる過去の候補単語
と、単語認識手段１０から得られる新規候補単語とを比
較し差分を検出する。文法認識手段１４は、単語の接続
規則を記述した文法辞書１５と照合して接続が文法的に
妥当と思われる文法適合単語列を決定する。文法適合単
語列記憶手段１６は、文法適合単語列を逐次記憶する。
認識結果決定手段１７は、候補文字記憶手段、候補単語
記憶手段８、文法適合単語列記憶手段１６の内容から最
も妥当と予想される認識結果を決定し、認識結果として
出力する。The candidate character storage means 8 sequentially stores the candidate characters output by the individual character recognition means 7. The candidate character comparison means 9 compares the past candidate character obtained from the candidate character storage means 8 with the new candidate character obtained from the individual character recognition means 7 to detect a difference. The word recognizing means 10 determines a word candidate considered to be appropriate by collating a word formed by combining the candidate characters with the word dictionary 11 based on the output from the candidate character comparing means. The candidate words determined by the word recognition means 10 are sequentially stored. Candidate word comparison means 13
Detects the difference by comparing the past candidate word obtained from the candidate word storage means 12 and the new candidate word obtained from the word recognition means 10. The grammar recognition means 14 determines a grammatical matching word string in which the connection is considered to be grammatically appropriate by collating with the grammar dictionary 15 describing the connection rules of words. The grammatical matching word string storage unit 16 sequentially stores the grammar matching word string.
The recognition result determining means 17 determines the most appropriate recognition result from the contents of the candidate character storage means, the candidate word storage means 8 and the grammatical matching word string storage means 16 and outputs it as the recognition result.

【００１１】次に上記各手段の詳細を説明する。ストロ
ーク入力手段１は、ペンで示す座標を検知可能なタブレ
ットであり、ペンによる筆跡を文字ストロークデータ
（以後ストロークデータと呼ぶ）として検出するもので
ある。Next, the details of each of the above means will be described. The stroke input means 1 is a tablet capable of detecting coordinates indicated by a pen, and detects handwriting by the pen as character stroke data (hereinafter referred to as stroke data).

【００１２】ストローク比較手段３は、ストローク入力
手段１から新規に入力された最新ストロークデータと、
ストローク記憶手段３にある過去に入力されたストロー
クデータを参照し、最新ストロークから距離Ｌの範囲内
に存在するストロークを抽出する。そして最新のストロ
ークと抽出したストロークを文字切り出し手段４に出力
する。なお、Ｌは文字間隔、文字の大きさ等を考慮して
決定するが０〜無限大の値をとることが可能である。図
２にこの例を示す。「今日は」という文字が手書きで入
力され、ストロークが順に２０１〜２１１まで入力され
たとする。ここで、新たに最新ストローク２１１が入力
された時、距離Ｌの範囲に存在するストロークは２０
６、２０７、２０８、２０９、２１０ということにな
り、文字切り出し手段にはストローク２０６〜２１０と
最新ストロークである２１１が送られる。また、ストロ
ーク抽出終了後、最新ストロークはストローク記憶手段
３に過去のストロークに追加して記憶される。The stroke comparison means 3 has the latest stroke data newly input from the stroke input means 1, and
The stroke data input in the past in the stroke storage means 3 is referred to, and strokes existing within a distance L from the latest stroke are extracted. Then, the latest stroke and the extracted stroke are output to the character cutting means 4. Although L is determined in consideration of the character spacing, the character size, etc., it can take a value of 0 to infinity. This example is shown in FIG. It is assumed that the characters “today” are manually input and strokes 201 to 211 are sequentially input. Here, when the newest stroke 211 is newly input, there are 20 strokes existing in the range of the distance L.
6, 207, 208, 209, and 210, and strokes 206 to 210 and the latest stroke 211 are sent to the character cutting means. Also, after the stroke extraction is completed, the latest stroke is stored in the stroke storage means 3 in addition to the past stroke.

【００１３】また、ストロークの抽出方法はストローク
間距離に限らず、ストローク入力時間差や、ストローク
本数によっても良い。The method of extracting strokes is not limited to the distance between strokes, but may be the stroke input time difference or the number of strokes.

【００１４】図２の例では、最新ストロークが入力され
た時間からさかのぼってある一定時間、例えば１００ｍ
ｓｅｃの時間内に入力されたストロークを抽出すること
にすれば、２０６から２１０が該当することになり、直
前に入力された一定本数のストローク、例えば７本のス
トロークを抽出することにすれば、２０４から２１０が
該当ストロークとなる。また、最新ストロークに近い一
定本数のストローク、例えば３本のストロークを抽出す
ることにすれば、近い順に２１０、２０９、２０８が抽
出される。In the example of FIG. 2, a certain period of time, for example, 100 m, is traced back from the time when the latest stroke is input.
If the strokes input within the time of sec are to be extracted, 206 to 210 are applicable, and if a predetermined number of strokes input immediately before, for example, 7 strokes are extracted, The corresponding stroke is from 204 to 210. Further, if a certain number of strokes close to the latest stroke, for example, three strokes are extracted, 210, 209, and 208 are extracted in the order of closeness.

【００１５】文字切り出し手段４は、図４に示す手順で
一連のストロークデータを一文字を構成すると予想され
るストローク集合に切り出す。たとえば図３に示す手書
き文字「川崎」（ストロークデータ３０１〜３１４）の
ストローク列が入力されたとする。このストロークデー
タの切り出しを例に文字切り出し処理を考える。まず、
図５の３０１ａ〜３１４ａで示すように、ストローク列
入力後（４０１）、各ストロークの外接矩形をもとめる
（４０２）。The character cut-out means 4 cuts out a series of stroke data into a stroke set expected to constitute one character by the procedure shown in FIG. For example, it is assumed that the stroke sequence of the handwritten character "Kawasaki" (stroke data 301 to 314) shown in FIG. 3 is input. Let us consider character cutout processing by taking this cutout of stroke data as an example. First,
As shown by 301a to 314a in FIG. 5, after the stroke sequence is input (401), the circumscribed rectangle of each stroke is obtained (402).

【００１６】次に互いに重なりを持つ外接矩形同士を統
合し統合矩形を求める（４０３）。例では外接矩形３０
４ａ〜３０６ａ、３０７ａ〜３０９ａ、３１０ａ〜３１
４ａがそれぞれ統合され順に３０４ｂ、３０５ｂ、３０
６ｂが生成される。Next, circumscribing rectangles that overlap each other are integrated to obtain an integrated rectangle (403). In the example, circumscribed rectangle 30
4a to 306a, 307a to 309a, 310a to 31
4a are integrated respectively in order of 304b, 305b, 30
6b is generated.

【００１７】次に、垂直方向に重なる部分がある矩形を
統合する（４０４）。ただし、これは横書きの場合で、
縦書きの時は水平方向の重なりを検証する。例では３０
５ｂ、３０６ｂが統合され３０４ｃが生成される。Next, the rectangles having overlapping portions in the vertical direction are integrated (404). However, this is the case of horizontal writing,
When writing vertically, verify the horizontal overlap. 30 in the example
5b and 306b are integrated to generate 304c.

【００１８】次に、水平方向に分離された矩形３０１
ａ、３０２ａ、３０３ａ、３０４ｂ、３０５ｃについ
て、その間隔が閾値以下の場合は統合する（４０５）。
閾値の決定は図６に示すように行う。統合を検討する２
つの矩形、高さＨ1幅Ｗ1である左側の矩形１（６０
１）、高さＨ2幅Ｗ2である右側の矩形２（６０２）に関
して、その間隔Ｄ12が、条件（Ｄ12＋Ｗ1）＜（ａ・Ｈ
1）、を満たす時統合することにする。ただしａは文字
間隔等から決定される任意定数である。この処理を入力
された順に実行し矩形統合する。例えばａ＝１としたと
き、まず矩形１（６０１）と矩形２（６０２）は、（Ｄ
12＋Ｗ1）＜Ｈ1となるので統合され矩形４（６０４）が
生成される。次に矩形４（６０４）と矩形３（６０３）
について判定を行うが、（Ｄ23＋Ｗ4）＞Ｈ2となるので
統合しない。最後に、統合された矩形に含まれる全ての
ストロークを一文字分のデータとして認識出来るように
切り出しパターンを作成する。Next, a rectangle 301 separated in the horizontal direction
If the intervals of a, 302a, 303a, 304b, and 305c are less than or equal to the threshold, they are integrated (405).
The threshold is determined as shown in FIG. Consider integration 2
Left rectangle 1 (60 in height, H1 in width, W1 in width)
1), with respect to the right side rectangle 2 (602) having the height H2 and the width W2, the distance D12 is set to the condition (D12 + W1) <(a · H
1), will be integrated when satisfying. However, a is an arbitrary constant determined from the character spacing or the like. This process is executed in the order of input and rectangle integration is performed. For example, when a = 1, the rectangle 1 (601) and the rectangle 2 (602) are (D
Since 12 + W1) <H1, the rectangle 4 (604) is integrated and generated. Next, rectangle 4 (604) and rectangle 3 (603)
However, since (D23 + W4)> H2, it is not integrated. Finally, a cutout pattern is created so that all strokes included in the integrated rectangle can be recognized as data for one character.

【００１９】図７に切り出しパターンデータの例を示
す。切り出しパターンデータは、文字として切り出した
パターンのＩＤ、開始ストロークＩＤ、終了ストローク
ＩＤを情報として持つ。図７の例で文字切り出しＩＤ番
号１として与えられる切り出しパターンは、ストローク
３０１、３０２、３０３の３個のストロークから構成さ
れ、同様に文字切り出しパターン２はストローク３０４
〜３１４の１１個のストロークから構成されることを示
す。最後に、生成した切り出しパターンを切り出しパタ
ーン比較手段６に出力する（４０６）。FIG. 7 shows an example of cutout pattern data. The cut-out pattern data has the ID of the pattern cut out as a character, the start stroke ID, and the end stroke ID as information. In the example of FIG. 7, the cutout pattern given as the character cutout ID number 1 is composed of three strokes 301, 302, and 303. Similarly, the character cutout pattern 2 is stroke 304.
It shows that it is comprised from 11 strokes of -314. Finally, the generated cutout pattern is output to the cutout pattern comparison unit 6 (406).

【００２０】本実施例における切り出しパターンデータ
は一文字に含まれるストロークＩＤが連続していること
を想定して開始ストロークと終了ストロークのＩＤの情
報を有するが、一文字を構成する全てのストロークＩＤ
情報を有するデータであっても良い。なお、切り出し方
法は本実施例に限定されず、例えばストロークの輪郭追
跡によってストロークの統合を判定しても良い。The cut-out pattern data in this embodiment has information on the start stroke and end stroke IDs on the assumption that the stroke IDs contained in one character are continuous, but all stroke IDs that form one character.
It may be data having information. The cutout method is not limited to this embodiment, and stroke integration may be determined by stroke contour tracking, for example.

【００２１】切り出しパターン比較手段６は、文字切り
出し手段４で新規に切り出された最新文字切り出しパタ
ーンと、切り出しパターン記憶手段５に記憶されている
過去に切り出された文字切り出しパターンを比較し、過
去の文字切り出しパターンにない新しい文字切り出しパ
ターンが最新文字切り出しパターンのなかに存在した
ら、その切り出しパターンを出力する。図８にその例を
示す。過去に切り出された文字切り出しパターン８０
１、８０２、８０３、８０４が存在したとする。この
時、最新文字切り出しパターンとして、ストローク５か
ら８によって構成されている８０５、ストローク９から
１４によって構成されている８０６が切り出されたとす
ると、切り出しパターン比較手段６は切り出しパターン
８０６を候補文字認識手段７に出力する。切り出しパタ
ーン８０５は過去の切り出しパターン８０３と同じであ
るため出力されない。また、切り出しパターン比較手段
６の出力した文字切り出しパターンは文字切り出しパタ
ーン記憶手段５に既に記憶された切り出しパターンに追
加して記憶される。The cut-out pattern comparison means 6 compares the latest character cut-out pattern newly cut out by the character cut-out means 4 with the character cut-out pattern cut out in the past stored in the cut-out pattern storage means 5 to determine the past. If a new character cutout pattern that is not in the character cutout pattern exists in the latest character cutout pattern, the cutout pattern is output. FIG. 8 shows an example thereof. Character cutout pattern 80 cut out in the past
It is assumed that 1, 802, 803, and 804 exist. At this time, if 805 composed of strokes 5 to 8 and 806 composed of strokes 9 to 14 are cut out as the latest character cut-out pattern, the cut-out pattern comparison means 6 selects the cut-out pattern 806 as the candidate character recognition means. Output to 7. The cutout pattern 805 is the same as the cutout pattern 803 in the past and is not output. The character cutout pattern output by the cutout pattern comparison unit 6 is stored in addition to the cutout pattern already stored in the character cutout pattern storage unit 5.

【００２２】個別文字認識手段７は、切り出しパターン
比較手段６が出力する文字切り出しパターンを認識して
候補文字を出力する。手書き文字の認識方法は、例えば
認識辞書に格納されている辞書ストロークパターンと入
力ストロークパターンとの照合をパターンマッチング法
によって行う。パターンマッチング法では、辞書ストロ
ークパターンと入力ストロークパターンとのパターン間
距離を、対応するストローク間距離の総和として求め
る。したがってパターン間距離が小さいものほど、より
その文字に類似していると評価される。認識結果として
はパターン間距離が小さい順に、第１位候補から第ｎ位
候補を出力するものである。ただしｎは１以上の整数と
する。パターン間距離は認識評価値として候補文字とと
もに出力される。なお、本実施例における認識方法はパ
ターンマッチング法に限定されず、順位付けされた複数
の認識候補文字が出力されるものであれば、どのような
認識方法を用いてもよい。The individual character recognition means 7 recognizes the character cutout pattern output by the cutout pattern comparison means 6 and outputs a candidate character. As a method of recognizing handwritten characters, for example, a dictionary stroke pattern stored in a recognition dictionary is compared with an input stroke pattern by a pattern matching method. In the pattern matching method, the inter-pattern distance between the dictionary stroke pattern and the input stroke pattern is obtained as the sum of the corresponding inter-stroke distances. Therefore, the smaller the pattern-to-pattern distance, the more similar the character is evaluated. As a recognition result, the first to n-th candidates are output in the order of smaller inter-pattern distance. Here, n is an integer of 1 or more. The distance between patterns is output together with the candidate character as a recognition evaluation value. The recognition method in this embodiment is not limited to the pattern matching method, and any recognition method may be used as long as a plurality of ranked recognition candidate characters are output.

【００２３】候補文字は最新の認識候補文字データとし
て候補文字比較手段９に出力される。図９に認識候補文
字データの例をしめす。図に示すように認識候補文字デ
ータは候補毎に独自の番号である候補ＩＤ番号９０１
と、その候補文字の文字切り出しパターンの開始ストロ
ークＩＤ９０２、終了ストロークＩＤ９０３、候補文字
コード９０４、候補文字評価値９０５から構成される。
したがって、第ｎ位候補まで出力する場合は同じ開始ス
トロークＩＤ、終了ストロークＩＤを持ったｎ個の認識
候補文字データが存在することになる。図９ではｎ＝２
のため同じストローク範囲である候補が２つづつ存在す
る。The candidate character is output to the candidate character comparing means 9 as the latest recognition candidate character data. FIG. 9 shows an example of recognition candidate character data. As shown in the figure, the recognition candidate character data is a candidate ID number 901 that is a unique number for each candidate.
And a start stroke ID 902, an end stroke ID 903, a candidate character code 904, and a candidate character evaluation value 905 of the character cutout pattern of the candidate character.
Therefore, when outputting up to the nth candidate, there are n pieces of recognition candidate character data having the same start stroke ID and end stroke ID. In FIG. 9, n = 2
Therefore, there are two candidates each having the same stroke range.

【００２４】候補文字比較手段９は、個別文字認識手段
７の出力する最新認識候補文字データと候補文字記憶手
段８に記憶されている過去に認識された認識候補文字デ
ータを参照して、最新認識候補文字の直前に位置するｍ
文字分の認識候補文字を選択し、最新認識候補文字デー
タとともに単語認識手段１０に出力する。The candidate character comparing means 9 refers to the latest recognition candidate character data output from the individual character recognizing means 7 and the recognition candidate character data recognized in the past stored in the candidate character storing means 8 to refer to the latest recognition. M located just before the candidate character
The character recognition candidate characters are selected and output to the word recognition means 10 together with the latest recognition candidate character data.

【００２５】直前のｍ文字は認識候補文字データの開始
ストロークＩＤ及び終了ストロークＩＤを参照して探す
ことができる。なお、選択文字個数ｍは、単語辞書１１
に登録されている見出し単語の最長の文字数から１を引
いた数にしておく。その後、最新認識候補文字データは
候補文字記憶手段８に既に認識された認識候補文字デー
タに追加して記憶される。The immediately preceding m characters can be searched by referring to the start stroke ID and end stroke ID of the recognition candidate character data. Note that the number of selected characters m is the word dictionary 11
1 is subtracted from the longest number of characters of the headword registered in. After that, the latest recognition candidate character data is added to the recognition candidate character data already recognized and stored in the candidate character storage means 8.

【００２６】たとえば、図９の例で最新認識候補文字と
してＩＤ番号２６４４１の候補文字「識」と２６４４２
の候補文字「織」が存在し、残りの候補単語が候補文字
記憶手段に記憶されてたとする。この時単語辞書の最長
見出し単語長が仮に２だとすると、ｍ＝１となり、最新
認識候補文字に接続する候補文字は終了ストロークＩＤ
番号が２５である候補ＩＤ番号１０２５１の「語」、１
０２５２「認」、１８２５１「忍」、１８２５２「思」
の４つの候補文字が最新候補文字とともに単語認識手段
１０に出力される。For example, in the example of FIG. 9, the candidate characters "knowledge" and ID 26442 with ID number 26441 are the latest recognition candidate characters.
It is assumed that the candidate character "Ori" exists and the remaining candidate words are stored in the candidate character storage means. At this time, if the longest heading word length of the word dictionary is 2, m = 1, and the candidate character connected to the latest recognition candidate character is the end stroke ID.
"Word" of candidate ID number 10251 whose number is 25, 1
0252 "Accepted", 18251 "Shinobi", 18252 "Thought"
The four candidate characters are output to the word recognition means 10 together with the latest candidate character.

【００２７】単語認識手段１０は、候補文字比較手段９
が出力する候補文字と単語辞書１１に格納されている見
出し単語を照らし合わせて、少なくとも最新認識候補文
字を含み、かつ残りの認識候補文字の組み合わせと一致
する候補単語を検索する。The word recognition means 10 is a candidate character comparison means 9
By comparing the candidate characters output by the above with the headwords stored in the word dictionary 11, a candidate word that includes at least the latest recognition candidate character and that matches the remaining combination of recognition candidate characters is searched.

【００２８】図９の例では、「語識」「認識」「忍識」
「思識」「語織」「認織」「忍織」「思織」の中から
「認識」が候補単語として単語辞書１１に存在すること
になる。In the example of FIG. 9, "word knowledge", "recognition", and "intellect"
"Recognition" is present as a candidate word in the word dictionary 11 from among "thought", "word weave", "recognition weave", "ninori", and "shiori".

【００２９】候補単語は、最新認識候補単語データとし
て候補単語比較手段１３に出力される。図１０に認識候
補単語データの例を示す。図に示すように認識候補単語
データは候補毎に独自の番号となる候補ＩＤ番号１００
１、その単語を構成する開始ストロークＩＤ番号１００
２、及び終了ストロークＩＤ番号１００３、単語文字列
１００４、評価値１００５、から構成される。候補単語
の評価値は、構成する各候補文字の個別の評価値の和よ
りも良い評価が得られるように調整する。例えば、評価
値の値が小さい方がより良い評価となる場合には、定数
α（０＜α＜１）を乗するといった調整方法が考えられ
る。たとえば図１０の候補ＩＤ番号１０４４１の「認
識」はα＝0.8とすると、評価値は４４０になる。これ
は図９の単語候補１０２５２「認」と２６４４１「識」
の評価値を個別に加算した値５５０より良い評価値であ
り、単語として認識されたことによってその文字の評価
が向上することを反映している。The candidate word is output to the candidate word comparison means 13 as the latest recognition candidate word data. FIG. 10 shows an example of recognition candidate word data. As shown in the figure, the recognition candidate word data has a candidate ID number 100, which is a unique number for each candidate.
1. Start stroke ID number 100 that composes the word
2, end stroke ID number 1003, word character string 1004, and evaluation value 1005. The evaluation value of the candidate word is adjusted so that a better evaluation can be obtained than the sum of the individual evaluation values of each of the constituent candidate characters. For example, when the smaller the evaluation value is, the better the evaluation is, an adjustment method of multiplying by a constant α (0 <α <1) can be considered. For example, assuming that “recognition” of the candidate ID number 10441 in FIG. 10 is α = 0.8, the evaluation value is 440. This is the word candidates 10252 "Kan" and 26441 "Kin" in FIG.
It is a better evaluation value than the value 550 obtained by individually adding the evaluation values of, and reflects that the evaluation of the character is improved by being recognized as a word.

【００３０】候補単語比較手段１３は、単語認識手段１
０の出力する最新認識候補単語データと候補単語記憶手
段１２に記憶されている過去に認識された既認識候補単
語データを比較参照して、最新認識単語と連接している
既認識単語を検索する。最新認識候補単語データとそれ
に連接すると判明した既認識候補単語データは、文法認
識手段１４に出力される。The candidate word comparison means 13 is the word recognition means 1
The latest recognized candidate word data output by 0 and the previously recognized already recognized candidate word data stored in the candidate word storage unit 12 are compared and referenced to search for the already recognized word connected to the latest recognized word. . The latest recognition candidate word data and the already recognized candidate word data found to be connected to the latest recognition candidate word data are output to the grammar recognition means 14.

【００３１】文法認識手段１４は、候補単語比較手段１
３が出力した最新認識候補単語と既認識候補単語のう
ち、文法辞書１５に格納されている単語の接続規則に適
合するものを認識する。接続規則に適合する単語列は、
文法適合単語候補列として文法適合単語列記憶手段１６
で記憶される。文法適合単語候補列は、認識単語候補デ
ータと同様に、候補ＩＤ番号、その単語列を構成する開
始ストロークＩＤ番号、及び終了ストロークＩＤ番号、
単語列文字列、評価値、から構成される。候補単語列の
評価値は、構成する各候補単語の個別の評価値の和より
も良い評価が得られるように調整する。例えば、評価値
の値が小さい方がより良い評価となる場合には、定数β
（０＜β＜１）を乗するといった調整方法が考えられ
る。The grammar recognition means 14 is the candidate word comparison means 1
Of the latest recognition candidate word and the already recognized candidate word output by 3 which correspond to the connection rules of the words stored in the grammar dictionary 15 are recognized. The word string that matches the connection rule is
As a grammar matching word candidate string, a grammar matching word string storage unit 16
Is stored. Similar to the recognized word candidate data, the grammatical matching word candidate string has a candidate ID number, a start stroke ID number and an end stroke ID number that form the word string,
It consists of word string character string and evaluation value. The evaluation value of the candidate word string is adjusted so that a better evaluation can be obtained than the sum of the individual evaluation values of each of the constituent candidate words. For example, if a smaller evaluation value gives a better evaluation, the constant β
An adjustment method of multiplying (0 <β <1) can be considered.

【００３２】認識結果決定手段１７は、候補文字記憶手
段８、候補単語記憶手段１２、文法適合単語列記憶手段
１６を参照して、入力ストロークを全てカバーし、かつ
評価値に基づいて認識結果を出力する。例えば、評価値
の和が最小であるような認識結果を候補の中から選択
し、認識結果として出力する。その他、評価値の積を用
いてもよい。The recognition result determining means 17 refers to the candidate character storage means 8, the candidate word storage means 12, and the grammar matching word string storage means 16 to cover all the input strokes and to obtain the recognition result based on the evaluation value. Output. For example, a recognition result having the smallest sum of evaluation values is selected from the candidates and is output as the recognition result. Alternatively, the product of evaluation values may be used.

【００３３】図１１を用いて図３に示すストローク列が
順に入力される際の本実施例装置の実際の動作を説明す
る。既入力ストローク欄１１０１は既に入力されたスト
ロークを図示する。新規ストローク欄１１０２は新規に
入力されたストロークとそのＩＤ番号（図３と共通）
を、新規切り出しパターン欄１１０３には、新規に切り
出された切り出しパターンの開始ストロークＩＤと終了
ストロークＩＤを、候補文字欄１１０４には新規切り出
しパターンの認識結果（第２位候補まで）とその評価値
を、候補単語欄１１０５には候補単語として認識された
単語とその評価値を、認識結果欄１１０６にはその時点
までに入力されたストロークを認識した最終的な結果と
その評価値をそれぞれ表記する。以下順に説明する。The actual operation of the apparatus of this embodiment when the stroke sequences shown in FIG. 3 are sequentially input will be described with reference to FIG. The already-input stroke column 1101 illustrates strokes that have already been input. The new stroke column 1102 shows a newly input stroke and its ID number (common to FIG. 3).
In the new cut-out pattern field 1103, the start stroke ID and end stroke ID of the newly cut out cut-out pattern, and in the candidate character field 1104, the recognition result (up to the second-ranked candidate) of the new cut-out pattern and its evaluation value. In the candidate word column 1105, the word recognized as a candidate word and its evaluation value are described, and in the recognition result column 1106, the final result of recognizing the stroke input up to that point and its evaluation value are described. . The description will be made in the following order.

【００３４】１：ストローク３０１入力このとき既入力ストロークは存在しないため、ストロー
ク３０１だけで構成される切り出しパターンが新規に切
り出され認識される。認識結果は「ノ」。1: Input of Stroke 301 At this time, since there is no input stroke, a cutout pattern composed only of the stroke 301 is newly cut out and recognized. The recognition result is "No".

【００３５】２：ストローク３０２入力ストローク３０１と３０２は一つの文字として切り出さ
れ、新規切り出しパターンとなる。認識結果は現時点で
入力されたストロークをカバーする候補で最も評価値の
良い「イ」が選択される。2: Stroke 302 input The strokes 301 and 302 are cut out as one character to form a new cutout pattern. As the recognition result, “i” having the best evaluation value is selected from the candidates that cover the strokes input at the present time.

【００３６】３：ストローク３０３入力ストローク３０１から３０３は一つの文字として切り出
され、新規切り出しパターンとなる。この段階での認識
結果は「川」である。3: Input of Stroke 303 Strokes 301 to 303 are cut out as one character to form a new cutout pattern. The recognition result at this stage is “river”.

【００３７】４：ストローク３０４入力ここではストローク３０１〜３０３と３０４〜３０４で
構成される２つの切り出しパターンが切り出された。前
者はストローク３０３入力時に既に切り出された切り出
しパターンと同じなので、新規切り出しパターンはスト
ローク３０４で構成されるパターンのみである。したが
ってこの段階で文字認識処理される切り出しパターンは
後者だけである。その結果候補文字は「１」と「く」と
なった。切り出しパターンが連続する候補文字「川」
「Ｎ」との組み合わせの中に候補単語はなく、この段階
での認識結果は「川１」となる。4: Input of stroke 304 Here, two cutout patterns composed of strokes 301 to 303 and 304 to 304 are cut out. Since the former is the same as the cutout pattern already cut out when the stroke 303 is input, the new cutout pattern is only the pattern constituted by the stroke 304. Therefore, the latter is the only cut-out pattern that is subjected to character recognition processing at this stage. As a result, the candidate characters became "1" and "ku". Candidate character "Kawa" with continuous cutout patterns
There is no candidate word in the combination with “N”, and the recognition result at this stage is “kawa 1”.

【００３８】５：ストローク３０５入力ストローク３０１〜３０３と３０４〜３０５で切り出さ
れるが、前段階と同様に新規切り出しパターンはストロ
ーク３０４〜３０５で構成されるものである。その切り
出しパターンの認識候補文字は「ト」「人」となった。
候補単語はなく、この段階での認識結果は「川ト」とな
る。5: Input of Stroke 305 The strokes 301 to 303 and 304 to 305 are cut out, and the new cutout pattern is composed of the strokes 304 to 305 as in the previous step. The recognition candidate characters of the cutout pattern are "to" and "person".
There is no candidate word, and the recognition result at this stage is "Kawato".

【００３９】６：ストローク３０６入力ストローク３０１〜３０３と３０４〜３０６で切り出さ
れるが、前段階と同様に新規切り出しパターンはストロ
ーク３０４〜３０６で構成されるものである。その切り
出しパターンの認識候補文字は「山」「Ｎ」となった。
候補単語はなく、この段階での認識結果は「川山」とな
る。6: Stroke 306 input Although the strokes are cut out by the strokes 301 to 303 and 304 to 306, the new cutout pattern is composed of the strokes 304 to 306 as in the previous stage. The recognition candidate characters of the cutout pattern are "mountain" and "N".
There is no candidate word, and the recognition result at this stage is "Kawayama".

【００４０】７：ストローク３０７入力ストローク３０１〜３０３と３０４〜３０６と３０７で
切り出されるが、新規切り出しパターンはストローク３
０７で構成されるもののみである。その切り出しパター
ンの認識候補文字は「一（漢字）」「ー（長音記号）」
となった。候補単語はなく、この段階での認識結果は
「川山一」となる。7: Stroke 307 input The strokes 301 to 303 and 304 to 306 and 307 are cut out. The new cutout pattern is stroke 3.
07 are only configured. The recognition candidate characters for the cutout pattern are "I (kanji)" and "-(long-sound symbol)."
It became. There is no candidate word, and the recognition result at this stage is “Kawayama Hajime”.

【００４１】８：ストローク３０８入力ストローク３０１〜３０３と３０４〜３０６と３０７〜
３０８で切り出されるが、新規切り出しパターンはスト
ローク３０７〜３０８で構成されるもののみである。そ
の切り出しパターンの認識候補文字は「ナ」「＋」とな
った。候補単語はなく、この段階での認識結果は「川山
ナ」となる。8: Stroke 308 input Strokes 301 to 303 and 304 to 306 and 307 to
It is cut out at 308, but the new cutout pattern is only the one composed of strokes 307 to 308. The recognition candidate characters of the cutout pattern are "na" and "+". There is no candidate word, and the recognition result at this stage is "Kawayama Na".

【００４２】９：ストローク３０９入力ストローク３０１〜３０３と３０４〜３０６と３０７〜
３０９で切り出されるが、新規切り出しパターンはスト
ローク３０７〜３０９で構成されるもののみである。そ
の切り出しパターンの認識候補文字は「大」「丈」とな
った。候補単語はなく、この段階での認識結果は「川山
大」となる。9: Input of stroke 309 Strokes 301 to 303 and 304 to 306 and 307 to
Although it is cut out at 309, the new cut-out pattern is only the one composed of strokes 307 to 309. The recognition candidate characters of the cutout pattern are "large" and "length". There is no candidate word, and the recognition result at this stage is "Kawayama Univ."

【００４３】１０：ストローク３１０入力ストローク３０１〜３０３と３０４〜３１０で切り出さ
れるが、新規切り出しパターンはストローク３０４〜３
１０で構成されるもののみである。その切り出しパター
ンの認識候補文字は「性」「囲」となった。候補単語は
なく、この段階での認識結果は「川性」となる。10: Stroke 310 input The strokes 301 to 303 and 304 to 310 are cut out, and the new cutout pattern is strokes 304 to 3
Only those configured with 10. The recognition candidate characters of the cutout pattern are "sex" and "enclosure." There are no candidate words, and the recognition result at this stage is "river."

【００４４】１１：ストローク３１１入力ストローク３０１〜３０３と３０４〜３１１で切り出さ
れるが、新規切り出しパターンはストローク３０４〜３
１１で構成されるもののみである。その切り出しパター
ンの認識候補文字は「枠」「砕」となった。候補単語は
なく、この段階での認識結果は「川枠」となる。11: Input of stroke 311 The strokes 301 to 303 and 304 to 311 are cut out, but the new cutout pattern is strokes 304 to 3
11 is only configured. The recognition candidate characters of the cutout pattern are "frame" and "break". There are no candidate words, and the recognition result at this stage is "river frame."

【００４５】１２：ストローク３１２入力ストローク３０１〜３０３と３０４〜３１２で切り出さ
れるが、新規切り出しパターンはストローク３０４〜３
１２で構成されるもののみである。その切り出しパター
ンの認識候補文字は「峠」「特」となった。候補単語は
なく、この段階での認識結果は「川峠」となる。12: Stroke 312 input The strokes 301 to 303 and 304 to 312 are cut out, but the new cutout pattern is strokes 304 to 3
It is only composed of 12. The recognition candidate characters for the cutout pattern are "pass" and "special". There is no candidate word, and the recognition result at this stage is "Kawatoge".

【００４６】１３：ストローク３１３入力ストローク３０１〜３０３と３０４〜３１３で切り出さ
れるが、新規切り出しパターンはストローク３０４〜３
１３で構成されるもののみである。その切り出しパター
ンの認識候補文字は「峰」「崎」となった。候補単語
「川崎」が存在し、評価値は候補単語「川」と「崎」の
評価値を個別に加算した値２１０（＝６０＋１５０）に
定数αを乗した値となる。図の例ではα＝０．８として
計算したので評価値１６８となっている。この段階での
認識結果は、「川崎」が選ばれる。ちなみに単語辞書
「川崎」がない場合は「川峰」となる。13: Stroke 313 input: The strokes 301 to 303 and 304 to 313 are cut out, but the new cutout pattern is strokes 304 to 3
13 is the only one. The recognition candidate characters of the cutout pattern are "Mine" and "Saki". The candidate word “Kawasaki” exists, and the evaluation value is a value 210 (= 60 + 150) obtained by individually adding the evaluation values of the candidate words “kawa” and “saki” multiplied by a constant α. In the example of the figure, the evaluation value is 168 because the calculation is performed with α = 0.8. “Kawasaki” is selected as the recognition result at this stage. By the way, if there is no word dictionary "Kawasaki", it will be "Kawamine".

【００４７】１４：ストローク３１４入力ストローク３０１〜３０３と３０４〜３１４で切り出さ
れるが、新規切り出しパターンはストローク３０４〜３
１４で構成されるもののみである。その切り出しパター
ンの認識候補文字は「崎」「埼」となった。前段階と同
様に候補単語「川崎」が存在する。認識結果は「川崎」
となる。14: Stroke 314 input The strokes 301 to 303 and 304 to 314 are cut out, and the new cutout pattern is strokes 304 to 3
14 is the only one. The recognition candidate characters of the cutout pattern are "saki" and "saki". As in the previous step, the candidate word "Kawasaki" exists. The recognition result is "Kawasaki"
Becomes

【００４８】このように、新たに手書きストロークが入
力されることによって変化する文字の切り出しや、それ
に伴う認識結果の変化、変化した認識結果の単語認識等
の後処理への動的な反映を実現した。また、変化した認
識結果と、それに関係すると予想される部分のみを抜き
出して再処理することによって効率的な処理が実現され
た。As described above, a character that changes when a new handwritten stroke is input, a change in the recognition result accompanying it, and a dynamic reflection of the changed recognition result in post-processing such as word recognition are realized. did. In addition, efficient processing was realized by extracting only the changed recognition result and the part expected to be related to it and reprocessing.

【００４９】本発明による構成は、オンライン手書き文
字認識装置のみならず、断続的な入力を認識する装置全
般にも適用することが可能である。たとえば、連続音声
認識装置にも適用可能である。The structure according to the present invention can be applied not only to an on-line handwritten character recognition device but also to all devices for recognizing intermittent input. For example, it can be applied to a continuous voice recognition device.

【００５０】[0050]

【発明の効果】以上の実施例から明らかなように、本発
明の構成の文字認識装置を使用することにより、オンラ
イン手書き文字認識において、新たに手書きストローク
が入力されると、文字切り出し、個別文字認識、単語認
識、文法認識の各段階で、過去の認識結果との差分につ
いてのみ処理を行うことができる。このため、必要最小
限の処理により認識結果が更新可能となり、高速かつ効
率的な認識を行ううえで効果がある。As is apparent from the above embodiments, by using the character recognition device having the configuration of the present invention, when a new handwriting stroke is input in the online handwritten character recognition, the character segmentation and the individual character are input. At each stage of recognition, word recognition, and grammar recognition, only the difference from the past recognition result can be processed. Therefore, the recognition result can be updated with the minimum necessary processing, which is effective in performing high-speed and efficient recognition.

[Brief description of drawings]

【図１】本発明の文字認識装置の一実施例の構成図であ
る。FIG. 1 is a configuration diagram of an embodiment of a character recognition device of the present invention.

【図２】本発明の文字認識装置の実施例の文字切り出し
手段の文字切り出し方法の一例を説明するための図であ
る。FIG. 2 is a diagram for explaining an example of a character cutout method of a character cutout unit of the embodiment of the character recognition device of the present invention.

【図３】本発明の文字認識装置の実施例の文字切り出し
手段の文字切り出し方法の一例を説明するための図であ
る。FIG. 3 is a diagram for explaining an example of a character cutout method of a character cutout unit of the embodiment of the character recognition device of the present invention.

【図４】本発明の文字認識装置の実施例の文字切り出し
手段の文字切り出し方法の一例を説明するための図であ
る。FIG. 4 is a diagram for explaining an example of a character cutout method of a character cutout unit of the embodiment of the character recognition device of the present invention.

【図５】本発明の文字認識装置の実施例の文字切り出し
手段の文字切り出し方法の一例を説明するための図であ
る。FIG. 5 is a diagram for explaining an example of a character cutout method of the character cutout means of the embodiment of the character recognition device of the present invention.

【図６】本発明の文字認識装置の実施例の文字切り出し
手段の文字切り出し方法の一例を説明するための図であ
る。FIG. 6 is a diagram for explaining an example of a character cutout method of a character cutout unit of the embodiment of the character recognition device of the present invention.

【図７】本発明の文字認識装置の実施例の文字切り出し
パターンの一例示す図である。FIG. 7 is a diagram showing an example of a character cutout pattern of an embodiment of the character recognition device of the present invention.

【図８】本発明の文字認識装置の実施例の文字切り出し
パターン比較手段の処理を説明するための図である。FIG. 8 is a diagram for explaining the processing of the character cutout pattern comparison means of the embodiment of the character recognition device of the present invention.

【図９】本発明の文字認識装置の実施例の認識候補文字
データの一例を示す図である。FIG. 9 is a diagram showing an example of recognition candidate character data of an embodiment of the character recognition device of the present invention.

【図１０】本発明の文字認識装置の実施例の候補単語デ
ータの一例を示す図である。FIG. 10 is a diagram showing an example of candidate word data in the embodiment of the character recognition device of the present invention.

【図１１】本発明の文字認識装置の実施例の動作を説明
する図である。FIG. 11 is a diagram for explaining the operation of the embodiment of the character recognition device of the present invention.

【図１２】従来の文字認識装置の一例を示す構成図であ
る。FIG. 12 is a configuration diagram showing an example of a conventional character recognition device.

[Explanation of symbols]

１…ストローク入力手段、２…ストローク比較手段、３
…ストローク記憶手段、４…文字切り出し手段、５…文
字切り出しパターン記憶手段、６…文字切り出しパター
ン比較手段、７…個別文字認識手段、８…候補文字記憶
手段、９…候補文字比較手段、１０…単語認識手段、１
１…単語辞書、１２…候補単語記憶手段、１３…候補単
語比較手段、１４…文法認識手段、１５…文法辞書、１
６…文法適合単語列記憶手段、１７…認識結果決定手段1 ... Stroke input means, 2 ... Stroke comparison means, 3
... stroke storage means, 4 ... character cutout means, 5 ... character cutout pattern storage means, 6 ... character cutout pattern comparison means, 7 ... individual character recognition means, 8 ... candidate character storage means, 9 ... candidate character comparison means, 10 ... Word recognition means, 1
DESCRIPTION OF SYMBOLS 1 ... Word dictionary, 12 ... Candidate word storage means, 13 ... Candidate word comparison means, 14 ... Grammar recognition means, 15 ... Grammar dictionary, 1
6 ... Grammar compatible word string storage means, 17 ... Recognition result determination means

───────────────────────────────────────────────────── フロントページの続き (72)発明者難波康晴神奈川県川崎市麻生区王禅寺1099番地株式会社日立製作所システム開発研究所内 (72)発明者青島弘和神奈川県川崎市麻生区王禅寺1099番地株式会社日立製作所システム開発研究所内 ─────────────────────────────────────────────────── ─── Continuation of the front page (72) Inventor Yasuharu Namba 1099, Ozenji, Aso-ku, Kawasaki-shi, Kanagawa Inside the Hitachi, Ltd. Systems Development Laboratory (72) Inventor Hirokazu Aoshima 1099, Ozen-ji, Aso-ku, Kawasaki, Kanagawa Hitachi Systems Development Laboratory

Claims

[Claims]

1. An input unit for inputting handwriting stroke data of handwritten characters, a first storage unit for storing the input handwriting stroke data, and a unit for storing the input handwriting stroke data and the storage unit in the past. Compare the stored handwriting stroke data,
First extracting means for extracting difference stroke data, cutout means for cutting out the input handwriting stroke data as a stroke set for each character using the extracted difference stroke data, and the cutout stroke set. A character recognition device comprising a character recognition means for recognizing a character.

2. The character recognition device according to claim 1, wherein the storage unit stores the stroke set cut out by the cutting unit, the stroke set cut out by the cutting unit, and the second set in the past. Second extraction means for comparing the stroke set stored in the storage means and extracting the difference, the character recognition means performing character recognition using the extraction result of the second extraction means. Character recognition device.

3. The character recognition device according to claim 1, wherein the character recognition unit stores the character recognized by the character recognition unit, and the character recognized by the character recognition unit and the character recognized in the past. A third recognizing means for comparing the characters stored in the second storing means and extracting a difference, and a word recognizing means for recognizing a word using the third extracting means and a word dictionary storing word information. A character recognition device having.

4. The character recognition device according to claim 3, wherein the word recognition means outputs one or more candidate words.

5. The character recognition device according to claim 1, wherein the character recognition means outputs one or more candidate characters.