JP7095450B2

JP7095450B2 - Information processing device, character recognition method, and character recognition program

Info

Publication number: JP7095450B2
Application number: JP2018136748A
Authority: JP
Inventors: 勝彦秋山
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 2018-07-20
Filing date: 2018-07-20
Publication date: 2022-07-05
Anticipated expiration: 2038-07-20
Also published as: JP2020013460A

Description

本発明は、情報処理装置、文字認識方法、および文字認識プログラムに関する。 The present invention relates to an information processing device, a character recognition method, and a character recognition program.

従来、ユーザが手の指やスタイラスなどを利用して１つの入力欄に重ね書きした文字を認識する文字認識技術がある。 Conventionally, there is a character recognition technique in which a user recognizes characters overwritten in one input field by using a finger or a stylus.

先行技術としては、例えば、バッファ内の入力ストローク列に対する認識候補文字列を生成し、認識候補文字列ごとに文字列構造辞書を生成して入力ストローク列と照合するものがある。また、例えば、入力ストロークに基づいて、文字の推測に対応するノードから構成される分割グラフを生成し、分割グラフの各ノードに認識スコアを割り当て、認識スコアと言語モデルとに基づいて、入力ストロークの言語的意味を生成する技術がある。また、例えば、ユーザが複数の重なり合ったストロークを描くと、ストロークを自動的にセグメント化し、１つ以上の対応する文字に結合する技術がある。また、例えば、入力された手書き文字の各ストロークについて入力枠に対するストロークの開始・終了領域と方向とからストローク情報を求め、ストローク辞書に登録されたストローク情報と照合してストローク候補を検出する技術がある。また、例えば、ユーザが筆記中の字画がｉ番目の文字に帰属するのかｉ＋１番目の文字に帰属するのかを判断し、ユーザがｉ＋１番目の文字の筆記を開始すると、ｉ番目の文字の字画の表示色を自動的に暗転する技術がある。 As a prior art, for example, there is one that generates a recognition candidate character string for an input stroke string in a buffer, generates a character string structure dictionary for each recognition candidate character string, and collates it with the input stroke string. Also, for example, based on the input stroke, a split graph consisting of nodes corresponding to character guesses is generated, a recognition score is assigned to each node of the split graph, and the input stroke is based on the recognition score and the language model. There is a technique to generate the linguistic meaning of. Further, for example, there is a technique in which when a user draws a plurality of overlapping strokes, the strokes are automatically segmented and combined with one or more corresponding characters. Further, for example, a technique for obtaining stroke information from the start / end area and direction of a stroke for an input frame for each stroke of an input handwritten character and collating it with the stroke information registered in the stroke dictionary to detect a stroke candidate. be. Further, for example, when the user determines whether the stroke being written belongs to the i-th character or the i + 1-th character and the user starts writing the i + 1-th character, the stroke of the i-th character There is a technology that automatically darkens the display color.

特開２００２－２０３２０８号公報Japanese Unexamined Patent Publication No. 2002-203208 国際公開第２０１５／１５０９１１号International Publication No. 2015/150911 国際公開第２０１５／０９４５８７号International Publication No. 2015/094587 特開２００１－１８４４５８号公報Japanese Unexamined Patent Publication No. 2001-184458 国際公開第２０１２／０７５８２０号International Publication No. 2012/0752820

しかしながら、従来技術では、入力欄に重ね書きされる文字を正しく認識することができない場合がある。例えば、漢字、ひらがな、カタカナ、英数字、記号などの、認識する文字の種別が多くなるほど、入力欄に重ね書きされる文字を正しく認識することが難しくなる傾向がある。 However, in the prior art, it may not be possible to correctly recognize the characters overwritten in the input field. For example, the more types of characters to be recognized, such as Chinese characters, hiragana, katakana, alphanumericals, and symbols, the more difficult it tends to be to correctly recognize the characters overwritten in the input field.

１つの側面では、本発明は、文字の認識精度を向上することを目的とする。 In one aspect, the present invention aims to improve character recognition accuracy.

１つの実施態様によれば、入力欄に書き込まれた筆画の軌跡を示す筆画データを蓄積し、蓄積した前記筆画データを参照して、前記入力欄に書き込まれた、時系列順に連続する１以上の筆画を含む先行の筆画グループと、前記先行の筆画グループの直後に続く、時系列順に連続する１以上の筆画を含む後続の筆画グループとの特徴量に基づいて、前記先行の筆画グループと前記後続の筆画グループとの区切りが、文字間の区切りである妥当性の強さを示す妥当性スコアを算出し、蓄積した前記筆画データと、算出した前記妥当性スコアとに基づいて、前記入力欄に書き込まれた１以上の文字を認識する情報処理装置、文字認識方法、および文字認識プログラムが提案される。 According to one embodiment, stroke data indicating the trajectory of the stroke written in the input field is accumulated, and with reference to the accumulated stroke data, one or more consecutive strokes written in the input field are written in chronological order. The preceding stroke group and the said The input field is based on the stroke data accumulated by calculating the validity score indicating the strength of the validity that the division with the subsequent stroke group is the division between characters, and the calculated validity score. An information processing device, a character recognition method, and a character recognition program for recognizing one or more characters written in are proposed.

一態様によれば、文字の認識精度を向上することが可能になる。 According to one aspect, it becomes possible to improve the recognition accuracy of characters.

図１は、実施の形態にかかる文字認識方法の一実施例を示す説明図である。FIG. 1 is an explanatory diagram showing an embodiment of a character recognition method according to an embodiment. 図２は、情報処理装置１００のハードウェア構成例を示すブロック図である。FIG. 2 is a block diagram showing a hardware configuration example of the information processing apparatus 100. 図３は、辞書データベース３００の記憶内容の一例を示す説明図である。FIG. 3 is an explanatory diagram showing an example of the stored contents of the dictionary database 300. 図４は、文脈データベース４００の記憶内容の一例を示す説明図である。FIG. 4 is an explanatory diagram showing an example of the stored contents of the context database 400. 図５は、候補データベース５００の記憶内容の一例を示す説明図である。FIG. 5 is an explanatory diagram showing an example of the stored contents of the candidate database 500. 図６は、情報処理装置１００の機能的構成例を示すブロック図である。FIG. 6 is a block diagram showing a functional configuration example of the information processing apparatus 100. 図７は、情報処理装置１００がニューラルネットワークを生成する一例を示す説明図（その１）である。FIG. 7 is an explanatory diagram (No. 1) showing an example in which the information processing apparatus 100 generates a neural network. 図８は、情報処理装置１００がニューラルネットワークを生成する一例を示す説明図（その２）である。FIG. 8 is an explanatory diagram (No. 2) showing an example in which the information processing apparatus 100 generates a neural network. 図９は、ニューラルネットワークに学習される性質の一例を示す説明図（その１）である。FIG. 9 is an explanatory diagram (No. 1) showing an example of the property learned by the neural network. 図１０は、ニューラルネットワークに学習される性質の一例を示す説明図（その２）である。FIG. 10 is an explanatory diagram (No. 2) showing an example of the property learned by the neural network. 図１１は、ニューラルネットワークに学習される性質の一例を示す説明図（その３）である。FIG. 11 is an explanatory diagram (No. 3) showing an example of the property learned by the neural network. 図１２は、情報処理装置１００が入力画面１２００を表示する一例を示す説明図である。FIG. 12 is an explanatory diagram showing an example in which the information processing apparatus 100 displays the input screen 1200. 図１３は、情報処理装置１００が文字を認識する一例を示す説明図（その１）である。FIG. 13 is an explanatory diagram (No. 1) showing an example in which the information processing apparatus 100 recognizes characters. 図１４は、情報処理装置１００が文字を認識する一例を示す説明図（その２）である。FIG. 14 is an explanatory diagram (No. 2) showing an example in which the information processing apparatus 100 recognizes characters. 図１５は、情報処理装置１００が文字を認識する一例を示す説明図（その３）である。FIG. 15 is an explanatory diagram (No. 3) showing an example in which the information processing apparatus 100 recognizes characters. 図１６は、情報処理装置１００が文字を認識した結果の一例を示す説明図である。FIG. 16 is an explanatory diagram showing an example of the result of the information processing apparatus 100 recognizing characters. 図１７は、全体処理手順の一例を示すフローチャートである。FIG. 17 is a flowchart showing an example of the overall processing procedure.

以下に、図面を参照して、本発明にかかる情報処理装置、文字認識方法、および文字認識プログラムの実施の形態を詳細に説明する。 Hereinafter, embodiments of the information processing apparatus, the character recognition method, and the character recognition program according to the present invention will be described in detail with reference to the drawings.

（実施の形態にかかる文字認識方法の一実施例）
図１は、実施の形態にかかる文字認識方法の一実施例を示す説明図である。図１において、情報処理装置１００は、ユーザにより１つの入力欄に重ね書きされた文字を認識するためのコンピュータである。情報処理装置１００は、例えば、ＰＣ（ＰｅｒｓｏｎａｌＣｏｍｐｕｔｅｒ）、タブレット端末、スマートフォン、ウェアラブル端末などである。 (Example of character recognition method according to the embodiment)
FIG. 1 is an explanatory diagram showing an embodiment of a character recognition method according to an embodiment. In FIG. 1, the information processing apparatus 100 is a computer for recognizing characters overwritten in one input field by a user. The information processing device 100 is, for example, a PC (Personal Computer), a tablet terminal, a smartphone, a wearable terminal, or the like.

ここで、１つの入力欄に文字を重ね書きする方式は、他の方式に比べて有用な点が存在するため、１つの入力欄に文字を重ね書きする方式を利用することが好ましい場合がある。他の方式は、例えば、複数の入力欄が用意され、ユーザがそれぞれの入力欄に１文字ずつ文字を書き込むような方式である。また、他の方式は、例えば、ユーザが１つの入力欄に１文字書き込む都度、確定操作を行うような方式である。 Here, since the method of overwriting characters in one input field has more useful points than the other methods, it may be preferable to use the method of overwriting characters in one input field. .. Another method is, for example, a method in which a plurality of input fields are prepared and a user writes one character in each input field. Further, another method is, for example, a method in which a confirmation operation is performed each time a user writes one character in one input field.

他の方式に比べて有用な点は、例えば、入力欄を大きくしやすいため、ユーザが文字を書き込みやすくなるという点が考えられる。また、他の方式に比べて有用な点は、例えば、ユーザが１文字ずつ書き込む入力欄を変更しなくてもよく、ユーザが１文字ずつ確定操作を行わなくてもよいため、ユーザが文字を書き込みやすくなるという点が考えられる。 A useful point as compared with other methods is that, for example, it is easy to make the input field large, so that it is easy for the user to write characters. Further, a useful point compared to other methods is that, for example, the user does not have to change the input field for writing characters one by one, and the user does not have to perform the confirmation operation one character at a time, so that the user writes characters. It is possible that it will be easier to write.

しかしながら、１つの入力欄に文字を重ね書きする方式で、入力欄に書き込まれた文字を正しく認識することができない場合がある。例えば、漢字、ひらがな、カタカナ、英数字、記号などの、認識する文字の種別が多くなるほど、入力欄に重ね書きされる文字を正しく認識することが難しくなる傾向がある。 However, there are cases where the characters written in the input field cannot be correctly recognized by the method of overwriting the characters in one input field. For example, the more types of characters to be recognized, such as Chinese characters, hiragana, katakana, alphanumericals, and symbols, the more difficult it tends to be to correctly recognize the characters overwritten in the input field.

ここで、１つの入力欄に文字を重ね書きする方式では、１つの入力欄に連続して書き込まれた複数の筆画のうち、いずれの筆画の集まりが、１つの文字分の筆画の集まりであるかは、ユーザから明示されない。そして、認識する文字の種別が多くなるほど、１つの文字を形成する筆画の集まりと、連続する２以上の文字を形成する筆画の集まりとが類似してしまう確率が増加する傾向がある。結果として、１つの文字と、連続する２以上の文字とを区別して認識することが難しくなる。例えば、「名（漢字）」と「タロ（カタカナ）」とを区別することが難しくなる。また、例えば、「夕（漢字）」と「タ（カタカナ）」と「クー（カタカナ）」とを区別することが難しくなる。 Here, in the method of overwriting characters in one input field, a collection of strokes among a plurality of strokes written consecutively in one input field is a collection of strokes for one character. Is not specified by the user. As the number of types of characters to be recognized increases, the probability that a group of strokes forming one character and a group of strokes forming two or more consecutive characters tend to be similar increases. As a result, it becomes difficult to distinguish and recognize one character and two or more consecutive characters. For example, it becomes difficult to distinguish between "name (kanji)" and "taro (katakana)". Further, for example, it becomes difficult to distinguish between "Yu (Kanji)", "Ta (Katakana)", and "Ku (Katakana)".

これに対し、例えば、認識する文字の種別をユーザが切り替え可能にし、認識する文字の種別を制限して、重ね書きされる文字を精度よく認識しようとする場合が考えられる。この場合では、ユーザが１つの入力欄に多種の文字を含む文字列を書き込もうとする際、認識する文字の種別を切り替える操作を、ユーザが行うことになるため、ユーザの負担の増大化を招いてしまうという問題がある。 On the other hand, for example, it is conceivable that the user can switch the type of the character to be recognized, the type of the character to be recognized is limited, and the overwritten character is recognized with high accuracy. In this case, when the user tries to write a character string containing various characters in one input field, the user performs an operation of switching the type of the character to be recognized, which increases the burden on the user. There is a problem that it will be gone.

また、例えば、連続する２つの文字の組み合わせごとに、連続する２つの文字の一方の文字を書き込んだ後、他方の文字の書き込みに移行する際、入力欄上で書き込み位置がどこからどこに移動するかの正しい軌跡を示す辞書情報を用意しておく場合が考えられる。そして、１つの入力欄に書き込まれた複数の筆画から文字の候補を特定した後、辞書情報を参照し、いずれの候補が妥当であるかを判断して、文字を精度よく認識しようとする。 Also, for example, when one character of two consecutive characters is written for each combination of two consecutive characters and then the writing of the other character is started, where the writing position moves on the input field. It is conceivable to prepare dictionary information showing the correct trajectory of. Then, after identifying character candidates from a plurality of strokes written in one input field, the dictionary information is referred to to determine which candidate is appropriate, and an attempt is made to accurately recognize the character.

この場合では、認識する文字の種別が多くなるほど、文字の組み合わせの数が増大するため、膨大な文字の組み合わせについての辞書情報を用意することになるという問題がある。また、この場合では、連続する２つの文字が斜めに傾けて書き込まれると、入力欄上で書き込み位置が移動した実際の軌跡が、辞書情報が示す正しい軌跡と一致しないことがあり、重ね書きされる文字を正しく認識することができないという問題がある。 In this case, as the number of types of characters to be recognized increases, the number of character combinations increases, so that there is a problem that dictionary information about a huge number of character combinations is prepared. Further, in this case, if two consecutive characters are written at an angle, the actual locus of the movement of the writing position on the input field may not match the correct locus indicated by the dictionary information, and the characters are overwritten. There is a problem that the characters cannot be recognized correctly.

また、例えば、１つの入力欄に書き込まれた複数の筆画から文字の候補を特定した後、文脈上いずれの候補が妥当であるかを判断して、重ね書きされる文字を精度よく認識しようとする場合が考えられる。この場合では、特定した候補のうち、文脈上妥当である候補が複数存在すると、重ね書きされる文字を正しく認識することができないという問題がある。 Also, for example, after identifying character candidates from a plurality of strokes written in one input field, it is determined which candidate is appropriate in the context, and an attempt is made to accurately recognize the overwritten character. It is possible that you will. In this case, if there are a plurality of candidates that are valid in the context among the specified candidates, there is a problem that the overwritten characters cannot be recognized correctly.

そこで、本実施の形態では、認識する文字の種別を制限しなくても、ユーザによって１つの入力欄に重ね書きされる文字を精度よく認識可能にすることができる文字認識方法について説明する。 Therefore, in the present embodiment, a character recognition method capable of accurately recognizing characters overwritten in one input field by the user without limiting the types of characters to be recognized will be described.

図１の例では、情報処理装置１００は、文字が重ね書きされる１つの入力欄１１０を表示する。また、図１の例では、ユーザは、手書きで入力欄１１０に、５０音順のひらがなの各文字「あいう…」を重ね書きしていく場合を例に挙げる。 In the example of FIG. 1, the information processing apparatus 100 displays one input field 110 in which characters are overwritten. Further, in the example of FIG. 1, a case where the user handwrites each character "Ai ..." of hiragana in the order of the Japanese syllabary in the input field 110 is given as an example.

（１－１）情報処理装置１００は、入力欄１１０に書き込まれた筆画の軌跡を示す筆画データを蓄積する。情報処理装置１００は、例えば、入力欄１１０に筆画が書き込まれる都度、筆画を検出し、筆画の軌跡を示す筆画データを取得して蓄積する。図１の例では、情報処理装置１００は、具体的には、入力欄１１０に書き込まれた筆画１１１～１１５を検出し、筆画１１１～１１５の軌跡を示す筆画データを取得して蓄積する。 (1-1) The information processing apparatus 100 stores stroke data indicating the trajectory of the stroke written in the input field 110. For example, the information processing apparatus 100 detects a stroke each time a stroke is written in the input field 110, and acquires and stores stroke data indicating the trajectory of the stroke. In the example of FIG. 1, the information processing apparatus 100 specifically detects the strokes 111 to 115 written in the input field 110, and acquires and stores the stroke data showing the loci of the strokes 111 to 115.

（１－２）情報処理装置１００は、蓄積した筆画データを参照して、入力欄１１０に書き込まれた、先行の筆画グループと、後続の筆画グループとの特徴量に基づいて、先行の筆画グループと、後続の筆画グループとの区切りについての妥当性スコアを算出する。筆画グループは、時系列順に連続する１以上の筆画を含むグループである。後続の筆画グループは、先行の筆画グループの直後に続く筆画グループである。筆画グループの特徴量は、例えば、筆画グループに含まれる先頭の筆画および末尾の筆画の入力欄１１０での位置である。また、筆画グループの特徴量は、例えば、筆画グループ全体の入力欄１１０での位置であってもよい。また、筆画グループの特徴量は、例えば、筆画グループに含まれる筆画の数であってもよい。 (1-2) The information processing apparatus 100 refers to the accumulated stroke data, and based on the feature quantities of the preceding stroke group and the succeeding stroke group written in the input field 110, the preceding stroke group. And the validity score for the division with the subsequent stroke group is calculated. The stroke group is a group including one or more consecutive strokes in chronological order. The subsequent stroke group is the stroke group immediately following the preceding stroke group. The feature amount of the stroke group is, for example, the position in the input field 110 of the first stroke and the last stroke included in the stroke group. Further, the feature amount of the stroke group may be, for example, the position in the input field 110 of the entire stroke group. Further, the feature amount of the stroke group may be, for example, the number of strokes included in the stroke group.

妥当性スコアは、先行の筆画グループと後続の筆画グループとの区切りが、文字間の区切りである妥当性の強さを示すスコアである。妥当性スコアは、例えば、先行の筆画グループと後続の筆画グループとの筆画の位置関係や画数などの特徴が、先行の筆画グループと後続の筆画グループとの区切りが正しい場合に対応する特徴を表している度合いを示す。情報処理装置１００は、例えば、いずれかの筆画間を区切りとし、先行の筆画グループと後続の筆画グループとのペアを１つ特定する。情報処理装置１００は、例えば、それぞれの筆画間を区切りとし、先行の筆画グループと後続の筆画グループとのペアを複数特定してもよい。 The validity score is a score indicating the strength of validity that the delimiter between the preceding stroke group and the subsequent stroke group is the delimiter between characters. The validity score represents, for example, the characteristics corresponding to the case where the characteristics such as the positional relationship and the number of strokes of the strokes between the preceding stroke group and the succeeding stroke group correspond to the case where the division between the preceding stroke group and the succeeding stroke group is correct. Indicates the degree of stroke. The information processing apparatus 100 specifies, for example, one pair of a preceding stroke group and a succeeding stroke group, with one of the strokes as a delimiter. The information processing apparatus 100 may specify, for example, a plurality of pairs of a preceding stroke group and a succeeding stroke group by separating each stroke.

図１の例では、情報処理装置１００は、具体的には、入力欄１１０に書き込まれた筆画１１１～１１５を、筆画１１１～１１３を含む先行の筆画グループＧ１と、筆画１１４，１１５を含む後続の筆画グループＧ２とにグループ化する。情報処理装置１００は、先行の筆画グループＧ１と、後続の筆画グループＧ２との特徴量を、所定のモデルＭｏｄに入力し、所定のモデルＭｏｄを用いて妥当性スコアを算出する。所定のモデルＭｏｄは、２つの筆画グループの特徴量を入力すると、２つの筆画グループの区切りの妥当性スコアを出力するモデルである。所定のモデルＭｏｄは、例えば、ニューラルネットワークを用いて実現される。 In the example of FIG. 1, the information processing apparatus 100 specifically uses the strokes 111 to 115 written in the input field 110, the preceding stroke group G1 including the strokes 111 to 113, and the subsequent strokes 114 and 115. Group with the stroke group G2. The information processing apparatus 100 inputs the feature amounts of the preceding stroke group G1 and the succeeding stroke group G2 into a predetermined model mod, and calculates the validity score using the predetermined model mod. The predetermined model Mod is a model that outputs the validity score of the division between the two stroke groups when the feature amounts of the two stroke groups are input. A given model mod is realized using, for example, a neural network.

（１－３）情報処理装置１００は、蓄積した筆画データと、算出した妥当性スコアとに基づいて、入力欄１１０に書き込まれた１以上の文字を認識する。図１の例では、情報処理装置１００は、具体的には、算出した妥当性スコアが一定以上であるため、先行の筆画グループＧ１に含まれる筆画１１１～１１３の合成結果の筆跡１２１から、文字「あ」を認識する。また、情報処理装置１００は、後続の筆画グループＧ２に含まれる筆画１１４，１１５の合成結果の筆跡１２２から、文字「い」を認識する。 (1-3) The information processing apparatus 100 recognizes one or more characters written in the input field 110 based on the accumulated stroke data and the calculated validity score. In the example of FIG. 1, since the information processing apparatus 100 specifically has a calculated validity score of a certain value or more, characters are obtained from the handwriting 121 of the composition result of the strokes 111 to 113 included in the preceding stroke group G1. Recognize "A". Further, the information processing apparatus 100 recognizes the character "i" from the handwriting 122 of the composition result of the strokes 114 and 115 included in the subsequent stroke group G2.

これにより、情報処理装置１００は、認識する文字の種別を制限しなくても、ユーザによって１つの入力欄に重ね書きされる文字を精度よく認識可能にすることができる。情報処理装置１００は、例えば、認識する文字の種別が増加しても、文字の区切りがユーザから明示されずとも、１つの文字を形成する筆画の集まりを精度よく特定することができ、重ね書きされる文字を精度よく認識可能にすることができる。 As a result, the information processing apparatus 100 can accurately recognize the characters overwritten in one input field by the user without limiting the types of characters to be recognized. For example, even if the types of characters to be recognized increase, the information processing apparatus 100 can accurately identify a group of strokes forming one character even if the character delimiter is not specified by the user, and overwriting. It is possible to make the characters to be recognized accurately and recognizable.

また、情報処理装置１００は、例えば、先行の筆画グループと後続の筆画グループとの特徴量を基に妥当性スコアを算出するため、１文字ずつの筆画の位置関係と、文字間の筆画の位置関係とを考慮可能にすることができる。このため、情報処理装置１００は、ユーザが文字を斜めに傾けて書き込む傾向があっても、重ね書きされる文字を精度よく認識可能にすることができる。 Further, the information processing apparatus 100 calculates the validity score based on the feature amount of the preceding stroke group and the succeeding stroke group, for example, the positional relationship of the strokes for each character and the position of the strokes between the characters. Relationships can be considered. Therefore, the information processing apparatus 100 can accurately recognize the overwritten characters even if the user tends to write the characters at an angle.

また、情報処理装置１００は、例えば、先行の筆画グループと後続の筆画グループとの特徴量を基に、所定のモデルＭｏｄを用いて、妥当性スコアを算出することができる。このため、情報処理装置１００は、２つの文字の組み合わせごとに、前方の文字から後方の文字の書き込みに移行する際、書き込み位置がどこからどこに移動するかの正しい軌跡を示す辞書情報を記憶する場合に比べて、メモリ使用量の低減化を図ることができる。 Further, the information processing apparatus 100 can calculate a validity score by using a predetermined model Mod based on, for example, the feature quantities of the preceding stroke group and the succeeding stroke group. Therefore, when the information processing apparatus 100 stores dictionary information indicating a correct trajectory of where the writing position moves when shifting from the writing of the front character to the writing of the rear character for each combination of two characters. Compared with the above, the memory usage can be reduced.

また、情報処理装置１００は、例えば、認識結果になりうる文字列の候補が複数あり、いずれの候補も文脈上妥当であっても、ユーザの書き方から、いずれの候補における文字間の区切りが妥当であるかを判断可能にすることができる。このため、情報処理装置１００は、重ね書きされる文字を精度よく認識可能にすることができる。 Further, in the information processing apparatus 100, for example, there are a plurality of character string candidates that can be recognition results, and even if any of the candidates is valid in the context, the delimiter between characters in any of the candidates is valid from the user's writing style. It can be made possible to judge whether it is. Therefore, the information processing apparatus 100 can accurately recognize the characters to be overwritten.

ここでは、算出した妥当性スコアが一定以上である場合について説明したが、これに限らない。例えば、情報処理装置１００は、算出した妥当性スコアが一定未満である場合に、先行グループと後続グループとの別のペアについて、同様に妥当性スコアを算出し、別のペアに基づいて文字を認識する場合があってもよい。 Here, the case where the calculated validity score is above a certain level has been described, but the present invention is not limited to this. For example, when the calculated validity score is less than a certain value, the information processing apparatus 100 similarly calculates the validity score for another pair of the preceding group and the succeeding group, and outputs characters based on the other pair. It may be recognized.

また、ここでは、情報処理装置１００が、先行の筆画グループと後続の筆画グループとを組み合わせたいずれか１つのペアについて妥当性スコアを算出し、妥当性スコアを基に文字を認識する場合について説明したが、これに限らない。例えば、情報処理装置１００が、先行の筆画グループと後続の筆画グループとを組み合わせて考え得るすべてのペアについて妥当性スコアを算出し、妥当性スコアを基に文字を認識する場合があってもよい。 Further, here, the case where the information processing apparatus 100 calculates the validity score for any one pair of the combination of the preceding stroke group and the succeeding stroke group and recognizes the character based on the validity score will be described. However, it is not limited to this. For example, the information processing apparatus 100 may calculate a validity score for all possible pairs of a preceding stroke group and a succeeding stroke group, and recognize characters based on the validity score. ..

また、例えば、情報処理装置１００は、妥当性スコアの他に、筆画グループに含まれる１以上の筆画の合成結果の筆跡が示す文字の候補の妥当性の強さを示す認識スコアを利用して、入力欄１１０に書き込まれた文字を認識する場合があってもよい。認識スコアが高いほど、筆画グループが形成する文字の候補が正しい確率が高く、筆画グループの区切りが正しい確率が高いことを示すことになる。具体的には、情報処理装置１００は、図３に後述する辞書データベースに基づいて、認識スコアを算出する。これによれば、情報処理装置１００は、さらに、重ね書きされる文字を精度よく認識可能にすることができる。 Further, for example, the information processing apparatus 100 uses, in addition to the validity score, a recognition score indicating the strength of the validity of the character candidate indicated by the handwriting of the composition result of one or more strokes included in the stroke group. , The character written in the input field 110 may be recognized. The higher the recognition score, the higher the probability that the character candidates formed by the stroke group are correct, and the higher the probability that the stroke group division is correct. Specifically, the information processing apparatus 100 calculates the recognition score based on the dictionary database described later in FIG. According to this, the information processing apparatus 100 can further make it possible to accurately recognize the characters to be overwritten.

また、例えば、情報処理装置１００は、妥当性スコアの他に、２つの筆画グループが形成する文字の組み合わせが、文脈に沿った組み合わせである妥当性の強さを示す文脈スコアを利用して、入力欄１１０に書き込まれた文字を認識する場合があってもよい。文脈スコアが高いほど、２つの筆画グループが形成する文字の組み合わせが正しい確率が高く、２つの筆画グループの区切りが正しい確率が高いことを示すことになる。具体的には、情報処理装置１００は、図４に後述する文脈データベースに基づいて、文脈スコアを算出する。これによれば、情報処理装置１００は、さらに、重ね書きされる文字を精度よく認識可能にすることができる。 Further, for example, the information processing apparatus 100 uses a context score indicating the strength of validity that the combination of characters formed by the two stroke groups is a combination in line with the context, in addition to the validity score. The character written in the input field 110 may be recognized. The higher the context score, the higher the probability that the combination of characters formed by the two stroke groups is correct, and the higher the probability that the delimiter between the two stroke groups is correct. Specifically, the information processing apparatus 100 calculates a context score based on the context database described later in FIG. According to this, the information processing apparatus 100 can further make it possible to accurately recognize the characters to be overwritten.

また、ここでは、情報処理装置１００が、入力欄１１０を表示し、入力欄１１０に書き込まれた筆画の軌跡を示す筆画データを取得して蓄積する場合について説明したが、これに限らない。例えば、情報処理装置１００とは異なる装置が、入力欄１１０を表示し、入力欄１１０に書き込まれた筆画の軌跡を示す筆画データを取得して情報処理装置１００に送信し、情報処理装置１００が、受信した筆画データを蓄積する場合があってもよい。 Further, although the case where the information processing apparatus 100 displays the input field 110 and acquires and stores the stroke data indicating the locus of the stroke written in the input field 110 has been described here, the present invention is not limited to this. For example, a device different from the information processing device 100 displays the input field 110, acquires the brush stroke data indicating the locus of the brush stroke written in the input field 110, and transmits it to the information processing device 100, and the information processing device 100 , The received brush stroke data may be accumulated.

（情報処理装置１００のハードウェア構成例）
次に、図２を用いて、情報処理装置１００のハードウェア構成例について説明する。 (Hardware configuration example of information processing device 100)
Next, a hardware configuration example of the information processing apparatus 100 will be described with reference to FIG.

図２は、情報処理装置１００のハードウェア構成例を示すブロック図である。図２において、情報処理装置１００は、ＣＰＵ（ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）２０１と、メモリ２０２と、ネットワークＩ／Ｆ（Ｉｎｔｅｒｆａｃｅ）２０３と、記録媒体Ｉ／Ｆ２０４と、記録媒体２０５と、タッチパネル２０６とを有する。また、各構成部は、バス２００によってそれぞれ接続される。 FIG. 2 is a block diagram showing a hardware configuration example of the information processing apparatus 100. In FIG. 2, the information processing apparatus 100 includes a CPU (Central Processing Unit) 201, a memory 202, a network I / F (Interface) 203, a recording medium I / F 204, a recording medium 205, and a touch panel 206. .. Further, each component is connected by a bus 200.

ここで、ＣＰＵ２０１は、情報処理装置１００の全体の制御を司る。メモリ２０２は、例えば、ＲＯＭ（ＲｅａｄＯｎｌｙＭｅｍｏｒｙ）、ＲＡＭ（ＲａｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ）およびフラッシュＲＯＭなどを有する。具体的には、例えば、フラッシュＲＯＭやＲＯＭが各種プログラムを記憶し、ＲＡＭがＣＰＵ２０１のワークエリアとして使用される。メモリ２０２に記憶されるプログラムは、ＣＰＵ２０１にロードされることで、コーディングされている処理をＣＰＵ２０１に実行させる。また、メモリ２０２は、例えば、図１に示したモデルＭｏｄを記憶してもよい。また、メモリ２０２は、例えば、図３～図５に後述する各種データベースを記憶してもよい。 Here, the CPU 201 controls the entire information processing apparatus 100. The memory 202 includes, for example, a ROM (Read Only Memory), a RAM (Random Access Memory), a flash ROM, and the like. Specifically, for example, a flash ROM or ROM stores various programs, and RAM is used as a work area of CPU 201. The program stored in the memory 202 is loaded into the CPU 201 to cause the CPU 201 to execute the coded process. Further, the memory 202 may store, for example, the model mod shown in FIG. Further, the memory 202 may store various databases described later in FIGS. 3 to 5, for example.

ネットワークＩ／Ｆ２０３は、通信回線を通じてネットワーク２１０に接続され、ネットワーク２１０を介して他のコンピュータに接続される。そして、ネットワークＩ／Ｆ２０３は、ネットワーク２１０と内部のインターフェースを司り、他のコンピュータからのデータの入出力を制御する。ネットワークＩ／Ｆ２０３は、例えば、モデムやＬＡＮ（ＬｏｃａｌＡｒｅａＮｅｔｗｏｒｋ）アダプタなどである。 The network I / F 203 is connected to the network 210 through a communication line, and is connected to another computer via the network 210. Then, the network I / F 203 controls the internal interface with the network 210 and controls the input / output of data from another computer. The network I / F 203 is, for example, a modem, a LAN (Local Area Network) adapter, or the like.

記録媒体Ｉ／Ｆ２０４は、ＣＰＵ２０１の制御に従って記録媒体２０５に対するデータのリード／ライトを制御する。記録媒体Ｉ／Ｆ２０４は、例えば、ディスクドライブ、ＳＳＤ（ＳｏｌｉｄＳｔａｔｅＤｒｉｖｅ）、ＵＳＢ（ＵｎｉｖｅｒｓａｌＳｅｒｉａｌＢｕｓ）ポートなどである。記録媒体２０５は、記録媒体Ｉ／Ｆ２０４の制御で書き込まれたデータを記憶する不揮発メモリである。記録媒体２０５は、例えば、ディスク、半導体メモリ、ＵＳＢメモリなどである。記録媒体２０５は、情報処理装置１００から着脱可能であってもよい。また、記録媒体２０５は、例えば、図１に示したモデルＭｏｄを記憶してもよい。また、記録媒体２０５は、例えば、図３～図５に後述する各種データベースを記憶してもよい。 The recording medium I / F 204 controls read / write of data to the recording medium 205 according to the control of the CPU 201. The recording medium I / F 204 is, for example, a disk drive, an SSD (Solid State Drive), a USB (Universal Serial Bus) port, or the like. The recording medium 205 is a non-volatile memory that stores data written under the control of the recording medium I / F 204. The recording medium 205 is, for example, a disk, a semiconductor memory, a USB memory, or the like. The recording medium 205 may be detachable from the information processing apparatus 100. Further, the recording medium 205 may store, for example, the model mod shown in FIG. Further, the recording medium 205 may store various databases described later in FIGS. 3 to 5, for example.

タッチパネル２０６は、ディスプレイを有する。ディスプレイは、例えば、文字が重ね書きされる入力欄を表示する。また、ディスプレイは、例えば、カーソル、アイコン、ツールボックス、文書、画像、または、機能情報などを表示してもよい。タッチパネル２０６は、タッチパネル２０６上のユーザの接触位置を検出する検出装置を有する。ユーザの接触位置は、例えば、ユーザの指の接触位置、または、ユーザが所持するスタイラスの接触位置などである。検出装置は、例えば、ディスプレイ上、または、ディスプレイ外周部に設けられる。検出装置は、例えば、抵抗膜方式、静電容量方式、超音波方式、光学方式、または、電磁誘導方式などを用いて、接触位置を検出する。タッチパネル２０６は、ユーザの接触位置に応じて、ディスプレイに表示された入力欄に書き込まれた筆画を検出し、筆画の軌跡を示す筆画データを取得する。また、タッチパネル２０６は、各種指示などの入力を受け付けてもよい。 The touch panel 206 has a display. The display displays, for example, an input field in which characters are overwritten. The display may also display, for example, a cursor, an icon, a toolbox, a document, an image, or functional information. The touch panel 206 has a detection device that detects a user's contact position on the touch panel 206. The contact position of the user is, for example, the contact position of the user's finger or the contact position of the stylus possessed by the user. The detection device is provided, for example, on the display or on the outer peripheral portion of the display. The detection device detects the contact position by using, for example, a resistance film method, a capacitance method, an ultrasonic method, an optical method, an electromagnetic induction method, or the like. The touch panel 206 detects the stroke drawn in the input field displayed on the display according to the contact position of the user, and acquires the stroke data indicating the trajectory of the stroke. Further, the touch panel 206 may accept inputs such as various instructions.

情報処理装置１００は、上述した構成部の他、例えば、キーボード、プリンタ、スキャナ、マイク、スピーカーなどを有してもよい。また、情報処理装置１００は、記録媒体Ｉ／Ｆ２０４や記録媒体２０５を複数有していてもよい。また、情報処理装置１００は、記録媒体Ｉ／Ｆ２０４や記録媒体２０５を有していなくてもよい。 The information processing apparatus 100 may include, for example, a keyboard, a printer, a scanner, a microphone, a speaker, and the like, in addition to the above-mentioned components. Further, the information processing apparatus 100 may have a plurality of recording media I / F 204 and recording media 205. Further, the information processing apparatus 100 does not have to have the recording medium I / F 204 or the recording medium 205.

また、情報処理装置１００は、タッチパネル２０６に代わり、入力欄を表示するディスプレイと、入力欄に筆画を書き込むことが可能なポインティングデバイスとを有し、入力欄に書き込まれた筆画を検出し、筆画データを取得してもよい。ポインティングデバイスは、例えば、マウスである。 Further, the information processing apparatus 100 has a display for displaying an input field and a pointing device capable of writing a stroke in the input field instead of the touch panel 206, detects the stroke written in the input field, and draws a stroke. You may get the data. The pointing device is, for example, a mouse.

（辞書データベース３００の記憶内容）
次に、図３を用いて、辞書データベース３００の記憶内容の一例について説明する。辞書データベース３００は、例えば、図２に示した情報処理装置１００のメモリ２０２や記録媒体２０５などの記憶領域により実現される。 (Stored contents of dictionary database 300)
Next, an example of the stored contents of the dictionary database 300 will be described with reference to FIG. The dictionary database 300 is realized, for example, by a storage area such as a memory 202 or a recording medium 205 of the information processing apparatus 100 shown in FIG.

図３は、辞書データベース３００の記憶内容の一例を示す説明図である。図３に示すように、辞書データベース３００は、文字と特徴とのフィールドを有する。辞書データベース３００は、文字ごとに各フィールドに情報を設定することにより、辞書情報がレコードとして記憶される。 FIG. 3 is an explanatory diagram showing an example of the stored contents of the dictionary database 300. As shown in FIG. 3, the dictionary database 300 has fields for characters and features. The dictionary database 300 stores dictionary information as a record by setting information in each field for each character.

文字のフィールドには、文字が設定される。文字のフィールドには、例えば、文字を示す文字コードが設定されてもよい。特徴のフィールドには、文字の特徴を示す特徴情報が設定される。文字の特徴は、例えば、文字全体が存在する入力欄での領域の特徴である。文字の特徴は、例えば、文字の筆画の方向の特徴、または、文字の筆画間での方向の変化の特徴などである。特徴情報は、例えば、文字を解析して得られる特徴量を示す。 Characters are set in the character field. For example, a character code indicating a character may be set in the character field. In the feature field, feature information indicating the feature of the character is set. Character characteristics are, for example, the characteristics of the area in the input field where the entire character is present. Character features are, for example, the characteristics of the direction of the stroke of the character, or the characteristics of the change in direction between the strokes of the character. The feature information indicates, for example, a feature amount obtained by analyzing characters.

（文脈データベース４００の記憶内容）
次に、図４を用いて、文脈データベース４００の記憶内容の一例について説明する。文脈データベース４００は、例えば、図２に示した情報処理装置１００のメモリ２０２や記録媒体２０５などの記憶領域により実現される。 (Memory contents of context database 400)
Next, an example of the stored contents of the context database 400 will be described with reference to FIG. The context database 400 is realized, for example, by a storage area such as a memory 202 or a recording medium 205 of the information processing apparatus 100 shown in FIG.

図４は、文脈データベース４００の記憶内容の一例を示す説明図である。図４に示すように、文脈データベース４００は、文字列と、確率とのフィールドを有する。文脈データベース４００は、文字列ごとに各フィールドに情報を設定することにより、文脈情報がレコードとして記憶される。 FIG. 4 is an explanatory diagram showing an example of the stored contents of the context database 400. As shown in FIG. 4, the context database 400 has fields for strings and probabilities. The context database 400 stores the context information as a record by setting information in each field for each character string.

文字列のフィールドには、文字列である２つの文字が設定される。文字列のフィールドには、例えば、文字列である２つの文字のそれぞれの文字を示す文字コードが設定されてもよい。確率のフィールドには、２つの文字の文脈上の連続しやすさを示す度合い情報が設定される。確率のフィールドには、例えば、２つの文字のバイグラム確率が設定される。バイグラム確率は、ある文字の隣にある文字が並ぶ確率である。文脈データベース４００は、２以上の文字を含む文字列について、語句や活用形、語幹などを利用して、２以上の文字の文脈上の連続しやすさを示す度合い情報を記憶してもよい。 Two characters, which are character strings, are set in the character string field. In the field of the character string, for example, a character code indicating each character of the two characters which is a character string may be set. In the probability field, degree information indicating the degree of continuity in the context of the two characters is set. In the probability field, for example, the bigram probability of two characters is set. The bigram probability is the probability that the characters next to a character will line up. The context database 400 may store degree information indicating the ease of continuity of two or more characters in the context by using words, phrases, inflected forms, stems, and the like for a character string containing two or more characters.

（候補データベース５００の記憶内容）
次に、図５を用いて、候補データベース５００の記憶内容の一例について説明する。候補データベース５００は、例えば、図２に示した情報処理装置１００のメモリ２０２や記録媒体２０５などの記憶領域により実現される。 (Memory contents of candidate database 500)
Next, an example of the stored contents of the candidate database 500 will be described with reference to FIG. The candidate database 500 is realized, for example, by a storage area such as a memory 202 or a recording medium 205 of the information processing apparatus 100 shown in FIG.

図５は、候補データベース５００の記憶内容の一例を示す説明図である。図５に示すように、候補データベース５００は、１画目～Ｍ画目のそれぞれの項目に対応付けて、１画～Ｎ画のフィールドを有する。ｊ画のフィールドは、１以上の認識候補のフィールドを有する。候補データベース５００は、筆画ごとに各フィールドに情報を設定することにより、候補情報がレコードとして記憶される。 FIG. 5 is an explanatory diagram showing an example of the stored contents of the candidate database 500. As shown in FIG. 5, the candidate database 500 has fields of 1st to Nth strokes in association with each item of the 1st to Mth strokes. The field of the j image has one or more recognition candidate fields. The candidate database 500 stores the candidate information as a record by setting information in each field for each stroke.

ｉ画目の項目は、入力欄に書き込まれたＭ画の筆画のうちｉ番目の筆画に対応する。ｉは、１，２，・・・，Ｍである。ｊ画のフィールドには、ｉ番目の筆画からｊ画分の筆画を合成した合成結果である筆跡が形成する文字の候補が設定される。ｊは、１，２，・・・，ｍｉｎ（Ｍ－ｉ＋１，Ｎ）である。ｉがＭに近づくにつれ、ｉ画目より後ろに存在する筆画の数が少なくなるため、ｊの最大値は小さくなる。Ｎは、例えば、一般的な文字の最大画数が設定される。 The item of the i-th stroke corresponds to the i-th stroke of the stroke strokes of the M stroke written in the input field. i is 1, 2, ..., M. In the field of the j-stroke, a candidate for a character formed by a handwriting, which is a composite result of synthesizing the strokes of the j-th stroke from the i-th stroke, is set. j is 1, 2, ..., Min (M-i + 1, N). As i approaches M, the number of strokes existing after the i-th stroke decreases, so that the maximum value of j becomes smaller. For N, for example, the maximum number of strokes of a general character is set.

認識候補のフィールドには、ｉ番目の筆画からｊ画分の筆画を合成した合成結果の筆跡が形成する文字の候補、および、ｉ番目の筆画からｊ画分の筆画を合成した合成結果の筆跡が形成する文字の候補の妥当性を示す認識スコアが設定される。 In the recognition candidate fields, there are character candidates formed by the handwriting of the composition result of synthesizing the strokes of j strokes from the i-th stroke, and the handwriting of the composition result of synthesizing the strokes of j strokes from the i-th stroke. A recognition score is set to indicate the validity of the candidate characters formed by.

（情報処理装置１００の機能的構成例）
次に、図６を用いて、情報処理装置１００の機能的構成例について説明する。 (Example of functional configuration of information processing device 100)
Next, an example of a functional configuration of the information processing apparatus 100 will be described with reference to FIG.

図６は、情報処理装置１００の機能的構成例を示すブロック図である。情報処理装置１００は、記憶部６００と、取得部６０１と、特定部６０２と、妥当性スコア算出部６０３と、認識スコア算出部６０４と、文脈スコア算出部６０５と、重畳文字認識部６０６と、出力部６０７と、学習部６０８とを含む。 FIG. 6 is a block diagram showing a functional configuration example of the information processing apparatus 100. The information processing apparatus 100 includes a storage unit 600, an acquisition unit 601, a specific unit 602, a validity score calculation unit 603, a recognition score calculation unit 604, a context score calculation unit 605, and a superposed character recognition unit 606. The output unit 607 and the learning unit 608 are included.

記憶部６００は、例えば、図２に示したメモリ２０２や記録媒体２０５などの記憶領域によって実現される。以下では、記憶部６００が、情報処理装置１００に含まれる場合について説明するが、これに限らない。例えば、記憶部６００が、情報処理装置１００とは異なる装置に含まれ、記憶部６００の記憶内容が情報処理装置１００から参照可能である場合があってもよい。 The storage unit 600 is realized by, for example, a storage area such as the memory 202 or the recording medium 205 shown in FIG. Hereinafter, the case where the storage unit 600 is included in the information processing device 100 will be described, but the present invention is not limited to this. For example, the storage unit 600 may be included in a device different from the information processing device 100, and the stored contents of the storage unit 600 may be visible from the information processing device 100.

取得部６０１～学習部６０８は、制御部の一例として機能する。取得部６０１～学習部６０８は、具体的には、例えば、図２に示したメモリ２０２や記録媒体２０５などの記憶領域に記憶されたプログラムをＣＰＵ２０１に実行させることにより、または、ネットワークＩ／Ｆ２０３により、その機能を実現する。各機能部の処理結果は、例えば、図２に示したメモリ２０２や記録媒体２０５などの記憶領域に記憶される。 The acquisition unit 601 to the learning unit 608 function as an example of the control unit. Specifically, the acquisition unit 601 to the learning unit 608 may cause the CPU 201 to execute a program stored in a storage area such as the memory 202 or the recording medium 205 shown in FIG. 2, or the network I / F 203. To realize the function. The processing result of each functional unit is stored in a storage area such as the memory 202 or the recording medium 205 shown in FIG. 2, for example.

記憶部６００は、各機能部の処理において参照され、または更新される各種情報を記憶する。記憶部６００は、例えば、筆画データを記憶してもよい。筆画データは、１つの入力欄に書き込まれた筆画の軌跡を示すデータである。入力欄は、文字が重ね書きされる領域である。また、記憶部６００は、例えば、筆画グループを記憶してもよい。筆画グループは、時系列順に連続する１以上の筆画を含むグループである。 The storage unit 600 stores various information referred to or updated in the processing of each functional unit. The storage unit 600 may store stroke data, for example. The stroke data is data indicating the trajectory of the stroke written in one input field. The input field is an area where characters are overwritten. Further, the storage unit 600 may store, for example, a stroke group. The stroke group is a group including one or more consecutive strokes in chronological order.

記憶部６００は、例えば、所定のモデルを記憶してもよい。所定のモデルは、２つの筆画グループの特徴量を入力すると、２つの筆画グループの区切りの妥当性スコアを出力するモデルである。所定のモデルは、例えば、ニューラルネットワークを用いて実現される。筆画グループの特徴量は、例えば、筆画グループに含まれる少なくともいずれかの筆画に基づく特徴量である。 The storage unit 600 may store a predetermined model, for example. The predetermined model is a model that outputs the validity score of the division between the two stroke groups when the feature quantities of the two stroke groups are input. A given model is realized, for example, using a neural network. The feature amount of the stroke group is, for example, a feature amount based on at least one stroke included in the stroke group.

筆画グループの特徴量は、例えば、筆画グループに含まれる先頭の筆画および末尾の筆画の少なくともいずれかの筆画の入力欄での位置である。また、筆画グループの特徴量は、例えば、筆画グループ全体の入力欄での位置であってもよい。また、筆画グループの特徴量は、例えば、筆画グループに含まれる筆画の数であってもよい。妥当性スコアは、２つの筆画グループの区切りが、文字間の区切りである妥当性の強さを示すスコアである。 The feature amount of the stroke group is, for example, the position in the input field of at least one of the first stroke and the last stroke included in the stroke group. Further, the feature amount of the stroke group may be, for example, a position in the input field of the entire stroke group. Further, the feature amount of the stroke group may be, for example, the number of strokes included in the stroke group. The validity score is a score indicating the strength of validity in which the delimiter between two stroke groups is a delimiter between characters.

記憶部６００は、例えば、妥当性スコアを記憶してもよい。記憶部６００は、例えば、筆画グループが形成する文字の候補および認識スコアを記憶してもよい。認識スコアは、例えば、筆画グループが形成する文字の候補の妥当性の強さを示すスコアである。認識スコアは、例えば、認識スコア算出部６０４によって算出される。認識スコアは、具体的には、最高値を１００、最低値を０として、文字としての確からしさが高いほど数値が高くなるようなスコアである。記憶部６００は、具体的には、図５に示した候補データベース５００を記憶してもよい。記憶部６００は、例えば、筆画グループが形成する文字の候補を生成し、認識スコアを算出するための情報を記憶してもよい。記憶部６００は、具体的には、図３に示した辞書データベース３００を記憶してもよい。 The storage unit 600 may store the validity score, for example. The storage unit 600 may store, for example, character candidates and recognition scores formed by the stroke group. The recognition score is, for example, a score indicating the strength of the validity of the character candidates formed by the stroke group. The recognition score is calculated by, for example, the recognition score calculation unit 604. Specifically, the recognition score is a score in which the maximum value is 100 and the minimum value is 0, and the higher the certainty as a character, the higher the numerical value. Specifically, the storage unit 600 may store the candidate database 500 shown in FIG. The storage unit 600 may, for example, generate a character candidate formed by a stroke group and store information for calculating a recognition score. Specifically, the storage unit 600 may store the dictionary database 300 shown in FIG.

記憶部６００は、例えば、２つの筆画グループが形成する文字の候補の組み合わせの文脈スコアを記憶してもよい。文脈スコアは、例えば、文字の候補の組み合わせが、文脈に沿った組み合わせである妥当性の強さを示すスコアである。文脈スコアは、例えば、文脈スコア算出部６０５によって算出される。文脈スコアは、具体的には、日本語であれば、文字の並びの日本語としての尤度が高いほど、数値が高くなるようなスコアである。例えば、「好き」と「女子き」という文字の並びがあれば、「好き」は「女子き」よりも日本語としての尤度が高いため、「好き」は「女子き」よりも算出される文脈スコアが高くなる。文脈スコアは、具体的には、バイグラム確率に基づいて算出される。記憶部６００は、例えば、文脈スコアを算出するための情報を記憶してもよい。記憶部６００は、具体的には、図４に示した文脈データベース４００を記憶してもよい。 The storage unit 600 may store, for example, a context score of a combination of character candidates formed by two stroke groups. The context score is, for example, a score indicating the strength of validity that the combination of character candidates is a combination in context. The context score is calculated by, for example, the context score calculation unit 605. Specifically, in the case of Japanese, the context score is such that the higher the likelihood of the character sequence as Japanese, the higher the numerical value. For example, if there is a sequence of letters "like" and "girl's ki", "like" is more likely to be Japanese than "girl's ki", so "like" is calculated more than "girl's ki". The context score is high. The context score is specifically calculated based on the bigram probability. The storage unit 600 may store information for calculating the context score, for example. Specifically, the storage unit 600 may store the context database 400 shown in FIG.

記憶部６００は、例えば、教師データを記憶してもよい。教師データは、例えば、ペアにされた２つの筆画グループと、ペアにされた２つの筆画グループの区切りが、文字間の区切りとして妥当であるか否かを示す回答データとを対応付けて表すデータである。教師データは、所定のモデルを学習するためのデータである。教師データは、具体的には、ペアにされた２つの筆画グループのそれぞれの筆画グループの特徴量と、回答データとを対応付けて表すデータであってもよい。教師データは、所定のモデルを学習するためのデータである。記憶部６００は、情報処理装置１００で所定のモデルを学習しない場合、教師データを記憶しなくてよい。 The storage unit 600 may store, for example, teacher data. The teacher data is, for example, data representing in association with two paired stroke groups and answer data indicating whether or not the delimiter between the two paired stroke groups is appropriate as a delimiter between characters. Is. Teacher data is data for learning a predetermined model. Specifically, the teacher data may be data representing the feature amount of each stroke group of the two paired stroke groups in association with the response data. Teacher data is data for learning a predetermined model. When the information processing apparatus 100 does not learn a predetermined model, the storage unit 600 does not have to store the teacher data.

取得部６０１は、各機能部の処理に用いられる各種情報を取得する。取得部６０１は、取得した各種情報を、記憶部６００に記憶し、または、各機能部に出力する。また、取得部６０１は、記憶部６００に記憶しておいた各種情報を、各機能部に出力してもよい。取得部６０１は、例えば、ユーザの操作入力に基づき、各種情報を取得する。取得部６０１は、例えば、情報処理装置１００とは異なる装置から、各種情報を受信してもよい。 The acquisition unit 601 acquires various information used for processing of each functional unit. The acquisition unit 601 stores various acquired information in the storage unit 600 or outputs it to each function unit. Further, the acquisition unit 601 may output various information stored in the storage unit 600 to each function unit. The acquisition unit 601 acquires various information based on, for example, a user's operation input. The acquisition unit 601 may receive various information from a device different from the information processing device 100, for example.

取得部６０１は、例えば、筆画データを蓄積する。取得部６０１は、具体的には、入力欄に筆画が書き込まれる都度、筆画を検出し、筆画の軌跡を示す筆画データを取得して記憶部６００に蓄積する。また、取得部６０１は、例えば、教師データを取得してもよい。また、取得部６０１は、例えば、情報処理装置１００で所定のモデルを学習しない場合、所定のモデルを取得してもよい。 The acquisition unit 601 stores, for example, stroke data. Specifically, the acquisition unit 601 detects the stroke stroke each time the stroke stroke is written in the input field, acquires the stroke stroke data indicating the trajectory of the stroke stroke, and stores it in the storage unit 600. Further, the acquisition unit 601 may acquire teacher data, for example. Further, the acquisition unit 601 may acquire a predetermined model, for example, when the information processing apparatus 100 does not learn the predetermined model.

特定部６０２は、蓄積した筆画データを参照して、入力欄に書き込まれた、複数の筆画グループを特定する。特定部６０２は、例えば、入力欄に書き込まれた、Ｍ画の筆画のうち、ｉ画目の筆画からｊ画分の筆画を含む筆画グループを特定する。ｉは、１，２，・・・，Ｍである。ｊは、１，２，・・・，ｍｉｎ（Ｍ－ｉ＋１，Ｎ）である。ｉがＭに近づくにつれ、ｉ画目より後ろに存在する筆画の数が少なくなるため、ｊの最大値は小さくなる。Ｎは、予め設定される。Ｎは、例えば、一般的な文字の最大画数が設定される。 The specifying unit 602 identifies a plurality of stroke groups written in the input field with reference to the accumulated stroke data. The specifying unit 602 specifies, for example, a stroke group including the strokes of j strokes from the stroke of the i-th stroke among the strokes of the M stroke written in the input field. i is 1, 2, ..., M. j is 1, 2, ..., Min (M-i + 1, N). As i approaches M, the number of strokes existing after the i-th stroke decreases, so that the maximum value of j becomes smaller. N is preset. For N, for example, the maximum number of strokes of a general character is set.

ここで、特定部６０２は、特定しうる筆画グループを複数またはすべて特定してもよいため、一部の筆画が重複して含まれる２以上の筆画グループを特定してもよい。これにより、特定部６０２は、文字を形成しうる筆画グループを特定することができ、妥当性スコア算出部６０３、認識スコア算出部６０４、および、文脈スコア算出部６０５の処理を実行可能にすることができる。 Here, since the specifying unit 602 may specify a plurality or all of the stroke groups that can be specified, two or more stroke groups in which some strokes are duplicated may be specified. As a result, the specific unit 602 can specify the stroke group that can form a character, and enables the processing of the validity score calculation unit 603, the recognition score calculation unit 604, and the context score calculation unit 605 to be executed. Can be done.

妥当性スコア算出部６０３は、蓄積した筆画データを参照して、入力欄に書き込まれた、先行の筆画グループと、後続の筆画グループとの特徴量に基づいて、先行の筆画グループと、後続の筆画グループとの区切りについての妥当性スコアを算出する。後続の筆画グループは、先行の筆画グループの直後に続く筆画グループである。妥当性スコア算出部６０３は、例えば、先行の筆画グループと、後続の筆画グループとの特徴量を、所定のモデルに入力することにより、妥当性スコアを算出する。これにより、妥当性スコア算出部６０３は、１つの文字を形成する筆画の集まりを特定する指標となる区切りの尤もらしさを取得することができる。 The validity score calculation unit 603 refers to the accumulated stroke data, and based on the feature quantities of the preceding stroke group and the succeeding stroke group written in the input field, the preceding stroke group and the succeeding stroke group Calculate the validity score for the division with the stroke group. The subsequent stroke group is the stroke group immediately following the preceding stroke group. The validity score calculation unit 603 calculates the validity score by inputting, for example, the feature amounts of the preceding stroke group and the subsequent stroke group into a predetermined model. As a result, the validity score calculation unit 603 can acquire the plausibility of the delimiter as an index for specifying the group of strokes forming one character.

妥当性スコア算出部６０３は、先行の筆画グループと、後続の筆画グループとを組み合わせた２つの筆画グループのペアごとに、ペアにされた２つの筆画グループの区切りの妥当性スコアを算出してもよい。ここで、妥当性スコア算出部６０３は、ペアにされた２つの筆画グループのそれぞれの筆画グループの特徴量に基づいて、ペアにされた２つの筆画グループの区切りの妥当性スコアを算出する。妥当性スコア算出部６０３は、例えば、ペアにされた２つの筆画グループのそれぞれの筆画グループの特徴量を、所定のモデルに入力することにより、妥当性スコアを算出する。これにより、妥当性スコア算出部６０３は、文字を形成する筆画の集まりの区切りを網羅的に検証することができ、１つの文字を形成する筆画の集まりを特定する指標となる区切りの尤もらしさを取得することができる。 Even if the validity score calculation unit 603 calculates the validity score of the division between the two paired stroke groups for each pair of two stroke groups in which the preceding stroke group and the subsequent stroke group are combined. good. Here, the validity score calculation unit 603 calculates the validity score of the division between the two paired stroke groups based on the feature amount of each stroke group of the two paired stroke groups. The validity score calculation unit 603 calculates the validity score by inputting, for example, the feature amount of each stroke group of the two paired stroke groups into a predetermined model. As a result, the validity score calculation unit 603 can comprehensively verify the delimiter of the group of strokes forming a character, and the plausibility of the delimiter as an index for specifying the group of strokes forming one character. Can be obtained.

妥当性スコア算出部６０３は、具体的には、先行の筆画グループに含まれる末尾の筆画と、後続の筆画グループに含まれる先頭の筆画と、後続の筆画グループに含まれる末尾の筆画との少なくともいずれかの筆画の入力欄での位置に基づいて、妥当性スコアを算出してもよい。妥当性スコア算出部６０３は、より具体的には、先行の筆画グループに含まれる末尾の筆画の入力欄での位置と、後続の筆画グループに含まれる先頭の筆画および末尾の筆画の入力欄での位置を、所定のモデルに入力することにより、妥当性スコアを算出する。これにより、妥当性スコア算出部６０３は、文字の区切りとしての正しさが表れやすい、筆画の特徴に基づいて、妥当性スコアを精度よく算出可能にすることができる。また、妥当性スコア算出部６０３は、先行の筆画グループに含まれるすべての筆画と、後続の筆画グループに含まれるすべての筆画とを考慮する場合に比べて、処理量の低減化を図ることができる。 Specifically, the validity score calculation unit 603 includes at least the last stroke included in the preceding stroke group, the first stroke included in the subsequent stroke group, and the last stroke included in the subsequent stroke group. The validity score may be calculated based on the position of any stroke in the input field. More specifically, the validity score calculation unit 603 is in the position in the input field of the last stroke included in the preceding stroke group and in the input field of the first stroke and the last stroke included in the subsequent stroke group. The validity score is calculated by inputting the position of. As a result, the validity score calculation unit 603 can accurately calculate the validity score based on the characteristics of the stroke, which tends to show the correctness as a character delimiter. Further, the validity score calculation unit 603 can reduce the processing amount as compared with the case where all the strokes included in the preceding stroke group and all the strokes included in the subsequent stroke group are taken into consideration. can.

妥当性スコア算出部６０３は、具体的には、さらに、先行の筆画グループに含まれる先頭の筆画の入力欄での位置に基づいて、妥当性スコアを算出してもよい。妥当性スコア算出部６０３は、より具体的には、さらに、先行の筆画グループに含まれる先頭の筆画の入力欄での位置を、所定のモデルに入力することにより、妥当性スコアを算出する。これにより、妥当性スコア算出部６０３は、妥当性スコアを算出する精度の向上を図ることができる。 Specifically, the validity score calculation unit 603 may further calculate the validity score based on the position in the input field of the first stroke included in the preceding stroke group. More specifically, the validity score calculation unit 603 calculates the validity score by inputting the position in the input field of the first stroke included in the preceding stroke group into a predetermined model. As a result, the validity score calculation unit 603 can improve the accuracy of calculating the validity score.

妥当性スコア算出部６０３は、具体的には、さらに、先行の筆画グループに含まれる筆画の数と、後続の筆画グループに含まれる筆画の数とに基づいて、妥当性スコアを算出してもよい。妥当性スコア算出部６０３は、より具体的には、さらに、先行の筆画グループに含まれる筆画の数と、後続の筆画グループに含まれる筆画の数とを、所定のモデルに入力することにより、妥当性スコアを算出する。これにより、妥当性スコア算出部６０３は、妥当性スコアを算出する精度の向上を図ることができる。 Specifically, the validity score calculation unit 603 may further calculate the validity score based on the number of strokes included in the preceding stroke group and the number of strokes included in the subsequent stroke group. good. More specifically, the validity score calculation unit 603 inputs the number of strokes included in the preceding stroke group and the number of strokes included in the subsequent stroke group into a predetermined model. Calculate the validity score. As a result, the validity score calculation unit 603 can improve the accuracy of calculating the validity score.

妥当性スコア算出部６０３は、具体的には、さらに、先行の筆画グループ全体の入力欄での位置と、後続の筆画グループ全体の入力欄での位置とに基づいて、妥当性スコアを算出してもよい。妥当性スコア算出部６０３は、より具体的には、さらに、先行の筆画グループ全体の入力欄での位置と、後続の筆画グループ全体の入力欄での位置とを、所定のモデルに入力することにより、妥当性スコアを算出する。これにより、妥当性スコア算出部６０３は、妥当性スコアを算出する精度の向上を図ることができる。 Specifically, the validity score calculation unit 603 calculates the validity score based on the position in the input field of the entire preceding stroke group and the position in the input field of the entire subsequent stroke group. You may. More specifically, the validity score calculation unit 603 inputs the position in the input field of the entire preceding stroke group and the position in the input field of the entire subsequent stroke group into a predetermined model. To calculate the validity score. As a result, the validity score calculation unit 603 can improve the accuracy of calculating the validity score.

認識スコア算出部６０４は、蓄積した筆画データを参照して、先行の筆画グループが形成する文字の候補を生成し、先行の筆画グループが形成する文字の候補の妥当性の強さを示す認識スコアを算出する。認識スコア算出部６０４は、例えば、辞書データベース３００を参照して、先行の筆画グループに含まれる１以上の筆画を合成した合成結果の筆跡の特徴量に対応する文字の候補を生成し、認識スコアを算出する。 The recognition score calculation unit 604 generates character candidates formed by the preceding stroke group with reference to the accumulated stroke data, and indicates the strength of the validity of the character candidates formed by the preceding stroke group. Is calculated. The recognition score calculation unit 604, for example, refers to the dictionary database 300, generates character candidates corresponding to the feature amount of the handwriting of the synthesis result obtained by synthesizing one or more strokes included in the preceding stroke group, and generates a recognition score. Is calculated.

また、認識スコア算出部６０４は、蓄積した筆画データを参照して、後続の筆画グループが形成する文字の候補を生成し、後続の筆画グループが形成する文字の候補の妥当性の強さを示す認識スコアを算出する。認識スコア算出部６０４は、例えば、辞書データベース３００を参照して、後続の筆画グループに含まれる１以上の筆画を合成した合成結果の筆跡の特徴量に対応する文字の候補を生成し、認識スコアを算出する。これにより、認識スコア算出部６０４は、認識スコアに基づいて、ユーザが書き込んだ文字がいずれの候補であるかを判断可能にすることができ、ユーザが書き込んだ文字を認識可能にすることができる。 Further, the recognition score calculation unit 604 generates character candidates formed by the subsequent stroke group by referring to the accumulated stroke data, and shows the validity of the character candidates formed by the subsequent stroke group. Calculate the recognition score. The recognition score calculation unit 604, for example, refers to the dictionary database 300, generates character candidates corresponding to the feature amount of the handwriting of the synthesis result obtained by synthesizing one or more strokes included in the subsequent stroke group, and generates a recognition score. Is calculated. Thereby, the recognition score calculation unit 604 can determine which candidate the character written by the user is based on the recognition score, and can make the character written by the user recognizable. ..

認識スコア算出部６０４は、ペアのうち、算出した妥当性スコアが条件を満たすペアごとに、条件を満たすペアにされた２つの筆画グループのそれぞれの筆画グループが形成する文字の候補を生成し、認識スコアを算出してもよい。条件は、例えば、妥当性スコアが一定以上であることである。認識スコア算出部６０４は、例えば、辞書データベース３００を参照して、それぞれの筆画グループに含まれる１以上の筆画を合成した合成結果の筆跡の特徴量に対応する文字の候補を生成し、認識スコアを算出する。これにより、認識スコア算出部６０４は、ユーザが書き込んだ文字を表す確率が相対的に低いペアについての処理を省略し、文字を認識する精度の低下を抑制しつつ、処理量の低減化を図ることができる。 The recognition score calculation unit 604 generates character candidates formed by each stroke group of the two stroke groups that satisfy the condition for each pair that satisfies the calculated validity score. The recognition score may be calculated. The condition is, for example, that the validity score is above a certain level. The recognition score calculation unit 604, for example, refers to the dictionary database 300, generates character candidates corresponding to the feature amount of the handwriting of the synthesis result obtained by synthesizing one or more strokes included in each stroke group, and generates a recognition score. Is calculated. As a result, the recognition score calculation unit 604 omits the processing for the pair having a relatively low probability of representing the character written by the user, suppresses the decrease in the accuracy of recognizing the character, and reduces the processing amount. be able to.

文脈スコア算出部６０５は、蓄積した筆画データを参照して、先行の筆画グループが形成する文字の候補と、後続の筆画グループが形成する文字の候補との組み合わせの文脈スコアを算出する。文脈スコア算出部６０５は、例えば、文脈データベース４００を参照して、先行の筆画グループが形成する文字の候補と、後続の筆画グループが形成する文字の候補との組み合わせに対応するバイグラム確率に基づいて、文脈スコアを算出する。これにより、文脈スコア算出部６０５は、文字の候補のうち、文脈上尤もらしい文字を特定可能にすることができ、文字を認識する精度の向上を図ることができる。 The context score calculation unit 605 calculates the context score of the combination of the character candidate formed by the preceding stroke group and the character candidate formed by the subsequent stroke group with reference to the accumulated stroke data. The context score calculation unit 605, for example, refers to the context database 400 and is based on the bigram probability corresponding to the combination of the character candidates formed by the preceding stroke group and the character candidates formed by the subsequent stroke group. , Calculate the context score. As a result, the context score calculation unit 605 can identify characters that are likely to be contextual among the character candidates, and can improve the accuracy of recognizing the characters.

文脈スコア算出部６０５は、ペアのうち、算出した妥当性スコアが条件を満たすペアごとに、条件を満たすペアにされた２つの筆画グループのそれぞれの筆画グループが形成する文字の候補の組み合わせの文脈スコアを算出してもよい。文脈スコア算出部６０５は、例えば、文脈データベース４００を参照して、それぞれの筆画グループが形成する文字の候補との組み合わせに対応するバイグラム確率に基づいて、文脈スコアを算出する。これにより、文脈スコア算出部６０５は、ユーザが書き込んだ文字を表す確率が相対的に低いペアについての処理を省略し、文字を認識する精度の低下を抑制しつつ、処理量の低減化を図ることができる。 The context score calculation unit 605 describes the context of the combination of character candidates formed by each stroke group of the two stroke groups that are paired to satisfy the condition for each pair that satisfies the calculated validity score. You may calculate the score. The context score calculation unit 605, for example, refers to the context database 400 and calculates the context score based on the bigram probability corresponding to the combination with the character candidates formed by each stroke group. As a result, the context score calculation unit 605 omits the processing for the pair having a relatively low probability of representing the character written by the user, suppresses the decrease in the accuracy of recognizing the character, and reduces the processing amount. be able to.

重畳文字認識部６０６は、蓄積した筆画データと、算出した妥当性スコアとに基づいて、入力欄に書き込まれた１以上の文字を認識する。重畳文字認識部６０６は、例えば、いずれかの２つの筆画グループについて算出した妥当性スコアが一定以上であれば、それぞれの筆画グループに含まれる１以上の筆画の合成結果の筆跡が示す文字を、入力欄に書き込まれた文字として認識する。これにより、重畳文字認識部６０６は、認識する文字の種別を制限せず、認識する文字の種別が増加しても、文字の区切りがユーザから明示されずとも、ユーザによって１つの入力欄に重ね書きされる文字を精度よく認識することができる。 The superimposed character recognition unit 606 recognizes one or more characters written in the input field based on the accumulated stroke data and the calculated validity score. For example, if the validity score calculated for any two stroke groups is a certain value or more, the superimposed character recognition unit 606 displays the characters indicated by the handwriting of the composition result of one or more strokes included in each stroke group. Recognize as characters written in the input field. As a result, the superimposed character recognition unit 606 does not limit the types of characters to be recognized, and even if the types of characters to be recognized increase or the character delimiters are not specified by the user, the superimposed character recognition unit 606 superimposes them on one input field by the user. The characters to be written can be recognized accurately.

重畳文字認識部６０６は、例えば、生成した候補および認識スコアと、算出した妥当性スコアとに基づいて、入力欄に書き込まれた１以上の文字を認識してもよい。重畳文字認識部６０６は、具体的には、認識スコアと妥当性スコアとに基づいて、２つの筆画グループのペアにおける文字の候補の組み合わせごとに総合スコアを算出する。そして、重畳文字認識部６０６は、総合スコアが最大の組み合わせである文字の候補を、入力欄に書き込まれた文字として認識する。ここで、重畳文字認識部６０６は、総合スコアが一定以上の組み合わせである文字の候補を、入力欄に書き込まれた文字の候補として認識してもよい。これにより、重畳文字認識部６０６は、認識スコアに基づき、ユーザによって１つの入力欄に重ね書きされる文字を、さらに精度よく認識することができる。 The superimposed character recognition unit 606 may recognize one or more characters written in the input field based on, for example, the generated candidate and recognition score and the calculated validity score. Specifically, the superimposed character recognition unit 606 calculates the total score for each combination of character candidates in the pair of two stroke groups based on the recognition score and the validity score. Then, the superimposed character recognition unit 606 recognizes the character candidate having the maximum total score as the character written in the input field. Here, the superimposed character recognition unit 606 may recognize a character candidate having a combination of a total score of a certain value or more as a character candidate written in the input field. As a result, the superimposed character recognition unit 606 can more accurately recognize the characters overwritten in one input field by the user based on the recognition score.

重畳文字認識部６０６は、例えば、算出した文脈スコアと、生成した候補および認識スコアと、算出した妥当性スコアとに基づいて、入力欄に書き込まれた１以上の文字を認識してもよい。重畳文字認識部６０６は、具体的には、文脈スコアと認識スコアと妥当性スコアとに基づいて、２つの筆画グループのペアにおける文字の候補の組み合わせごとに総合スコアを算出する。そして、重畳文字認識部６０６は、総合スコアが最大の組み合わせである文字の候補を、入力欄に書き込まれた文字として認識する。ここで、重畳文字認識部６０６は、総合スコアが一定以上の組み合わせである文字の候補を、入力欄に書き込まれた文字の候補として認識してもよい。これにより、重畳文字認識部６０６は、文脈スコアに基づき、ユーザによって１つの入力欄に重ね書きされる文字を、さらに精度よく認識することができる。 The superimposed character recognition unit 606 may recognize one or more characters written in the input field based on, for example, the calculated context score, the generated candidate and recognition score, and the calculated validity score. Specifically, the superimposed character recognition unit 606 calculates the total score for each combination of character candidates in the pair of two stroke groups based on the context score, the recognition score, and the validity score. Then, the superimposed character recognition unit 606 recognizes the character candidate having the maximum total score as the character written in the input field. Here, the superimposed character recognition unit 606 may recognize a character candidate having a combination of a total score of a certain value or more as a character candidate written in the input field. As a result, the superimposed character recognition unit 606 can more accurately recognize the characters overwritten in one input field by the user based on the context score.

出力部６０７は、各種情報を出力する。出力形式は、例えば、ディスプレイへの表示、プリンタへの印刷出力、ネットワークＩ／Ｆ２０３による外部装置への送信、または、メモリ２０２や記録媒体２０５などの記憶領域への記憶である。各種情報は、例えば、いずれかの機能部の処理結果である。出力部６０７は、例えば、重畳文字認識部６０６の認識結果を出力する。これにより、出力部６０７は、ユーザが書き込んだ文字を利用可能にすることができる。また、出力部６０７は、認識した文字をユーザが確認可能にすることができる。 The output unit 607 outputs various information. The output format is, for example, display on a display, print output to a printer, transmission to an external device by network I / F 203, or storage in a storage area such as a memory 202 or a recording medium 205. The various information is, for example, the processing result of any of the functional units. The output unit 607 outputs, for example, the recognition result of the superimposed character recognition unit 606. As a result, the output unit 607 can make the characters written by the user available. Further, the output unit 607 can enable the user to confirm the recognized character.

学習部６０８は、取得した教師データを利用して、ペアにされた２つの筆画グループのそれぞれの筆画グループの特徴量を入力すると、ペアにされた２つの筆画グループの区切りの妥当性スコアを出力する所定のモデルを学習する。学習部６０８は、学習モデルを記憶部６００に記憶する。学習部６０８は、例えば、教師データが示すペアにされた２つの筆画グループのそれぞれの筆画グループの特徴量と、２つの筆画グループの区切りの正しさを示す回答データとを、ニューラルネットワークに入力する。 The learning unit 608 uses the acquired teacher data to input the feature amount of each stroke group of the two paired stroke groups, and outputs the validity score of the division between the two paired stroke groups. Learn a given model. The learning unit 608 stores the learning model in the storage unit 600. The learning unit 608 inputs, for example, the feature amount of each stroke group of the two stroke groups paired as shown by the teacher data and the answer data indicating the correctness of the division between the two stroke groups into the neural network. ..

ここで、学習部６０８は、例えば、筆画グループの特徴量としては、筆画グループに含まれる少なくともいずれかの筆画に基づく特徴量を利用する。学習部６０８は、具体的には、筆画グループに含まれる先頭または末尾の筆画の特徴量を利用する。特徴量は、例えば、入力欄での位置である。また、学習部６０８は、具体的には、筆画グループに含まれるすべての筆画の特徴量、または、筆画グループに含まれる筆画の数などを利用してもよい。これにより、学習部６０８は、教師データを利用して、妥当性スコアを精度よく算出可能なモデルを学習することができ、モデルの生成にかかる作業負担の低減化を図ることができる。 Here, the learning unit 608 uses, for example, as the feature amount of the stroke group, the feature amount based on at least one stroke included in the stroke group. Specifically, the learning unit 608 uses the feature amount of the first or last stroke included in the stroke group. The feature amount is, for example, a position in an input field. In addition, the learning unit 608 may specifically use the feature quantities of all the strokes included in the stroke group, the number of strokes included in the stroke group, and the like. As a result, the learning unit 608 can learn a model capable of accurately calculating the validity score by using the teacher data, and can reduce the work load required for generating the model.

ここでは、情報処理装置１００が、学習部６０８を有する場合について説明したが、これに限らない。例えば、情報処理装置１００が、学習部６０８を有さない場合があってもよい。この場合、情報処理装置１００は、自装置で所定のモデルを学習せず、所定のモデルを学習する他の装置から、所定のモデルを取得する。 Here, the case where the information processing apparatus 100 has the learning unit 608 has been described, but the present invention is not limited to this. For example, the information processing apparatus 100 may not have the learning unit 608. In this case, the information processing apparatus 100 does not learn a predetermined model by its own device, but acquires a predetermined model from another device that learns the predetermined model.

（情報処理装置１００の動作例）
次に、図７～図１６を用いて、情報処理装置１００の動作例について説明する。まず、図７および図８を用いて、情報処理装置１００がニューラルネットワークを生成する一例について説明する。 (Operation example of information processing device 100)
Next, an operation example of the information processing apparatus 100 will be described with reference to FIGS. 7 to 16. First, an example in which the information processing apparatus 100 generates a neural network will be described with reference to FIGS. 7 and 8.

図７および図８は、情報処理装置１００がニューラルネットワークを生成する一例を示す説明図である。図７において、情報処理装置１００は、例えば、正解を示す教師データをニューラルネットワークに学習させる。図７の例では、教師データは、入力欄７００に書き込まれた筆画７０１～７０９を、筆画７０１～７０４を含む先行の筆画グループと、筆画７０５～７０９を含む後続の筆画グループとに区切る場合に対応する。 7 and 8 are explanatory views showing an example in which the information processing apparatus 100 generates a neural network. In FIG. 7, the information processing apparatus 100 causes a neural network to learn, for example, teacher data indicating a correct answer. In the example of FIG. 7, when the teacher data divides the strokes 701 to 709 written in the input field 700 into a preceding stroke group including the strokes 701 to 704 and a subsequent stroke group including the strokes 705 to 709. handle.

教師データは、先行の筆画グループの先頭の筆画、末尾の筆画、筆画グループ全体の位置を示す位置データＡを含む。また、教師データは、後続の筆画グループの先頭の筆画、末尾の筆画、筆画グループ全体の位置を示す位置データＢを含む。教師データは、先行の筆画グループに含まれる筆画の数を示す画数データＣを含む。また、教師データは、後続の筆画グループに含まれる筆画の数を示す画数データＤを含む。 The teacher data includes the first stroke of the preceding stroke group, the last stroke, and the position data A indicating the position of the entire stroke group. Further, the teacher data includes the first stroke of the subsequent stroke group, the last stroke, and the position data B indicating the position of the entire stroke group. The teacher data includes stroke number data C indicating the number of strokes included in the preceding stroke group. Further, the teacher data includes stroke number data D indicating the number of strokes included in the subsequent stroke group.

また、先行の筆画グループに含まれる筆画７０１～７０４を合成した合成結果の筆跡７１１は正しく文字「文」を形成し、かつ、後続の筆画グループに含まれる筆画７０５～７０９を合成した合成結果の筆跡７１２は正しく文字「字」を形成する。このため、教師データは、例えば、先行の筆画グループと後続の筆画グループとの区切りが、文字の区切りとして正しいことを示す回答データＥを含む。 In addition, the handwriting 711 of the composition result of synthesizing the strokes 701 to 704 included in the preceding stroke group correctly forms the character "sentence", and the composition result of synthesizing the strokes 705 to 709 included in the subsequent stroke group. The stroke 712 correctly forms the character "character". Therefore, the teacher data includes, for example, the answer data E indicating that the delimiter between the preceding stroke group and the subsequent stroke group is correct as the character delimiter.

これにより、情報処理装置１００は、ニューラルネットワークに、２つの筆画グループの区切りが文字間の区切りとして正しい場合に、２つの筆画グループに含まれる筆画の特徴がどのような状態になるかを学習させることができる。次に、図８の説明に移行する。 As a result, the information processing apparatus 100 causes the neural network to learn what the characteristics of the strokes included in the two stroke groups will be when the division between the two stroke groups is correct as the division between characters. be able to. Next, the description shifts to FIG.

図８において、情報処理装置１００は、例えば、不正解を示す教師データをニューラルネットワークに学習させる。図８の例では、教師データは、入力欄７００に書き込まれた筆画７０１～７０９を、筆画７０１～７０５を含む先行の筆画グループと、筆画７０６～７０９を含む後続の筆画グループとに区切る場合に対応する。 In FIG. 8, the information processing apparatus 100 causes a neural network to learn, for example, teacher data indicating an incorrect answer. In the example of FIG. 8, when the teacher data divides the strokes 701 to 709 written in the input field 700 into a preceding stroke group including the strokes 701 to 705 and a subsequent stroke group including the strokes 706 to 709. handle.

また、先行の筆画グループに含まれる筆画７０１～７０５を合成した合成結果の筆跡８１１は文字「文」を形成せず、かつ、後続の筆画グループに含まれる筆画７０６～７０９を合成した合成結果の筆跡８１２は文字「字」を形成しない。このため、教師データは、例えば、先行の筆画グループと後続の筆画グループとの区切りが、文字の区切りとして正しくないことを示す回答データＥを含む。 In addition, the handwriting 811 of the composition result of synthesizing the strokes 701 to 705 included in the preceding stroke group does not form a character "sentence", and the composition result of synthesizing the strokes 706 to 709 included in the subsequent stroke group. The stroke 812 does not form the character "character". Therefore, the teacher data includes, for example, the answer data E indicating that the delimiter between the preceding stroke group and the subsequent stroke group is not correct as the character delimiter.

これにより、情報処理装置１００は、ニューラルネットワークに、２つの筆画グループの区切りが文字間の区切りとして誤っている場合に、２つの筆画グループに含まれる筆画の特徴がどのような状態になるかを学習させることができる。このように、情報処理装置１００は、ニューラルネットワークに、２つの筆画グループに含まれる筆画の特徴と、２つの筆画グループの区切りが文字間の区切りとして正しいか否かの関係についての各種性質を学習させることができる。 As a result, the information processing apparatus 100 tells the neural network what the characteristics of the strokes included in the two stroke groups will be when the division between the two stroke groups is incorrect as the division between characters. Can be learned. In this way, the information processing apparatus 100 learns from the neural network various properties regarding the characteristics of the strokes included in the two stroke groups and whether or not the division between the two stroke groups is correct as the division between characters. Can be made to.

そして、情報処理装置１００は、ニューラルネットワークが、２つの筆画グループの特徴量が入力されると、２つの筆画グループの区切りが文字の区切りとして尤もらしいほど１に近い値になるように、０～１の範囲の値を出力可能にすることができる。ここで、０～１の範囲の値は、例えば、そのまま、妥当性スコアとして利用される。０～１の範囲の値は、例えば、一定値αを乗算してから、妥当性スコアとして利用されてもよい。一定値αは、予め、文字認識のテストを実施することにより設定される。 Then, in the information processing apparatus 100, when the feature amounts of the two stroke groups are input to the neural network, the division between the two stroke groups becomes a value that is plausibly close to 1 as the division of characters. Values in the range of 1 can be output. Here, a value in the range of 0 to 1 is used as it is, for example, as a validity score. A value in the range of 0 to 1 may be used as a validity score after being multiplied by a constant value α, for example. The constant value α is set in advance by performing a character recognition test.

したがって、情報処理装置１００は、文字の区切りとして正しい場合の他に、文字の区切りとして誤っている場合についても、ニューラルネットワークに反映した上で、妥当性スコアを算出可能にすることができる。このため、情報処理装置１００は、認識する文字の種別を制限しない場合であっても、１つの文字を形成する筆画の集まりを精度よく特定可能にする基準を得ることができる。 Therefore, the information processing apparatus 100 can calculate the validity score after reflecting it in the neural network not only when it is correct as a character delimiter but also when it is incorrect as a character delimiter. Therefore, the information processing apparatus 100 can obtain a standard that enables accurate identification of a set of strokes forming one character even when the type of the character to be recognized is not limited.

ここでは、教師データが、先行の筆画グループの先頭の筆画、末尾の筆画、筆画グループ全体の位置を示す位置データＡと、後続の筆画グループの先頭の筆画、末尾の筆画、筆画グループ全体の位置を示す位置データＢとを含む場合について説明したが、これに限らない。例えば、教師データが、先行の筆画グループのすべての筆画、筆画グループ全体の位置を示す位置データＡと、後続の筆画グループのすべての筆画、筆画グループ全体の位置を示す位置データＢとを含む場合があってもよい。 Here, the teacher data is the position data A indicating the position of the first stroke, the last stroke, and the entire stroke group of the preceding stroke group, and the first stroke, the last stroke, and the position of the entire stroke group of the subsequent stroke group. The case where the position data B indicating the above is included has been described, but the present invention is not limited to this. For example, when the teacher data includes position data A indicating the positions of all strokes in the preceding stroke group and the entire stroke group, and position data B indicating the positions of all strokes in the subsequent stroke group and the entire stroke group. There may be.

次に、図９～図１１を用いて、ニューラルネットワークに学習される、２つの筆画グループに含まれる筆画の特徴と、２つの筆画グループの区切りが文字間の区切りとして正しいか否かの関係についての性質の一例について説明する。 Next, using FIGS. 9 to 11, the characteristics of the strokes included in the two stroke groups learned by the neural network and the relationship between whether or not the division between the two stroke groups is correct as the division between characters. An example of the property of is described.

図９～図１１は、ニューラルネットワークに学習される性質の一例を示す説明図である。図９において、入力欄９００に、文字９１１と文字９１２とが続けて書き込まれる場合を例に挙げる。 9 to 11 are explanatory views showing an example of the property learned by the neural network. In FIG. 9, a case where a character 911 and a character 912 are continuously written in the input field 900 will be taken as an example.

この場合、文字９１１は、入力欄９００内で、左側から書き始められて右側で書き終えられる傾向があり、上側から書き始められて下側で書き終えられる傾向がある。そして、文字９１２を書き込む際、同様に、文字９１２は、入力欄９００内で、左側から書き始められて右側で書き終えられる傾向があり、上側から書き始められて下側で書き終えられる傾向がある。 In this case, the character 911 tends to start writing from the left side and finish writing on the right side in the input field 900, and tends to start writing from the upper side and finish writing on the lower side. Then, when writing the character 912, similarly, in the input field 900, the character 912 tends to start writing from the left side and finish writing on the right side, and tends to start writing from the upper side and finish writing on the lower side. be.

このため、２つの筆画グループのそれぞれの筆画グループを合成した合成結果の筆跡の組み合わせが示す文字の書き方が、図９に示すような文字の書き方であれば、区切りが正しい確率が高くなるという性質が、ニューラルネットワークに学習される傾向がある。次に、図１０の説明に移行する。 For this reason, if the writing style of the characters indicated by the combination of the handwritings of the composite result obtained by synthesizing the stroke group of each of the two stroke groups is the writing style of the characters as shown in FIG. 9, the probability that the delimiter is correct is high. However, it tends to be learned by neural networks. Next, the description proceeds to FIG.

図１０において、入力欄１０００に、文字１０１０が書き込まれる場合、および、文字列１０３０が書き込まれる場合を例に挙げる。文字１０１０は、「名（漢字）」である。文字列１０３０は、「タロ（カタカナ）」である。 In FIG. 10, a case where a character 1010 is written and a case where a character string 1030 is written are given as an example in the input field 1000. The character 1010 is a "name (kanji)". The character string 1030 is "taro (katakana)".

この場合、文字１０１０を、誤って部分１０２１と部分１０２２とに区切ると、部分１０２１と部分１０２２とは入力欄１０００内で偏った位置に存在する傾向がある。一方で、文字列１０３０を、正しく文字１０４１と文字１０４２とに区切れば、文字１０４１と文字１０４２とは、入力欄１０００内で中央に存在する傾向がある。 In this case, if the character 1010 is erroneously divided into a portion 1021 and a portion 1022, the portion 1021 and the portion 1022 tend to exist at biased positions in the input field 1000. On the other hand, if the character string 1030 is correctly divided into the character 1041 and the character 1042, the character 1041 and the character 1042 tend to exist in the center in the input field 1000.

このため、２つの筆画グループのそれぞれの筆画グループ全体の位置が、入力欄１０００の中央に存在すれば、区切りが正しい確率が高くなるという性質が、ニューラルネットワークに学習される傾向がある。次に、図１１の説明に移行する。 Therefore, if the position of the entire stroke group of each of the two stroke groups exists in the center of the input field 1000, the property that the probability that the delimiter is correct increases tends to be learned by the neural network. Next, the description proceeds to FIG.

図１１において、入力欄１１００に、筆画群１１１１が書き込まれる場合、および、筆画群１１１２が書き込まれる場合を例に挙げる。ここで、筆画群１１１１のように、入力欄１１００の下側に存在する３画以下の筆画群は、句読点や３点リーダなどである可能性がある。一方で、入力欄１１００の下側に存在しても４画以上の筆画群は、３画以下である場合に比べて、文字の一部である可能性が高い。 In FIG. 11, a case where the stroke group 1111 is written in the input field 1100 and a case where the stroke group 1112 is written are given as an example. Here, the stroke group having three or less strokes existing below the input field 1100, such as the stroke group 1111, may be a punctuation mark, a three-point reader, or the like. On the other hand, even if it exists under the input field 1100, the stroke group of 4 or more strokes is more likely to be a part of the character than the case of 3 strokes or less.

また、筆画群１１１２のように、上側に存在する３画以上の筆画群は、下側に存在する場合に比べて、文字の一部である可能性が高い。一方で、上側に存在しても２画以下の筆画群は、引用符号である可能性がある。このため、２つの筆画グループのそれぞれの筆画グループ全体の位置、および、筆画グループに含まれる筆画の数が、区切りの正しさと関係するという性質が、ニューラルネットワークに学習される傾向がある。例えば、筆画グループ全体の位置に偏りがある場合、筆画グループに含まれる筆画の数が多いほど、区切りが正しい確率が低くなるという性質が、ニューラルネットワークに学習される傾向がある。 Further, the stroke group having three or more strokes existing on the upper side, such as the stroke group 1112, is more likely to be a part of the character than the case where the stroke group exists on the lower side. On the other hand, a group of strokes having two or less strokes even if they exist on the upper side may be quotation marks. Therefore, there is a tendency for the neural network to learn the property that the position of the entire stroke group of each of the two stroke groups and the number of strokes included in the stroke group are related to the correctness of the delimiter. For example, when the position of the entire stroke group is biased, the neural network tends to learn that the larger the number of strokes included in the stroke group, the lower the probability that the delimiter is correct.

次に、図１２を用いて、情報処理装置１００が入力画面１２００を表示する一例について説明する。 Next, an example in which the information processing apparatus 100 displays the input screen 1200 will be described with reference to FIG.

図１２は、情報処理装置１００が入力画面１２００を表示する一例を示す説明図である。図１２において、情報処理装置１００は、入力画面１２００を表示する。入力画面１２００は、ユーザが文字を重ね書きする入力欄１２０１と、書き込まれた文字の認識結果を示す表示欄１２０２とを含む。これにより、情報処理装置１００は、ユーザからの文字の書き込みを受け付ける。 FIG. 12 is an explanatory diagram showing an example in which the information processing apparatus 100 displays the input screen 1200. In FIG. 12, the information processing apparatus 100 displays an input screen 1200. The input screen 1200 includes an input field 1201 in which characters are overwritten by the user, and a display field 1202 showing a recognition result of the written characters. As a result, the information processing apparatus 100 receives the writing of characters from the user.

次に、図１３～図１５を用いて、情報処理装置１００が文字を認識する一例について説明する。 Next, an example in which the information processing apparatus 100 recognizes characters will be described with reference to FIGS. 13 to 15.

図１３～図１５は、情報処理装置１００が文字を認識する一例を示す説明図である。情報処理装置１００は、入力欄１２０１に筆画が書き込まれる都度、書き込まれた筆画を検出し、筆画の軌跡を示す筆画データを取得する。情報処理装置１００は、例えば、タッチパネル２０６から、時系列のポインティング座標を含む座標列情報を、筆画データとして取得する。座標列情報は、例えば、（ｘ，ｙ）のようなポインティング座標を示す座標情報の配列である。座標情報は、例えば、３０ｍ秒などの周期ごとに１点ずつ得られる。座標列情報は、例えば、タッチパネル２０６に指やスタイラスが押下されてから、離されるまでの軌跡を示す１以上の座標情報を含む。 13 to 15 are explanatory views showing an example in which the information processing apparatus 100 recognizes characters. Each time the stroke is written in the input field 1201, the information processing apparatus 100 detects the written stroke and acquires the stroke data indicating the trajectory of the stroke. The information processing apparatus 100 acquires, for example, coordinate sequence information including time-series pointing coordinates from the touch panel 206 as stroke data. The coordinate sequence information is, for example, an array of coordinate information indicating pointing coordinates such as (x, y). Coordinate information is obtained one point at a cycle of, for example, 30 ms. The coordinate sequence information includes, for example, one or more coordinate information indicating a locus from when a finger or a stylus is pressed on the touch panel 206 until the stylus is released.

図１３～図１５の例では、情報処理装置１００が、入力欄１２０１に書き込まれた筆画１３０１～１３０５などの軌跡を示す筆画データを順に取得する場合において、筆画１３０５の軌跡を示す筆画データを取得した時点を例に挙げる。 In the example of FIGS. 13 to 15, when the information processing apparatus 100 sequentially acquires the stroke data indicating the locus of the strokes 1301 to 1305 written in the input field 1201, the stroke data indicating the locus of the stroke 1305 is acquired. Take the time when it was done as an example.

ここで、情報処理装置１００は、特定しうる筆画グループをすべて特定し、それぞれの筆画グループが示す筆跡を特定する。情報処理装置１００は、筆画１３０１を含む筆画グループを特定し、筆画１３０１が示す筆跡１３１０を特定する。また、情報処理装置１００は、筆画１３０１，１３０２を含む筆画グループを特定し、筆画１３０１，１３０２が示す筆跡１３２０を特定する。 Here, the information processing apparatus 100 specifies all the stroke groups that can be specified, and specifies the handwriting indicated by each stroke group. The information processing apparatus 100 identifies a stroke group including the stroke 1301, and identifies the handwriting 1310 indicated by the stroke 1301. Further, the information processing apparatus 100 identifies a stroke group including strokes 1301, 1302, and identifies a stroke 1320 indicated by strokes 1301, 1302.

また、情報処理装置１００は、筆画１３０１～１３０３を含む筆画グループを特定し、筆画１３０１～１３０３が示す筆跡１３３０を特定する。また、情報処理装置１００は、筆画１３０１～１３０４を含む筆画グループを特定し、筆画１３０１～１３０４が示す筆跡１３４０を特定する。また、情報処理装置１００は、筆画１３０１～１３０５を含む筆画グループを特定し、筆画１３０１～１３０５が示す筆跡１３５０を特定する。 Further, the information processing apparatus 100 identifies a stroke group including strokes 1301 to 1303, and identifies a stroke 1330 indicated by strokes 1301 to 1303. Further, the information processing apparatus 100 identifies a stroke group including strokes 1301 to 1304, and identifies a stroke 1340 indicated by strokes 1301 to 1304. Further, the information processing apparatus 100 identifies a stroke group including strokes 1301 to 1305, and identifies a stroke 1350 indicated by strokes 1301 to 1305.

また、情報処理装置１００は、筆画１３０２を含む筆画グループを特定し、筆画１３０２が示す筆跡１３６０を特定する。また、情報処理装置１００は、筆画１３０２，１３０３を含む筆画グループを特定し、筆画１３０２，１３０３が示す筆跡１３７０を特定する。また、情報処理装置１００は、同様に、他の筆画グループを特定し、他の筆跡を特定する。次に、図１４の説明に移行する。 Further, the information processing apparatus 100 identifies a stroke group including the stroke 1302, and identifies the handwriting 1360 indicated by the stroke 1302. Further, the information processing apparatus 100 identifies a stroke group including strokes 1302 and 1303, and identifies a stroke 1370 indicated by strokes 1302 and 1303. Further, the information processing apparatus 100 similarly identifies another stroke group and identifies another handwriting. Next, the description proceeds to FIG.

図１４において、情報処理装置１００は、辞書データベース３００を参照して、特定した筆跡が示す文字の候補を特定して認識スコアを算出し、候補データベース５００に記憶する。情報処理装置１００は、例えば、筆跡１３１０が示す文字の候補として「─（罫線）」、「一（漢字）」、「－（記号）」などを特定して認識スコアを算出し、候補データベース５００の１画目の項目の１画のフィールドに設定する。 In FIG. 14, the information processing apparatus 100 refers to the dictionary database 300, identifies the character candidates indicated by the specified handwriting, calculates the recognition score, and stores the recognition score in the candidate database 500. The information processing apparatus 100 identifies, for example, "-(ruled line)", "one (kanji)", "-(symbol)", etc. as character candidates indicated by the handwriting 1310, calculates a recognition score, and calculates the recognition score, and calculates the recognition score, and the candidate database 500. Set in the field of the 1st stroke of the item of the 1st stroke of.

また、情報処理装置１００は、例えば、筆跡１３２０が示す文字の候補として「＋（記号）」、「ナ（カタカナ）」、「十（漢字）」などを特定して認識スコアを算出し、候補データベース５００の１画目の項目の２画のフィールドに設定する。また、情報処理装置１００は、同様に、他の筆跡が示す文字の候補および認識スコアを、候補データベース５００に記憶する。次に、図１５の説明に移行する。 Further, the information processing apparatus 100 identifies, for example, "+ (symbol)", "na (katakana)", "ten (kanji)", etc. as character candidates indicated by the handwriting 1320, calculates a recognition score, and is a candidate. It is set in the field of the second stroke of the item of the first stroke of the database 500. Similarly, the information processing apparatus 100 stores character candidates and recognition scores indicated by other handwritings in the candidate database 500. Next, the process proceeds to the description of FIG.

図１５において、情報処理装置１００は、文脈データベース４００を参照して、筆跡が示す文字の候補の組み合わせごとに、文脈スコアを算出する。組み合わせは、例えば、筆跡１３１０が示す文字の候補と、筆跡１３６０が示す文字の候補の組み合わせである。組み合わせは、例えば、筆跡１３６０が示す文字の候補と、筆跡１５００が示す文字の候補の組み合わせである。組み合わせは、例えば、筆跡１３２０が示す文字の候補と、筆跡１５００が示す文字の候補の組み合わせである。 In FIG. 15, the information processing apparatus 100 refers to the context database 400 and calculates a context score for each combination of character candidates indicated by the handwriting. The combination is, for example, a combination of a character candidate indicated by the handwriting 1310 and a character candidate indicated by the handwriting 1360. The combination is, for example, a combination of a character candidate indicated by the handwriting 1360 and a character candidate indicated by the handwriting 1500. The combination is, for example, a combination of a character candidate indicated by the handwriting 1320 and a character candidate indicated by the handwriting 1500.

情報処理装置１００は、先頭の筆画から末尾の筆画までで形成される連続する１以上の筆跡が示す文字の候補を並べた候補列の、隣接する候補間のバイグラム確率を乗算して、一定値Ｃを乗算して、候補列全体での文脈スコアを算出する。そして、情報処理装置１００は、候補列全体での文脈スコアを、候補列に含まれる文字の候補の組み合わせごとの文脈スコアとして利用する。情報処理装置１００は、筆跡が示す文字の候補の組み合わせのバイグラム確率を、そのまま文脈スコアとして利用してもよい。 The information processing apparatus 100 multiplies the bigram probabilities between adjacent candidates in a candidate sequence in which character candidates indicated by one or more continuous handwritings formed from the first stroke to the last stroke are arranged, and a constant value is obtained. Multiply by C to calculate the context score for the entire candidate sequence. Then, the information processing apparatus 100 uses the context score of the entire candidate sequence as the context score for each combination of character candidates included in the candidate sequence. The information processing apparatus 100 may use the bigram probability of the combination of character candidates indicated by the handwriting as it is as the context score.

また、情報処理装置１００は、ニューラルネットワークを利用して、筆跡の元である筆画グループのペアごとに、妥当性スコアを算出する。情報処理装置１００は、ペアにされた筆画グループの特徴量を、ニューラルネットワークに入力することにより、妥当性スコアを算出する。 Further, the information processing apparatus 100 uses a neural network to calculate a validity score for each pair of stroke groups that are the source of handwriting. The information processing apparatus 100 calculates the validity score by inputting the feature amount of the paired stroke group into the neural network.

そして、情報処理装置１００は、筆跡が示す文字の候補の組み合わせごとの認識スコアおよび文脈スコアと、筆跡の元である筆画グループのペアの妥当性スコアとに基づいて、筆跡が示す文字の候補の組み合わせごとの総合スコアを算出する。総合スコアは、例えば、認識スコアと、文脈スコアと、妥当性スコアとの合計である。総合スコアは、総合的に、いずれの筆画グループが正しく区切られた筆画グループであるかを示し、いずれの候補が書き込まれた文字として相応しい候補であるかを示すことになる。 Then, the information processing apparatus 100 determines the character candidates indicated by the handwriting based on the recognition score and the context score for each combination of the character candidates indicated by the handwriting and the validity score of the pair of the stroke group that is the source of the handwriting. Calculate the total score for each combination. The overall score is, for example, the sum of the recognition score, the context score, and the validity score. Overall, the overall score indicates which stroke group is a correctly delimited stroke group, and which candidate is a suitable candidate for the written character.

情報処理装置１００は、総合スコアが最大である文字の候補の組み合わせを、入力欄１２０１に書き込まれた文字として認識する。これにより、情報処理装置１００は、入力欄１２０１に書き込まれた文字を、リアルタイムに、精度よく認識することができる。 The information processing apparatus 100 recognizes the combination of character candidates having the maximum total score as the characters written in the input field 1201. As a result, the information processing apparatus 100 can accurately recognize the characters written in the input field 1201 in real time.

ここで、情報処理装置１００は、図１３～図１５に示す処理を実行する場合、ビタービアルゴリズムなどの効率的な最適化アルゴリズムを用いることが好ましい。これによれば、情報処理装置１００は、必ずしもすべての組み合わせについて各種スコアを算出しなくてもよく、処理量の低減化を図ることができる。 Here, when the information processing apparatus 100 executes the processes shown in FIGS. 13 to 15, it is preferable to use an efficient optimization algorithm such as a bitterbi algorithm. According to this, the information processing apparatus 100 does not necessarily have to calculate various scores for all combinations, and can reduce the amount of processing.

ここで、情報処理装置１００は、妥当性スコアに基づいて、一部の認識スコアおよび文脈スコアの算出を省略してもよい。これによれば、情報処理装置１００は、ユーザが書き込んだ文字を表す確率が相対的に低いペアについての処理を省略し、文字を認識する精度の低下を抑制しつつ、処理量の低減化を図ることができる。 Here, the information processing apparatus 100 may omit the calculation of a part of the recognition score and the context score based on the validity score. According to this, the information processing apparatus 100 omits the processing for the pair having a relatively low probability of representing the character written by the user, suppresses the decrease in the accuracy of recognizing the character, and reduces the processing amount. Can be planned.

ここでは、情報処理装置１００が、筆画グループのペアに対応する２つの筆跡が示す文字の候補の組み合わせごとに文脈スコアを算出する場合について説明したが、これに限らない。例えば、情報処理装置１００が、最初の筆画から最後の筆画までのうち、時系列に連続する複数の筆跡が示す文字の候補の組み合わせごとに、文脈スコアを算出する場合があってもよい。 Here, the case where the information processing apparatus 100 calculates the context score for each combination of the character candidates indicated by the two handwritings corresponding to the pair of stroke groups has been described, but the present invention is not limited to this. For example, the information processing apparatus 100 may calculate a context score for each combination of character candidates indicated by a plurality of consecutive handwritings in a time series from the first stroke to the last stroke.

次に、図１６を用いて、情報処理装置１００が文字を認識した結果の一例について説明する。 Next, an example of the result of the information processing apparatus 100 recognizing characters will be described with reference to FIG.

図１６は、情報処理装置１００が文字を認識した結果の一例を示す説明図である。例えば、入力欄１２０１に「名」という字が書き込まれる場合と、入力欄１２０１に「タロ」という字が重ね書きされる場合との筆跡は類似する傾向がある。ここで、「名」という字を書いた場合、「タ」と「ロ」の領域は重なっており、領域が重なるか否かを考慮するだけでは、「タロ」という字と区別することが難しい。また、文脈上も、「名は右側です」と「タロは右側です」とのように、いずれの字と認識しても妥当である場合がある。 FIG. 16 is an explanatory diagram showing an example of the result of the information processing apparatus 100 recognizing characters. For example, the handwriting tends to be similar between the case where the character "name" is written in the input field 1201 and the case where the character "taro" is overwritten in the input field 1201. Here, when the character "name" is written, the areas of "ta" and "ro" overlap, and it is difficult to distinguish them from the character "taro" simply by considering whether or not the areas overlap. .. Also, in context, it may be appropriate to recognize either character, such as "name is on the right side" and "taro is on the right side".

このため、従来では、「名」という字が書き込まれた場合、「名」という字を途中で誤って区切り、「名」という字の筆跡は「タロ（カタカナ）」として尤もらしいと誤って判断されることがある。このため、「名」という字が、誤って「タロ（カタカナ）」と認識されてしまうことがある。 For this reason, in the past, when the character "name" was written, the character "name" was mistakenly separated in the middle, and the handwriting of the character "name" was mistakenly judged to be plausible as "taro (katakana)". May be done. Therefore, the character "name" may be mistakenly recognized as "taro (katakana)".

これに対し、情報処理装置１００は、ニューラルネットワークに、誤って文字を区切った場合についても反映させ、「タ」と「ロ」の重なりが浅い場合は「タ」と「ロ」の間の区切りが間違っていると学習させることができる。一方で、情報処理装置１００は、ニューラルネットワークに、正しく文字を区切った場合についても反映させ、「タ」と「ロ」が重ね書きされ、「タ」と「ロ」の重なりが深い場合は、「タ」と「ロ」の区切りは正しいと学習させることができる。 On the other hand, the information processing apparatus 100 reflects the case where characters are erroneously separated in the neural network, and when the overlap between "ta" and "ro" is shallow, the separation between "ta" and "ro". Can be learned if is wrong. On the other hand, the information processing apparatus 100 reflects the case where the characters are correctly separated in the neural network, and when "ta" and "ro" are overwritten and the overlap between "ta" and "ro" is deep, the information processing device 100 reflects the case. It can be learned that the delimiter between "ta" and "b" is correct.

これにより、情報処理装置１００は、例えば、筆跡１６０１のように、「名」という字が書き込まれた場合であれば、妥当性スコアに基づき、筆跡１６０１を途中で区切らないことが好ましいと判断して、「名（漢字）」を認識することができる。一方で、情報処理装置１００は、例えば、筆跡１６０２のように、「タロ」という字が書き込まれた場合であれば、筆跡１６０２を２文字分に区切ることが好ましいと判断して、「タロ（カタカナ）」を認識することができる。このように、情報処理装置１００は、ニューラルネットワークに、誤って文字を区切った場合についても反映させており、精度よく文字を認識することができる。 As a result, the information processing apparatus 100 determines that it is preferable not to divide the handwriting 1601 in the middle based on the validity score when the character "name" is written, for example, as in the handwriting 1601. And can recognize "name (kanji)". On the other hand, when the character "Taro" is written, for example, the information processing apparatus 100 determines that it is preferable to divide the handwriting 1602 into two characters, and "Taro (Taro). Katakana) ”can be recognized. As described above, the information processing apparatus 100 reflects the case where the characters are erroneously separated in the neural network, and can recognize the characters with high accuracy.

また、認識する文字の種別を制限せずに、日本語の入力に対応する場合、認識する文字の数が７０００以上になることがある。このため、従来では、文字の組み合わせごとに、入力欄上で書き込み位置がどこからどこに移動するかの正しい軌跡を示す辞書情報を用意しようとすると、辞書情報が膨大になってしまう。これに対し、情報処理装置１００は、ニューラルネットワークを利用するため、数キロバイト程度にメモリ使用量を抑制することができる。また、情報処理装置１００は、ニューラルネットワークを利用し、１文字ずつの筆画の位置関係と、文字間の筆画の位置関係とを総合的に考慮して、妥当性スコアを算出することができる。このため、情報処理装置１００は、ユーザが文字を斜めに傾けて書き込む傾向があっても、重ね書きされる文字を精度よく認識可能にすることができる。 Further, when Japanese input is supported without limiting the types of characters to be recognized, the number of characters to be recognized may be 7,000 or more. For this reason, conventionally, if it is attempted to prepare dictionary information indicating a correct trajectory of where the writing position moves on the input field for each combination of characters, the dictionary information becomes enormous. On the other hand, since the information processing apparatus 100 uses a neural network, the memory usage can be suppressed to about several kilobytes. Further, the information processing apparatus 100 can calculate the validity score by comprehensively considering the positional relationship of the strokes of each character and the positional relationship of the strokes between the characters by using the neural network. Therefore, the information processing apparatus 100 can accurately recognize the overwritten characters even if the user tends to write the characters at an angle.

ここでは、情報処理装置１００が、入力欄１２０１に筆画が書き込まれる都度、図１３～図１５に示す処理を実行する場合について説明したが、これに限らない。例えば、情報処理装置１００が、筆画を検出してから数ミリ秒後までの間に、次の筆画が書き込まれない場合に、図１３～図１５に示す処理を実行する場合があってもよい。 Here, the case where the information processing apparatus 100 executes the processes shown in FIGS. 13 to 15 each time a stroke is written in the input field 1201 has been described, but the present invention is not limited to this. For example, the information processing apparatus 100 may execute the processes shown in FIGS. 13 to 15 when the next stroke is not written within a few milliseconds after the stroke is detected. ..

ここでは、入力欄１２０１に文字が重ね書きされる場合について説明したが、これに限らない。例えば、入力欄１２０１に文字が並べて書き込まれる場合があってもよい。この場合、情報処理装置１００は、ニューラルネットワークに、正しく文字を区切る場合と、誤って文字を区切る場合とを学習させる。具体的には、情報処理装置１００は、入力欄１２０１に「タロ」という字が近づけて書き込まれた場合に、「タロ」を２文字に区切ることが正しいことを、ニューラルネットワークに学習させる。また、情報処理装置１００は、入力欄１２０１に「名」という字が書き込まれた場合に、「名」を区切らないことが正しいことを、ニューラルネットワークに学習させる。 Here, the case where characters are overwritten in the input field 1201 has been described, but the present invention is not limited to this. For example, characters may be written side by side in the input field 1201. In this case, the information processing apparatus 100 causes the neural network to learn the case where the characters are correctly separated and the case where the characters are erroneously separated. Specifically, the information processing apparatus 100 causes the neural network to learn that it is correct to divide the "Taro" into two characters when the character "Taro" is written close to the input field 1201. Further, the information processing apparatus 100 causes the neural network to learn that it is correct not to separate the "names" when the character "name" is written in the input field 1201.

（全体処理手順）
次に、図１７を用いて、情報処理装置１００が実行する、全体処理手順の一例について説明する。全体処理は、例えば、図２に示したＣＰＵ２０１と、メモリ２０２や記録媒体２０５などの記憶領域と、ネットワークＩ／Ｆ２０３とによって実現される。 (Overall processing procedure)
Next, an example of the overall processing procedure executed by the information processing apparatus 100 will be described with reference to FIG. The entire processing is realized, for example, by the CPU 201 shown in FIG. 2, a storage area such as a memory 202 or a recording medium 205, and a network I / F 203.

図１７は、全体処理手順の一例を示すフローチャートである。図１７において、情報処理装置１００は、筆画データを取得する（ステップＳ１７０１）。そして、情報処理装置１００は、取得した筆画データを蓄積する（ステップＳ１７０２）。 FIG. 17 is a flowchart showing an example of the overall processing procedure. In FIG. 17, the information processing apparatus 100 acquires stroke data (step S1701). Then, the information processing apparatus 100 accumulates the acquired stroke data (step S1702).

次に、情報処理装置１００は、蓄積した筆画データを参照して、時系列順に連続する１以上の筆画を含む筆画グループを、すべて特定する（ステップＳ１７０３）。そして、情報処理装置１００は、特定した筆画グループごとに文字認識を実施し、それぞれの筆画グループに含まれる１以上の筆画の合成結果である筆跡が示す文字の候補および認識スコアを取得する（ステップＳ１７０４）。 Next, the information processing apparatus 100 refers to the accumulated stroke data and specifies all stroke groups including one or more consecutive strokes in chronological order (step S1703). Then, the information processing apparatus 100 performs character recognition for each of the specified stroke groups, and acquires character candidates and recognition scores indicated by the handwriting, which is the result of synthesizing one or more strokes included in each stroke group (step). S1704).

次に、情報処理装置１００は、筆画グループの組み合わせごとに、筆画グループの組み合わせに対応する文字の候補の組み合わせの文脈スコアを算出する（ステップＳ１７０５）。そして、情報処理装置１００は、ニューラルネットワークを利用して、筆画グループの組み合わせごとの妥当性スコアを算出する（ステップＳ１７０６）。 Next, the information processing apparatus 100 calculates the context score of the combination of character candidates corresponding to the combination of the stroke groups for each combination of the stroke groups (step S1705). Then, the information processing apparatus 100 uses a neural network to calculate the validity score for each combination of stroke groups (step S1706).

次に、情報処理装置１００は、筆画グループの組み合わせに対応する文字の候補の組み合わせごとに、認識スコアと文脈スコアと妥当性スコアとに基づく総合スコアを算出する（ステップＳ１７０７）。そして、情報処理装置１００は、総合スコアが最大の文字の候補の組み合わせを出力する（ステップＳ１７０８）。その後、情報処理装置１００は、ステップＳ１７０１の処理に戻る。 Next, the information processing apparatus 100 calculates a total score based on the recognition score, the context score, and the validity score for each combination of character candidates corresponding to the combination of stroke groups (step S1707). Then, the information processing apparatus 100 outputs a combination of character candidates having the maximum total score (step S1708). After that, the information processing apparatus 100 returns to the process of step S1701.

ここで、情報処理装置１００は、図１７の一部ステップの処理の順序を入れ替えて実行してもよい。例えば、情報処理装置１００は、ステップＳ１７０６の処理を、ステップＳ１７０４，Ｓ１７０５の処理より先に実行してもよい。また、情報処理装置１００は、図１７の一部ステップの処理を省略してもよい。例えば、情報処理装置１００は、妥当性スコアが一定以下である筆画グループの組み合わせについては、ステップＳ１７０４，Ｓ１７０５の処理を省略してもよい。 Here, the information processing apparatus 100 may execute the processing in the partial steps of FIG. 17 in a different order. For example, the information processing apparatus 100 may execute the process of step S1706 before the process of steps S1704 and S1705. Further, the information processing apparatus 100 may omit the processing of a part of the steps in FIG. For example, the information processing apparatus 100 may omit the processing of steps S1704 and S1705 for a combination of stroke groups having a validity score of a certain value or less.

以上説明したように、情報処理装置１００によれば、文字が重ね書きされる１つの入力欄に書き込まれた筆画の軌跡を示す筆画データを蓄積することができる。情報処理装置１００によれば、蓄積した筆画データを参照して、入力欄に書き込まれた、先行の筆画グループと、後続の筆画グループとの特徴量に基づいて、先行の筆画グループと後続の筆画グループとの区切りの妥当性スコアを算出することができる。情報処理装置１００によれば、蓄積した筆画データと、算出した妥当性スコアとに基づいて、入力欄に書き込まれた１以上の文字を認識することができる。これにより、情報処理装置１００は、認識する文字の種別が増加しても、文字の区切りがユーザから明示されずとも、１つの文字を形成する筆画の集まりを精度よく特定することができ、重ね書きされる文字を精度よく認識可能にすることができる。 As described above, according to the information processing apparatus 100, it is possible to store stroke data indicating the locus of strokes written in one input field in which characters are overwritten. According to the information processing apparatus 100, the preceding stroke group and the succeeding stroke group are based on the feature quantities of the preceding stroke group and the succeeding stroke group written in the input field with reference to the accumulated stroke data. The validity score of the division with the group can be calculated. According to the information processing apparatus 100, one or more characters written in the input field can be recognized based on the accumulated stroke data and the calculated validity score. As a result, the information processing apparatus 100 can accurately identify a group of strokes forming one character even if the types of characters to be recognized increase and the character delimiter is not specified by the user. The characters to be written can be recognized with high accuracy.

情報処理装置１００によれば、蓄積した筆画データを参照して、先行の筆画グループが形成する文字の候補を生成し、先行の筆画グループが形成する文字の候補の認識スコアを算出することができる。情報処理装置１００によれば、蓄積した筆画データを参照して、後続の筆画グループが形成する文字の候補を生成し、後続の筆画グループが形成する文字の候補の認識スコアを算出することができる。情報処理装置１００によれば、生成した候補および算出した認識スコアと、算出した妥当性スコアとに基づいて、入力欄に書き込まれた１以上の文字を認識することができる。これにより、情報処理装置１００は、認識スコアを利用して、重ね書きされる文字をさらに精度よく認識可能にすることができる。 According to the information processing apparatus 100, the accumulated stroke data can be referred to to generate character candidates formed by the preceding stroke group, and the recognition score of the character candidates formed by the preceding stroke group can be calculated. .. According to the information processing apparatus 100, the accumulated stroke data can be referred to to generate character candidates formed by the subsequent stroke group, and the recognition score of the character candidates formed by the subsequent stroke group can be calculated. .. According to the information processing apparatus 100, one or more characters written in the input field can be recognized based on the generated candidate, the calculated recognition score, and the calculated validity score. As a result, the information processing apparatus 100 can use the recognition score to more accurately recognize the characters to be overwritten.

情報処理装置１００によれば、蓄積した筆画データを参照して、先行の筆画グループが形成する文字の候補と、後続の筆画グループが形成する文字の候補との組み合わせの文脈スコアを算出することができる。情報処理装置１００によれば、算出した文脈スコアと、生成した候補および算出した認識スコアと、算出した妥当性スコアとに基づいて、入力欄に書き込まれた１以上の文字を認識することができる。これにより、情報処理装置１００は、文脈スコアを利用して、重ね書きされる文字をさらに精度よく認識可能にすることができる。 According to the information processing apparatus 100, the context score of the combination of the character candidate formed by the preceding stroke group and the character candidate formed by the subsequent stroke group can be calculated with reference to the accumulated stroke data. can. According to the information processing apparatus 100, one or more characters written in the input field can be recognized based on the calculated context score, the generated candidate, the calculated recognition score, and the calculated validity score. .. As a result, the information processing apparatus 100 can use the context score to more accurately recognize the overwritten characters.

情報処理装置１００によれば、入力欄に書き込まれた、先行の筆画グループと後続の筆画グループとを組み合わせた２つの筆画グループのペアごとに、ペアにされた２つの筆画グループの区切りの妥当性スコアを算出することができる。これにより、情報処理装置１００は、文字を形成する筆画の集まりの区切りを網羅的に検証することができ、１つの文字を形成する筆画の集まりを精度よく特定することができ、重ね書きされる文字を精度よく認識可能にすることができる。 According to the information processing apparatus 100, the validity of the division between the two paired stroke groups for each pair of the two stroke groups in which the preceding stroke group and the subsequent stroke group are combined, which is written in the input field. The score can be calculated. As a result, the information processing apparatus 100 can comprehensively verify the division of the group of strokes forming a character, can accurately identify the group of strokes forming one character, and is overwritten. Characters can be recognized accurately.

情報処理装置１００によれば、ペアのうち、算出した妥当性スコアが条件を満たすペアごとに、認識スコアを算出することができる。情報処理装置１００によれば、ペアのうち、算出した妥当性スコアが条件を満たすペアごとに、文脈スコアを算出することができる。これにより、情報処理装置１００は、ユーザが書き込んだ文字を表す確率が相対的に低いペアについての処理を省略し、文字を認識する精度の低下を抑制しつつ、処理量の低減化を図ることができる。 According to the information processing apparatus 100, the recognition score can be calculated for each pair that satisfies the condition of the calculated validity score. According to the information processing apparatus 100, the context score can be calculated for each pair that satisfies the calculated validity score among the pairs. As a result, the information processing apparatus 100 omits the processing for the pair having a relatively low probability of representing the character written by the user, suppresses the decrease in the accuracy of recognizing the character, and reduces the processing amount. Can be done.

情報処理装置１００によれば、先行の筆画グループに含まれる末尾の筆画の入力欄での位置と、後続の筆画グループに含まれる先頭の筆画および末尾の筆画の入力欄での位置とに基づいて、妥当性スコアを算出することができる。これにより、情報処理装置１００は、先行の筆画グループに含まれるすべての筆画と、後続の筆画グループに含まれるすべての筆画とを考慮する場合に比べて、処理量の低減化を図ることができる。 According to the information processing apparatus 100, based on the position in the input field of the last stroke included in the preceding stroke group and the position in the input field of the first stroke and the last stroke included in the subsequent stroke group. , The validity score can be calculated. As a result, the information processing apparatus 100 can reduce the processing amount as compared with the case where all the strokes included in the preceding stroke group and all the strokes included in the subsequent stroke group are taken into consideration. ..

情報処理装置１００によれば、さらに、先行の筆画グループに含まれる先頭の筆画に基づいて、妥当性スコアを算出することができる。これにより、情報処理装置１００は、妥当性スコアを算出する精度の向上を図ることができる。 Further, according to the information processing apparatus 100, the validity score can be calculated based on the first stroke included in the preceding stroke group. As a result, the information processing apparatus 100 can improve the accuracy of calculating the validity score.

情報処理装置１００によれば、さらに、先行の筆画グループに含まれる筆画の数と、後続の筆画グループに含まれる筆画の数とに基づいて、妥当性スコアを算出することができる。これにより、情報処理装置１００は、妥当性スコアを算出する精度の向上を図ることができる。 Further, according to the information processing apparatus 100, the validity score can be calculated based on the number of strokes included in the preceding stroke group and the number of strokes included in the subsequent stroke group. As a result, the information processing apparatus 100 can improve the accuracy of calculating the validity score.

情報処理装置１００によれば、さらに、先行の筆画グループ全体の入力欄での位置と、後続の筆画グループ全体の入力欄での位置とに基づいて、妥当性スコアを算出することができる。これにより、情報処理装置１００は、妥当性スコアを算出する精度の向上を図ることができる。 Further, according to the information processing apparatus 100, the validity score can be calculated based on the position in the input field of the entire preceding stroke group and the position in the input field of the entire subsequent stroke group. As a result, the information processing apparatus 100 can improve the accuracy of calculating the validity score.

情報処理装置１００によれば、ペアにされた２つの筆画グループと、ペアにされた２つの筆画グループの区切りが文字間の区切りとして妥当であるか否かとを対応付けて表す教師データを取得することができる。情報処理装置１００によれば、取得した教師データを利用して、ペアにされた２つの筆画グループのそれぞれの筆画グループの特徴量を入力すると、ペアにされた２つの筆画グループの区切りの妥当性スコアを出力するモデルを学習することができる。情報処理装置１００によれば、学習したモデルを利用して、先行の筆画グループと、後続の筆画グループとの特徴量に基づいて、先行の筆画グループと後続の筆画グループとの区切りの妥当性スコアを算出することができる。これにより、情報処理装置１００は、文字の区切りの性質を学習したニューラルネットワークを利用して、妥当性スコアを精度よく算出可能にすることができる。 According to the information processing apparatus 100, the teacher data representing whether or not the delimiter between the two paired stroke groups and the paired stroke group is appropriate as the delimiter between characters is acquired. be able to. According to the information processing apparatus 100, when the feature amount of each stroke group of the two paired stroke groups is input by using the acquired teacher data, the validity of the division between the two paired stroke groups is valid. You can learn a model that outputs scores. According to the information processing apparatus 100, the validity score of the division between the preceding stroke group and the succeeding stroke group is based on the features of the preceding stroke group and the succeeding stroke group by using the learned model. Can be calculated. As a result, the information processing apparatus 100 can accurately calculate the validity score by using the neural network that has learned the property of character delimiter.

なお、本実施の形態で説明した文字認識方法は、予め用意されたプログラムをパーソナル・コンピュータやワークステーション等のコンピュータで実行することにより実現することができる。本実施の形態で説明した文字認識プログラムは、ハードディスク、フレキシブルディスク、ＣＤ－ＲＯＭ、ＭＯ、ＤＶＤ等のコンピュータで読み取り可能な記録媒体に記録され、コンピュータによって記録媒体から読み出されることによって実行される。また、本実施の形態で説明した文字認識プログラムは、インターネット等のネットワークを介して配布してもよい。 The character recognition method described in this embodiment can be realized by executing a program prepared in advance on a computer such as a personal computer or a workstation. The character recognition program described in this embodiment is recorded on a computer-readable recording medium such as a hard disk, flexible disk, CD-ROM, MO, or DVD, and is executed by being read from the recording medium by the computer. Further, the character recognition program described in this embodiment may be distributed via a network such as the Internet.

上述した実施の形態に関し、さらに以下の付記を開示する。 The following additional notes are further disclosed with respect to the above-described embodiment.

（付記１）入力欄に書き込まれた筆画の軌跡を示す筆画データを蓄積し、
蓄積した前記筆画データを参照して、前記入力欄に書き込まれた、時系列順に連続する１以上の筆画を含む先行の筆画グループと、前記先行の筆画グループの直後に続く、時系列順に連続する１以上の筆画を含む後続の筆画グループとの特徴量に基づいて、前記先行の筆画グループと前記後続の筆画グループとの区切りが、文字間の区切りである妥当性の強さを示す妥当性スコアを算出し、
蓄積した前記筆画データと、算出した前記妥当性スコアとに基づいて、前記入力欄に書き込まれた１以上の文字を認識する、
制御部を有することを特徴とする情報処理装置。 (Appendix 1) Accumulate stroke data showing the trajectory of the stroke written in the input field.
With reference to the accumulated stroke data, the preceding stroke group containing one or more consecutive strokes written in the input field in chronological order and the preceding stroke group immediately following the preceding stroke group are continuous in chronological order. A validity score indicating the strength of validity that the division between the preceding stroke group and the subsequent stroke group is a division between characters based on the feature amount of the subsequent stroke group including one or more strokes. Is calculated,
Recognizes one or more characters written in the input field based on the accumulated stroke data and the calculated validity score.
An information processing device characterized by having a control unit.

（付記２）前記入力欄は、文字が重ね書きされる領域である、ことを特徴とする付記１に記載の情報処理装置。 (Supplementary Note 2) The information processing apparatus according to Supplementary Note 1, wherein the input field is an area in which characters are overwritten.

（付記３）前記制御部は、
蓄積した前記筆画データを参照して、前記先行の筆画グループが形成する文字の候補および前記先行の筆画グループが形成する文字の候補の妥当性の強さを示す認識スコアと、前記後続の筆画グループが形成する文字の候補および前記後続の筆画グループが形成する文字の候補の妥当性の強さを示す認識スコアとを生成し、
生成した前記候補および前記認識スコアと、算出した前記妥当性スコアとに基づいて、前記入力欄に書き込まれた１以上の文字を認識する、ことを特徴とする付記１または２に記載の情報処理装置。 (Appendix 3) The control unit is
With reference to the accumulated stroke data, a recognition score indicating the strength of the validity of the character candidates formed by the preceding stroke group and the character candidates formed by the preceding stroke group, and the subsequent stroke group. Generates a recognition score that indicates the strength of the validity of the character candidates formed by and the character candidates formed by the subsequent stroke group.
The information processing according to Appendix 1 or 2, wherein one or more characters written in the input field are recognized based on the generated candidate and the recognition score and the calculated validity score. Device.

（付記４）前記制御部は、
蓄積した前記筆画データを参照して、前記先行の筆画グループが形成する文字の候補と、前記後続の筆画グループが形成する文字の候補との組み合わせが、文脈に沿った組み合わせである妥当性の強さを示す文脈スコアを算出し、
算出した前記文脈スコアと、生成した前記候補および前記認識スコアと、算出した前記妥当性スコアとに基づいて、前記入力欄に書き込まれた１以上の文字を認識する、ことを特徴とする付記３に記載の情報処理装置。 (Appendix 4) The control unit is
With reference to the accumulated stroke data, the combination of the character candidate formed by the preceding stroke group and the character candidate formed by the subsequent stroke group has a strong validity as a contextual combination. Calculate the context score to indicate
Addendum 3 characterized in that one or more characters written in the input field are recognized based on the calculated context score, the generated candidate and the recognition score, and the calculated validity score. The information processing device described in.

（付記５）前記制御部は、
前記入力欄に書き込まれた、時系列順に連続する１以上の筆画を含む先行の筆画グループと、前記先行の筆画グループの直後に続く、時系列順に連続する１以上の筆画を含む後続の筆画グループとを組み合わせた２つの筆画グループのペアごとに、前記ペアにされた２つの筆画グループのそれぞれの筆画グループの特徴量に基づいて、前記ペアにされた２つの筆画グループの区切りが、文字間の区切りである妥当性の強さを示す妥当性スコアを算出する、ことを特徴とする付記４に記載の情報処理装置。 (Appendix 5) The control unit is
A preceding stroke group containing one or more consecutive strokes in chronological order written in the input field, and a subsequent stroke group containing one or more consecutive strokes in chronological order immediately following the preceding stroke group. For each pair of two stroke groups in combination with, the division of the two stroke groups in the pair is between characters based on the feature amount of each stroke group of the two stroke groups in the pair. The information processing apparatus according to Appendix 4, wherein a validity score indicating the strength of validity, which is a delimiter, is calculated.

（付記６）前記制御部は、
前記ペアのうち、算出した前記妥当性スコアが条件を満たすペアごとに、前記条件を満たすペアにされた２つの筆画グループのそれぞれの筆画グループが形成する文字の候補および前記条件を満たすペアにされた２つの筆画グループのそれぞれの筆画グループが形成する文字の候補の妥当性を示す認識スコアを生成し、
前記ペアのうち、前記条件を満たすペアごとに、前記条件を満たすペアにされた２つの筆画グループのそれぞれの筆画グループが形成する文字の候補の組み合わせが、文脈に沿った組み合わせである妥当性の強さを示す文脈スコアを算出する、ことを特徴とする付記５に記載の情報処理装置。 (Appendix 6) The control unit is
Of the pairs, for each pair whose validity score is satisfied, the character candidates formed by each stroke group of the two stroke groups made into the pair satisfying the above condition and the pair satisfying the above condition are set. Generates a recognition score that indicates the validity of the character candidates formed by each of the two stroke groups.
Of the pairs, for each pair that satisfies the conditions, the combination of character candidates formed by each stroke group of the two stroke groups that satisfy the conditions is a contextual combination. The information processing apparatus according to Appendix 5, wherein a context score indicating strength is calculated.

（付記７）前記制御部は、
前記先行の筆画グループに含まれる末尾の筆画と、前記後続の筆画グループに含まれる先頭の筆画と、前記後続の筆画グループに含まれる末尾の筆画との少なくともいずれかの筆画の前記入力欄での位置に基づいて、前記先行の筆画グループと前記後続の筆画グループとの区切りが、文字間の区切りである妥当性の強さを示す妥当性スコアを算出する、ことを特徴とする付記１～６のいずれか一つに記載の情報処理装置。 (Appendix 7) The control unit is
In the input field of at least one of the last stroke included in the preceding stroke group, the first stroke included in the subsequent stroke group, and the last stroke included in the subsequent stroke group. Addendum 1 to 6 characterized in that the division between the preceding stroke group and the subsequent stroke group calculates a validity score indicating the strength of validity as a division between characters based on the position. The information processing apparatus according to any one of the above.

（付記８）前記制御部は、
さらに、前記先行の筆画グループに含まれる先頭の筆画の前記入力欄での位置に基づいて、前記先行の筆画グループと前記後続の筆画グループとの区切りが、文字間の区切りである妥当性の強さを示す妥当性スコアを算出する、ことを特徴とする付記７に記載の情報処理装置。 (Appendix 8) The control unit is
Further, based on the position of the first stroke included in the preceding stroke group in the input field, the division between the preceding stroke group and the subsequent stroke group has a strong validity as a division between characters. The information processing apparatus according to Appendix 7, wherein a validity score indicating the above is calculated.

（付記９）前記制御部は、
さらに、前記先行の筆画グループに含まれる筆画の数と、前記後続の筆画グループに含まれる筆画の数とに基づいて、前記先行の筆画グループと前記後続の筆画グループとの区切りが、文字間の区切りである妥当性の強さを示す妥当性スコアを算出する、ことを特徴とする付記７または８に記載の情報処理装置。 (Appendix 9) The control unit is
Further, based on the number of strokes included in the preceding stroke group and the number of strokes included in the succeeding stroke group, the division between the preceding stroke group and the succeeding stroke group is between characters. The information processing apparatus according to Appendix 7 or 8, wherein a validity score indicating the strength of validity as a delimiter is calculated.

（付記１０）前記制御部は、
さらに、前記先行の筆画グループ全体の前記入力欄での位置と、前記後続の筆画グループ全体の前記入力欄での位置とに基づいて、前記先行の筆画グループと前記後続の筆画グループとの区切りが、文字間の区切りである妥当性の強さを示す妥当性スコアを算出する、ことを特徴とする付記７～９のいずれか一つに記載の情報処理装置。 (Appendix 10) The control unit is
Further, based on the position of the entire preceding stroke group in the input field and the position of the entire subsequent stroke group in the input field, the division between the preceding stroke group and the subsequent stroke group is established. , The information processing apparatus according to any one of Supplementary note 7 to 9, wherein a validity score indicating the strength of validity, which is a delimiter between characters, is calculated.

（付記１１）前記制御部は、
ペアにされた２つの筆画グループと、前記ペアにされた２つの筆画グループの区切りが文字間の区切りとして妥当であるか否かとを対応付けて表す教師データを取得し、
取得した前記教師データを利用して、ペアにされた２つの筆画グループのそれぞれの筆画グループの特徴量を入力すると、前記ペアにされた２つの筆画グループの区切りが、文字間の区切りである妥当性の強さを示す妥当性スコアを出力するモデルを学習し、
学習した前記モデルを利用して、前記先行の筆画グループと、前記後続の筆画グループとの特徴量に基づいて、前記先行の筆画グループと前記後続の筆画グループとの区切りが、文字間の区切りである妥当性の強さを示す妥当性スコアを算出する、ことを特徴とする付記１～１０のいずれか一つに記載の情報処理装置。 (Appendix 11) The control unit is
Acquire teacher data showing the correspondence between the two paired stroke groups and whether or not the delimiter between the two paired stroke groups is appropriate as a delimiter between characters.
When the feature amount of each stroke group of the two paired stroke groups is input by using the acquired teacher data, the division between the two paired stroke groups is appropriate as the division between characters. Learn a model that outputs a validity score that indicates sexual strength,
Using the learned model, the division between the preceding stroke group and the succeeding stroke group is a division between characters based on the feature amount of the preceding stroke group and the succeeding stroke group. The information processing apparatus according to any one of Supplementary note 1 to 10, wherein a validity score indicating the strength of a certain validity is calculated.

（付記１２）コンピュータが、
入力欄に書き込まれた筆画の軌跡を示す筆画データを蓄積し、
蓄積した前記筆画データを参照して、前記入力欄に書き込まれた、時系列順に連続する１以上の筆画を含む先行の筆画グループと、前記先行の筆画グループの直後に続く、時系列順に連続する１以上の筆画を含む後続の筆画グループとの特徴量に基づいて、前記先行の筆画グループと前記後続の筆画グループとの区切りが、文字間の区切りである妥当性の強さを示す妥当性スコアを算出し、
蓄積した前記筆画データと、算出した前記妥当性スコアとに基づいて、前記入力欄に書き込まれた１以上の文字を認識する、
処理を実行することを特徴とする文字認識方法。 (Appendix 12) The computer
Accumulates stroke data showing the trajectory of the stroke written in the input field,
With reference to the accumulated stroke data, the preceding stroke group containing one or more consecutive strokes written in the input field in chronological order and the preceding stroke group immediately following the preceding stroke group are continuous in chronological order. A validity score indicating the strength of validity that the division between the preceding stroke group and the subsequent stroke group is a division between characters based on the feature amount of the subsequent stroke group including one or more strokes. Is calculated,
Recognizes one or more characters written in the input field based on the accumulated stroke data and the calculated validity score.
A character recognition method characterized by performing processing.

（付記１３）コンピュータに、
入力欄に書き込まれた筆画の軌跡を示す筆画データを蓄積し、
蓄積した前記筆画データを参照して、前記入力欄に書き込まれた、時系列順に連続する１以上の筆画を含む先行の筆画グループと、前記先行の筆画グループの直後に続く、時系列順に連続する１以上の筆画を含む後続の筆画グループとの特徴量に基づいて、前記先行の筆画グループと前記後続の筆画グループとの区切りが、文字間の区切りである妥当性の強さを示す妥当性スコアを算出し、
蓄積した前記筆画データと、算出した前記妥当性スコアとに基づいて、前記入力欄に書き込まれた１以上の文字を認識する、
処理を実行させることを特徴とする文字認識プログラム。 (Appendix 13) To the computer
Accumulates stroke data showing the trajectory of the stroke written in the input field,
With reference to the accumulated stroke data, the preceding stroke group containing one or more consecutive strokes written in the input field in chronological order and the preceding stroke group immediately following the preceding stroke group are continuous in chronological order. A validity score indicating the strength of validity that the division between the preceding stroke group and the subsequent stroke group is a division between characters based on the feature amount of the subsequent stroke group including one or more strokes. Is calculated,
Recognizes one or more characters written in the input field based on the accumulated stroke data and the calculated validity score.
A character recognition program characterized by executing processing.

１００情報処理装置
１１０，７００，９００，１０００，１１００，１２０１入力欄
１１１～１１５，７０１～７０９，１３０１～１３０５筆画
１２１，１２２，７１１，７１２，８１１，８１２，１３１０，１３２０，１３３０，１３４０，１３５０，１３６０，１３７０，１５００，１６０１，１６０２筆跡
２００バス
２０１ＣＰＵ
２０２メモリ
２０３ネットワークＩ／Ｆ
２０４記録媒体Ｉ／Ｆ
２０５記録媒体
２０６タッチパネル
２１０ネットワーク
３００辞書データベース
４００文脈データベース
５００候補データベース
６００記憶部
６０１取得部
６０２特定部
６０３妥当性スコア算出部
６０４認識スコア算出部
６０５文脈スコア算出部
６０６重畳文字認識部
６０７出力部
６０８学習部
９１１，９１２，１０１０，１０４１，１０４２文字
１０２１，１０２２部分
１０３０文字列
１１１１，１１１２筆画群
１２００入力画面
１２０２表示欄 100 Information processing unit 110, 700, 900, 1000, 1100, 1201 Input fields 111 to 115, 701 to 709, 1301 to 1305 Brush drawing 121, 122, 711, 712, 811, 812, 1310, 1320, 1330, 1340, 1350 , 1360, 1370, 1500, 1601, 1602 Handwriting 200 Bus 201 CPU
202 Memory 203 Network I / F
204 Recording medium I / F
205 Recording medium 206 Touch panel 210 Network 300 Dictionary database 400 Contextual database 500 Candidate database 600 Storage unit 601 Acquisition unit 602 Specific unit 603 Validity score calculation unit 604 Recognition score calculation unit 605 Contextual score calculation unit 606 Superimposed character recognition unit 607 Output unit 608 Learning unit 911, 912, 1010, 1041, 1042 characters 1021, 1022 part 1030 character string 1111, 1112 brush drawing group 1200 input screen 1202 display field

Claims

Accumulates stroke data showing the trajectory of the stroke written in the input field,
With reference to the accumulated stroke data, the preceding stroke group containing one or more consecutive strokes written in the input field in chronological order and the preceding stroke group immediately following the preceding stroke group are continuous in chronological order. For each pair of two stroke groups combined with a subsequent stroke group containing one or more strokes, the pair was made into the pair based on the feature quantity of each stroke group of the two stroke groups made into the pair. Calculate the validity score, which indicates the strength of the validity that the break of one stroke group is the break between characters.
Of the pairs, for each pair whose validity score is satisfied, the character candidates formed by each stroke group of the two stroke groups made into the pair satisfying the above condition and the pair satisfying the above condition are set. Generates a recognition score that indicates the validity of the character candidates formed by each of the two stroke groups.
Recognizes one or more characters written in the input field based on the generated candidate and the recognition score and the calculated validity score.
An information processing device characterized by having a control unit.

The information processing apparatus according to claim 1, wherein the input field is an area in which characters are overwritten.

The control unit
Of the pairs, for each pair that satisfies the conditions, the combination of character candidates formed by each stroke group of the two stroke groups that satisfy the conditions is a contextual combination. Calculate a context score that indicates strength,
A claim characterized in that one or more characters written in the input field are recognized based on the calculated context score, the generated candidate and the recognition score, and the calculated validity score. The information processing apparatus according to 1 or 2.

The control unit
For each pair, the last stroke included in the preceding stroke group of the two paired stroke groups and the first stroke included in the subsequent stroke group of the two paired stroke groups. Two of the paired strokes based on the position in the input field of at least one of the strokes and the last stroke in the subsequent stroke group of the two paired stroke groups. The information processing apparatus according to any one of claims 1 to 3, wherein the stroke group delimiter calculates a validity score indicating the strength of the validity of the delimiter between characters.

The control unit
For each pair, further, the two stroke groups made into the pair based on the position of the first stroke included in the preceding stroke group in the input field among the two stroke groups made into the pair. The information processing apparatus according to claim 4, wherein the delimiter calculates a validity score indicating the strength of validity as a delimiter between characters.

The information processing apparatus according to any one of claims 1 to 5, wherein the feature amount of each of the stroke groups includes the position of the entire stroke group.

The information processing apparatus according to any one of claims 1 to 6, wherein the feature amount of each stroke group includes the number of strokes of each stroke group.

The control unit
Acquire teacher data showing the correspondence between the two paired stroke groups and whether or not the delimiter between the two paired stroke groups is appropriate as a delimiter between characters.
When the feature amount of each stroke group of the two paired stroke groups is input by using the acquired teacher data, the delimiter between the two paired stroke groups is appropriate as the delimiter between characters. Learn a model that outputs a validity score that indicates sexual strength,
Using the learned model, for each pair of two stroke groups in which the preceding stroke group and the subsequent stroke group are combined, the characteristics of each stroke group of the two stroke groups paired. Based on the quantity, the validity score indicating the strength of the validity that the separation of the two stroke groups paired is the separation between characters is calculated.
The information processing apparatus according to any one of claims 1 to 7.

The computer
Accumulates stroke data showing the trajectory of the stroke written in the input field,
With reference to the accumulated stroke data, the preceding stroke group containing one or more consecutive strokes written in the input field in chronological order and the preceding stroke group immediately following the preceding stroke group are continuous in chronological order. For each pair of two stroke groups combined with a subsequent stroke group containing one or more strokes, the pair was made into the pair based on the feature quantity of each stroke group of the two stroke groups made into the pair. Calculate the validity score, which indicates the strength of the validity that the break of one stroke group is the break between characters.
Of the pairs, for each pair whose validity score is satisfied, the character candidates formed by each stroke group of the two stroke groups made into the pair satisfying the above condition and the pair satisfying the above condition are set. Generates a recognition score that indicates the validity of the character candidates formed by each of the two stroke groups.
Recognizes one or more characters written in the input field based on the generated candidate and the recognition score and the calculated validity score.
A character recognition method characterized by performing processing.

On the computer
Accumulates stroke data showing the trajectory of the stroke written in the input field,
With reference to the accumulated stroke data, the preceding stroke group containing one or more consecutive strokes written in the input field in chronological order and the preceding stroke group immediately following the preceding stroke group are continuous in chronological order. For each pair of two stroke groups combined with a subsequent stroke group containing one or more strokes, the pair was made into the pair based on the feature quantity of each stroke group of the two stroke groups made into the pair. Calculate the validity score, which indicates the strength of the validity that the break of one stroke group is the break between characters.
Of the pairs, for each pair whose validity score is satisfied, the character candidates formed by each stroke group of the two stroke groups made into the pair satisfying the above condition and the pair satisfying the above condition are set. Generates a recognition score that indicates the validity of the character candidates formed by each of the two stroke groups.
Recognizes one or more characters written in the input field based on the generated candidate and the recognition score and the calculated validity score.
A character recognition program characterized by executing processing.