JP4107659B2

JP4107659B2 - Handwritten font output system

Info

Publication number: JP4107659B2
Application number: JP2003321251A
Authority: JP
Inventors: 真人中島; 慎吾安藤; 万貴加藤
Original assignee: Keio University
Current assignee: Keio University
Priority date: 2003-09-12
Filing date: 2003-09-12
Publication date: 2008-06-25
Anticipated expiration: 2023-09-12
Also published as: JP2005091444A

Description

本発明は、ワープロ等で使用するための、各人の個性を反映した手書き風文字フォントを自動生成する手書き風文字フォント出力システムに関するものである。 The present invention relates to a handwritten character font output system that automatically generates a handwritten character font that reflects the individuality of each person for use in a word processor or the like.

オフライン入力によるユーザの手書き文字から、書き手の書き癖を抽出定量化し、その癖を反映させたユーザオリジナルの手書き文字フォント（以下、「個性入り文字フォント」）を生成する手法は、すでに特許文献１で提案されているが、その手法では、個性入り文字フォントを出力する際、等幅の文字を等間隔に出力することを想定している。 A method for extracting and quantifying a writer's handwritten character from off-line input user's handwritten character and generating a user's original handwritten character font (hereinafter referred to as “character font with individuality”) reflecting the character has already been disclosed in Patent Document 1. However, in this method, when outputting a character font with individuality, it is assumed that uniform width characters are output at equal intervals.

特開２００３−５８１４２号公報JP 2003-58142 A

ところが、人がペンやボールペンあるいは鉛筆等の筆記具を用いて書いた手書き文字は、書く人によって、文字幅（文字自体の幅）及び文字間スペースの取り方がかなり異なる。 However, handwritten characters written by a person using a writing instrument such as a pen, a ballpoint pen, or a pencil vary considerably in character width (width of the character itself) and the space between characters depending on the writer.

図３は、手書き文章の例を示す図である。この図３は、４人の人Ａ〜Ｄに「な影響を与えています」と自然に書いてもらった例を示している。実際に、文字幅も人によって異なるし、文字間スペースの取り方も人によって異なる。この違いが手書きらしさの大きな要素であることを窺わせる。このため、せっかく手書き風の文字フォントであっても、等幅の文字を等間隔で碁盤目状に出力すると、見た目には “ギコチナサ”が目立ってしまって、手書き風としては物足りなさがあった。 FIG. 3 is a diagram illustrating an example of handwritten text. FIG. 3 shows an example in which four people A to D are naturally written as “having a big influence”. Actually, the character width varies from person to person, and the way of taking the space between characters varies from person to person. This difference suggests that it is a big element of handwriting. For this reason, even if it is a handwritten-style character font, if uniform-width characters are output in a grid pattern at regular intervals, “Gikochinasa” will be conspicuous in appearance, which is unsatisfactory as a handwritten style .

本発明は、上記問題点に鑑み、手書きの隣接する文字間スペース情報を抽出する文字セグメンテーションを可能とする手書き風文字フォント出力システム、プログラム及び記録媒体を提供することを目的とする。 In view of the above problems, an object of the present invention is to provide a handwritten-style character font output system, a program, and a recording medium that enable character segmentation to extract handwritten inter-character space information.

本発明の手書き風文字フォント出力システムは、標準のｍ（ｍ：自然数）個の文字フォントを記憶する第１メモリと、イメージ入力されるｎ（ｎ：自然数、２≦ｎ＜ｍ）個の文字フォントの連なりを記憶する第２メモリと、該第２メモリに記憶されている入力された文字フォントの傾きを補正する傾き補正手段と、該傾き補正手段によって傾き補正された文字フォントに対応する文字フォントを第１メモリから読み出して両者を比較することによって入力され傾き補正された文字フォントの連なりから各文字フォントを切り出して隣接する文字の間隔を抽出するとともに、抽出された前記文字の間隔を、隣接する文字の種類の組合せ別に定量化して登録する文字セグメンテーション手段と、該文字セグメンテーション手段によって切り出された各文字フォントと前記第１メモリに記憶されている標準の対応する文字フォントとをそれぞれ比較して対応点の変位を抽出してその統計的情報を抽出する変位情報抽出手段と、該変位情報抽出手段の出力に応じて前記第１メモリに記憶されている文字フォントを変更して出力する変位情報付加手段と、該変位情報付加手段によって出力される文字フォントを前記文字セグメンテーション手段によって登録された文字の間隔に応じた文字間隔で出力する文字間スペース情報付加手段とを備える。 The handwritten character font output system of the present invention includes a first memory for storing standard m (m: natural number) character fonts, and n (n: natural number, 2 ≦ n <m) characters inputted as an image. a second memory for storing a series of fonts, a tilt correcting means for correcting the tilt of the input character font stored in the second memory, the character corresponding to the character fonts inclination correction by inclined-out correction means The character fonts are cut out from a series of character fonts that have been input by comparing the two read out from the first memory and the character fonts are cut out to extract the spacing between adjacent characters . character segmentation means for registering quantified by adjacent type of character combinations, cut out by the character segmentation unit Displacement information extracting means for comparing each character font with a standard corresponding character font stored in the first memory and extracting the displacement of the corresponding point to extract the statistical information, and the displacement information extraction Displacement information adding means for changing the character font stored in the first memory in accordance with the output of the means and outputting the character font output by the displacement information adding means by the character segmentation means Inter-character space information adding means for outputting at a character interval according to the interval.

また、本発明の手書き風文字フォント出力システムは、標準のｍ（ｍ：自然数）個の文字フォントを記憶する第１メモリと、イメージ入力されるｎ（ｎ：自然数、２≦ｎ＜ｍ）個の文字フォントの連なりを記憶する第２メモリと、該第２メモリに記憶されている入力された文字フォントの傾きを補正する傾き補正手段と、該傾き補正手段によって傾き補正された文字フォントに対応する文字フォントを第１メモリから読み出して両者を比較することによって入力され傾き補正された文字フォントの連なりから各文字フォントを切り出して隣接する文字の間隔を抽出するとともに、抽出された前記文字の間隔を、隣接する文字のプロジェクションの複雑さの組合せ別に定量化して登録する文字セグメンテーション手段と、該文字セグメンテーション手段によって切り出された各文字フォントと前記第１メモリに記憶されている標準の対応する文字フォントとをそれぞれ比較して対応点の変位を抽出してその統計的情報を抽出する変位情報抽出手段と、該変位情報抽出手段の出力に応じて前記第１メモリに記憶されている文字フォントを変更して出力する変位情報付加手段と、該変位情報付加手段によって出力される文字フォントを前記文字セグメンテーション手段によって登録された文字の間隔に応じた文字間隔で出力する文字間スペース情報付加手段とを備える。 The handwritten-style character font output system of the present invention includes a first memory for storing standard m (m: natural number) character fonts, and n (n: natural number, 2 ≦ n <m) input image. A second memory that stores a series of character fonts, an inclination correction unit that corrects an inclination of the input character font stored in the second memory, and a character font that is inclined by the inclination correction unit The character fonts to be read out from the first memory and compared with each other are extracted from the series of character fonts that have been input and corrected to extract the spacing between adjacent characters by cutting out each character font. Character segmentation means for quantifying and registering each character according to the combination of the projection complexity of adjacent characters, and the character segmentation Displacement information extracting means for comparing each character font cut out by the image means with the standard corresponding character font stored in the first memory, extracting the displacement of the corresponding points, and extracting the statistical information thereof Displacement information adding means for changing and outputting the character font stored in the first memory in accordance with the output of the displacement information extracting means; and character font output by the displacement information adding means for the character segmentation. And an inter-character space information adding means for outputting at character intervals corresponding to the character intervals registered by the means.

本発明によれば、斜めに傾いた文字列に対しても手書き文字を正確に切り出すことができるので、文字間スペース情報を抽出して、ユーザの個性に応じた文字間スペースで手書き風文字を出力することができる。 According to the present invention, a handwritten character can be accurately cut out even with respect to a diagonally inclined character string. Can be output.

以下、添付図面を参照しながら本発明の好適な実施の形態について詳細に説明する。 Hereinafter, preferred embodiments of the present invention will be described in detail with reference to the accompanying drawings.

図１は、本発明の一実施の形態による手書き風文字フォント出力システムの構成を示すブロック図である。手書き風文字フォント出力システム１は、手書き風文字フォント出力システム１全体を制御する制御部１１、文字を入力するキーボード及び文字フォントを入力するイメージスキャナなどからなる入力部１２、作業領域として使用されるメモリであるＲＡＭ（Random Access Memory）１３、制御プログラムや所定の定数等を固定的に記憶しているＲＯＭ（Read Only Memory）１４、コンピュータのディスプレイである表示部１５、及び文字フォントを紙に印刷する印刷部１６から成り、これら各構成要素は相互に接続されている。これらの各構成要素は通常のコンピュータにおける各構成要素と基本的には同じものであり汎用のものを用いることができる。 FIG. 1 is a block diagram showing a configuration of a handwritten character font output system according to an embodiment of the present invention. The handwritten character font output system 1 is used as a control unit 11 for controlling the entire handwritten character font output system 1, an input unit 12 including a keyboard for inputting characters and an image scanner for inputting character fonts, and a work area. RAM (Random Access Memory) 13 which is a memory, ROM (Read Only Memory) 14 which stores a control program and predetermined constants, a display unit 15 which is a computer display, and character fonts are printed on paper. The printing unit 16 is configured to be connected to each other. Each of these components is basically the same as each component in a normal computer, and general-purpose components can be used.

図２は、本実施の形態による手書き風文字フォント出力システムの機能を示す機能ブロック図である。機能は大きく、個性情報抽出・登録機能２と個性情報入り文字フォント生成機能３の２つに分けられる。 FIG. 2 is a functional block diagram showing functions of the handwritten style font output system according to the present embodiment. The functions are large and can be divided into two functions: a personality information extraction / registration function 2 and a character font generation function 3 with personality information.

個性情報抽出・登録機能２では、ユーザが罫線用紙上に筆記した予め決められている文章である複数の文字（以下、登録文字）を、イメージスキャナを用いて計算機に入力し、その登録文字フォントと標準文字フォントとを比較して登録文字を切り出して、隣接登録文字間の間隔、標準文字フォントに対する登録文字フォントの幾何学的変位、及び、登録文字の大きさを解析し、その統計的情報を、そのユーザの個性を表す特徴量（以下、個性情報）として登録する。 In the individuality information extraction / registration function 2, a plurality of characters (hereinafter referred to as registered characters), which are predetermined sentences written by the user on ruled line paper, are input to a computer using an image scanner, and the registered character font is used. And the standard character font are extracted, the registered characters are cut out, the interval between adjacent registered characters, the geometric displacement of the registered character font relative to the standard character font, and the size of the registered character are analyzed, and the statistical information Are registered as feature quantities (hereinafter referred to as individuality information) representing the individuality of the user.

まず、標準文字フォント群記憶機能２１では、登録文字の切り出し、ユーザの個性情報抽出、及び、個性情報入り文字フォント作成にあたって、予め計算機内に数種類のペン楷書体文字フォント（以下、標準文字フォント）をインストールしておく。種類は多い方が良く、字のくずし方にできるだけ多くのヴァリエーションを持たせることが望ましい。例えば、千葉ペン字体、ペン行楷書体、白州ペン楷書体、創英ペン字体、セイビシロガネ、及びペン楷書体などである。 First, in the standard character font group storage function 21, several types of pen font fonts (hereinafter referred to as standard character fonts) are preliminarily stored in the computer for extracting registered characters, extracting user personality information, and creating character fonts with personality information. Is installed. It is better to have more types, and it is desirable to have as many variations as possible in the way the characters are broken. For example, there are a Chiba pen font, a pen line font, a Hakushu pen font, a Soei pen font, a sabishi sword, and a pen font.

つぎに、手書き文字入力記憶機能２２では、ユーザに、罫線用紙に１００〜２００文字程度の文章を筆記してもらい、それをイメージとして計算機に入力し、記憶する。筆記する文章は予め決められたものである。用いる罫線用紙にはトンボ（＋印）を打った赤色罫線（赤色というのは一例であり、モノクロ又は青系以外の色ならば何でも良い）の便箋を使用する（図４参照）。この便箋はアプリケーションソフト内蔵の便箋フォームをユーザがプリントアウトして使用しても良い。更にユーザが個性情報抽出用の筆記サンプルを記入する際には、トンボ（＋印）を上にして、１行に１０字程度（全体で１００〜２００文字程度）の文章（予め決められている指定文章）を記入させることが望ましい（黒もしくは青色のボールペン、万年筆又は鉛筆を使用）。文字のセグメンテーション（後に詳述する）を確実に行うためには、一行に１０文字程度の文字数が適当である。文字数を多くすると、登録文字のプロジェクションと標準フォントのプロジェクションとの比較にエラーが生じた場合にそのエラーが芋づる式に拡大されてしまう可能性がある。したがって、文字数を増やすことは得策でない。反面、ユーザに書いてもらう文章においては、書字の流れの保存も重要である。そのため、ユーザがスラスラと自然な形で文字を記入することができるようにするため、一行の文字数をあまり少なくすることもまたできない。これらから適当な一行の文字数は「１０字程度」ということになる。そのようにすることによって、一マスに一文字を書かなくてはならないという制約（一文字ずつ書くと、書字の流れに支障をきたしギコチナイものとなってしまう）をなくし、文字間の連続性と書き手固有の筆跡（文字幅及び文字間スペースの状態も含む）が保たれた文章（文字群）を得ることが可能となる。そして、以上のように書かれた文章を、イメージスキャナにより計算機に入力し、記憶する。上記のように文字記入用紙にはトンボが打ってあり、罫線もユーザが書く文字とは異なる色のものが使われているため、その両者の情報から、記入用紙の向きの変化に強い処理を行うことが可能となる。すなわち、イメージスキャナ面に対して記入用紙をどのようにセットしたとしても、記入用紙の向きを自動的に判別することが可能となり、用紙の向きに依存しない文字データの取得が可能となる。 Next, in the handwritten character input storage function 22, the user writes a sentence of about 100 to 200 characters on the ruled line paper, which is input to the computer as an image and stored. Sentences to be written are predetermined. As the ruled line paper to be used, a stationery of a red ruled line (a red color is just an example and any color other than monochrome or blue) may be used (see FIG. 4). For this stationery, the user may print out and use a stationery form with built-in application software. Furthermore, when the user enters a writing sample for extracting personality information, a sentence (predetermined about 100 to 200 characters in total) with a registration mark (+) on the top is used. (Specified text) should be entered (use black or blue ballpoint pen, fountain pen or pencil). In order to reliably perform character segmentation (to be described in detail later), the number of characters of about 10 characters per line is appropriate. If the number of characters is increased, if an error occurs in the comparison between the projection of registered characters and the projection of the standard font, there is a possibility that the error is expanded to a formula that can be determined. Therefore, increasing the number of characters is not a good idea. On the other hand, it is also important to preserve the flow of letters in sentences that are written by the user. For this reason, the number of characters in one line cannot be reduced so much that the user can enter characters in a natural manner. Accordingly, the appropriate number of characters in one line is “about 10 characters”. By doing so, it eliminates the restriction that one character must be written in one square (if you write one character at a time, it will hinder the flow of writing and it will be ugly), and the continuity between characters and the writer It is possible to obtain a sentence (character group) in which unique handwriting (including the state of character width and inter-character space) is maintained. Then, the sentence written as described above is input to the computer by the image scanner and stored. As mentioned above, the registration form is stamped on the character entry form, and the ruled line has a color different from the character written by the user. Can be done. That is, no matter how the entry sheet is set on the image scanner surface, it is possible to automatically determine the orientation of the entry sheet, and it is possible to acquire character data independent of the orientation of the sheet.

行又は列抽出機能２３は、横書きに書かれた文字群の場合の行、又は、縦書きに書かれた文字群の場合の列を抽出する。ここでは、横書きに書かれた文字群の場合を例に説明する。 The row or column extraction function 23 extracts a row in the case of a character group written in horizontal writing or a column in the case of a character group written in vertical writing. Here, the case of a group of characters written horizontally will be described as an example.

まず前処理として、取得イメージ全体に対して二値化処理を行う。そして文字記入用便箋に書かれた文字群は、赤色罫線に沿って配置されているため、赤色罫線を消去した後、赤色罫線の方向にプロジェクション（投影）を取り、プロジェクション像（投影像）を作成することによって、そのプロジェクション像の凹凸から、行の切り出しを行う。 First, as preprocessing, binarization processing is performed on the entire acquired image. Since the character group written on the letter writing note is arranged along the red ruled line, after erasing the red ruled line, the projection is taken in the direction of the red ruled line, and the projection image (projected image) is taken. By creating, the line is cut out from the projections and depressions of the projection image.

図５は、行のプロジェクション像の例を示す図である。左側にユーザが罫線用紙に筆記した指定文章の取得イメージの例を示し、右側にその行方向のプロジェクション像を示す。プロジェクション像の“谷”の部分で行を切り出すことで行を抽出することができる。 FIG. 5 is a diagram illustrating an example of a row projection image. An example of the acquisition image of the designated sentence written by the user on the ruled line paper is shown on the left side, and the projection image in the row direction is shown on the right side. A row can be extracted by cutting out the row at the “valley” portion of the projection image.

標準文字フォント選択機能２４は、行又は列抽出機能２３により抽出された各文字列の傾きを補正し、標準文字フォント群記憶機能２１によって予め計算機内にインストールされている登録文字フォントに対応する文字の数種類の標準文字フォントの各文字列とを比較して、全体として最も登録文字フォントに類似する標準文字フォントを選択する。 The standard character font selection function 24 corrects the inclination of each character string extracted by the row or column extraction function 23, and the character corresponding to the registered character font previously installed in the computer by the standard character font group storage function 21. The character strings of several kinds of standard character fonts are compared, and the standard character font most similar to the registered character font as a whole is selected.

まず、切り出された各行の文字列に対して、文字列単位で文字の傾き検出を行い、文字列の傾き補正を行う。ユーザによっては、斜めに傾いた文字列を書くことがあるため、文字列の傾きを検出し、補正することで、斜めに傾いた文字列に対しても、文字のセグメンテーションを行えるようにするためである。文字列の傾きを検出・補正する手法として、（ア）文字列の縦方向のプロジェクションの分散値によるものと、（イ）文字列の黒画素のストローク方向によるものの２つを提案する。 First, with respect to the extracted character string in each line, character inclination detection is performed on a character string basis, and character string inclination correction is performed. Some users write diagonally inclined character strings, so that character segmentation can be performed even for diagonally inclined character strings by detecting and correcting the inclination of the character string. It is. Two methods are proposed as methods for detecting and correcting the inclination of the character string: (a) using the dispersion value of the projection in the vertical direction of the character string, and (b) using the stroke direction of the black pixels of the character string.

まず、前者の手法（ア）について説明する。一行に切り出された各行の文字列に対して、縦方向のプロジェクションを作成する（図６参照）。作成された縦方向のプロジェクションは、文字列が傾いていなければ、プロジェクション値のばらつきが大きくなり（分散値が大きくなる）、文字列が傾いていれば、プロジェクション値のばらつきが小さくなる（分散値が小さくなる）。そして、罫線と垂直に書かれた文字列を基準の文字列の傾きとすると、ユーザにより書かれた文字列の傾きは、大きくても基準の文字列の傾きから±４５°内の傾きに収まると考えられる。したがって、基準の文字列の傾きから±４５°内のすべての方向に対して、文字列の縦方向のプロジェクションの分散値を算出し、（そのプロジェクションの分散値が一番大きいときがその文字列の傾きとなるので）検出された傾き分だけ逆方向に文字列を傾かせることで、文字列の傾き補正をする。 First, the former method (a) will be described. A projection in the vertical direction is created for the character string in each line cut out in one line (see FIG. 6). In the created vertical projection, if the character string is not inclined, the variation in the projection value is large (the variance value is large), and if the character string is inclined, the variation in the projection value is small (the variance value is Becomes smaller). If the character string written perpendicular to the ruled line is used as the inclination of the reference character string, the inclination of the character string written by the user is within ± 45 ° from the inclination of the reference character string at most. it is conceivable that. Accordingly, the dispersion value of the projection in the vertical direction of the character string is calculated for all directions within ± 45 ° from the inclination of the reference character string, and the character string is calculated when the dispersion value of the projection is the largest. The inclination of the character string is corrected by inclining the character string in the opposite direction by the detected inclination.

つぎに、後者の手法（イ）について説明する。切り出された各行の文字列に対して、文字列を構成するすべての黒画素におけるストローク（文字を構成する線分）方向を算出し、ストローク方向のヒストグラムを作成する。文字列のストローク方向は、文字を罫線と垂直に書いた場合、罫線と垂直方向のストロークが一番多くなる。すなわち、ストローク方向のヒストグラムは、確率的に罫線と垂直の方向が一番高くなる。したがって、ユーザにより書かれた文字列のストローク方向のヒストグラムが一番高い値を持つ方向が、その文字列のストローク方向ということなる。このように罫線と垂直方向のストローク方向を基準とすることにより、基準ストローク方向からユーザにより書かれた文字列のストローク方向までのずれが、文字列のストロークの傾きとして検出されるため、検出された傾き分だけ逆方向に傾かせることで、文字列の傾き補正を行うことができる。 Next, the latter method (A) will be described. The stroke direction (line segment constituting the character) in all the black pixels constituting the character string is calculated with respect to the character string of each cut out line, and a stroke direction histogram is created. As for the stroke direction of the character string, when the character is written perpendicular to the ruled line, the stroke in the direction perpendicular to the ruled line is the largest. That is, the stroke direction histogram has the highest probability in the direction perpendicular to the ruled line. Therefore, the direction in which the histogram of the stroke direction of the character string written by the user has the highest value is the stroke direction of the character string. Thus, by using the stroke direction perpendicular to the ruled line as a reference, a deviation from the reference stroke direction to the stroke direction of the character string written by the user is detected as the inclination of the stroke of the character string, and thus is detected. The inclination of the character string can be corrected by inclining in the opposite direction by the amount of inclination.

（ア）及び（イ）の処理により検出された傾きは、記憶しておく。以上のような処理によって、ユーザにより書かれた文字列は、罫線と垂直に配置された文字列になる。 The inclination detected by the processes (a) and (b) is stored. Through the processing described above, the character string written by the user becomes a character string arranged perpendicular to the ruled line.

つぎに、罫線と垂直に配置された全ての登録文字列と、標準文字フォント群記憶機能２１によって予め計算機内にインストールされていてその登録文字列に対応する数種類の標準文字フォントの文字列とを比較することで、全体として最も登録文字フォントに類似する標準文字フォントを選択する。 Next, all the registered character strings arranged perpendicular to the ruled lines, and character strings of several types of standard character fonts which are installed in the computer in advance by the standard character font group storage function 21 and correspond to the registered character strings. By comparison, the standard character font most similar to the registered character font as a whole is selected.

文字セグメンテーション機能２５は、文字列の傾きが補正された各文字列の縦方向のプロジェクションと対応する標準文字フォントの文字列の縦方向のプロジェクションとの対応点探索により文字列から各文字を切り出す。 The character segmentation function 25 cuts out each character from the character string by searching for a corresponding point between the vertical projection of each character string in which the inclination of the character string is corrected and the vertical projection of the corresponding character string of the standard character font.

そして、選択された標準文字フォントの文字列の縦方向のプロジェクションを算出し、その標準文字フォントの文字列の縦方向のプロジェクションと、ユーザにより書かれて傾き補正された登録文字列の縦方向のプロジェクションとの対応点探索を行う。ここで用いられる手法は、エネルギー汎関数Ｅ(ＤＸ)の最小化によるものである。 Then, the vertical projection of the character string of the selected standard character font is calculated, and the vertical projection of the character string of the standard character font and the vertical direction of the registered character string written and corrected by the user are corrected. Search for corresponding points with the projection. The technique used here is by minimizing the energy functional E (DX).

Ｅ(ＤＸ)＝Ｐ(ＤＸ)＋λＳ(ＤＸ) ・・・（１）
Ｐ(ＤＸ)＝Σ(ｆ(ｘ＋ＤＸ(ｘ))−ｇ(ｘ))² ・・・（２）
Ｓ(ＤＸ)＝Σ(∂ＤＸ(ｘ)／∂ｘ)² ・・・（３）
ここで、
Ｐ(ＤＸ)：変位を加えた時のプロジェクション間の対応誤差、
Ｓ(ＤＸ)：対応の滑らかさ、
λ：重み付けのためのパラメータ、
ｆ(ｘ)：ユーザにより書かれた文字列の縦方向のプロジェクション、
ｇ(ｘ)：標準文字フォントの縦方向のプロジェクション、
ＤＸ(ｘ)：位置（ｘ）におけるｘ方向の変位量
である。 E (DX) = P (DX) + λS (DX) ... (1)
P (DX) = Σ (f (x + DX (x)) - g (x)) 2 ··· (2)
S (DX) = Σ (∂DX (x) / ∂x) ² (3)
here,
P (DX): Corresponding error between projections when displacement is applied,
S (DX): corresponding smoothness,
λ: parameter for weighting,
f (x): vertical projection of a character string written by the user,
g (x): vertical projection of standard character font,
DX (x): A displacement amount in the x direction at the position (x).

式１に示すＥ(ＤＸ)を最小にするオイラー方程式を反復して解くことにより、プロジェクションの位置(ｘ)での変位量ＤＸ(ｘ)を求める（図７参照）。予め記憶されている標準フォントは、すでにどこからどこまでが一文字のプロジェクションに相当するかという情報を持っているので、その情報を基に、計測された位置(ｘ)における変位量ＤＸ(ｘ)から、登録文字が存在する範囲、すなわち、文字幅を抽出することができる。したがって、隣接する文字の間の間隔、つまり文字間スペースの情報も抽出することができる（図８参照）。これは文字のセグメンテーション（切り出し）に他ならない。こうして抽出された文字間スペース情報を書き手の個性として定量化し、記憶しておく。 The displacement amount DX (x) at the projection position (x) is obtained by repetitively solving the Euler equation that minimizes E (DX) shown in Equation 1 (see FIG. 7). Since the standard font stored in advance already has information on where to correspond to the projection of one character, based on the information, from the displacement amount DX (x) at the measured position (x), A range in which registered characters exist, that is, a character width can be extracted. Therefore, it is also possible to extract information about the space between adjacent characters, that is, the space between characters (see FIG. 8). This is nothing other than character segmentation. The extracted inter-character space information is quantified and stored as the writer's personality.

実際に、この文字のセグメンテーション法を、多数の手書き文字に適用したところ、９５％以上の成功率で文字のセグメンテーションを行うことができた（図９参照）。このことから、本手法により高い成功率で、手書き文字を一文字毎にセグメンテーションできることが分かる。また縦書きに書かれた文字群の場合でも、同様の処理を行えば、横書きで書かれた文字群と同じように、一文字毎に文字をセグメンテーションできる。 Actually, when this character segmentation method was applied to a large number of handwritten characters, character segmentation could be performed with a success rate of 95% or more (see FIG. 9). This shows that handwritten characters can be segmented character by character with a high success rate by this method. In the case of a character group written in vertical writing, if the same processing is performed, characters can be segmented for each character in the same way as in a character group written in horizontal writing.

大きさ情報抽出機能２６は、まず標準文字フォント選択機能２４によって記憶された文字列の傾きから各登録文字の傾きを元に戻す。そして標準文字フォント選択機能２４により選択された標準文字フォントであって登録文字と対応する標準文字フォントと登録文字フォントとの大きさを比較して、その大きさの違い（縦方向及び横方向）を抽出し、書き手の個性として定量化し、記憶する。その大きさの違いは、すべての登録文字フォントの平均を記憶しても良いし、漢字の登録文字フォントと英数かな文字の登録文字フォントを別々に平均して記憶しても良い。 The size information extraction function 26 first restores the inclination of each registered character from the inclination of the character string stored by the standard character font selection function 24. Then, the standard character font selected by the standard character font selection function 24 and the size of the registered character font corresponding to the registered character are compared with the size of the registered character font (vertical direction and horizontal direction). Is quantified and memorized as the writer's personality. For the difference in size, the average of all the registered character fonts may be stored, or the registered character font of kanji and the registered character font of alphanumeric characters may be averaged and stored separately.

変位情報抽出機能２７は、標準文字フォント選択機能２４によって選択された標準文字フォントであって登録文字と対応する標準文字フォントと登録文字フォントとを比較して、標準文字フォントに対する登録文字フォントの画面内の各位置での幾何学的変位量（方向と大きさの情報を持つ）を２次元の対応点探索により抽出し、書き手の個性として定量化し、記憶する。 The displacement information extraction function 27 compares the standard character font corresponding to the registered character with the standard character font selected by the standard character font selection function 24 and the registered character font, and displays the registered character font screen for the standard character font. The amount of geometric displacement (with direction and size information) at each position is extracted by two-dimensional search for corresponding points, quantified and stored as the writer's personality.

個性情報入り文字フォント生成機能３では、個性情報抽出・登録機能２にて抽出された個性情報を標準文字フォントに付加することによって、任意の文字の個性情報入り文字フォントを生成して出力する。 In the character font generation function 3 with individuality information, the individuality information extracted by the individuality information extraction / registration function 2 is added to the standard character font to generate and output a character font with individuality information of an arbitrary character.

まず、標準文字フォント呼び出し機能３１は、個性情報抽出・登録機能２において選択された標準文字フォントを呼び出す。ユーザにより用いられるフォントが異なってくる。 First, the standard character font calling function 31 calls the standard character font selected in the individuality information extraction / registration function 2. Different fonts are used by users.

つぎに、キーボード入力機能３２は、入力部１２であるキーボードで入力した文字コードに従い、計算機内にあらかじめインストールされている複数の標準文字フォントから標準文字フォント呼び出し機能３１にて呼び出されたユーザ特有の標準文字フォントの画像情報を一旦メモリに蓄える。それぞれの標準文字フォントは、明朝体、ゴシック体といった他の文字フォントと同様に、すべての文字についての情報を包含している。 Next, the keyboard input function 32 is specific to a user called by the standard character font call function 31 from a plurality of standard character fonts installed in the computer in advance according to the character code input by the keyboard which is the input unit 12. The image information of the standard character font is temporarily stored in the memory. Each standard character font contains information about all characters, like other character fonts such as Mincho and Gothic.

変位情報付加機能３３は、メモリに蓄えられた標準文字フォントの各文字に対し、変位情報を加える。すなわち、変位情報抽出機能２７において登録されたそのユーザの画像内の各位置での幾何学的変位量の平均値分の変位を標準文字フォントに加える。 The displacement information adding function 33 adds displacement information to each character of the standard character font stored in the memory. That is, the displacement corresponding to the average geometric displacement amount at each position in the user image registered in the displacement information extraction function 27 is added to the standard character font.

大きさ情報付加機能３４は、変位情報付加機能３３によって変位情報が加えられた標準文字フォントの大きさを決定して出力する。すなわち、大きさ情報抽出機能２６において登録されたそのユーザの文字の大きさ平均値に応じた倍率で、変位情報が加えられた標準文字フォントの大きさを変える。 The size information adding function 34 determines and outputs the size of the standard character font to which the displacement information is added by the displacement information adding function 33. In other words, the size of the standard character font to which the displacement information is added is changed at a magnification corresponding to the average size of the user's character registered in the size information extraction function 26.

文字間スペース情報付加機能３５は、変位情報が加えられ大きさを決められた標準文字フォントの文字間スペースを決定して出力する。すなわち、文字セグメンテーション機能２５において登録されたそのユーザの文字間スペースを開けて各文字を出力する。 The inter-character space information adding function 35 determines and outputs the inter-character space of the standard character font whose size is determined by adding displacement information. In other words, the character segmentation function 25 opens the space between the characters registered for the user and outputs each character.

ユーザ固有の文字間スペース情報の付加方法としては、次のものがある。
(1).文字セグメンテーション機能２５により抽出されたすべての文字間スペース情報から平均して算出された値をユーザ固有の文字間スペース情報として登録して、文字間スペース情報付加機能３５により出力する。
(2).文字セグメンテーション機能２５により抽出されたすべての文字間スペース情報から平均して算出された値を登録して、文字間スペース情報付加機能３５により乱数により変動を持たせて出力する。
(3).文字セグメンテーション機能２５により平仮名、漢字及び句読点などのような隣接する文字の種類の組合せ別に文字間スペースの個性を定量化して登録する。例えば、平仮名と平仮名との間、平仮名と漢字との間、平仮名と句読点との間などのように隣接する文字の種類の組合せ別に文字間スペースを定量化して登録する。そして、文字間スペース情報付加機能３５により平仮名と平仮名との間、平仮名と漢字との間、平仮名と句読点との間などのように隣接する文字の種類の組合せ別に文字間スペースを付加する。
(4).文字セグメンテーション機能２５により縦方向のプロジェクションの複雑さ（例えば分散の大きさ）により、文字間スペースの個性を定量化して登録する。例えばプロジェクションの複雑さを大中小に分けて、大と大との間、大と中との間、大と小との間などのように隣接する文字のプロジェクションの複雑さの組合せ別に文字間スペースを定量化して登録する。そして、文字間スペース情報付加機能３５により大と大との間、大と中との間、大と小との間などのように隣接する文字のプロジェクションの複雑さの組合せ別に文字間スペースを付加する。 There are the following methods for adding user-specific character space information.
(1). Register the character segmentation function 25 value calculated by averaging all of the inter-character space information extracted by as a user-specific inter-character space information, output by the inter-character space information adding function 35 To do.
(2). A value calculated by averaging from all the inter-character space information extracted by the character segmentation function 25 is registered, and the inter-character space information adding function 35 outputs a change with random numbers.
(3) The character segmentation function 25 quantifies and registers the character of the space between characters for each combination of adjacent character types such as hiragana, kanji and punctuation marks. For example, the inter-character space is quantified and registered for each combination of adjacent character types such as between hiragana and hiragana, between hiragana and kanji, between hiragana and punctuation. Then, the inter-character space information adding function 35 adds inter-character spaces according to combinations of adjacent character types such as between hiragana and hiragana, between hiragana and kanji, between hiragana and punctuation.
(4) The character segmentation function 25 quantifies and registers the character of the space between characters according to the complexity of projection in the vertical direction (for example, the size of dispersion). For example, dividing the projection complexity into large, medium, and small, the space between characters according to the projection complexity of adjacent characters such as between large and large, between large and medium, between large and small, etc. Quantify and register . Then, the inter-character space information addition function 35 adds inter-character spaces according to the combination of the projection complexity of adjacent characters such as between large and large, between large and medium, and between large and small. To do.

個性入り文字フォント出力機能３６は、変位情報、大きさ情報及び文字間スペース情報が付加された個性入り文字フォントを表示部１５に表示したり、印刷部１６で印刷したりする。 The individual character font output function 36 displays the individual character font to which the displacement information, the size information, and the inter-character space information are added on the display unit 15 or prints it on the printing unit 16.

なお、本発明は上記実施の形態に限定されるものではない。 The present invention is not limited to the above embodiment.

個性情報は、変位情報及び大きさ情報の両方とも付加することが望ましいが、一方だけであっても良い。 Although it is desirable to add both the displacement information and the size information, the personality information may be only one.

ユーザの好みに応じて、自分の個性情報を加える量を制御できることが望ましい。 It is desirable to be able to control the amount of personality information added according to user preferences.

標準文字フォントはプロポーショナルフォントであることが望ましい。 The standard character font is preferably a proportional font.

上記実施の形態では、変位情報抽出機能として、２次元の対応点探索により幾何学的変位量を抽出するものを説明したが、標準文字フォントに対する登録文字フォントの対応点の変位を抽出するものであればいかなる手法によるものでも良い。 In the above embodiment, the displacement information extraction function has been described for extracting the geometric displacement amount by two-dimensional corresponding point search. However, the displacement information extracting function extracts the displacement of the corresponding point of the registered character font with respect to the standard character font. Any method may be used.

本発明の手書き風文字フォント出力システムは、コンピュータを本手書き風文字フォント出力システムとして機能させるためのプログラムでも実現される。このプログラムは、コンピュータで読み取り可能な記録媒体に格納されていても良い。 The handwritten character font output system of the present invention is also realized by a program for causing a computer to function as the handwritten character font output system. This program may be stored in a computer-readable recording medium.

このプログラムを記録した記録媒体は、図１に示されるＲＯＭ１４そのものであっても良いし、また、外部記憶装置としてＣＤ−ＲＯＭドライブ等のプログラム読取装置が設けられ、そこに記録媒体を挿入することで読み取り可能なＣＤ−ＲＯＭ等であっても良い。 The recording medium on which this program is recorded may be the ROM 14 itself shown in FIG. 1, or a program reading device such as a CD-ROM drive is provided as an external storage device, and the recording medium is inserted therein. It may be a CD-ROM or the like that can be read.

また、上記記録媒体は、磁気テープ、カセットテープ、フレキシブルディスク、ハードディスク、ＭＯ／ＭＤ／ＤＶＤ等、又は半導体メモリであっても良い。 The recording medium may be a magnetic tape, a cassette tape, a flexible disk, a hard disk, an MO / MD / DVD, or a semiconductor memory.

本発明の一実施の形態による手書き風文字フォント出力システムの構成を示すブロック図である。It is a block diagram which shows the structure of the handwritten style character font output system by one embodiment of this invention. 本実施の形態による手書き風文字フォント出力システムの機能を示す機能ブロック図である。It is a functional block diagram which shows the function of the handwritten style character font output system by this Embodiment. 手書き文章の例を示す図である。It is a figure which shows the example of a handwritten sentence. 本実施の形態で用いる便箋の例を示す図である。It is a figure which shows the example of the notepaper used by this Embodiment. 行のプロジェクション像の例を示す図である。It is a figure which shows the example of the projection image of a line. 縦方向のプロジェクション像の例を示す図である。It is a figure which shows the example of the projection image of a vertical direction. 文字のプロジェクションの対応点探索を説明する図である。It is a figure explaining the corresponding point search of a character projection. 文字幅情報と文字間スペース情報との関係を示す図である。It is a figure which shows the relationship between character width information and the space information between characters. 文字セグメンテーションの成功率を示す図である。It is a figure which shows the success rate of character segmentation.

Claims

A first memory for storing standard m (m: natural number) character fonts;
A second memory for storing a series of n (n: natural number, 2 ≦ n <m) character fonts that are input as images;
Inclination correcting means for correcting the inclination of the input character font stored in the second memory ;
A character font corresponding to the character font whose inclination has been corrected by the inclination correction means is read out from the first memory and compared with the character fonts. Character segmentation means for extracting an interval and quantifying and registering the extracted character interval for each combination of adjacent character types ;
Displacement information for comparing each character font cut out by the character segmentation means with a standard corresponding character font stored in the first memory, extracting the displacement of the corresponding point, and extracting its statistical information Extraction means;
Displacement information adding means for changing and outputting the character font stored in the first memory in accordance with the output of the displacement information extracting means;
Handwritten-style character font output, comprising: character space information adding means for outputting the character font output by the displacement information adding means at character intervals corresponding to the character intervals registered by the character segmentation means. system.

A first memory for storing standard m (m: natural number) character fonts;
A second memory for storing a series of n (n: natural number, 2 ≦ n <m) character fonts that are input as images;
Inclination correcting means for correcting the inclination of the input character font stored in the second memory;
A character font corresponding to the character font whose inclination has been corrected by the inclination correction means is read out from the first memory and compared with the character fonts. A character segmentation means for extracting an interval and quantifying and registering the extracted character interval for each combination of projection complexity of adjacent characters;
Displacement information for comparing each character font cut out by the character segmentation means with a standard corresponding character font stored in the first memory, extracting the displacement of the corresponding point, and extracting its statistical information Extraction means;
Displacement information adding means for changing and outputting the character font stored in the first memory in accordance with the output of the displacement information extracting means;
Handwritten-style character font output, comprising: character space information adding means for outputting the character font output by the displacement information adding means at character intervals corresponding to the character intervals registered by the character segmentation means. system.