JPH07129717A

JPH07129717A - Online character segmenting device

Info

Publication number: JPH07129717A
Application number: JP5273687A
Authority: JP
Inventors: Kenji Okano; 健治岡野; Koji Matsumoto; 浩司松本; Hidesato Ichii; 英里一井
Original assignee: Oki Electric Industry Co Ltd
Current assignee: Oki Electric Industry Co Ltd
Priority date: 1993-11-01
Filing date: 1993-11-01
Publication date: 1995-05-19

Abstract

PURPOSE:To provide an online character segmenting device reduced in character segmentation error of separate characters, characters with voiced sound symbols, and characters with semi-voiced sound symbols. CONSTITUTION:The online character segmenting device is provided with a voiced sound symbol discriminating means which discriminates whether two continuous strokes inputted by writing are a voiced sound symbol or not and discriminates two strokes as one segment of the voiced sound symbol in the case of the voiced sound symbol, a voiced sound symbol/semi-voiced sound symbol coupling means 5 which couples a voiced sound symbol or a semi-voiced sound symbol to the just preceding fundamental segment, and a KANA (Japanese syllabary) separate character coupling means 6 which couples two continuous fundamental segments in the case of two fundamental segments constituting a KANA separate character, and characters with voiced sound symbols, characters with semi-voiced sound symbols, KANA separate characters, and characters easy to separate at the time of writing are segmented.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、筆記された文字列のデ
ータから各文字のデータをオンラインで切出すオンライ
ン文字切出し装置に関し、特にオンライン文字認識装置
に適応して好適なものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an online character slicing device for slicing online each character data from written character string data, and is particularly suitable for an online character recognition device.

【０００２】[0002]

【従来の技術】オンライン文字認識装置においては、例
えば、タブレットやスタイラスペンでなる座標入力装置
を用いて入力者が筆記した文字列のデータから各文字の
データをオンラインで切出すことが行なわれる。当然に
この切出し精度も文字認識率に影響を与え、正確に行な
われることが求められている。筆記された文字列のデー
タから各文字のデータを切出す方法としては、従来、文
字枠（枡目）を表示してその中に筆記させることで切出
しを行なう方法が一般的であった（『特開昭５５―８３
９７０号公報』）。このような切出し方法の他にも、例
えば文字を構成する偏（へん）や旁（つくり）などの要
素毎に行なう文字認識を利用してその組み合わせの妥当
性等から文字を切出す文字認識処理を利用した方法
（『特開昭６１―２９９８２号公報』）が既に提案され
ている。2. Description of the Related Art In an online character recognition device, data of each character is cut online from a character string data written by an input person using a coordinate input device such as a tablet or a stylus pen. Naturally, this cutting accuracy also affects the character recognition rate, and it is required to be performed accurately. As a method of cutting out the data of each character from the data of a written character string, conventionally, a method of displaying a character frame (mesh) and allowing the user to write in it has been generally used. JP-A-55-83
970 publication "). In addition to such a cutout method, for example, character recognition processing that cuts out characters based on the validity of the combination by using character recognition that is performed for each element such as bias and structure that composes a character. A method utilizing "(Japanese Patent Application Laid-Open No. 61-29982") has already been proposed.

【０００３】[0003]

【発明が解決しようとする課題】しかしながら、上記の
方式では、文字幅情報を用いて切り出しを行なっている
ため、筆記された文字の文字幅が所定の範囲から外れて
しまった場合、誤って切出しを行なってしまうという問
題点があった。また、一般的な日本語の文章には平仮名
や片仮名が多く含まれおり、「は」，「ほ」などの分離
文字や「が」，「ぎ」などの濁点文字、「ぱ」，「ぴ」
などの半濁点文字が正しく切り出せない場合があった。However, in the above method, since the character width information is used for clipping, when the character width of the written character is out of the predetermined range, the character width is erroneously cut out. There was a problem that it did. In addition, a lot of hiragana and katakana are included in general Japanese sentences. Separation characters such as “ha” and “ho”, dakuten characters such as “ga” and “gi”, and “pa” and “pi”. "
In some cases, the semi-voiced characters such as "" could not be cut out correctly.

【０００４】本発明は、以上述べた従来技術の問題点を
解決し、濁点文字、半濁点文字、仮名分離文字及び筆記
時分離しやすい文字に対し文字切出し誤りが少ないオン
ライン文字切出し装置を提供することを目的とする。The present invention solves the above-mentioned problems of the prior art, and provides an online character segmentation device with less character segmentation errors for dakuten characters, semi-dakuten characters, kana separated characters, and characters that are easily separated during writing. The purpose is to

【０００５】[0005]

【課題を解決するための手段】前記問題点を解決するた
めに、本発明においては、オンライン文字切出し装置に
おいて、文字を筆記する時の入力文字の位置情報を座標
データで出力する座標入力装置と、座標入力装置から出
力された座標データに基づいて、入力文字が筆記される
のに伴い逐次ストロークを抽出しコード化するストロー
クコード化手段と、２つの連続するストロークが「濁
点」であるかを判定し、「濁点」の場合には２つのスト
ロークを濁点として１つのセグメントと判定する濁点判
定手段と、各々のストロークと前記濁点判定手段で出力
されたセグメントの重なりを検出し、重なりあっている
ものを１つのセグメントとして出力する基本セグメント
分割手段と、基本セグメントが濁点もしくは半濁点のみ
で構成されている場合に、直前の基本セグメントと結合
する濁点・半濁点結合手段と、連続する２つの基本セグ
メントが仮名分離文字の場合には２つの基本セグメント
を結合する仮名分離文字結合手段とを設けたものであ
る。In order to solve the above problems, the present invention provides a coordinate input device for outputting position information of an input character when writing a character as coordinate data in an online character cutting device. , Based on the coordinate data output from the coordinate input device, stroke coding means for sequentially extracting and coding strokes as the input character is written, and whether two consecutive strokes are "dakuten" In the case of “dakuten”, two strokes are treated as a dakuten and one segment is judged as one segment, and the overlap of each stroke and the segment output by the dakuten judging means is detected and overlapped. When the basic segment is composed of only the dakuten or the semi-dakuten, and the basic segment dividing means that outputs an item as one segment In addition, there is provided a dakuten / semi-dakuten connecting means for connecting the immediately preceding basic segment and a kana separated character connecting means for connecting two basic segments when two consecutive basic segments are kana separated characters. .

【０００６】[0006]

【作用】本発明では以上のようにオンライン文字切出し
装置を構成したもので各手段は以下のとおり作用する。
座標入力手段は、入力者が筆記した文字の位置情報を逐
次座標データで出力する。ストロークコード化手段は、
前記座標データよりストロークを抽出しコード化を行な
う。コード化は、例えば１ストローク分の入力データ列
の特徴点に基づき、各セグメントのサイン、セグメント
の角度、セグメント間の回転角度によりストロークを分
類し行う。特徴点は、例えば入力データ列のデータ間の
ｘ，ｙ方向のサインを求めサインの状態の変化点を特徴
点とする等の方法により抽出する。ここでサインとは、
入力データ間でＸ方向、Ｙ方向それぞれの座標値の差が
正か負か０かを示すものである。濁点判定手段は、２つ
の連続するストロークがともに「上の方にある小さなス
トローク」である場合に２つのストロークは濁点である
と判定し２つのストロークを１つのセグメントとして出
力する。基本セグメント分割手段は、各ストローク及び
濁点判定手段で出力されたセグメントをＸ軸上に投影
し、重なりあったストロークおよびセグメントを１つの
基本セグメントとし濁点・半濁点結合手段に出力する。
濁点・半濁点結合手段は、基本セグメントが濁点及び半
濁点のみで構成されている場合に直前の基本セグメント
と結合する。半濁点であるための条件は基本セグメント
のストローク数が１で、ストロークが小さく丸い場合で
ある。ストロークが丸いかの判定はストロークコードに
より容易に判定することができる。仮名分離文字結合手
段は、２つの連続する基本セグメントが仮名分離文字で
あるための条件を満たすかを判定する。仮名分離文字と
は「け」，「に」，「は」，「ほ」，「ハ」等の平仮名
・片仮名の分離文字である。仮名分離文字であるための
条件はストローク数及びストロークコード等により記述
されている。仮名分離文字であるための条件を満たした
場合には２つの基本セグメントを結合する。以上のよう
な構成により、日本語文章に頻繁に出現し、かつ筆記時
分離しやすい濁点文字，半濁点文字，仮名分離文字を切
り出すことができるAccording to the present invention, the online character cutting device is constructed as described above, and each means operates as follows.
The coordinate input means successively outputs position information of characters written by the input person as coordinate data. The stroke coding means is
A stroke is extracted from the coordinate data and coded. The encoding is performed by classifying strokes according to the signature of each segment, the angle of the segment, and the rotation angle between the segments, for example, based on the characteristic points of the input data string for one stroke. The characteristic points are extracted by, for example, a method of obtaining a sign in the x and y directions between the data of the input data string and using a change point of the state of the sign as a characteristic point. The sign here is
It indicates whether the difference between the coordinate values in the X direction and the Y direction between the input data is positive, negative, or zero. The dakuten judging means judges that the two strokes are dakuten and outputs the two strokes as one segment when the two consecutive strokes are both “small strokes on the upper side”. The basic segment dividing means projects the segments output by each stroke and the dakuten judging means on the X-axis, and outputs the overlapping strokes and segments as one basic segment to the dakuten / semi-dakuten combining means.
The dakuten / semi-dakuten combining means connects with the immediately preceding basic segment when the basic segment is composed of only the dakuten and the semi-dakuten. The condition for the semi-voiced point is that the number of strokes of the basic segment is 1, and the stroke is small and round. Whether the stroke is round or not can be easily determined by the stroke code. The kana separated character combining means determines whether two consecutive basic segments satisfy the condition for being a kana separated character. Kana separation characters are separation characters for hiragana and katakana such as "ke", "ni", "ha", "ho", and "ha". The condition for being a kana separated character is described by the number of strokes, stroke code, and the like. Two basic segments are combined when the condition for being a kana separation character is satisfied. With the above configuration, it is possible to cut out dakuten characters, semi-dakuten characters, and kana separated characters that frequently appear in Japanese sentences and are easy to separate during writing.

【０００７】[0007]

【実施例】以下、本発明の実施例について図面を参照し
ながら詳細に説明する。図１は本発明の実施例のオンラ
イン文字切出し装置の構成を示す図である。同図に示す
ように、本オンライン文字切出し装置は、座標入力装置
１、ストロークコード化手段２、濁点判定手段３、基本
セグメント分割手段４、濁点・半濁点結合手段５、仮名
分離文字結合手段６、基本セグメント出力制御手段７、
文字切出し手段８から構成される。なお図中９は図示し
ないオンライン文字認識装置への出力端子を示す。図２
は図１の装置の動作の概略を示すフローチャートであ
り、ストロークコード化手段２はステップ１０１を、濁
点判定手段３はステップ１０２〜１０４を、基本セグメ
ント分割手段４はステップ１０５を、濁点・半濁点結合
手段５はステップ１０６〜１０８を、仮名分離文字結合
手段６はステップ１０９〜１１１を、基本セグメント出
力制御手段７はステップ１１２を実行する。図３は連続
する２つの基本セグメントが仮名分離文字であるための
条件の一例を示す図である。以下、本実施例の装置の動
作を各手段ごとに説明する。Embodiments of the present invention will now be described in detail with reference to the drawings. FIG. 1 is a diagram showing a configuration of an online character cutting device according to an embodiment of the present invention. As shown in the figure, this online character cutting device includes a coordinate input device 1, a stroke coding means 2, a dakuten judging means 3, a basic segment dividing means 4, a dakuten / semi-dakuten combining means 5, and a kana separated character combining means 6. , Basic segment output control means 7,
It is composed of the character cutting means 8. In the figure, 9 indicates an output terminal to an online character recognition device (not shown). Figure 2
2 is a flow chart showing the outline of the operation of the apparatus of FIG. 1, in which the stroke coding means 2 carries out step 101, the dakuten judging means 3 carries out steps 102 to 104, the basic segment dividing means 4 carries out step 105, and the dakuten / semi-dakuten. The combining means 5 executes steps 106 to 108, the kana separated character combining means 6 executes steps 109 to 111, and the basic segment output control means 7 executes step 112. FIG. 3 is a diagram showing an example of a condition for two consecutive basic segments being a kana separated character. The operation of the apparatus of this embodiment will be described below for each means.

【０００８】先ず、座標入力装置１は文字を入力するも
のであり、例えばタブレット、マウス、ライトペン、タ
ッチパネルなどで実現される。座標入力装置１は文字が
入力されるとデータ列｛(xi,yi),i=1,2,…,nj｝jを抽出
し、ストロークコード化手段２に送る。ストロークコー
ド化手段２は前記データ列よりストロークを抽出し、ス
トローク毎の特徴点情報に基づき、各ストロークをその
形状により、あらかじめ用意された数種類のコードに割
り当てる。特徴点は、例えば入力データ列のｘ，ｙ方向
それぞれのサインの変化点を特徴点とする等の方法によ
り抽出する。コード化としては例えば各セグメントの
Ｘ，Ｙサイン、セグメントの角度、セグメント間の回転
角度により分類し行なう。このコード化されたデータは
濁点判定手段３、濁点・半濁点結合手段５及び仮名分離
文字結合処理６で使用する。First, the coordinate input device 1 is for inputting characters, and is realized by, for example, a tablet, a mouse, a light pen, a touch panel, or the like. When a character is input, the coordinate input device 1 extracts a data string {(xi, yi), i = 1,2, ..., nj} j and sends it to the stroke encoding means 2. The stroke coding means 2 extracts strokes from the data string and assigns each stroke to several kinds of codes prepared in advance according to the shape thereof based on the characteristic point information for each stroke. The characteristic points are extracted by, for example, using the change points of the signs in the x and y directions of the input data string as the characteristic points. The coding is performed, for example, by classifying the X and Y signs of each segment, the angle of the segment, and the rotation angle between the segments. This coded data is used by the dakuten judging means 3, the dakuten / semi-dakuten combining means 5, and the kana separated character combining process 6.

【０００９】濁点判定手段３は、２つの連続するストロ
ークがともに「上の方にある小さなストローク」である
場合に、２つのストロークを結合し１つのセグメントと
する。図４は濁点判定の説明のための図である。「上の
方にある小さなストローク」という条件は、図４におい
て、ストロークの横幅をXh、ストロークの縦方向の幅を
Yh、ストロークの高さをHeight、１行の高さをLineheig
ht 、α,βを１以下の定数とすると、 Xh < α × Lineheight （１） Yh < α × Lineheight （２） Height > β × Lineheight （３）（１）〜（３）式の全ての条件を満たした場合である。
α，βは筆記者や筆記条件によって適宜設定する。When the two consecutive strokes are both "small strokes on the upper side", the dakuten judging means 3 combines the two strokes into one segment. FIG. 4 is a diagram for explaining the dakuten determination. The condition of "a small stroke in the upper part" is that the horizontal width of the stroke is Xh and the vertical width of the stroke is
Yh, stroke height is Height, and line height is Lineheig
If ht, α, β are constants of 1 or less, Xh <α × Lineheight (1) Yh <α × Lineheight (2) Height> β × Lineheight (3) All conditions of the expressions (1) to (3) This is the case when it is satisfied.
α and β are set appropriately depending on the writer and the writing conditions.

【００１０】基本セグメント分割手段４は、各ストロー
ク及び濁点判定手段３で出力されたセグメントをＸ軸上
に投影し、重なりあったストロークおよびセグメントを
１つの基本セグメントとし濁点・半濁点結合手段５に出
力する。図５に基本セグメント分割手段４により分割さ
れた基本セグメントを示す。The basic segment dividing means 4 projects the segments output from each stroke and the dakuten judging means 3 on the X-axis, and makes the overlapping strokes and segments one basic segment to the dakuten / semi-dakuten combining means 5. Output. FIG. 5 shows basic segments divided by the basic segment dividing means 4.

【００１１】次にセグメント分割を行なう方法を詳細に
説明する。先ず、ストローク及び濁点判定手段３で出力
されたセグメント毎にＸ方向の最大値、最小値を求め
る。それぞれの値をXMAXi,XMINi（iは筆記された順番を
しめす；i=1〜SMAX:SMAXは濁点判定手段３で結合されて
いないストロークの数と濁点判定手段３で出力されたセ
グメントの数の和である）。最初にストローク及び濁点
判定手段３で出力されたセグメントをそれぞれ１つのセ
グメントと仮定する。 k番目のセグメントの処理を行な
う場合、そのセグメントのＸ方向の最小値（XMINk ）
が、それ以前のセグメント（それまでの処理結果が反映
される）のＸ方向の最大値（XMAXi ;i < k）を下回らな
いか（XMINk <= XMAXi）をチェックする。iは1から順に
変化させ最初に条件を満たしたものを採用する。もし、
条件を満たした場合には、第iセグメントから第kセグメ
ントまでを一つのセグメントとする。以上の操作をkを1
からSMAXまで変化させて順次処理することにより基本セ
グメントの分割を行なうことができる。分割して生成さ
れた基本セグメントは濁点、半濁点結合手段５に出力さ
れる。Next, a method of performing segment division will be described in detail. First, the maximum value and the minimum value in the X direction are obtained for each segment output by the stroke and cloud point determination means 3. The respective values are XMAXi, XMINi (i indicates the order in which they are written; i = 1 to SMAX: SMAX is the number of strokes that are not combined by the dakuten judging means 3 and the number of segments output by the dakuten judging means 3. Is the sum). First, it is assumed that the segments output by the stroke and dull point determination means 3 are each one segment. When processing the kth segment, the minimum value in the X direction of that segment (XMINk)
Does not fall below the maximum value (XMAXi; i <k) in the X direction of the previous segment (which reflects the processing results up to that point) (XMINk <= XMAXi). i is changed in order from 1, and the one that satisfies the condition first is adopted. if,
When the condition is satisfied, the i-th segment to the k-th segment are regarded as one segment. The above operation is k 1
It is possible to divide the basic segment by changing from SMAX to SMAX and processing sequentially. The basic segment generated by the division is output to the dakuten / semi-dakuten combining means 5.

【００１２】濁点・半濁点結合手段５は基本セグメント
が濁点及び半濁点のみで構成されている場合に直前の基
本セグメントと結合する。基本セグメントが独立した濁
点である条件は、基本セグメントにはストロークが２つ
しか存在せず、かつ濁点判定処理３で濁点と判定されて
いることである。基本セグメントが独立した半濁点であ
るための条件は、基本セグメントのストローク数が１
で、含まれるストロークのストロークコードが丸を示す
コードでなければならない。The dakuten / semi-dakuten combining means 5 joins the immediately preceding basic segment when the basic segment is composed of only the dakuten and the semi-dakuten. The condition that the basic segment is an independent dakuten is that there are only two strokes in the basic segment and that the dakuten judgment processing 3 determines that the stroke is a dakuten. The condition that the basic segment is an independent semi-voiced point is that the number of strokes in the basic segment is 1
, The stroke code of the included stroke must be a code indicating a circle.

【００１３】仮名分離文字結合手段６は、２つの連続す
る基本セグメントが仮名分離文字であるための条件を満
たすかを判定する。仮名分離文字であるための条件は、
例えば図３に示すように、それぞれの基本セグメントの
ストローク数及び含まれるストロークのストロークコー
ド等により記述されている。ただし、基本ストロークの
ストローク数及びストロークコードは濁点、半濁点を除
いたものである。図６に示すように、「じ」が１つの基
本セグメントである場合、ストローク数は「１」でスト
ロークコードは「し」である。２つの基本セグメントが
仮名分離文字であるための条件を全て満たした場合には
２つの基本セグメントを結合する。The kana separated character combining means 6 determines whether or not two consecutive basic segments satisfy the condition for being a kana separated character. The conditions for being a kana separator are:
For example, as shown in FIG. 3, it is described by the stroke number of each basic segment, the stroke code of the included stroke, and the like. However, the stroke number and stroke code of the basic stroke do not include dakuten and semi-dakuten. As shown in FIG. 6, when "ji" is one basic segment, the number of strokes is "1" and the stroke code is "shi". When all the conditions for the two basic segments to be kana separation characters are satisfied, the two basic segments are combined.

【００１４】基本セグメント出力制御手段７は、基本セ
グメントが１文字として確定されている場合（濁点文
字、半濁点文字、仮名分離文字）には、切出し手段をス
キップして出力端子９に基本セグメントを出力する。基
本セグメントが未処理の場合、連続した未処理の基本セ
グメントを１つにまとめて文字切出し手段８に出力す
る。When the basic segment is determined as one character (voiced character, semi-voiced character, kana separated character), the basic segment output control means 7 skips the cutting means and outputs the basic segment to the output terminal 9. Output. When the basic segments are unprocessed, the continuous unprocessed basic segments are collected into one and output to the character cutting means 8.

【００１５】文字切出し手段８は基本セグメント出力制
御手段７より送られてきた基本セグメント群を結合する
ことにより文字切出しを行なう。文字切出しとしては、
例えば、連続する２つの基本セグメントを結合した場合
の横幅が、あるしきい値以下である場合にのみ２つの基
本セグメントを結合し、条件を満たさない場合は１つの
基本セグメントを１文字として確定する。以上の操作を
入力された全ての基本セグメントに対して行なう。また
本発明では上記の様な方法を用いたが、文字切出し手
段６は既存のあらゆる文字切出しを用いることができ
る。例えばペンアップ時間を検出して切出しを行なう方
式や文字認識を用いる方式など様々な手段を用いること
が可能である。The character cutting-out means 8 performs character cutting-out by combining the basic segment groups sent from the basic segment output control means 7. For character cutout,
For example, two basic segments are combined only when the width when two consecutive basic segments are combined is equal to or less than a certain threshold value, and when the condition is not satisfied, one basic segment is determined as one character. . The above operation is performed for all input basic segments. Further, in the present invention, the above method is used, but the character cutting means 6 can use any existing character cutting. For example, it is possible to use various means such as a method of detecting the pen-up time to perform clipping and a method of using character recognition.

【００１６】[0016]

【発明の効果】以上、詳細に説明したように、本発明に
よれば、上記のように濁点判定手段、濁点・半濁点結合
手段、仮名分離文字結合手段を設けたことにより、日本
語文章に頻繁に出現し、かつ筆記時分離しやすい文字、
濁点文字、半濁点文字、仮名分離文字を切出すことが可
能になる。As described above in detail, according to the present invention, by providing the dakuten judging means, the dakuten / semi-dakuten combining means, and the kana separated character combining means as described above, Characters that appear frequently and are easy to separate during writing,
It is possible to cut out dakuten characters, semi-dakuten characters, and kana separated characters.

[Brief description of drawings]

【図１】実施例の構成を示す図である。FIG. 1 is a diagram showing a configuration of an example.

【図２】実施例の動作を示すフローチャートである。FIG. 2 is a flowchart showing the operation of the embodiment.

【図３】仮名分離文字結合手段における仮名分離文字で
あるための条件を示す図である。FIG. 3 is a diagram showing conditions for being a kana separated character in a kana separated character combining means.

【図４】濁点判定手段における濁点判定を説明するため
の図である。FIG. 4 is a diagram for explaining a dakuten determination performed by a dakuten determination means.

【図５】基本セグメント分割手段により分割された基本
セグメントを示す図である。FIG. 5 is a diagram showing basic segments divided by a basic segment dividing unit.

【図６】仮名分離文字結合処理におけるストローク数及
びストロークコードの求め方を示す図である。FIG. 6 is a diagram showing how to determine a stroke number and a stroke code in a kana separated character combination process.

[Explanation of sign]

１…座標入力装置、２…ストロークコード化手段、
３…濁点判定手段、４…基本セグメント分割手段、５
…濁点・半濁点結合手段、６…仮名分離文字結合手段
、７…基本セグメント出力制御手段、８…文字切出
し手段。1 ... Coordinate input device, 2 ... Stroke coding means,
3 ... dakuten determination means, 4 ... basic segment dividing means, 5
... voiced sound / semi-voiced sound combining means, 6 ... kana separated character combining means, 7 ... basic segment output control means, 8 ... character cutting means.

Claims

[Claims]

1. A coordinate input device that outputs position information of an input character when writing a character as coordinate data, and based on the coordinate data output from the coordinate input device,
Stroke coding means for sequentially extracting and coding strokes as input characters are written, and determining whether two consecutive strokes are "voiced points". If "voiced points", the two strokes are voiced points. And a basic segment dividing means for detecting the overlap between each stroke and the segments output by the dakuten determination means, and outputting the overlapping ones as one segment, When the basic segment is composed of only dakuten or semi-dakuten, the dakuten that is combined with the immediately preceding basic segment
An online character segmentation device comprising a semi-voiced point combining means and a kana separated character combining means for connecting two basic segments when two consecutive basic segments are kana separated characters.

2. The cloud point determination means sets the conditions for the cloud point determination as horizontal width of stroke, vertical width of stroke,
The online character cutting device according to claim 1, wherein the stroke is determined from the height of the stroke and the height of one line.

3. The kana separated character combining means determines the condition for being a kana separated character from the stroke code and stroke number of the stroke excluding the dakuten and semi-dakuten in the basic segment. Item 1. The online character cutting device according to item 1.