JPH04342089A - Character input procedding method - Google Patents

Character input procedding method

Info

Publication number
JPH04342089A
JPH04342089A JP3114309A JP11430991A JPH04342089A JP H04342089 A JPH04342089 A JP H04342089A JP 3114309 A JP3114309 A JP 3114309A JP 11430991 A JP11430991 A JP 11430991A JP H04342089 A JPH04342089 A JP H04342089A
Authority
JP
Japan
Prior art keywords
character
line
range
handwritten
characters
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP3114309A
Other languages
Japanese (ja)
Inventor
Yoshiharu Kawabe
河部 芳晴
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hitachi Ltd
Original Assignee
Hitachi Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hitachi Ltd filed Critical Hitachi Ltd
Priority to JP3114309A priority Critical patent/JPH04342089A/en
Publication of JPH04342089A publication Critical patent/JPH04342089A/en
Pending legal-status Critical Current

Links

Landscapes

  • Character Input (AREA)

Abstract

PURPOSE:To efficiently segment the handwritten sentence from a handwritten document by funding a line head position coordinate and a line range to determine a line position, and subsequently, finding a character head position in each line and a character range to determine a character position, and executing the character conversion and the character recognition. CONSTITUTION:A processing part 3 determines, first of all, a line position by finding a line head position coordinate and a line range from input data of a handwritten sentence (character) read by an OCR 1. Subsequently, the processing part 3 determines a character position by finding a character head position coordinate in each line and a line range. That is, the processing part 3 finds a start position and a range of the head character of a certain line, first of all, a first line, executes it with regard to all characters of this line and finds its one character range. The processing part 3 executes the character conversion (expanding or compression processing) to these dot image patterns in each character range, executes the character recognition with regard to image data obtained in such a way, and also, repeats the processing until that time with regard to each line. In such a way, the sentence can be segmented efficiently.

Description

【発明の詳細な説明】[Detailed description of the invention]

【0001】0001

【産業上の利用分野】本発明は文字入力処理方法に関し
、特に手書き文書からの手書き文章の切出しを効率的に
実行可能とする文字入力処理方法に関する。
BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a character input processing method, and more particularly to a character input processing method that makes it possible to efficiently extract handwritten text from a handwritten document.

【0002】0002

【従来の技術】従来、この種の文字入力処理装置におけ
る文字切出し方法としては、例えば、実開昭55−13
8362号公報,特開昭56−44972号公報に開示
されている方法が知られている。これらは、いずれも活
字ないしは印刷された文字を対象として、所定幅の行エ
リア内を検索して、各活字ないしは印刷された文字を切
出すという技術を示しているものである。
2. Description of the Related Art Conventionally, as a character extraction method in this type of character input processing device, for example,
Methods disclosed in Japanese Patent Application Laid-open No. 8362 and Japanese Patent Application Laid-Open No. 56-44972 are known. All of these techniques indicate a technique for searching within a line area of a predetermined width for typed or printed characters and cutting out each typed or printed character.

【0003】0003

【発明が解決しようとする課題】上記従来技術は、手書
き文書からの手書き文章の切出しについては配慮されて
いないものである。これは、手書き文書からの手書き文
章の切出しについては、文字の大きさや行間隔等、すべ
てが変動幅が大きく、これを自動的に行うことは極めて
困難であったからである。本発明は上記事情に鑑みてな
されたもので、その目的とするところは、従来の技術に
おける上述の如き問題を解消し、手書き文書からの手書
き文章の切出しを効率的に実行可能とする文字入力処理
方法を提供することにある。
Problems to be Solved by the Invention The above-mentioned prior art does not take into consideration the extraction of handwritten sentences from handwritten documents. This is because when extracting handwritten text from a handwritten document, everything such as character size and line spacing varies widely, and it is extremely difficult to do this automatically. The present invention has been made in view of the above circumstances, and its purpose is to solve the above-mentioned problems in the conventional technology and to provide a character input that enables efficient extraction of handwritten sentences from handwritten documents. The purpose is to provide a processing method.

【0004】0004

【課題を解決するための手段】本発明の上記目的は、O
CR(光学的文字読取り装置),キーボード,ディスプ
レイを有し、手書き文章(文字)を入力する文字入力処
理装置において、前記手書き文章(文字)の入力データ
から、まず、行先頭位置座標と行範囲を求めて行位置を
決定し、次に各行内の文字先頭位置と文字範囲を求めて
文字位置を決定し、得られたイメージデータについて文
字変換を行った後、文字認識を行うことを特徴とする文
字入力処理方法によって達成される。
[Means for Solving the Problems] The above object of the present invention is to
In a character input processing device that has a CR (optical character reader), a keyboard, and a display and inputs handwritten text (characters), first, from the input data of the handwritten text (characters), line start position coordinates and line range are determined. The line position is determined by determining the line position, then the character position is determined by determining the character start position and character range within each line, character conversion is performed on the obtained image data, and then character recognition is performed. This is achieved by a character input processing method.

【0005】[0005]

【作用】本発明に係る文字入力処理方法においては、手
書き文章(文字)の入力データから、まず、行先頭位置
座標と行範囲を求めて行位置を決定し、次に各行内の文
字先頭位置と文字範囲を求めて文字位置を決定し、得ら
れたイメージデータについて文字変換を行った後、文字
認識を行うようにしたので、どのような手書き文字から
も、自動的に文字入力を行うことが可能になる。
[Operation] In the character input processing method according to the present invention, from the input data of handwritten text (characters), the line position is determined by first finding the line start position coordinates and line range, and then the character start position within each line. The system determines the character position by determining the character range, performs character conversion on the obtained image data, and then performs character recognition, making it possible to automatically input characters from any handwritten character. becomes possible.

【0006】[0006]

【実施例】以下、本発明の実施例を図面に基づいて詳細
に説明する。図2は、本発明の一実施例を示す文字入力
処理装置の構成図であり、図中、1は手書き文章(文字
)を読み取るためのOCR(光学的文字読取り装置)、
2はOCR1により読み取った文字の変更,訂正を行う
ためのキーボード、3は後述する文字認識,活字変換等
の処理を行う処理部、4は上述のOCR1からのイメー
ジデータをバッファリングする入力バッファ、5は文字
認識,活字変換等の処理を行った後の文章を表示する表
示部、6は同プリンタを示している。図1は、処理部3
による、CR1により読み取った手書き文章(文字)の
行範囲および1文字範囲を判定する処理のフローチャー
ト、図3は、同処理の概念図である。CR1により読み
取った手書き文章(文字)は、図3では、横書きの原稿
を例として挙げているが、「0」と「1」のドットイメ
ージパターンとして入力バッファ4に入力される(ステ
ップ11)。次に、入力バッファ4に入力されたドット
イメージパターンのデータについて、処理部3により、
1行分の範囲の判定および1文字範囲の判定が行われる
(ステップ12,13)。
Embodiments Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings. FIG. 2 is a block diagram of a character input processing device showing an embodiment of the present invention. In the figure, 1 is an OCR (optical character reading device) for reading handwritten text (characters);
2 is a keyboard for changing and correcting characters read by the OCR 1; 3 is a processing unit that performs processing such as character recognition and type conversion, which will be described later; 4 is an input buffer for buffering image data from the OCR 1; Reference numeral 5 indicates a display section that displays text after processing such as character recognition and type conversion, and 6 indicates the same printer. FIG. 1 shows the processing unit 3
FIG. 3 is a conceptual diagram of the process of determining the line range and single character range of handwritten text (characters) read by CR1. The handwritten text (characters) read by the CR1 is input to the input buffer 4 as a dot image pattern of "0" and "1", although FIG. 3 shows an example of a horizontally written document (step 11). Next, regarding the data of the dot image pattern input to the input buffer 4, the processing unit 3 performs the following:
The range for one line and the range for one character are determined (steps 12 and 13).

【0007】上述の1行分の範囲および1文字範囲の判
定は、以下に示す如く行われる。すなわち、まず、図3
に示す如く、X方向,Y方向の基準点(X0,Y0)か
ら、Y座標を+1ずつずらしながら、各位置において、
X座標X0からXnまでの、データの論理和が「1」と
なるY座標Yaを求める。なお、上記Xnは、用紙のサ
イズによって予め定められた位置である。Yaが1行目
文字範囲のスタート点になる。次に、1行目文字範囲の
最終Y座標を求める。これは、先程求めた1行目文字範
囲の最初のY座標(Ya)から、Y座標を+1ずつ移動
しながら、X座標X0からXnまでのデータの論理和が
連続する「0」となる Y座標の始まりを求めることに
より、Ybが定まる。以上の処理により、文章の 最初
の1行目の行範囲が決められる。次に、上述の如き処理
により決定された1行目の先頭文字のスタート位置およ
び範囲を求める。 先頭文字のスタート位置は、先程、行範囲として求めた
Y座標YaからYbまでの、データの論理和が連続して
「1」となるX座標の始まり、ここでは、Xbを求める
ことになる。また、文字範囲の最終X座標を求めるため
、先程求めた XbからX座標を+1ずつ移動しながら
、Y座標YaからYbまでの、データの論理和が連続し
て「0」になるX座標の始まり Xcを求めれば良い。 以上の処理により、1行目の先頭文字の位置および範囲
が求まる。 以下、同様の動作を繰り返し、1行目の全文字の1文字
範囲を決める。
The determination of the range for one line and the range for one character described above is performed as shown below. That is, first, Figure 3
As shown in , at each position, while shifting the Y coordinate by +1 from the reference point (X0, Y0) in the X direction and Y direction,
Find the Y coordinate Ya at which the logical sum of the data from the X coordinates X0 to Xn is "1". Note that the above Xn is a position predetermined depending on the size of the paper. Ya becomes the starting point of the first line character range. Next, find the final Y coordinate of the first line character range. This means that while moving the Y coordinate by +1 from the first Y coordinate (Ya) of the first line character range found earlier, the logical sum of the data from X coordinates X0 to Xn becomes a continuous "0" Y Yb is determined by finding the start of the coordinates. Through the above processing, the line range for the first line of the text is determined. Next, the start position and range of the first character in the first line determined by the process described above are determined. The start position of the first character is the start of the X coordinate where the logical sum of the data from the Y coordinate Ya to Yb found earlier as the line range continuously becomes "1"; here, Xb is found. Also, in order to find the final X coordinate of the character range, while moving the X coordinate by +1 from Xb found earlier, find the Just find the beginning Xc. Through the above processing, the position and range of the first character in the first line are determined. Thereafter, the same operation is repeated to determine a one-character range of all the characters in the first line.

【0008】上述の動作中、同一行内で前文字の最終座
標から次文字の先頭座標までの間隔が規定値以上となっ
た場合はこれをスペースとみなし、次の文字の先頭座標
を前にずらし、スペスの削除を行う。手書き文字は各文
字の大きさが異なるため、1文字範囲内のドットイメー
ジパターンを、所定の枠決めされた大きさ(例えば、2
4×24ドットイメージパターン)に伸長あるいは圧縮
処理を行い、その後、文字認識を行う(ステップ14,
15)。2行目以降についても、1行目と同様に、行範
囲の判定,文字の先頭位置および範囲の判定,イメージ
データの伸長・圧縮処理等(これらの処理を合せて「文
字変換」という)を行った後、得られたイメードデータ
について文字認識を行い、最終文字行までこれを繰り返
す(ステップ16,17)。なお、行間隔についても、
行内の空きスペース削除と同様に、前の行の最終座標か
ら次の行の先頭座標が規定値以上である場合には、行の
先頭座標を移動し、未記入の行を削除する。
During the above operation, if the interval from the final coordinate of the previous character to the first coordinate of the next character in the same line exceeds a specified value, this is regarded as a space, and the first coordinate of the next character is shifted forward. , delete the space. Since the size of each handwritten character is different, the dot image pattern within one character range is divided into a predetermined frame size (for example, 2
4x24 dot image pattern) is expanded or compressed, and then character recognition is performed (step 14,
15). For the second and subsequent lines, in the same way as the first line, the line range determination, character start position and range determination, image data expansion/compression processing, etc. (these processes are collectively referred to as "character conversion") are performed. After this, character recognition is performed on the obtained image data, and this is repeated until the final character line (steps 16 and 17). Regarding line spacing,
Similar to deleting empty space within a line, if the starting coordinate of the next line from the final coordinate of the previous line is greater than or equal to a specified value, the starting coordinate of the line is moved and the blank line is deleted.

【0009】次に、文章のフォーマッティングを行う。 行の最終文字以降に余分なスペースがあり、かつ、最終
文字が句点でない場合は、次の行の文字を規定桁数分だ
け、前段の行に移動する。なお、最終文字が句点である
場合は、次の行の文字を1桁分だけ移動する。この動作
を繰り返すことにより、文章のフォーマッティングを行
う。また、上述の文字変換処理において、入力文字が不
鮮明または煩雑で処理できなかった文字については、表
示部5への表示の際、イメージデータのままで表示する
とか、ブリンク表示することにより、訂正をやり易くす
ると良い。上述の実施例によれば、手書き原稿を自動的
に定形文章フォーマット化することが可能になる。なお
、上記実施例は本発明の一例を示したものであり、本発
明はこれに限定されるべきものではないことは言うまで
もない。
Next, the text is formatted. If there is an extra space after the last character on a line, and the last character is not a period, move the next line's characters by the specified number of digits to the previous line. Note that if the last character is a period, the character on the next line is moved by one digit. By repeating this operation, the text is formatted. In addition, in the above-mentioned character conversion process, for characters that cannot be processed because the input characters are unclear or complicated, corrections can be made by displaying them as image data as they are or by displaying them in a blinking manner when displaying them on the display section 5. It's good to make it easier. According to the embodiment described above, it is possible to automatically convert a handwritten manuscript into a fixed form sentence format. It should be noted that the above-mentioned embodiment shows an example of the present invention, and it goes without saying that the present invention should not be limited thereto.

【0010】0010

【発明の効果】以上、詳細に説明した如く、本発明によ
れば、手書き文書からの手書き文章の切出しを効率的に
実行可能とする文字入力処理方法を実現できるという顕
著な効果を奏するものである。
[Effects of the Invention] As described above in detail, the present invention has the remarkable effect of realizing a character input processing method that makes it possible to efficiently extract handwritten sentences from handwritten documents. be.

【0011】[0011]

【図面の簡単な説明】[Brief explanation of the drawing]

【図1】本発明の一実施例である文字入力処理装置の処
理部3による処理のフローチャートである。
FIG. 1 is a flowchart of processing by a processing unit 3 of a character input processing device that is an embodiment of the present invention.

【図2】実施例の文字入力処理装置の構成図である。FIG. 2 is a configuration diagram of a character input processing device according to an embodiment.

【図3】実施例の処理の概念図である。FIG. 3 is a conceptual diagram of processing in the embodiment.

【符号の説明】[Explanation of symbols]

1:OCR、2:キーボード、3:処理部、4:入力バ
ッファ、5:表示部、6:プリンタ、11〜17:処理
ステップ。
1: OCR, 2: Keyboard, 3: Processing unit, 4: Input buffer, 5: Display unit, 6: Printer, 11 to 17: Processing steps.

Claims (1)

【特許請求の範囲】[Claims] 【請求項1】  OCR(光学的文字読取り装置),キ
ーボード,ディスプレイを有し、手書き文章(文字)を
入力する文字入力処理装置において、前記手書き文章(
文字)の入力データから、まず、行先頭位置座標と行範
囲を求めて行位置を決定し、次に各行内の文字先頭位置
と文字範囲を求めて文字位置を決定し、得られたイメー
ジデータについて文字変換を行った後、文字認識を行う
ことを特徴とする文字入力処理方法。
1. A character input processing device for inputting handwritten text (characters), which includes an OCR (optical character reader), a keyboard, and a display, wherein the handwritten text (characters) is
From the input data of (characters), first determine the line position by finding the line start position coordinates and line range, then determine the character position by finding the character start position and character range within each line, and the resulting image data A character input processing method characterized in that character recognition is performed after character conversion is performed.
JP3114309A 1991-05-20 1991-05-20 Character input procedding method Pending JPH04342089A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP3114309A JPH04342089A (en) 1991-05-20 1991-05-20 Character input procedding method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP3114309A JPH04342089A (en) 1991-05-20 1991-05-20 Character input procedding method

Publications (1)

Publication Number Publication Date
JPH04342089A true JPH04342089A (en) 1992-11-27

Family

ID=14634645

Family Applications (1)

Application Number Title Priority Date Filing Date
JP3114309A Pending JPH04342089A (en) 1991-05-20 1991-05-20 Character input procedding method

Country Status (1)

Country Link
JP (1) JPH04342089A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6798546B1 (en) 1999-03-05 2004-09-28 Fujitsu Limited Scanning document reader and memory medium storing scanner driver program
CN114241090A (en) * 2021-12-31 2022-03-25 广州朗国电子科技股份有限公司 OCR-based electronic whiteboard straight line drawing method, system, equipment and medium

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6798546B1 (en) 1999-03-05 2004-09-28 Fujitsu Limited Scanning document reader and memory medium storing scanner driver program
CN114241090A (en) * 2021-12-31 2022-03-25 广州朗国电子科技股份有限公司 OCR-based electronic whiteboard straight line drawing method, system, equipment and medium

Similar Documents

Publication Publication Date Title
KR100209455B1 (en) Character generating method and apparatus thereof
JPH09106272A (en) Character pattern generating device
JP2000118054A (en) Printer and method for reducing amount of consumption of toner
JP2002203207A (en) Character recognizing method and program, and recording medium
JP3062382B2 (en) Image processing apparatus and method
JPH04342089A (en) Character input procedding method
EP0488128B1 (en) Printing apparatus
JP3034140B2 (en) Character generation method and device
JPH07121734A (en) Character image editing device
JPS62211692A (en) Preparation system for character of varied size
JP2618528B2 (en) Character drawing device, character output correction method, and character output device
JP2977230B2 (en) Character extraction method
JPH06175639A (en) Method and device for character generation
KR920008909B1 (en) Slant printing method for dot metrix printer
JPH0581318A (en) Digital translating device
JPH0812547B2 (en) Pattern reduction / enlargement processing device
JPS62212167A (en) Synthesized character forming system
JP2968617B2 (en) CAD equipment
JPH08123904A (en) Character style classification processing device
JPS6327990A (en) Character recognizing method
JP2898041B2 (en) Character pattern creation device
JP2740506B2 (en) Image recognition method
JPS62211691A (en) Character synthesization system
JPH0916147A (en) Processor and method for character processing
JPH08153092A (en) Document processor