JPS62231389A - Character recognizing device - Google Patents

Character recognizing device

Info

Publication number
JPS62231389A
JPS62231389A JP61074572A JP7457286A JPS62231389A JP S62231389 A JPS62231389 A JP S62231389A JP 61074572 A JP61074572 A JP 61074572A JP 7457286 A JP7457286 A JP 7457286A JP S62231389 A JPS62231389 A JP S62231389A
Authority
JP
Japan
Prior art keywords
character
width
pattern
information
code
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP61074572A
Other languages
Japanese (ja)
Inventor
Hiroshi Matsumura
松村 博
Shuichi Toyoda
豊田 修一
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Sanyo Electric Co Ltd
Original Assignee
Sanyo Electric Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sanyo Electric Co Ltd filed Critical Sanyo Electric Co Ltd
Priority to JP61074572A priority Critical patent/JPS62231389A/en
Publication of JPS62231389A publication Critical patent/JPS62231389A/en
Pending legal-status Critical Current

Links

Landscapes

  • Character Input (AREA)
  • Character Discrimination (AREA)

Abstract

PURPOSE:To obtain a character code on which a bit of en information is added, and to simplify the operation of an operator, by providing an en identifying part which outputs the bit of en information representing that an input character is an en character. CONSTITUTION:A character measuring part 3 scans an input form 1, and converts it to an image pattern, then stores it on an image memory 4. A character segmenting part 5 segments an image pattern 6 of MXN corresponding to one character frame, from the memory 4, and segments a character pattern 7 by a circumscription detecting process. Next, it is detected whether the pattern 7 exists only at the left half or the right half of the pattern 6 by an en identifying part 8, based on X1 and X2 of an X-axis, and the en information, or a bit of en information is outputted. A code output part 14 converts an inputted character number to the character code using a code memory 15, and adds the en information, or the en information obtained at the identifying part 8, then stores it on a candidate memory 16, and also, outputs it to a control part 17.

Description

【発明の詳細な説明】 (り産業上の利用分野 本発明は、1文字に対応する文字記入用の文字枠が予め
複数記録きれた用紙を入力用紙として用いる文字認識装
置に係り、特に、半角文字の処理に関する。
DETAILED DESCRIPTION OF THE INVENTION (Field of Industrial Application) The present invention relates to a character recognition device that uses, as an input sheet, a sheet on which a plurality of character frames for writing characters corresponding to one character have been recorded in advance. Concerning character processing.

(口〉従来の技術 特願昭59−195316号に示されているように、オ
フライン方式の文字認識装置においては、通常、1文字
に対応する文字記入用の文字枠が予め複数記録された用
紙が入力用紙として使用され、認識部では、文字枠に記
入された入力文字の特徴パターンが先ず抽出きれ、この
パターンと辞書の標準特徴パターンとが照合され、類似
度の大きい候補文字の文字コードが得られる。
(Example) As shown in the prior art Japanese Patent Application No. 195316/1983, in an offline character recognition device, a sheet of paper with a plurality of character frames for writing characters corresponding to one character recorded in advance is usually used. is used as an input form, and the recognition unit first extracts the feature pattern of the input character written in the character box, and then compares this pattern with the standard feature pattern in the dictionary to determine the character code of the candidate character with a high degree of similarity. can get.

(ハ)発明が解決しようとする問題点 従来の技術においては、認識結果として候補文字の文字
コードが得られるだけであったので、文字認識装置をワ
ードプロセッサに接続して文書を作成する場合、半角文
字を入力しようとすれば、その文字を入力用紙に記入す
るだけでは実現できず、認識結果が得られた後、オペレ
ータのキー操作により、候補文字の文字コードに半角属
性を付加しなければならなかった。
(c) Problems to be solved by the invention In the conventional technology, only the character code of the candidate character was obtained as a recognition result, so when creating a document by connecting a character recognition device to a word processor, half-width When trying to input a character, it cannot be achieved by simply writing the character on an input form; after the recognition result is obtained, the operator must use keys to add half-width attributes to the character code of the candidate character. There wasn't.

(ニ)問題点を解決するだめの手段 本発明は、入力文字が文字枠の左又は右の特定範囲内に
のみ記入されていることを検出して、入力文字が半角文
字であることを示す半角情報を出力する半角識別部を設
けたものである。
(d) Means to solve the problem The present invention detects that input characters are written only within a specific range on the left or right side of a character frame, and indicates that the input characters are half-width characters. This is provided with a half-width recognition section that outputs half-width information.

(ホ)作用 本発明では、入力用紙の文字枠において、例えば、左半
分又は右半分というような特定範囲内に、文字を記入す
るだけで、半角情報が付加された文字コードが得られる
(E) Function According to the present invention, a character code to which half-width information is added can be obtained by simply writing characters within a specific range, such as the left half or the right half, of the character frame of the input form.

(へ)実施例 第1図は本発明の実施例を示すブロック図、第2図は入
力用紙の具体例を示す図であり、入力用紙(1)には、
文字記入用として1文字毎に所定の大きさの方形文字枠
(2)がドロップアウトカラーで印刷きれている。
(F) Embodiment FIG. 1 is a block diagram showing an embodiment of the present invention, and FIG. 2 is a diagram showing a specific example of input paper.
A rectangular character frame (2) of a predetermined size is printed in a dropout color for each character for writing characters.

文字観測部(3)は、入力用紙(1〉を走査し、COD
イメージセンサ−等の光電交換素子を用いて、少なくと
も1行分の文字行を2値のいずれかをとる画像パターン
に変換し、画像メモリ(4)に記憶する。文字切出し部
(5)は、先ず、第3図に示すように、画像メモリ(4
)から1つの文字枠(2)に対応するMXNドツトの実
線で囲まれた画像パターン(6)を切出し、次に、外接
枠検出処理により、波線で囲まれた実際の文字パターン
(7)を切出す、この際、MXNドツトの画像パターン
において、各辺より中心方向に触手をのばし、最初に文
字部分に出合ったところでその辺の外接枠を検出してお
り、従って、文字切出し時には、各外接枠の座標xr 
The character observation unit (3) scans the input paper (1) and reads the COD
Using a photoelectric exchange element such as an image sensor, at least one line of characters is converted into an image pattern having one of two values, and the pattern is stored in an image memory (4). As shown in FIG.
), cut out an image pattern (6) of MXN dots surrounded by solid lines corresponding to one character frame (2), and then use circumscribing frame detection processing to extract the actual character pattern (7) surrounded by wavy lines. At this time, in the image pattern of MXN dots, the tentacles extend toward the center from each side, and when the character part is first encountered, the circumscribed frame of that side is detected. Therefore, when cutting out the character, each circumscribed frame is detected. frame coordinates xr
.

x2及びyI+72が検出される。x2 and yI+72 are detected.

そこで、本発明では、半角識別部(8)で、この工座標
χ1及び工、に基づき、文字パターン(7)が画像パタ
ーン(6)の左半分あるいは右半分のみに存在するか否
かを検出している。つまり、工、≦M/2ならば、右半
分は空白であり、文字パターンは左半分にしか存在せず
、工、≧M/2ならば、左半分は空白であり、文字パタ
ーンは右半分にしか存在しないはずであるので、文字パ
ターンが右半分又は左半分にしか存在しないときに半角
情報を出力し、文字パターンが左半分及び右半分にも存
在するときは全角情報を出力するようにしている。
Therefore, in the present invention, the half-width recognition unit (8) detects whether the character pattern (7) exists only in the left half or the right half of the image pattern (6) based on the work coordinates χ1 and work. are doing. In other words, if ≦M/2, the right half is blank and the character pattern exists only in the left half, and if ≧M/2, the left half is blank and the character pattern is in the right half. Therefore, if the character pattern exists only in the right or left half, output half-width information, and if the character pattern exists in both the left and right halves, output full-width information. ing.

ところで、切出された文字パターン(7)は、認識部(
9)の正規化部(10)で大きさ及び位置に関する正規
化が施され、特徴抽出部(11)でその特徴パターンが
抽出される0次に照合部(12)で、辞書(13)に格
納されている全角及び半角に共通な標準特徴パターンと
照合され、最も類似度の大きい候補文字の文字番号が出
力される。
By the way, the extracted character pattern (7) is recognized by the recognition unit (
The normalization unit (10) in 9) normalizes the size and position, and the feature extraction unit (11) extracts the feature pattern. It is compared with the stored standard characteristic pattern common to full-width and half-width characters, and the character number of the candidate character with the highest degree of similarity is output.

コード出力部(14)は、コードメモリ(15〉ヲ用い
て入力詐れた文字番号を文字フードに変換し、この文字
コードに半角識別部(8)で得られる半角情報又は全角
情報を付加して、候補メモIJ(16)に記憶すると共
に制御部(17)へ送出する。
The code output unit (14) uses the code memory (15>) to convert the incorrectly input character number into a character code, and adds half-width or full-width information obtained by the half-width identification unit (8) to this character code. Then, it is stored in the candidate memo IJ (16) and sent to the control unit (17).

(18)は、制御部(17) 、テキスト(19) 、
キャラクタジェネレータ(20) 、表示部(21)を
備えたワードプロセッサ等の文書作成装置であり、コー
ド出力部(14)からの情報はテキスト(19)に記憶
される。
(18) is a control part (17), a text (19),
It is a document creation device such as a word processor that is equipped with a character generator (20) and a display section (21), and information from a code output section (14) is stored in text (19).

そして、テキスト(19)に記憶きれた情報が全角情報
と文字コードの場合は、制御部(17)はその文字コー
ドに対応する全角ドツトパターンをキャラクタジェネレ
ータ(20)から読出して、そのまま表示部(21)へ
送出し、候補文字の全角表示を行ない、半角情報と文字
コードの場合は、制御部(17)は、その文字コードに
対応する全角ドツトパターンを読出し、これを半角のド
ツトパターンに変換して表示部(21)へ送出し、候補
文字の半角表示を行なう。
If the information stored in the text (19) is full-width information and a character code, the control section (17) reads out the full-width dot pattern corresponding to the character code from the character generator (20) and directly displays it on the display section ( 21) and displays the candidate character in full-width. In the case of half-width information and a character code, the control unit (17) reads out the full-width dot pattern corresponding to the character code and converts it into a half-width dot pattern. The candidate characters are then sent to the display unit (21) to display half-width candidate characters.

尚、半角文字に対して全角文字と異なる文字コードを割
り当てるときは、コード出力部(14)で文字番号と半
角情報により半角文字コードを作成し、作成したコード
を候補メモリ(16)に記憶しても良く、この場合、予
めキャラクタジェネレータ(20〉に半角文字コードに
対応する半角ドツトパターンを記憶しておけば、直接半
角ドツトパターンを読出すことにより半角表示が可能と
なる。
When assigning a character code different from a full-width character to a half-width character, a half-width character code is created using the character number and half-width information in the code output section (14), and the created code is stored in the candidate memory (16). In this case, if half-width dot patterns corresponding to half-width character codes are stored in advance in the character generator (20), half-width display can be achieved by directly reading out the half-width dot patterns.

ところで、一般に、半角文字は全ての文字に存在するわ
けではなく、例えば、漢字等では半角文字は用いられな
い。
By the way, half-width characters generally do not exist in all characters; for example, half-width characters are not used in kanji and the like.

そこで、コード出力部(14)において、認識結果とし
ての文字番号が、半角となり得る文字か否かを判定し、
なり得ない文字のときは、半角情報を無視するようにす
れば、より一層正確な半角属性の付加が可能となる。具
体的には、半角となり得る文字を登録したテーブルを設
けて、認識結果をこのテーブルと照合するか、もしくは
、文字番号が字種毎に所定の範囲に固めて割り付けであ
ることを利用して、認識結果としての文字番号が所定の
範囲内にあるか否かを検出すれば良い。
Therefore, the code output unit (14) determines whether the character number as a recognition result is a character that can be half-width.
If the half-width information is ignored for characters that cannot be used, even more accurate half-width attributes can be added. Specifically, you can set up a table that registers characters that can be half-width and check the recognition results against this table, or you can use the fact that character numbers are assigned within a predetermined range for each character type. , it is sufficient to detect whether the character number as a recognition result is within a predetermined range.

入力用紙の全ての文字枠(2)に、第2図(22)(2
3)で示すように、特定範囲を示す波線や点をドロップ
アウトカラーで印刷しておけば、半角文字の記入が楽に
なる。
All character frames (2) on the input form are marked with (22) (2) in Figure 2.
As shown in 3), if you print wavy lines or dots indicating a specific range in dropout color, it will be easier to write in half-width characters.

(ト)発明の効果 本発明によれば、入力用紙の文字枠に文字を記入する際
、左又は右の特定範囲内に記入するだけで半角属性を付
加した文字コードが得られるので、オペレータの操作が
簡略化され、非常に便利になる。
(g) Effects of the Invention According to the present invention, when entering characters in the character frame of an input form, a character code with a half-width attribute added can be obtained by simply writing within a specific range on the left or right. Operation is simplified and extremely convenient.

【図面の簡単な説明】[Brief explanation of drawings]

第1図は本発明の実施例を示すブロック図、第2図は入
力用紙の具体例を示す説明図、第3図は文字パターンの
切出しを説明するための説明図である。 (1)・・・入力用紙、 (8)・・・半角識別部、 
(9)・・・認識部、 (16)・・・候補メモリ、 
(19)・・・テキスト、 (21)・・・表示部。 7一 第2図 第3図 ゝ−−−\−−−ノ′
FIG. 1 is a block diagram showing an embodiment of the present invention, FIG. 2 is an explanatory diagram showing a specific example of an input sheet, and FIG. 3 is an explanatory diagram for explaining cutting out of a character pattern. (1)...Input paper, (8)...Half-width identification section,
(9)... Recognition unit, (16)... Candidate memory,
(19)...Text, (21)...Display section. 71 Figure 2 Figure 3 ゝ---\---ノ'

Claims (1)

【特許請求の範囲】[Claims] (1)1文字に対応する文字記入用の文字枠が予め複数
記録された用紙を入力用紙として用いる文字認識装置に
おいて、入力文字が前記文字枠の左又は右の特定範囲内
にのみ記入されていることを検出して、前記入力文字が
半角文字であることを示す半角情報を出力する半角識別
部を設けたことを特徴とする文字認識装置。
(1) In a character recognition device that uses a sheet of paper on which multiple character frames for character entry corresponding to one character are recorded in advance as an input sheet, the input character is written only within a specific range to the left or right of the character frame. 1. A character recognition device comprising: a half-width identification unit that detects that the input character is a half-width character and outputs half-width information indicating that the input character is a half-width character.
JP61074572A 1986-04-01 1986-04-01 Character recognizing device Pending JPS62231389A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP61074572A JPS62231389A (en) 1986-04-01 1986-04-01 Character recognizing device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP61074572A JPS62231389A (en) 1986-04-01 1986-04-01 Character recognizing device

Publications (1)

Publication Number Publication Date
JPS62231389A true JPS62231389A (en) 1987-10-09

Family

ID=13551046

Family Applications (1)

Application Number Title Priority Date Filing Date
JP61074572A Pending JPS62231389A (en) 1986-04-01 1986-04-01 Character recognizing device

Country Status (1)

Country Link
JP (1) JPS62231389A (en)

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS58100108U (en) * 1981-12-28 1983-07-07 富士重工業株式会社 Ventilation structure when automobile door closes

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS58100108U (en) * 1981-12-28 1983-07-07 富士重工業株式会社 Ventilation structure when automobile door closes

Similar Documents

Publication Publication Date Title
JP2713622B2 (en) Tabular document reader
US20020141660A1 (en) Document scanner, system and method
EP3477547B1 (en) Optical character recognition systems and methods
JPS6285394A (en) Character recognizing device
JPS62231389A (en) Character recognizing device
JPS6316795B2 (en)
JPH0388085A (en) Optical character reader
JPH0728935A (en) Document image processor
JP2877380B2 (en) Optical character reader
JPS6227887A (en) Character type separating system
JPH06111057A (en) Optical character reader
JP2978801B2 (en) Character input method for handwritten character recognition
JP2925270B2 (en) Character reader
JP2823350B2 (en) Multimedia input device
JPS62295192A (en) Optical character image reader
JP2784004B2 (en) Character recognition device
JPS62217389A (en) Handwritten character reader
JPS60201486A (en) Handwritten document reading method
JPH0576666B2 (en)
JPS62295191A (en) Optical character image reader
KR20050045291A (en) Data processing of text by selective scanning and color comparison
JPS6182275A (en) Automatic translating device
JPH01201789A (en) Character reader
JPH03103996A (en) Optical character reader
JPS5851390A (en) Font character recognizing device