JPS588382A - Character reader - Google Patents

Character reader

Info

Publication number
JPS588382A
JPS588382A JP56105399A JP10539981A JPS588382A JP S588382 A JPS588382 A JP S588382A JP 56105399 A JP56105399 A JP 56105399A JP 10539981 A JP10539981 A JP 10539981A JP S588382 A JPS588382 A JP S588382A
Authority
JP
Japan
Prior art keywords
reading
character
mode
stored
information
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP56105399A
Other languages
Japanese (ja)
Inventor
Masaru Tokita
土岐田 勝
Hideki Sakata
英樹 坂田
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Fujitsu Ltd
Original Assignee
Fujitsu Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fujitsu Ltd filed Critical Fujitsu Ltd
Priority to JP56105399A priority Critical patent/JPS588382A/en
Publication of JPS588382A publication Critical patent/JPS588382A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/98Detection or correction of errors, e.g. by rescanning the pattern or by human intervention; Evaluation of the quality of the acquired patterns

Landscapes

  • Engineering & Computer Science (AREA)
  • Quality & Reliability (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • Character Input (AREA)
  • Character Discrimination (AREA)

Abstract

PURPOSE:To make the career of a file clear and to make the retrieval of mis- reading easy, by setting the reading mode to the 1st and 2nd modes and storing all the reading information at one mode and the information only corresponding to reading impossible characters at the other mode to a log file. CONSTITUTION:A recording medium 1 is scanned at a reading section 2 and the pattern of the result of scanning is stored in a reading information buffer 9. A recognizing section 3 is operated when one line of this pattern is finished, compared with a reference pattern stored in advance, the code of character pattern in coincidence to a reading character code buffer 4 and when the character is illegible, the code corresponding to the character is stored. Next, the illegible character code in the buffer 4 is displayed on a display 6 with an illegible character processing section 5. According to the 1st and 2nd reading mode set on the mode selector 10, all the reading mode is stored in a data output floppy 8 at the 1st mode and only the information of reading impossible character at the second mode is stored to a reading information buffer 12.

Description

【発明の詳細な説明】 本発明は文字読取装置、特にロギング用ファイル金設け
て、読取られた文字のイメージ情報を格納するようにし
た文字読取装置に関する。
DETAILED DESCRIPTION OF THE INVENTION The present invention relates to a character reading device, and particularly to a character reading device that is provided with a logging file to store image information of read characters.

近年、力A−ドや帳票に書かれた文字を電算機へ入力す
る装置として文字読取装置が開発され、広・く使用され
ている。そして、帳票上の文字をスキャンすることによ
り@0#l“11′ハターンで読取情報を作成し、該読
取情報を文字認識部にて予め格納されている文字情報と
比較し、対応する文字コードを検出している。処理単位
は通常、帳票の1行毎であり、もしその中に読めない、
即ち認識し得ない文字があると、文字読取装置に接続さ
れたディスプレイに、読取情報(文字のイメージ情報)
を表示させ、それをオペレータが目で見て判断しキーボ
ード婢から入力している。−性分の処理が終了し次段階
で、格納してあった文字のイメージ情報である文字情報
は捨てられ、次の行の文字情報が格納されることになる
。勿論、認識済の文字コードについては、読取結果とし
て出力される。
In recent years, character reading devices have been developed and are widely used as devices for inputting characters written on letters or forms into computers. Then, by scanning the characters on the form, read information is created in @0#l"11' pattern, and the read information is compared with pre-stored character information in the character recognition unit, and the corresponding character code is The processing unit is usually one line of the form, and if there is an unreadable line in the form,
In other words, if there is a character that cannot be recognized, reading information (image information of the character) is displayed on the display connected to the character reading device.
is displayed, and the operator visually judges it and inputs it from the keyboard. - At the next stage after the character processing is completed, the stored character information, which is the image information of the characters, is discarded, and the character information of the next line is stored. Of course, recognized character codes are output as reading results.

しかしながら、上記の如き文字読取装置においては、帳
票データ全一度読込むと、読めかい文字(不読文字)や
誤って読んだ文字(誤読文字)が発生したとしても、後
刻何故、不読や誤読が生じ友かを調査することはで自な
か2友。同じ鬼人が書いた文字でも、日によって不読と
なることもあり、また、−見樟準文字からよりかけ離れ
た字体の文字が確実に読まれ、−見標準文字により近い
字体の文字が不読・又は誤読になることもあり、その原
因を追求することができない。更に、どの様な文字が不
読・誤読文字となるかの傾向の統計資料を作成すること
もできなかった。
However, in the above-mentioned character reading device, even if characters that are difficult to read (unreadable characters) or characters that are read incorrectly (misread characters) occur after reading all of the form data once, for some reason they may not be read or misread later. It's easy to find out who your new friends are. Even characters written by the same Onijin may become illegible depending on the day, and - characters with a font that is further away from Kensho semi-characters are reliably read, and - characters with a font closer to Kensho characters are readable. Misreading or misreading may occur, and it is not possible to investigate the cause. Furthermore, it has not been possible to create statistical data on the tendency of characters to be unreadable or misread.

本案は、上耐欠点ヲ蕪<すことを目的としており、この
目的は、記録媒体上の文字全読取り、この読取9情報に
基き文字のg誠を行なう文字読取装置において、読取り
モードとして少なくとも2つ設け、第1のモードでは上
記読取り情報をすべてログファイルに格納し、第2のモ
ードでは上記読取り情報のうち上記認識を行なった結果
認識できなかった文字に対応した絖柩9情報のみ會ログ
ファイルに格納かつ、骸モードを任意に選択し得る文字
読取装置によって達成される。以下、図面を用いて、本
発明の詳細な説明する。
The purpose of this invention is to overcome the shortcomings of durability, and this purpose is to set at least 2 reading modes in a character reading device that reads all characters on a recording medium and performs character recognition based on this reading information. In the first mode, all the above read information is stored in the log file, and in the second mode, only the 9 information corresponding to the characters that could not be recognized as a result of the above recognition among the above read information is stored in the log file. This is achieved by a character reading device that can be stored in a file and can arbitrarily select the Mukuro mode. Hereinafter, the present invention will be explained in detail using the drawings.

第1図は本発明の実施例、M2図は画面表示金示す図で
あり、図においてlは帳票、2は読取部、3は認識部、
4は留識部3の認識結果である読取文字コードバッファ
、5は不読文字処理部、6はディスプレイ、7はデータ
出力部、8は読取文字コードを出力する出力フロッピづ
、9は文字を読取りたスキャナの出力である読取情報全
格納する読取情報バッファ、10はモードセレクト部、
11はモードセレクト部10がモード1に−示す時、読
取情報バッファ9の内容をすべてログフロッピィ12へ
出力し、モード2t−示す時不読文字があり次に動作を
説明する。帳票1は読取部2に備えられたスキャナによ
り走査され、例えば1行分の走査結果が”40”、”1
”パターンにて読取情報バッファ9に格納される。1行
終了でg繊部3が動作し、予め格納さ・れている規準文
字パターンと順次比較?行ない、一致し・たパターンに
対応した文字コードを読取文字コードバッファ4へ格納
する。
Fig. 1 is an embodiment of the present invention, Fig. M2 is a diagram showing screen display money, in the figure l is a form, 2 is a reading section, 3 is a recognition section,
4 is a readable character code buffer which is the recognition result of the familiarization unit 3, 5 is an unreadable character processing unit, 6 is a display, 7 is a data output unit, 8 is an output floppy that outputs the readable character code, and 9 is a character a read information buffer that stores all the read information that is the read output of the scanner; 10 is a mode select section;
11 outputs all the contents of the read information buffer 9 to the log floppy 12 when the mode select section 10 indicates mode 1-, and when mode 2t- indicates that there are unreadable characters, the operation will be explained next. The form 1 is scanned by a scanner provided in the reading unit 2, and the scan results for one line are, for example, "40" and "1".
” pattern is stored in the read information buffer 9. When one line ends, the g-text section 3 operates and sequentially compares with the pre-stored standard character patterns, and reads the characters corresponding to the matching pattern. The code is stored in the read character code buffer 4.

この時もし不読文字があると、不絖文字會示すコードを
格納する。
At this time, if there are any unreadable characters, a code indicating the unreadable characters is stored.

読取情報バッファ9の内容に関するU識部3の処理がす
べて終了すると、不読文字処理部5が動作する。不読文
字処理部5は読取文字コード列ν・ファ4の中の不読文
字コードに対応する読取情報を読取情報バッファ9より
取出し、ディスプレイ6へ表示する。実施例においては
、不読文字が存在すると、読取文字コードバッファの内
容をディスプレイ6に表示し、カーソル21で示指され
た文字に対応する読取情報をディスプレイ画面の−一部
領域22に表示している。(表示文字のパターン23)
そして、キーボード等で文字が指定されると、その文字
に対応したコードが読取文字コードバッファ4へ格納さ
れる。そして、読取コードバッフ゛ア4の内容はデータ
出力部7により出力フロッピィ8に格納される。
When all the processing of the U recognition section 3 regarding the contents of the read information buffer 9 is completed, the unreadable character processing section 5 starts operating. The unreadable character processing unit 5 takes out reading information corresponding to the unreadable character code in the readable character code string ν·fa 4 from the reading information buffer 9 and displays it on the display 6. In the embodiment, when an unreadable character exists, the contents of the readable character code buffer are displayed on the display 6, and the readable information corresponding to the character pointed at by the cursor 21 is displayed in a partial area 22 of the display screen. There is. (Display character pattern 23)
When a character is specified using a keyboard or the like, a code corresponding to the character is stored in the read character code buffer 4. The contents of the read code buffer 4 are then stored in the output floppy 8 by the data output section 7.

ログ出力部11は、上記データ出力部7の動作に同期し
て、モードセレクト部lOがモード1會示していると、
読取情報バッファ9の内容をログフロッピィ12へ出力
する。もし、七−ドセレクト部10がモード!を示して
いると、読取コードバッファ4を8照し、不読文字に関
する読取情報2 o り70ツピイ12へ出力する。
In synchronization with the operation of the data output unit 7, the log output unit 11 detects that when the mode select unit IO indicates mode 1,
The contents of the read information buffer 9 are output to the log floppy 12. If the seventh select section 10 is in mode! , the reading code buffer 4 is checked and the reading information 2 o regarding unreadable characters is output to the 70 thread 12.

従って、本発明によれば、モードセレクト部10tモー
ド1とすることにより、全読取情報即ち全読取文字イメ
ージがログフロッピィに格納されていることになり、文
字読取装置の読取結果を電算機で二処理した結果、誤読
文字が発見された場合にも、該誤読文字に対応する文字
イメージがログフロッピィに残っておシ、誤読の原因t
−調査することは容易にできる。更にモードセレクト部
10’i−モ゛−ド2とすることにより、不読文字だけ
が9グフロツピイ12に格納されるので、この内容を分
析することにより、不読となる文字の傾向を知ることも
容易に行なえる。
Therefore, according to the present invention, by setting the mode selector 10t to mode 1, all read information, that is, all read character images are stored in the log floppy, and the reading results of the character reading device can be read by a computer. Even if a misread character is found as a result of processing, the character image corresponding to the misread character remains on the log floppy, which may cause the misreading.
-It is easy to investigate. Furthermore, by setting the mode select section 10'i-mode 2, only unreadable characters are stored in the 9gf floppy 12, so by analyzing this content, it is possible to know the tendency of unreadable characters. can also be done easily.

尚、以上述べた例で、読取情報を出力する媒体としてロ
グフロッピィを用いたが、他に磁気ディスク装置や磁気
テープ装置t用いても勿論構わず更にそれら装置に装着
された媒体をいちいち更新することなく、同一媒体を循
環使用しても良い。
In the example described above, a log floppy was used as a medium for outputting read information, but it is of course possible to use other magnetic disk devices or magnetic tape devices as well, and the media installed in these devices can be updated one by one. The same medium may be used repeatedly.

ま友、出力フロッピィを別の装置とせず、読取情報を出
力する媒体と同じ装置としても良いのけ勿論である。
Of course, instead of using a separate device as the output floppy disk, it is also possible to use the same device as the medium that outputs the read information.

【図面の簡単な説明】[Brief explanation of the drawing]

第1図は本発明の実施例、第2図はディスプレイの表示
画面を示す。 図において、1は記録媒体、2は読取部、3は認識部、
5は不読文字処理部、6はディスプレイ、8はデータ出
力フロッピィ、9は読取情報バッファ、10はモードセ
レクト部、12は読取情報バッファである。
FIG. 1 shows an embodiment of the present invention, and FIG. 2 shows a display screen of a display. In the figure, 1 is a recording medium, 2 is a reading section, 3 is a recognition section,
5 is an unreadable character processing section, 6 is a display, 8 is a data output floppy disk, 9 is a read information buffer, 10 is a mode select section, and 12 is a read information buffer.

Claims (1)

【特許請求の範囲】[Claims] ml録媒体上の文字を読取シ、この読取り情報に基き文
字のsimt−行なう文字読取装置において、読取モー
ドとして少なくとも2つ設け、j[1のモードでは・、
上記読取情報をすべてログファイルに格納し、庸2のモ
ニドでは上記読取り情報のうち上記認織會行なった結果
[識できなかった文字に対応した読敗り情報のみをログ
ファイルに格納かつ、該モード金任意に選択し得ること
全特徴とする文字読取装置。
In a character reading device that reads characters on a ML recording medium and simulates the characters based on this reading information, at least two reading modes are provided, and in the mode j[1...
All of the above reading information is stored in the log file, and in the second monido, only the reading failure information corresponding to the characters that could not be recognized is stored in the log file as a result of the above recognition organization among the reading information. Character reading device with all features that mode gold can be selected arbitrarily.
JP56105399A 1981-07-06 1981-07-06 Character reader Pending JPS588382A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP56105399A JPS588382A (en) 1981-07-06 1981-07-06 Character reader

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP56105399A JPS588382A (en) 1981-07-06 1981-07-06 Character reader

Publications (1)

Publication Number Publication Date
JPS588382A true JPS588382A (en) 1983-01-18

Family

ID=14406548

Family Applications (1)

Application Number Title Priority Date Filing Date
JP56105399A Pending JPS588382A (en) 1981-07-06 1981-07-06 Character reader

Country Status (1)

Country Link
JP (1) JPS588382A (en)

Similar Documents

Publication Publication Date Title
JP4461769B2 (en) Document retrieval / browsing technique and document retrieval / browsing device
JP3294995B2 (en) Form reader
WO1997005561A1 (en) Medium processor and medium processing method
JPH05143707A (en) Method and device for processing coded image data
JP2006065477A (en) Character recognition device
JPS588382A (en) Character reader
JP3347477B2 (en) Word dictionary registration device and document recognition device
JP3467437B2 (en) Character recognition apparatus and method and program recording medium
JP3727422B2 (en) Character recognition apparatus and method
CN113111881A (en) Information processing apparatus and recording medium
JP2019185142A (en) Image processing device, image processing method, and program
JP3233803B2 (en) Hard-to-read kanji search device
JP4843705B2 (en) Business card reader
JP2784004B2 (en) Character recognition device
JPS6326789A (en) Character recognizing device
JPH117492A (en) Method and device for editing key entry
JPS61208584A (en) Character reader
JP2878772B2 (en) Optical character reader
JP2000222496A (en) Table edition device, its method, and storage medium
JPS592952B2 (en) Mojinin Shikiboshiki
JPH05303661A (en) Acquring/displaying device for partial image data
JP2623292B2 (en) How to create dictionary data
JP3045086B2 (en) Optical character reading method and apparatus
JPH11282965A (en) Character recognizing device and computer readable storage medium recording character recognition program
JPH10154157A (en) Electronic filing system