JPS6394384A - System for deciding direction of character row - Google Patents
System for deciding direction of character rowInfo
- Publication number
- JPS6394384A JPS6394384A JP61240621A JP24062186A JPS6394384A JP S6394384 A JPS6394384 A JP S6394384A JP 61240621 A JP61240621 A JP 61240621A JP 24062186 A JP24062186 A JP 24062186A JP S6394384 A JPS6394384 A JP S6394384A
- Authority
- JP
- Japan
- Prior art keywords
- character
- blocks
- block
- decide
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 230000003287 optical effect Effects 0.000 claims abstract description 5
- 239000000470 constituent Substances 0.000 claims abstract description 3
- 238000000034 method Methods 0.000 claims description 7
- 238000013459 approach Methods 0.000 abstract 1
- 239000000284 extract Substances 0.000 abstract 1
- 230000000694 effects Effects 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 2
- 239000011888 foil Substances 0.000 description 1
- 238000011017 operating method Methods 0.000 description 1
Landscapes
- Character Input (AREA)
- Character Discrimination (AREA)
Abstract
Description
【発明の詳細な説明】
〔産業上の利用分野〕
本発明は、郵便物上に記載されている住所などとOCR
(光学読取装置)を用いて直接読み取り区分する住所読
取区分機の文字行方向判定方式に関する。[Detailed Description of the Invention] [Industrial Application Field] The present invention provides an OCR method for identifying addresses written on mail items and the like.
This invention relates to a character line direction determination method for an address reading and sorting machine that directly reads and sorts using an optical reader (optical reader).
−mに、紙面には予しめ何らかの印刷がなされていて、
その向きに従って紙面の向きを決め文字を書いている。-m, some kind of printing was done on the paper in advance,
The paper is oriented according to that direction and the letters are written.
また、印刷物においても、読み取るべき必要があるもの
以外に、ページ等の他の情報が書かれている。Furthermore, in printed matter, other information such as pages is written in addition to what needs to be read.
従来、文字行を読み取るOCRでは、紙面の向きと、文
字の向きが一致しているという条件が設けられているた
め、紙面上の一定の特徴を検出することにより、紙面の
向きを判定しその文字向きを判定していた0例えば、海
外のアドレスリーグでは、切手の位置を検出することに
より文字の向きを定めていた。また、日本の郵便番号読
み取り区分機では、郵便番号は、すべて横書きであるた
め、縦書きの判定は行なっていなかった。Conventionally, in OCR for reading lines of text, there is a condition that the orientation of the paper and the orientation of the characters match, so the orientation of the paper can be determined by detecting certain features on the paper. For example, in overseas address leagues, the orientation of characters was determined by detecting the position of stamps. In addition, Japanese postal code reading and sorting machines do not determine whether postal codes are written vertically because all postal codes are written horizontally.
日本語等の文字行は、アルファベットを用いるものとは
異なり、縦書きと横書きが混在しているため、同じ行イ
メージでもどちら向きに書かれているかを判別するのは
、規定がない限り困難である0例えば、日本の郵便物で
は、第3図(a)〜(C)に示すように、同じ行イメー
ジに対しても3通りの向きがある。そのため海外のアド
レス・リーダのごとく切手位置を検出しただけでは、文
字の向きまで定めるのは困難である。Unlike those using the alphabet, character lines in Japanese and other languages are written both vertically and horizontally, so it is difficult to determine in which direction even the same line image is written unless there is a regulation. For example, in Japanese mail, there are three orientations for the same line image, as shown in FIGS. 3(a) to 3(C). Therefore, it is difficult to determine the orientation of the letters by simply detecting the position of the stamp, as is done with overseas address readers.
現在の文字認識技術では、文字の向きが既知でなければ
認識は難しい、そのため文字行を読取るOCRにおいて
は、文字の向きを定める必要があるが、前記のように文
字を読まずに文字の向きを定めることは困難であった。With current character recognition technology, it is difficult to recognize characters unless the orientation of the characters is known. Therefore, in OCR that reads character lines, it is necessary to determine the orientation of the characters, but as mentioned above, the orientation of the characters can be determined without reading the characters. It was difficult to determine the
本発明の目的は、このような問題を解決し、文字の向き
を容易に判定できるようにした文字行方向判定方式を提
供することにある。SUMMARY OF THE INVENTION An object of the present invention is to provide a character line direction determination method that solves these problems and makes it possible to easily determine the orientation of characters.
本発明の構成は、印字あるいは、手書き文字を光学的に
読取る光学読取装置の文字行方向判定方式において、紙
面上の画像を入力しこの画像を構成要素に従ってブロッ
ク化する手段と、このブロック化された各ブロックの位
置、形状、大きさ等の特徴を抽出する手段と、前記ブロ
ックの特徴と用紙形状をもとに文字行向きを判定する手
段とを備えることを特徴する。The present invention provides a means for inputting an image on a paper surface and dividing the image into blocks according to its constituent elements, in a character line direction determination method for an optical reading device that optically reads printing or handwritten characters, and The present invention is characterized by comprising means for extracting characteristics such as position, shape, and size of each block, and means for determining character line orientation based on the characteristics of the blocks and the paper shape.
次に本発明を図面により詳細に説明する。 Next, the present invention will be explained in detail with reference to the drawings.
第1図は本発明の一実施例を説明するフローチャート、
第2図(a)、(b)は本実施例に用いられる郵便物(
封書)のブロック単位のイメージを表わす平面図である
。第2図(a)に示す郵便物の封書は、第2図(b)に
示すような各ブロック単位で表わされる。すなわち、切
手のブロック1と、郵便番号枠のブロック2とは、郵便
番号自動読取区分機において検出される。また、受は取
り人住所、氏名のブロック3と、差出し人住所氏名のブ
ロック4はブロックの大きさと位置関係より大きいブロ
ック3が受は取り人住所であることはわかる。しかし、
この受は取り人住所の文字の向きが、第3図(a)〜(
C)のいずれに対応しているかはわからない。FIG. 1 is a flowchart explaining one embodiment of the present invention;
Figures 2(a) and (b) show the mail items (
FIG. 2 is a plan view showing an image of a block unit of a sealed letter. The mail envelope shown in FIG. 2(a) is represented in units of blocks as shown in FIG. 2(b). That is, block 1 of the stamp and block 2 of the postal code frame are detected by the automatic postal code reading and sorting machine. Further, it can be seen that the receiver is the receiver's address, block 3 of the name, and block 4 of the sender's address and name, which is larger than the size and positional relationship of the blocks, is the receiver's address. but,
In this receiver, the direction of the letters of the recipient's address is shown in Figure 3 (a) - (
I don't know which one of C) is supported.
本実施例では、ステップ10で紙面全体の画像入力を行
い、この紙面全体からすべての記載事項をブロック化し
くステップ11)、ステップ12でこれらすべてのブロ
ックの位置、形状、大きさ等の特徴情報を抽出し、これ
らの情報の関係と用紙形状をもとに、ステップ13でブ
ロック間の関係を評価し、ステップ14でその文字の向
きを定めている。In this embodiment, in step 10, an image of the entire paper is input, and all written items from the entire paper are converted into blocks (step 11), and in step 12, characteristic information such as the position, shape, size, etc. of all these blocks is input. is extracted, and based on the relationship between these pieces of information and the paper shape, the relationship between blocks is evaluated in step 13, and the orientation of the characters is determined in step 14.
例えば、日本の郵便物では、ブロックの情報から切手、
郵便番号枠、差し出し人住所氏名、受は取り人住所氏名
、アングライン、囲み枠等を検出し、これらの位置関係
、形状関係等の情報をもとに文字の向きを定めることが
できる。また、アングラインとその位置が検出された場
合、アングラインが切手を上にして、左側に縦にあれば
向きは、第3図(C)に相当し、右側縁にあれば、第3
図(b)に対応することがわかる。For example, in Japanese mail, stamps,
It is possible to detect postal code frames, sender's address and name, receiver's address and name, angle lines, surrounding frames, etc., and determine the direction of the characters based on information such as their positional relationships and shape relationships. Also, if the Angline and its position are detected, if the Angline is vertically on the left side with the stamp facing up, the orientation corresponds to Figure 3 (C), and if it is on the right edge, it will be in the 3rd position.
It can be seen that this corresponds to figure (b).
本実施例では、更に差出人ブロック4の位置と大きさを
みることにより、文字の向きを判定することができる。In this embodiment, by further checking the position and size of the sender block 4, the orientation of the characters can be determined.
この差出人ブロック4は、郵便物の切手、郵便番号のブ
ロック逆の端にあり、横幅が郵便物の幅に近いという情
報によって、受取人ブロック3は縦書き(第3図(a)
に相当)であると定めることができる。6000通のデ
ータ調査でも例外はみつからなかった。This sender block 4 is located at the opposite end of the postal stamp and postal code blocks, and the recipient block 3 is written vertically (see Figure 3(a)
(equivalent to). A data survey of 6,000 emails found no exceptions.
このように本発明によれば、各ブロックの関係から文字
の向きがわかるので、文字を逆向きに読んだり、横向き
に読んだための、リジェクト、誤続を減らすことができ
る。また、3方向から文字を読みその中から最も確から
しいものを見つけ出す方式よりも、処理速度も速くする
ことが出来る。As described above, according to the present invention, since the direction of a character can be determined from the relationship between each block, it is possible to reduce rejections and erroneous continuations due to characters being read backwards or sideways. Furthermore, the processing speed can be faster than the method of reading characters from three directions and finding the most likely one among them.
さらに、住所認識では、出現文字数、出現単語数が少な
いため、これらの情報を用いることにより、認識性能の
向上が期待され、文字の向きがわかれば文字行の先頭が
どちらかを定めることも可能である。Furthermore, since the number of characters and words that appear in address recognition is small, using this information is expected to improve recognition performance, and if the orientation of the characters is known, it is possible to determine which direction the beginning of a character line is. It is.
以上説明したように、本発明は、文字の向きを自動的に
定めることにより、文字認識の性能を向上できる効果が
あると共に、海外のアドレス・す−ダ等では現実が困難
であった紙面の向きと文字の向きを容易に判定すること
が出来る。As explained above, the present invention has the effect of improving character recognition performance by automatically determining the direction of characters, and also has the effect of improving the performance of character recognition by automatically determining the orientation of characters. The orientation and direction of characters can be easily determined.
第1図は本発明の一実施例の操作手順を示すフローチャ
ート、第2図(a>、(b)は郵便物及びそのブロック
イメージの一例を示す正面図、第3図(a)〜(c)は
日本の郵便物上に書かれた各文字向きを示す郵便物の正
面図である。
1・・・切手ブロック、2・・・郵便番号ブロック、3
・・・受取人住所ブロック、4・・・差出人住所ブロッ
ク、11〜14・・・操作ステップ。
ガ1母
箔3回FIG. 1 is a flowchart showing the operating procedure of an embodiment of the present invention, FIGS. ) is a front view of a Japanese postal item showing the direction of each character written on the postal item. 1... Stamp block, 2... Postal code block, 3
... Recipient address block, 4... Sender address block, 11-14... Operation step. Ga1 mother foil 3 times
Claims (1)
置の文字行方向判定方式において、紙面上の画像を入力
しこの画像を構成要素に従ってブロック化する手段と、
このブロック化された各ブロックの位置、形状、大きさ
等の特徴を抽出する手段と、前記ブロックの特徴と用紙
形状をもとに文字行の向きを判定する手段とを備えるこ
とを特徴とする文字行方向判定方式。In a character line direction determination method for an optical reading device that optically reads printed or handwritten characters, means for inputting an image on a paper surface and dividing the image into blocks according to constituent elements;
The present invention is characterized by comprising means for extracting characteristics such as the position, shape, and size of each of the blocks, and means for determining the orientation of character lines based on the characteristics of the blocks and the paper shape. Character line direction determination method.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP61240621A JPS6394384A (en) | 1986-10-08 | 1986-10-08 | System for deciding direction of character row |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP61240621A JPS6394384A (en) | 1986-10-08 | 1986-10-08 | System for deciding direction of character row |
Publications (1)
Publication Number | Publication Date |
---|---|
JPS6394384A true JPS6394384A (en) | 1988-04-25 |
Family
ID=17062216
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP61240621A Pending JPS6394384A (en) | 1986-10-08 | 1986-10-08 | System for deciding direction of character row |
Country Status (1)
Country | Link |
---|---|
JP (1) | JPS6394384A (en) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH08202814A (en) * | 1995-01-26 | 1996-08-09 | Nec Corp | Mail address reader and its method |
US5610995A (en) * | 1995-06-06 | 1997-03-11 | United Parcel Service Of America, Inc. | Method and apparatus for compressing images containing optical symbols |
US6470091B2 (en) * | 1998-02-10 | 2002-10-22 | Hitachi, Ltd. | Address reader, sorting machine such as a mail thing and character string recognition method |
-
1986
- 1986-10-08 JP JP61240621A patent/JPS6394384A/en active Pending
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH08202814A (en) * | 1995-01-26 | 1996-08-09 | Nec Corp | Mail address reader and its method |
US5610995A (en) * | 1995-06-06 | 1997-03-11 | United Parcel Service Of America, Inc. | Method and apparatus for compressing images containing optical symbols |
US6470091B2 (en) * | 1998-02-10 | 2002-10-22 | Hitachi, Ltd. | Address reader, sorting machine such as a mail thing and character string recognition method |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Aradhye | A generic method for determining up/down orientation of text in roman and non-roman scripts | |
JP2575539B2 (en) | How to locate and identify money fields on documents | |
US6782144B2 (en) | Document scanner, system and method | |
JPH11226513A (en) | Mail address reader and mail address classifier | |
JP2006263512A (en) | Address recognition device | |
US20010022854A1 (en) | Recognition apparatus and recognition method | |
JPS6394384A (en) | System for deciding direction of character row | |
JP3268552B2 (en) | Area extraction method, destination area extraction method, destination area extraction apparatus, and image processing apparatus | |
JP3162552B2 (en) | Mail address recognition device and address recognition method | |
JP2000210624A (en) | Mail address recognition device | |
KR100240627B1 (en) | Structure of recognition code and its recognition method and code controller structure | |
JP3105918B2 (en) | Character recognition device and character recognition method | |
JP2977219B2 (en) | Mail address reading device | |
JP3015448B2 (en) | Address area detection device | |
JP3160347B2 (en) | Mail address reading device | |
JP3724906B2 (en) | Character direction determination device | |
JPH07117984B2 (en) | Optical character reader | |
JPH08155397A (en) | Postal matter classifying device and bar code printer | |
US8428303B2 (en) | Postal indicium detection method and postal indicium detection apparatus | |
JP2616995B2 (en) | Character recognition device | |
JPH11238095A (en) | Mail address reader | |
JPH11207265A (en) | Information processing device and mail processing device | |
JPS5942354B2 (en) | Delivery classification method | |
JP3476936B2 (en) | Character line direction determination device | |
JPH0793474A (en) | Character area detector |