JPS604373A - Pattern information compressing system - Google Patents
Pattern information compressing systemInfo
- Publication number
- JPS604373A JPS604373A JP58110922A JP11092283A JPS604373A JP S604373 A JPS604373 A JP S604373A JP 58110922 A JP58110922 A JP 58110922A JP 11092283 A JP11092283 A JP 11092283A JP S604373 A JPS604373 A JP S604373A
- Authority
- JP
- Japan
- Prior art keywords
- character
- information
- code
- pattern
- section
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N1/00—Scanning, transmission or reproduction of documents or the like, e.g. facsimile transmission; Details thereof
- H04N1/41—Bandwidth or redundancy reduction
- H04N1/411—Bandwidth or redundancy reduction for the transmission or storage or reproduction of two-tone pictures, e.g. black and white pictures
- H04N1/4115—Bandwidth or redundancy reduction for the transmission or storage or reproduction of two-tone pictures, e.g. black and white pictures involving the recognition of specific patterns, e.g. by symbol matching
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Document Processing Apparatus (AREA)
Abstract
Description
【発明の詳細な説明】
〔発明の利用分野〕
本発明はパターン圧縮方式に係り、特にワードプロセッ
サによって作成された文書の情報圧縮に好適なパターン
圧縮方式に関する。DETAILED DESCRIPTION OF THE INVENTION [Field of Application of the Invention] The present invention relates to a pattern compression method, and more particularly to a pattern compression method suitable for compressing information on documents created by a word processor.
従来パターンを圧縮する方法として知られている方法の
一つにランレングス符号化法が有る。この方法は二次元
パターンを一次元で順次走査し、パターンの連続する長
さを符号化するもので比較的簡単なことから広く用いら
れている。しかし消点があった。One of the conventional methods for compressing patterns is run-length encoding. This method sequentially scans a two-dimensional pattern in one dimension and encodes the continuous length of the pattern, and is widely used because it is relatively simple. But there was a vanishing point.
本発明の目的は上述した欠点を除去するためになされた
もので、圧縮率を■程度迄高めて二次元パターンを圧縮
することの出来るパターン圧縮方式を提供することに有
る。The purpose of the present invention was to eliminate the above-mentioned drawbacks, and it is an object of the present invention to provide a pattern compression method capable of compressing a two-dimensional pattern by increasing the compression rate to approximately 2.
この目的を達成するため本発明においては、二次元パタ
ーンの認識結果と位置情報をコード化して情報圧縮をお
こなう点に特徴がある。To achieve this objective, the present invention is characterized in that the two-dimensional pattern recognition results and position information are encoded to perform information compression.
以下、本発明の一実施例を第1図によシ説明する。 An embodiment of the present invention will be explained below with reference to FIG.
第1図はワードプロセッサ等で作成された文書の一例で
ある。このような文書に対して本発明では以下の手順で
情報圧縮を行なう。すなわち文字を切シ出しこれ全認識
し、コード化する。さらに切シ出した文字の位置情報や
、文字の大きさ、等の情報を認識結果に付加して一文字
の圧縮情報とする。これを文書全域に亘って行なう。FIG. 1 is an example of a document created using a word processor or the like. The present invention performs information compression on such documents using the following procedure. In other words, characters are cut out, all of them are recognized, and then coded. Furthermore, information such as the position information of the cut characters and the size of the characters is added to the recognition result to create compressed information for one character. Do this over the entire document.
第2図は本発明の一実施例である。第2図において文書
1の情報はスキャナ2によって電気信号に変換する。切
り出し部3は文字部分全検出し文字パターンを認識部4
に送′るとともに、切り出した文字の位置、大きさの検
出を行なう。認識部4の認識結果コードと、切り出し部
3で検出した二次元平面に対する文字の情報(位置、大
きさ等)全コード化部5でコード化して一文字分のコー
ド情報を得、こ扛を例えば記憶装置6に格納する。FIG. 2 shows an embodiment of the present invention. In FIG. 2, information on a document 1 is converted into an electrical signal by a scanner 2. The extraction unit 3 detects all character parts and recognizes the character pattern by the recognition unit 4.
At the same time, the position and size of the cut out characters are detected. The recognition result code of the recognition unit 4 and character information (position, size, etc.) on the two-dimensional plane detected by the cutting unit 3 are all encoded in the encoding unit 5 to obtain code information for one character, and this code is converted into, for example, It is stored in the storage device 6.
このように文字コードと二次元平面の情報を持つことに
より、二次元パターンを浄書して再生することが出来る
。すなわち文字コードと大きさの情報全文字パターン発
生器によシ文字パターンに復元し、文字の位置情報から
二次元平面上の位置を決めることによってもとの文vを
再生することが出来る。By having the character code and information on the two-dimensional plane in this way, it is possible to print and reproduce the two-dimensional pattern. That is, the original sentence v can be reproduced by restoring the character code and size information to a character pattern using a character pattern generator, and determining the position on the two-dimensional plane from the character position information.
次に具体的な処理について第1図を例に説明する。いま
文書11上に書かれている文字列「文書1」において文
字「文」を切シ出し位置情報(例えば重心の位置)xl
、ytと大きさ情報(例えは倍9文字)Mが得られたと
するとコード化部にはxIylMが送られる。また4か
らは1文」のコードCBが5に送らfl、5の出力には
例えば第3図のようなコード出力が得られる。Next, specific processing will be explained using FIG. 1 as an example. Cut out the character "sentence" in the character string "document 1" currently written on document 11 and find the position information (for example, the position of the center of gravity) xl
, yt and size information (for example, double 9 characters) M are obtained, xIylM is sent to the encoding unit. Further, from 4, the code CB of "1 sentence" is sent to 5, fl, and the output of 5 produces a code output as shown in FIG. 3, for example.
文字の位置情報は例えば重心等を用いると文字の徨類に
よって重心の位置が異なるのでCのままでは再生した文
字列が変動してしまう。したがってxt、ytに対し文
字ピッチ、行ピッチで正規化して再生する必要が有る。If the position information of the character is, for example, the center of gravity, the position of the center of gravity will differ depending on the character type, so if C is left as is, the reproduced character string will fluctuate. Therefore, it is necessary to normalize xt and yt by character pitch and line pitch before reproduction.
第4図は正規化のための説明図である。第4図において
Xlは文字列の中心、ΔX++ ΔX2はXlからの重
ノbのずれである。いま、X、=X、+ΔXl、x24
X1−に1であるから
に+に文字の行方向の位置とすれば良い。FIG. 4 is an explanatory diagram for normalization. In FIG. 4, Xl is the center of the character string, and ΔX++ ΔX2 is the deviation of the double knot b from Xl. Now, X, =X, +ΔXl, x24
Since X1- is 1, the position of the character in the line direction can be set to +.
本発明によれば従来のランレングス等の圧縮手法に比べ
約1桁圧縮率の向上が計れる。例えば−文字の大きさ1
64X64メツシユで表わした時、る程度である。しか
し本発明では一文誉を2000X2000メツシユ、文
字種12000とした時に位置情報のためにX方向に1
1ビツト、Y方向に11ビツト、文字種に11ビツト合
計33ビット上少ない情報で表わせる。According to the present invention, the compression ratio can be improved by about one order of magnitude compared to conventional compression methods such as run length. For example - font size 1
When expressed as a 64x64 mesh, it is about . However, in the present invention, when one sentence is 2000 x 2000 meshes and 12000 character types, 1 in the X direction for position information.
1 bit, 11 bits in the Y direction, and 11 bits for the character type, a total of 33 bits or less information.
さらに文字コードで格納しであるために再生ノくターン
の劣化が無く、又、文書の検索に該当コードを用いれば
容易にこれを行なうことが出来る。Furthermore, since the information is stored as a character code, there is no deterioration during playback, and documents can be easily searched by using the corresponding code.
第1図は本発明の理解のための説明図、第2図は本発明
の一実施例の構成図、第3図、第4図は本発明を説明す
るだめの図である。FIG. 1 is an explanatory diagram for understanding the present invention, FIG. 2 is a configuration diagram of an embodiment of the present invention, and FIGS. 3 and 4 are diagrams for explaining the present invention.
Claims (1)
元パターンの平面上の情報を付加してコード化すること
によハエ次元パターンの情報圧mk行なうことを特徴と
したパターン情報圧縮方式。1. A pattern information compression method characterized in that a two-dimensional pattern is recognized, and the information pressure mk of a fly-dimensional pattern is obtained by adding plane information of the two-dimensional pattern to the recognition result code and encoding it.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP58110922A JPS604373A (en) | 1983-06-22 | 1983-06-22 | Pattern information compressing system |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP58110922A JPS604373A (en) | 1983-06-22 | 1983-06-22 | Pattern information compressing system |
Publications (1)
Publication Number | Publication Date |
---|---|
JPS604373A true JPS604373A (en) | 1985-01-10 |
Family
ID=14548023
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP58110922A Pending JPS604373A (en) | 1983-06-22 | 1983-06-22 | Pattern information compressing system |
Country Status (1)
Country | Link |
---|---|
JP (1) | JPS604373A (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2002314821A (en) * | 2001-04-18 | 2002-10-25 | Ricoh Co Ltd | Image compression method, image expansion method, image compression device, and image expansion device |
JP2003333341A (en) * | 2002-04-25 | 2003-11-21 | Microsoft Corp | Compression of bi-level image with explicit representation of ink cluster |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPS56100565A (en) * | 1980-01-16 | 1981-08-12 | Toshiba Corp | Facsimile device |
-
1983
- 1983-06-22 JP JP58110922A patent/JPS604373A/en active Pending
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPS56100565A (en) * | 1980-01-16 | 1981-08-12 | Toshiba Corp | Facsimile device |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2002314821A (en) * | 2001-04-18 | 2002-10-25 | Ricoh Co Ltd | Image compression method, image expansion method, image compression device, and image expansion device |
JP2003333341A (en) * | 2002-04-25 | 2003-11-21 | Microsoft Corp | Compression of bi-level image with explicit representation of ink cluster |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US5793903A (en) | Multimedia rendering marker and method | |
JP4152789B2 (en) | Clustering system and clustering method | |
US7317838B2 (en) | Compression of bi-level images with explicit representation of ink clusters | |
EP0093244A3 (en) | Apparatus and method for transmitting compressed data | |
KR960005588A (en) | Image data editing device | |
JPH022349B2 (en) | ||
JP2811175B2 (en) | Orthogonal transform coding method for image data | |
JP2745301B2 (en) | Orthogonal transform coding method for image data | |
Mei et al. | Learn a compression for objection detection-vae with a bridge | |
JPS604373A (en) | Pattern information compressing system | |
WO2005046213A1 (en) | Document image encoding/decoding | |
KR100717026B1 (en) | Apparatus and method for binary image compression | |
JP3144824B2 (en) | Image playback device | |
JP2795150B2 (en) | Moving picture reproducing apparatus and encoding / decoding system | |
JPH04105178A (en) | Document picture processor | |
Cai et al. | Lossless image compression with tree coding of magnitude levels | |
JP2745300B2 (en) | Orthogonal transform coding method for image data | |
Ye | Text image compression based on pattern matching | |
JPS6212280A (en) | Combination system for image data | |
JPH03157064A (en) | Picture information compression transfer system | |
JPH01268350A (en) | Encoding device | |
JPS60126782A (en) | Character recognizing device | |
JPH04106670A (en) | Document picture processor | |
JPH02254858A (en) | Picture storage system for facsimile equipment | |
JPH02299379A (en) | Vector quantizer |