JPS604373A - Pattern information compressing system - Google Patents

Pattern information compressing system

Info

Publication number
JPS604373A
JPS604373A JP58110922A JP11092283A JPS604373A JP S604373 A JPS604373 A JP S604373A JP 58110922 A JP58110922 A JP 58110922A JP 11092283 A JP11092283 A JP 11092283A JP S604373 A JPS604373 A JP S604373A
Authority
JP
Japan
Prior art keywords
character
information
code
pattern
section
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP58110922A
Other languages
Japanese (ja)
Inventor
Hirohide Endo
遠藤 裕英
Kunihiro Okada
邦弘 岡田
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hitachi Ltd
Original Assignee
Hitachi Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hitachi Ltd filed Critical Hitachi Ltd
Priority to JP58110922A priority Critical patent/JPS604373A/en
Publication of JPS604373A publication Critical patent/JPS604373A/en
Pending legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N1/00Scanning, transmission or reproduction of documents or the like, e.g. facsimile transmission; Details thereof
    • H04N1/41Bandwidth or redundancy reduction
    • H04N1/411Bandwidth or redundancy reduction for the transmission or storage or reproduction of two-tone pictures, e.g. black and white pictures
    • H04N1/4115Bandwidth or redundancy reduction for the transmission or storage or reproduction of two-tone pictures, e.g. black and white pictures involving the recognition of specific patterns, e.g. by symbol matching

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Document Processing Apparatus (AREA)

Abstract

PURPOSE:To improve a pattern compressing ratio to about 1/100 so that a two- dimensional pattern can be compressed, by encoding the recognized result of the two-dimensional pattern after positional information is added to the result. CONSTITUTION:Information of a document 1 is converted into an electric signal by means of a scanner 2 and the electric signal is sent to a cutting section 3. The cutting section 3 detects character parts from the electric signal and sends the character parts to a character pattern recognizing section 4 and, at the same time, supplies positional information of the stubbed characters, such as position, size, etc., to an encoding section 5. Moreover, a recognized result code from the recognizing section 4 is also sent to the encoding section 5 where the recognised result code is encoded and code information of one character is obtained. The code information is given to a storage device 6. By having a character code together with information of a two-dimensional plane, a two-dimensional pattern can be reproduced in the form of a fair copy.

Description

【発明の詳細な説明】 〔発明の利用分野〕 本発明はパターン圧縮方式に係り、特にワードプロセッ
サによって作成された文書の情報圧縮に好適なパターン
圧縮方式に関する。
DETAILED DESCRIPTION OF THE INVENTION [Field of Application of the Invention] The present invention relates to a pattern compression method, and more particularly to a pattern compression method suitable for compressing information on documents created by a word processor.

〔発明の背景〕[Background of the invention]

従来パターンを圧縮する方法として知られている方法の
一つにランレングス符号化法が有る。この方法は二次元
パターンを一次元で順次走査し、パターンの連続する長
さを符号化するもので比較的簡単なことから広く用いら
れている。しかし消点があった。
One of the conventional methods for compressing patterns is run-length encoding. This method sequentially scans a two-dimensional pattern in one dimension and encodes the continuous length of the pattern, and is widely used because it is relatively simple. But there was a vanishing point.

〔発明の目的〕[Purpose of the invention]

本発明の目的は上述した欠点を除去するためになされた
もので、圧縮率を■程度迄高めて二次元パターンを圧縮
することの出来るパターン圧縮方式を提供することに有
る。
The purpose of the present invention was to eliminate the above-mentioned drawbacks, and it is an object of the present invention to provide a pattern compression method capable of compressing a two-dimensional pattern by increasing the compression rate to approximately 2.

〔発明の概要〕[Summary of the invention]

この目的を達成するため本発明においては、二次元パタ
ーンの認識結果と位置情報をコード化して情報圧縮をお
こなう点に特徴がある。
To achieve this objective, the present invention is characterized in that the two-dimensional pattern recognition results and position information are encoded to perform information compression.

〔発明の実施例〕[Embodiments of the invention]

以下、本発明の一実施例を第1図によシ説明する。 An embodiment of the present invention will be explained below with reference to FIG.

第1図はワードプロセッサ等で作成された文書の一例で
ある。このような文書に対して本発明では以下の手順で
情報圧縮を行なう。すなわち文字を切シ出しこれ全認識
し、コード化する。さらに切シ出した文字の位置情報や
、文字の大きさ、等の情報を認識結果に付加して一文字
の圧縮情報とする。これを文書全域に亘って行なう。
FIG. 1 is an example of a document created using a word processor or the like. The present invention performs information compression on such documents using the following procedure. In other words, characters are cut out, all of them are recognized, and then coded. Furthermore, information such as the position information of the cut characters and the size of the characters is added to the recognition result to create compressed information for one character. Do this over the entire document.

第2図は本発明の一実施例である。第2図において文書
1の情報はスキャナ2によって電気信号に変換する。切
り出し部3は文字部分全検出し文字パターンを認識部4
に送′るとともに、切り出した文字の位置、大きさの検
出を行なう。認識部4の認識結果コードと、切り出し部
3で検出した二次元平面に対する文字の情報(位置、大
きさ等)全コード化部5でコード化して一文字分のコー
ド情報を得、こ扛を例えば記憶装置6に格納する。
FIG. 2 shows an embodiment of the present invention. In FIG. 2, information on a document 1 is converted into an electrical signal by a scanner 2. The extraction unit 3 detects all character parts and recognizes the character pattern by the recognition unit 4.
At the same time, the position and size of the cut out characters are detected. The recognition result code of the recognition unit 4 and character information (position, size, etc.) on the two-dimensional plane detected by the cutting unit 3 are all encoded in the encoding unit 5 to obtain code information for one character, and this code is converted into, for example, It is stored in the storage device 6.

このように文字コードと二次元平面の情報を持つことに
より、二次元パターンを浄書して再生することが出来る
。すなわち文字コードと大きさの情報全文字パターン発
生器によシ文字パターンに復元し、文字の位置情報から
二次元平面上の位置を決めることによってもとの文vを
再生することが出来る。
By having the character code and information on the two-dimensional plane in this way, it is possible to print and reproduce the two-dimensional pattern. That is, the original sentence v can be reproduced by restoring the character code and size information to a character pattern using a character pattern generator, and determining the position on the two-dimensional plane from the character position information.

次に具体的な処理について第1図を例に説明する。いま
文書11上に書かれている文字列「文書1」において文
字「文」を切シ出し位置情報(例えば重心の位置)xl
、ytと大きさ情報(例えは倍9文字)Mが得られたと
するとコード化部にはxIylMが送られる。また4か
らは1文」のコードCBが5に送らfl、5の出力には
例えば第3図のようなコード出力が得られる。
Next, specific processing will be explained using FIG. 1 as an example. Cut out the character "sentence" in the character string "document 1" currently written on document 11 and find the position information (for example, the position of the center of gravity) xl
, yt and size information (for example, double 9 characters) M are obtained, xIylM is sent to the encoding unit. Further, from 4, the code CB of "1 sentence" is sent to 5, fl, and the output of 5 produces a code output as shown in FIG. 3, for example.

文字の位置情報は例えば重心等を用いると文字の徨類に
よって重心の位置が異なるのでCのままでは再生した文
字列が変動してしまう。したがってxt、ytに対し文
字ピッチ、行ピッチで正規化して再生する必要が有る。
If the position information of the character is, for example, the center of gravity, the position of the center of gravity will differ depending on the character type, so if C is left as is, the reproduced character string will fluctuate. Therefore, it is necessary to normalize xt and yt by character pitch and line pitch before reproduction.

第4図は正規化のための説明図である。第4図において
Xlは文字列の中心、ΔX++ ΔX2はXlからの重
ノbのずれである。いま、X、=X、+ΔXl、x24
X1−に1であるから に+に文字の行方向の位置とすれば良い。
FIG. 4 is an explanatory diagram for normalization. In FIG. 4, Xl is the center of the character string, and ΔX++ ΔX2 is the deviation of the double knot b from Xl. Now, X, =X, +ΔXl, x24
Since X1- is 1, the position of the character in the line direction can be set to +.

〔発明の効果〕〔Effect of the invention〕

本発明によれば従来のランレングス等の圧縮手法に比べ
約1桁圧縮率の向上が計れる。例えば−文字の大きさ1
64X64メツシユで表わした時、る程度である。しか
し本発明では一文誉を2000X2000メツシユ、文
字種12000とした時に位置情報のためにX方向に1
1ビツト、Y方向に11ビツト、文字種に11ビツト合
計33ビット上少ない情報で表わせる。
According to the present invention, the compression ratio can be improved by about one order of magnitude compared to conventional compression methods such as run length. For example - font size 1
When expressed as a 64x64 mesh, it is about . However, in the present invention, when one sentence is 2000 x 2000 meshes and 12000 character types, 1 in the X direction for position information.
1 bit, 11 bits in the Y direction, and 11 bits for the character type, a total of 33 bits or less information.

さらに文字コードで格納しであるために再生ノくターン
の劣化が無く、又、文書の検索に該当コードを用いれば
容易にこれを行なうことが出来る。
Furthermore, since the information is stored as a character code, there is no deterioration during playback, and documents can be easily searched by using the corresponding code.

【図面の簡単な説明】[Brief explanation of the drawing]

第1図は本発明の理解のための説明図、第2図は本発明
の一実施例の構成図、第3図、第4図は本発明を説明す
るだめの図である。
FIG. 1 is an explanatory diagram for understanding the present invention, FIG. 2 is a configuration diagram of an embodiment of the present invention, and FIGS. 3 and 4 are diagrams for explaining the present invention.

Claims (1)

【特許請求の範囲】[Claims] 1、二次元パターンを認識し、該認識結果コードに二次
元パターンの平面上の情報を付加してコード化すること
によハエ次元パターンの情報圧mk行なうことを特徴と
したパターン情報圧縮方式。
1. A pattern information compression method characterized in that a two-dimensional pattern is recognized, and the information pressure mk of a fly-dimensional pattern is obtained by adding plane information of the two-dimensional pattern to the recognition result code and encoding it.
JP58110922A 1983-06-22 1983-06-22 Pattern information compressing system Pending JPS604373A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP58110922A JPS604373A (en) 1983-06-22 1983-06-22 Pattern information compressing system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP58110922A JPS604373A (en) 1983-06-22 1983-06-22 Pattern information compressing system

Publications (1)

Publication Number Publication Date
JPS604373A true JPS604373A (en) 1985-01-10

Family

ID=14548023

Family Applications (1)

Application Number Title Priority Date Filing Date
JP58110922A Pending JPS604373A (en) 1983-06-22 1983-06-22 Pattern information compressing system

Country Status (1)

Country Link
JP (1) JPS604373A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2002314821A (en) * 2001-04-18 2002-10-25 Ricoh Co Ltd Image compression method, image expansion method, image compression device, and image expansion device
JP2003333341A (en) * 2002-04-25 2003-11-21 Microsoft Corp Compression of bi-level image with explicit representation of ink cluster

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS56100565A (en) * 1980-01-16 1981-08-12 Toshiba Corp Facsimile device

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS56100565A (en) * 1980-01-16 1981-08-12 Toshiba Corp Facsimile device

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2002314821A (en) * 2001-04-18 2002-10-25 Ricoh Co Ltd Image compression method, image expansion method, image compression device, and image expansion device
JP2003333341A (en) * 2002-04-25 2003-11-21 Microsoft Corp Compression of bi-level image with explicit representation of ink cluster

Similar Documents

Publication Publication Date Title
US5793903A (en) Multimedia rendering marker and method
JP4152789B2 (en) Clustering system and clustering method
US7317838B2 (en) Compression of bi-level images with explicit representation of ink clusters
EP0093244A3 (en) Apparatus and method for transmitting compressed data
KR960005588A (en) Image data editing device
JPH022349B2 (en)
JP2811175B2 (en) Orthogonal transform coding method for image data
JP2745301B2 (en) Orthogonal transform coding method for image data
Mei et al. Learn a compression for objection detection-vae with a bridge
JPS604373A (en) Pattern information compressing system
WO2005046213A1 (en) Document image encoding/decoding
KR100717026B1 (en) Apparatus and method for binary image compression
JP3144824B2 (en) Image playback device
JP2795150B2 (en) Moving picture reproducing apparatus and encoding / decoding system
JPH04105178A (en) Document picture processor
Cai et al. Lossless image compression with tree coding of magnitude levels
JP2745300B2 (en) Orthogonal transform coding method for image data
Ye Text image compression based on pattern matching
JPS6212280A (en) Combination system for image data
JPH03157064A (en) Picture information compression transfer system
JPH01268350A (en) Encoding device
JPS60126782A (en) Character recognizing device
JPH04106670A (en) Document picture processor
JPH02254858A (en) Picture storage system for facsimile equipment
JPH02299379A (en) Vector quantizer