WO2002062050A2 - Compound document image compression using multi-region two layer format - Google Patents

Compound document image compression using multi-region two layer format Download PDF

Info

Publication number
WO2002062050A2
WO2002062050A2 PCT/US2002/002060 US0202060W WO02062050A2 WO 2002062050 A2 WO2002062050 A2 WO 2002062050A2 US 0202060 W US0202060 W US 0202060W WO 02062050 A2 WO02062050 A2 WO 02062050A2
Authority
WO
WIPO (PCT)
Prior art keywords
text
layer
color
layers
document
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/US2002/002060
Other languages
English (en)
French (fr)
Other versions
WO2002062050A3 (en
Inventor
Jian Fan
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
HP Inc
Original Assignee
Hewlett Packard Co
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hewlett Packard Co filed Critical Hewlett Packard Co
Priority to GB0317737A priority Critical patent/GB2390257B/en
Priority to JP2002562074A priority patent/JP4463476B2/ja
Priority to DE10295968T priority patent/DE10295968T5/de
Publication of WO2002062050A2 publication Critical patent/WO2002062050A2/en
Publication of WO2002062050A3 publication Critical patent/WO2002062050A3/en
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N1/00Scanning, transmission or reproduction of documents or the like, e.g. facsimile transmission; Details thereof
    • H04N1/41Bandwidth or redundancy reduction

Definitions

  • the present invention relates to an apparatus and method for compressing
  • Formatting a document as a PDF file means that the document can be
  • Color information is identified for the text in the text layers. Color information may
  • the two layers can be compressed and
  • FIG. 2 is a diagram conceptually illustrating text layers and bodies of text each having a uniform color in the text layers;
  • FIG. 3 is a diagram illustrating an exemplary text layer of FIG. 2 along with a
  • FIG. 5 is a flow chart of a method for execution by the computer system for formatting a document using a two layer format.
  • Embodiments consistent with the present invention divide an image into, for
  • Each region is separated into two layers, a layer of text within the region and a layer of non-text information. Both layers have the same size as the
  • bit value “ 1 " means that the pixel is a text pixel and bit value "0" means the pixel is not a text pixel; different values can alternatively be used.
  • the color of the text can be represented by, for
  • the non-text layer is represented by, for example, a two-dimensional matrix that uses
  • R, G, B values is known in the art and includes the use of three
  • a first byte specifies the value of the color red for
  • a second byte specifies the value of the color green for the pixel
  • Each byte specifies the value of the color blue for the pixel.
  • Each byte having eight bits,
  • FIG. 1 is a diagram conceptually illustrating a two layer format for a document
  • regions 24, 26, and 28 based upon colors of bodies of text. All of the text in region 24, in this example, has the same color.
  • the regions each define a physical space within the document
  • Region 24 has a
  • region 26 has a text layer 16 and a non-text layer
  • region 28 has a text layer 20 and a non-text layer 22.
  • Each text layer is a text layer 22 and a non-text layer 22.
  • text layer represents the image or non-text information within, for example, the same
  • Regions 24, 26, and 28 are shown conceptually in FIG. 1 and the illustrated layers are not necessarily
  • Bodies of text having the same color are used to define the physical space for each region. This feature maintains the color information for the
  • FIG. 2 is a diagram
  • Document 30 includes various regions 31, 32, 33, 34, and 35
  • FIG. 3 is a diagram illustrating the relation between
  • An exemplary text layer 36 has a corresponding non-text
  • the other text layers can have different dimensions
  • non-text layers can exist in one of the regions or span multiple regions. Color values
  • System 40 can include
  • Network 54 represents any type
  • wireline or wireless network can be used, for example, to transmit formatted
  • Computer system 40 typically includes a memory 52, a processor 42, an input device 50, a display device 44, a printer 48, a
  • Memory 52 may include random access memory (RAM) or similar types of RAM
  • memory may store one or more applications for execution by processor 42.
  • Secondary storage device 56 may include a hard disk drive, floppy disk drive, CD-
  • Processor 42 may execute instructions stored in ROM drive, or other types of non- volatile data storage.
  • Processor 42 may execute instructions stored in ROM drive, or other types of non- volatile data storage.
  • computer system 40 such as a keyboard, key pad, cursor-
  • Scanner 46 may include any device for converting a hard copy of information into an
  • Computer system 40 can also include output devices such as
  • computer-readable media may include instructions for controlling computer system 40
  • a document can be scanned into memory 52 using
  • a non-text layer is then created by, for example, excluding all text pixels
  • This segmentation can be accomplished by a number of
  • a first region contains the first
  • region information is updated to include the new character. If the new character is not
  • the color information can be specified
  • the layers in the regions can be compressed (step 68). Since
  • lossless compression is the G4 compression method, and an example of
  • JPEG Joint Photographic Experts Group
  • the compressed layers can be output to a file (step 70).
  • the file can be stored,
  • An XObject is created for the text layer

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Compression Of Band Width Or Redundancy In Fax (AREA)
  • Image Processing (AREA)
  • Facsimile Image Signal Circuits (AREA)
  • Color Image Communication Systems (AREA)
  • Silver Salt Photography Or Processing Solution Therefor (AREA)
PCT/US2002/002060 2001-01-31 2002-01-23 Compound document image compression using multi-region two layer format Ceased WO2002062050A2 (en)

Priority Applications (3)

Application Number Priority Date Filing Date Title
GB0317737A GB2390257B (en) 2001-01-31 2002-01-23 Compound document image compression using multi-region two layer format
JP2002562074A JP4463476B2 (ja) 2001-01-31 2002-01-23 複数領域2レイヤフォーマットを用いた複合文書画像圧縮
DE10295968T DE10295968T5 (de) 2001-01-31 2002-01-23 Verbunddokumentbildkompression unter Verwendung eines Mehrfachregion-Zweischichtformats

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US09/774,074 US7145676B2 (en) 2001-01-31 2001-01-31 Compound document image compression using multi-region two layer format
US09/774,074 2001-01-31

Publications (2)

Publication Number Publication Date
WO2002062050A2 true WO2002062050A2 (en) 2002-08-08
WO2002062050A3 WO2002062050A3 (en) 2003-01-23

Family

ID=25100169

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2002/002060 Ceased WO2002062050A2 (en) 2001-01-31 2002-01-23 Compound document image compression using multi-region two layer format

Country Status (6)

Country Link
US (1) US7145676B2 (https=)
JP (1) JP4463476B2 (https=)
DE (1) DE10295968T5 (https=)
GB (1) GB2390257B (https=)
TW (1) TW522737B (https=)
WO (1) WO2002062050A2 (https=)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7272180B2 (en) 2002-10-01 2007-09-18 Avocent Corporation Video compression system
US7336839B2 (en) 2004-06-25 2008-02-26 Avocent Corporation Digital video compression command priority
US7457461B2 (en) 2004-06-25 2008-11-25 Avocent Corporation Video compression noise immunity
US7782961B2 (en) 2006-04-28 2010-08-24 Avocent Corporation DVC delta commands
US8718147B2 (en) 2006-02-17 2014-05-06 Avocent Huntsville Corporation Video compression algorithm
US9424215B2 (en) 2006-08-10 2016-08-23 Avocent Huntsville Corporation USB based virtualized media system
US9560371B2 (en) 2003-07-30 2017-01-31 Avocent Corporation Video compression system

Families Citing this family (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7600183B2 (en) * 2000-06-16 2009-10-06 Olive Software Inc. System and method for data publication through web pages
JP2003046790A (ja) * 2001-07-30 2003-02-14 Fuji Photo Film Co Ltd 色修正指示装置
US7346215B2 (en) * 2001-12-31 2008-03-18 Transpacific Ip, Ltd. Apparatus and method for capturing a document
US7343046B2 (en) * 2004-02-12 2008-03-11 Xerox Corporation Systems and methods for organizing image data into regions
US6992686B2 (en) * 2004-06-14 2006-01-31 Xerox Corporation System and method for dynamic control of file size
US7542164B2 (en) * 2004-07-14 2009-06-02 Xerox Corporation Common exchange format architecture for color printing in a multi-function system
US7594169B2 (en) * 2005-08-18 2009-09-22 Adobe Systems Incorporated Compressing, and extracting a value from, a page descriptor format file
US7913160B2 (en) * 2006-02-16 2011-03-22 Xerox Corporation Document versioning based on layer content
DE102006010763A1 (de) * 2006-03-08 2007-09-13 Netviewer Gmbh Hybrides Bildkompressionsverfahren
US7483186B2 (en) * 2006-07-03 2009-01-27 Xerox Corporation Pitch to pitch online gray balance calibration
US7903873B2 (en) * 2007-09-13 2011-03-08 Microsoft Corporation Textual image coding
JP5523047B2 (ja) * 2008-10-20 2014-06-18 キヤノン株式会社 電子文書生成方法及び電子文書生成装置
US9069731B2 (en) * 2009-12-29 2015-06-30 Olive Software Inc. System and method for providing online versions of print-medium publications
JP5847062B2 (ja) 2012-11-27 2016-01-20 京セラドキュメントソリューションズ株式会社 画像処理装置
JP5847063B2 (ja) 2012-11-27 2016-01-20 京セラドキュメントソリューションズ株式会社 画像処理装置
WO2016142969A1 (ja) 2015-03-12 2016-09-15 ルネサスエレクトロニクス株式会社 データ処理装置、データ処理システム及びその方法

Family Cites Families (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5280367A (en) * 1991-05-28 1994-01-18 Hewlett-Packard Company Automatic separation of text from background in scanned images of complex documents
US5243414A (en) * 1991-07-29 1993-09-07 Tektronix, Inc. Color processing system
DE69332344T2 (de) 1992-07-20 2003-06-05 Canon K.K., Tokio/Tokyo Bildverarbeitungsgerät und Bildübertragungsgerät
US5991515A (en) * 1992-11-10 1999-11-23 Adobe Systems Incorporated Method and apparatus for compressing and decompressing data prior to display
US5638498A (en) 1992-11-10 1997-06-10 Adobe Systems Incorporated Method and apparatus for reducing storage requirements for display data
US5706096A (en) 1994-04-25 1998-01-06 Ricoh Company, Ltd. Optimum line density determining method and system
US5552898A (en) 1994-07-06 1996-09-03 Agfa-Gevaert Lossy and lossless compression in raster image processor
US5778092A (en) 1996-12-20 1998-07-07 Xerox Corporation Method and apparatus for compressing color or gray scale documents
US5982937A (en) 1996-12-24 1999-11-09 Electronics For Imaging, Inc. Apparatus and method for hybrid compression of raster data
US6400844B1 (en) 1998-12-02 2002-06-04 Xerox Corporation Method and apparatus for segmenting data to create mixed raster content planes
US6809741B1 (en) * 1999-06-09 2004-10-26 International Business Machines Corporation Automatic color contrast adjuster
US6856428B1 (en) * 1999-06-10 2005-02-15 Electronics For Imaging, Inc. Black text printing from page description languages

Cited By (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7738553B2 (en) 2002-10-01 2010-06-15 Avocent Corporation Video compression system
US7809058B2 (en) 2002-10-01 2010-10-05 Avocent Corporation Video compression system
US7515633B2 (en) 2002-10-01 2009-04-07 Avocent Corporation Video compression system
US7515632B2 (en) 2002-10-01 2009-04-07 Avocent Corporation Video compression system
US7542509B2 (en) 2002-10-01 2009-06-02 Avocent Corporation Video compression system
US7720146B2 (en) 2002-10-01 2010-05-18 Avocent Corporation Video compression system
US7321623B2 (en) 2002-10-01 2008-01-22 Avocent Corporation Video compression system
US7272180B2 (en) 2002-10-01 2007-09-18 Avocent Corporation Video compression system
US8385429B2 (en) 2002-10-01 2013-02-26 Avocent Corporation Video compression encoder
US9560371B2 (en) 2003-07-30 2017-01-31 Avocent Corporation Video compression system
US7336839B2 (en) 2004-06-25 2008-02-26 Avocent Corporation Digital video compression command priority
US7457461B2 (en) 2004-06-25 2008-11-25 Avocent Corporation Video compression noise immunity
US8805096B2 (en) 2004-06-25 2014-08-12 Avocent Corporation Video compression noise immunity
US8718147B2 (en) 2006-02-17 2014-05-06 Avocent Huntsville Corporation Video compression algorithm
US8660194B2 (en) 2006-04-28 2014-02-25 Avocent Corporation DVC delta commands
US7782961B2 (en) 2006-04-28 2010-08-24 Avocent Corporation DVC delta commands
US9424215B2 (en) 2006-08-10 2016-08-23 Avocent Huntsville Corporation USB based virtualized media system

Also Published As

Publication number Publication date
JP2005500709A (ja) 2005-01-06
GB0317737D0 (en) 2003-09-03
GB2390257B (en) 2005-06-29
US20020101609A1 (en) 2002-08-01
JP4463476B2 (ja) 2010-05-19
DE10295968T5 (de) 2004-04-22
US7145676B2 (en) 2006-12-05
GB2390257A (en) 2003-12-31
WO2002062050A3 (en) 2003-01-23
TW522737B (en) 2003-03-01

Similar Documents

Publication Publication Date Title
US7145676B2 (en) Compound document image compression using multi-region two layer format
JP3063957B2 (ja) 画像処理装置
US10136128B2 (en) Cell-based compression with edge detection
CN102577345B (zh) 图像处理设备及其处理方法
JP3615399B2 (ja) 画像処理装置および画像処理方法
US8503036B2 (en) System and method of improving image quality in digital image scanning and printing by reducing noise in output image data
JP2006005939A (ja) スキャンされた文書用のセグメント化に基づくハイブリッド圧縮機構
JP4861711B2 (ja) 画像処理装置、画像圧縮方法、画像圧縮プログラム及び記録媒体
JP2005020227A (ja) 画像圧縮装置
JP2003163801A (ja) 画像処理装置および画像処理方法、画像処理プログラム、記憶媒体
US20090303505A1 (en) Subtractive color method, subtractive color processing apparatus, image forming apparatus, and computer-readable storage medium for computer program
US20040109182A1 (en) System for processing monochrome and full-color digital image data
CN1984219A (zh) 图像处理设备和图像处理方法
US8971647B2 (en) Image compression apparatus, image compression method, and storage medium
CN106488074B (zh) 图像处理装置以及电子文件生成方法
US7307760B2 (en) Raster image path architecture
CN105847620B (zh) 用于对数字图像的属性平面进行压缩的方法及打印设备
US7190837B2 (en) Compression of mixed raster content (MRC) image data
US6272251B1 (en) Fully automatic pasting of images into compressed pre-collated documents
JP3899872B2 (ja) 画像処理装置、画像処理方法ならびに画像処理プログラムおよびこれを記録したコンピュータ読み取り可能な記録媒体
JP3960210B2 (ja) 画像処理装置
JP2020043461A (ja) 画像処理装置と画像処理方法、及びプログラム
JP3346051B2 (ja) 画像処理装置
Triantaphillidou et al. Digital image file formats
JP2000227848A (ja) 画像処理装置

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ OM PH PL PT RO RU SD SE SG SI SK SL TJ TM TN TR TT TZ UA UG UZ VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZM ZW

ENP Entry into the national phase

Ref document number: 0317737

Country of ref document: GB

Kind code of ref document: A

Free format text: PCT FILING DATE = 20020123

AK Designated states

Kind code of ref document: A3

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ OM PH PL PT RO RU SD SE SG SI SK SL TJ TM TN TR TT TZ UA UG UZ VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A3

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZM ZW

WWE Wipo information: entry into national phase

Ref document number: 2002562074

Country of ref document: JP

Ref document number: 0317737.5

Country of ref document: GB

RET De translation (de og part 6b)

Ref document number: 10295968

Country of ref document: DE

Date of ref document: 20040422

Kind code of ref document: P

WWE Wipo information: entry into national phase

Ref document number: 10295968

Country of ref document: DE

REG Reference to national code

Ref country code: DE

Ref legal event code: 8607