WO2000013144A1 - Graphical display system and method - Google Patents
Graphical display system and method Download PDFInfo
- Publication number
- WO2000013144A1 WO2000013144A1 PCT/US1999/019820 US9919820W WO0013144A1 WO 2000013144 A1 WO2000013144 A1 WO 2000013144A1 US 9919820 W US9919820 W US 9919820W WO 0013144 A1 WO0013144 A1 WO 0013144A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- frequency component
- spatial frequency
- image
- compressing
- high spatial
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T9/00—Image coding
Definitions
- the present invention is directed generally to a method and system of graphically displaying data with a high spatial frequency on a background with a low spatial frequency and, more particularly, to a system and method for graphically displaying text on a color background.
- a user-oriented approach to the graphical display of books, and especially antique books, has fundamentally different requirements from other approaches to digital libraries.
- the problem of displaying antique books is not at all similar to the problem faced by a librarian.
- a librarian must be interested in gaining access to the book, not in the reader's experience of reading it.
- It is also not at all similar to the problem faced by the archivist who wishes to have views preserved for scholarly research.
- the approach is that of a publisher, or re-publisher, who wishes to preserve, as best as possible, the intent of the original publisher both in page layout and in gaining audience.
- a graphical display system and method which presents the pages of a book as they actually appear with discolored pages, pictures and print, in full color, thus approximating the experience of reading the actual book.
- a graphical display system and method which preserves high-resolution, archival quality scans that are universally accessible on the Internet and in electronic form.
- a graphical display system and method which allows the reader of the book to trade off different grades of viewing fidelity against page display speed while reading the graphics.
- the present invention is directed to a computer-implemented method of preparing an image for graphical display.
- the image has a high spatial frequency component and a low spatial frequency component.
- the method includes receiving a scanned representation of the image and extracting the high spatial frequency component and the low spatial frequency component from the scanned representation of the image.
- the method also includes compressing the high spatial frequency component using a spatially lossless compression technique and compressing the low spatial frequency component using a spatially lossy compression technique.
- the present invention represents a substantial advance over prior graphical display systems and methods.
- the present invention has the advantage that it presents the pages of a book as they actually appear, with discolored pages, pictures and print, in full color, thus approximating the experience of reading the actual book.
- the present invention also has the advantage that it preserves high-resolution, archival quality scans that are universally accessible on the Internet and in electronic form.
- the present invention has the further advantage that it allows the reader of the book to trade off different grades of viewing fidelity against page display speed while reading the book.
- FIG. 1 is a diagram illustrating a graphical display system
- FIGS. 2 A - 2B are diagrams illustrating the flow through the system illustrated in
- FIG. 1 DETAILED DESCRIPTION OF THE INVENTION
- FIG. 1 illustrates a graphical display system 10.
- the system 10 includes a computer 12, which executes the various modules comprising the software portion of the system 10.
- the computer 12 may be any type of computing system suitable to execute the modules such as, for example, an IBM compatible PC, Apple Macintosh, a mainframe computer, a workstation, a personal decision aid (PDA), or an application specific integrated circuit (ASIC).
- IBM compatible PC Apple Macintosh
- mainframe computer a mainframe computer
- workstation a workstation
- PDA personal decision aid
- ASIC application specific integrated circuit
- a document scanner 14 scans an image from a document 15 and transmits a scanned representation of the image to the computer 12 via a communications link 16.
- the scanner 14 can be any type of scanner suitable such as, for example, any type of document scanner manufactured by Hewlett-Packard, Fujitsu, or Agfa that is capable of 300 DPI resolution with 8 bits of red, green, and blue.
- the communications link 16 can be any type of link suitable such as, for example, hardwired RCA-type cables or BNC-type cables, or a wireless RF connection.
- An extract module 18, a text module 20, a background module 22, and a merge module 24 are resident on and executable by the computer 12. The extract module 18 extracts high and low spatial frequency components from the scanned representation of the image.
- the text module 20 processes the high spatial frequency component of the scanned representation of the image and compresses it using a lossless compression technique such as, for example, graphics interchange format (GIF) compression.
- the background module 22 processes the low spatial frequency component of the scanned representation of the image and compresses it using a lossy compression technique such as, for example, joint photographic expert group (JPEG) compression.
- the merge module 24 merges the compressed low spatial frequency component and the compressed high frequency spatial components into a form for display on such as, for example, a video monitor on the computer
- FIGS. 2 A and 2B are diagrams illustrating the flow through the system 10 of FIG. 1.
- the document 15 is scanned, or digitally photographed, at a resolution of, for example, at least 600 dots per inch (about 40 per millimeter) in full 24 bit color to produce, for example, a raw RGB representation 28.
- a resolution of, for example, at least 600 dots per inch (about 40 per millimeter) in full 24 bit color to produce, for example, a raw RGB representation 28.
- Such a high resolution is oftentimes necessary because even old book printing technologies were capable of generating fine detail. Modern printing technologies, such as Offset and Gravure commonly yield detail that would require scanning at 2400 dots per inch.
- 600 dots per inch is usually sufficient for letterpress with carved plate engravings common to most books that are out of copyright, detail on a U.S. dollar bill, Intaglio technology that is almost two centuries old, goes as fine as .001 inch.
- Print imperfections are commonly manifest in excessively fine detail. Such imperfections, when present in old books, are often smaller than can be intentionally produced through the printing processes. However, a 600 dots per inch image capture for letterpress books typically provides enough visual accuracy to render print imperfections in a satisfactory, if not completely accurate, fashion. This same principle, that some non-focal aspects of the image can be successfully approximated while others need to be highly accurate, also holds for the non-printed components of the image.
- the high spatial frequency (text and illustrations 32) and low spatial frequency (background 44) components of the raw RGB are extracted.
- High spatial frequency fine detail
- the background, paper, and defect portions of the book are typically characterized by no need for high spatial frequency. Because print is typically black, monochrome, duochrome, or some variant thereof, and the paper and any defects are rich in color, the background color subtleties are preserved without preserving background detail. Step 30 thus gives an "improved view" quality to the scanned image.
- Step 30 is slightly counterintuitive because the print is not extracted from the image and the result processed as background. Instead, the print is extracted from the image, and a background is independently generated by blanking print from the image. Independent parameters for the two operations are computed based on generally accepted image processing principles. Examples of the parameters are the pixel thresholds t, for extracting the text and illustrations 32 and t 2 for extracting the background 44, where t, > t 2 . The reason for the bifurcation is that the independence gives great control over the eventual appearance of the rendering. Furthermore, because the display resolution is several times (more than two times) less than the scan resolution, aliasing artifacts that would be normal to such independent processing will tend to vanish at display resolution (typically 72 dots per inch).
- the text and illustrations 32 are enhanced by, for example, the application of a first Laplacian addition.
- a Laplacian addition, or image sharpening is a standard signal processing technique that simultaneously smoothes and makes edges sharper. Laplacian addition techniques are described in Hall, "Computer Image Processing and Recognition", pp. 394 et seq., which is incorporated herein by reference.
- Other edge enhancement techniques such as, for example unsharp masking and difference of Gaussians, can be used to enhance the text and illustrations 32.
- the enhanced text and illustrations are averaged down to display resolution, using an averaging filter.
- a second Laplacian addition is applied to the text and illustrations 32 at display resolution.
- the first enhancement at step 34 tends to cause detail to be preserved through the averaging process and the second enhancement 38 helps remove the "defocussed" look that is common for averaging techniques.
- the enhanced and reduced text and illustrations 32 are compressed using a lossless technique such as, for example, graphics interchange format (GIF).
- GIF graphics interchange format
- the result of the compression step, a GIF file 42 can be saved as a "transparent GIF" file.
- GIF format has the advantage that it is a supported default format for most web browsers and does not require any "plugins” or other code downloads to a web client.
- GIF compression nevertheless achieves excellent digital compression of the original image. For example, for an original page image over twenty megabytes, the resultant processed image using the teachings of the present invention, both print and background, is approximately 50 kilobytes (400 to 1 compression is achieved by this technique).
- the print GIF is actually a transparent GIF in the browser.
- the transparent part shows through the background when displayed.
- the background 44 is processed such that it is converted into a true HTML "background" type.
- the nature of the processing of the background 44 is fundamentally different from the nature of the processing for the text and illustrations 32. Because there is not enough background data in a region to give a result that does not appear overly dark, the text and illustration areas of the background 44 are blanked at step 46. The blanking is performed through an interpolation process on the text and illustration areas, which computes a color value based on the distances and colors of the other, non-text pixels in the vicinity. Any number of interpolation strategies will work because the objective is to preserve background value in a subsequent averaging step needed for display resolution. For example, the pixel values of the background 44 can be lightened by subtracting black.
- a repeating pattern of background pixel values may be generated from common light background pixel values. Because few of the pattern pixels appear in the final rendering of the background 44, a wide range of pattern techniques that preserve gradual color variation will work such as, for example, tiling a circularly grated square tile of, for example, a 16 X 16 pixel square.
- a low pass filtering process is performed to remove fine detail to improve compression performance after an averaging step 50, which reduces the background 44 to display resolution.
- the background 44 is compressed using a lossy compression technique such as, for example, the joint photographic expert group (JPEG) standard.
- JPEG joint photographic expert group
- the background 44 can be compressed to three alternative "quality" settings (as defined in the JPEG standard) having to do with preserving detail spatially and in subtlety in color.
- the JPEG standard is supported by most web browsers and tends to yield better compression of continuous tone (low spatial frequency) image data.
- JPEG is a common HTML "background" MIME type.
- the compressed text and illustrations 32 and the compressed background 44 are merged for display by, for example, a web browser by aligning the text and illustrations 32 and the background 44 and overlaying the text and illustrations 32 on the compressed background 44.
- a user can select the quality by inputting a quality selection 58 to vary the fidelity of the view of the original page.
- the quality selection 58 selects, for example, one of the three JPEG background compressions for display.
- the different JPEG quality levels can represent, for example, the number of Discrete Cosine Transform (DCT) parameters permitted to represent a given block such as, for example, a l6 X 16 or an 8 X 8 area of pixels. The fewer the number of DCT parameters, the lower the fidelity of the display but the more compact the display and thus, a quicker time to display.
- DCT Discrete Cosine Transform
- the compressed text and illustrations and the compressed background images are aligned.
- the current HTML specification does not allow the explicit alignment of a background image with an overlaying transparent GIF image.
- the http client or the user must be prompted to input the browser and the platform that is being employed in viewing the displayed image.
- the text and illustrations 32 are overlayed on the background 44 at step 56.
- the GIF compression format provides for assigning an 8-bit pixel value as transparent.
- the JPEG compression standard provides that all pixel values be displayed.
- the text and illustrations 32 is defined as the foreground image over the background 44 to ensure that the text and illustrations 32 pixel values replace the background 44 pixel values except where the text and illustrations 32 pixel values are defined as transparent.
- the result of the display step 56 can be stored in a memory device or on a storage device such as, for example, a floppy disk or a compact disc.
- the result of the display step 56 can also be processed by standard optical character recognition techniques for creating full text indices of scanned images.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Compression Of Band Width Or Redundancy In Fax (AREA)
- Image Analysis (AREA)
- Processing Or Creating Images (AREA)
Abstract
Description
Claims
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP99942542A EP1025548A1 (en) | 1998-08-31 | 1999-08-27 | Graphical display system and method |
JP2000568058A JP2002523845A (en) | 1998-08-31 | 1999-08-27 | Graphic display system and method |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US14402198A | 1998-08-31 | 1998-08-31 | |
US09/144,021 | 1998-08-31 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2000013144A1 true WO2000013144A1 (en) | 2000-03-09 |
Family
ID=22506721
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US1999/019820 WO2000013144A1 (en) | 1998-08-31 | 1999-08-27 | Graphical display system and method |
Country Status (3)
Country | Link |
---|---|
EP (1) | EP1025548A1 (en) |
JP (1) | JP2002523845A (en) |
WO (1) | WO2000013144A1 (en) |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0411232A2 (en) * | 1989-08-04 | 1991-02-06 | International Business Machines Corporation | Method for high-quality compression by binary text images |
-
1999
- 1999-08-27 EP EP99942542A patent/EP1025548A1/en not_active Withdrawn
- 1999-08-27 JP JP2000568058A patent/JP2002523845A/en active Pending
- 1999-08-27 WO PCT/US1999/019820 patent/WO2000013144A1/en not_active Application Discontinuation
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0411232A2 (en) * | 1989-08-04 | 1991-02-06 | International Business Machines Corporation | Method for high-quality compression by binary text images |
Non-Patent Citations (1)
Title |
---|
LAVAGETTO F ET AL: "Model-based analysis of color maps for high compression coding", PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON IMAGE ANALYSIS AND PROCESSING. PROGRESS IN IMAGE ANALYSIS AND PROCESSING III, PROCEEDINGS OF 7TH INTERNATIONAL CONFERENCE ON IMAGE ANALYSIS AND PROCESSING, MONOPOLI, ITALY, 20-22 SEPT. 1993, 1994, Singapore, World Scientific, Singapore, pages 421 - 426, XP000856172, ISBN: 981-02-1552-5 * |
Also Published As
Publication number | Publication date |
---|---|
EP1025548A1 (en) | 2000-08-09 |
JP2002523845A (en) | 2002-07-30 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US7894683B2 (en) | Reformatting binary image data to generate smaller compressed image data size | |
US7684085B2 (en) | Methods and apparatus for reconstructing digitized images | |
US7324247B2 (en) | Image processing apparatus, image processing program and storage medium storing the program | |
US7116446B2 (en) | Restoration and enhancement of scanned document images | |
US7433535B2 (en) | Enhancing text-like edges in digital images | |
US6628833B1 (en) | Image processing apparatus, image processing method, and recording medium with image processing program to process image according to input image | |
US7557963B2 (en) | Label aided copy enhancement | |
US20080181491A1 (en) | Color to grayscale conversion method and apparatus | |
JP4115460B2 (en) | Image processing apparatus and method, and computer program and recording medium | |
US8565531B2 (en) | Edge detection for mixed raster content (MRC) images for improved compression and image quality | |
EP1320988A2 (en) | Method including lossless compression of luminance channel and lossy compression of chrominance channels | |
KR20080001675A (en) | Image processing apparatus and image processing method | |
EP1103918B1 (en) | Image enhancement on JPEG compressed image data | |
US6924909B2 (en) | High-speed scanner having image processing for improving the color reproduction and visual appearance thereof | |
JP4226484B2 (en) | Method and system for improving weak selector signal of mixed raster signal | |
JP3899872B2 (en) | Image processing apparatus, image processing method, image processing program, and computer-readable recording medium recording the same | |
WO2000013144A1 (en) | Graphical display system and method | |
EP1039416A2 (en) | A process for reducing effects of truncation in digital images | |
JP3017249B2 (en) | Image reproduction device | |
JP2000184219A (en) | Color printing system and color printer | |
JP3017248B2 (en) | Image reproduction device | |
JP2015049631A (en) | Image processing apparatus, image forming apparatus, image processing method, program, and recording medium | |
JP3294871B2 (en) | Halftone dot area extraction device | |
Feng et al. | Image rendering for digital fax | |
JPH08163365A (en) | Picture processor |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A1 Designated state(s): JP |
|
AL | Designated countries for regional patents |
Kind code of ref document: A1 Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE |
|
ENP | Entry into the national phase |
Ref country code: JP Ref document number: 2000 568058 Kind code of ref document: A Format of ref document f/p: F |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
WWE | Wipo information: entry into national phase |
Ref document number: 1999942542 Country of ref document: EP |
|
WWP | Wipo information: published in national office |
Ref document number: 1999942542 Country of ref document: EP |
|
WWW | Wipo information: withdrawn in national office |
Ref document number: 1999942542 Country of ref document: EP |
|
REF | Corresponds to |
Ref document number: 10082412 Country of ref document: DE Date of ref document: 20020711 Format of ref document f/p: P |