US20100067023A1 - Image processing apparatus, image processing method and program - Google Patents

Image processing apparatus, image processing method and program Download PDF

Info

Publication number
US20100067023A1
US20100067023A1 US12/554,766 US55476609A US2010067023A1 US 20100067023 A1 US20100067023 A1 US 20100067023A1 US 55476609 A US55476609 A US 55476609A US 2010067023 A1 US2010067023 A1 US 2010067023A1
Authority
US
United States
Prior art keywords
image
character
high resolution
resolution
region
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US12/554,766
Other languages
English (en)
Inventor
Naoki Ito
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Canon Inc
Original Assignee
Canon Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Canon Inc filed Critical Canon Inc
Assigned to CANON KABUSHIKI KAISHA reassignment CANON KABUSHIKI KAISHA ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: ITO, NAOKI
Publication of US20100067023A1 publication Critical patent/US20100067023A1/en
Priority to US14/296,290 priority Critical patent/US10477063B2/en
Abandoned legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N1/00Scanning, transmission or reproduction of documents or the like, e.g. facsimile transmission; Details thereof
    • H04N1/41Bandwidth or redundancy reduction
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06KGRAPHICAL DATA READING; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K15/00Arrangements for producing a permanent visual presentation of the output data, e.g. computer output printers
    • G06K15/02Arrangements for producing a permanent visual presentation of the output data, e.g. computer output printers using printers
    • G06K15/18Conditioning data for presenting it to the physical printing elements
    • G06K15/1867Post-processing of the composed and rasterized print image
    • G06K15/1872Image enhancement
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/40Document-oriented image-based pattern recognition
    • G06V30/41Analysis of document content
    • G06V30/413Classification of content, e.g. text, photographs or tables
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/40Document-oriented image-based pattern recognition
    • G06V30/41Analysis of document content
    • G06V30/414Extracting the geometrical structure, e.g. layout tree; Block segmentation, e.g. bounding boxes for graphics or text
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N1/00Scanning, transmission or reproduction of documents or the like, e.g. facsimile transmission; Details thereof
    • H04N1/387Composing, repositioning or otherwise geometrically modifying originals
    • H04N1/3871Composing, repositioning or otherwise geometrically modifying originals the composed originals being of different kinds, e.g. low- and high-resolution originals
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N1/00Scanning, transmission or reproduction of documents or the like, e.g. facsimile transmission; Details thereof
    • H04N1/40Picture signal circuits
    • H04N1/409Edge or detail enhancement; Noise or error suppression
    • H04N1/4092Edge or detail enhancement

Definitions

  • the present invention relates to an image processing apparatus which performs character vectorization process to a character object recognized by an object recognition process in a case of generating vectorization data from an input image.
  • Japanese Patent Laid-Open No. 2002-142128 discloses an image processing apparatus configured to have the following construction as an image processing apparatus capable of generating a color document image with high quality.
  • An input image is converted into an image with low resolution.
  • the converted image with the low resolution is used to generate a subtractive color image, and neighboring pixels allotted to the same representative color from the subtractive color image are integrated to extract a FCR (Flat Color Region) candidate region.
  • An image having higher resolution than the subtractive color image is used to once more determine the region candidate.
  • a boundary of the FCR is detected to determine a final FCR, and a representative color of the determined FCR is selected to perform a coloring process for replacing a color close to a pure color for the pure color.
  • the FCR is drawn over (combined to) the input image to perform reproduction of the image.
  • An image processing apparatus in the present invention comprises a resolution converting unit configured to generate an image with low resolution by performing resolution conversion to a compressed image with high resolution, an object dividing unit configured to obtain information of a character object with low resolution by performing object division to the image with the low resolution, a decompressing unit configured to decompress the compressed image with the high resolution, and a character contour generating unit configured to generate an image showing a character contour with high resolution by using the information of the character object with the low resolution obtained by the object dividing unit and the image with the high resolution decompressed by the decompressing unit.
  • FIG. 1 is a block diagram showing a construction of an image processing system according to embodiments in the present invention
  • FIG. 2 is a block diagram showing a construction of MFP
  • FIG. 3 is a flow chart showing the process flow according to a first embodiment
  • FIG. 4 is a schematic diagram illustrating the processes according to the first embodiment
  • FIG. 5 is a schematic diagram illustrating the character contour generating process according to the first embodiment
  • FIG. 6 is a flow chart showing the process flow according to a second embodiment
  • FIG. 7 is a flow chart showing the flow of the partial decompressing process according to the second embodiment.
  • FIG. 8A to 8C are diagram explaining the flow of the partial decompressing process according to the second embodiment.
  • FIG. 9 is a flow chart showing the flow of a one-character decompressing process according to a third embodiment.
  • FIG. 10 is a diagram explaining the flow of the one-character decompressing process according to the third embodiment.
  • FIG. 11 is a flow chart showing the flow of a decompressing process according to a fourth embodiment
  • FIG. 12 is a flow chart showing the flow of a decompressing process according to a fifth embodiment
  • FIG. 13 is a flow chart showing image reading-in from a scanner
  • FIG. 14 is diagrams showing object division region information
  • FIG. 15 is diagrams showing attribute block information and input file information
  • FIG. 16 is a diagram showing a document which is an object of a document structure tree generating process
  • FIG. 17 is a diagram showing a document structure tree generated by the document structure tree generating process
  • FIG. 18 is diagrams each showing an example of a format of data vectorized by a function approximate process.
  • FIG. 19 is a diagram showing an effect obtained by an application of the present embodiment.
  • FIG. 1 is a block diagram showing a construction of an image processing system according to embodiments in the present invention.
  • the image processing system is configured of an office 10 and an office 20 each provided with various devices.
  • the office 10 is provided with a LAN 107 built therein and the office 20 is provided with a LAN 108 built therein, and are further connected with each other through the Internet 104 .
  • the office 10 is provided with a multifunction peripheral (hereinafter, referred to as (MFP)) 100 , a management PC 101 for controlling the MFP 100 , a local PC 102 , a proxy server 103 , a data base 105 , and a document management server 106 . These devices are connected by the LAN 107 .
  • MFP multifunction peripheral
  • the office 20 is provided with a local PC 102 , a proxy server 103 , a data base 105 , and a document management server 106 . These devices are connected by the LAN 108 .
  • the LAN 107 and the LAN 108 are connected respectively to the proxy servers 103 and are connected through the respective proxy servers 103 to the Internet 104 .
  • the MFP 100 performs a part of image processing to an input image read from a document and transmits the image-processed image data through the LAN 109 to the management PC 101 .
  • the MFP 100 interprets a PDL (Page Description Language) language received through the LAN 107 from the local PC 102 or a general PC (not shown) and prints out the interpreted language. Further the MFP 100 transmits the image data read from the document through LAN 107 to the local PC 102 or the general PC.
  • PDL Peage Description Language
  • the management PC 101 is a computer including am image storage unit, am image processing unit, a displaying unit, an input unit (not shown) and the like.
  • the management PC 101 performs the image process in cooperation with the MFP 100 . It should be noted that in the present embodiment, the management PC 101 performs a registration process to be described hereinafter and the like to the data base 105 , but the MFP 100 itself may be configured to perform the registration process.
  • FIG. 2 is a block diagram showing a construction of the MFP 100 .
  • the MFP 100 is provided with an image reading unit 110 , a storage unit (box) 111 , a printing unit 112 , an input unit 113 , network I/F units 114 and 117 , and a data processing unit 115 .
  • the image reading unit 110 is configured of a scanner having an auto document feeder.
  • the image reading unit 110 emits light on images in a bundle of documents or in one document and makes image formation of the reflected light on a solid image pickup device.
  • the solid image pickup device generates an image signal with a predetermined resolution (for example, 600 dpi) and a predetermined luminance level (for example, 8 bits). This image signal is formed of image data made of raster data.
  • the data processing unit 115 performs the image process for copying to the image data received from the image reading unit 110 and converts the processed image data into print data, which are outputted to the printing unit 112 .
  • the data processing unit 115 stores the print data corresponding to one page in the storage unit 111 once and thereafter, sequentially outputs the print data to the printing unit 112 .
  • the printing unit 112 prints an image or the like on a print paper based upon the print data.
  • the data processing unit 115 interprets and processes a PDL language received through the network I/F 114 from the local PC 102 or the other general PC (not shown) to generate print data, which are outputted to the printing unit 112 .
  • the storage unit 111 has a function for storing data performing rendering to the image data received from the image reading unit 110 or the PDL data received from the local PC 102 .
  • the storage unit 111 is controlled directly by the management PC 101 through the network I/F 117 and the LAN 109 .
  • the input unit 113 is provided with operational interfaces, such as a touch panel and a key board, and receives an operational input from a user.
  • the MFP 100 is also operable by operating a key board, a pointing device or the like which is provided in the management PC 101 .
  • a displaying unit 116 displays a status, image data and the like in the MFP 100 .
  • FIG. 3 is a flow chart showing the flow of the vectorization process according to the first embodiment.
  • the MFP 100 inputs an image using the image reading unit 110 .
  • the MFP 100 performs an image process to the input image in accordance therewith.
  • the image process is, for example, a known process such as color conversion or a filtering process.
  • the MFP 100 performs a compressing process to the image-processed image data to generate compressed image data.
  • An example of a compressing method includes JPEG.
  • the MFP 100 spools the compressed image data in the storage unit 111 .
  • the storage unit 111 is, for example, a memory or a hard disc.
  • the MFP 100 performs a decompressing process to the spooled compressed image data.
  • the decompressing process to the compressed image is performed by JPEG. That is, at step S 305 , a decompressing method corresponding to the compressing method used at step S 303 is used.
  • the MFP 100 performs a resolution converting process for reducing resolution of the decompressed image data. For example, the image data with the resolution of 600 dpi are converted into the image data with the resolution of 300 dpi.
  • the MFP 100 performs an object dividing process (region dividing process) to the image data after the resolution conversion is made.
  • object kind includes a character object, a photo object, a graphic (drawing, line drawing or table) object, and a background object.
  • step S 312 the MFP 100 decompress the image data compressed at step S 303 .
  • the MFP 100 performs a character contour generating process to the character object.
  • the MFP 100 refers to a portion in the image data with the high resolution corresponding to the character object to generate a contour of the character.
  • the contour of the character is generated based upon the character object divided at step S 307 and the image data decompressed at step S 312 . That is, the MFP 100 generates the contour of the character using the character object divided from the image data with the low resolution and the image data with the high resolution after the decompressing. A detail of this character contour generating process will be described later.
  • the MFP 100 performs a function approximate process to the character contour generated at step S 308 , which is converted into path data (vector data).
  • the MFP 100 formats the path data to a general format.
  • An example of the general format includes PDF, XPS, SVG or XML. It should be noted that the MFP 100 may store the formatted data in the storage unit 111 or transmit it through the network to an external device such as PC.
  • FIG. 4 is a diagram showing the aforementioned vectorization process.
  • the MFP 100 scans a document ( 411 ), performs an image process to the scanned image ( 412 ), and performs an image compression to the image-processed image data ( 413 ).
  • the compressing method is JPEG.
  • the MFP 100 spools the compressed image data in the storage unit 111 ( 414 ), reads out the spooled compressed data for decompression ( 415 ), and generates image data with resolution of 600 dpi and the gradation number of 24 bits.
  • the MFP 100 performs resolution conversion for converting the generated image data with the resolution of 600 dpi and the gradation number of 24 bits ( 402 ) into an image with low resolution ( 401 ) to generate image data with resolution of 300 dpi and the gradation number of 24 bits. It should be noted that the above resolution or the above gradation number is used simply as an example, and the present invention is not limited thereto.
  • the MFP 100 performs an object division to the image data ( 403 ) with the resolution of 300 dpi and the gradation number of 24 bits ( 404 ) to find object division region information and character information ( 405 and 406 ).
  • the object division region information serves to show in which region in the image and what kind of an object (object kind such as a character object) exists. That is, the object division region information ( 405 ) is the information showing position information of a rectangular region and an attribute of an object (object kind) contained in the rectangular region, and can be determined using a known technology.
  • the character information ( 406 ) is the information showing a character object in the region where the character object exists.
  • the character information ( 406 ) is the information showing a region of the character attribute by binary image data (that is, image of 300 dpi and one bit), and can be obtained, for example, by binarizing the region of the character attribute.
  • the character information is a binary image with low resolution in the character region where a pixel of the character is expressed as one and a pixel of the background which is not the character is expressed as zero.
  • the MFP 100 uses the object division region information, the character information, and the image data with the resolution of 600 dpi and the gradation number of 24 bits which is the image with the high resolution before the resolution conversion is made (after the decompression is performed) to perform a character contour generating process to be described later ( 407 ).
  • the character contour generating process 407
  • the character contour image data with resolution of 600 dpi and the gradation number of one bit can be obtained ( 408 ).
  • the MFP 100 performs a function approximate process to the character contour image data 408 to perform a passing process ( 409 ) and formats the path data (that is, vector data) ( 410 ).
  • FIG. 5 is a diagram showing the character contour generating process ( 407 ).
  • Numeral 501 shows an image with low resolution including object division region information and character information. The resolution is set as 300 dpi.
  • Numeral 502 shows a luminance image having resolution of 600 dpi and the gradation number of 8 bits generated from the image with the high resolution having resolution of 600 dpi and the gradation number of 24 bits.
  • the generation of the luminance image is realized by a known method. For example, the process for converting a color image (RGB image) of 24 bits into a luminance image of 8 bits is to be performed using a given conversion expression.
  • Numeral 504 shows one region in the image 501 with resolution of 300 dpi.
  • a position of a region of the character attribute is specified based upon region information 405
  • the region is an image 504 with resolution of 300 dpi and the gradation number of one bit which is formed by taking out a portion of a region corresponding to the specified position of the region of the character from character information 406 .
  • the image 504 is a binary image, and pixels painted in black are, as shown in numeral 503 , pixels for forming a character in the image 504 .
  • One square in the image shown in numeral 504 expresses one pixel in an image with low resolution.
  • Numeral 505 shows a region in the image with high resolution having resolution of 600 dpi and the gradation number of 8 bits corresponding to the region of the image 504 .
  • One square in the image in numeral 505 expresses one pixel in the image with the high resolution.
  • the image 505 is a luminance image (multi-value image) and in most cases, a pixel in gray exists in the boundary between a pixel forming a character and a pixel forming the background.
  • Numeral 506 shows a character image having resolution of 600 dpi and the gradation number of one bit generated using the image 504 with the low resolution and the image 505 with the high resolution. That is, the image 506 is a binary image with the high resolution.
  • a method of generating the character image 506 is as follows.
  • the scan is carried out on the low-resolution image 504 having resolution of 300 dpi in raster sequence.
  • the one pixel is divided into pixels of 2 ⁇ 2 (four pixels) to be converted into the image with resolution of 600 dpi, setting a pixel value of the each pixel as white (0).
  • a threshold value calculating process in a portion of the pixel is performed using data of resolution of 600 dpi and the gradation number of 8 bits (multi-value image with high resolution).
  • the data of the resolution of 600 dpi and the gradation number of 8 bits are binarized into 0 and 1 using the calculated threshold value.
  • an average value of pixel values of pixels constituting the character object existing in the 8 pixels surrounding the scan-target pixel and an average value of pixel values of pixels not constituting the character object existing in the 8 pixels are found from the image of the resolution of 600 dpi and the gradation number of 8 bits.
  • the respective average values are added, which is divided by two to find a threshold value.
  • the pixel value is compared with the threshold value in the image of the resolution of 600 dpi and the gradation number of 8 bits.
  • the pixel in which the pixel value exceeds the threshold value is set as black (1) and the pixel in which the pixel value is below the threshold value is set as white (0).
  • the character image of the resolution of 600 dpi and the gradation number of one bit (that is, a binary image with high resolution) can be generated.
  • a threshold value is adaptively determined using the image of the resolution of 600 dpi and the gradation number of 8 bits to perform the binarization process. That is, since only a portion where a possibility that the character image exists is high is set as an object of the binarization process, a binary image showing a character contour with high resolution can be efficiently generated.
  • the image 504 is formed as the image 506 . It should be noted that there exists a case where as in the case of an outline character on a colored background, pixels in the surroundings of the character have colors and the character is shown in white.
  • the character is the outline character on the colored background from the color in the surroundings and the color of the character, and, based upon this determination, generation of the character image of the resolution of 600 dpi and the gradation number of one bit is made.
  • generation of the character image of the resolution of 600 dpi and the gradation number of one bit is made. It is possible to use a known method in regard to the determination method of the outline character on the colored background. For example, at the time of generating the image of the resolution of 600 dpi and the gradation number of one bit, a total number of the black pixels is compared with a total number of the white pixels, and it is determined that more pixels are determined as the background pixels. In this case, when the black pixel is determined as the background pixel, reverse of white and black in the region may be made to form the white pixel as the background.
  • the image with the reduced resolution is used to perform the region division, and in regard to the character region, pixels where a character exists are specified in the image with the low resolution and in regard to the specified pixels, the image with the high resolution is used to generate a character contour. Therefore, the pixels of the character are specified in a small memory and at a high speed, making it possible to extract the character contour with high accuracy. For example, in a case of performing the vectorization process by extracting the character contour using only the image having resolution of 300 dpi, the generated vector data are formed to be an image as shown in FIG. 19 (1) by rendering.
  • an image is read in.
  • the read image is already bit map image data.
  • the image processing depending on a scanner is performed to the bit map image data.
  • the image process depending on the scanner is, for example, a color process or a filtering process.
  • application data produced using application software in the PC 102 are converted into PDL by a print driver in the PC 102 , which are transmitted to the MFP 100 .
  • PDL is, for example, LIPS or Postscript.
  • the MFP 100 which has received the PDL generates a display list based upon the PDL using a built-in interpreter. Next, MFP 100 performs rendering to the display list to generate the bit map image data.
  • the MFP 100 performs the processes (processes subsequent to 413 in FIG. 4 ) subsequent to step S 303 ( FIG. 3 ) using the bit map image data generated by either one of the above two methods as a process object.
  • FIG. 18 is diagrams each showing an example of the format of the data vectorized in the function approximate process at step S 309 shown in FIG. 3 .
  • an SVG Scalable Vector Graphic
  • the format is not limited thereto.
  • FIG. 18 for its explanation, the notation of the object is surrounded by a frame.
  • a frame 1801 an image attribute is shown, and object division region information (information showing a position and an object attribute) and bit map information (for example, JPEG data) in regard to a region of the image object (photo object) are shown.
  • information of a text object (information of an OCR result of a character object) is shown.
  • a configuration of the character object corresponding to the text object in the frame 1802 is shown as a vector object.
  • a graphic object line art such as a table object or a line drawing object
  • the process similar to the vector data converting process of the character object as explained in FIGS. 3 to 5 is performed, thereby making it possible to convert the graphic object into the vector object.
  • FIG. 14 is diagrams each showing the object division region information ( 405 in FIG. 4 ) obtained by the object dividing process (step S 307 in FIGS. 3 and 404I FIG. 4 ).
  • the MFP 100 divides an input mage 1401 into rectangular blocks for each attribute.
  • examples of the attribute of the rectangular block include a character, a photo, and a graphic (drawing, line drawing, a table and the like).
  • the image data ( 403 ) stored in the RAM are binarized into white data and black data to extract a black-pixel bundle (connected black pixels) surrounded by a contour made of black pixels. Further, a size of the extracted black-pixel bundle is calculated. In a case where the size of the black-pixel bundle is more than a predetermined value, a white-pixel bundle inside the black-pixel bundle is extracted to find the contour. Further, a size of the extracted white-pixel bundle is calculated. In a case where the size of the white-pixel bundle is more than a predetermined value, a black-pixel bundle inside the white-pixel bundle is extracted to find the contour.
  • the size of the pixel bundle is an area of the pixel bundle, for example.
  • a rectangular block circumscribed to the pixel bundle obtained in this way is found to determine an attribute of the rectangular block based upon the size and the shape of the rectangular block. For example, a rectangular block of which a horizontal to vertical ratio is close to one and of which a size is within a constant range (range of a size as large as determined as a character) is determined as a character-corresponding block which has the high possibility to be a character object region rectangular block.
  • a new rectangular block is generated by putting these character-corresponding blocks together to determine the new rectangular block as a character object region rectangular block. That is, since the neighboring character-corresponding blocks are put together, the character object region rectangular block is a region of a unit of a character line or a character paragraph. Further, a flat black-pixel bundle such as a longitudinal bundle or a horizontal bundle is determined as a line drawing region rectangular block such as a separator. In a case where the black-pixel bundle is larger than a constant value and square-shaped white-pixel bundles are arrange regularly inside the black-pixel bundle, the block is determined as a table region rectangular block. The line region rectangular block and the table region rectangular block are dealt as graphic region rectangular blocks.
  • a black-pixel bundle in an indeterminate form other than the above blocks is determined as a photo region rectangular block.
  • the attribute is categorized as a character, a photo, and a graphic, but the kind of the attribute is not limited to those and may be in more detail categorized.
  • the block information includes an attribute, a coordinate X, a coordinate Y, a width W, a height H and OCR information of each block.
  • Numerals of 1 to 3 are allotted to the attribute.
  • Numeral 1 shows a character object region rectangular block
  • numeral 2 shows a photo region rectangular block
  • numeral 3 shows a graphic region rectangular block.
  • the coordinate X and the coordinate Y show X and Y coordinates (coordinates of an upper left angle) at a starting point of each rectangular block in the image.
  • the width W and the height H show a width of the rectangular block in the X coordinate direction and a height of the rectangular block in the Y coordinate direction.
  • the OCR information shows presence/absence of pointer information in the input image.
  • the block information for each rectangular block is used for vectorization in a specific region.
  • the specific region is the character region. That is, the character object region is extracted, which is used in the character contour generating process 407 at the later stage. Further, a relative position relationship at the time of combining the specific region with the other region can be specified by the block information, and therefore, it is possible to combine the vectorization region with the raster data region without damaging a layout of the input image.
  • FIG. 16 is a diagram showing a document as an object of a document structure tree generating process.
  • rectangular blocks T 1 and T 2 are lined up laterally at the uppermost position of the document.
  • a lateral separator S 1 exists under the rectangular blocks T 1 and T 2 and rectangular blocks T 3 , T 4 , T 5 , T 6 , and T 7 exist under the lateral separator S 1 .
  • the rectangular blocks T 3 , T 4 , and T 5 are longitudinally lined up from upward to downward at a left half in a region under the lateral separator S 1 and the rectangular blocks T 6 and T 7 are lined up upward and downward at a right half in a region under the lateral separator S 1 .
  • FIG. 17 is a diagram illustrating the above processed result in a document structure tree.
  • an input image V 0 has groups H 1 and H 2 and the separator S 1 at the uppermost layer, and the rectangular blocks T 1 and T 2 in the second layer belong to the group H 1 .
  • Groups V 1 and V 2 at the second layer belong to the group H 2
  • the rectangular blocks T 3 , T 4 , and T 5 at the third layer belong to the group V 1
  • the rectangular blocks T 6 and T 7 at the third layer belong to the group V 2 .
  • a second embodiment is different from the first embodiment in a point of the decompressing process in an input image as the pre-process to the character contour generating process, and is common with the first embodiment in a point of the apparatus construction and the other processes. Therefore, the decompressing process which is different from the first embodiment will be mainly explained.
  • FIG. 6 is a flow chart showing the process flow according to the second embodiment. Since the processes of steps S 601 to S 610 in FIG. 6 respectively are the processes similar to the processes of steps S 301 to S 310 in FIG. 3 , a detail of the process at step S 612 will be explained in the following.
  • the MFP 100 performs a partial decompressing process for partially decompressing each of the several character object regions in the page using the object division region information obtained by the object dividing process at step S 607 .
  • the processes at steps S 608 and S 609 are sequentially performed each time of obtaining the partial decompressed image. It is possible to perform the decompressing process and the vectorization process in a small memory capacity by performing the decompression in an object unit.
  • FIG. 7 is a flow chart showing the process flow in the character vectorization process using the object division region information according to the second embodiment.
  • the processes performed in FIG. 7 are the processes corresponding to steps S 612 , S 608 , and S 609 .
  • the MFP 100 obtains the object division region information and obtains one character object region from the obtained object division region information.
  • a step S 703 the MFP 100 partially decompress an input image corresponding to the one character object region.
  • the object division region information is the information with resolution of 300 dpi
  • coordinates are calculated in a case where the resolution of 300 dpi in the information is converted into the resolution of 600 dpi.
  • the minimum unit of the compression is, for example, 8 pixels ⁇ 8 pixels in a case of JPEG
  • the MFP 100 is adapted for decompressing a region of the compressed image corresponding to the obtained character object region.
  • the method of the partial decompression differs depending on a compressing method to be used. However, it is possible to realize the partial decompression by using a known compression technology and a known partial decompressing method. In this way, it is possible to obtain the high-resolution image with resolution of 600 dpi corresponding to the one character object region.
  • the MFP 100 performs the character contour generating process.
  • the character contour generating process is the same as the process explained in the first embodiment.
  • the MFP 100 performs a function approximate process to the character contour generated at step S 704 to generate path-processed vector data.
  • the MFP 100 determines whether or not the above process is already performed in all the character object regions. When the above process is completed in all the character object regions, the vectorization process is completed.
  • FIG. 8 is diagrams explaining a case where the object division region information with resolution of 300 dpi is converted into coordinates of positions corresponding to an image with resolution of 600 dpi to perform a partial decompressing process (step S 812 ).
  • FIG. 8A is a diagram showing a compressed image 801 with resolution of 600 dpi.
  • FIG. 8B is a diagram showing the object division region information as an image 802 .
  • the TEXTs 808 to 812 in the image 802 each express a character object region.
  • coordinates of the compressed image 801 corresponding to the coordinates of the character object region 808 are found to specify a region of the compressed image containing the coordinates, decompressing the compressed image in the region.
  • numeral 813 (ABCD) in FIG. 8C shows the result of the partial decompressing process to the region 803 of the compressed image (600 dpi) corresponding to the character object region 808 in FIG. 8B .
  • a third embodiment is different from the second embodiment in a point of the decompressing process of an input image as the pre-process to the character contour generating process, and is common with the second embodiment in a point of the apparatus construction and the other processes.
  • the partial decompression is performed for each character region (each character line or each paragraph), but in the third embodiment, the partial decompression is performed for each one character in the character region. Therefore, the decompressing process which is different from the second embodiment will be explained.
  • the second embodiment performs the partial decompression to the one character object region.
  • the one character object region pargraph or character line
  • a memory capacity necessary for the process becomes possibly insufficient, but the third embodiment can cope with such a case.
  • FIG. 9 is a flow chart showing the process flow of the character vectorization process using object division region information according to the third embodiment.
  • the MFP 100 obtains object division region information and one-character information for cutting out one character from one character object region.
  • Information for cutting out the one character from the one character object region, that is, coordinates of the one character can be obtained from the object division region information.
  • the MFP 100 partially decompress an input image corresponding to the coordinates.
  • the object division region information is the information with resolution of 300 dpi
  • coordinates in a case where the resolution of 300 dpi is converted into resolution of 600 dpi are calculated.
  • the minimum unit of the compression is, for example, 8 pixels ⁇ 8 pixels in a case of JPEG
  • the MFP 100 is adapted for decompressing a region of the compressed image in which the calculated character object region is contained.
  • the method of the partial decompression differs depending on a compressing method to be used. However, it is possible to realize the partial decompression by using a known compression technology and a known partial decompressing method. In this way, it is possible to obtain the image with resolution of 600 dpi corresponding to one character.
  • the character element pixel and the image with 600 dpi can be obtained from the object division.
  • the MFP 100 performs the character contour generating process.
  • the character contour generating process is the same as the process explained in the first embodiment.
  • the MFP 100 performs a function approximate process to the character contour generated at step S 904 to generate a path-processed vector data.
  • the MFP 100 determines whether or not the above process is already performed in all the character objects.
  • FIG. 10 is diagrams explaining utility of partially performing the decompressing process for each character (one-character decompression).
  • Numeral 1001 shows an image JPEG-compressed in a compressing unit of 8 pixels ⁇ 8 pixels.
  • Numeral 1002 shows a compressing unit of JPEG compression.
  • a frame surrounding each character shows character-cutting information 1003 .
  • the character-cutting information 1003 shows a character determined as one character in the object dividing process. Since the decompression is performed in a compressing unit of JPEG, in a case of desiring to decompress a character of “A”, a part of “B” of a character in the right side adjacent to “A” is also decompressed as shown in numeral 1004 .
  • a fourth embodiment is different from the third embodiment in a point of the decompressing process of an input image as the pre-process to the character contour generating process, and is common with the third embodiment in a point of the apparatus construction and the other processes. Therefore, the decompressing process which is different from the third embodiment will be explained.
  • the third embodiment performs the partial decompressing process to the one character.
  • the process time possibly increases, but the fourth embodiment can cope with such a case.
  • FIG. 11 is a flow chart showing the process flow of the character vectorization process using object division region information according to the fourth embodiment.
  • the MFP 100 obtains object division region information and information for cutting out one character from one character object region (region of a character line or a paragraph). Further, the MFP 100 calculates the size of the character object region from the object division region information to calculate a memory capacity necessary for the region process (memory capacity calculation).
  • the MFP 100 compares a usable memory capacity in advance set with the memory capacity necessary for the region process calculated at step S 1102 . As a comparison result, in a case where the memory capacity necessary for the region process is smaller than the given usable memory capacity, it is determined that the region process is possible, and the process goes to step S 1104 . In a case if not, the process goes to step S 1107 .
  • the MFP 100 performs the decompressing process to one character object region (region of a character line or a paragraph).
  • the MFP 100 performs a character contour generating process (first character contour generating process).
  • the MFP 100 performs the decompressing process to the characters in the character object region one by one (one-character decompression).
  • the one-character decompression is as described above.
  • the MFP 100 performs a character contour generating process (second character contour generating process).
  • the MFP 100 performs a function approximate process to the character contour generated at step S 1105 or the character contour generated at step S 1108 to generate path-processed vector data.
  • the MFP 100 determines whether or not the above process is already performed in all the character objects.
  • a fifth embodiment is different from the fourth embodiment in a point of the decompressing process of an input image as the pre-process to the character contour generating process, and is common with the fourth embodiment in a point of the apparatus construction and the other processes. Therefore, the decompressing process which is different from the fourth embodiment will be explained.
  • the fourth embodiment performs the partial decompressing process to one character.
  • a method of performing the decompressing process to the character object region as it is and a method of performing the decompressing process to the characters in the character object region one character by one character are switched depending on a size of the one character object region (that is, a required memory capacity).
  • a size of the one character object region that is, a required memory capacity.
  • FIG. 12 is a flow chart showing the process flow of the character vectorization process using object division region information according to the fifth embodiment.
  • the MFP 100 obtains object division region information and information for cutting out one character from one character object region. Further, the MFP 100 calculates a size of the character object region from the object division region information to calculate a memory capacity necessary for the region process (memory capacity calculation).
  • the MFP 100 compares a usable memory capacity in advance set with the memory capacity necessary for the region process calculated at step S 1202 . As a comparison result, in a case where the memory capacity necessary for the region process is smaller than the given usable memory capacity, it is determined that the region process is possible, and the process goes to step S 1204 . In a case if not, the process goes to step S 1207 .
  • step S 1204 the MFP 100 performs the decompressing process to the one character object region.
  • the MFP 100 performs a character contour generating process (first character contour generating process).
  • the MFP 100 performs a function approximate process to the character contour generated at step S 1205 to generate path-processed vector data.
  • the MFP 100 divides the one character object region into region process-possible regions.
  • a size of the region process-possible region is calculated from a given usable memory capacity, and the division of the region is made based upon the calculated size of the region.
  • the division is designed to be made to contain plural characters.
  • step S 1208 the MFP 100 partially decompress the region divided at step S 1207 .
  • the MFP 100 performs a character contour generating process (second character contour generating process).
  • the MFP 100 performs a function approximate process to the character contour generated at step S 1209 to generate path-processed vector data.
  • the MFP 100 determines whether or not the above process is already performed in all the divided regions.
  • the MFP 100 determines whether or not the above process is performed to the character object region.
  • FIG. 19 shows an example of an improvement on an image quality.
  • aspects of the present invention can also be realized by a computer of a system or apparatus (or devices such as a CPU or MPU) that reads out and executes a program recorded on a memory device to perform the functions of the above-described embodiment(s), and by a method, the steps of which are performed by a computer of a system or apparatus by, for example, reading out and executing a program recorded on a memory device to perform the functions of the above-described embodiment(s).
  • the program is provided to the computer for example via a network or from a recording medium of various types serving as the memory device (e.g., computer-readable medium).

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Physics & Mathematics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • General Engineering & Computer Science (AREA)
  • Computer Graphics (AREA)
  • Geometry (AREA)
  • Compression Of Band Width Or Redundancy In Fax (AREA)
  • Editing Of Facsimile Originals (AREA)
  • Controls And Circuits For Display Device (AREA)
US12/554,766 2008-09-18 2009-09-04 Image processing apparatus, image processing method and program Abandoned US20100067023A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US14/296,290 US10477063B2 (en) 2008-09-18 2014-06-04 Character detection and binarization

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP2008-239804 2008-09-18
JP2008239804A JP5137759B2 (ja) 2008-09-18 2008-09-18 画像処理装置

Related Child Applications (1)

Application Number Title Priority Date Filing Date
US14/296,290 Continuation US10477063B2 (en) 2008-09-18 2014-06-04 Character detection and binarization

Publications (1)

Publication Number Publication Date
US20100067023A1 true US20100067023A1 (en) 2010-03-18

Family

ID=42006946

Family Applications (2)

Application Number Title Priority Date Filing Date
US12/554,766 Abandoned US20100067023A1 (en) 2008-09-18 2009-09-04 Image processing apparatus, image processing method and program
US14/296,290 Active US10477063B2 (en) 2008-09-18 2014-06-04 Character detection and binarization

Family Applications After (1)

Application Number Title Priority Date Filing Date
US14/296,290 Active US10477063B2 (en) 2008-09-18 2014-06-04 Character detection and binarization

Country Status (2)

Country Link
US (2) US20100067023A1 (ja)
JP (1) JP5137759B2 (ja)

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120182321A1 (en) * 2011-01-19 2012-07-19 Sony Corporation Image converter, image conversion method, program and electronic equipment
CN103942552A (zh) * 2014-02-11 2014-07-23 北京大学 一种基于骨架指导的文字图像矢量化方法及系统
US20150002859A1 (en) * 2013-06-28 2015-01-01 Riso Kagaku Corporation Image forming apparatus
US9165205B2 (en) 2012-11-27 2015-10-20 Kyocera Document Solutions Inc. Image processing device generating character image and background image on the basis of connected pixel group
CN105338360A (zh) * 2014-06-25 2016-02-17 优视科技有限公司 图像解码方法及装置
US20160259771A1 (en) * 2015-03-02 2016-09-08 Canon Kabushiki Kaisha Information processing system, information processing apparatus, control method, and storage medium
US9965695B1 (en) * 2016-12-30 2018-05-08 Konica Minolta Laboratory U.S.A., Inc. Document image binarization method based on content type separation
CN109767741A (zh) * 2019-03-26 2019-05-17 上海天马微电子有限公司 一种显示面板的显示方法及显示装置
US10574839B2 (en) * 2017-09-21 2020-02-25 Canon Kabushiki Kaisha Image processing apparatus, method and storage medium for acquiring character information from scanned image
RU2737001C1 (ru) * 2018-09-10 2020-11-24 Кэнон Кабусики Кайся Устройство и способ обработки изображений и носитель данных
CN112183540A (zh) * 2020-09-16 2021-01-05 珠海格力电器股份有限公司 标签参数的提取方法和装置、存储介质、电子装置

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110334760B (zh) * 2019-07-01 2021-06-08 成都数之联科技有限公司 一种基于resUnet的光学元器件损伤检测方法及系统

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5392362A (en) * 1992-11-13 1995-02-21 Fuji Xerox Co., Ltd. Image coding device and decoding device
US5778159A (en) * 1990-08-08 1998-07-07 Canon Kabushiki Kaisha Data outputting with control of number of gradations expressed
US6417841B1 (en) * 1998-06-15 2002-07-09 Kabushiki Kaisha Toshiba Information extraction apparatus and method
US20040042038A1 (en) * 2002-08-29 2004-03-04 Fuji Xerox Co., Ltd. Image forming system and back-end processor
US6873436B1 (en) * 2000-09-05 2005-03-29 Fuji Xerox Co., Ltd. Image processing device and recording medium
US20050280841A1 (en) * 2001-05-24 2005-12-22 Microsoft Corporation Determining optimal image compression methods for reducing file size
US20070286478A1 (en) * 2006-06-12 2007-12-13 Canon Kabushiki Kaisha Image processing apparatus and image processing method

Family Cites Families (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH01130957A (ja) * 1987-11-18 1989-05-23 Hitachi Ltd 文字制御装置
US5182657A (en) * 1989-04-27 1993-01-26 Casio Computer Co., Ltd. Image processing apparatus including binary data producing unit
JP3163577B2 (ja) * 1996-03-18 2001-05-08 株式会社日立情報システムズ ベクトル・フォント生成方法
US5956468A (en) * 1996-07-12 1999-09-21 Seiko Epson Corporation Document segmentation system
JPH1130957A (ja) 1997-07-11 1999-02-02 Lintec Corp 封印シール
JP2000184219A (ja) * 1998-12-15 2000-06-30 Seiko Epson Corp カラー印刷システム及びカラープリンタ
JP2001051670A (ja) * 1999-05-28 2001-02-23 Fluency Kenkyusho:Kk 文字データ作成装置、文字データ作成方法及び記憶媒体
JP2003072161A (ja) * 2001-08-31 2003-03-12 Canon Inc 印刷制御装置および印刷制御方法および記憶媒体およびプログラム
JP4052952B2 (ja) * 2003-01-24 2008-02-27 株式会社リコー 画像処理装置、画像処理方法、プログラム及び記憶媒体
JP4646797B2 (ja) * 2005-02-01 2011-03-09 キヤノン株式会社 画像処理装置及びその制御方法、プログラム
JP4878209B2 (ja) * 2006-05-15 2012-02-15 キヤノン株式会社 画像処理装置、画像処理方法、コンピュータプログラム
JP2008108178A (ja) * 2006-10-27 2008-05-08 Nobuhiko Ido webページに利用する文字輪郭情報の作成方法、および、利用方法

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5778159A (en) * 1990-08-08 1998-07-07 Canon Kabushiki Kaisha Data outputting with control of number of gradations expressed
US5392362A (en) * 1992-11-13 1995-02-21 Fuji Xerox Co., Ltd. Image coding device and decoding device
US6417841B1 (en) * 1998-06-15 2002-07-09 Kabushiki Kaisha Toshiba Information extraction apparatus and method
US6873436B1 (en) * 2000-09-05 2005-03-29 Fuji Xerox Co., Ltd. Image processing device and recording medium
US20050280841A1 (en) * 2001-05-24 2005-12-22 Microsoft Corporation Determining optimal image compression methods for reducing file size
US20040042038A1 (en) * 2002-08-29 2004-03-04 Fuji Xerox Co., Ltd. Image forming system and back-end processor
US20070286478A1 (en) * 2006-06-12 2007-12-13 Canon Kabushiki Kaisha Image processing apparatus and image processing method

Cited By (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120182321A1 (en) * 2011-01-19 2012-07-19 Sony Corporation Image converter, image conversion method, program and electronic equipment
CN102611856A (zh) * 2011-01-19 2012-07-25 索尼公司 图像转换器、图像转换方法、程序和电子装置
CN102611856B (zh) * 2011-01-19 2017-03-01 索尼公司 图像转换器、图像转换方法和电子装置
US9165205B2 (en) 2012-11-27 2015-10-20 Kyocera Document Solutions Inc. Image processing device generating character image and background image on the basis of connected pixel group
US20150002859A1 (en) * 2013-06-28 2015-01-01 Riso Kagaku Corporation Image forming apparatus
US9183474B2 (en) * 2013-06-28 2015-11-10 Riso Kagaku Corporation Image forming apparatus for adjusting boundary data
CN103942552A (zh) * 2014-02-11 2014-07-23 北京大学 一种基于骨架指导的文字图像矢量化方法及系统
CN105338360A (zh) * 2014-06-25 2016-02-17 优视科技有限公司 图像解码方法及装置
US20160259771A1 (en) * 2015-03-02 2016-09-08 Canon Kabushiki Kaisha Information processing system, information processing apparatus, control method, and storage medium
US10839146B2 (en) * 2015-03-02 2020-11-17 Canon Kabushiki Kaisha Information processing system, information processing apparatus, control method, and storage medium
US9965695B1 (en) * 2016-12-30 2018-05-08 Konica Minolta Laboratory U.S.A., Inc. Document image binarization method based on content type separation
US10574839B2 (en) * 2017-09-21 2020-02-25 Canon Kabushiki Kaisha Image processing apparatus, method and storage medium for acquiring character information from scanned image
RU2737001C1 (ru) * 2018-09-10 2020-11-24 Кэнон Кабусики Кайся Устройство и способ обработки изображений и носитель данных
US11283963B2 (en) 2018-09-10 2022-03-22 Canon Kabushiki Kaisha Image processing apparatus and image processing method and storage medium
CN109767741A (zh) * 2019-03-26 2019-05-17 上海天马微电子有限公司 一种显示面板的显示方法及显示装置
CN112183540A (zh) * 2020-09-16 2021-01-05 珠海格力电器股份有限公司 标签参数的提取方法和装置、存储介质、电子装置

Also Published As

Publication number Publication date
JP2010074540A (ja) 2010-04-02
US10477063B2 (en) 2019-11-12
JP5137759B2 (ja) 2013-02-06
US20140285824A1 (en) 2014-09-25

Similar Documents

Publication Publication Date Title
US10477063B2 (en) Character detection and binarization
US8320019B2 (en) Image processing apparatus, image processing method, and computer program thereof
US7860266B2 (en) Image processing system and image processing method
US8112706B2 (en) Information processing apparatus and method
JP4753638B2 (ja) 文書圧縮方法、文書を圧縮するためのシステム、及び画像処理装置
US8368969B2 (en) Methods and devices for rasterizing transparent pages
US8384964B2 (en) Image processing apparatus and image processing method
CN1859541B (zh) 图像处理设备及其控制方法
US10592766B2 (en) Image processing apparatus and medium storing program executable by image processing apparatus
US8774511B2 (en) Image processing apparatus and image processing method
JP4582204B2 (ja) 画像処理装置、画像変換方法、およびコンピュータプログラム
JP5645612B2 (ja) 画像処理装置、画像処理方法、プログラム、及び記憶媒体
JP4217575B2 (ja) 画像処理装置、画像処理方法ならびにプログラム、記憶媒体
JP3715905B2 (ja) 画像処理装置、画像処理方法、プログラム並びに記憶媒体
US20100329551A1 (en) Image processing apparatus and image processing method
US8229214B2 (en) Image processing apparatus and image processing method
US8270722B2 (en) Image processing with preferential vectorization of character and graphic regions
JP4089905B2 (ja) 画像処理装置、画像処理方法、プログラム及び情報記録媒体
US8570619B2 (en) Control devices for scanning documents, systems including such control devices, and non-transitory, computer-readable media storing instructions for such control devices
JP5195882B2 (ja) 電子文書生成システム、電子文書生成方法およびプログラム
US8441684B2 (en) Between-segment discontinuity reduction for text vectorization using dominant point classification
US20210110586A1 (en) Mixed raster content (mrc) to control color changes
US20090290797A1 (en) Image processing for storing objects separated from an image in a storage device
JP2007088912A (ja) 画像処理装置およびプログラム
JP5517028B2 (ja) 画像処理装置

Legal Events

Date Code Title Description
AS Assignment

Owner name: CANON KABUSHIKI KAISHA,JAPAN

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:ITO, NAOKI;REEL/FRAME:023685/0730

Effective date: 20090826

STCB Information on status: application discontinuation

Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION