WO2011002524A1 - Non-product image identification - Google Patents
Non-product image identification Download PDFInfo
- Publication number
- WO2011002524A1 WO2011002524A1 PCT/US2010/001898 US2010001898W WO2011002524A1 WO 2011002524 A1 WO2011002524 A1 WO 2011002524A1 US 2010001898 W US2010001898 W US 2010001898W WO 2011002524 A1 WO2011002524 A1 WO 2011002524A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- image
- source image
- color
- product
- source
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V40/00—Recognition of biometric, human-related or animal-related patterns in image or video data
- G06V40/10—Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
- G06V40/103—Static body considered as a whole, e.g. static pedestrian or occupant recognition
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/40—Extraction of image or video features
- G06V10/56—Extraction of image or video features relating to colour
Definitions
- the present application relates to the field of image processing and in particular to image recognition.
- a real photograph of a product should ideally be used as a picture of the product because the real photograph shows the appearance and property of the product intuitively.
- Some vendors often upload irrelevant pictures such as an advertisement for the vendor, a sale record, a picture that includes a user manual, etc., which do not accurately reflect the properties of the product itself.
- Such pictures hinder the efforts of data mining by the website owners and prevent effective categorization of products based on their appearances.
- some website owners may wish to classify images unrelated to the actual products differently from images depicting products. Techniques for identifying and distinguishing different types of images are needed. BRIEF DESCRIPTION OF THE DRAWINGS
- FIGS. IA and IB illustrate examples of non-product images.
- FIG. 2 is a flowchart illustrating an embodiment of an image recognition process.
- FIG. 3 is a flowchart illustrating an embodiment of a process for determining whether a source image is a non-product image.
- FIG. 4 is a diagram illustrating an example of a source image divided into blocks for determining whether the image is a non-product image.
- FIG. 5 is a flowchart illustrating an embodiment of a process for determining whether a source image is non-product based on its image blocks.
- FIG. 6 is a block diagram illustrating an embodiment of a system configured to recognize non-product images.
- FIG. 7 is a block diagram illustrating another embodiment of a system configured to recognize non-product images.
- the invention can be implemented in numerous ways, including as a process; an apparatus; a system; a composition of matter; a computer program product embodied on a computer readable storage medium; and/or a processor, such as a processor configured to execute instructions stored on and/or provided by a memory coupled to the processor.
- these implementations, or any other form that the invention may take, may be referred to as techniques.
- the order of the steps of disclosed processes may be altered within the scope of the invention.
- a component such as a processor or a memory described as being configured to perform a task may be implemented as a general component that is temporarily configured to perform the task at a given time or a specific component that is manufactured to perform the task.
- the term 'processor' refers to one or more devices, circuits, and/or processing cores configured to process data, such as computer program instructions.
- FIGS. IA and IB illustrate examples of non-product images found on an electronic commerce website.
- FIG. IA illustrates an image displaying a store policy and
- FIG. IB illustrates a sizing chart image for measuring clothes. These pictures do not depict the product for sale and therefore are considered as non-product images if included along with information for the product. In some existing systems, non-product images are recognized manually and removed.
- an automated image analysis technique for identifying non-product images is disclosed.
- one or more color features of the image such as a main color of the image or main colors of sub-blocks of the image, are obtained and used to determine whether the image is a non-product image.
- one or more rules are applied to the color features to make the determination.
- FIG. 2 is a flowchart illustrating an embodiment of an image recognition process.
- Process 200 may be performed on a system such as system 600 of FIG. 6.
- the process can be described in a general context of computer executable instructions, e.g., a program module, executed by a computer.
- the program module includes a routine, program, object, component, data structure, etc., which executes a specific task or embodies a specific abstract type of data.
- the process can also be implemented in a distributed computing environment in which a task is executed by a remote processing device connected via a communication network.
- the program module can be located in a local or remote computer storage medium including a storage device.
- color characteristic values of pixels in a source image are obtained. It is assumed that the source image is a color image rather than a binary or gray scale image. In some embodiments, the color characteristic values are obtained by reading from the source image file. In some embodiments, the color characteristic values are received from a sender such as an image server used to store the source image file. In some embodiments, the source image is optionally converted to be recognized into a uniform format and scaled into a uniform size before the color characteristic values are acquired.
- a color model is a mathematical model describing the way colors can be represented as tuples of numbers, typically as three or four components.
- RGB and CMYK are examples of commonly used color models. Adding a mapping function between the color model and a certain reference color space results in a definite "footprint" within the reference color space.
- the three-dimensional coordinate axes correspond to three independent color parameters so that each color has a corresponding space position and vice versa.
- a point in the space corresponds to a specific color. For instance, when colors are displayed on a computer monitor, they are usually defined in the RGB (red, green, blue) color space.
- RGB-based color model is discussed extensively in the following discussion for purposes of illustration. Other color models can be used as well. For example, colors black and white are typically represented using 0 and 1, respectively, in a binary image comprising black and white pixels only; and the color characteristic value of a pixel in a grey scale image is presented with a grey scale value. Values of the black and white colors and gray scale values can be regarded as special cases of a RGB value with a conversion relationship with the RGB value. Moreover, for a color image, its color can be represented variously as derived from RGB, e.g., a value of YCrCb (Brightness/Hue/Saturation), etc.
- a main color of the source image is determined based at least in part on the color characteristic values. In some embodiments, for each color characteristic value, the number of pixels corresponding to the color characteristic value is counted. The results are compared to determine the main color of the source image.
- a main color refers to a color with the highest frequency of occurring in the image. In other words, it is the color that corresponds to the color characteristic value that has the highest pixel count.
- the source image is a non-product image based at least in part on the magnitude of the color characteristic value corresponding to the main color of the source image.
- multiple rules are applied to make the determination. Examples of the rules are described in greater detail below.
- the non-product image may be handled in various ways.
- the non-product image may be automatically deleted from the product information with which the image is associated, or a warning may be sent to the user who uploaded the non-product image.
- confirmation by a human such as a system administrator or by an additional image processing system is needed before the non-product image is deleted.
- non-product images are classified into a different image category than product images depicting the product.
- FIG. 3 is a flowchart illustrating an embodiment of a process for determining whether a source image is a non-product image.
- Process 300 may be used to implement step 203 of FIG. 2.
- the process applies several rules to make determination of whether a source image is a non-product image.
- a general rule is applied for making a preliminary determination based on the determined main color of the source image.
- a color source image with a main color other than black or white can be considered as a product image, or conversely, a color source image with a main color of black or white will have a great probability of being a non-product image.
- a color is deemed to be black if it has a color characteristic value that is below a threshold A, and a color is deemed to be white if its color characteristic value that is above a threshold B.
- a set of colors with a color characteristic value between A and B are referred collectively to as "other colors”.
- a and B take on specific values depending upon how to quantize a color space may vary depending on implementation.
- the source image is deemed to be a product image. Otherwise, additional rules are applied. Specifically, at 306, the source image is divided into several image blocks and main colors of the respective image blocks are determined according to the number of pixels corresponding to respective color characteristic values in the respective image blocks.
- whether the source image is a non-product image is determined based at least in part on the main colors of the respective image blocks.
- the determination process of a main color of an image block is similar to that for determining the main color of the source image as a whole.
- determination can further be performed according to the number of image blocks with a specific main color and/or the relative position of an image block with a specific main color in the source image.
- the source image is RGB based and there are 8 different levels of intensity for each of the three primary colors of R (Red), G (Green) and B (Blue).
- the color characteristic value ranges from RGBOOO to RGB777 and the entire color space is quantized into 512 colors.
- RGB235 represents an intensity value 2 for red color, an intensity value 3 for green color, and an intensity value 5 for blue color.
- RGBOOO represents pure black
- RGBOO 1 , RGBO 10, RGBO 11 , RGB 100, RGB 101 and RGB 110 represent dark black
- RGB666, RGB667, RGB676 and RGB766 represent dark white; more particularly, RGB 666 also represents grey white;
- RGB777, RGB677, RGB767 and RGB776 represent bright white
- the present embodiment introduces the concept of a valid color for more reasonable and accurate recognition. If the ratio of the number of pixels with non-gray color to the number of all the pixels is above a threshold, the color can be regarded as a valid color.
- Valid colors are primarily used for measuring the degree to which an image is colorful.
- gray color is defined as a color with equal values of R, G, and B.
- the pure black (RGBOOO), grey white (RGB666) and pure white (RGB777) defined as above also fall into the category of grey color.
- a color with equal values of R, G, and B is typically a dark color and therefore the present embodiment also takes into account such a category of color.
- the threshold is set for the purpose of neglecting some colors seldom occurring in the image and experimental data demonstrates that the threshold can be set as approximately 5/1000. Other threshold values may be used.
- FIG. 4 is a diagram illustrating an example of a source image divided into blocks for determining whether the image is a non-product image.
- the image is divided into 9 image blocks according to a 3x3 grid layout. This division ensures that colors on the sides, corners, and the center of the source image are evaluated separately and avoids overly complicated determination rule to improve recognition speed. Other divisions are possible in other embodiments.
- FIG. 5 is a flowchart illustrating an embodiment of a process for determining whether a source image is a non-product image based on its image blocks.
- Process 500 may be used to implement 308 of FIG. 3.
- the main color of the source image is determined to be pure black, it is possibly unrelated to the product and it is further determined whether the main colors of all nine image blocks are black, at 506. If so, the image is deemed to be a non-product image at 508; otherwise, it is deemed to be a product image at 510.
- the main color of the source image is determined to be pure white
- block 4 is bright white, and at least one of the blocks 1, 4 and 7 is pure white. If so, the image is deemed to be a non-product image at 510; otherwise, it is a product image at 508.
- Conditions a-d include: (a) the number of bright white blocks is 9, the number of pure white blocks is 3, and block 4 is not pure white; (b) the number of pure white blocks is more than 7, block 4 is pure white, and blocks 1, 3, 5 and 7 are not grey white; (c) the number of valid colors is 1, the number of pure white blocks is 3, and blocks 1 and 7 are grey white; (d) blocks 1, 3, 4 and 5 are all grey white, and the block 0 is not pure white.
- FIG. IA illustrates an image with a main color of pure white and fewer than five valid colors.
- the color characteristic values of respective image blocks are illustrated below:
- FIG. IB illustrates an image with a main color of pure white and fewer than five valid colors.
- the color characteristic values of respective image blocks are illustrated below:
- process 500 it is determined at 514 that the number of white blocks is 9. It is further determined at 518 whether the number of grey white blocks is 0, the number of pure white blocks is greater than 6 and the number of bright white blocks is greater than 7. In this case the answer is yes, thus it is determined at 520 whether block 4 is bright white and at least one of blocks 1, 4, and 7 is pure white. These conditions are also true, thus the image is deemed to be non-product.
- the foregoing solution applies a digital image analysis technique to firstly extract a color feature of the image and then determine whether the image is a non-product image in combination with a predetermined determination rule to automatically distinguish a non-product image from a product image among pictures of a product so that the system can process them differently.
- a digital image analysis technique to firstly extract a color feature of the image and then determine whether the image is a non-product image in combination with a predetermined determination rule to automatically distinguish a non-product image from a product image among pictures of a product so that the system can process them differently.
- the determination rule described in the present embodiment is merely a specific rule derived from real data, those skilled in the art can define various determination rules according to different application demands and the application will not be limited in this respect.
- the technical solution according to the application can be applied to the stage of uploading a picture from the user and upon detection of the picture uploaded from the user being a non-product image, the system can reject to accept the picture or feed a message back to the user uploading the non-product image and prompt him to re-upload it to thereby ensure validity of the picture in the system.
- the foregoing solution can also be applied prior to data mining to reduce influence upon the data mining by precluding a non-product image. Also the recognized non-product image can be cleared to save a storage space of the system.
- FIG. 6 is a block diagram illustrating an embodiment of a system configured to recognize non-product images.
- System 600 may be implemented using one or more computing devices such as a personal computer, a server computer, a handheld or portable device, a flat panel device, a multi-processor system, a microprocessor based system, a set- top box, a programmable consumer electronic device, a network PC, a minicomputer, a large- scale computer, a special purpose device, a distributed computing environment including any of the foregoing systems or devices, or other hardware/software/firmware combination that includes one or more processors, and memory coupled to the processors and configured to provide the processors with instructions.
- computing devices such as a personal computer, a server computer, a handheld or portable device, a flat panel device, a multi-processor system, a microprocessor based system, a set- top box, a programmable consumer electronic device, a network PC, a minicomputer, a large- scale computer, a special purpose device,
- system 600 includes a feature value acquisition unit
- a source image main color determination unit 820 adapted to determine a main color of the source image.
- the main color determination unit is configured to count the number of the pixels corresponding to the respective color characteristic values and make the determination based on the counting result.
- a first recognition unit 830 adapted to recognize whether the source image is a non-product image based on the color characteristic value of main color of the source image.
- FIG. 7 is a block diagram illustrating another embodiment of a system configured to recognize non-product images. Similar to system 600, system 700 also includes feature value acquisition unit 810, main color determination unit 820, and first recognition unit 830. It further includes an image conversion unit 800 adapted to convert the source image into a preset format before the characteristic value acquisition unit 810 acquires the characteristic values and an image block processing unit 840 adapted to divide the source image into several image blocks and to determine main colors of the respective image blocks. In addition it includes a second recognition unit 850 adapted to recognize whether the source image is a non-product image according to the main colors of the respective image blocks.
- the units described above can be implemented as software components executing on one or more general purpose processors, as hardware such as programmable logic devices and/or Application Specific Integrated Circuits designed to perform certain functions or a combination thereof.
- the units can be embodied by a form of software products which can be stored in a nonvolatile storage medium (such as optical disk, flash storage device, mobile hard disk, etc.), including a number of instructions for making a computer device (such as personal computers, servers, network equipments, etc.) implement the methods described in the embodiments of the present invention.
- the units may be implemented on a single device or distributed across multiple devices. The functions of the units may be merged into one another or further split into multiple sub-units.
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Multimedia (AREA)
- Theoretical Computer Science (AREA)
- Human Computer Interaction (AREA)
- Image Analysis (AREA)
- Image Processing (AREA)
- Facsimile Image Signal Circuits (AREA)
- Color Image Communication Systems (AREA)
Abstract
An image recognition technique includes obtaining color characteristic values of pixels in a source image; determining a main color of the source image based at least in part on the color characteristic values; and determining whether the source image is a non- product image based at least in part on the main color of the source image.
Description
NON-PRODUCT IMAGE IDENTIFICATION CROSS REFERENCE TO OTHER APPLICATIONS
[0001] This application claims priority to People's Republic of China Patent
Application No. 200910149552.8 entitled IMAGE RECOGNITION METHOD AND DEVICE filed July 2, 2009 which is incorporated herein by reference for all purposes.
FIELD OF THE INVENTION
[0002] The present application relates to the field of image processing and in particular to image recognition.
BACKGROUND OF THE INVENTION
[0003] The development of multimedia technologies has enriched information presentation in software applications. Many image processing techniques have been developed for computer applications and especially for the Internet.
[0004] Taking an electronic commerce application as an example, existing electronic commerce systems generally support the function of attaching a picture to a product. A vendor publishing his product offerings over the network can upload pictures of the product in addition to textual descriptions. An image can exhibit a product more intuitively than mere text and, in most cases, the picture of a product can also be an important criterion based upon which a buyer confirms authenticity of information on the product.
[0005] For a majority of categories of products, a real photograph of a product should ideally be used as a picture of the product because the real photograph shows the appearance and property of the product intuitively. Some vendors, however, often upload irrelevant pictures such as an advertisement for the vendor, a sale record, a picture that includes a user manual, etc., which do not accurately reflect the properties of the product itself. Furthermore, such pictures hinder the efforts of data mining by the website owners and prevent effective categorization of products based on their appearances. Moreover, some website owners may wish to classify images unrelated to the actual products differently from images depicting products. Techniques for identifying and distinguishing different types of images are needed.
BRIEF DESCRIPTION OF THE DRAWINGS
[0006] Various embodiments of the invention are disclosed in the following detailed description and the accompanying drawings.
[0007] FIGS. IA and IB illustrate examples of non-product images.
[0008] FIG. 2 is a flowchart illustrating an embodiment of an image recognition process.
[0009] FIG. 3 is a flowchart illustrating an embodiment of a process for determining whether a source image is a non-product image.
[0010] FIG. 4 is a diagram illustrating an example of a source image divided into blocks for determining whether the image is a non-product image.
[0011] FIG. 5 is a flowchart illustrating an embodiment of a process for determining whether a source image is non-product based on its image blocks.
[0012] FIG. 6 is a block diagram illustrating an embodiment of a system configured to recognize non-product images.
[0013] FIG. 7 is a block diagram illustrating another embodiment of a system configured to recognize non-product images.
DETAILED DESCRIPTION [0014] The invention can be implemented in numerous ways, including as a process; an apparatus; a system; a composition of matter; a computer program product embodied on a computer readable storage medium; and/or a processor, such as a processor configured to execute instructions stored on and/or provided by a memory coupled to the processor. In this specification, these implementations, or any other form that the invention may take, may be referred to as techniques. In general, the order of the steps of disclosed processes may be altered within the scope of the invention. Unless stated otherwise, a component such as a processor or a memory described as being configured to perform a task may be implemented as a general component that is temporarily configured to perform the task at a given time or a specific component that is manufactured to perform the task. As used herein, the term
'processor' refers to one or more devices, circuits, and/or processing cores configured to process data, such as computer program instructions.
[0015] A detailed description of one or more embodiments of the invention is provided below along with accompanying figures that illustrate the principles of the invention. The invention is described in connection with such embodiments, but the invention is not limited to any embodiment. The scope of the invention is limited only by the claims and the invention encompasses numerous alternatives, modifications and equivalents. Numerous specific details are set forth in the following description in order to provide a thorough understanding of the invention. These details are provided for the purpose of example and the invention may be practiced according to the claims without some or all of these specific details. For the purpose of clarity, technical material that is known in the technical fields related to the invention has not been described in detail so that the invention is not unnecessarily obscured.
[0016] FIGS. IA and IB illustrate examples of non-product images found on an electronic commerce website. FIG. IA illustrates an image displaying a store policy and FIG. IB illustrates a sizing chart image for measuring clothes. These pictures do not depict the product for sale and therefore are considered as non-product images if included along with information for the product. In some existing systems, non-product images are recognized manually and removed.
[0017] In the following specification, an automated image analysis technique for identifying non-product images is disclosed. In some embodiments, one or more color features of the image, such as a main color of the image or main colors of sub-blocks of the image, are obtained and used to determine whether the image is a non-product image. In some embodiments, one or more rules are applied to the color features to make the determination.
[0018] FIG. 2 is a flowchart illustrating an embodiment of an image recognition process. Process 200 may be performed on a system such as system 600 of FIG. 6. The process can be described in a general context of computer executable instructions, e.g., a program module, executed by a computer. Generally, the program module includes a routine, program, object, component, data structure, etc., which executes a specific task or embodies a specific abstract type of data. The process can also be implemented in a distributed
computing environment in which a task is executed by a remote processing device connected via a communication network. In the distributed computing environment, the program module can be located in a local or remote computer storage medium including a storage device.
[0019] At 201, color characteristic values of pixels in a source image are obtained. It is assumed that the source image is a color image rather than a binary or gray scale image. In some embodiments, the color characteristic values are obtained by reading from the source image file. In some embodiments, the color characteristic values are received from a sender such as an image server used to store the source image file. In some embodiments, the source image is optionally converted to be recognized into a uniform format and scaled into a uniform size before the color characteristic values are acquired.
[0020] As used herein, a color model is a mathematical model describing the way colors can be represented as tuples of numbers, typically as three or four components. RGB and CMYK are examples of commonly used color models. Adding a mapping function between the color model and a certain reference color space results in a definite "footprint" within the reference color space. In a three-dimensional color space, the three-dimensional coordinate axes correspond to three independent color parameters so that each color has a corresponding space position and vice versa. A point in the space corresponds to a specific color. For instance, when colors are displayed on a computer monitor, they are usually defined in the RGB (red, green, blue) color space. This is another way of making nearly the same colors (limited by the reproduction medium, such as the phosphor (CRT) or filters and backlight (LCD)), and red, green, blue can be considered as the X, Y, and Z axes. Another way of making the same colors is to use their Hue (X axis), their Saturation (Y axis), and their brightness Value (Z axis). This is called the HSV color space. Other color space examples include YUV, YCbCr, etc. Many color spaces can be represented as three- dimensional (X, Y, Z) values in this manner.
[0021] RGB-based color model is discussed extensively in the following discussion for purposes of illustration. Other color models can be used as well. For example, colors black and white are typically represented using 0 and 1, respectively, in a binary image comprising black and white pixels only; and the color characteristic value of a pixel in a grey scale image is presented with a grey scale value. Values of the black and white colors and gray scale values can be regarded as special cases of a RGB value with a conversion
relationship with the RGB value. Moreover, for a color image, its color can be represented variously as derived from RGB, e.g., a value of YCrCb (Brightness/Hue/Saturation), etc.
[0022] At 202, a main color of the source image is determined based at least in part on the color characteristic values. In some embodiments, for each color characteristic value, the number of pixels corresponding to the color characteristic value is counted. The results are compared to determine the main color of the source image. As used herein, a main color refers to a color with the highest frequency of occurring in the image. In other words, it is the color that corresponds to the color characteristic value that has the highest pixel count.
[0023] At 203, it is determined whether the source image is a non-product image based at least in part on the magnitude of the color characteristic value corresponding to the main color of the source image. In some embodiments, multiple rules are applied to make the determination. Examples of the rules are described in greater detail below.
[0024] The non-product image may be handled in various ways. For example, the non-product image may be automatically deleted from the product information with which the image is associated, or a warning may be sent to the user who uploaded the non-product image. In some embodiments, confirmation by a human (such as a system administrator) or by an additional image processing system is needed before the non-product image is deleted. In some embodiments, non-product images are classified into a different image category than product images depicting the product.
[0025] FIG. 3 is a flowchart illustrating an embodiment of a process for determining whether a source image is a non-product image. Process 300 may be used to implement step 203 of FIG. 2. In the example shown, the process applies several rules to make determination of whether a source image is a non-product image.
[0026] Initially, a general rule is applied for making a preliminary determination based on the determined main color of the source image. Generally, a color source image with a main color other than black or white can be considered as a product image, or conversely, a color source image with a main color of black or white will have a great probability of being a non-product image. Thus, at 302, it is determined whether the main color is black or white.
[0027] In some embodiments, a color is deemed to be black if it has a color characteristic value that is below a threshold A, and a color is deemed to be white if its color characteristic value that is above a threshold B. A set of colors with a color characteristic value between A and B are referred collectively to as "other colors". A and B take on specific values depending upon how to quantize a color space may vary depending on implementation.
[0028] If the main color is neither black nor white, at 304, the source image is deemed to be a product image. Otherwise, additional rules are applied. Specifically, at 306, the source image is divided into several image blocks and main colors of the respective image blocks are determined according to the number of pixels corresponding to respective color characteristic values in the respective image blocks.
[0029] At 308, whether the source image is a non-product image is determined based at least in part on the main colors of the respective image blocks. The determination process of a main color of an image block is similar to that for determining the main color of the source image as a whole. By dividing an image into several blocks and determining the non- product image status based on the distribution of colors in the image blocks can more accurately determine whether an image is a non-product image.
[0030] For example, when the main color of the source image is black or white, determination can further be performed according to the number of image blocks with a specific main color and/or the relative position of an image block with a specific main color in the source image.
[0031] In this example, the source image is RGB based and there are 8 different levels of intensity for each of the three primary colors of R (Red), G (Green) and B (Blue). Thus, the color characteristic value ranges from RGBOOO to RGB777 and the entire color space is quantized into 512 colors. For example, RGB235 represents an intensity value 2 for red color, an intensity value 3 for green color, and an intensity value 5 for blue color.
[0032] In this example, some specific colors are defined as follows according to the foregoing method for quantizing a color space for later convenient descriptions: a color with each of R, G, and B values less than or equal to 1 is defined as black; a color with each of R, G and B values greater than or equal to 6 is defined as white. Black and white are further specified as follows:
[0033] RGBOOO represents pure black;
[0034] RGBOO 1 , RGBO 10, RGBO 11 , RGB 100, RGB 101 and RGB 110 represent dark black;
[0035] RGB666, RGB667, RGB676 and RGB766 represent dark white; more particularly, RGB 666 also represents grey white;
[0036] RGB777, RGB677, RGB767 and RGB776 represent bright white; and RGB
777 also represents pure white.
[0037] Based on characteristics of the human visual system, the present embodiment introduces the concept of a valid color for more reasonable and accurate recognition. If the ratio of the number of pixels with non-gray color to the number of all the pixels is above a threshold, the color can be regarded as a valid color. Valid colors are primarily used for measuring the degree to which an image is colorful. As used herein, gray color is defined as a color with equal values of R, G, and B. As can be seen, the pure black (RGBOOO), grey white (RGB666) and pure white (RGB777) defined as above also fall into the category of grey color. In practice, a color with equal values of R, G, and B is typically a dark color and therefore the present embodiment also takes into account such a category of color. The threshold is set for the purpose of neglecting some colors seldom occurring in the image and experimental data demonstrates that the threshold can be set as approximately 5/1000. Other threshold values may be used.
[0038] FIG. 4 is a diagram illustrating an example of a source image divided into blocks for determining whether the image is a non-product image. In this example, the image is divided into 9 image blocks according to a 3x3 grid layout. This division ensures that colors on the sides, corners, and the center of the source image are evaluated separately and avoids overly complicated determination rule to improve recognition speed. Other divisions are possible in other embodiments.
[0039] In this example, the source image is divided into 9 image blocks which are denoted sequentially with "a block 0", "a block 1", "a block 8" from the left to the right and the top to the bottom as illustrated in Fig.4 for convenient descriptions. Main colors of the respective image blocks are determined respectively.
[0040] FIG. 5 is a flowchart illustrating an embodiment of a process for determining whether a source image is a non-product image based on its image blocks. Process 500 may be used to implement 308 of FIG. 3.
[0041] At 502, it is determined whether the main color of the source image is pure black, pure white, or something else.
[0042] If, at 530, it is determined that the main color of the source image is neither pure black nor pure white, then it is a product image.
[0043] If, at 504, the main color of the source image is determined to be pure black, it is possibly unrelated to the product and it is further determined whether the main colors of all nine image blocks are black, at 506. If so, the image is deemed to be a non-product image at 508; otherwise, it is deemed to be a product image at 510.
[0044] If, at 516, the main color of the source image is determined to be pure white, it is further determined, at 518, whether all the main colors of the nine image blocks are white. If no, the image is deemed to be a product image at 508. Otherwise, at 518, it is determined whether the number of grey-white blocks is 0, the number of pure white blocks is greater than 6 and the number of bright white blocks is greater than 7. If so, control is transferred to 520; otherwise, control is transferred to 522.
[0045] At 520, it is determined whether block 4 is bright white, and at least one of the blocks 1, 4 and 7 is pure white. If so, the image is deemed to be a non-product image at 510; otherwise, it is a product image at 508.
[0046] At 522, it is determined whether the number of dark white blocks is 9 and the number of grey white blocks is greater than 6. If so, the image is a non-product image;
otherwise, further determination is performed. At 524, it is determined whether the number of valid colors is less than 5, and if yes, the image is a product image; otherwise, it is determined, at 526, whether any of the following conditions (a-d) is satisfied. If so, it is a non-product image; otherwise, it is a product image. Conditions a-d include: (a) the number of bright white blocks is 9, the number of pure white blocks is 3, and block 4 is not pure white; (b) the number of pure white blocks is more than 7, block 4 is pure white, and blocks 1, 3, 5 and 7 are not grey white; (c) the number of valid colors is 1, the number of pure white
blocks is 3, and blocks 1 and 7 are grey white; (d) blocks 1, 3, 4 and 5 are all grey white, and the block 0 is not pure white.
[0047] Process 500 will be described below in connection with the images shown in
FIGS. IA and IB.
[0048] FIG. IA illustrates an image with a main color of pure white and fewer than five valid colors. When divided according to the scheme shown in FIG. 4, the color characteristic values of respective image blocks are illustrated below:
666 666 666
666 666 666
666 666 666
[0049] According to process 500, at 516, the number of white blocks is 9. Thus, at
518, it is determined whether the number of grey white blocks is 0, the number of pure white blocks is greater than 6, and the number of bright white blocks is greater than 7. Since the answer is no, it is determined whether the number of dark white blocks is 9 and whether the number of grey white blocks is greater than 6. In this case, the condition is true, thus the image is determined to be a non-product image.
[0050] FIG. IB illustrates an image with a main color of pure white and fewer than five valid colors. When divided according to the scheme shown in FIG. 4, the color characteristic values of respective image blocks are illustrated below:
777 777 777
777 777 777
777 777 777
[0051] According to process 500, it is determined at 514 that the number of white blocks is 9. It is further determined at 518 whether the number of grey white blocks is 0, the number of pure white blocks is greater than 6 and the number of bright white blocks is greater than 7. In this case the answer is yes, thus it is determined at 520 whether block 4 is
bright white and at least one of blocks 1, 4, and 7 is pure white. These conditions are also true, thus the image is deemed to be non-product.
[0052] The foregoing solution applies a digital image analysis technique to firstly extract a color feature of the image and then determine whether the image is a non-product image in combination with a predetermined determination rule to automatically distinguish a non-product image from a product image among pictures of a product so that the system can process them differently. Of course, the determination rule described in the present embodiment is merely a specific rule derived from real data, those skilled in the art can define various determination rules according to different application demands and the application will not be limited in this respect.
[0053] The technical solution according to the application can be applied to the stage of uploading a picture from the user and upon detection of the picture uploaded from the user being a non-product image, the system can reject to accept the picture or feed a message back to the user uploading the non-product image and prompt him to re-upload it to thereby ensure validity of the picture in the system. The foregoing solution can also be applied prior to data mining to reduce influence upon the data mining by precluding a non-product image. Also the recognized non-product image can be cleared to save a storage space of the system.
[0054] FIG. 6 is a block diagram illustrating an embodiment of a system configured to recognize non-product images. System 600 may be implemented using one or more computing devices such as a personal computer, a server computer, a handheld or portable device, a flat panel device, a multi-processor system, a microprocessor based system, a set- top box, a programmable consumer electronic device, a network PC, a minicomputer, a large- scale computer, a special purpose device, a distributed computing environment including any of the foregoing systems or devices, or other hardware/software/firmware combination that includes one or more processors, and memory coupled to the processors and configured to provide the processors with instructions.
[0055] In the example shown, system 600 includes a feature value acquisition unit
810 adapted to obtain color characteristic values of respective pixels in a source image. It further includes a source image main color determination unit 820 adapted to determine a main color of the source image. In some embodiments, the main color determination unit is configured to count the number of the pixels corresponding to the respective color characteristic values and make the determination based on the counting result. Also included
is a first recognition unit 830 adapted to recognize whether the source image is a non-product image based on the color characteristic value of main color of the source image.
[0056] FIG. 7 is a block diagram illustrating another embodiment of a system configured to recognize non-product images. Similar to system 600, system 700 also includes feature value acquisition unit 810, main color determination unit 820, and first recognition unit 830. It further includes an image conversion unit 800 adapted to convert the source image into a preset format before the characteristic value acquisition unit 810 acquires the characteristic values and an image block processing unit 840 adapted to divide the source image into several image blocks and to determine main colors of the respective image blocks. In addition it includes a second recognition unit 850 adapted to recognize whether the source image is a non-product image according to the main colors of the respective image blocks.
[0057] The units described above can be implemented as software components executing on one or more general purpose processors, as hardware such as programmable logic devices and/or Application Specific Integrated Circuits designed to perform certain functions or a combination thereof. In some embodiments, the units can be embodied by a form of software products which can be stored in a nonvolatile storage medium (such as optical disk, flash storage device, mobile hard disk, etc.), including a number of instructions for making a computer device (such as personal computers, servers, network equipments, etc.) implement the methods described in the embodiments of the present invention. The units may be implemented on a single device or distributed across multiple devices. The functions of the units may be merged into one another or further split into multiple sub-units.
[0058] Although the foregoing embodiments have been described in some detail for purposes of clarity of understanding, the invention is not limited to the details provided. There are many alternative ways of implementing the invention. The disclosed embodiments are illustrative and not restrictive.
[0059] WHAT IS CLAIMED IS:
Claims
1. An image recognition method, comprising:
obtaining color characteristic values of pixels in a source image;
determining a main color of the source image based at least in part on the color characteristic values; and
determining whether the source image is a non-product image based at least in part on the main color of the source image.
2. The method of Claim 1, wherein determining the main color of the source image includes determining number of pixels corresponding to the color characteristic values of pixels in the source image and identifying a color that corresponds to the color characteristic value that has the highest pixel count.
3. The method of Claim 1, wherein determining whether the source image is a non- product image based at least in part on the main color of the source image includes:
performing a preliminary determination of whether the source image is likely to be a non-product image based at least in part on the main color of the source image;
in the event that it is determined that the source image is likely to be a non-product image:
dividing the source image into a plurality of image blocks;
determining main colors of the respective image blocks; and
determining whether the source image is a non-product image based at least in part on the main colors of the respective image blocks.
4. The method of Claim 3, wherein determining whether the source image is a non- product image based at least in part on the main colors of the respective image blocks includes determining whether the source image is a non-product image based at least in part on number of image blocks with a specific main color.
5. The method of Claim 3, wherein determining whether the source image is a non- product image based at least in part on the main colors of the respective image blocks includes determining whether the source image is a non-product image based at least in part on a relative position of an image block with a specific main color in the source image.
6. The method of Claim 3, wherein dividing the source image into a plurality of image blocks comprises dividing the source image into nine image blocks in a 3x3 layout.
7. The method of Claim 1, further comprising converting the source image according to a preset format; and wherein obtaining the color characteristic values of the pixels in the source image comprises acquiring the color characteristic values of the pixels in the converted format.
8. The method of Claim 1, wherein the color characteristic values are RGB values.
9. The method of Claim 1, wherein determining whether the source image is a non- product image based at least in part on the main color of the source image includes:
determining whether the main color is black or white; and
in the event that the main color is black or white, recognizing the source image as a product image.
10. The method of Claim 9, wherein in the event that the main color is black or white, the method further comprises dividing the source image into nine image blocks of a 3x3 layout.
11. The method of Claim 10, wherein in the event that the main color is black or white, the method further comprises recognizing whether the source image is a non-product image according to the number of image blocks with a specific main color.
12. The method of Claim 10, wherein in the event that the main color is black or white, the method further comprises recognizing whether the source image is a non-product image according to a relative position of an image block with a specific main color in the source image.
13. An image recognition system comprising:
one or more processors configured to:
obtain color characteristic values of pixels in a source image; determine a main color of the source image based at least in part on the color characteristic values; and
determine whether the source image is a non-product image based at least in part on the main color of the source image; and
one or more memories coupled to the one or more processors, configured to provide the one or more processors with instructions.
14. The system of Claim 13, wherein determining the main color of the source image includes determining number of pixels corresponding to the color characteristic values of pixels in the source image and identifying a color that corresponds to the color characteristic value that has the highest pixel count.
15. The system of Claim 13, wherein determining whether the source image is a non- product image based at least in part on the main color of the source image includes:
performing a preliminary determination of whether the source image is likely to be a non-product image based at least in part on the main color of the source image;
in the event that it is determined that the source image is likely to be a non-product image:
dividing the source image into a plurality of image blocks;
determining main colors of the respective image blocks; and determining whether the source image is a non-product image based at least in part on the main colors of the respective image blocks.
16. The system of Claim 15, wherein determining whether the source image is a non- product image based at least in part on the main colors of the respective image blocks includes determining whether the source image is a non-product image based at least in part on number of image blocks with a specific main color.
17. The system of Claim 15, wherein determining whether the source image is a non- product image based at least in part on the main colors of the respective image blocks includes determining whether the source image is a non-product image based at least in part on a relative position of an image block with a specific main color in the source image.
18. The system of Claim 15, wherein dividing the source image into a plurality of image blocks comprises dividing the source image into nine image blocks in a 3x3 layout.
19. The system of Claim 13, wherein the one or more processors are further configured to:
convert the source image according to a preset format; and wherein obtaining the color characteristic values of the pixels in the source image comprises acquiring the color characteristic values of the pixels in the converted format.
20. The system of Claim 13, wherein determining whether the source image is a non- product image based at least in part on the main color of the source image include:
determining whether the main color is black or white, and
in the event that the main color is black or white, recognizing the source image as a product image.
21. The method of Claim 9, wherein in the event that the main color is black or white, the method further comprises dividing the source image into nine image blocks of a 3x3 layout.
22. The method of Claim 10, wherein in the event that the main color is black or white, the method further comprises recognizing whether the source image is a non-product image according to the number of image blocks with a specific main color.
23. The method of Claim 10, wherein in the event that the main color is black or white, the method further comprises recognizing whether the source image is a non-product image according to a relative position of an image block with a specific main color in the source image.
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2012518528A JP5409910B2 (en) | 2009-07-02 | 2010-07-01 | Non-product image identification |
EP10794500.8A EP2449506A4 (en) | 2009-07-02 | 2010-07-01 | Non-product image identification |
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2009101495528A CN101599122B (en) | 2009-07-02 | 2009-07-02 | Image identification method and device |
CN200910149552.8 | 2009-07-02 | ||
US12/803,599 US8515164B2 (en) | 2009-07-02 | 2010-06-30 | Non-product image identification |
US12/803,599 | 2010-06-30 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2011002524A1 true WO2011002524A1 (en) | 2011-01-06 |
Family
ID=41420563
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2010/001898 WO2011002524A1 (en) | 2009-07-02 | 2010-07-01 | Non-product image identification |
Country Status (6)
Country | Link |
---|---|
US (1) | US8515164B2 (en) |
EP (1) | EP2449506A4 (en) |
JP (1) | JP5409910B2 (en) |
CN (1) | CN101599122B (en) |
HK (1) | HK1137552A1 (en) |
WO (1) | WO2011002524A1 (en) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107103330A (en) * | 2017-03-31 | 2017-08-29 | 深圳市浩远智能科技有限公司 | A kind of LED status recognition methods and device |
US9818204B2 (en) | 2014-09-26 | 2017-11-14 | Capitalbio Corporation | Method for monitoring, identification, and/or detection using a camera based on a color feature |
US10049464B2 (en) | 2014-09-26 | 2018-08-14 | Capitalbio Corporation | Method for identifying a unit using a camera |
Families Citing this family (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104298982B (en) * | 2013-07-16 | 2019-03-08 | 深圳市腾讯计算机系统有限公司 | A kind of character recognition method and device |
CN103927513A (en) * | 2014-03-26 | 2014-07-16 | 广州品唯软件有限公司 | Method and device for identifying Logo |
CN104200219B (en) * | 2014-08-20 | 2017-12-08 | 深圳供电局有限公司 | Automatic identification method and device for switch position indication of transformer substation disconnecting link position |
WO2016101767A1 (en) * | 2014-12-24 | 2016-06-30 | 北京奇虎科技有限公司 | Picture cropping method and device and image detecting method and device |
CN110490250A (en) * | 2019-08-19 | 2019-11-22 | 广州虎牙科技有限公司 | A kind of acquisition methods and device of artificial intelligence training set |
US20220351496A1 (en) * | 2019-12-24 | 2022-11-03 | Intel Corporation | Image content classification |
CN115018565A (en) * | 2022-08-08 | 2022-09-06 | 长沙朗源电子科技有限公司 | Advertisement media image identification method, system, equipment and readable storage medium |
CN116013190A (en) * | 2022-11-03 | 2023-04-25 | 深圳创维-Rgb电子有限公司 | Color bar picture detection method and device, display equipment and readable storage medium |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20040170318A1 (en) * | 2003-02-28 | 2004-09-02 | Eastman Kodak Company | Method for detecting color objects in digital images |
Family Cites Families (16)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2872285B2 (en) | 1989-08-02 | 1999-03-17 | キヤノン株式会社 | Image processing apparatus and image processing method |
JPH0366516A (en) | 1989-08-04 | 1991-03-22 | Ryobi Ltd | Hand fret saw device |
JP3100391B2 (en) | 1990-08-15 | 2000-10-16 | 株式会社リコー | Color image area separation device |
JP3258122B2 (en) * | 1993-03-31 | 2002-02-18 | 株式会社東芝 | Image processing device |
US5493386A (en) | 1995-01-03 | 1996-02-20 | Eastman Kodak Company | Multi-toner image forming apparatus and method having pattern recognition |
US6108098A (en) | 1995-12-28 | 2000-08-22 | Canon Kabushiki Kaisha | Image processing apparatus and method |
JP3222091B2 (en) | 1997-05-27 | 2001-10-22 | シャープ株式会社 | Image processing apparatus and medium storing image processing apparatus control program |
JPH11296672A (en) * | 1998-04-07 | 1999-10-29 | Dainippon Screen Mfg Co Ltd | Image color information extraction method |
CN1290312C (en) | 1998-06-23 | 2006-12-13 | 夏普公司 | Image processing device and its method for removing and reading strik-through produced by double side or overlaped master cope |
CN100428278C (en) * | 1999-02-05 | 2008-10-22 | 三星电子株式会社 | Color image processing method and apparatus thereof |
US6778697B1 (en) * | 1999-02-05 | 2004-08-17 | Samsung Electronics Co., Ltd. | Color image processing method and apparatus thereof |
US7016532B2 (en) * | 2000-11-06 | 2006-03-21 | Evryx Technologies | Image capture and identification system and process |
JP3626679B2 (en) * | 2000-12-06 | 2005-03-09 | 株式会社ガーラ | A method for discriminating nude images by computer image processing |
JP3970052B2 (en) | 2002-02-27 | 2007-09-05 | キヤノン株式会社 | Image processing device |
CN100565523C (en) | 2007-04-05 | 2009-12-02 | 中国科学院自动化研究所 | A kind of filtering sensitive web page method and system based on multiple Classifiers Combination |
CN100568283C (en) | 2007-12-07 | 2009-12-09 | 北京搜狗科技发展有限公司 | A kind of picture dominant hue analytical approach and device thereof |
-
2009
- 2009-07-02 CN CN2009101495528A patent/CN101599122B/en not_active Expired - Fee Related
-
2010
- 2010-05-25 HK HK10105088.0A patent/HK1137552A1/en not_active IP Right Cessation
- 2010-06-30 US US12/803,599 patent/US8515164B2/en not_active Expired - Fee Related
- 2010-07-01 WO PCT/US2010/001898 patent/WO2011002524A1/en active Application Filing
- 2010-07-01 JP JP2012518528A patent/JP5409910B2/en not_active Expired - Fee Related
- 2010-07-01 EP EP10794500.8A patent/EP2449506A4/en not_active Withdrawn
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20040170318A1 (en) * | 2003-02-28 | 2004-09-02 | Eastman Kodak Company | Method for detecting color objects in digital images |
Non-Patent Citations (2)
Title |
---|
See also references of EP2449506A4 * |
SWAIN ET AL.: "Color Indexing.", INTERNATIONAL JOURNAL OF COMPUTER VISION, vol. 7, no. 1, 1991, pages 11 - 32, XP008150018 * |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9818204B2 (en) | 2014-09-26 | 2017-11-14 | Capitalbio Corporation | Method for monitoring, identification, and/or detection using a camera based on a color feature |
US10049464B2 (en) | 2014-09-26 | 2018-08-14 | Capitalbio Corporation | Method for identifying a unit using a camera |
US10885673B2 (en) | 2014-09-26 | 2021-01-05 | Capitalbio Corporation | Method for identifying a unit using a camera |
CN107103330A (en) * | 2017-03-31 | 2017-08-29 | 深圳市浩远智能科技有限公司 | A kind of LED status recognition methods and device |
Also Published As
Publication number | Publication date |
---|---|
CN101599122B (en) | 2013-06-19 |
CN101599122A (en) | 2009-12-09 |
HK1137552A1 (en) | 2010-07-30 |
US8515164B2 (en) | 2013-08-20 |
EP2449506A1 (en) | 2012-05-09 |
EP2449506A4 (en) | 2017-03-15 |
US20110002535A1 (en) | 2011-01-06 |
JP5409910B2 (en) | 2014-02-05 |
JP2012532377A (en) | 2012-12-13 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US8515164B2 (en) | Non-product image identification | |
CN113160257B (en) | Image data labeling method, device, electronic equipment and storage medium | |
Lee et al. | Toward a no-reference image quality assessment using statistics of perceptual color descriptors | |
US20210174135A1 (en) | Method of matching image and apparatus thereof, device, medium and program product | |
US8346022B2 (en) | System and method for generating an intrinsic image using tone mapping and log chromaticity | |
CN108961183B (en) | Image processing method, terminal device and computer-readable storage medium | |
CN106203461B (en) | Image processing method and device | |
CN112419214A (en) | Method and device for generating labeled image, readable storage medium and terminal equipment | |
CN113391779B (en) | Parameter adjusting method, device and equipment for paper-like screen | |
CN111626967A (en) | Image enhancement method, image enhancement device, computer device and readable storage medium | |
US10395373B1 (en) | Image feature detection | |
CN111291778B (en) | Training method of depth classification model, exposure anomaly detection method and device | |
CN109961015A (en) | Image-recognizing method, device, equipment and storage medium | |
CN117237637A (en) | Image signal processing system and method | |
CN115374517A (en) | Testing method and device for wiring software, electronic equipment and storage medium | |
CN112989924B (en) | Target detection method, target detection device and terminal equipment | |
CN114463168A (en) | Data desensitization processing method and device and electronic equipment | |
US10713792B1 (en) | System and apparatus for image processing | |
CN105843972A (en) | Method and device for comparing product attribute information | |
CN112201117B (en) | Logic board identification method and device and terminal equipment | |
CN113474786A (en) | Electronic purchase order identification method and device and terminal equipment | |
US11238595B2 (en) | System to prepare images for presentation | |
CN111275725B (en) | Method and device for determining color temperature and tone of image, storage medium and terminal | |
CN113469297B (en) | Image tampering detection method, device, equipment and computer readable storage medium | |
US20220245819A1 (en) | Method for processing images, electronic device, and storage medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 10794500 Country of ref document: EP Kind code of ref document: A1 |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2012518528 Country of ref document: JP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2010794500 Country of ref document: EP |
|
NENP | Non-entry into the national phase |
Ref country code: DE |