WO2017088249A1 - 特征提取方法及装置 - Google Patents
特征提取方法及装置 Download PDFInfo
- Publication number
- WO2017088249A1 WO2017088249A1 PCT/CN2015/099310 CN2015099310W WO2017088249A1 WO 2017088249 A1 WO2017088249 A1 WO 2017088249A1 CN 2015099310 W CN2015099310 W CN 2015099310W WO 2017088249 A1 WO2017088249 A1 WO 2017088249A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- image
- blocks
- hog
- frequency domain
- cells
- Prior art date
Links
- 238000000605 extraction Methods 0.000 title claims abstract description 58
- 238000004364 calculation method Methods 0.000 claims abstract description 5
- 239000011159 matrix material Substances 0.000 claims description 41
- 238000000034 method Methods 0.000 claims description 27
- 239000013598 vector Substances 0.000 claims description 24
- 238000006243 chemical reaction Methods 0.000 claims description 6
- 238000001514 detection method Methods 0.000 abstract description 14
- 230000000694 effects Effects 0.000 abstract description 2
- 230000009466 transformation Effects 0.000 description 15
- 238000003909 pattern recognition Methods 0.000 description 14
- 238000010586 diagram Methods 0.000 description 11
- 238000004891 communication Methods 0.000 description 10
- 238000005516 engineering process Methods 0.000 description 7
- 230000008569 process Effects 0.000 description 5
- 230000006870 function Effects 0.000 description 4
- 230000003287 optical effect Effects 0.000 description 4
- 230000005236 sound signal Effects 0.000 description 4
- 238000000638 solvent extraction Methods 0.000 description 3
- 230000001133 acceleration Effects 0.000 description 2
- 230000003993 interaction Effects 0.000 description 2
- 238000007726 management method Methods 0.000 description 2
- 230000009471 action Effects 0.000 description 1
- 230000006978 adaptation Effects 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000013500 data storage Methods 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 238000003384 imaging method Methods 0.000 description 1
- 239000004973 liquid crystal related substance Substances 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 230000002093 peripheral effect Effects 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/40—Extraction of image or video features
- G06V10/50—Extraction of image or video features by performing operations within image blocks; by using histograms, e.g. histogram of oriented gradients [HoG]; by summing image-intensity values; Projection analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T5/00—Image enhancement or restoration
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T5/00—Image enhancement or restoration
- G06T5/10—Image enhancement or restoration by non-spatial domain filtering
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T5/00—Image enhancement or restoration
- G06T5/40—Image enhancement or restoration by the use of histogram techniques
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/10—Segmentation; Edge detection
- G06T7/11—Region-based segmentation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/40—Extraction of image or video features
- G06V10/42—Global feature extraction by analysis of the whole pattern, e.g. using frequency domain transformations or autocorrelation
- G06V10/431—Frequency domain transformation; Autocorrelation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20021—Dividing image into blocks, subimages or windows
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20048—Transform domain processing
- G06T2207/20052—Discrete cosine transform [DCT]
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20048—Transform domain processing
- G06T2207/20056—Discrete and fast Fourier transform, [DFT, FFT]
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/30—Subject of image; Context of image processing
- G06T2207/30004—Biomedical image processing
- G06T2207/30024—Cell structures in vitro; Tissue sections in vitro
Definitions
- the present disclosure relates to the field of image processing technologies, and in particular, to a feature extraction method and apparatus.
- Image detection and recognition is an important research area in computer vision.
- the most common method used in image detection and recognition technology is to detect and identify images by extracting certain features in the image.
- an image is detected and identified by extracting a HOG (Histogram of Oriented Gradient) feature of an image.
- the HOG feature extraction method is as follows: calculating a gradient of each pixel in the image; dividing the image into a plurality of cells, each cell including a plurality of pixels, and forming n blocks for each adjacent n cells; counting each unit The gradient histogram of all the pixels in the cell, and then the HOG feature of each block is obtained according to the gradient histogram of all the cells in each block; the HOG feature of all the blocks in the statistical image obtains the HOG feature of the image.
- the present disclosure provides a feature extraction method and apparatus.
- the technical solution is as follows:
- a feature extraction method comprising:
- a direction gradient histogram HOG feature of the image in the frequency domain is extracted.
- converting each cell from a spatial domain to a frequency domain comprises: performing a discrete cosine transform DCT on each cell.
- each cell is converted from a spatial domain to a frequency domain, including:
- Each cell is subjected to a discrete Fourier transform DFT.
- the direction gradient histogram HOG feature of the image in the frequency domain is extracted, including:
- the HOG feature of each block in the frequency domain is counted to obtain the HOG feature of the image.
- the HOG feature of each block in the image is obtained, and the HOG feature of the image is obtained, including:
- the HOG features of each block in the image are concatenated into a matrix to obtain the HOG features of the image, and each column of the matrix is a HOG feature of one block.
- the HOG feature of each block in the image is obtained, and the HOG feature of the image is obtained, including:
- the HOG feature of the image is obtained based on the adjusted HOG characteristics of each block and the corresponding position of each block in the image.
- the method further includes:
- the image is normalized to obtain an image of a predetermined size.
- a feature extraction apparatus comprising:
- a partitioning module configured to divide an image into a plurality of blocks, each block comprising a plurality of cells
- a conversion module configured to convert each cell from a spatial domain to a frequency domain
- An extraction module configured to extract a direction gradient histogram HOG feature of the image in the frequency domain.
- the conversion module is configured to perform a discrete cosine transform DCT on each cell.
- the conversion module is configured to perform a discrete Fourier transform DFT for each cell.
- the extraction module includes:
- a calculation submodule configured to calculate a gradient size and a gradient direction of each cell in the frequency domain to obtain a descriptor for each cell
- a first statistic sub-module configured to count each descriptor in each block in the frequency domain to obtain an HOG feature of each block
- the second statistic sub-module is configured to count HOG features of the image in each block of the frequency domain to obtain an HOG feature of the image.
- the second statistic sub-module is configured to concatenate HOG features of each block in the image into a matrix to obtain HOG features of the image, and each column of the matrix is a HOG feature of one block.
- the second statistic submodule includes:
- the feature extraction sub-module obtains the HOG feature of the image according to the adjusted HOG feature of each block and the corresponding position of each block in the image.
- the device further includes:
- the processing module is configured to normalize the image to obtain an image of a predetermined size.
- a feature extraction apparatus comprising:
- a memory for storing processor executable instructions
- processor is configured to:
- a direction gradient histogram HOG feature of the image in the frequency domain is extracted.
- each block By dividing the image into several blocks, each block includes several cells; converting each cell from the spatial domain to the frequency domain; extracting the directional gradient histogram HOG features of the image in the frequency domain; solving the HOG feature
- the extraction process is directly calculated for the spatial domain of the image, resulting in a low detection rate and accuracy in pattern recognition; the HOG feature of extracting images in the frequency domain is achieved, and the detection rate in pattern recognition is improved. The effect of accuracy.
- FIG. 1 is a flowchart of a feature extraction method according to an exemplary embodiment
- FIG. 2A is a flowchart of a feature extraction method according to another exemplary embodiment
- 2B is a schematic diagram of image division according to an exemplary embodiment
- 2C is a schematic diagram of an image division according to another exemplary embodiment
- 2D is a schematic diagram of a statistical intra-block HOG feature, according to an exemplary embodiment
- FIG. 3A is a flowchart of a feature extraction method according to an exemplary embodiment
- FIG. 3B is a schematic diagram of a statistical image HOG feature, according to an exemplary embodiment
- FIG. 4 is a block diagram of a feature extraction apparatus according to an exemplary embodiment
- FIG. 5 is a block diagram of a feature extraction apparatus according to another exemplary embodiment
- FIG. 6 is a block diagram of a sub-module of a feature extraction device, according to an exemplary embodiment
- FIG. 7 is a block diagram of a feature extraction device, according to another exemplary embodiment.
- FIG. 1 is a flowchart of a feature extraction method according to an exemplary embodiment. As shown in FIG. 1 , the embodiment is illustrated by using the method in hardware for pattern recognition. The method may include the following steps.
- step 102 the image is divided into blocks, each block comprising a number of cells.
- each cell is converted from a spatial domain to a frequency domain.
- step 106 the HOG features of the image in the frequency domain are extracted.
- the HOG features of the image are extracted in the frequency domain.
- the feature extraction method divides an image into a plurality of blocks, each block includes a plurality of cells; converts each cell from a spatial domain to a frequency domain; and extracts an image in Directional gradient histogram HOG feature in the frequency domain; solved the problem of directly calculating the spatial domain of the image in the HOG feature extraction process, resulting in low detection rate and accuracy in pattern recognition; reaching the frequency domain Extracting the HOG features of the image improves the detection rate and accuracy in pattern recognition.
- FIG. 2A is a flowchart of a feature extraction method according to another exemplary embodiment. As shown in FIG. 2A, the embodiment is illustrated by using the method in hardware for pattern recognition. The method may include the following steps. :
- step 201 the image is normalized to obtain an image of a predetermined size.
- the terminal Before the feature extraction of the image, the terminal first normalizes the image, and processes the images of different sizes into images of a predetermined size to facilitate unified processing of the image.
- step 202 the image is divided into blocks, each block comprising a number of cells.
- the dividing, by the terminal, the normalized image includes: dividing the image into a plurality of blocks, and dividing each block into a plurality of cells.
- the terminal divides the normalized image into: dividing the image into a plurality of cells, and then combining the connected cells into a block, where each block includes a plurality of cells, for example: Two adjacent two adjacent rows of cells form a block.
- the order of dividing the block and dividing the cell is not specifically limited, and the cell may be divided into cells before being divided, or the cells may be first divided into blocks.
- whether there is an overlapping area between the block and the block in which the image is divided is not specifically limited, and there may be an overlapping area between the block and the block, and there may be no overlapping area.
- step 203 a DCT transform is performed on each cell.
- the DCT Discrete Cosine Transform
- Each cell in the image is DCT transformed to convert the image from the spatial domain to the frequency domain.
- the terminal performs DFT transform (Discrete Fourier Transform) on each cell.
- DFT transform Discrete Fourier Transform
- the terminal performs DFT transformation on each cell in the image to convert the image from the spatial domain to the frequency domain.
- step 204 the gradient magnitude and gradient direction of each cell in the frequency domain are calculated to obtain a descriptor for each cell.
- the terminal uses a gradient operator to calculate a lateral gradient and a vertical gradient of each pixel in each cell after DCT transformation or DFT transformation.
- An exemplary, commonly used gradient operator is shown in Table 1 below:
- any gradient operator in Table 1 may be selected, or other gradient operators may be selected.
- the selection of the gradient operator is not specified. limited.
- ⁇ (x, y) is the gradient direction of the pixel (x, y)
- m(x, y) is the gradient size of the pixel (x, y).
- the gradient direction ⁇ (x, y) ranges from -90 degrees to 90 degrees, and the gradient direction ⁇ (x, y) is equally divided into z parts, and all pixels in each cell are weighted by m (x, y). Each of the divisions in the gradient direction is counted, and finally each cell gets a z-dimensional vector, that is, the descriptor corresponding to each cell is obtained.
- the gradient direction ⁇ (x, y) is equally divided into 9 parts, and each corresponding angle is 20 degrees, and all pixels in each cell are counted in each 20 degrees according to the weight m(x, y). Finally, a 9-dimensional vector is obtained for each cell.
- the number of divisions of the gradient direction is not specifically limited.
- step 205 individual descriptors within each block in the frequency domain are counted to obtain HOG features for each block.
- the terminal counts the descriptors calculated in each cell included in each block to obtain the HOG feature of each block.
- the terminal may concatenate the descriptors corresponding to the respective cells, such that the HOG feature of each block is a vector, and the dimension of the vector is a unit included in the block.
- the grid corresponds to the k dimension of the descriptor dimension.
- the descriptors in each cell are 9-dimensional vectors.
- Each block contains 4 cells, and the 9-dimensional descriptors in the 4 cells are concatenated to form a 36-dimensional vector.
- the vector of the dimension is taken as the HOG feature of the corresponding block.
- step 206 the HOG features of the respective blocks in the frequency domain are counted to obtain the HOG features of the image.
- the terminal counts the HOG features of each block to obtain the HOG feature of the image. Place each block in the image
- the HOG features are concatenated into a matrix to obtain the HOG features of the image, and each column of the matrix is a HOG feature of a block.
- K i HOG features are connected in series to form a matrix 25, and K 1 is placed in the first column 26 of the matrix in series, and K 2 is Placed in the second column 27 of the matrix in series, and so on. As shown in Figure 2D.
- the feature extraction method divides an image into a plurality of blocks, each block includes a plurality of cells; performs DCT transformation or DFT transformation on each cell; and calculates a frequency domain.
- the gradient size and gradient direction of each cell, the descriptor of each cell is obtained; each descriptor in each block in the statistical frequency domain is obtained, and the HOG feature of each block is obtained; the statistical image is in each block of the frequency domain.
- the HOG feature obtains the HOG feature of the image; it solves the problem that the HOG feature extraction process is directly calculated for the spatial domain of the image, resulting in a low detection rate and accuracy in the pattern recognition; the image is extracted in the frequency domain.
- the HOG feature improves the detection rate and accuracy in pattern recognition.
- Step 206 in the process of obtaining the HOG features of the images in the HOG features of the individual blocks in the statistical image, they may be arranged in the form of corresponding positions in the image.
- Step 206 can be replaced with the following steps 206a and 206b, as shown in FIG. 3A:
- the HOG feature of each block is an L*1 dimensional vector obtained by concatenating the descriptors corresponding to the respective cells, and the L*1 dimensional vector is adjusted to a matrix of M*N, that is, the terminal sets the L* in each block.
- the 1D vector is adjusted to the corresponding matrix according to the included cells, and each column of the corresponding matrix is a descriptor of a cell; then the descriptor of each cell is adjusted according to the corresponding pixel, and the matrix obtained after adjustment is obtained.
- Each column is a HOG feature corresponding to a pixel of a corresponding column in the corresponding block.
- step 206b the HOG feature of the image is obtained based on the adjusted HOG characteristics of each block and the corresponding position of each block in the image.
- the HOG features of the corresponding pixel locations in the image are obtained based on the adjusted HOG features of each block and the corresponding locations of each block in the image.
- the K i HOG features are adjusted to a matrix of M*N, and the matrix 31 adjusted by K 1 is placed in the first block 32. the corresponding position in the image, the K matrix 33 corresponding to the second adjustment in the position of the second block 34 in the image, and so on, and finally a corresponding position on the matrix MN last block MN in an image.
- Figure 3B shows that
- the feature extraction method adjusts the HOG feature of each block in the image from the initial L*1 dimensional vector to the M*N matrix, and each block includes M*N pixels.
- L M*N; according to the adjusted HOG feature of each block and the corresponding position of each block in the image, the HOG feature of the image is obtained; so that the HOG feature of the extracted image corresponds to each block in the image.
- the corresponding position can better highlight the features of each block in the image.
- FIG. 4 is a block diagram of a feature extraction apparatus according to an exemplary embodiment. As shown in FIG. 4, the feature extraction apparatus includes, but is not limited to:
- a partitioning module 420 is configured to divide the image into a number of blocks, each block comprising a number of cells.
- a transformation module 440 is configured to convert each cell from a spatial domain to a frequency domain.
- the transformation module 440 converts each cell to convert the image from the spatial domain to the frequency domain.
- An extraction module 460 is configured to extract a direction gradient histogram HOG feature of the image in the frequency domain.
- the feature extraction apparatus divides an image into a plurality of blocks, each block includes a plurality of cells; each cell is converted from a spatial domain to a frequency domain;
- FIG. 5 is a block diagram of a feature extraction apparatus according to another exemplary embodiment. As shown in FIG. 5, the feature extraction apparatus includes, but is not limited to:
- the processing module 410 is configured to normalize the image to obtain an image of a predetermined size.
- the processing module 410 Before performing feature extraction on the image, the processing module 410 normalizes the image, and processes images of different sizes into images of a predetermined size to facilitate uniform processing of the image.
- a partitioning module 420 is configured to divide the image into a number of blocks, each block comprising a number of cells.
- dividing the normalized image by the dividing module 420 includes dividing the image into a plurality of blocks, and dividing each block into a plurality of cells.
- the dividing module 420 divides the normalized image into: dividing the image into a plurality of cells, and then combining the connected cells into a block, where each block includes a plurality of cells, such as : Combine two adjacent two cells arranged in a field to form a block.
- the dividing module 420 does not specifically define the order of dividing the block and dividing the cell in the image dividing process, and may first divide the block and then divide the cell, or first divide the cells into blocks.
- the dividing module 420 does not specifically define whether there is an overlapping area between the block and the block divided by the image, and there may be an overlapping area between the block and the block, or there may be no overlapping area.
- a transformation module 440 is configured to perform a discrete cosine transform DCT on each cell.
- the DCT Discrete Cosine Transform
- the transformation module 440 performs DCT transformation on each cell in the image to convert the image from the spatial domain to the frequency domain.
- the transformation module 440 is configured to perform a DFT transform (Discrete Fourier Transform) on each of the cells.
- DFT transform Discrete Fourier Transform
- the transformation module 440 performs a DFT transformation on each of the cells in the image to convert the image from the spatial domain to the frequency domain.
- An extraction module 460 is configured to extract a direction gradient histogram HOG feature of the image in the frequency domain.
- the extraction module 460 can include the following sub-modules:
- the calculation sub-module 461 is configured to calculate the gradient size and gradient direction of each cell in the frequency domain to obtain a descriptor for each cell.
- the calculation sub-module 461 calculates a lateral gradient and a vertical gradient of each pixel in each cell after DCT transformation or DFT transformation using a gradient operator.
- the selection of the gradient operator in this embodiment is not specifically limited.
- ⁇ (x, y) is the gradient direction of the pixel (x, y)
- m(x, y) is the gradient size of the pixel (x, y).
- the gradient direction ⁇ (x, y) ranges from -90 degrees to 90 degrees, and the gradient direction ⁇ (x, y) is equally divided into z parts, and all pixels in each cell are weighted by m (x, y). Each of the divisions in the gradient direction is counted, and finally each cell gets a z-dimensional vector, that is, the descriptor corresponding to each cell is obtained.
- the number of divisions of the gradient direction is not specifically limited.
- the first statistic sub-module 462 is configured to count individual descriptors within each block in the frequency domain to obtain HOG features for each block.
- the first statistic sub-module 462 performs statistics on the descriptors calculated in each cell included in each block to obtain the HOG feature of each block.
- the first statistic sub-module 462 may concatenate the descriptors corresponding to the respective cells such that the HOG feature of each block is a vector, and the dimension of the vector is The block contains k times the number of descriptors corresponding to the cell.
- the second statistic sub-module 463 is configured to count the HOG features of the respective blocks in the frequency domain of the image to obtain the HOG features of the image.
- the second statistic sub-module 463 counts the HOG features of each block to obtain the HOG features of the image.
- the second statistic sub-module 463 is configured to concatenate the HOG features of each block in the image into a matrix to obtain an HOG feature of the image, and each column of the matrix is a HOG feature of one block.
- the feature extraction device divides an image into several Blocks, each block comprising several cells; performing DCT transform or DFT transform on each cell; calculating the gradient size and gradient direction of each cell in the frequency domain to obtain a descriptor for each cell; statistical frequency
- Each descriptor in each block in the domain obtains the HOG feature of each block; the HOG feature of each block of the statistical image in the frequency domain is obtained, and the HOG feature of the image is obtained; and the space for the image in the HOG feature extraction process is solved.
- the domain is directly calculated, which leads to the problem of low detection rate and accuracy in pattern recognition; the HOG feature of extracting images in the frequency domain is achieved, and the detection rate and accuracy in pattern recognition are improved.
- the second statistic sub-module 463 can include the following sub-modules, as shown in FIG. 6:
- the HOG feature of each block is an L*1 dimensional vector obtained by concatenating the descriptors corresponding to the respective cells, and the adjustment submodule 610 adjusts the L*1 dimensional vector to a matrix of M*N, that is, in each block
- the L*1 dimensional vector is adjusted to the corresponding matrix according to the included cells, and each column of the corresponding matrix is a descriptor of a cell; then the descriptor of each cell is adjusted according to the corresponding pixel, and then adjusted.
- Each column of the resulting matrix is a HOG feature corresponding to a pixel of a corresponding column in the corresponding block.
- the feature extraction sub-module 620 is configured to obtain an HOG feature of the image according to the adjusted HOG feature of each block and the corresponding position of each block in the image.
- the feature extraction sub-module 620 obtains the HOG feature of the corresponding pixel location in the image according to the adjusted HOG feature of each block and the corresponding position of each block in the image.
- the feature extraction apparatus adjusts the HOG feature of each block in the image from the initial L*1 dimensional vector to the matrix of M*N, and each block includes M*N pixels.
- L M*N; according to the adjusted HOG feature of each block and the corresponding position of each block in the image, the HOG feature of the image is obtained; so that the HOG feature of the extracted image corresponds to each block in the image.
- the corresponding position can better highlight the features of each block in the image.
- An exemplary embodiment of the present disclosure provides a feature extraction device capable of implementing the feature extraction method provided by the present disclosure, the feature extraction device including: a processor, a memory for storing processor executable instructions;
- processor is configured to:
- a direction gradient histogram HOG feature of the image in the frequency domain is extracted.
- FIG. 7 is a block diagram of a feature extraction device, according to an exemplary embodiment.
- device 700 can be a mobile phone, a computer, a digital broadcast terminal, a messaging device, a gaming console, a tablet device, a medical device, a fitness device, a personal digital assistant, and the like.
- apparatus 700 can include one or more of the following components: processing component 702, memory 704, power component 706, multimedia component 708, audio component 710, input/output (I/O) interface 712, sensor component 714, and Communication component 716.
- Processing component 702 typically controls the overall operation of device 700, such as operations associated with display, telephone calls, data communications, camera operations, and recording operations.
- Processing component 702 can include one or more processors 718 to execute instructions to perform all or part of the steps of the methods described above.
- processing component 702 can include one or more modules to facilitate interaction between component 702 and other components.
- processing component 702 can include a multimedia module to facilitate interaction between multimedia component 708 and processing component 702.
- Memory 704 is configured to store various types of data to support operation at device 700. Examples of such data include instructions for any application or method operating on device 700, contact data, phone book data, messages, pictures, videos, and the like. Memory 704 can be implemented by any type of volatile or non-volatile storage device, or a combination thereof, such as static random access memory (SRAM), electrically erasable programmable read only memory (EEPROM), erasable Programmable Read Only Memory (EPROM), Programmable Read Only Memory (PROM), Read Only Memory (ROM), Magnetic Memory, Flash Memory, Disk or Optical Disk.
- SRAM static random access memory
- EEPROM electrically erasable programmable read only memory
- EPROM erasable Programmable Read Only Memory
- PROM Programmable Read Only Memory
- ROM Read Only Memory
- Magnetic Memory Flash Memory
- Disk Disk or Optical Disk.
- Power component 706 provides power to various components of device 700.
- Power component 706 can include a power management system, one or more power sources, and other components associated with generating, managing, and distributing power for device 700.
- the multimedia component 708 includes a screen between the device 700 and the user that provides an output interface.
- the screen can include a liquid crystal display (LCD) and a touch panel (TP). If the screen includes a touch panel, the screen can be implemented as a touch screen to receive input signals from the user.
- the touch panel includes one or more touch sensors to sense touches, slides, and gestures on the touch panel. The touch sensor can sense not only the boundaries of the touch or sliding action, but also the duration and pressure associated with the touch or slide operation.
- the multimedia component 708 includes a front camera and/or a rear camera. When the device 700 is in an operation mode, such as a shooting mode or a video mode, the front camera and/or the rear camera can receive external multimedia data. Each front and rear camera can be a fixed optical lens system or have focal length and optical zoom capabilities.
- the audio component 710 is configured to output and/or input an audio signal.
- audio component 710 includes a microphone (MIC) that is configured to receive an external audio signal when device 700 is in an operational mode, such as a call mode, a recording mode, and a voice recognition mode.
- the received audio signal may be further stored in memory 704 or transmitted via communication component 716.
- audio component 710 also includes a speaker for outputting an audio signal.
- the I/O interface 712 provides an interface between the processing component 702 and the peripheral interface module, which may be a keyboard, a click wheel, a button, or the like. These buttons may include, but are not limited to, a home button, a volume button, a start button, and a lock button.
- Sensor assembly 714 includes one or more sensors for providing device 700 with various aspects of status assessment.
- sensor component 714 can detect an open/closed state of device 700, relative positioning of components, such as a display and a keypad of device 700, and sensor component 714 can also detect a change in position of device 700 or a component of device 700, user The presence or absence of contact with device 700, device 700 orientation or acceleration/deceleration and temperature variation of device 700.
- Sensor assembly 714 can include a proximity sensor configured to detect the presence of nearby objects without any physical contact.
- Sensor component 714 can also include a light sensor, such as a CMOS or CCD image sensor, for use in imaging applications.
- the sensor component 714 can also include an acceleration sensor, a gyro sensor, a magnetic sensor, a pressure sensor, or a temperature sensor.
- Communication component 716 is configured to facilitate wired or wireless communication between device 700 and other devices.
- the device 700 can access a wireless network based on a communication standard, such as Wi-Fi, 2G or 3G, or a combination thereof.
- communication component 716 receives broadcast signals or broadcast associated information from an external broadcast management system via a broadcast channel.
- communication component 716 Also included is a Near Field Communication (NFC) module to facilitate short range communication.
- NFC Near Field Communication
- the NFC module can be implemented based on radio frequency identification (RFID) technology, infrared data association (IrDA) technology, ultra-wideband (UWB) technology, Bluetooth (BT) technology, and other technologies.
- RFID radio frequency identification
- IrDA infrared data association
- UWB ultra-wideband
- Bluetooth Bluetooth
- apparatus 700 may be implemented by one or more application specific integrated circuits (ASICs), digital signal processors (DSPs), digital signal processing devices (DSPDs), programmable logic devices (PLDs), field programmable A gate array (FPGA), controller, microcontroller, microprocessor or other electronic component implementation for performing the feature extraction method described above.
- ASICs application specific integrated circuits
- DSPs digital signal processors
- DSPDs digital signal processing devices
- PLDs programmable logic devices
- FPGA field programmable A gate array
- controller microcontroller, microprocessor or other electronic component implementation for performing the feature extraction method described above.
- non-transitory computer readable storage medium comprising instructions, such as a memory 704 comprising instructions executable by processor 718 of apparatus 700 to perform the feature extraction method described above.
- the non-transitory computer readable storage medium can be a ROM, a random access memory (RAM), a CD-ROM, a magnetic tape, a floppy disk, and an optical data storage device.
Abstract
Description
Claims (15)
- 一种特征提取方法,其特征在于,所述方法包括:将图像划分为若干个块,每个所述块包括若干个单元格;将每个所述单元格从空间域转化为频率域;提取所述图像在所述频率域中的方向梯度直方图HOG特征。
- 根据权利要求1所述的方法,其特征在于,所述将每个所述单元格从空间域转化为频率域,包括:对每个所述单元格进行离散余弦变换DCT。
- 根据权利要求1所述的方法,其特征在于,所述将每个所述单元格从空间域转化为频率域,包括:对每个所述单元格进行离散傅里叶变换DFT。
- 根据权利要求2所述的方法,其特征在于,所述提取所述图像在所述频率域中的方向梯度直方图HOG特征,包括:计算所述频率域中每个所述单元格的梯度大小和梯度方向,得到每个所述单元格的描述子;统计所述频率域中每个所述块内的各个所述描述子,得到每个所述块的HOG特征;统计所述图像在所述频率域中各个所述块的HOG特征,得到所述图像的HOG特征。
- 根据权利要求4所述的方法,其特征在于,所述统计所述图像中各个所述块的HOG特征,得到所述图像的HOG特征,包括:将所述图像中各个所述块的HOG特征串联成一个矩阵,得到所述图像的HOG特征,所述矩阵的每一列为一个所述块的HOG特征。
- 根据权利要求4所述的方法,其特征在于,所述统计所述图像中各个所述块的HOG特征,得到所述图像的HOG特征,包括:将所述图像中每个所述块的HOG特征由初始的L*1维向量调整为M*N的 矩阵,每个所述块包括M*N个像素,L=M*N;根据每个所述块的调整后的所述HOG特征和每个所述块在所述图像中的对应位置,得到所述图像的HOG特征。
- 根据权利要求1至6任一所述的方法,其特征在于,所述方法,还包括:将所述图像进行归一化处理,得到预定尺寸大小的所述图像。
- 一种特征提取装置,其特征在于,所述装置包括:划分模块,被配置为将图像划分为若干个块,每个所述块包括若干个单元格;转化模块,被配置为将每个所述单元格从空间域转化为频率域;提取模块,被配置为提取所述图像在所述频率域中的方向梯度直方图HOG特征。
- 根据权利要求8所述的装置,其特征在于,所述转化模块,被配置为对每个所述单元格进行离散余弦变换DCT。
- 根据权利要求8所述的装置,其特征在于,所述转化模块,被配置为对每个所述单元格进行离散傅里叶变换DFT。
- 根据权利要求9所述的装置,其特征在于,所述提取模块,包括:计算子模块,被配置为计算所述频率域中每个所述单元格的梯度大小和梯度方向,得到每个所述单元格的描述子;第一统计子模块,被配置为统计所述频率域中每个所述块内的各个所述描述子,得到每个所述块的HOG特征;第二统计子模块,被配置为统计所述图像在所述频率域中各个所述块的HOG特征,得到所述图像的HOG特征。
- 根据权利要求11所述的装置,其特征在于,所述第二统计子模块,被配置为将所述图像中各个所述块的HOG特征串联成一个矩阵,得到所述图像的 HOG特征,所述矩阵的每一列为一个所述块的HOG特征。
- 根据权利要求11所述的装置,其特征在于,所述第二统计子模块,包括:调整子模块,被配置为将所述图像中每个所述块的HOG特征由初始的L*1维向量调整为M*N的矩阵,每个所述块包括M*N个像素,L=M*N;特征提取子模块,被配置为根据每个所述块的调整后的所述HOG特征和每个所述块在所述图像中的对应位置,得到所述图像的HOG特征。
- 根据权利要求8至13任一所述的装置,其特征在于,所述装置,还包括:处理模块,被配置为将所述图像进行归一化处理,得到预定尺寸大小的所述图像。
- 一种特征提取装置,其特征在于,所述装置包括:处理器;用于存储所述处理器可执行指令的存储器;其中,所述处理器被配置为:将图像划分为若干个块,每个所述块包括若干个单元格;将每个所述单元格从空间域转化为频率域;提取所述图像在所述频率域中的方向梯度直方图HOG特征。
Priority Applications (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2017552215A JP6378453B2 (ja) | 2015-11-25 | 2015-12-29 | 特徴抽出方法及び装置 |
KR1020167005590A KR101754046B1 (ko) | 2015-11-25 | 2015-12-29 | 특징 추출 방법 및 장치 |
MX2016003738A MX2016003738A (es) | 2015-11-25 | 2015-12-29 | Metodo y dispositivo para extraer una caracteristica. |
RU2016110721A RU2632578C2 (ru) | 2015-11-25 | 2015-12-29 | Способ и устройство выделения характеристики |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510827886.1A CN105654093B (zh) | 2015-11-25 | 2015-11-25 | 特征提取方法及装置 |
CN201510827886.1 | 2015-11-25 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2017088249A1 true WO2017088249A1 (zh) | 2017-06-01 |
Family
ID=56482145
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2015/099310 WO2017088249A1 (zh) | 2015-11-25 | 2015-12-29 | 特征提取方法及装置 |
Country Status (8)
Country | Link |
---|---|
US (1) | US10297015B2 (zh) |
EP (1) | EP3173976A1 (zh) |
JP (1) | JP6378453B2 (zh) |
KR (1) | KR101754046B1 (zh) |
CN (1) | CN105654093B (zh) |
MX (1) | MX2016003738A (zh) |
RU (1) | RU2632578C2 (zh) |
WO (1) | WO2017088249A1 (zh) |
Families Citing this family (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105654093B (zh) | 2015-11-25 | 2018-09-25 | 小米科技有限责任公司 | 特征提取方法及装置 |
CN107451583A (zh) * | 2017-08-03 | 2017-12-08 | 四川长虹电器股份有限公司 | 票据图像特征提取的方法 |
CN107633226B (zh) * | 2017-09-19 | 2021-12-24 | 北京师范大学珠海分校 | 一种人体动作跟踪特征处理方法 |
CN107832667A (zh) * | 2017-10-11 | 2018-03-23 | 哈尔滨理工大学 | 一种基于深度学习的人脸识别方法 |
CN107516094A (zh) * | 2017-10-12 | 2017-12-26 | 北京军秀咨询有限公司 | 一种基于人脸图像处理的人才测评方法 |
CN111243088A (zh) * | 2020-01-08 | 2020-06-05 | 长春工程学院 | 工程地质勘察中的真三维航空遥感地质解译方法及系统 |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102254185A (zh) * | 2011-07-21 | 2011-11-23 | 西安电子科技大学 | 基于对比度敏感函数的背景杂波量化方法 |
US20120099790A1 (en) * | 2010-10-20 | 2012-04-26 | Electronics And Telecommunications Research Institute | Object detection device and system |
US20140169663A1 (en) * | 2012-12-19 | 2014-06-19 | Futurewei Technologies, Inc. | System and Method for Video Detection and Tracking |
CN103903238A (zh) * | 2014-03-21 | 2014-07-02 | 西安理工大学 | 图像特征的显著结构和相关结构融合方法 |
CN105046224A (zh) * | 2015-07-16 | 2015-11-11 | 东华大学 | 基于分块自适应加权梯度方向直方图特征的人脸识别方法 |
Family Cites Families (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH0757077A (ja) | 1993-08-13 | 1995-03-03 | Ricoh Co Ltd | 画像処理装置 |
JPH1056570A (ja) * | 1996-08-12 | 1998-02-24 | Fuji Xerox Co Ltd | 画像処理装置 |
US8625861B2 (en) * | 2008-05-15 | 2014-01-07 | International Business Machines Corporation | Fingerprint representation using gradient histograms |
RU2461017C1 (ru) * | 2011-04-15 | 2012-09-10 | Государственное образовательное учреждение высшего профессионального образования "Военная академия войсковой противовоздушной обороны Вооруженных Сил Российской Федерации" имени Маршала Советского Союза А.М. Василевского | Способ обнаружения точечных тепловых объектов на сложном атмосферном фоне |
ITVI20120041A1 (it) * | 2012-02-22 | 2013-08-23 | St Microelectronics Srl | Rilevazione di caratteristiche di un'immagine |
AU2013312495B2 (en) | 2012-09-05 | 2019-03-21 | Element, Inc. | Biometric authentication in connection with camera-equipped devices |
KR101407070B1 (ko) | 2012-09-28 | 2014-06-12 | 한국전자통신연구원 | 영상기반 사람 검출을 위한 특징 추출 방법 및 장치 |
TWI475495B (zh) | 2013-02-04 | 2015-03-01 | Wistron Corp | 圖像的識別方法、電子裝置與電腦程式產品 |
CN104268528B (zh) * | 2014-09-28 | 2017-10-17 | 中智科创机器人有限公司 | 一种人群聚集区域检测方法和装置 |
CN104866865B (zh) * | 2015-05-11 | 2018-03-16 | 西南交通大学 | 一种基于dhog和离散余弦变换的接触网平衡线故障检测方法 |
CN105654093B (zh) * | 2015-11-25 | 2018-09-25 | 小米科技有限责任公司 | 特征提取方法及装置 |
-
2015
- 2015-11-25 CN CN201510827886.1A patent/CN105654093B/zh active Active
- 2015-12-29 RU RU2016110721A patent/RU2632578C2/ru active
- 2015-12-29 KR KR1020167005590A patent/KR101754046B1/ko active IP Right Grant
- 2015-12-29 MX MX2016003738A patent/MX2016003738A/es active IP Right Grant
- 2015-12-29 JP JP2017552215A patent/JP6378453B2/ja active Active
- 2015-12-29 WO PCT/CN2015/099310 patent/WO2017088249A1/zh active Application Filing
-
2016
- 2016-11-17 EP EP16199236.7A patent/EP3173976A1/en not_active Ceased
- 2016-11-23 US US15/359,816 patent/US10297015B2/en active Active
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20120099790A1 (en) * | 2010-10-20 | 2012-04-26 | Electronics And Telecommunications Research Institute | Object detection device and system |
CN102254185A (zh) * | 2011-07-21 | 2011-11-23 | 西安电子科技大学 | 基于对比度敏感函数的背景杂波量化方法 |
US20140169663A1 (en) * | 2012-12-19 | 2014-06-19 | Futurewei Technologies, Inc. | System and Method for Video Detection and Tracking |
CN103903238A (zh) * | 2014-03-21 | 2014-07-02 | 西安理工大学 | 图像特征的显著结构和相关结构融合方法 |
CN105046224A (zh) * | 2015-07-16 | 2015-11-11 | 东华大学 | 基于分块自适应加权梯度方向直方图特征的人脸识别方法 |
Also Published As
Publication number | Publication date |
---|---|
MX2016003738A (es) | 2018-06-22 |
RU2016110721A (ru) | 2017-09-28 |
US10297015B2 (en) | 2019-05-21 |
RU2632578C2 (ru) | 2017-10-06 |
CN105654093B (zh) | 2018-09-25 |
KR101754046B1 (ko) | 2017-07-04 |
US20170148147A1 (en) | 2017-05-25 |
JP2018504729A (ja) | 2018-02-15 |
JP6378453B2 (ja) | 2018-08-22 |
CN105654093A (zh) | 2016-06-08 |
KR20170074214A (ko) | 2017-06-29 |
EP3173976A1 (en) | 2017-05-31 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2017088249A1 (zh) | 特征提取方法及装置 | |
WO2017088266A1 (zh) | 图片处理方法及装置 | |
EP3032821B1 (en) | Method and device for shooting a picture | |
US9959484B2 (en) | Method and apparatus for generating image filter | |
WO2017088250A1 (zh) | 特征提取方法及装置 | |
WO2021051949A1 (zh) | 一种图像处理方法及装置、电子设备和存储介质 | |
TWI702544B (zh) | 圖像處理方法、電子設備和電腦可讀儲存介質 | |
WO2017215224A1 (zh) | 指纹录入提示方法和装置 | |
WO2017020476A1 (zh) | 关联用户的确定方法及装置 | |
WO2017143776A1 (zh) | 图片类型的识别方法及装置 | |
WO2015196715A1 (zh) | 图像重定位方法、装置及终端 | |
WO2017000491A1 (zh) | 获取虹膜图像的方法、装置及红膜识别设备 | |
CN109034150B (zh) | 图像处理方法及装置 | |
WO2017088248A1 (zh) | 特征提取方法及装置 | |
WO2017088259A1 (zh) | 屏幕保护方法及装置 | |
US9665925B2 (en) | Method and terminal device for retargeting images | |
EP3088941A1 (en) | Liquid crystal display, method and device for measuring pressure | |
CN111985280B (zh) | 图像处理方法及装置 | |
US20140301651A1 (en) | Method and device for performing spatial filtering process on image |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
ENP | Entry into the national phase |
Ref document number: 2017552215 Country of ref document: JP Kind code of ref document: A |
|
ENP | Entry into the national phase |
Ref document number: 20167005590 Country of ref document: KR Kind code of ref document: A |
|
WWE | Wipo information: entry into national phase |
Ref document number: MX/A/2016/003738 Country of ref document: MX |
|
ENP | Entry into the national phase |
Ref document number: 2016110721 Country of ref document: RU Kind code of ref document: A |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 15909160 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 15909160 Country of ref document: EP Kind code of ref document: A1 |