WO2019095997A1 - Image recognition method and device, computer device and computer-readable storage medium - Google Patents

Image recognition method and device, computer device and computer-readable storage medium Download PDF

Info

Publication number
WO2019095997A1
WO2019095997A1 PCT/CN2018/112759 CN2018112759W WO2019095997A1 WO 2019095997 A1 WO2019095997 A1 WO 2019095997A1 CN 2018112759 W CN2018112759 W CN 2018112759W WO 2019095997 A1 WO2019095997 A1 WO 2019095997A1
Authority
WO
WIPO (PCT)
Prior art keywords
image
database
region
query
query image
Prior art date
Application number
PCT/CN2018/112759
Other languages
French (fr)
Chinese (zh)
Inventor
杨茜
牟永强
Original Assignee
深圳云天励飞技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳云天励飞技术有限公司 filed Critical 深圳云天励飞技术有限公司
Publication of WO2019095997A1 publication Critical patent/WO2019095997A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/50Information retrieval; Database structures therefor; File system structures therefor of still image data
    • G06F16/58Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/583Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/22Matching criteria, e.g. proximity measures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/23Clustering techniques
    • G06F18/232Non-hierarchical techniques
    • G06F18/2321Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions
    • G06F18/23213Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions with fixed number of clusters, e.g. K-means clustering
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/40Extraction of image or video features
    • G06V10/56Extraction of image or video features relating to colour

Definitions

  • a first aspect of the present application provides an image recognition method, the method comprising:
  • Whether the query image matches the database image is determined according to the cluster center of each region of the query image and the database image.
  • the query image and the database image are character images
  • the zoning of the query image and the database image includes:
  • the query image and the database image are respectively divided into upper and lower areas according to the character image in the query image and the database image, wherein the upper area corresponds to the upper body of the character, and the lower area corresponds to the lower body of the character.
  • determining whether the query image matches the database image according to the cluster center of each region of the query image and the database image includes:
  • the Mahalanobis distance of the cluster center of each region corresponding to the database image is calculated.
  • Figure 2 is a logarithmic versus RGB coordinate distribution of an image.
  • the image recognition method specifically includes the following steps:
  • the query image is an image that needs to be identified or matched
  • the database image is an image in a pre-established image library.
  • the image recognition method compares the query image with the database image to determine whether the query image matches the database image to confirm whether the content in the query image is consistent with the content in the database image. For example, when performing pedestrian recognition, the pedestrian image captured by the camera on the road is a query image, and the portrait library image of the traffic management system is a database image, and whether the pedestrian image and the portrait library image match are determined according to the similarity coefficient between the pedestrian image and the portrait library image.
  • the person in the pedestrian image is considered to be a character in the portrait library image; otherwise, if it does not match, the person in the non-portrait library image in the pedestrian image is considered to be able to image the pedestrian and another portrait library image. Identify.
  • the image recognition method can be applied to various fields such as video surveillance, product detection, medical diagnosis, and the like.
  • the present invention can be utilized for pedestrian identification, driver identification, vehicle identification, and the like.
  • clustering pixel points of the upper region R1 and the lower region R2 of the query image to obtain a cluster center (x 1 , y 1 ) of the upper region R1 of the query image and a cluster center (x 2 of the lower region R2 of the query image) y 2 ); clustering the pixel points of the upper region R1' and the lower region R2' of the database image to obtain the cluster center (x 1 ', y 1 ') and the lower region R2' of the upper region R1' of the query image Clustering center (x 2 ', y 2 ').
  • the distance between the query center and the cluster center of each region corresponding to the database image may be calculated, and whether the query image matches the database image is determined according to the distance between the query image and the cluster center of each region corresponding to the database image.
  • the Euclidean distance of the cluster center of each region corresponding to the database image of the query image may be calculated, and the query image is determined according to the Euclidean distance of the cluster center of each region corresponding to the query image and the database image. Whether the database images match.
  • the query image includes a cluster center (x1, y1) of the upper region R1 and a cluster center (x2, y2) of the lower region R2
  • the database image includes a cluster center (x1', y1') of the upper region R1'
  • the cluster center (x2', y2') of the lower region R2' is calculated as the cluster center (x1, y1) of the upper region R1 of the query image and the cluster center (x1', y1 of the upper region R1' of the database image.
  • Euclidean distance And the Euclidean distance between the cluster center (x2, y2) of the lower region R2 of the query image and the cluster center (x2', y2') of the lower region R2' of the database image
  • the Manhattan distance (ie, the absolute value distance) of the cluster center of each region corresponding to the database image of the query image may be calculated, and the query image is determined according to the Manhattan distance of the cluster center of each region corresponding to the query image and the database image. Whether the database images match.
  • the query image includes a cluster center (x1, y1) of the upper region R1 and a cluster center (x2, y2) of the lower region R2, and the database image includes a cluster center (x1', y1') of the upper region R1' and
  • the cluster center (x2', y2') of the lower region R2' is calculated as the cluster center (x1, y1) of the upper region R1 of the query image and the cluster center (x1', y1 of the upper region R1' of the database image.
  • other distances such as Mahalanobis distances, of the cluster center of each region of the query image corresponding to the database image may be calculated.
  • the distance between the query image and the cluster center of each region corresponding to the database image may be preset, and whether the query image matches the database image is determined according to the operation result.
  • an average distance (eg, (d 1 +d 2 )/2) of the cluster center of each region of the query image corresponding to the database image may be calculated, and each region corresponding to the database image is determined by the query image. Whether the average distance of the cluster center is less than or equal to the preset distance, and if the average distance of the cluster center of each region corresponding to the database image is less than or equal to the preset distance, the query image is judged to match the database image; otherwise If the average distance of the cluster center of each region corresponding to the query image and the database image is greater than the preset distance, it is determined that the query image does not match the database image.
  • the logarithm of the different poses and shooting angles is similar to the RGB coordinate distribution, so the robustness to the pose and the shooting angle is good, and the attitude and the shooting angle are invariant.
  • the logarithmic relative RGB coordinates are the normalized coordinates of the red component Ri and the green component Gi versus the green component Gi, which reduces the influence of illumination on the recognition, and thus is more robust to illumination and has illumination intensity invariance.
  • the logarithmic relative RGB coordinates reduce the three-dimensional coordinates of each pixel to two dimensions, which reduces the computational complexity and improves the recognition speed.
  • the pixel points are clustered according to the logarithmic relative RGB coordinate sub-region, and the pixel points of each region are transformed into a cluster center, which reduces the influence of the accidental error (ie, eliminates the influence of the stray point) and improves The robustness and anti-interference ability of the identification.
  • whether the query image and the database image match are determined according to the cluster center of each region of the query image and the database image, and the calculation amount of the calculation data of the cluster center is small, the operation complexity is low, and the matching result can be quickly obtained. Therefore, the image recognition method of the first embodiment can realize image recognition with fast high robustness and high anti-interference.
  • the area dividing unit 301 is configured to perform area division on the query image and the database image.
  • the query image is an image that needs to be identified or matched
  • the database image is an image in a pre-established image library.
  • the image recognition method compares the query image with the database image to determine whether the query image matches the database image to confirm whether the content in the query image is consistent with the content in the database image. For example, when performing pedestrian recognition, the pedestrian image captured by the camera on the road is a query image, and the portrait library image of the traffic management system is a database image, and whether the pedestrian image and the portrait library image match are determined according to the similarity coefficient between the pedestrian image and the portrait library image.
  • the image recognition method can be applied to various fields such as video surveillance, product detection, medical diagnosis, and the like.
  • the present invention can be utilized for pedestrian identification, driver identification, vehicle identification, and the like.
  • the same division method is adopted.
  • the query image and the database image are each divided into two upper and lower regions or two left and right regions.
  • the image recognition method is used for character recognition (for example, pedestrian recognition), and the query image and the database image are character images, and the query image and the database image may be divided into upper and lower regions according to the character shapes in the image. .
  • the upper area corresponds to the upper body of the character
  • the lower area corresponds to the lower body of the character.
  • the query image is divided into an upper region R1 and a lower region R2
  • the database image is divided into an upper region R1' and a lower region R2'.
  • the characters in the image are upright characters, since the proportions of the upright characters are roughly similar but the postures and actions are different, the division of the upper and lower regions according to the shape of the characters in the image is more robust.
  • the most colorful character costume is usually a jacket, so the character image is divided into upper and lower regions.
  • the query image and the database image may be respectively divided into two regions, and the query image and the database image may each be divided into more than two regions, for example, divided into three regions or four regions.
  • the matching unit 304 is configured to determine whether the query image and the database image match according to the query center and the cluster center of each region of the database image.
  • the sum of the distances of the cluster centers of each region of the query image corresponding to the database image may be calculated, and the cluster center of each region corresponding to the database image of the query image is determined. Whether the sum of the distances is less than or equal to the preset distance, and if the sum of the distances of the cluster centers of the respective regions corresponding to the database image is less than or equal to the preset distance, the query image is judged to match the database image; otherwise, if If the sum of the distances of the cluster centers of the regions corresponding to the database image corresponding to the database image is greater than the preset distance, it is determined that the query image does not match the database image.
  • an average distance (eg, (d 1 +d 2 )/2) of the cluster center of each region of the query image corresponding to the database image may be calculated, and each region corresponding to the database image is determined by the query image. Whether the average distance of the cluster center is less than or equal to the preset distance, and if the average distance of the cluster center of each region corresponding to the database image is less than or equal to the preset distance, the query image is judged to match the database image; otherwise If the average distance of the cluster center of each region corresponding to the query image and the database image is greater than the preset distance, it is determined that the query image does not match the database image.
  • the pixel points are clustered according to the logarithmic relative RGB coordinate sub-region, and the pixel points of each region are transformed into a cluster center, which reduces the influence of the accidental error (ie, eliminates the influence of the spurious point) and improves The robustness and anti-interference ability of the identification.
  • the query image and the database image match are determined according to the cluster center of each region of the query image and the database image, and the calculation amount of the calculation data of the cluster center is small, the operation complexity is low, and the matching result can be quickly obtained. Therefore, the image recognition method of the second embodiment can realize image recognition with fast high robustness and high anti-interference.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Artificial Intelligence (AREA)
  • Library & Information Science (AREA)
  • Multimedia (AREA)
  • Probability & Statistics with Applications (AREA)
  • Databases & Information Systems (AREA)
  • Image Analysis (AREA)

Abstract

An image recognition method. The method comprises: performing region division on a query image and a database image; calculating logarithmic relative RGB coordinates of each pixel point in each region of the query image and the database image; clustering pixels points in each region of the query image and the database image according to the logarithmic relative RGB coordinates of each pixel point in each region of the query image and the database image to obtain a clustering center of each region of the query image and the database image; and determining, according to the clustering center of each region of the query image and the database image, whether the query image matches the database image. Also provided is an image recognition device, a computer device and a readable storage medium. By means of the present invention, image recognition of high robustness and high anti-interference performance can be quickly realized.

Description

图像识别方法及装置、计算机装置和计算机可读存储介质Image recognition method and device, computer device and computer readable storage medium
本申请要求于2017年11月15日提交中国专利局,申请号为201711132067.0、发明名称为“图像识别方法及装置、计算机装置和计算机可读存储介质”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。The present application claims priority to Chinese Patent Application No. 201711132067.0, entitled "Image Recognition Method and Apparatus, Computer Apparatus, and Computer Readable Storage Medium", which is filed on November 15, 2017. This is incorporated herein by reference.
技术领域Technical field
本发明涉及计算机视觉技术领域,具体涉及一种图像识别方法及装置、计算机装置和计算机可读存储介质。The present invention relates to the field of computer vision technology, and in particular, to an image recognition method and apparatus, a computer apparatus, and a computer readable storage medium.
背景技术Background technique
现有的图像识别方法大多存在一定的缺点。举例来说,基于多区域、多特征的识别方法,通常需要组成维数较高的特征向量再进行训练和预测,计算复杂度较高且准确率和鲁棒性易受到影响。使用RGB色彩空间会受到光照、拍摄条件的影响从而造成同一被测物在不同拍摄条件下提取的特征不同,从而影响识别准确性。而基于色彩空间直方图在计算相似性时易受到干扰,且无论是直方图相交法还是距离计算法,其准确率都不是很高,且在多区域计算中复杂度较高。Most existing image recognition methods have certain disadvantages. For example, based on multi-region and multi-feature recognition methods, it is usually necessary to form feature vectors with higher dimensionality and then perform training and prediction. The computational complexity is high and the accuracy and robustness are easily affected. The use of the RGB color space is affected by illumination and shooting conditions, resulting in different features extracted by the same object under different shooting conditions, thereby affecting the recognition accuracy. However, the histogram based on color space is susceptible to interference when calculating similarity, and the accuracy of histogram intersection method or distance calculation method is not very high, and the complexity is high in multi-region calculation.
发明内容Summary of the invention
鉴于以上内容,有必要提出一种图像识别方法及装置、计算机装置和计算机可读存储介质,其可以实现快速高鲁棒性高抗干扰性的图像识别。In view of the above, it is necessary to provide an image recognition method and apparatus, a computer apparatus, and a computer readable storage medium, which can realize image recognition with fast, high robustness and high anti-interference.
本申请的第一方面提供一种图像识别方法,所述方法包括:A first aspect of the present application provides an image recognition method, the method comprising:
对查询图像与数据库图像进行区域划分;Area division of the query image and the database image;
计算查询图像与数据库图像的每个区域的每个像素点的对数相对RGB坐标;Calculating a logarithmic relative RGB coordinate of each pixel of each region of the query image and the database image;
根据查询图像与数据库图像的每个区域的每个像素点的对数相对RGB坐标 对查询图像与数据库图像的每个区域内的像素点进行聚类,得到查询图像与数据库图像的每个区域的聚类中心;Clustering pixel points in each region of the query image and the database image according to the logarithm of each pixel of each region of the query image and the database image, and obtaining each region of the query image and the database image Cluster center
根据查询图像与数据库图像的每个区域的聚类中心确定查询图像与数据库图像是否匹配。Whether the query image matches the database image is determined according to the cluster center of each region of the query image and the database image.
另一种可能的实现方式中,所述查询图像与数据库图像是人物图像,所述对查询图像与数据库图像进行区域划分包括:In another possible implementation manner, the query image and the database image are character images, and the zoning of the query image and the database image includes:
按照查询图像与数据库图像中的人物形体将查询图像与数据库图像各自划分为上下两个区域,其中上区域对应人物的上半身,下区域对应人物的下半身。The query image and the database image are respectively divided into upper and lower areas according to the character image in the query image and the database image, wherein the upper area corresponds to the upper body of the character, and the lower area corresponds to the lower body of the character.
另一种可能的实现方式中,所述根据查询图像与数据库图像的每个区域的聚类中心确定查询图像与数据库图像是否匹配包括:In another possible implementation manner, determining whether the query image matches the database image according to the cluster center of each region of the query image and the database image includes:
计算查询图像与数据库图像对应的每个区域的聚类中心的距离;Calculating a distance of a cluster center of each region of the query image corresponding to the database image;
根据查询图像与数据库图像对应的每个区域的聚类中心的距离确定查询图像与数据库图像是否匹配。Whether the query image matches the database image is determined according to the distance between the query image and the cluster center of each region corresponding to the database image.
另一种可能的实现方式中,所述计算查询图像与数据库图像对应的每个区域的聚类中心的距离包括:In another possible implementation manner, the distance between the calculation query image and the cluster center of each region corresponding to the database image includes:
计算查询图像与数据库图像对应的每个区域的聚类中心的欧氏距离;或者Calculating the Euclidean distance of the cluster center of each region of the query image corresponding to the database image; or
计算查询图像与数据库图像对应的每个区域的聚类中心的曼哈顿距离;或者Calculating the Manhattan distance of the cluster center of each region of the query image corresponding to the database image; or
计算查询图像与数据库图像对应的每个区域的聚类中心的马氏距离。The Mahalanobis distance of the cluster center of each region corresponding to the database image is calculated.
本申请的第二方面提供一种图像识别装置,所述装置包括:A second aspect of the present application provides an image recognition apparatus, the apparatus comprising:
区域划分单元,用于对查询图像与数据库图像进行区域划分;a region dividing unit, configured to perform area division on the query image and the database image;
坐标计算单元,用于计算查询图像与数据库图像的每个区域的每个像素点的对数相对RGB坐标;a coordinate calculation unit, configured to calculate a logarithmic relative RGB coordinate of each pixel of each region of the query image and the database image;
聚类单元,用于根据查询图像与数据库图像的每个区域的每个像素点的对数相对RGB坐标对查询图像与数据库图像的每个区域内的像素点进行聚类,得到查询图像与数据库图像的每个区域的聚类中心;a clustering unit, configured to cluster pixel points in each region of the query image and the database image according to a logarithm of each pixel of each region of the query image and the database image to obtain a query image and a database The cluster center of each region of the image;
匹配单元,用于根据查询图像与数据库图像的每个区域的聚类中心确定查询图像与数据库图像是否匹配。And a matching unit, configured to determine, according to the query center and the cluster center of each region of the database image, whether the query image and the database image match.
另一种可能的实现方式中,所述查询图像与数据库图像包括人物图像,所述区域划分单元具体用于:In another possible implementation manner, the query image and the database image include a character image, and the area dividing unit is specifically configured to:
按照查询图像与数据库图像中的人物形体将查询图像与数据库图像各自划分为上下两个区域,其中上区域对应人物的上半身,下区域对应人物的下半身。The query image and the database image are respectively divided into upper and lower areas according to the character image in the query image and the database image, wherein the upper area corresponds to the upper body of the character, and the lower area corresponds to the lower body of the character.
另一种可能的实现方式中,所述匹配单元具体用于:In another possible implementation manner, the matching unit is specifically configured to:
计算查询图像与数据库图像对应的每个区域的聚类中心的距离;Calculating a distance of a cluster center of each region of the query image corresponding to the database image;
根据查询图像与数据库图像对应的每个区域的聚类中心的距离确定查询图像与数据库图像是否匹配。Whether the query image matches the database image is determined according to the distance between the query image and the cluster center of each region corresponding to the database image.
另一种可能的实现方式中,所述匹配单元计算查询图像与数据库图像对应的每个区域的聚类中心的距离具体包括:In another possible implementation manner, the matching unit calculates a distance between the cluster center of each region corresponding to the database image corresponding to the database image, and specifically includes:
计算查询图像与数据库图像对应的每个区域的聚类中心的欧氏距离;或者Calculating the Euclidean distance of the cluster center of each region of the query image corresponding to the database image; or
计算查询图像与数据库图像对应的每个区域的聚类中心的曼哈顿距离;或者Calculating the Manhattan distance of the cluster center of each region of the query image corresponding to the database image; or
计算查询图像与数据库图像对应的每个区域的聚类中心的马氏距离。The Mahalanobis distance of the cluster center of each region corresponding to the database image is calculated.
本申请的第三方面提供一种计算机装置,所述计算机装置包括处理器,所述处理器用于执行存储器中存储的计算机程序时实现所述图像识别方法。A third aspect of the present application provides a computer apparatus including a processor that implements the image recognition method when executing a computer program stored in a memory.
本申请的第四方面提供一种计算机可读存储介质,其上存储有计算机程序,所述计算机程序被处理器执行时实现所述图像识别方法。A fourth aspect of the present application provides a computer readable storage medium having stored thereon a computer program that, when executed by a processor, implements the image recognition method.
本发明对查询图像与数据库图像进行区域划分;计算查询图像与数据库图像的每个区域的每个像素点的对数相对RGB坐标;根据查询图像与数据库图像的每个区域的每个像素点的对数相对RGB坐标对查询图像与数据库图像的每个区域内的像素点进行聚类,得到查询图像与数据库图像的每个区域的聚类中心;根据查询图像与数据库图像的每个区域的聚类中心确定查询图像与数据库图像是否匹配。本发明利用对数相对RGB坐标进行图像识别,不同姿态和拍摄角度 得到的对数相对RGB坐标分布相似,因而对姿态和拍摄角度的鲁棒性较好,具有姿态和拍摄角度不变性。对数相对RGB坐标是红色分量Ri、绿色分量Gi对绿色分量为Gi的归一化坐标,减小了光照对识别的影响,因而对光照的鲁棒性较好,具有光照强度不变性。对数相对RGB坐标将每个像素点的三维坐标减少到二维,减小了运算复杂度,提高识别速度。实施例一根据对数相对RGB坐标分区域对像素点进行聚类,将一个区域的像素点变为一个聚类中心,减小了偶然误差的影响(即消除杂散点的影响),提高了识别的鲁棒性和抗干扰能力。此外,本发明根据查询图像与数据库图像的每个区域的聚类中心确定查询图像与数据库图像是否匹配,对于聚类中心的计算数据运算量小,运算复杂度低,可以快速得到匹配结果。因此,本发明可以实现快速高鲁棒性高抗干扰性的图像识别。The invention divides the query image and the database image into regions; calculates the logarithmic relative RGB coordinates of each pixel of each region of the query image and the database image; according to each pixel of each region of the query image and the database image Logarithmic relative RGB coordinates cluster the pixel in each area of the query image and the database image to obtain the cluster center of each region of the query image and the database image; according to each region of the query image and the database image The class center determines if the query image matches the database image. The invention utilizes logarithmic relative RGB coordinates for image recognition, and the logarithm of the different poses and shooting angles is similar to the RGB coordinate distribution, so that the posture and the shooting angle are robust, and the posture and the shooting angle are invariant. The logarithmic relative RGB coordinates are the normalized coordinates of the red component Ri and the green component Gi versus the green component Gi, which reduces the influence of illumination on the recognition, and thus is more robust to illumination and has illumination intensity invariance. The logarithmic relative RGB coordinates reduce the three-dimensional coordinates of each pixel to two dimensions, which reduces the computational complexity and improves the recognition speed. In the first embodiment, the pixel points are clustered according to the logarithmic relative RGB coordinate sub-region, and the pixel points of one region are changed into a cluster center, which reduces the influence of the accidental error (ie, eliminates the influence of the stray point), and improves the Identification robustness and immunity to interference. In addition, the present invention determines whether the query image and the database image match according to the clustering center of each region of the query image and the database image, and the calculation amount for the cluster center is small, the computational complexity is low, and the matching result can be quickly obtained. Therefore, the present invention can realize image recognition with fast high robustness and high anti-interference.
附图说明DRAWINGS
图1是本发明实施例一提供的图像识别方法的流程图。FIG. 1 is a flowchart of an image recognition method according to Embodiment 1 of the present invention.
图2是图像的对数相对RGB坐标分布图。Figure 2 is a logarithmic versus RGB coordinate distribution of an image.
图3是本发明实施例二提供的图像识别装置的结构图。FIG. 3 is a structural diagram of an image recognition apparatus according to Embodiment 2 of the present invention.
图4是本发明实施例三提供的计算机装置的示意图。4 is a schematic diagram of a computer device according to Embodiment 3 of the present invention.
具体实施方式Detailed ways
为了能够更清楚地理解本发明的上述目的、特征和优点,下面结合附图和具体实施例对本发明进行详细描述。需要说明的是,在不冲突的情况下,本申请的实施例及实施例中的特征可以相互组合。The present invention will be described in detail below with reference to the accompanying drawings and specific embodiments. It should be noted that the embodiments in the present application and the features in the embodiments may be combined with each other without conflict.
在下面的描述中阐述了很多具体细节以便于充分理解本发明,所描述的实施例仅仅是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。In the following description, numerous specific details are set forth in the description All other embodiments obtained by those skilled in the art based on the embodiments of the present invention without creative efforts are within the scope of the present invention.
除非另有定义,本文所使用的所有的技术和科学术语与属于本发明的技术领域的技术人员通常理解的含义相同。本文中在本发明的说明书中所使用的术语只是为了描述具体的实施例的目的,不是旨在于限制本发明。All technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs, unless otherwise defined. The terminology used in the description of the present invention is for the purpose of describing particular embodiments and is not intended to limit the invention.
优选地,本发明的图像识别方法应用在一个或者多个计算机装置中。所述计算机装置是一种能够按照事先设定或存储的指令,自动进行数值计算和/或信息处理的设备,其硬件包括但不限于微处理器、专用集成电路(Application Specific Integrated Circuit,ASIC)、可编程门阵列(Field-Programmable Gate Array,FPGA)、数字处理器(Digital Signal Processor,DSP)、嵌入式设备等。Preferably, the image recognition method of the present invention is applied in one or more computer devices. The computer device is a device capable of automatically performing numerical calculation and/or information processing according to an instruction set or stored in advance, and the hardware thereof includes but is not limited to a microprocessor and an application specific integrated circuit (ASIC). , Field-Programmable Gate Array (FPGA), Digital Signal Processor (DSP), embedded devices, etc.
所述计算机装置可以是桌上型计算机、笔记本、掌上电脑及云端服务器等计算设备。所述计算机装置可以与用户通过键盘、鼠标、遥控器、触摸板或声控设备等方式进行人机交互。The computer device may be a computing device such as a desktop computer, a notebook, a palmtop computer, and a cloud server. The computer device can perform human-computer interaction with the user through a keyboard, a mouse, a remote controller, a touch panel, or a voice control device.
实施例一 Embodiment 1
图1是本发明实施例一提供的图像识别方法的流程图。所述图像识别方法应用于计算机装置。FIG. 1 is a flowchart of an image recognition method according to Embodiment 1 of the present invention. The image recognition method is applied to a computer device.
如图1所示,所述图像识别方法具体包括以下步骤:As shown in FIG. 1, the image recognition method specifically includes the following steps:
101:对查询图像与数据库图像进行区域划分。101: Perform area division on the query image and the database image.
查询图像是需要识别或匹配的图像,数据库图像是预先建立的图像库中的图像。所述图像识别方法是将查询图像与数据库图像进行比较,确定查询图像与数据库图像是否匹配,以确认查询图像中的内容与数据库图像中的内容是否一致。例如,当进行行人识别时,道路上摄像头拍摄到的行人图像为查询图像,交管系统的人像库图像为数据库图像,根据行人图像与人像库图像的相似系数判断行人图像与人像库图像是否匹配。若匹配,则认为行人图像中的人物为人像库图像中的人物;否则,若不匹配,则认为行人图像中的人物非人像库图像中的人物,可以对行人图像和另一张人像库图像进行识别。The query image is an image that needs to be identified or matched, and the database image is an image in a pre-established image library. The image recognition method compares the query image with the database image to determine whether the query image matches the database image to confirm whether the content in the query image is consistent with the content in the database image. For example, when performing pedestrian recognition, the pedestrian image captured by the camera on the road is a query image, and the portrait library image of the traffic management system is a database image, and whether the pedestrian image and the portrait library image match are determined according to the similarity coefficient between the pedestrian image and the portrait library image. If it matches, the person in the pedestrian image is considered to be a character in the portrait library image; otherwise, if it does not match, the person in the non-portrait library image in the pedestrian image is considered to be able to image the pedestrian and another portrait library image. Identify.
数据库图像通常与特定信息(例如个人身份信息)相关联。根据匹配结果, 可以获得查询图像的相关信息(例如个人身份信息)。例如,当进行行人识别时,若行人图像与人像库图像是否匹配,则将人像库图像对应的个人身份信息作为行人图像中人物的个人身份信息。Database images are typically associated with specific information, such as personally identifiable information. According to the matching result, related information (for example, personal identification information) of the query image can be obtained. For example, when performing pedestrian recognition, if the pedestrian image matches the portrait library image, the personal identification information corresponding to the portrait library image is taken as the personal identification information of the person in the pedestrian image.
所述图像识别方法可以应用于各个领域,如视频监控、产品检测、医学诊断等。例如,在交通监控中,可以利用本发明进行行人识别、司机识别、车辆识别等。The image recognition method can be applied to various fields such as video surveillance, product detection, medical diagnosis, and the like. For example, in traffic monitoring, the present invention can be utilized for pedestrian identification, driver identification, vehicle identification, and the like.
对查询图像与数据库图像进行区域划分时,采用相同的划分方法。例如,将查询图像和数据库图像各自划分为上下两个区域或者左右两个区域。When arranging the query image and the database image, the same division method is adopted. For example, the query image and the database image are each divided into two upper and lower regions or two left and right regions.
在本实施例中,所述图像识别方法用于人物识别(例如行人识别),查询图像与数据库图像是人物图像,可以将查询图像与数据库图像按照图像中的人物形体各自划分为上下两个区域。上区域对应人物的上半身,下区域对应人物的下半身。例如,将查询图像划分为上区域R1和下区域R2,将数据库图像划分为上区域R1′和下区域R2′。当图像中的人物为直立人物时,由于直立人物的比例大致类似但姿态和动作不同,根据图像中人物的形体进行上下区域的划分会有更高的鲁棒性。同时,最具颜色特征的人物服装通常为上衣下衣,因此将人物图像划分为上下两个区域。In this embodiment, the image recognition method is used for character recognition (for example, pedestrian recognition), and the query image and the database image are character images, and the query image and the database image may be divided into upper and lower regions according to the character shapes in the image. . The upper area corresponds to the upper body of the character, and the lower area corresponds to the lower body of the character. For example, the query image is divided into an upper region R1 and a lower region R2, and the database image is divided into an upper region R1' and a lower region R2'. When the characters in the image are upright characters, since the proportions of the upright characters are roughly similar but the postures and actions are different, the division of the upper and lower regions according to the shape of the characters in the image is more robust. At the same time, the most colorful character costume is usually a jacket, so the character image is divided into upper and lower regions.
在对人物图像进行两个区域的划分时,可以根据经验值确定划分的位置,例如按照人体上下身的黄金比例进行划分。或者,可以识别人物图像中人物的上装与下装的分界处,从该分界处进行划分。When the two images of the person image are divided, the position of the division can be determined according to the empirical value, for example, according to the golden ratio of the upper and lower body of the human body. Alternatively, the boundary between the top and bottom of the character in the character image can be identified, and the division is performed from the boundary.
可以理解,可以以其他方式对查询图像与数据库图像进行区域划分。例如,可以采用金字塔模型对查询图像与数据库图像进行区域划分。It can be understood that the query image and the database image can be divided into regions in other ways. For example, the pyramid model can be used to divide the query image and the database image into regions.
可以将查询图像与数据库图像各自划分为两个区域,也可以将查询图像与数据库图像各自划分为多于两个区域,例如各自划分为三个区域或四个区域。The query image and the database image may be respectively divided into two regions, and the query image and the database image may each be divided into more than two regions, for example, divided into three regions or four regions.
102:计算查询图像与数据库图像的每个区域的每个像素点的对数相对RGB坐标。102: Calculate a logarithmic relative RGB coordinate of each pixel of each region of the query image and the database image.
在本实施例中,红色分量为Ri、绿色分量为Gi、蓝色分量为Bi的像素点i的对数相对RGB坐标为(xi,yi),其中
Figure PCTCN2018112759-appb-000001
Figure PCTCN2018112759-appb-000002
可以取以e为底的对数,即
Figure PCTCN2018112759-appb-000003
Figure PCTCN2018112759-appb-000004
或者,可以取以其他值为底的对数,例如取以10为底的对数。
In this embodiment, the logarithm of the pixel point i of the red component is Ri, the green component is Gi, and the blue component is Bi is (xi, yi), wherein
Figure PCTCN2018112759-appb-000001
Figure PCTCN2018112759-appb-000002
You can take the logarithm of e, that is,
Figure PCTCN2018112759-appb-000003
Figure PCTCN2018112759-appb-000004
Alternatively, you can take a logarithm with other values, such as a base 10 logarithm.
Figure PCTCN2018112759-appb-000005
为横轴,
Figure PCTCN2018112759-appb-000006
为纵轴,可以得到查询图像与数据库图像的对数相对RGB坐标分布图。当本发明图像识别方法用于人物识别时,若人物图像中人物上下身服装颜色差异较大,则人物图像的上区域(对应人物的上半身)的像素点对应的对数相对RGB坐标与人物图像的下区域(对应人物的下半身)的对数相对RGB坐标往往分布在两个不同区域,因而通常会得到两个中心的坐标簇。
Take
Figure PCTCN2018112759-appb-000005
For the horizontal axis,
Figure PCTCN2018112759-appb-000006
For the vertical axis, a logarithmic relative RGB coordinate distribution map of the query image and the database image can be obtained. When the image recognition method of the present invention is used for person recognition, if the color difference between the upper and lower body clothing of the character image is large, the logarithm of the upper part of the character image (corresponding to the upper body of the character) corresponds to the RGB coordinate and the character image. The logarithmic relative RGB coordinates of the lower region (corresponding to the lower body of the character) are often distributed in two different regions, so that two central coordinate clusters are usually obtained.
图2是图像的对数相对RGB坐标分布图。图2中,图像划分为R1与R2两个区域(例如查询图像划分为上区域R1和下区域R2),其中,20是区域R1的像素点的对数相对RGB坐标分布,21是区域R2的像素点的对数相对RGB坐标分布。Figure 2 is a logarithmic versus RGB coordinate distribution of an image. In FIG. 2, the image is divided into two regions R1 and R2 (for example, the query image is divided into an upper region R1 and a lower region R2), where 20 is the logarithm of the pixel of the region R1 relative to the RGB coordinate distribution, and 21 is the region R2. The logarithm of the pixel is relative to the RGB coordinate distribution.
使用RGB色彩空间会受到光照、姿态、拍摄角度的影响,从而造成同一被测物在不同条件下提取的特征不同,影响识别准确性。本发明利用对数相对RGB坐标进行图像识别,不同姿态和拍摄角度得到的对数相对RGB坐标分布非常相似,因而对姿态和拍摄角度的鲁棒性较好,具有姿态和拍摄角度不变性。对数相对RGB坐标是红色分量Ri、绿色分量Gi对绿色分量为Gi的归一化坐标,减小了光照和图像质量对识别的影响,具有光照强度不变性。同时,对数相对RGB坐标以0为中心左右对称,具有较好的对称性和平衡性。此外,对数相对RGB坐标将每个像素点的三维坐标减少到二维,减小了运算复杂度,并使得其分布可以用二维图像来表示,为进一步的聚类提供了可能。The use of the RGB color space is affected by the illumination, attitude, and shooting angle, resulting in different features extracted by the same object under different conditions, affecting the recognition accuracy. The invention utilizes logarithmic relative RGB coordinates for image recognition, and the logarithmic relative RGB coordinate distribution obtained by different poses and shooting angles is very similar, so the robustness to the attitude and the shooting angle is good, and the attitude and the shooting angle are invariant. The logarithmic relative RGB coordinates are the normalized coordinates of the red component Ri and the green component Gi versus the green component Gi, which reduces the influence of illumination and image quality on the recognition, and has the invariance of illumination intensity. At the same time, the logarithmic relative RGB coordinates are symmetrical about 0, which has good symmetry and balance. In addition, the logarithmic relative RGB coordinates reduce the three-dimensional coordinates of each pixel to two-dimensional, which reduces the computational complexity and makes its distribution can be represented by two-dimensional images, which provides a possibility for further clustering.
103:根据查询图像与数据库图像的每个区域的每个像素点的对数相对RGB坐标对查询图像与数据库图像的每个区域内的像素点进行聚类,得到查询图像 与数据库图像的每个区域的聚类中心。103: Clustering pixel points in each region of the query image and the database image according to the logarithm of each pixel of each region of the query image and the database image, and obtaining each of the query image and the database image. The clustering center of the area.
例如,对查询图像的上区域R1和下区域R2的像素点进行聚类,得到查询图像的上区域R1的聚类中心(x 1,y 1)和下区域R2的聚类中心(x 2,y 2);对数据库图像的上区域R1′和下区域R2′的像素点进行聚类,得到查询图像的上区域R1′的聚类中心(x 1′,y 1′)和下区域R2′的聚类中心(x 2′,y 2′)。 For example, clustering pixel points of the upper region R1 and the lower region R2 of the query image to obtain a cluster center (x 1 , y 1 ) of the upper region R1 of the query image and a cluster center (x 2 of the lower region R2 of the query image) y 2 ); clustering the pixel points of the upper region R1' and the lower region R2' of the database image to obtain the cluster center (x 1 ', y 1 ') and the lower region R2' of the upper region R1' of the query image Clustering center (x 2 ', y 2 ').
参阅图2所示,根据区域R1的每个像素点的对数相对RGB坐标对区域R1的像素点进行聚类,得到区域R1的聚类中心22;根据区域R2的每个像素点的对数相对RGB坐标对区域R2的像素点进行聚类,得到区域R2的聚类中心23。Referring to FIG. 2, the pixel points of the region R1 are clustered according to the logarithm of each pixel of the region R1 with respect to the RGB coordinates to obtain the cluster center 22 of the region R1; the logarithm of each pixel according to the region R2 The pixel points of the region R2 are clustered with respect to the RGB coordinates to obtain the cluster center 23 of the region R2.
可以使用GMM(Gaussian Mixture Model,高斯混合模型)或K-Means算法对查询图像与数据库图像的每个区域内的像素点进行聚类,得到查询图像和数据库图像的每个区域的聚类中心。例如,使用聚类中心数为2的高斯混合模型GMM或K-Means算法得到查询图像的上区域R1的聚类中心(x 1,y 1)和下区域R2的聚类中心(x 2,y 2),得到数据库图像的上区域R1′的聚类中心(x 1′,y 1′)和下区域R2′的聚类中心(x 2′,y 2′)。 The GMM (Gaussian Mixture Model) or the K-Means algorithm may be used to cluster the pixel points in each region of the query image and the database image to obtain a cluster center of each region of the query image and the database image. For example, using a Gaussian mixture model GMM or K-Means algorithm with a clustering center number of 2, the cluster center (x 1 , y 1 ) of the upper region R1 of the query image and the cluster center of the lower region R2 (x 2 , y) are obtained. 2 ), the cluster center (x 1 ', y 1 ') of the upper region R1' of the database image and the cluster center (x 2 ', y 2 ') of the lower region R2' are obtained.
还可以使用其他的聚类算法对查询图像与数据库图像的每个区域内的像素点进行聚类。例如,使用DBSCAN(Density-Based Spatial Clustering of Applications with Noise,具有噪声的基于密度的聚类方法)算法对查询图像与数据库图像的每个区域内的像素点进行聚类。Other clustering algorithms can also be used to cluster the query image with pixels in each region of the database image. For example, a DBSCAN (Density-Based Spatial Clustering of Applications with Noise) algorithm is used to cluster pixel points in each region of the query image and the database image.
根据对数相对RGB坐标分区域对像素点进行聚类,可以减小了偶然误差的影响(即消除杂散点的影响),提高了识别的鲁棒性和抗干扰能力。Clustering pixel points according to the logarithmic relative RGB coordinate sub-region can reduce the influence of accidental error (ie, eliminate the influence of spurious points), and improve the robustness and anti-interference ability of the recognition.
104:根据查询图像与数据库图像的每个区域的聚类中心确定查询图像与数据库图像是否匹配。104: Determine whether the query image and the database image match according to the cluster center of each region of the query image and the database image.
例如,当进行行人识别时,根据摄像头拍摄到的行人图像与交管系统的人像库图像的相似系数判断行人图像与人像库图像是否匹配。若匹配,则认为行人图像中的人物为人像库图像中的人物;否则,若不匹配,则认为行人图像中 的人物非人像库图像中的人物,可以对行人图像和另一张人像库图像进行识别。For example, when performing pedestrian recognition, it is determined whether the pedestrian image and the portrait library image match based on the similarity coefficient of the pedestrian image captured by the camera and the portrait library image of the traffic control system. If it matches, the person in the pedestrian image is considered to be a character in the portrait library image; otherwise, if it does not match, the person in the non-portrait library image in the pedestrian image is considered to be able to image the pedestrian and another portrait library image. Identify.
可以计算查询图像与数据库图像对应的每个区域的聚类中心的距离,根据查询图像与数据库图像对应的每个区域的聚类中心的距离确定查询图像与数据库图像是否匹配。The distance between the query center and the cluster center of each region corresponding to the database image may be calculated, and whether the query image matches the database image is determined according to the distance between the query image and the cluster center of each region corresponding to the database image.
在本实施例中,可以计算查询图像与数据库图像对应的每个区域的聚类中心的欧氏距离,根据查询图像与数据库图像对应的每个区域的聚类中心的欧氏距离确定查询图像与数据库图像是否匹配。例如,查询图像包括上区域R1的聚类中心(x1,y1)和下区域R2的聚类中心(x2,y2),数据库图像包括上区域R1′的聚类中心(x1′,y1′)和下区域R2′的聚类中心(x2′,y2′),计算得到查询图像的上区域R1的聚类中心(x1,y1)与数据库图像的上区域R1′的聚类中心(x1′,y1′)的欧氏距离
Figure PCTCN2018112759-appb-000007
以及查询图像的下区域R2的聚类中心(x2,y2)与数据库图像的下区域R2′的聚类中心(x2′,y2′)的欧氏距离
Figure PCTCN2018112759-appb-000008
In this embodiment, the Euclidean distance of the cluster center of each region corresponding to the database image of the query image may be calculated, and the query image is determined according to the Euclidean distance of the cluster center of each region corresponding to the query image and the database image. Whether the database images match. For example, the query image includes a cluster center (x1, y1) of the upper region R1 and a cluster center (x2, y2) of the lower region R2, and the database image includes a cluster center (x1', y1') of the upper region R1' and The cluster center (x2', y2') of the lower region R2' is calculated as the cluster center (x1, y1) of the upper region R1 of the query image and the cluster center (x1', y1 of the upper region R1' of the database image. Euclidean distance
Figure PCTCN2018112759-appb-000007
And the Euclidean distance between the cluster center (x2, y2) of the lower region R2 of the query image and the cluster center (x2', y2') of the lower region R2' of the database image
Figure PCTCN2018112759-appb-000008
或者,可以计算查询图像与数据库图像对应的每个区域的聚类中心的曼哈顿距离(即绝对值距离),根据查询图像与数据库图像对应的每个区域的聚类中心的曼哈顿距离确定查询图像与数据库图像是否匹配。例如,查询图像包括上区域R1的聚类中心(x1,y1)和下区域R2的聚类中心(x2,y2),数据库图像包括上区域R1′的聚类中心(x1′,y1′)和下区域R2′的聚类中心(x2′,y2′),计算得到查询图像的上区域R1的聚类中心(x1,y1)与数据库图像的上区域R1′的聚类中心(x1′,y1′)的曼哈顿距离d 1=|x 1-x′ 1|+|y 1-y′1|,以及查询图像的下区域R2的聚类中心(x2,y2)与数据库图像的下区域R2′的聚类中心(x2′,y2′)的曼哈顿距离d 2=|x 2-x′ 2|+|y 2-y′ 2|。 Alternatively, the Manhattan distance (ie, the absolute value distance) of the cluster center of each region corresponding to the database image of the query image may be calculated, and the query image is determined according to the Manhattan distance of the cluster center of each region corresponding to the query image and the database image. Whether the database images match. For example, the query image includes a cluster center (x1, y1) of the upper region R1 and a cluster center (x2, y2) of the lower region R2, and the database image includes a cluster center (x1', y1') of the upper region R1' and The cluster center (x2', y2') of the lower region R2' is calculated as the cluster center (x1, y1) of the upper region R1 of the query image and the cluster center (x1', y1 of the upper region R1' of the database image. ') Manhattan distance d 1 =|x 1 -x' 1 |+|y 1 -y'1|, and the cluster center (x2,y2) of the lower region R2 of the query image and the lower region R2' of the database image The Manhattan distance d 2 =|x 2 -x' 2 |+|y 2 -y' 2 | of the clustering center (x2', y2').
其他的实施例中,可以计算查询图像与数据库图像对应的每个区域的聚类中心的其他距离,例如马氏距离。In other embodiments, other distances, such as Mahalanobis distances, of the cluster center of each region of the query image corresponding to the database image may be calculated.
可以对查询图像与数据库图像对应的每个区域的聚类中心的距离进行预设运算,根据运算结果确定查询图像与数据库图像是否匹配。The distance between the query image and the cluster center of each region corresponding to the database image may be preset, and whether the query image matches the database image is determined according to the operation result.
在一个实施例中,可以计算查询图像与数据库图像对应的每个区域的聚类中心的距离之和(例如d 1+d 2),判断查询图像与数据库图像对应的每个区域的聚类中心的距离之和是否小于或等于预设距离,若查询图像与数据库图像对应的每个区域的聚类中心的距离之和小于或等于预设距离,则判断查询图像与数据库图像匹配;否则,若查询图像与数据库图像对应的每个区域的聚类中心的距离之和大于预设距离,则判断查询图像与数据库图像不匹配。 In one embodiment, the sum of the distances of the cluster centers of each region of the query image corresponding to the database image (eg, d 1 +d 2 ) may be calculated, and the cluster center of each region corresponding to the database image of the query image is determined. Whether the sum of the distances is less than or equal to the preset distance, and if the sum of the distances of the cluster centers of the respective regions corresponding to the database image is less than or equal to the preset distance, the query image is judged to match the database image; otherwise, if If the sum of the distances of the cluster centers of the regions corresponding to the database image corresponding to the database image is greater than the preset distance, it is determined that the query image does not match the database image.
或者,可以判断查询图像与数据库图像对应的每个区域的聚类中心的距离之和是否小于查询图像与其他的数据库图像对应的每个区域的聚类中心的距离之和,若查询图像与数据库图像对应的每个区域的聚类中心的距离之和小于查询图像与其他的数据库图像对应的每个区域的聚类中心的距离之和,则判断查询图像与数据库图像匹配;否则,若查询图像与数据库图像的每个区域对应的聚类中心的距离之和不小于查询图像与其他的数据库图像对应的每个区域的聚类中心的距离之和,则判断查询图像与数据库图像不匹配。Alternatively, it can be determined whether the sum of the distances of the cluster centers of each region corresponding to the query image and the database image is smaller than the sum of the distances between the query image and the cluster center of each region corresponding to the other database images, if the query image and the database The sum of the distances of the cluster centers of each region corresponding to the image is smaller than the sum of the distances between the query images and the cluster centers of each region corresponding to the other database images, and then the query image is judged to match the database image; otherwise, if the image is queried The sum of the distances of the cluster centers corresponding to each region of the database image is not less than the sum of the distances of the cluster centers of the regions corresponding to the query image and the other database images, and then the query image is judged to be mismatched with the database image.
在另一实施例中,可以计算查询图像与数据库图像对应的每个区域的聚类中心的平均距离(例如(d 1+d 2)/2),判断查询图像与数据库图像对应的每个区域的聚类中心的平均距离是否小于或等于预设距离,若查询图像与数据库图像对应的每个区域的聚类中心的平均距离小于或等于预设距离,则判断查询图像与数据库图像匹配;否则,若查询图像与数据库图像对应的每个区域的聚类中心的平均距离大于预设距离,则判断查询图像与数据库图像不匹配。 In another embodiment, an average distance (eg, (d 1 +d 2 )/2) of the cluster center of each region of the query image corresponding to the database image may be calculated, and each region corresponding to the database image is determined by the query image. Whether the average distance of the cluster center is less than or equal to the preset distance, and if the average distance of the cluster center of each region corresponding to the database image is less than or equal to the preset distance, the query image is judged to match the database image; otherwise If the average distance of the cluster center of each region corresponding to the query image and the database image is greater than the preset distance, it is determined that the query image does not match the database image.
或者,可以判断查询图像与数据库图像对应的每个区域的聚类中心的平均距离是否小于查询图像与其他的数据库图像对应的每个区域的聚类中心的平均距离,若查询图像与数据库图像对应的每个区域的聚类中心的平均距离小于查询图像与其他的数据库图像对应的每个区域的聚类中心的平均距离,则判断查询图像与数据库图像匹配;否则,若查询图像与数据库图像对应的每个区域的聚类中心的平均距离不小于查询图像与其他的数据库图像对应的每个区域的聚类中心的平均距离,则判断查询图像与数据库图像不匹配。Alternatively, it can be determined whether the average distance of the cluster center of each region corresponding to the query image and the database image is smaller than the average distance of the cluster center of each region corresponding to the query image and other database images, if the query image corresponds to the database image The average distance of the cluster center of each region is smaller than the average distance of the cluster center of each region corresponding to the query image and other database images, and then the query image is judged to match the database image; otherwise, if the query image corresponds to the database image The average distance of the cluster center of each region is not less than the average distance of the cluster center of each region of the query image corresponding to other database images, and then the query image is judged to not match the database image.
实施例一的图像识别方法对查询图像与数据库图像进行区域划分;计算查询图像与数据库图像的每个区域的每个像素点的对数相对RGB坐标;根据查询图像与数据库图像的每个区域的每个像素点的对数相对RGB坐标对查询图像与数据库图像的每个区域内的像素点进行聚类,得到查询图像与数据库图像的每个区域的聚类中心;根据查询图像与数据库图像的每个区域的聚类中心确定查询图像与数据库图像是否匹配。实施例一利用对数相对RGB坐标进行图像识别,不同姿态和拍摄角度得到的对数相对RGB坐标分布相似,因而对姿态和拍摄角度的鲁棒性较好,具有姿态和拍摄角度不变性。对数相对RGB坐标是红色分量Ri、绿色分量Gi对绿色分量为Gi的归一化坐标,减小了光照对识别的影响,因而对光照的鲁棒性较好,具有光照强度不变性。对数相对RGB坐标将每个像素点的三维坐标减少到二维,减小了运算复杂度,提高识别速度。实施例一根据对数相对RGB坐标分区域对像素点进行聚类,将每个区域的像素点变换为一个聚类中心,减小了偶然误差的影响(即消除杂散点的影响),提高了识别的鲁棒性和抗干扰能力。此外,实施例一根据查询图像与数据库图像的每个区域的聚类中心确定查询图像与数据库图像是否匹配,对于聚类中心的计算数据运算量小,运算复杂度低,可以快速得到匹配结果。因此,实施例一的图像识别方法可以实现快速高鲁棒性高抗干扰性的图像识别。The image recognition method of the first embodiment performs area division on the query image and the database image; calculates the logarithm relative RGB coordinates of each pixel of each region of the query image and the database image; according to each region of the query image and the database image The logarithm of each pixel is compared with the RGB coordinates to cluster the pixel in each region of the query image and the database image to obtain a cluster center of each region of the query image and the database image; according to the query image and the database image The cluster center of each region determines whether the query image matches the database image. In the first embodiment, the logarithmic relative RGB coordinates are used for image recognition. The logarithm of the different poses and shooting angles is similar to the RGB coordinate distribution, so the robustness to the pose and the shooting angle is good, and the attitude and the shooting angle are invariant. The logarithmic relative RGB coordinates are the normalized coordinates of the red component Ri and the green component Gi versus the green component Gi, which reduces the influence of illumination on the recognition, and thus is more robust to illumination and has illumination intensity invariance. The logarithmic relative RGB coordinates reduce the three-dimensional coordinates of each pixel to two dimensions, which reduces the computational complexity and improves the recognition speed. In the first embodiment, the pixel points are clustered according to the logarithmic relative RGB coordinate sub-region, and the pixel points of each region are transformed into a cluster center, which reduces the influence of the accidental error (ie, eliminates the influence of the stray point) and improves The robustness and anti-interference ability of the identification. In addition, in the first embodiment, whether the query image and the database image match are determined according to the cluster center of each region of the query image and the database image, and the calculation amount of the calculation data of the cluster center is small, the operation complexity is low, and the matching result can be quickly obtained. Therefore, the image recognition method of the first embodiment can realize image recognition with fast high robustness and high anti-interference.
实施例二 Embodiment 2
图3为本发明实施例二提供的图像识别装置的结构图。如图3所示,所述图像识别装置10可以包括:区域划分单元301、坐标计算单元302、聚类单元303、匹配单元304。FIG. 3 is a structural diagram of an image recognition apparatus according to Embodiment 2 of the present invention. As shown in FIG. 3, the image recognition apparatus 10 may include an area division unit 301, a coordinate calculation unit 302, a clustering unit 303, and a matching unit 304.
区域划分单元301,用于对查询图像与数据库图像进行区域划分。The area dividing unit 301 is configured to perform area division on the query image and the database image.
查询图像是需要识别或匹配的图像,数据库图像是预先建立的图像库中的图像。所述图像识别方法是将查询图像与数据库图像进行比较,确定查询图像与数据库图像是否匹配,以确认查询图像中的内容与数据库图像中的内容是否 一致。例如,当进行行人识别时,道路上摄像头拍摄到的行人图像为查询图像,交管系统的人像库图像为数据库图像,根据行人图像与人像库图像的相似系数判断行人图像与人像库图像是否匹配。若匹配,则认为行人图像中的人物为人像库图像中的人物;否则,若不匹配,则认为行人图像中的人物非人像库图像中的人物,可以对行人图像和另一张人像库图像进行识别。The query image is an image that needs to be identified or matched, and the database image is an image in a pre-established image library. The image recognition method compares the query image with the database image to determine whether the query image matches the database image to confirm whether the content in the query image is consistent with the content in the database image. For example, when performing pedestrian recognition, the pedestrian image captured by the camera on the road is a query image, and the portrait library image of the traffic management system is a database image, and whether the pedestrian image and the portrait library image match are determined according to the similarity coefficient between the pedestrian image and the portrait library image. If it matches, the person in the pedestrian image is considered to be a character in the portrait library image; otherwise, if it does not match, the person in the non-portrait library image in the pedestrian image is considered to be able to image the pedestrian and another portrait library image. Identify.
数据库图像通常与特定信息(例如个人身份信息)相关联。根据匹配结果,可以获得查询图像的相关信息(例如个人身份信息)。例如,当进行行人识别时,若行人图像与人像库图像是否匹配,则将人像库图像对应的个人身份信息作为行人图像中人物的个人身份信息。Database images are typically associated with specific information, such as personally identifiable information. Based on the matching result, related information (for example, personal identification information) of the query image can be obtained. For example, when performing pedestrian recognition, if the pedestrian image matches the portrait library image, the personal identification information corresponding to the portrait library image is taken as the personal identification information of the person in the pedestrian image.
所述图像识别方法可以应用于各个领域,如视频监控、产品检测、医学诊断等。例如,在交通监控中,可以利用本发明进行行人识别、司机识别、车辆识别等。The image recognition method can be applied to various fields such as video surveillance, product detection, medical diagnosis, and the like. For example, in traffic monitoring, the present invention can be utilized for pedestrian identification, driver identification, vehicle identification, and the like.
对查询图像与数据库图像进行区域划分时,采用相同的划分方法。例如,将查询图像和数据库图像各自划分为上下两个区域或者左右两个区域。When arranging the query image and the database image, the same division method is adopted. For example, the query image and the database image are each divided into two upper and lower regions or two left and right regions.
在本实施例中,所述图像识别方法用于人物识别(例如行人识别),查询图像与数据库图像是人物图像,可以将查询图像与数据库图像按照图像中的人物形体各自划分为上下两个区域。上区域对应人物的上半身,下区域对应人物的下半身。例如,将查询图像划分为上区域R1和下区域R2,将数据库图像划分为上区域R1′和下区域R2′。当图像中的人物为直立人物时,由于直立人物的比例大致类似但姿态和动作不同,根据图像中人物的形体进行上下区域的划分会有更高的鲁棒性。同时,最具颜色特征的人物服装通常为上衣下衣,因此将人物图像划分为上下两个区域。In this embodiment, the image recognition method is used for character recognition (for example, pedestrian recognition), and the query image and the database image are character images, and the query image and the database image may be divided into upper and lower regions according to the character shapes in the image. . The upper area corresponds to the upper body of the character, and the lower area corresponds to the lower body of the character. For example, the query image is divided into an upper region R1 and a lower region R2, and the database image is divided into an upper region R1' and a lower region R2'. When the characters in the image are upright characters, since the proportions of the upright characters are roughly similar but the postures and actions are different, the division of the upper and lower regions according to the shape of the characters in the image is more robust. At the same time, the most colorful character costume is usually a jacket, so the character image is divided into upper and lower regions.
在对人物图像进行两个区域的划分时,可以根据经验值确定划分的位置,例如按照人体上下身的黄金比例进行划分。或者,可以识别人物图像中人物的上装与下装的分界处,从该分界处进行划分。When the two images of the person image are divided, the position of the division can be determined according to the empirical value, for example, according to the golden ratio of the upper and lower body of the human body. Alternatively, the boundary between the top and bottom of the character in the character image can be identified, and the division is performed from the boundary.
可以理解,可以以其他方式对查询图像与数据库图像进行区域划分。例如,可以采用金字塔模型对查询图像与数据库图像进行区域划分。It can be understood that the query image and the database image can be divided into regions in other ways. For example, the pyramid model can be used to divide the query image and the database image into regions.
可以将查询图像与数据库图像各自划分为两个区域,也可以将查询图像与数据库图像各自划分为多于两个区域,例如各自划分为三个区域或四个区域。The query image and the database image may be respectively divided into two regions, and the query image and the database image may each be divided into more than two regions, for example, divided into three regions or four regions.
坐标计算单元302,用于计算查询图像与数据库图像的每个区域的每个像素点的对数相对RGB坐标。The coordinate calculation unit 302 is configured to calculate a logarithmic relative RGB coordinate of each pixel of each region of the query image and the database image.
在本实施例中,红色分量为Ri、绿色分量为Gi、蓝色分量为Bi的像素点i的对数相对RGB坐标为(xi,yi),其中
Figure PCTCN2018112759-appb-000009
Figure PCTCN2018112759-appb-000010
可以取以e为底的对数,即
Figure PCTCN2018112759-appb-000011
Figure PCTCN2018112759-appb-000012
或者,可以取以其他值为底的对数,例如取以10为底的对数。
In this embodiment, the logarithm of the pixel point i of the red component is Ri, the green component is Gi, and the blue component is Bi is (xi, yi), wherein
Figure PCTCN2018112759-appb-000009
Figure PCTCN2018112759-appb-000010
You can take the logarithm of e, that is,
Figure PCTCN2018112759-appb-000011
Figure PCTCN2018112759-appb-000012
Alternatively, you can take a logarithm with other values, such as a base 10 logarithm.
Figure PCTCN2018112759-appb-000013
为横轴,
Figure PCTCN2018112759-appb-000014
为纵轴,可以得到查询图像与数据库图像的对数相对RGB坐标分布图。当本发明图像识别方法用于人物识别时,若人物图像中人物上下身服装颜色差异较大,则人物图像的上区域(对应人物的上半身)的像素点对应的对数相对RGB坐标与人物图像的下区域(对应人物的下半身)的对数相对RGB坐标往往分布在两个不同区域,因而通常会得到两个中心的坐标簇。
Take
Figure PCTCN2018112759-appb-000013
For the horizontal axis,
Figure PCTCN2018112759-appb-000014
For the vertical axis, a logarithmic relative RGB coordinate distribution map of the query image and the database image can be obtained. When the image recognition method of the present invention is used for person recognition, if the color difference between the upper and lower body clothing of the character image is large, the logarithm of the upper part of the character image (corresponding to the upper body of the character) corresponds to the RGB coordinate and the character image. The logarithmic relative RGB coordinates of the lower region (corresponding to the lower body of the character) are often distributed in two different regions, so that two central coordinate clusters are usually obtained.
图2是图像的对数相对RGB坐标分布图。图2中,图像划分为R1与R2两个区域(例如查询图像划分为上区域R1和下区域R2),其中,20是区域R1的像素点的对数相对RGB坐标分布,21是区域R2的像素点的对数相对RGB坐标分布。Figure 2 is a logarithmic versus RGB coordinate distribution of an image. In FIG. 2, the image is divided into two regions R1 and R2 (for example, the query image is divided into an upper region R1 and a lower region R2), where 20 is the logarithm of the pixel of the region R1 relative to the RGB coordinate distribution, and 21 is the region R2. The logarithm of the pixel is relative to the RGB coordinate distribution.
虽然颜色特征为最具分辨力的特征,但是对照明环境、相机拍摄和空间分布的鲁棒性较差。使用RGB色彩空间会受到光照、拍摄条件的影响,从而造成同一被测物在不同拍摄条件下提取的特征不同,影响识别准确性。本发明利用对数相对RGB坐标进行图像识别,不同姿态和拍摄角度得到的对数相对RGB坐标分布非常相似,因而对姿态和角度的鲁棒性较好,具有拍摄角度不变性。对 数相对RGB坐标是红色分量Ri、绿色分量Gi对绿色分量为Gi的归一化坐标,减小了光照和图像质量对识别的影响,具有一定的光照强度不变性。同时,对数相对RGB坐标以0为中心左右对称,具有较好的对称性和平衡性。此外,对数相对RGB坐标将每个像素点的三维坐标减少到二维,减小了运算复杂度,并使得其分布可以用二维图像来表示,因此不需要使用常用的直方图统计图,为进一步的聚类提供了可能。Although the color feature is the most resolving feature, it is less robust to lighting environments, camera shooting, and spatial distribution. The use of the RGB color space is affected by illumination and shooting conditions, resulting in different features extracted by the same object under different shooting conditions, affecting the recognition accuracy. The invention utilizes logarithmic relative RGB coordinates for image recognition, and the logarithmic relative RGB coordinate distribution obtained by different poses and shooting angles is very similar, so the robustness to the attitude and the angle is better, and the shooting angle is invariant. The logarithmic relative RGB coordinates are the normalized coordinates of the red component Ri and the green component Gi versus the green component Gi, which reduces the influence of illumination and image quality on the recognition, and has a certain intensity of light intensity. At the same time, the logarithmic relative RGB coordinates are symmetrical about 0, which has good symmetry and balance. In addition, the logarithmic relative RGB coordinates reduce the three-dimensional coordinates of each pixel to two dimensions, which reduces the computational complexity and allows its distribution to be represented by a two-dimensional image, so there is no need to use common histogram statistics. It provides the possibility for further clustering.
聚类单元303,用于根据查询图像与数据库图像的每个区域的每个像素点的对数相对RGB坐标对查询图像与数据库图像的每个区域内的像素点进行聚类,得到查询图像与数据库图像的每个区域的聚类中心。The clustering unit 303 is configured to cluster the pixel in each region of the query image and the database image according to the logarithm of the pixel and the RGB coordinate of each pixel of each region of the database image to obtain a query image and The cluster center of each region of the database image.
例如,对查询图像的上区域R1和下区域R2的像素点进行聚类,得到查询图像的上区域R1的聚类中心(x1,y1)和下区域R2的聚类中心(x2,y2);对数据库图像的上区域R1′和下区域R2′的像素点进行聚类,得到查询图像的上区域R1′的聚类中心(x1′,y 1′)和下区域R2′的聚类中心(x 2′,y 2′)。 For example, clustering pixel points of the upper region R1 and the lower region R2 of the query image to obtain a cluster center (x1, y1) of the upper region R1 of the query image and a cluster center (x2, y2) of the lower region R2; Clustering the pixel points of the upper region R1' and the lower region R2' of the database image to obtain the cluster center (x1', y 1 ') of the upper region R1' of the query image and the cluster center of the lower region R2' ( x 2 ', y 2 ').
参阅图2所示,根据区域R1的每个像素点的对数相对RGB坐标对区域R1的像素点进行聚类,得到区域R1的聚类中心22;根据区域R2的每个像素点的对数相对RGB坐标对区域R2的像素点进行聚类,得到区域R2的聚类中心23。Referring to FIG. 2, the pixel points of the region R1 are clustered according to the logarithm of each pixel of the region R1 with respect to the RGB coordinates to obtain the cluster center 22 of the region R1; the logarithm of each pixel according to the region R2 The pixel points of the region R2 are clustered with respect to the RGB coordinates to obtain the cluster center 23 of the region R2.
可以使用GMM(Gaussian Mixture Model,高斯混合模型)或K-Means算法对查询图像与数据库图像的每个区域内的像素点进行聚类,得到查询图像和数据库图像的每个区域的聚类中心。例如,使用聚类中心数为2的高斯混合模型GMM或K-Means算法得到查询图像的上区域R1的聚类中心(x 1,y 1)和下区域R2的聚类中心(x 2,y 2),得到数据库图像的上区域R1′的聚类中心(x 1′,y 1′)和下区域R2′的聚类中心(x 2′,y 2′)。 The GMM (Gaussian Mixture Model) or the K-Means algorithm may be used to cluster the pixel points in each region of the query image and the database image to obtain a cluster center of each region of the query image and the database image. For example, using a Gaussian mixture model GMM or K-Means algorithm with a clustering center number of 2, the cluster center (x 1 , y 1 ) of the upper region R1 of the query image and the cluster center of the lower region R2 (x 2 , y) are obtained. 2 ), the cluster center (x 1 ', y 1 ') of the upper region R1' of the database image and the cluster center (x 2 ', y 2 ') of the lower region R2' are obtained.
还可以使用其他的聚类算法对查询图像与数据库图像的每个区域内的像素点进行聚类。例如,使用DBSCAN(Density-Based Spatial Clustering of Applications with Noise,具有噪声的基于密度的聚类方法)算法对查询图像与数据库图像的 每个区域内的像素点进行聚类。Other clustering algorithms can also be used to cluster the query image with pixels in each region of the database image. For example, a DBSCAN (Density-Based Spatial Clustering of Applications with Noise) algorithm is used to cluster pixel points in each region of the query image and the database image.
根据对数相对RGB坐标分区域对像素点进行聚类,可以减小了偶然误差的影响(即消除杂散点的影响),提高了识别的鲁棒性和抗干扰能力。Clustering pixel points according to the logarithmic relative RGB coordinate sub-region can reduce the influence of accidental error (ie, eliminate the influence of spurious points), and improve the robustness and anti-interference ability of the recognition.
匹配单元304,用于根据查询图像与数据库图像的每个区域的聚类中心确定查询图像与数据库图像是否匹配。The matching unit 304 is configured to determine whether the query image and the database image match according to the query center and the cluster center of each region of the database image.
例如,当进行行人识别时,根据摄像头拍摄到的行人图像与交管系统的人像库图像的相似系数判断行人图像与人像库图像是否匹配。若匹配,则认为行人图像中的人物为人像库图像中的人物;否则,若不匹配,则认为行人图像中的人物非人像库图像中的人物,可以对行人图像和另一张人像库图像进行识别。For example, when performing pedestrian recognition, it is determined whether the pedestrian image and the portrait library image match based on the similarity coefficient of the pedestrian image captured by the camera and the portrait library image of the traffic control system. If it matches, the person in the pedestrian image is considered to be a character in the portrait library image; otherwise, if it does not match, the person in the non-portrait library image in the pedestrian image is considered to be able to image the pedestrian and another portrait library image. Identify.
可以计算查询图像与数据库图像对应的每个区域的聚类中心的距离,根据查询图像与数据库图像对应的每个区域的聚类中心的距离确定查询图像与数据库图像是否匹配。The distance between the query center and the cluster center of each region corresponding to the database image may be calculated, and whether the query image matches the database image is determined according to the distance between the query image and the cluster center of each region corresponding to the database image.
在本实施例中,可以计算查询图像与数据库图像对应的每个区域的聚类中心的欧氏距离,根据查询图像与数据库图像对应的每个区域的聚类中心的欧氏距离确定查询图像与数据库图像是否匹配。例如,查询图像包括上区域R1的聚类中心(x1,y1)和下区域R2的聚类中心(x2,y2),数据库图像包括上区域R1′的聚类中心(x1′,y1′)和下区域R2′的聚类中心(x2′,y2′),计算得到查询图像的上区域R1的聚类中心(x1,y1)与数据库图像的上区域R1′的聚类中心(x1′,y1′)的欧氏距离
Figure PCTCN2018112759-appb-000015
以及查询图像的下区域R2的聚类中心(x2,y2)与数据库图像的下区域R2′的聚类中心(x2′,y2′)的欧氏距离
Figure PCTCN2018112759-appb-000016
In this embodiment, the Euclidean distance of the cluster center of each region corresponding to the database image of the query image may be calculated, and the query image is determined according to the Euclidean distance of the cluster center of each region corresponding to the query image and the database image. Whether the database images match. For example, the query image includes a cluster center (x1, y1) of the upper region R1 and a cluster center (x2, y2) of the lower region R2, and the database image includes a cluster center (x1', y1') of the upper region R1' and The cluster center (x2', y2') of the lower region R2' is calculated as the cluster center (x1, y1) of the upper region R1 of the query image and the cluster center (x1', y1 of the upper region R1' of the database image. Euclidean distance
Figure PCTCN2018112759-appb-000015
And the Euclidean distance between the cluster center (x2, y2) of the lower region R2 of the query image and the cluster center (x2', y2') of the lower region R2' of the database image
Figure PCTCN2018112759-appb-000016
或者,可以计算查询图像与数据库图像对应的每个区域的聚类中心的曼哈顿距离(即绝对值距离),根据查询图像与数据库图像对应的每个区域的聚类中心的曼哈顿距离确定查询图像与数据库图像是否匹配。例如,查询图像包括上区域R1的聚类中心(x1,y1)和下区域R2的聚类中心(x2,y2),数据库 图像包括上区域R1′的聚类中心(x1′,y1′)和下区域R2′的聚类中心(x2′,y2′),计算得到查询图像的上区域R1的聚类中心(x1,y1)与数据库图像的上区域R1′的聚类中心(x1′,y1′)的曼哈顿距离d 1=|x 1-x′ 1|+|y 1-y′ 1|,以及查询图像的下区域R2的聚类中心(x2,y2)与数据库图像的下区域R2′的聚类中心(x2′,y2′)的曼哈顿距离d 2=|x 2-x′ 2|+|y 2-y′ 2|。 Alternatively, the Manhattan distance (ie, the absolute value distance) of the cluster center of each region corresponding to the database image of the query image may be calculated, and the query image is determined according to the Manhattan distance of the cluster center of each region corresponding to the query image and the database image. Whether the database images match. For example, the query image includes a cluster center (x1, y1) of the upper region R1 and a cluster center (x2, y2) of the lower region R2, and the database image includes a cluster center (x1', y1') of the upper region R1' and The cluster center (x2', y2') of the lower region R2' is calculated as the cluster center (x1, y1) of the upper region R1 of the query image and the cluster center (x1', y1 of the upper region R1' of the database image. ') Manhattan distance d 1 =|x 1 -x' 1 |+|y 1 -y' 1 |, and the cluster center (x2,y2) of the lower region R2 of the query image and the lower region R2' of the database image The Manhattan distance d 2 =|x 2 -x' 2 |+|y 2 -y' 2 | of the clustering center (x2', y2').
其他的实施例中,可以计算查询图像与数据库图像对应的每个区域的聚类中心的其他距离,例如马氏距离。In other embodiments, other distances, such as Mahalanobis distances, of the cluster center of each region of the query image corresponding to the database image may be calculated.
可以对查询图像与数据库图像对应的每个区域的聚类中心的距离进行预设运算,根据运算结果确定查询图像与数据库图像是否匹配。The distance between the query image and the cluster center of each region corresponding to the database image may be preset, and whether the query image matches the database image is determined according to the operation result.
在一个实施例中,可以计算查询图像与数据库图像对应的每个区域的聚类中心的距离之和(例如d 1+d 2),判断查询图像与数据库图像对应的每个区域的聚类中心的距离之和是否小于或等于预设距离,若查询图像与数据库图像对应的每个区域的聚类中心的距离之和小于或等于预设距离,则判断查询图像与数据库图像匹配;否则,若查询图像与数据库图像对应的每个区域的聚类中心的距离之和大于预设距离,则判断查询图像与数据库图像不匹配。 In one embodiment, the sum of the distances of the cluster centers of each region of the query image corresponding to the database image (eg, d 1 +d 2 ) may be calculated, and the cluster center of each region corresponding to the database image of the query image is determined. Whether the sum of the distances is less than or equal to the preset distance, and if the sum of the distances of the cluster centers of the respective regions corresponding to the database image is less than or equal to the preset distance, the query image is judged to match the database image; otherwise, if If the sum of the distances of the cluster centers of the regions corresponding to the database image corresponding to the database image is greater than the preset distance, it is determined that the query image does not match the database image.
或者,可以判断查询图像与数据库图像对应的每个区域的聚类中心的距离之和是否小于查询图像与其他的数据库图像对应的每个区域的聚类中心的距离之和,若查询图像与数据库图像对应的每个区域的聚类中心的距离之和小于查询图像与其他的数据库图像对应的每个区域的聚类中心的距离之和,则判断查询图像与数据库图像匹配;否则,若查询图像与数据库图像的每个区域对应的聚类中心的距离之和不小于查询图像与其他的数据库图像对应的每个区域的聚类中心的距离之和,则判断查询图像与数据库图像不匹配。Alternatively, it can be determined whether the sum of the distances of the cluster centers of each region corresponding to the query image and the database image is smaller than the sum of the distances between the query image and the cluster center of each region corresponding to the other database images, if the query image and the database The sum of the distances of the cluster centers of each region corresponding to the image is smaller than the sum of the distances between the query images and the cluster centers of each region corresponding to the other database images, and then the query image is judged to match the database image; otherwise, if the image is queried The sum of the distances of the cluster centers corresponding to each region of the database image is not less than the sum of the distances of the cluster centers of the regions corresponding to the query image and the other database images, and then the query image is judged to be mismatched with the database image.
在另一实施例中,可以计算查询图像与数据库图像对应的每个区域的聚类中心的平均距离(例如(d 1+d 2)/2),判断查询图像与数据库图像对应的每个区域的聚类中心的平均距离是否小于或等于预设距离,若查询图像与数据库图 像对应的每个区域的聚类中心的平均距离小于或等于预设距离,则判断查询图像与数据库图像匹配;否则,若查询图像与数据库图像对应的每个区域的聚类中心的平均距离大于预设距离,则判断查询图像与数据库图像不匹配。 In another embodiment, an average distance (eg, (d 1 +d 2 )/2) of the cluster center of each region of the query image corresponding to the database image may be calculated, and each region corresponding to the database image is determined by the query image. Whether the average distance of the cluster center is less than or equal to the preset distance, and if the average distance of the cluster center of each region corresponding to the database image is less than or equal to the preset distance, the query image is judged to match the database image; otherwise If the average distance of the cluster center of each region corresponding to the query image and the database image is greater than the preset distance, it is determined that the query image does not match the database image.
或者,可以判断查询图像与数据库图像对应的每个区域的聚类中心的平均距离是否小于查询图像与其他的数据库图像对应的每个区域的聚类中心的平均距离,若查询图像与数据库图像对应的每个区域的聚类中心的平均距离小于查询图像与其他的数据库图像对应的每个区域的聚类中心的平均距离,则判断查询图像与数据库图像匹配;否则,若查询图像与数据库图像对应的每个区域的聚类中心的平均距离不小于查询图像与其他的数据库图像对应的每个区域的聚类中心的平均距离,则判断查询图像与数据库图像不匹配。Alternatively, it can be determined whether the average distance of the cluster center of each region corresponding to the query image and the database image is smaller than the average distance of the cluster center of each region corresponding to the query image and other database images, if the query image corresponds to the database image The average distance of the cluster center of each region is smaller than the average distance of the cluster center of each region corresponding to the query image and other database images, and then the query image is judged to match the database image; otherwise, if the query image corresponds to the database image The average distance of the cluster center of each region is not less than the average distance of the cluster center of each region of the query image corresponding to other database images, and then the query image is judged to not match the database image.
实施例二的图像识别装置对查询图像与数据库图像进行区域划分;计算查询图像与数据库图像的每个区域的每个像素点的对数相对RGB坐标;根据查询图像与数据库图像的每个区域的每个像素点的对数相对RGB坐标对查询图像与数据库图像的每个区域内的像素点进行聚类,得到查询图像与数据库图像的每个区域的聚类中心;根据查询图像与数据库图像的每个区域的聚类中心确定查询图像与数据库图像是否匹配。实施例二利用对数相对RGB坐标进行图像识别,不同姿态和拍摄角度得到的对数相对RGB坐标分布相似,因而对姿态和拍摄角度的鲁棒性较好,具有姿态和拍摄角度不变性。对数相对RGB坐标是红色分量Ri、绿色分量Gi对绿色分量为Gi的归一化坐标,减小了光照对识别的影响,因而对光照的鲁棒性较好,具有光照强度不变性。对数相对RGB坐标将每个像素点的三维坐标减少到二维,减小了运算复杂度,提高识别速度。实施例二根据对数相对RGB坐标分区域对像素点进行聚类,将每个区域的像素点变换为一个聚类中心,减小了偶然误差的影响(即消除杂散点的影响),提高了识别的鲁棒性和抗干扰能力。此外,实施例一根据查询图像与数据库图像的每个区域的聚类中心确定查询图像与数据库图像是否匹配,对于聚类中心的计算数据运算量小,运算复杂度低,可以快速得到匹配结果。因此,实施例二的图像识别方法 可以实现快速高鲁棒性高抗干扰性的图像识别。The image recognition apparatus of the second embodiment performs area division on the query image and the database image; calculates a logarithmic relative RGB coordinate of each pixel of each area of the query image and the database image; and according to the query image and each area of the database image The logarithm of each pixel is compared with the RGB coordinates to cluster the pixel in each region of the query image and the database image to obtain a cluster center of each region of the query image and the database image; according to the query image and the database image The cluster center of each region determines whether the query image matches the database image. In the second embodiment, the logarithmic relative RGB coordinates are used for image recognition. The logarithm of the different poses and shooting angles is similar to the RGB coordinate distribution, so the robustness to the pose and the shooting angle is better, and the attitude and shooting angle are invariant. The logarithmic relative RGB coordinates are the normalized coordinates of the red component Ri and the green component Gi versus the green component Gi, which reduces the influence of illumination on the recognition, and thus is more robust to illumination and has illumination intensity invariance. The logarithmic relative RGB coordinates reduce the three-dimensional coordinates of each pixel to two dimensions, which reduces the computational complexity and improves the recognition speed. In the second embodiment, the pixel points are clustered according to the logarithmic relative RGB coordinate sub-region, and the pixel points of each region are transformed into a cluster center, which reduces the influence of the accidental error (ie, eliminates the influence of the spurious point) and improves The robustness and anti-interference ability of the identification. In addition, in the first embodiment, whether the query image and the database image match are determined according to the cluster center of each region of the query image and the database image, and the calculation amount of the calculation data of the cluster center is small, the operation complexity is low, and the matching result can be quickly obtained. Therefore, the image recognition method of the second embodiment can realize image recognition with fast high robustness and high anti-interference.
实施例三 Embodiment 3
图4为本发明实施例三提供的计算机装置的示意图。所述计算机装置1包括存储器20、处理器30以及存储在所述存储器20中并可在所述处理器30上运行的计算机程序40,例如图像识别程序。所述处理器30执行所述计算机程序40时实现上述图像识别方法实施例中的步骤,例如图1所示的步骤101~104。或者,所述处理器30执行所述计算机程序40时实现上述装置实施例中各模块/单元的功能,例如图3中的单元301~304。4 is a schematic diagram of a computer device according to Embodiment 3 of the present invention. The computer device 1 includes a memory 20, a processor 30, and a computer program 40, such as an image recognition program, stored in the memory 20 and executable on the processor 30. When the processor 30 executes the computer program 40, the steps in the foregoing image recognition method embodiment are implemented, for example, steps 101-104 shown in FIG. Alternatively, when the processor 30 executes the computer program 40, the functions of the modules/units in the above device embodiments are implemented, such as the units 301-304 in FIG.
示例性的,所述计算机程序40可以被分割成一个或多个模块/单元,所述一个或者多个模块/单元被存储在所述存储器20中,并由所述处理器30执行,以完成本发明。所述一个或多个模块/单元可以是能够完成特定功能的一系列计算机程序指令段,该指令段用于描述所述计算机程序40在所述计算机装置1中的执行过程。例如,所述计算机程序40可以被分割成图3中的区域划分单元301、坐标计算单元302、聚类单元303、匹配单元304,各单元具体功能参见实施例二。Illustratively, the computer program 40 can be partitioned into one or more modules/units that are stored in the memory 20 and executed by the processor 30 to complete this invention. The one or more modules/units may be a series of computer program instruction segments capable of performing a particular function for describing the execution of the computer program 40 in the computer device 1. For example, the computer program 40 may be divided into the area dividing unit 301, the coordinate calculating unit 302, the clustering unit 303, and the matching unit 304 in FIG. 3, and the specific functions of each unit are as follows.
所述计算机装置1可以是桌上型计算机、笔记本、掌上电脑及云端服务器等计算设备。本领域技术人员可以理解,所述示意图6仅仅是计算机装置1的示例,并不构成对计算机装置1的限定,可以包括比图示更多或更少的部件,或者组合某些部件,或者不同的部件,例如所述计算机装置1还可以包括输入输出设备、网络接入设备、总线等。The computer device 1 may be a computing device such as a desktop computer, a notebook, a palmtop computer, and a cloud server. It will be understood by those skilled in the art that the schematic diagram 6 is merely an example of the computer device 1 and does not constitute a limitation of the computer device 1. It may include more or less components than those illustrated, or may combine some components, or different. The components, such as the computer device 1, may also include input and output devices, network access devices, buses, and the like.
所称处理器30可以是中央处理单元(Central Processing Unit,CPU),还可以是其他通用处理器、数字信号处理器(Digital Signal Processor,DSP)、专用集成电路(Application Specific Integrated Circuit,ASIC)、现成可编程门阵列(Field-Programmable Gate Array,FPGA)或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件等。通用处理器可以是微处理器或者该处理器 30也可以是任何常规的处理器等,所述处理器30是所述计算机装置1的控制中心,利用各种接口和线路连接整个计算机装置1的各个部分。The processor 30 may be a central processing unit (CPU), or may be other general-purpose processors, a digital signal processor (DSP), an application specific integrated circuit (ASIC), Field-Programmable Gate Array (FPGA) or other programmable logic device, discrete gate or transistor logic device, discrete hardware components, etc. The general purpose processor may be a microprocessor or the processor 30 may be any conventional processor or the like, and the processor 30 is a control center of the computer device 1, and connects the entire computer device 1 by using various interfaces and lines. Various parts.
所述存储器20可用于存储所述计算机程序40和/或模块/单元,所述处理器30通过运行或执行存储在所述存储器20内的计算机程序和/或模块/单元,以及调用存储在存储器20内的数据,实现所述计算机装置1的各种功能。所述存储器20可主要包括存储程序区和存储数据区,其中,存储程序区可存储操作系统、至少一个功能所需的应用程序(比如声音播放功能、图像播放功能等)等;存储数据区可存储根据计算机装置1的使用所创建的数据(比如音频数据、电话本等)等。此外,存储器20可以包括高速随机存取存储器,还可以包括非易失性存储器,例如硬盘、内存、插接式硬盘,智能存储卡(Smart Media Card,SMC),安全数字(Secure Digital,SD)卡,闪存卡(Flash Card)、至少一个磁盘存储器件、闪存器件、或其他易失性固态存储器件。The memory 20 can be used to store the computer program 40 and/or modules/units by running or executing computer programs and/or modules/units stored in the memory 20, and by calling in memory. The data within 20 implements various functions of the computer device 1. The memory 20 may mainly include a storage program area and a storage data area, wherein the storage program area may store an operating system, an application required for at least one function (such as a sound playing function, an image playing function, etc.), and the like; the storage data area may be Data (such as audio data, phone book, etc.) created according to the use of the computer device 1 is stored. In addition, the memory 20 may include a high-speed random access memory, and may also include a non-volatile memory such as a hard disk, a memory, a plug-in hard disk, a smart memory card (SMC), and a secure digital (Secure Digital, SD). Card, flash card, at least one disk storage device, flash device, or other volatile solid state storage device.
所述计算机装置1集成的模块/单元如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储介质中。基于这样的理解,本发明实现上述实施例方法中的全部或部分流程,也可以通过计算机程序来指令相关的硬件来完成,所述的计算机程序可存储于一计算机可读存储介质中,该计算机程序在被处理器执行时,可实现上述各个方法实施例的步骤。其中,所述计算机程序包括计算机程序代码,所述计算机程序代码可以为源代码形式、对象代码形式、可执行文件或某些中间形式等。所述计算机可读介质可以包括:能够携带所述计算机程序代码的任何实体或装置、记录介质、U盘、移动硬盘、磁碟、光盘、计算机存储器、只读存储器(ROM,Read-Only Memory)、随机存取存储器(RAM,Random Access Memory)、电载波信号、电信信号以及软件分发介质等。需要说明的是,所述计算机可读介质包含的内容可以根据司法管辖区内立法和专利实践的要求进行适当的增减,例如在某些司法管辖区,根据立法和专利实践,计算机可读介质不包括电载波信号和电信信号。The modules/units integrated by the computer device 1 can be stored in a computer readable storage medium if implemented in the form of a software functional unit and sold or used as a stand-alone product. Based on such understanding, the present invention implements all or part of the processes in the foregoing embodiments, and may also be completed by a computer program to instruct related hardware. The computer program may be stored in a computer readable storage medium. The steps of the various method embodiments described above may be implemented when the program is executed by the processor. Wherein, the computer program comprises computer program code, which may be in the form of source code, object code form, executable file or some intermediate form. The computer readable medium may include any entity or device capable of carrying the computer program code, a recording medium, a USB flash drive, a removable hard disk, a magnetic disk, an optical disk, a computer memory, a read-only memory (ROM). , random access memory (RAM, Random Access Memory), electrical carrier signals, telecommunications signals, and software distribution media. It should be noted that the content contained in the computer readable medium may be appropriately increased or decreased according to the requirements of legislation and patent practice in a jurisdiction, for example, in some jurisdictions, according to legislation and patent practice, computer readable media Does not include electrical carrier signals and telecommunication signals.
在本发明所提供的几个实施例中,应该理解到,所揭露的计算机装置和方法,可以通过其它的方式实现。例如,以上所描述的计算机装置实施例仅仅是示意性的,例如,所述单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式。In the several embodiments provided by the present invention, it should be understood that the disclosed computer apparatus and method may be implemented in other manners. For example, the computer device embodiments described above are merely illustrative. For example, the division of the unit is only a logical function division, and the actual implementation may have another division manner.
另外,在本发明各个实施例中的各功能单元可以集成在相同处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在相同单元中。上述集成的单元既可以采用硬件的形式实现,也可以采用硬件加软件功能模块的形式实现。In addition, each functional unit in each embodiment of the present invention may be integrated in the same processing unit, or each unit may exist physically separately, or two or more units may be integrated in the same unit. The above integrated unit can be implemented in the form of hardware or in the form of hardware plus software function modules.
对于本领域技术人员而言,显然本发明不限于上述示范性实施例的细节,而且在不背离本发明的精神或基本特征的情况下,能够以其他的具体形式实现本发明。因此,无论从哪一点来看,均应将实施例看作是示范性的,而且是非限制性的,本发明的范围由所附权利要求而不是上述说明限定,因此旨在将落在权利要求的等同要件的含义和范围内的所有变化涵括在本发明内。不应将权利要求中的任何附图标记视为限制所涉及的权利要求。此外,显然“包括”一词不排除其他单元或步骤,单数不排除复数。计算机装置权利要求中陈述的多个单元或计算机装置也可以由同一个单元或计算机装置通过软件或者硬件来实现。第一,第二等词语用来表示名称,而并不表示任何特定的顺序。It is apparent to those skilled in the art that the present invention is not limited to the details of the above-described exemplary embodiments, and the present invention can be embodied in other specific forms without departing from the spirit or essential characteristics of the invention. Therefore, the present embodiments are to be considered as illustrative and not restrictive, and the scope of the invention is defined by the appended claims instead All changes in the meaning and scope of equivalent elements are included in the present invention. Any reference signs in the claims should not be construed as limiting the claim. In addition, it is to be understood that the word "comprising" does not exclude other elements or steps. A plurality of units or computer devices recited in the computer device claims can also be implemented by the same unit or computer device in software or hardware. The first, second, etc. words are used to denote names and do not denote any particular order.
最后应说明的是,以上实施例仅用以说明本发明的技术方案而非限制,尽管参照较佳实施例对本发明进行了详细说明,本领域的普通技术人员应当理解,可以对本发明的技术方案进行修改或等同替换,而不脱离本发明技术方案的精神和范围。It should be noted that the above embodiments are only for explaining the technical solutions of the present invention and are not intended to be limiting, and the present invention will be described in detail with reference to the preferred embodiments. Modifications or equivalents are made without departing from the spirit and scope of the invention.

Claims (10)

  1. 一种图像识别方法,其特征在于,所述方法包括:An image recognition method, the method comprising:
    对查询图像与数据库图像进行区域划分;Area division of the query image and the database image;
    计算查询图像与数据库图像的每个区域的每个像素点的对数相对RGB坐标;Calculating a logarithmic relative RGB coordinate of each pixel of each region of the query image and the database image;
    根据查询图像与数据库图像的每个区域的每个像素点的对数相对RGB坐标对查询图像与数据库图像的每个区域内的像素点进行聚类,得到查询图像与数据库图像的每个区域的聚类中心;Clustering pixel points in each region of the query image and the database image according to the logarithm of each pixel of each region of the query image and the database image, and obtaining each region of the query image and the database image Cluster center
    根据查询图像与数据库图像的每个区域的聚类中心确定查询图像与数据库图像是否匹配。Whether the query image matches the database image is determined according to the cluster center of each region of the query image and the database image.
  2. 如权利要求1所述的方法,其特征在于,所述查询图像与数据库图像是人物图像,所述对查询图像与数据库图像进行区域划分包括:The method according to claim 1, wherein the query image and the database image are character images, and the zoning of the query image and the database image comprises:
    按照查询图像与数据库图像中的人物形体将查询图像与数据库图像各自划分为上下两个区域,其中上区域对应人物的上半身,下区域对应人物的下半身。The query image and the database image are respectively divided into upper and lower areas according to the character image in the query image and the database image, wherein the upper area corresponds to the upper body of the character, and the lower area corresponds to the lower body of the character.
  3. 如权利要求1所述的方法,其特征在于,所述根据查询图像与数据库图像的每个区域的聚类中心确定查询图像与数据库图像是否匹配包括:The method according to claim 1, wherein the determining whether the query image matches the database image according to the cluster center of each region of the query image and the database image comprises:
    计算查询图像与数据库图像对应的每个区域的聚类中心的距离;Calculating a distance of a cluster center of each region of the query image corresponding to the database image;
    根据查询图像与数据库图像对应的每个区域的聚类中心的距离确定查询图像与数据库图像是否匹配。Whether the query image matches the database image is determined according to the distance between the query image and the cluster center of each region corresponding to the database image.
  4. 如权利要求3所述的方法,其特征在于,所述计算查询图像与数据库图像对应的每个区域的聚类中心的距离包括:The method according to claim 3, wherein the calculating the distance between the query image and the cluster center of each region corresponding to the database image comprises:
    计算查询图像与数据库图像对应的每个区域的聚类中心的欧氏距离;或者Calculating the Euclidean distance of the cluster center of each region of the query image corresponding to the database image; or
    计算查询图像与数据库图像对应的每个区域的聚类中心的曼哈顿距离;或者Calculating the Manhattan distance of the cluster center of each region of the query image corresponding to the database image; or
    计算查询图像与数据库图像对应的每个区域的聚类中心的马氏距离。The Mahalanobis distance of the cluster center of each region corresponding to the database image is calculated.
  5. 一种图像识别装置,其特征在于,所述装置包括:An image recognition device, characterized in that the device comprises:
    区域划分单元,用于对查询图像与数据库图像进行区域划分;a region dividing unit, configured to perform area division on the query image and the database image;
    坐标计算单元,用于计算查询图像与数据库图像的每个区域的每个像素点的对数相对RGB坐标;a coordinate calculation unit, configured to calculate a logarithmic relative RGB coordinate of each pixel of each region of the query image and the database image;
    聚类单元,用于根据查询图像与数据库图像的每个区域的每个像素点的对数相对RGB坐标对查询图像与数据库图像的每个区域内的像素点进行聚类,得到查询图像与数据库图像的每个区域的聚类中心;a clustering unit, configured to cluster pixel points in each region of the query image and the database image according to a logarithm of each pixel of each region of the query image and the database image to obtain a query image and a database The cluster center of each region of the image;
    匹配单元,用于根据查询图像与数据库图像的每个区域的聚类中心确定查询图像与数据库图像是否匹配。And a matching unit, configured to determine, according to the query center and the cluster center of each region of the database image, whether the query image and the database image match.
  6. 如权利要求5所述的装置,其特征在于,所述查询图像与数据库图像包括人物图像,所述区域划分单元具体用于:The device according to claim 5, wherein the query image and the database image comprise a person image, and the area dividing unit is specifically configured to:
    按照查询图像与数据库图像中的人物形体将查询图像与数据库图像各自划分为上下两个区域,其中上区域对应人物的上半身,下区域对应人物的下半身。The query image and the database image are respectively divided into upper and lower areas according to the character image in the query image and the database image, wherein the upper area corresponds to the upper body of the character, and the lower area corresponds to the lower body of the character.
  7. 如权利要求5所述的装置,其特征在于,所述匹配单元具体用于:The device according to claim 5, wherein the matching unit is specifically configured to:
    计算查询图像与数据库图像对应的每个区域的聚类中心的距离;Calculating a distance of a cluster center of each region of the query image corresponding to the database image;
    根据查询图像与数据库图像对应的每个区域的聚类中心的距离确定查询图像与数据库图像是否匹配。Whether the query image matches the database image is determined according to the distance between the query image and the cluster center of each region corresponding to the database image.
  8. 如权利要求7所述的装置,其特征在于,所述匹配单元计算查询图像与数据库图像对应的每个区域的聚类中心的距离具体包括:The device according to claim 7, wherein the matching unit calculates the distance between the cluster center of each region of the query image corresponding to the database image, and specifically includes:
    计算查询图像与数据库图像对应的每个区域的聚类中心的欧氏距离;或者Calculating the Euclidean distance of the cluster center of each region of the query image corresponding to the database image; or
    计算查询图像与数据库图像对应的每个区域的聚类中心的曼哈顿距离;或者Calculating the Manhattan distance of the cluster center of each region of the query image corresponding to the database image; or
    计算查询图像与数据库图像对应的每个区域的聚类中心的马氏距离。The Mahalanobis distance of the cluster center of each region corresponding to the database image is calculated.
  9. 一种计算机装置,其特征在于:所述计算机装置包括处理器,所述处理 器用于执行存储器中存储的计算机程序时实现如权利要求1-4中任一项所述图像识别方法。A computer apparatus, comprising: a processor, the processor for performing an image recognition method according to any one of claims 1 to 4 when the computer program is stored in a memory.
  10. 一种计算机可读存储介质,其上存储有计算机程序,其特征在于:所述计算机程序被处理器执行时实现如权利要求1-4中任一项所述图像识别方法。A computer readable storage medium having stored thereon a computer program, wherein the computer program is executed by a processor to implement the image recognition method according to any one of claims 1-4.
PCT/CN2018/112759 2017-11-15 2018-10-30 Image recognition method and device, computer device and computer-readable storage medium WO2019095997A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201711132067.0A CN107871143B (en) 2017-11-15 2017-11-15 Image-recognizing method and device, computer installation and computer readable storage medium
CN201711132067.0 2017-11-15

Publications (1)

Publication Number Publication Date
WO2019095997A1 true WO2019095997A1 (en) 2019-05-23

Family

ID=61754080

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2018/112759 WO2019095997A1 (en) 2017-11-15 2018-10-30 Image recognition method and device, computer device and computer-readable storage medium

Country Status (2)

Country Link
CN (1) CN107871143B (en)
WO (1) WO2019095997A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111950629A (en) * 2020-08-11 2020-11-17 精英数智科技股份有限公司 Method, device and equipment for detecting confrontation sample
CN112329660A (en) * 2020-11-10 2021-02-05 浙江商汤科技开发有限公司 Scene recognition method and device, intelligent equipment and storage medium

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107895021B (en) * 2017-11-15 2019-12-17 深圳云天励飞技术有限公司 image recognition method and device, computer device and computer readable storage medium
CN107871143B (en) * 2017-11-15 2019-06-28 深圳云天励飞技术有限公司 Image-recognizing method and device, computer installation and computer readable storage medium
CN110689046A (en) * 2019-08-26 2020-01-14 深圳壹账通智能科技有限公司 Image recognition method, image recognition device, computer device, and storage medium
CN116340991B (en) * 2023-02-02 2023-11-07 魔萌动漫文化传播(深圳)有限公司 Big data management method and device for IP gallery material resources and electronic equipment

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101794442A (en) * 2010-01-25 2010-08-04 哈尔滨工业大学 Calibration method for extracting illumination-insensitive information from visible images
US20160092736A1 (en) * 2014-09-30 2016-03-31 C/O Canon Kabushiki Kaisha System and method for object re-identification
CN105574515A (en) * 2016-01-15 2016-05-11 南京邮电大学 Pedestrian re-identification method in zero-lap vision field
CN107871143A (en) * 2017-11-15 2018-04-03 深圳云天励飞技术有限公司 Image-recognizing method and device, computer installation and computer-readable recording medium

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8913829B2 (en) * 2012-06-05 2014-12-16 Tandent Vision Science, Inc. Automatic processing scale estimation for use in an image process
CN105139042A (en) * 2015-09-08 2015-12-09 携程计算机技术(上海)有限公司 Image identification method and system

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101794442A (en) * 2010-01-25 2010-08-04 哈尔滨工业大学 Calibration method for extracting illumination-insensitive information from visible images
US20160092736A1 (en) * 2014-09-30 2016-03-31 C/O Canon Kabushiki Kaisha System and method for object re-identification
CN105574515A (en) * 2016-01-15 2016-05-11 南京邮电大学 Pedestrian re-identification method in zero-lap vision field
CN107871143A (en) * 2017-11-15 2018-04-03 深圳云天励飞技术有限公司 Image-recognizing method and device, computer installation and computer-readable recording medium

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111950629A (en) * 2020-08-11 2020-11-17 精英数智科技股份有限公司 Method, device and equipment for detecting confrontation sample
CN112329660A (en) * 2020-11-10 2021-02-05 浙江商汤科技开发有限公司 Scene recognition method and device, intelligent equipment and storage medium
CN112329660B (en) * 2020-11-10 2024-05-24 浙江商汤科技开发有限公司 Scene recognition method and device, intelligent equipment and storage medium

Also Published As

Publication number Publication date
CN107871143B (en) 2019-06-28
CN107871143A (en) 2018-04-03

Similar Documents

Publication Publication Date Title
WO2019095997A1 (en) Image recognition method and device, computer device and computer-readable storage medium
US11703951B1 (en) Gesture recognition systems
Sarma et al. Methods, databases and recent advancement of vision-based hand gesture recognition for hci systems: A review
WO2019095998A1 (en) Image recognition method and device, computer device and computer-readable storage medium
CN106682598B (en) Multi-pose face feature point detection method based on cascade regression
WO2019242416A1 (en) Video image processing method and apparatus, computer readable storage medium and electronic device
WO2022027912A1 (en) Face pose recognition method and apparatus, terminal device, and storage medium.
Ban et al. Face detection based on skin color likelihood
US20190108447A1 (en) Multifunction perceptrons in machine learning environments
US20220375236A1 (en) License plate identification method and system thereof
JP4217664B2 (en) Image processing method and image processing apparatus
CN102667810B (en) Face recognition in digital images
US20090180671A1 (en) Multi-view face recognition method and system
JP2020515983A (en) Target person search method and device, device, program product and medium
US9213897B2 (en) Image processing device and method
WO2021036309A1 (en) Image recognition method and apparatus, computer apparatus, and storage medium
WO2020087922A1 (en) Facial attribute identification method, device, computer device and storage medium
CN110706238B (en) Method and device for segmenting point cloud data, storage medium and electronic equipment
González-Ortega et al. Real-time hands, face and facial features detection and tracking: Application to cognitive rehabilitation tests monitoring
CN112651321A (en) File processing method and device and server
Nasri et al. A novel approach for dynamic hand gesture recognition using contour-based similarity images
Juang et al. Stereo-camera-based object detection using fuzzy color histograms and a fuzzy classifier with depth and shape estimations
WO2023279604A1 (en) Re-identification method, training method for target re-identification network and related product
CN111797862A (en) Task processing method and device, storage medium and electronic equipment
Cai et al. Robust facial expression recognition using RGB-D images and multichannel features

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 18879388

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 14/08/2020)

122 Ep: pct application non-entry in european phase

Ref document number: 18879388

Country of ref document: EP

Kind code of ref document: A1