WO2021082118A1 - Person re-identification method and apparatus, and terminal and storage medium - Google Patents

Person re-identification method and apparatus, and terminal and storage medium Download PDF

Info

Publication number
WO2021082118A1
WO2021082118A1 PCT/CN2019/119860 CN2019119860W WO2021082118A1 WO 2021082118 A1 WO2021082118 A1 WO 2021082118A1 CN 2019119860 W CN2019119860 W CN 2019119860W WO 2021082118 A1 WO2021082118 A1 WO 2021082118A1
Authority
WO
WIPO (PCT)
Prior art keywords
information
pedestrian
preset
identified
training
Prior art date
Application number
PCT/CN2019/119860
Other languages
French (fr)
Chinese (zh)
Inventor
李国法
黄莉莎
徐刚
谢恒�
赖伟鉴
陈耀昱
Original Assignee
深圳大学
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳大学 filed Critical 深圳大学
Publication of WO2021082118A1 publication Critical patent/WO2021082118A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
    • G06V40/103Static body considered as a whole, e.g. static pedestrian or occupant recognition
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02TCLIMATE CHANGE MITIGATION TECHNOLOGIES RELATED TO TRANSPORTATION
    • Y02T10/00Road transport of goods or passengers
    • Y02T10/10Internal combustion engine [ICE] based vehicles
    • Y02T10/40Engine management systems

Definitions

  • This application belongs to the field of computer technology, and in particular relates to a method, device, terminal, and storage medium for pedestrian re-identification.
  • Pedestrian re-identification also known as pedestrian re-identification
  • pedestrian re-identification is a technology that uses computer vision technology to determine whether there is a specific pedestrian in an image or video sequence.
  • network recognition models are often used.
  • pedestrians due to the differences between different camera equipment, pedestrians have both rigid and flexible characteristics, and the appearance is easily affected by wearing, scale, occlusion, posture and viewing angle, making the process of pedestrian re-recognition more difficult than the common face recognition process
  • How to improve the accuracy of pedestrian re-identification is a technical problem that needs to be solved urgently.
  • the embodiments of the present application provide a pedestrian re-identification method, device, terminal, and storage medium to improve the accuracy of pedestrian re-identification.
  • the first aspect of the embodiments of the present application provides a pedestrian re-identification method, including:
  • the re-identification result of the pedestrian to be identified is determined based on the target tag information.
  • a second aspect of the embodiments of the present application provides a pedestrian re-identification device, including:
  • An acquiring module configured to acquire a target image frame sequence from a pre-collected surveillance video stream, and the target image frame in the target image frame sequence contains image information of a pedestrian to be identified;
  • the first determining module is configured to identify the characteristic information of the pedestrian to be identified from the image information, and determine all tag information corresponding to the characteristic information;
  • the re-recognition module is used to perform pedestrian re-recognition on the target image frame sequence and all the tag information using the pre-trained pedestrian re-recognition model, and determine the target tag of the pedestrian to be recognized from all the tag information information;
  • the second determination module is configured to determine the re-identification result of the pedestrian to be identified based on the target tag information.
  • a third aspect of the embodiments of the present application provides a terminal, including a memory, a processor, and a computer program stored in the memory and running on the processor, wherein the processor executes the computer program When realizing the steps of the pedestrian re-identification method described in the first aspect of the above embodiment.
  • the fourth aspect of the embodiments of the present application provides a computer-readable storage medium, the computer-readable storage medium stores a computer program, and is characterized in that, when the computer program is executed by a processor, the implementation is as described in the first aspect of the above embodiment. Steps of the pedestrian re-identification method.
  • Fig. 1 is an implementation flowchart of the pedestrian re-identification method provided by the first embodiment of the present application
  • FIG. 2 is a flow chart of the specific implementation of S102 in Figure 1;
  • Fig. 3 is an implementation flowchart of the pedestrian re-identification method provided by the second embodiment of the present application.
  • FIG. 4 is a flowchart of the specific implementation of S304 in Figure 3;
  • Figure 5 is a specific implementation flow chart of S305 in Figure 3;
  • FIG. 6 is a schematic structural diagram of a pedestrian re-identification device provided by an embodiment of the present application.
  • FIG. 7 is a schematic structural diagram of a terminal provided by an embodiment of the present application.
  • the present invention provides a pedestrian re-identification method, which performs pedestrian re-identification based on a pedestrian re-identification network model of a novel loss function, and improves the cross-entropy loss function by increasing the loss of the wrong category, thereby enabling the machine learning model to obtain better classification performance .
  • FIG. 1 it is a flow chart of the implementation of the pedestrian re-identification method provided by the first embodiment of the present application.
  • This embodiment may be implemented by hardware or software of a pedestrian re-identification device.
  • the pedestrian re-identification device may be terminal. The details are as follows:
  • the pre-collected surveillance video stream is collected by a pre-determined surveillance device, such as a video stream collected by a surveillance device in a campus, the surveillance video stream includes consecutive image frames in a time sequence, and the video streams collected in different time periods include The image frame correspondingly contains different target information.
  • a target image frame sequence is acquired from a pre-collected surveillance video stream, and the target image frame in the target image frame sequence contains the image information of the pedestrian to be identified, and the image information of the pedestrian to be identified includes the image information of the pedestrian to be identified. Recognize pedestrian's facial information, clothing information, body information, etc.
  • S102 Identify feature information of the pedestrian to be identified from the image information, and determine all tag information corresponding to the feature information.
  • the feature information of the pedestrian to be identified includes facial features such as skin condition, facial expression, facial features, appearance features such as clothes color, clothes texture, handbag, backpack, hat, etc., the image area occupied, and the relative position in the image.
  • Location features such as location.
  • the characteristic information has corresponding label information, and the label information is used to identify the characteristic information.
  • the characteristic information is a skin condition, and the corresponding label information is smooth or not smooth.
  • FIG. 2 it is a flowchart of the specific implementation of S102 in FIG. 1. It can be seen from Figure 2 that S102 includes:
  • S1021 Perform feature recognition on the pedestrian to be identified by using the feature information recognition model completed in advance to obtain the feature information of the pedestrian to be identified.
  • the pre-trained feature information recognition model may be a machine learning model with a recognition function, such as a neural network model.
  • the input of the neural network model is the pedestrian to be recognized, and the output is the feature information corresponding to the pedestrian to be recognized.
  • S1022 Calculate the probability value of the feature information belonging to each type of preset label information.
  • the pedestrian to be identified usually has multiple different characteristic information, such as skin condition, clothes color, etc., and different characteristic information corresponds to multiple preset label information, for example, the preset label information corresponding to the clothes color includes red and black. , Yellow, green, etc., and when the feature information is not obvious, multiple preset label information may appear in the recognition result.
  • the recognition result may correspond to two preset labels of black and gray. At this time, it is necessary to further calculate the probability value of the feature information belonging to each type of preset label information.
  • the probability value of the feature information belonging to each type of preset label information can be calculated by using a preset probability normalization formula; the preset probability normalization formula is:
  • p i represents the probability value of the feature information belonging to the i-th type of preset label information
  • K represents the total number of types of preset label information
  • the preset tag information corresponding to the feature information is determined by calculating the probability value that the feature information belongs to the preset tag information. Understandably, the method of calculating the probability value is not limited to using the aforementioned preset probability normalization formula, which is not specifically limited here.
  • the pre-trained pedestrian re-recognition model may be a machine learning model with a recognition function, and the input of the pedestrian re-recognition model is an image frame sequence containing the pedestrian to be recognized and all tags corresponding to the characteristic information of the pedestrian to be recognized Information, output as target tag information of the pedestrian to be identified.
  • the pedestrian to be identified belongs to tag information with the highest probability of each type of preset tag information.
  • S104 Determine a re-identification result of the pedestrian to be identified based on the target tag information.
  • the target tag information matches the preset tag information of the pedestrian to be identified, it is determined that the pedestrian to be identified is a predetermined specific pedestrian. If the tag information for identifying the pedestrian does not match, it is determined that the pedestrian to be identified is not a predetermined specific pedestrian.
  • the pedestrian recovery method includes: acquiring a target image frame sequence from a pre-collected surveillance video stream, and the target image frame in the target image frame sequence contains the pedestrian to be identified Identify the feature information of the pedestrian to be identified from the image information, and determine all the tag information corresponding to the feature information; use the pre-trained pedestrian re-recognition model to separately perform the target image frame sequence Perform pedestrian re-identification with all the tag information, determine the target tag information of the pedestrian to be identified from all the tag information; determine the re-identification result of the pedestrian to be identified based on the target tag information.
  • FIG. 3 it is a flowchart of the implementation of the pedestrian re-identification method provided by the second embodiment of the present application. It can be seen from Fig. 3 that compared with the embodiment shown in Fig. 1, the specific implementation process of S301 ⁇ S302 and S308 ⁇ S309 is the same as the specific implementation process of S101 ⁇ S104 in this implementation. The difference is that S303 is also included before S308. ⁇ S307, where S307 and S308 are executed in parallel, and one of them can be executed. The specific implementation process of S303 ⁇ S307 is detailed as follows:
  • S303 Collect a first preset number of training samples, each of which includes an image of a pedestrian to be identified and all preset label information corresponding to the pedestrian to be identified.
  • S304 Use the training sample to train a pre-established machine learning model for training, to obtain a trained machine learning model.
  • S304 includes:
  • the machine learning model may be a deep learning model such as a neural network model, a logical classification model, and a random forest model. It is understandable that all the preset label information corresponding to each pedestrian to be identified are usually not completely the same. Performing preset label recognition through the machine learning model can quickly and accurately obtain that each pedestrian to be identified belongs to each category. The probability of the preset label information.
  • the preset label property with the highest probability corresponding to each pedestrian to be identified is used as a constraint condition for training the machine learning model, and by minimizing the loss function corresponding to the machine learning model, iteratively
  • the preset parameters of the machine learning model are used to improve the accuracy of the machine learning model's recognition of error labels.
  • S305 includes:
  • S3051 Input a second preset number of test samples into the machine learning model after training for analysis, and determine the rate of change of the loss function of the machine learning model after training.
  • the loss function of the pre-trained pedestrian re-identification model is:
  • N represents the total number of training samples
  • K represents the total number of categories of preset label information
  • p j represents the probability value of the current sample belonging to the j-th type of preset label information
  • y i is the true label information corresponding to the current sample
  • q i, j are the distribution ratios of p j
  • N sc represents the number of similar label information belonging to the current sample
  • is a coefficient that balances the real label information and the similar label information
  • Fig. 6 is a schematic structural diagram of a pedestrian re-identification device provided by an embodiment of the present application. It can be seen from FIG. 6 that the pedestrian re-identification device 6 provided in this embodiment includes: an acquisition module 601, a first determination module 602, a re-identification module 603, and a second determination module 604. among them,
  • the obtaining module 601 is configured to obtain a target image frame sequence from a pre-collected surveillance video stream, and the target image frame in the target image frame sequence contains image information of a pedestrian to be identified;
  • the first determining module 602 is configured to identify the characteristic information of the pedestrian to be identified from the image information, and determine all tag information corresponding to the characteristic information;
  • the re-recognition module 603 is configured to use the pre-trained pedestrian re-recognition model to perform pedestrian re-recognition on the target image frame sequence and all the tag information, and determine the target of the pedestrian to be recognized from all the tag information Label Information;
  • the second determining module 604 is configured to determine the re-identification result of the pedestrian to be identified based on the target tag information.
  • the first determining module 602 includes:
  • a recognition unit configured to use a pre-trained feature information recognition model to perform feature recognition on the pedestrian to be identified to obtain feature information of the pedestrian to be identified;
  • a calculation unit configured to calculate the probability value of the feature information belonging to each type of preset label information
  • the first determining unit is configured to: if the probability value of the feature information belonging to the first type of preset label information is greater than the probability value of belonging to the second type of preset label information, and the feature information belongs to the first type of preset If the probability value of the tag information is greater than the preset probability threshold, it is determined that the first type of preset tag information is the tag information corresponding to the feature information, and the second type of preset tag information is except for the first type of preset tag information. Set any type of preset label information except label information.
  • the calculation unit includes:
  • a preset probability normalization formula is used to calculate the probability value of the feature information belonging to each type of preset label information; the preset probability normalization formula is:
  • p i represents the probability value of the feature information belonging to the i-th type of preset label information
  • K represents the total number of types of preset label information
  • An acquisition module configured to collect a first preset number of training samples, each of the training samples includes an image of the pedestrian to be identified and all preset label information corresponding to the pedestrian to be identified;
  • the training module is used for training a pre-established machine learning model by using the training samples to obtain a machine learning model after training;
  • a test module which is used to perform a model accuracy test on the machine learning model after training
  • the first determination module is configured to determine that the machine learning model after the training is the pedestrian re-identification model if the accuracy test of the machine learning model after the training is passed;
  • the second determination module is used to increase the number of training samples if the accuracy test of the machine learning model after the training fails, and then return to execute the machine learning pre-established for training with the training samples
  • the model is trained to obtain the pedestrian re-identification model.
  • the training module includes:
  • the re-identification unit is configured to use the pre-established machine learning model to re-identify all the preset label information corresponding to each pedestrian to be identified, and obtain the probability that each pedestrian to be identified belongs to each type of preset label information, And determine the preset label information with the highest probability corresponding to each of the pedestrians to be identified;
  • An iterative unit configured to use the preset label information with the highest probability corresponding to each pedestrian to be identified as a constraint condition for training the machine learning model, and iterate the preset parameters of the machine learning model;
  • the second determining unit is configured to determine that the training of the machine learning model is completed if the rate of change of the loss function value corresponding to the machine learning model becomes stable, and the pedestrian re-identification model is obtained.
  • the loss function of the pre-trained pedestrian re-identification model is:
  • N represents the total number of training samples
  • K represents the total number of categories of preset label information
  • p j represents the probability value of the current sample belonging to the j-th type of preset label information
  • y i is the true label information corresponding to the current sample
  • q i, j are the distribution ratios of p j
  • N sc represents the number of similar label information belonging to the current sample
  • is a coefficient that balances the real label information and the similar label information
  • FIG. 7 is a schematic structural diagram of a terminal provided by an embodiment of the present application.
  • the terminal 7 of this embodiment includes a processor 70, a memory 71, and a computer program 72 stored in the memory 71 and running on the processor 70, such as a pedestrian re-identification program.
  • the processor 70 executes the computer program 72, the steps in the above-mentioned various pedestrian re-identification method embodiments are implemented, for example, steps 101 to 104 shown in FIG. 1.
  • the computer program 72 may be divided into one or more modules/units, and the one or more modules/units are stored in the memory 71 and executed by the processor 70 to complete the application.
  • the one or more modules/units may be a series of computer program instruction segments capable of completing specific functions, and the instruction segments are used to describe the execution process of the computer program 72 in the terminal 7.
  • the computer program 72 can be divided into an acquisition module, a first determination module, a re-identification module, and a second determination module (a module in a virtual device).
  • the specific functions of each module are as follows:
  • An acquiring module configured to acquire a target image frame sequence from a pre-collected surveillance video stream, and the target image frame in the target image frame sequence contains image information of a pedestrian to be identified;
  • the first determining module is configured to identify the characteristic information of the pedestrian to be identified from the image information, and determine all tag information corresponding to the characteristic information;
  • the re-recognition module is used to perform pedestrian re-recognition on the target image frame sequence and all the tag information using the pre-trained pedestrian re-recognition model, and determine the target tag of the pedestrian to be recognized from all the tag information information;
  • the second determination module is configured to determine the re-identification result of the pedestrian to be identified based on the target tag information.
  • the disclosed device/terminal device and method may be implemented in other ways.
  • the device/terminal device embodiments described above are only illustrative.
  • the division of the modules or units is only a logical function division, and there may be other divisions in actual implementation, such as multiple units.
  • components can be combined or integrated into another system, or some features can be omitted or not implemented.
  • the displayed or discussed mutual coupling or direct coupling or communication connection may be indirect coupling or communication connection through some interfaces, devices or units, and may be in electrical, mechanical or other forms.
  • the units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, they may be located in one place, or they may be distributed on multiple communication units. Some or all of the units may be selected according to actual needs to achieve the objectives of the solutions of the embodiments.
  • the functional units in the various embodiments of the present application may be integrated into one processing unit, or each unit may exist alone physically, or two or more units may be integrated into one unit.
  • the above-mentioned integrated unit can be implemented in the form of hardware or software functional unit.
  • the integrated module/unit is implemented in the form of a software functional unit and sold or used as an independent product, it can be stored in a computer readable storage medium.
  • the present application implements all or part of the processes in the above-mentioned embodiments and methods, and can also be completed by instructing relevant hardware through a computer program.
  • the computer program can be stored in a computer-readable storage medium. When the program is executed by the processor, it can implement the steps of the foregoing method embodiments. .
  • the computer program includes computer program code, and the computer program code may be in the form of source code, object code, executable file, or some intermediate forms.
  • the computer-readable medium may include: any entity or device capable of carrying the computer program code, recording medium, U disk, mobile hard disk, magnetic disk, optical disk, computer memory, read-only memory (ROM, Read-Only Memory) , Random Access Memory (RAM, Random Access Memory), electrical carrier signal, telecommunications signal, and software distribution media, etc.
  • ROM Read-Only Memory
  • RAM Random Access Memory
  • electrical carrier signal telecommunications signal
  • software distribution media etc.
  • the content contained in the computer-readable medium can be appropriately added or deleted according to the requirements of the legislation and patent practice in the jurisdiction.
  • the computer-readable medium Does not include electrical carrier signals and telecommunication signals.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • General Physics & Mathematics (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • General Engineering & Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Human Computer Interaction (AREA)
  • Multimedia (AREA)
  • Image Analysis (AREA)

Abstract

The present application is applicable to the technical field of computers. Provided is a person re-identification method, comprising: acquiring a target image frame sequence from a pre-collected monitoring video stream; identifying, from image information, feature information of a person to be identified, and determining all pieces of tag information corresponding to the feature information; respectively performing person re-identification on the target image frame sequence and all the pieces of tag information by means of a pre-trained person re-identification model, and determining, from all the pieces of tag information, target tag information of the person to be identified; and determining, on the basis of the target tag information, a re-identification result of the person to be identified. By means of identifying, from image information including a person to be identified, feature information of a person to be identified, determining all pieces of tag information corresponding to the feature information, and respectively performing person re-identification on the image information including the person to be identified and all the pieces of tag information by means of a pre-trained person re-identification model, the accuracy of person re-identification is improved.

Description

行人重识别的方法、装置、终端及存储介质Method, device, terminal and storage medium for pedestrian re-identification
本申请要求于2019年11月01日在中国专利局提交的、申请号为201911060337.0、发明名称为“行人重识别的方法、装置、终端及存储介质”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims the priority of the Chinese patent application filed at the Chinese Patent Office on November 1, 2019, with the application number 201911060337.0, and the invention title "Methods, devices, terminals and storage media for pedestrian re-identification", and its entire contents Incorporated in this application by reference.
技术领域Technical field
本申请属于计算机技术领域,尤其涉及一种行人重识别的方法、装置、终端及存储介质。This application belongs to the field of computer technology, and in particular relates to a method, device, terminal, and storage medium for pedestrian re-identification.
背景技术Background technique
行人重识别(Person re-identification)也称行人再识别,是利用计算机视觉技术判断图像或者视频序列中是否存在特定行人的技术。目前,在行人重识别过程中,常借助于网络识别模型。但是,由于不同摄像设备之间的差异,同时行人兼具刚性和柔性的特性,外观易受穿着、尺度、遮挡、姿态和视角等影响,使得行人重识别的过程比常见的人脸识别过程难度更大,如何提高行人重识别的准确性是目前亟待解决的技术问题。Pedestrian re-identification (Person re-identification), also known as pedestrian re-identification, is a technology that uses computer vision technology to determine whether there is a specific pedestrian in an image or video sequence. At present, in the process of pedestrian re-identification, network recognition models are often used. However, due to the differences between different camera equipment, pedestrians have both rigid and flexible characteristics, and the appearance is easily affected by wearing, scale, occlusion, posture and viewing angle, making the process of pedestrian re-recognition more difficult than the common face recognition process How to improve the accuracy of pedestrian re-identification is a technical problem that needs to be solved urgently.
发明内容Summary of the invention
有鉴于此,本申请实施例提供了行人重识别的方法、装置、终端及存储介质,以提高行人重识别的准确性。In view of this, the embodiments of the present application provide a pedestrian re-identification method, device, terminal, and storage medium to improve the accuracy of pedestrian re-identification.
本申请实施例的第一方面提供了一种行人重识别的方法,包括:The first aspect of the embodiments of the present application provides a pedestrian re-identification method, including:
从预先采集的监控视频流中获取目标图像帧序列,所述目标图像帧序列中的目标图像帧中包含待识别行人的图像信息;Acquiring a target image frame sequence from a pre-collected surveillance video stream, where the target image frame in the target image frame sequence contains image information of a pedestrian to be identified;
从所述图像信息中识别出所述待识别行人的特征信息,并确定所述特征信息对应的所有标签信息;Identifying characteristic information of the pedestrian to be identified from the image information, and determining all tag information corresponding to the characteristic information;
利用预先训练完成的行人重识别模型分别对所述目标图像帧序列和所有所述标签信息进行行人重识别,从所有所述标签信息中确定所述待识别行人的目标标签信息;Using a pre-trained pedestrian re-recognition model to perform pedestrian re-recognition on the target image frame sequence and all the tag information, and determine the target tag information of the pedestrian to be identified from all the tag information;
基于所述目标标签信息确定所述待识别行人的重识别结果。The re-identification result of the pedestrian to be identified is determined based on the target tag information.
本申请实施例第二方面提供了一种行人重识别装置,包括:A second aspect of the embodiments of the present application provides a pedestrian re-identification device, including:
获取模块,用于从预先采集的监控视频流中获取目标图像帧序列,所述目标图像帧序列中的目标图像帧中包含待识别行人的图像信息;An acquiring module, configured to acquire a target image frame sequence from a pre-collected surveillance video stream, and the target image frame in the target image frame sequence contains image information of a pedestrian to be identified;
第一确定模块,用于从所述图像信息中识别出所述待识别行人的特征信息,并确定所述特征信息对应的所有标签信息;The first determining module is configured to identify the characteristic information of the pedestrian to be identified from the image information, and determine all tag information corresponding to the characteristic information;
重识别模块,用于利用预先训练完成的行人重识别模型分别对所述目标图像帧序列和所有所述标签信息进行行人重识别,从所有所述标签信息中确定所述待识别行人的目标标签信息;The re-recognition module is used to perform pedestrian re-recognition on the target image frame sequence and all the tag information using the pre-trained pedestrian re-recognition model, and determine the target tag of the pedestrian to be recognized from all the tag information information;
第二确定模块,用于基于所述目标标签信息确定所述待识别行人的重识别结果。The second determination module is configured to determine the re-identification result of the pedestrian to be identified based on the target tag information.
本申请实施例第三方面提供一种终端,包括存储器、处理器以及存储在所述存储器中并可在所述处理器上运行的计算机程序,其特征在于,所述处理器执行所述计算机程序时实现如上实施例第一方面所述行人重识别的方法的步骤。A third aspect of the embodiments of the present application provides a terminal, including a memory, a processor, and a computer program stored in the memory and running on the processor, wherein the processor executes the computer program When realizing the steps of the pedestrian re-identification method described in the first aspect of the above embodiment.
本申请实施例第四方面提供一种计算机可读存储介质,所述计算机可读存储介质存储有计算机程序,其特征在于,所述计算机程序被处理器执行时实现如上实施例第一方面所述行人重识别的方法的步骤。The fourth aspect of the embodiments of the present application provides a computer-readable storage medium, the computer-readable storage medium stores a computer program, and is characterized in that, when the computer program is executed by a processor, the implementation is as described in the first aspect of the above embodiment. Steps of the pedestrian re-identification method.
本申请的一个或多个实施例的细节在下面的附图和描述中提出。本申请的其它特征、目的和优点将从说明书、附图以及权利要求书变得明显。The details of one or more embodiments of the present application are set forth in the following drawings and description. Other features, purposes and advantages of this application will become apparent from the description, drawings and claims.
附图说明Description of the drawings
为了更清楚地说明本申请实施例中的技术方案,下面将对实施例或现有技术描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本申请的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动性的前提下,还可以根据这些附图获得其他的附图。In order to more clearly describe the technical solutions in the embodiments of the present application, the following will briefly introduce the drawings that need to be used in the description of the embodiments or the prior art. Obviously, the drawings in the following description are only of the present application. For some embodiments, those of ordinary skill in the art can obtain other drawings based on these drawings without creative labor.
图1是本申请第一实施例提供的行人重识别的方法的实现流程图;Fig. 1 is an implementation flowchart of the pedestrian re-identification method provided by the first embodiment of the present application;
图2是图1中S102的具体实施流程图;Figure 2 is a flow chart of the specific implementation of S102 in Figure 1;
图3是本申请第二实施例提供的行人重识别的方法的实现流程图;Fig. 3 is an implementation flowchart of the pedestrian re-identification method provided by the second embodiment of the present application;
图4是图3中S304的具体实施流程图;Figure 4 is a flowchart of the specific implementation of S304 in Figure 3;
图5是图3中S305的具体实现流程图;Figure 5 is a specific implementation flow chart of S305 in Figure 3;
图6是本申请实施例提供的行人重识别装置的结构示意图;FIG. 6 is a schematic structural diagram of a pedestrian re-identification device provided by an embodiment of the present application;
图7是本申请实施例提供的终端的结构示意图。FIG. 7 is a schematic structural diagram of a terminal provided by an embodiment of the present application.
具体实施方式Detailed ways
以下描述中,为了说明而不是为了限定,提出了诸如特定系统结构、技术之类的具体细节,以便透彻理解本申请实施例。然而,本领域的技术人员应当清楚,在没有这些具体细节的其它实施例中也可以实现本申请。在其它情况中,省略对众所周知的系统、装置、电路以及方法的详细说明,以免不必要的细节妨碍本申请的描述。In the following description, for the purpose of illustration rather than limitation, specific details such as a specific system structure and technology are proposed for a thorough understanding of the embodiments of the present application. However, it should be clear to those skilled in the art that the present application can also be implemented in other embodiments without these specific details. In other cases, detailed descriptions of well-known systems, devices, circuits, and methods are omitted to avoid unnecessary details from obstructing the description of this application.
需要说明的是,随着机器(深度)学习的迅速发展和视频监控设备的日益普及,行人重识别在智能安防、智能监控等领域愈加受到重视。现有大多数行人重识别方法通过使用机器学习模型来识别图像帧中的行人是否为同一人,将行人重识别问题看做为一个多分类问题。而常见的用于分类的机器学习模型,其损失函数通常为交叉熵损失函数。由于交叉熵损失函数只通过计算训练样本与所属正确类别之间的损失来保证分类的正确性,而没有考虑误判的损失信息,导致分类结果存在一定的误差。因此如何将错误类别的损失融入损失函数中,降低误判发生的概率,提高行人重识别网络模型的性能,是亟待解决的问题。It should be noted that with the rapid development of machine (deep) learning and the increasing popularity of video surveillance equipment, pedestrian re-identification has been paid more and more attention in the fields of intelligent security and intelligent surveillance. Most existing pedestrian re-identification methods use machine learning models to identify whether the pedestrians in the image frame are the same person, and regard the pedestrian re-identification problem as a multi-classification problem. The loss function of common machine learning models used for classification is usually a cross-entropy loss function. Since the cross-entropy loss function only calculates the loss between the training sample and the correct category to ensure the correctness of the classification, and does not consider the loss information of the misjudgment, there is a certain error in the classification result. Therefore, how to integrate the loss of the wrong category into the loss function, reduce the probability of misjudgment, and improve the performance of the pedestrian re-identification network model is an urgent problem to be solved.
本发明提供一种行人重识别方法,其基于新型损失函数的行人重识别网络模型进行行人重识别,通过增加错误类别的损失来改进交叉熵损失函数,从而使得机器学习模型获得更好的分类性能。The present invention provides a pedestrian re-identification method, which performs pedestrian re-identification based on a pedestrian re-identification network model of a novel loss function, and improves the cross-entropy loss function by increasing the loss of the wrong category, thereby enabling the machine learning model to obtain better classification performance .
为了说明本申请所述的技术方案,下面通过具体实施例来进行说明。如图1所示,是本申请第一实施例提供的行人重识别的方法的实现流程图,本实施例可以由行人重识别的装置的硬件或者软件实现,所述行人重识别的装置可以是终端。详述如下:In order to illustrate the technical solution described in the present application, specific embodiments are used for description below. As shown in Figure 1, it is a flow chart of the implementation of the pedestrian re-identification method provided by the first embodiment of the present application. This embodiment may be implemented by hardware or software of a pedestrian re-identification device. The pedestrian re-identification device may be terminal. The details are as follows:
S101,从预先采集的监控视频流中获取目标图像帧序列,所述目标图像帧序列中的目标图像帧中包含待识别行人的图像信息。S101. Obtain a target image frame sequence from a pre-collected surveillance video stream, where the target image frame in the target image frame sequence contains image information of a pedestrian to be identified.
所述预先采集的监控视频流由预先确定的监控设备采集,例如校园中的监控设备采集的视频流,所述监控视频流包括具有时间顺序的连续图像帧,不同时间段采集的视频流包括的图像帧对应包含不同的目标信息。在本实施例中,从预先采集的监控视频流中获取目标图像帧序列,所述目标图像帧序列中的目标图像帧中包含有待识别行人的图像信息,所述待识别行人的图像信息包括待识别行人的脸部信息、穿衣信息、肢体信息等。The pre-collected surveillance video stream is collected by a pre-determined surveillance device, such as a video stream collected by a surveillance device in a campus, the surveillance video stream includes consecutive image frames in a time sequence, and the video streams collected in different time periods include The image frame correspondingly contains different target information. In this embodiment, a target image frame sequence is acquired from a pre-collected surveillance video stream, and the target image frame in the target image frame sequence contains the image information of the pedestrian to be identified, and the image information of the pedestrian to be identified includes the image information of the pedestrian to be identified. Recognize pedestrian's facial information, clothing information, body information, etc.
S102,从所述图像信息中识别出所述待识别行人的特征信息,并确定所述特征信息对应的所有标签信息。S102: Identify feature information of the pedestrian to be identified from the image information, and determine all tag information corresponding to the feature information.
所述待识别行人的特征信息包括如皮肤状态、面部表情、五官特征等面部特征,衣服颜色、衣服纹理、手提包、背包、帽子等外观特征,所占据的图像局域、在图像中的相对位置等位置特征。在本实施例中,所述特征信息具有对应的标签信息,所述标签信息用于标识所述特征信息,例如所述特征信息为皮肤状态,对应的标签信息为光滑或者不光滑等。The feature information of the pedestrian to be identified includes facial features such as skin condition, facial expression, facial features, appearance features such as clothes color, clothes texture, handbag, backpack, hat, etc., the image area occupied, and the relative position in the image. Location features such as location. In this embodiment, the characteristic information has corresponding label information, and the label information is used to identify the characteristic information. For example, the characteristic information is a skin condition, and the corresponding label information is smooth or not smooth.
在一种可选的实现方式中,如图2所示,是图1中S102的具体实施流程图。由图2可知,S102,包括:In an optional implementation manner, as shown in FIG. 2, it is a flowchart of the specific implementation of S102 in FIG. 1. It can be seen from Figure 2 that S102 includes:
S1021,利用预先训练完成的特征信息识别模型,对所述待识别行人进行特征识别,得到所述待识别行人的特征信息。S1021: Perform feature recognition on the pedestrian to be identified by using the feature information recognition model completed in advance to obtain the feature information of the pedestrian to be identified.
所述预先训练完成的特征信息识别模型可以是具有识别功能的机器学习模型,例如神经网络模型,所述神经网络模型的输入为待识别行人,输出为待识别行人对应的特征信息。The pre-trained feature information recognition model may be a machine learning model with a recognition function, such as a neural network model. The input of the neural network model is the pedestrian to be recognized, and the output is the feature information corresponding to the pedestrian to be recognized.
S1022,计算所述特征信息属于每类预设标签信息的概率值。S1022: Calculate the probability value of the feature information belonging to each type of preset label information.
可以理解地,通常待识别行人具有多个不同的特征信息,例如皮肤状况,衣服颜色等,且不同的特征信息对应多个预设标签信息,例如衣服颜色对应的预设标签信息有红、黑、黄、绿等,且当特征信息的特征不明显时,识别结果可能会出现多个预设标签信息,例如,当衣服颜色为黑时,识别结果可能对应为黑、灰两种预设标签信息,此时需要进一步计算所述特征信息属于每类预设标签信息的概率值。Understandably, the pedestrian to be identified usually has multiple different characteristic information, such as skin condition, clothes color, etc., and different characteristic information corresponds to multiple preset label information, for example, the preset label information corresponding to the clothes color includes red and black. , Yellow, green, etc., and when the feature information is not obvious, multiple preset label information may appear in the recognition result. For example, when the color of clothes is black, the recognition result may correspond to two preset labels of black and gray. At this time, it is necessary to further calculate the probability value of the feature information belonging to each type of preset label information.
具体地,可以通过利用预设的概率归一化公式计算所述特征信息属于每类预设标签信息的概率值;所述预设的概率归一化公式为:Specifically, the probability value of the feature information belonging to each type of preset label information can be calculated by using a preset probability normalization formula; the preset probability normalization formula is:
Figure PCTCN2019119860-appb-000001
Figure PCTCN2019119860-appb-000001
其中,p i表示所述特征信息属于第i类预设标签信息的概率值,K表示预设标签信息的类别总数,
Figure PCTCN2019119860-appb-000002
表示所述特征信息属于第i类预设标签信息的对数概率值。
Wherein, p i represents the probability value of the feature information belonging to the i-th type of preset label information, and K represents the total number of types of preset label information,
Figure PCTCN2019119860-appb-000002
It indicates that the feature information belongs to the log probability value of the i-th type of preset label information.
S1023,若所述特征信息属于第一类预设标签信息的概率值大于属于第二类预设标签信息的概率值,且所述特征信息属于所述第一类预设标签信息的概率值大于预设的概率阈值,则判定所述第一类预设标签信息为所述特征信息对应的标签信息,所述第二类预设标签信息为除所述第一类预设标签信息之外的任 一类预设标签信息。S1023: If the probability value of the feature information belonging to the first type of preset label information is greater than the probability value of belonging to the second type of preset tag information, and the probability value of the feature information belonging to the first type of preset label information is greater than A preset probability threshold, it is determined that the first type of preset label information is the label information corresponding to the feature information, and the second type of preset label information is other than the first type of preset label information Any type of preset label information.
在本实施例中,通过计算所述特征信息属于预设标签信息的概率值来确定所述特征信息对应的预设标签信息。可以理解地,计算概率值的方式不限于利用上述预设的概率归一化公式,具体在此不做限定。In this embodiment, the preset tag information corresponding to the feature information is determined by calculating the probability value that the feature information belongs to the preset tag information. Understandably, the method of calculating the probability value is not limited to using the aforementioned preset probability normalization formula, which is not specifically limited here.
S103,利用预先训练完成的行人重识别模型分别对所述目标图像帧序列和所有所述标签信息进行行人重识别,从所有所述标签信息中确定所述待识别行人的目标标签信息。S103, using a pre-trained pedestrian re-recognition model to perform pedestrian re-identification on the target image frame sequence and all the tag information, and determine the target tag information of the pedestrian to be identified from all the tag information.
所述预先训练完成的行人重识别模型可以为具有识别功能的机器学习模型,所述行人重识别模型的输入为包含有待识别行人的图像帧序列和所述待识别行人的特征信息对应的所有标签信息,输出为所述待识别行人的目标标签信息。其中,所述待识别行人属于每类预设标签信息的概率最大的标签信息。The pre-trained pedestrian re-recognition model may be a machine learning model with a recognition function, and the input of the pedestrian re-recognition model is an image frame sequence containing the pedestrian to be recognized and all tags corresponding to the characteristic information of the pedestrian to be recognized Information, output as target tag information of the pedestrian to be identified. Wherein, the pedestrian to be identified belongs to tag information with the highest probability of each type of preset tag information.
S104,基于所述目标标签信息确定所述待识别行人的重识别结果。S104: Determine a re-identification result of the pedestrian to be identified based on the target tag information.
具体地,当所述目标标签信息与预设的所述待识别行人的标签信息匹配,则确定所述待识别行人是预先确定的特定行人,若所述目标标签信息与预设的所述待识别行人的标签信息不匹配,则判定所述待识别行人不是预先确定的特定行人。Specifically, when the target tag information matches the preset tag information of the pedestrian to be identified, it is determined that the pedestrian to be identified is a predetermined specific pedestrian. If the tag information for identifying the pedestrian does not match, it is determined that the pedestrian to be identified is not a predetermined specific pedestrian.
由上述分析可知,本申请实施例提供的行人重拾别的方法,包括:从预先采集的监控视频流中获取目标图像帧序列,所述目标图像帧序列中的目标图像帧中包含待识别行人的图像信息;从所述图像信息中识别出所述待识别行人的特征信息,并确定所述特征信息对应的所有标签信息;利用预先训练完成的行人重识别模型分别对所述目标图像帧序列和所有所述标签信息进行行人重识别,从所有所述标签信息中确定所述待识别行人的目标标签信息;基于所述目标标签信息确定所述待识别行人的重识别结果。与现有技术相比,通过从包含待识别行人的图像信息中识别出待识别行人的特征信息,确定特征信息对应的所有标签信息;并利用预先训练完成的行人重识别模型分别对包含有待识别行人的图像信息和所有标签信息进行行人重识别,提高行人重识别的准确性。From the above analysis, it can be seen that the pedestrian recovery method provided by the embodiment of the present application includes: acquiring a target image frame sequence from a pre-collected surveillance video stream, and the target image frame in the target image frame sequence contains the pedestrian to be identified Identify the feature information of the pedestrian to be identified from the image information, and determine all the tag information corresponding to the feature information; use the pre-trained pedestrian re-recognition model to separately perform the target image frame sequence Perform pedestrian re-identification with all the tag information, determine the target tag information of the pedestrian to be identified from all the tag information; determine the re-identification result of the pedestrian to be identified based on the target tag information. Compared with the prior art, by identifying the characteristic information of the pedestrian to be identified from the image information containing the pedestrian to be identified, all the tag information corresponding to the characteristic information is determined; Pedestrian image information and all tag information are re-identified to improve the accuracy of pedestrian re-identification.
如图3所示,是本申请第二实施例提供的行人重识别的方法的实现流程图。由图3可知,本实施与图图1所示实施例相比,S301~S302以及S308~S309的具体实施过程与S101~S104的具体实现过程相同,不同之处在于,在S308之前还包括S303~S307,其中,S307与S308为并列执行关系,可以择一执行。S303~S307的具体实施过程详述如下:As shown in FIG. 3, it is a flowchart of the implementation of the pedestrian re-identification method provided by the second embodiment of the present application. It can be seen from Fig. 3 that compared with the embodiment shown in Fig. 1, the specific implementation process of S301~S302 and S308~S309 is the same as the specific implementation process of S101~S104 in this implementation. The difference is that S303 is also included before S308. ~ S307, where S307 and S308 are executed in parallel, and one of them can be executed. The specific implementation process of S303~S307 is detailed as follows:
S303,采集第一预设数量的训练样本,每个所述训练样本包含待识别行人的图像以及所述待识别行人对应的所有预设标签信息。S303: Collect a first preset number of training samples, each of which includes an image of a pedestrian to be identified and all preset label information corresponding to the pedestrian to be identified.
S304,利用所述训练样本训练预先建立的机器学习模型进行训练,得到训练之后的机器学习模型。S304: Use the training sample to train a pre-established machine learning model for training, to obtain a trained machine learning model.
如图4所示,是图3中S304的具体实施流程图。由图4可知,S304包括:As shown in FIG. 4, it is a flowchart of the specific implementation of S304 in FIG. 3. It can be seen from Figure 4 that S304 includes:
S3041,使用所述预先建立的机器学习模型重识别每个所述待识别行人对应的所有预设标签信息,得到每个所述待识别行人属于每类预设标签信息的概率,并确定每个所述待识别行人对应的概率最大的预设标签信息。S3041. Use the pre-established machine learning model to re-identify all preset label information corresponding to each pedestrian to be identified, obtain the probability that each pedestrian to be identified belongs to each type of preset label information, and determine each The preset tag information with the highest probability corresponding to the pedestrian to be identified.
所述机器学习模型可以是神经网络模型、逻辑分类模型、随机森林模型等深度学习模型。可以理解地,每个所述待识别行人对应的所有预设标签信息通常不完全相同,通过所述机器学习模型进行预设标签识别,可以快速准确地得到每个所述待识别行人属于每类所述预设标签信息的概率。The machine learning model may be a deep learning model such as a neural network model, a logical classification model, and a random forest model. It is understandable that all the preset label information corresponding to each pedestrian to be identified are usually not completely the same. Performing preset label recognition through the machine learning model can quickly and accurately obtain that each pedestrian to be identified belongs to each category. The probability of the preset label information.
S3042,分别以每个所述待识别行人对应的所述概率最大的预设标签信息为训练所述机器学习模型的约束条件,迭代所述机器学习模型的预设参数。S3042, respectively using the preset label information with the highest probability corresponding to each pedestrian to be identified as a constraint condition for training the machine learning model, and iterating the preset parameters of the machine learning model.
可以理解地,分别以每个所述待识别行人对应的所述概率最大的预设标签性为训练所述机器学习模型的约束条件,通过最小化所述机器学习模型对应的损失函数,不断迭代所述机器学习模型的预设参数,来提高所述机器学习模型对错误标签识别的准确率。Understandably, the preset label property with the highest probability corresponding to each pedestrian to be identified is used as a constraint condition for training the machine learning model, and by minimizing the loss function corresponding to the machine learning model, iteratively The preset parameters of the machine learning model are used to improve the accuracy of the machine learning model's recognition of error labels.
S3043,若所述机器学习模型对应的损失函数值的变化率趋于稳定,则判定对所述机器学习模型的训练完成,得到所述行人重识别模型。S3043: If the rate of change of the loss function value corresponding to the machine learning model tends to be stable, it is determined that the training of the machine learning model is completed, and the pedestrian re-identification model is obtained.
S305,对训练之后的所述机器学习模型进行模型准确性测试。S305: Perform a model accuracy test on the machine learning model after training.
如图5所示,是图3中S305的具体实现流程图。由图5可知,S305包括:As shown in FIG. 5, it is a specific implementation flowchart of S305 in FIG. 3. It can be seen from Figure 5 that S305 includes:
S3051,将第二预设数量的测试样本输入训练之后的所述机器学习模型进行分析,确定训练之后的所述机器学习模型的损失函数的变化率。S3051: Input a second preset number of test samples into the machine learning model after training for analysis, and determine the rate of change of the loss function of the machine learning model after training.
S3052,若所述变化率小于或者等于预设的变化率阈值,则判定对训练之后的所述机器学习模型的测试通过。S3052: If the rate of change is less than or equal to a preset rate of change threshold, it is determined that the test of the machine learning model after training passes.
S3053,若所述变化率大于预设的变化率阈值,则判定对训练之后的所述机器学习模型的测试不通过。S3053: If the rate of change is greater than a preset rate of change threshold, it is determined that the test of the machine learning model after the training fails.
S306,若对所述训练之后的所述机器学习模型的准确性测试通过,则判定训练之后的所述机器学习模型为所述行人重识别模型。S306: If the accuracy test of the machine learning model after the training is passed, it is determined that the machine learning model after the training is the pedestrian re-identification model.
S307,若对所述训练之后的所述机器学习模型的准确性测试不通过,则增 加所述训练样本的数量,并返回执行利用所述训练样本训练预先建立的机器学习模型进行训练,得到所述行人重识别模型。S307: If the accuracy test of the machine learning model after the training fails, increase the number of training samples, and return to perform training using the training sample to train the pre-established machine learning model to obtain all The pedestrian re-identification model.
所述预先训练完成的所述行人重识别模型的损失函数为:The loss function of the pre-trained pedestrian re-identification model is:
Figure PCTCN2019119860-appb-000003
Figure PCTCN2019119860-appb-000003
其中,among them,
Figure PCTCN2019119860-appb-000004
Figure PCTCN2019119860-appb-000004
Figure PCTCN2019119860-appb-000005
Figure PCTCN2019119860-appb-000005
其中,N表示所述训练样本的总数,K表示预设标签信息的类别总数,p j表示当前样本属于第j类预设标签信息的概率值,y i为当前样本对应的真实标签信息,q i,j为p j的分布比例,N sc表示当前样本属于相似标签信息的数目,ε为平衡所述真实标签信息和所述相似标签信息的系数,
Figure PCTCN2019119860-appb-000006
表示当前样本对应的特征信息属于第j类预设标签信息的对数概率值,
Figure PCTCN2019119860-appb-000007
表示当前样本对应的特征信息属于第k类预设标签信息的对数概率值,
Figure PCTCN2019119860-appb-000008
表示输出概率大于预设概率阈值的前n个标签。
Where N represents the total number of training samples, K represents the total number of categories of preset label information, p j represents the probability value of the current sample belonging to the j-th type of preset label information, y i is the true label information corresponding to the current sample, q i, j are the distribution ratios of p j , N sc represents the number of similar label information belonging to the current sample, and ε is a coefficient that balances the real label information and the similar label information,
Figure PCTCN2019119860-appb-000006
Indicates that the feature information corresponding to the current sample belongs to the log probability value of the j-th type of preset label information,
Figure PCTCN2019119860-appb-000007
Indicates that the feature information corresponding to the current sample belongs to the log probability value of the k-th type of preset label information,
Figure PCTCN2019119860-appb-000008
Represents the first n tags whose output probability is greater than the preset probability threshold.
图6是本申请实施例提供的行人重识别装置的结构示意图。由图6可知,本实施例提供的行人重识别装置6包括:获取模块601、第一确定模块602、重识别模块603以及第二确定模块604。其中,Fig. 6 is a schematic structural diagram of a pedestrian re-identification device provided by an embodiment of the present application. It can be seen from FIG. 6 that the pedestrian re-identification device 6 provided in this embodiment includes: an acquisition module 601, a first determination module 602, a re-identification module 603, and a second determination module 604. among them,
获取模块601,用于从预先采集的监控视频流中获取目标图像帧序列,所述目标图像帧序列中的目标图像帧中包含待识别行人的图像信息;The obtaining module 601 is configured to obtain a target image frame sequence from a pre-collected surveillance video stream, and the target image frame in the target image frame sequence contains image information of a pedestrian to be identified;
第一确定模块602,用于从所述图像信息中识别出所述待识别行人的特征信息,并确定所述特征信息对应的所有标签信息;The first determining module 602 is configured to identify the characteristic information of the pedestrian to be identified from the image information, and determine all tag information corresponding to the characteristic information;
重识别模块603,用于利用预先训练完成的行人重识别模型分别对所述目标图像帧序列和所有所述标签信息进行行人重识别,从所有所述标签信息中确定所述待识别行人的目标标签信息;The re-recognition module 603 is configured to use the pre-trained pedestrian re-recognition model to perform pedestrian re-recognition on the target image frame sequence and all the tag information, and determine the target of the pedestrian to be recognized from all the tag information Label Information;
第二确定模块604,用于基于所述目标标签信息确定所述待识别行人的重识别结果。The second determining module 604 is configured to determine the re-identification result of the pedestrian to be identified based on the target tag information.
在一种可选的实现方式中,所述第一确定模块602,包括:In an optional implementation manner, the first determining module 602 includes:
识别单元,用于利用预先训练完成的特征信息识别模型,对所述待识别行 人进行特征识别,得到所述待识别行人的特征信息;A recognition unit, configured to use a pre-trained feature information recognition model to perform feature recognition on the pedestrian to be identified to obtain feature information of the pedestrian to be identified;
计算单元,用于计算所述特征信息属于每类预设标签信息的概率值;A calculation unit, configured to calculate the probability value of the feature information belonging to each type of preset label information;
第一判定单元,用于在若所述特征信息属于第一类预设标签信息的概率值大于属于第二类预设标签信息的概率值,且所述特征信息属于所述第一类预设标签信息的概率值大于预设的概率阈值,则判定所述第一类预设标签信息为所述特征信息对应的标签信息,所述第二类预设标签信息为除所述第一类预设标签信息之外的任一类预设标签信息。The first determining unit is configured to: if the probability value of the feature information belonging to the first type of preset label information is greater than the probability value of belonging to the second type of preset label information, and the feature information belongs to the first type of preset If the probability value of the tag information is greater than the preset probability threshold, it is determined that the first type of preset tag information is the tag information corresponding to the feature information, and the second type of preset tag information is except for the first type of preset tag information. Set any type of preset label information except label information.
在一种可选的实现方式中,所述计算单元,包括:In an optional implementation manner, the calculation unit includes:
利用预设的概率归一化公式计算所述特征信息属于每类预设标签信息的概率值;所述预设的概率归一化公式为:A preset probability normalization formula is used to calculate the probability value of the feature information belonging to each type of preset label information; the preset probability normalization formula is:
Figure PCTCN2019119860-appb-000009
Figure PCTCN2019119860-appb-000009
其中,p i表示所述特征信息属于第i类预设标签信息的概率值,K表示预设标签信息的类别总数,
Figure PCTCN2019119860-appb-000010
表示所述特征信息属于第i类预设标签信息的对数概率值。
Wherein, p i represents the probability value of the feature information belonging to the i-th type of preset label information, and K represents the total number of types of preset label information,
Figure PCTCN2019119860-appb-000010
Indicates the log probability value of the feature information belonging to the i-th type of preset label information.
在一种可选的实现方式中,还包括:In an optional implementation manner, it further includes:
采集模块,用于采集第一预设数量的训练样本,每个所述训练样本包含待识别行人的图像以及所述待识别行人对应的所有预设标签信息;An acquisition module, configured to collect a first preset number of training samples, each of the training samples includes an image of the pedestrian to be identified and all preset label information corresponding to the pedestrian to be identified;
训练模块,用于利用所述训练样本训练预先建立的机器学习模型进行训练,得到训练之后的机器学习模型;The training module is used for training a pre-established machine learning model by using the training samples to obtain a machine learning model after training;
测试模块,用于对训练之后的所述机器学习模型进行模型准确性测试;A test module, which is used to perform a model accuracy test on the machine learning model after training;
第一判定模块,用于在若对所述训练之后的所述机器学习模型的准确性测试通过,则判定训练之后的所述机器学习模型为所述行人重识别模型;The first determination module is configured to determine that the machine learning model after the training is the pedestrian re-identification model if the accuracy test of the machine learning model after the training is passed;
第二判定模块,用于在若对所述训练之后的所述机器学习模型的准确性测试不通过,则增加所述训练样本的数量,并返回执行利用所述训练样本训练预先建立的机器学习模型进行训练,得到所述行人重识别模型。The second determination module is used to increase the number of training samples if the accuracy test of the machine learning model after the training fails, and then return to execute the machine learning pre-established for training with the training samples The model is trained to obtain the pedestrian re-identification model.
在一种可选的实现方式中,所述训练模块,包括:In an optional implementation manner, the training module includes:
重识别单元,用于使用所述预先建立的机器学习模型重识别每个所述待识别行人对应的所有预设标签信息,得到每个所述待识别行人属于每类预设标签信息的概率,并确定每个所述待识别行人对应的概率最大的预设标签信息;The re-identification unit is configured to use the pre-established machine learning model to re-identify all the preset label information corresponding to each pedestrian to be identified, and obtain the probability that each pedestrian to be identified belongs to each type of preset label information, And determine the preset label information with the highest probability corresponding to each of the pedestrians to be identified;
迭代单元,用于分别以每个所述待识别行人对应的所述概率最大的预设标签信息为训练所述机器学习模型的约束条件,迭代所述机器学习模型的预设参数;An iterative unit, configured to use the preset label information with the highest probability corresponding to each pedestrian to be identified as a constraint condition for training the machine learning model, and iterate the preset parameters of the machine learning model;
第二判定单元,用于在若所述机器学习模型对应的损失函数值的变化率趋于稳定,则判定对所述机器学习模型的训练完成,得到所述行人重识别模型。The second determining unit is configured to determine that the training of the machine learning model is completed if the rate of change of the loss function value corresponding to the machine learning model becomes stable, and the pedestrian re-identification model is obtained.
在一种可选的实现方式中,所述预先训练完成的所述行人重识别模型的损失函数为:In an optional implementation manner, the loss function of the pre-trained pedestrian re-identification model is:
Figure PCTCN2019119860-appb-000011
Figure PCTCN2019119860-appb-000011
其中,among them,
Figure PCTCN2019119860-appb-000012
Figure PCTCN2019119860-appb-000012
Figure PCTCN2019119860-appb-000013
Figure PCTCN2019119860-appb-000013
其中,N表示所述训练样本的总数,K表示预设标签信息的类别总数,p j表示当前样本属于第j类预设标签信息的概率值,y i为当前样本对应的真实标签信息,q i,j为p j的分布比例,N sc表示当前样本属于相似标签信息的数目,ε为平衡所述真实标签信息和所述相似标签信息的系数,
Figure PCTCN2019119860-appb-000014
表示当前样本对应的特征信息属于第j类预设标签信息的对数概率值,
Figure PCTCN2019119860-appb-000015
表示当前样本对应的特征信息属于第k类预设标签信息的对数概率值,
Figure PCTCN2019119860-appb-000016
表示输出概率大于预设概率阈值的前n个标签。
Where N represents the total number of training samples, K represents the total number of categories of preset label information, p j represents the probability value of the current sample belonging to the j-th type of preset label information, y i is the true label information corresponding to the current sample, q i, j are the distribution ratios of p j , N sc represents the number of similar label information belonging to the current sample, and ε is a coefficient that balances the real label information and the similar label information,
Figure PCTCN2019119860-appb-000014
Indicates that the feature information corresponding to the current sample belongs to the log probability value of the j-th type of preset label information,
Figure PCTCN2019119860-appb-000015
Indicates that the feature information corresponding to the current sample belongs to the log probability value of the k-th type of preset label information,
Figure PCTCN2019119860-appb-000016
Represents the first n tags whose output probability is greater than the preset probability threshold.
图7是本申请实施例提供的终端的结构示意图。如图7所示,该实施例的终端7包括:处理器70、存储器71以及存储在存储器71中并可在处理器70上运行的计算机程序72,例如行人重识别的程序。处理器70执行计算机程序72时实现上述各个行人重识别的方法实施例中的步骤,例如图1所示的步骤101至104。FIG. 7 is a schematic structural diagram of a terminal provided by an embodiment of the present application. As shown in FIG. 7, the terminal 7 of this embodiment includes a processor 70, a memory 71, and a computer program 72 stored in the memory 71 and running on the processor 70, such as a pedestrian re-identification program. When the processor 70 executes the computer program 72, the steps in the above-mentioned various pedestrian re-identification method embodiments are implemented, for example, steps 101 to 104 shown in FIG. 1.
示例性的,计算机程序72可以被分割成一个或多个模块/单元,所述一个或者多个模块/单元被存储在存储器71中,并由处理器70执行,以完成本申请。所述一个或多个模块/单元可以是能够完成特定功能的一系列计算机程序指令段,该指令段用于描述计算机程序72在所述终端7中的执行过程。例如,计算机程序72可以被分割成获取模块、第一确定模块、重识别模块以及第二确定模 块(虚拟装置中的模块),各模块具体功能如下:Exemplarily, the computer program 72 may be divided into one or more modules/units, and the one or more modules/units are stored in the memory 71 and executed by the processor 70 to complete the application. The one or more modules/units may be a series of computer program instruction segments capable of completing specific functions, and the instruction segments are used to describe the execution process of the computer program 72 in the terminal 7. For example, the computer program 72 can be divided into an acquisition module, a first determination module, a re-identification module, and a second determination module (a module in a virtual device). The specific functions of each module are as follows:
获取模块,用于从预先采集的监控视频流中获取目标图像帧序列,所述目标图像帧序列中的目标图像帧中包含待识别行人的图像信息;An acquiring module, configured to acquire a target image frame sequence from a pre-collected surveillance video stream, and the target image frame in the target image frame sequence contains image information of a pedestrian to be identified;
第一确定模块,用于从所述图像信息中识别出所述待识别行人的特征信息,并确定所述特征信息对应的所有标签信息;The first determining module is configured to identify the characteristic information of the pedestrian to be identified from the image information, and determine all tag information corresponding to the characteristic information;
重识别模块,用于利用预先训练完成的行人重识别模型分别对所述目标图像帧序列和所有所述标签信息进行行人重识别,从所有所述标签信息中确定所述待识别行人的目标标签信息;The re-recognition module is used to perform pedestrian re-recognition on the target image frame sequence and all the tag information using the pre-trained pedestrian re-recognition model, and determine the target tag of the pedestrian to be recognized from all the tag information information;
第二确定模块,用于基于所述目标标签信息确定所述待识别行人的重识别结果。The second determination module is configured to determine the re-identification result of the pedestrian to be identified based on the target tag information.
所属领域的技术人员可以清楚地了解到,为了描述的方便和简洁,仅以上述各功能单元、模块的划分进行举例说明,实际应用中,可以根据需要而将上述功能分配由不同的功能单元、模块完成,即将所述装置的内部结构划分成不同的功能单元或模块,以完成以上描述的全部或者部分功能。实施例中的各功能单元、模块可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中,上述集成的单元既可以采用硬件的形式实现,也可以采用软件功能单元的形式实现。另外,各功能单元、模块的具体名称也只是为了便于相互区分,并不用于限制本申请的保护范围。上述系统中单元、模块的具体工作过程,可以参考前述方法实施例中的对应过程,在此不再赘述。Those skilled in the art can clearly understand that, for the convenience and conciseness of description, only the division of the above functional units and modules is used as an example. In practical applications, the above functions can be allocated to different functional units and modules as needed. Module completion, that is, the internal structure of the device is divided into different functional units or modules to complete all or part of the functions described above. The functional units and modules in the embodiments can be integrated into one processing unit, or each unit can exist alone physically, or two or more units can be integrated into one unit. The above-mentioned integrated units can be hardware-based Formal realization can also be realized in the form of software functional units. In addition, the specific names of the functional units and modules are only used to facilitate distinguishing each other, and are not used to limit the protection scope of the present application. For the specific working process of the units and modules in the foregoing system, reference may be made to the corresponding process in the foregoing method embodiment, which will not be repeated here.
在上述实施例中,对各个实施例的描述都各有侧重,某个实施例中没有详述或记载的部分,可以参见其它实施例的相关描述。In the above-mentioned embodiments, the description of each embodiment has its own emphasis. For parts that are not described in detail or recorded in an embodiment, reference may be made to related descriptions of other embodiments.
本领域普通技术人员可以意识到,结合本文中所公开的实施例描述的各示例的单元及算法步骤,能够以电子硬件、或者计算机软件和电子硬件的结合来实现。这些功能究竟以硬件还是软件方式来执行,取决于技术方案的特定应用和设计约束条件。专业技术人员可以对每个特定的应用来使用不同方法来实现所描述的功能,但是这种实现不应认为超出本申请的范围。A person of ordinary skill in the art may realize that the units and algorithm steps of the examples described in combination with the embodiments disclosed herein can be implemented by electronic hardware or a combination of computer software and electronic hardware. Whether these functions are performed by hardware or software depends on the specific application and design constraint conditions of the technical solution. Professionals and technicians can use different methods for each specific application to implement the described functions, but such implementation should not be considered beyond the scope of this application.
在本申请所提供的实施例中,应该理解到,所揭露的装置/终端设备和方法,可以通过其它的方式实现。例如,以上所描述的装置/终端设备实施例仅仅是示意性的,例如,所述模块或单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如多个单元或组件可以结合或者可以集成到另一 个系统,或一些特征可以忽略,或不执行。另一点,所显示或讨论的相互之间的耦合或直接耦合或通讯连接可以是通过一些接口,装置或单元的间接耦合或通讯连接,可以是电性,机械或其它的形式。In the embodiments provided in this application, it should be understood that the disclosed device/terminal device and method may be implemented in other ways. For example, the device/terminal device embodiments described above are only illustrative. For example, the division of the modules or units is only a logical function division, and there may be other divisions in actual implementation, such as multiple units. Or components can be combined or integrated into another system, or some features can be omitted or not implemented. In addition, the displayed or discussed mutual coupling or direct coupling or communication connection may be indirect coupling or communication connection through some interfaces, devices or units, and may be in electrical, mechanical or other forms.
所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个通信单元上。可以根据实际的需要选择其中的部分或者全部单元来实现本实施例方案的目的。The units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, they may be located in one place, or they may be distributed on multiple communication units. Some or all of the units may be selected according to actual needs to achieve the objectives of the solutions of the embodiments.
另外,在本申请各个实施例中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。上述集成的单元既可以采用硬件的形式实现,也可以采用软件功能单元的形式实现。In addition, the functional units in the various embodiments of the present application may be integrated into one processing unit, or each unit may exist alone physically, or two or more units may be integrated into one unit. The above-mentioned integrated unit can be implemented in the form of hardware or software functional unit.
所述集成的模块/单元如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储介质中。基于这样的理解,本申请实现上述实施例方法中的全部或部分流程,也可以通过计算机程序来指令相关的硬件来完成,所述的计算机程序可存储于一计算机可读存储介质中,该计算机程序在被处理器执行时,可实现上述各个方法实施例的步骤。。其中,所述计算机程序包括计算机程序代码,所述计算机程序代码可以为源代码形式、对象代码形式、可执行文件或某些中间形式等。所述计算机可读介质可以包括:能够携带所述计算机程序代码的任何实体或装置、记录介质、U盘、移动硬盘、磁碟、光盘、计算机存储器、只读存储器(ROM,Read-Only Memory)、随机存取存储器(RAM,Random Access Memory)、电载波信号、电信信号以及软件分发介质等。需要说明的是,所述计算机可读介质包含的内容可以根据司法管辖区内立法和专利实践的要求进行适当的增减,例如在某些司法管辖区,根据立法和专利实践,计算机可读介质不包括电载波信号和电信信号。If the integrated module/unit is implemented in the form of a software functional unit and sold or used as an independent product, it can be stored in a computer readable storage medium. Based on this understanding, the present application implements all or part of the processes in the above-mentioned embodiments and methods, and can also be completed by instructing relevant hardware through a computer program. The computer program can be stored in a computer-readable storage medium. When the program is executed by the processor, it can implement the steps of the foregoing method embodiments. . Wherein, the computer program includes computer program code, and the computer program code may be in the form of source code, object code, executable file, or some intermediate forms. The computer-readable medium may include: any entity or device capable of carrying the computer program code, recording medium, U disk, mobile hard disk, magnetic disk, optical disk, computer memory, read-only memory (ROM, Read-Only Memory) , Random Access Memory (RAM, Random Access Memory), electrical carrier signal, telecommunications signal, and software distribution media, etc. It should be noted that the content contained in the computer-readable medium can be appropriately added or deleted according to the requirements of the legislation and patent practice in the jurisdiction. For example, in some jurisdictions, according to the legislation and patent practice, the computer-readable medium Does not include electrical carrier signals and telecommunication signals.
以上所述实施例仅用以说明本申请的技术方案,而非对其限制;尽管参照前述实施例对本申请进行了详细的说明,本领域的普通技术人员应当理解:其依然可以对前述各实施例所记载的技术方案进行修改,或者对其中部分技术特征进行等同替换;而这些修改或者替换,并不使相应技术方案的本质脱离本申请各实施例技术方案的精神和范围,均应包含在本申请的保护范围之内。The above-mentioned embodiments are only used to illustrate the technical solutions of the present application, not to limit them; although the present application has been described in detail with reference to the foregoing embodiments, a person of ordinary skill in the art should understand that it can still implement the foregoing The technical solutions recorded in the examples are modified, or some of the technical features are equivalently replaced; these modifications or replacements do not cause the essence of the corresponding technical solutions to deviate from the spirit and scope of the technical solutions of the embodiments of the application, and should be included in Within the scope of protection of this application.

Claims (20)

  1. 一种行人重识别的方法,其特征在于,包括:A method for pedestrian re-identification, which is characterized in that it includes:
    从预先采集的监控视频流中获取目标图像帧序列,所述目标图像帧序列中的目标图像帧中包含待识别行人的图像信息;Acquiring a target image frame sequence from a pre-collected surveillance video stream, where the target image frame in the target image frame sequence contains image information of a pedestrian to be identified;
    从所述图像信息中识别出所述待识别行人的特征信息,并确定所述特征信息对应的所有标签信息;Identifying characteristic information of the pedestrian to be identified from the image information, and determining all tag information corresponding to the characteristic information;
    利用预先训练完成的行人重识别模型分别对所述目标图像帧序列和所有所述标签信息进行行人重识别,从所有所述标签信息中确定所述待识别行人的目标标签信息;Using a pre-trained pedestrian re-recognition model to perform pedestrian re-recognition on the target image frame sequence and all the tag information, and determine the target tag information of the pedestrian to be identified from all the tag information;
    基于所述目标标签信息确定所述待识别行人的重识别结果。The re-identification result of the pedestrian to be identified is determined based on the target tag information.
  2. 如权利要求1所述的行人重识别的方法,其特征在于,所述从所述图像信息中识别出所述待识别行人的特征信息,并确定所述特征信息对应的标签信息,包括:The method for pedestrian re-identification according to claim 1, wherein the identifying characteristic information of the pedestrian to be identified from the image information and determining the label information corresponding to the characteristic information comprises:
    利用预先训练完成的特征信息识别模型,对所述待识别行人进行特征识别,得到所述待识别行人的特征信息;Using the pre-trained feature information recognition model to perform feature recognition on the pedestrian to be identified to obtain the feature information of the pedestrian to be identified;
    计算所述特征信息属于每类预设标签信息的概率值;Calculating the probability value of the feature information belonging to each type of preset label information;
    若所述特征信息属于第一类预设标签信息的概率值大于属于第二类预设标签信息的概率值,且所述特征信息属于所述第一类预设标签信息的概率值大于预设的概率阈值,则判定所述第一类预设标签信息为所述特征信息对应的标签信息,所述第二类预设标签信息为除所述第一类预设标签信息之外的任一类预设标签信息。If the probability value of the feature information belonging to the first type of preset tag information is greater than the probability value of belonging to the second type of preset tag information, and the probability value of the feature information belonging to the first type of preset tag information is greater than the preset The probability threshold of the first type is determined to be the tag information corresponding to the feature information, and the second type of preset tag information is any one other than the first type of preset tag information. Class preset label information.
  3. 如权利要求2所述的行人重识别的方法,其特征在于,所述计算所述特征信息属于每类预设标签信息的概率值,包括:The method for pedestrian re-identification according to claim 2, wherein said calculating the probability value of said characteristic information belonging to each type of preset label information comprises:
    利用预设的概率归一化公式计算所述特征信息属于每类预设标签信息的概率值;所述预设的概率归一化公式为:A preset probability normalization formula is used to calculate the probability value of the feature information belonging to each type of preset label information; the preset probability normalization formula is:
    Figure PCTCN2019119860-appb-100001
    Figure PCTCN2019119860-appb-100001
    其中,p i表示所述特征信息属于第i类预设标签信息的概率值,K表示预设标签信息的类别总数,
    Figure PCTCN2019119860-appb-100002
    表示所述特征信息属于第i类预设标签信息的对数概 率值。
    Wherein, p i represents the probability value of the feature information belonging to the i-th type of preset label information, and K represents the total number of types of preset label information,
    Figure PCTCN2019119860-appb-100002
    Indicates the log probability value of the feature information belonging to the i-th type of preset label information.
  4. 如权利要求1所述的行人重识别的方法,其特征在于,在所述利用预先训练完成的行人重识别模型分别对所述目标图像帧序列和所述标签信息进行重识别,以完成对所述待识别行人的重识别之前,包括:The method for pedestrian re-identification according to claim 1, wherein the target image frame sequence and the label information are respectively re-identified in the pedestrian re-identification model completed by pre-training to complete the re-identification of the target image frame sequence and the label information. Before re-identification of pedestrians to be identified, including:
    采集第一预设数量的训练样本,每个所述训练样本包含待识别行人的图像以及所述待识别行人对应的所有预设标签信息;Collecting a first preset number of training samples, each of the training samples containing an image of a pedestrian to be identified and all preset label information corresponding to the pedestrian to be identified;
    利用所述训练样本训练预先建立的机器学习模型进行训练,得到训练之后的机器学习模型;Use the training samples to train a pre-established machine learning model for training, to obtain a trained machine learning model;
    对训练之后的所述机器学习模型进行模型准确性测试;Performing a model accuracy test on the machine learning model after training;
    若对所述训练之后的所述机器学习模型的准确性测试通过,则判定训练之后的所述机器学习模型为所述行人重识别模型;If the accuracy test of the machine learning model after the training is passed, it is determined that the machine learning model after the training is the pedestrian re-identification model;
    若对所述训练之后的所述机器学习模型的准确性测试不通过,则增加所述训练样本的数量,并返回执行利用所述训练样本训练预先建立的机器学习模型进行训练,得到所述行人重识别模型。If the accuracy test of the machine learning model after the training fails, increase the number of training samples, and return to perform training using the training sample training pre-established machine learning model to obtain the pedestrian Re-identify the model.
  5. 如权利要求4所述的行人重识别的方法,其特征在于,所述利用所述训练样本训练预先建立的机器学习模型,得到训练之后的机器学习模型,包括:The method for pedestrian re-identification according to claim 4, wherein the training a pre-established machine learning model using the training samples to obtain the trained machine learning model comprises:
    使用所述预先建立的机器学习模型重识别每个所述待识别行人对应的所有预设标签信息,得到每个所述待识别行人属于每类预设标签信息的概率,并确定每个所述待识别行人对应的概率最大的预设标签信息;Use the pre-established machine learning model to re-identify all the preset label information corresponding to each pedestrian to be identified, obtain the probability that each pedestrian to be identified belongs to each type of preset label information, and determine each The preset label information with the highest probability corresponding to the pedestrian to be identified;
    分别以每个所述待识别行人对应的所述概率最大的预设标签信息为训练所述机器学习模型的约束条件,迭代所述机器学习模型的预设参数;Respectively taking the preset label information with the highest probability corresponding to each of the pedestrians to be identified as the constraint conditions for training the machine learning model, and iterating the preset parameters of the machine learning model;
    若所述机器学习模型对应的损失函数值的变化率趋于稳定,则判定对所述机器学习模型的训练完成,得到所述行人重识别模型。If the rate of change of the loss function value corresponding to the machine learning model tends to be stable, it is determined that the training of the machine learning model is completed, and the pedestrian re-identification model is obtained.
  6. 如权利要求4所述的行人重识别的方法,其特征在于,所述预先训练完成的所述行人重识别模型的损失函数为:The method for pedestrian re-identification according to claim 4, wherein the loss function of the pedestrian re-identification model completed in the pre-training is:
    Figure PCTCN2019119860-appb-100003
    Figure PCTCN2019119860-appb-100003
    其中,among them,
    Figure PCTCN2019119860-appb-100004
    Figure PCTCN2019119860-appb-100004
    Figure PCTCN2019119860-appb-100005
    Figure PCTCN2019119860-appb-100005
    其中,N表示所述训练样本的总数,K表示预设标签信息的类别总数,p j表示当前样本属于第j类预设标签信息的概率值,y i为当前样本对应的真实标签信息,q i,j为p j的分布比例,N sc表示当前样本属于相似标签信息的数目,ε为平衡所述真实标签信息和所述相似标签信息的系数,
    Figure PCTCN2019119860-appb-100006
    表示当前样本对应的特征信息属于第j类预设标签信息的对数概率值,
    Figure PCTCN2019119860-appb-100007
    表示当前样本对应的特征信息属于第k类预设标签信息的对数概率值,
    Figure PCTCN2019119860-appb-100008
    表示输出概率大于预设概率阈值的前n个标签。
    Where N represents the total number of training samples, K represents the total number of categories of preset label information, p j represents the probability value of the current sample belonging to the j-th type of preset label information, y i is the true label information corresponding to the current sample, q i, j are the distribution ratios of p j , N sc represents the number of similar label information belonging to the current sample, and ε is a coefficient that balances the real label information and the similar label information,
    Figure PCTCN2019119860-appb-100006
    Indicates that the feature information corresponding to the current sample belongs to the log probability value of the j-th type of preset label information,
    Figure PCTCN2019119860-appb-100007
    Indicates that the feature information corresponding to the current sample belongs to the log probability value of the k-th type of preset label information,
    Figure PCTCN2019119860-appb-100008
    Represents the first n tags whose output probability is greater than the preset probability threshold.
  7. 如权利要求6所述的行人重识别的方法,其特征在于,所述对训练之后的机器学习模型进行模型准确性测试,包括:8. The method for pedestrian re-identification according to claim 6, wherein said performing a model accuracy test on the machine learning model after training comprises:
    将第二预设数量的测试样本输入训练之后的所述机器学习模型进行分析,确定训练之后的所述机器学习模型的损失函数的变化率;Input a second preset number of test samples into the machine learning model after training for analysis, and determine the rate of change of the loss function of the machine learning model after training;
    若所述变化率小于或者等于预设的变化率阈值,则判定对训练之后的所述机器学习模型的测试通过;If the change rate is less than or equal to the preset change rate threshold, it is determined that the test of the machine learning model after training passes;
    若所述变化率大于预设的变化率阈值,则判定对训练之后的所述机器学习模型的测试不通过。If the rate of change is greater than the preset rate of change threshold, it is determined that the test of the machine learning model after the training fails.
  8. 一种行人重识别装置,其特征在于,包括:A pedestrian re-identification device, which is characterized in that it comprises:
    获取模块,用于从预先采集的监控视频流中获取目标图像帧序列,所述目标图像帧序列中的目标图像帧中包含待识别行人的图像信息;An acquiring module, configured to acquire a target image frame sequence from a pre-collected surveillance video stream, and the target image frame in the target image frame sequence contains image information of a pedestrian to be identified;
    第一确定模块,用于从所述图像信息中识别出所述待识别行人的特征信息,并确定所述特征信息对应的所有标签信息;The first determining module is configured to identify the characteristic information of the pedestrian to be identified from the image information, and determine all tag information corresponding to the characteristic information;
    重识别模块,用于利用预先训练完成的行人重识别模型分别对所述目标图像帧序列和所有所述标签信息进行行人重识别,从所有所述标签信息中确定所述待识别行人的目标标签信息;The re-recognition module is used to perform pedestrian re-recognition on the target image frame sequence and all the tag information using the pre-trained pedestrian re-recognition model, and determine the target tag of the pedestrian to be recognized from all the tag information information;
    第二确定模块,用于基于所述目标标签信息确定所述待识别行人的重识别结果。The second determination module is configured to determine the re-identification result of the pedestrian to be identified based on the target tag information.
  9. 如权利要求8所述的行人重识别装置,其特征在于,所述第一确定模块,包括:The pedestrian re-identification device according to claim 8, wherein the first determining module comprises:
    识别单元,用于利用预先训练完成的特征信息识别模型,对所述待识别行人进行特征识别,得到所述待识别行人的特征信息;A recognition unit, configured to use a feature information recognition model completed in advance to perform feature recognition on the pedestrian to be identified to obtain feature information of the pedestrian to be identified;
    计算单元,用于计算所述特征信息属于每类预设标签信息的概率值;A calculation unit, configured to calculate the probability value of the feature information belonging to each type of preset label information;
    第一判定单元,用于在若所述特征信息属于第一类预设标签信息的概率值大于属于第二类预设标签信息的概率值,且所述特征信息属于所述第一类预设标签信息的概率值大于预设的概率阈值,则判定所述第一类预设标签信息为所述特征信息对应的标签信息,所述第二类预设标签信息为除所述第一类预设标签信息之外的任一类预设标签信息。The first determining unit is configured to: if the probability value of the feature information belonging to the first type of preset label information is greater than the probability value of belonging to the second type of preset label information, and the feature information belongs to the first type of preset If the probability value of the tag information is greater than the preset probability threshold, it is determined that the first type of preset tag information is the tag information corresponding to the feature information, and the second type of preset tag information is except for the first type of preset tag information. Set any type of preset label information except label information.
  10. 如权利要求9所述的行人重识别装置,其特征在于,所述计算单元,包括:The pedestrian re-identification device according to claim 9, wherein the calculation unit comprises:
    利用预设的概率归一化公式计算所述特征信息属于每类预设标签信息的概率值;所述预设的概率归一化公式为:A preset probability normalization formula is used to calculate the probability value of the feature information belonging to each type of preset label information; the preset probability normalization formula is:
    Figure PCTCN2019119860-appb-100009
    Figure PCTCN2019119860-appb-100009
    其中,p i表示所述特征信息属于第i类预设标签信息的概率值,K表示预设标签信息的类别总数,
    Figure PCTCN2019119860-appb-100010
    表示所述特征信息属于第i类预设标签信息的对数概率值。
    Wherein, p i represents the probability value of the feature information belonging to the i-th type of preset label information, and K represents the total number of types of preset label information,
    Figure PCTCN2019119860-appb-100010
    Indicates the log probability value of the feature information belonging to the i-th type of preset label information.
  11. 一种终端,包括存储器、处理器以及存储在所述存储器中并可在所述处理器上运行的计算机程序,其特征在于,所述处理器执行所述计算机程序时实现如下步骤:A terminal includes a memory, a processor, and a computer program stored in the memory and running on the processor, wherein the processor implements the following steps when the processor executes the computer program:
    从预先采集的监控视频流中获取目标图像帧序列,所述目标图像帧序列中的目标图像帧中包含待识别行人的图像信息;Acquiring a target image frame sequence from a pre-collected surveillance video stream, where the target image frame in the target image frame sequence contains image information of a pedestrian to be identified;
    从所述图像信息中识别出所述待识别行人的特征信息,并确定所述特征信息对应的所有标签信息;Identifying characteristic information of the pedestrian to be identified from the image information, and determining all tag information corresponding to the characteristic information;
    利用预先训练完成的行人重识别模型分别对所述目标图像帧序列和所有所述标签信息进行行人重识别,从所有所述标签信息中确定所述待识别行人的目标标签信息;Using a pre-trained pedestrian re-recognition model to perform pedestrian re-recognition on the target image frame sequence and all the tag information, and determine the target tag information of the pedestrian to be identified from all the tag information;
    基于所述目标标签信息确定所述待识别行人的重识别结果。The re-identification result of the pedestrian to be identified is determined based on the target tag information.
  12. 如权利要求11所述的终端,其特征在于,所述从所述图像信息中识别出所述待识别行人的特征信息,并确定所述特征信息对应的标签信息,包括:The terminal according to claim 11, wherein the identifying characteristic information of the pedestrian to be identified from the image information and determining the label information corresponding to the characteristic information comprises:
    利用预先训练完成的特征信息识别模型,对所述待识别行人进行特征识别,得到所述待识别行人的特征信息;Using the pre-trained feature information recognition model to perform feature recognition on the pedestrian to be identified to obtain the feature information of the pedestrian to be identified;
    计算所述特征信息属于每类预设标签信息的概率值;Calculating the probability value of the feature information belonging to each type of preset label information;
    若所述特征信息属于第一类预设标签信息的概率值大于属于第二类预设标签信息的概率值,且所述特征信息属于所述第一类预设标签信息的概率值大于预设的概率阈值,则判定所述第一类预设标签信息为所述特征信息对应的标签信息,所述第二类预设标签信息为除所述第一类预设标签信息之外的任一类预设标签信息。If the probability value of the feature information belonging to the first type of preset tag information is greater than the probability value of belonging to the second type of preset tag information, and the probability value of the feature information belonging to the first type of preset tag information is greater than the preset The probability threshold of the first type is determined to be the tag information corresponding to the feature information, and the second type of preset tag information is any one other than the first type of preset tag information. Class preset label information.
  13. 如权利要求12所述的终端,其特征在于,所述计算所述特征信息属于每类预设标签信息的概率值,包括:The terminal according to claim 12, wherein the calculating the probability value of the characteristic information belonging to each type of preset label information comprises:
    利用预设的概率归一化公式计算所述特征信息属于每类预设标签信息的概率值;所述预设的概率归一化公式为:A preset probability normalization formula is used to calculate the probability value of the feature information belonging to each type of preset label information; the preset probability normalization formula is:
    Figure PCTCN2019119860-appb-100011
    Figure PCTCN2019119860-appb-100011
    其中,p i表示所述特征信息属于第i类预设标签信息的概率值,K表示预设标签信息的类别总数,
    Figure PCTCN2019119860-appb-100012
    表示所述特征信息属于第i类预设标签信息的对数概率值。
    Wherein, p i represents the probability value of the feature information belonging to the i-th type of preset label information, and K represents the total number of types of preset label information,
    Figure PCTCN2019119860-appb-100012
    Indicates the log probability value of the feature information belonging to the i-th type of preset label information.
  14. 如权利要求11所述的终端,其特征在于,在所述利用预先训练完成的行人重识别模型分别对所述目标图像帧序列和所述标签信息进行重识别,以完成对所述待识别行人的重识别之前,包括:The terminal according to claim 11, wherein the target image frame sequence and the label information are respectively re-identified in the pedestrian re-recognition model completed using the pre-training to complete the re-identification of the pedestrian to be identified. Before re-identification, including:
    采集第一预设数量的训练样本,每个所述训练样本包含待识别行人的图像以及所述待识别行人对应的所有预设标签信息;Collecting a first preset number of training samples, each of the training samples containing an image of a pedestrian to be identified and all preset label information corresponding to the pedestrian to be identified;
    利用所述训练样本训练预先建立的机器学习模型进行训练,得到训练之后的机器学习模型;Use the training samples to train a pre-established machine learning model for training, to obtain a trained machine learning model;
    对训练之后的所述机器学习模型进行模型准确性测试;Performing a model accuracy test on the machine learning model after training;
    若对所述训练之后的所述机器学习模型的准确性测试通过,则判定训练之后的所述机器学习模型为所述行人重识别模型;If the accuracy test of the machine learning model after the training is passed, it is determined that the machine learning model after the training is the pedestrian re-identification model;
    若对所述训练之后的所述机器学习模型的准确性测试不通过,则增加所述训练样本的数量,并返回执行利用所述训练样本训练预先建立的机器学习模型进行训练,得到所述行人重识别模型。If the accuracy test of the machine learning model after the training fails, increase the number of training samples, and return to perform training using the training sample training pre-established machine learning model to obtain the pedestrian Re-identify the model.
  15. 如权利要求14所述的终端,其特征在于,所述利用所述训练样本训练预先建立的机器学习模型,得到训练之后的机器学习模型,包括:The terminal according to claim 14, wherein the training a pre-established machine learning model using the training samples to obtain a trained machine learning model comprises:
    使用所述预先建立的机器学习模型重识别每个所述待识别行人对应的所有预设标签信息,得到每个所述待识别行人属于每类预设标签信息的概率,并确定每个所述待识别行人对应的概率最大的预设标签信息;Use the pre-established machine learning model to re-identify all the preset label information corresponding to each pedestrian to be identified, obtain the probability that each pedestrian to be identified belongs to each type of preset label information, and determine each The preset label information with the highest probability corresponding to the pedestrian to be identified;
    分别以每个所述待识别行人对应的所述概率最大的预设标签信息为训练所述机器学习模型的约束条件,迭代所述机器学习模型的预设参数;Respectively taking the preset label information with the highest probability corresponding to each of the pedestrians to be identified as the constraint conditions for training the machine learning model, and iterating the preset parameters of the machine learning model;
    若所述机器学习模型对应的损失函数值的变化率趋于稳定,则判定对所述机器学习模型的训练完成,得到所述行人重识别模型。If the rate of change of the loss function value corresponding to the machine learning model tends to be stable, it is determined that the training of the machine learning model is completed, and the pedestrian re-identification model is obtained.
  16. 一种计算机可读存储介质,所述计算机可读存储介质存储有计算机程序,其特征在于,所述计算机程序被处理器执行时实现如下步骤:A computer-readable storage medium that stores a computer program, and is characterized in that, when the computer program is executed by a processor, the following steps are implemented:
    从预先采集的监控视频流中获取目标图像帧序列,所述目标图像帧序列中的目标图像帧中包含待识别行人的图像信息;Acquiring a target image frame sequence from a pre-collected surveillance video stream, where the target image frame in the target image frame sequence contains image information of a pedestrian to be identified;
    从所述图像信息中识别出所述待识别行人的特征信息,并确定所述特征信息对应的所有标签信息;Identifying characteristic information of the pedestrian to be identified from the image information, and determining all tag information corresponding to the characteristic information;
    利用预先训练完成的行人重识别模型分别对所述目标图像帧序列和所有所述标签信息进行行人重识别,从所有所述标签信息中确定所述待识别行人的目标标签信息;Using a pre-trained pedestrian re-recognition model to perform pedestrian re-recognition on the target image frame sequence and all the tag information, and determine the target tag information of the pedestrian to be identified from all the tag information;
    基于所述目标标签信息确定所述待识别行人的重识别结果。The re-identification result of the pedestrian to be identified is determined based on the target tag information.
  17. 如权利要求16所述的计算机可读存储介质,其特征在于,所述从所述图像信息中识别出所述待识别行人的特征信息,并确定所述特征信息对应的标签信息,包括:15. The computer-readable storage medium of claim 16, wherein the identifying characteristic information of the pedestrian to be identified from the image information and determining the label information corresponding to the characteristic information comprises:
    利用预先训练完成的特征信息识别模型,对所述待识别行人进行特征识别,得到所述待识别行人的特征信息;Using the pre-trained feature information recognition model to perform feature recognition on the pedestrian to be identified to obtain the feature information of the pedestrian to be identified;
    计算所述特征信息属于每类预设标签信息的概率值;Calculating the probability value of the feature information belonging to each type of preset label information;
    若所述特征信息属于第一类预设标签信息的概率值大于属于第二类预设标签信息的概率值,且所述特征信息属于所述第一类预设标签信息的概率值大于预设的概率阈值,则判定所述第一类预设标签信息为所述特征信息对应的标签信息,所述第二类预设标签信息为除所述第一类预设标签信息之外的任一类预设标签信息。If the probability value of the feature information belonging to the first type of preset tag information is greater than the probability value of belonging to the second type of preset tag information, and the probability value of the feature information belonging to the first type of preset tag information is greater than the preset The probability threshold of the first type is determined to be the tag information corresponding to the feature information, and the second type of preset tag information is any one other than the first type of preset tag information. Class preset label information.
  18. 如权利要求17所述的计算机可读存储介质,其特征在于,所述计算所述特征信息属于每类预设标签信息的概率值,包括:18. The computer-readable storage medium of claim 17, wherein the calculating the probability value of the characteristic information belonging to each type of preset label information comprises:
    利用预设的概率归一化公式计算所述特征信息属于每类预设标签信息的概 率值;所述预设的概率归一化公式为:A preset probability normalization formula is used to calculate the probability value of the feature information belonging to each type of preset label information; the preset probability normalization formula is:
    Figure PCTCN2019119860-appb-100013
    Figure PCTCN2019119860-appb-100013
    其中,p i表示所述特征信息属于第i类预设标签信息的概率值,K表示预设标签信息的类别总数,
    Figure PCTCN2019119860-appb-100014
    表示所述特征信息属于第i类预设标签信息的对数概率值。
    Wherein, p i represents the probability value of the feature information belonging to the i-th type of preset label information, and K represents the total number of types of preset label information,
    Figure PCTCN2019119860-appb-100014
    It indicates that the feature information belongs to the log probability value of the i-th type of preset label information.
  19. 如权利要求16所述的计算机可读存储介质,其特征在于,在所述利用预先训练完成的行人重识别模型分别对所述目标图像帧序列和所述标签信息进行重识别,以完成对所述待识别行人的重识别之前,包括:16. The computer-readable storage medium of claim 16, wherein the target image frame sequence and the tag information are re-identified in the pedestrian re-identification model completed in advance to complete the re-identification of the target image frame sequence and the label information. Before re-identification of pedestrians to be identified, including:
    采集第一预设数量的训练样本,每个所述训练样本包含待识别行人的图像以及所述待识别行人对应的所有预设标签信息;Collecting a first preset number of training samples, each of the training samples containing an image of a pedestrian to be identified and all preset label information corresponding to the pedestrian to be identified;
    利用所述训练样本训练预先建立的机器学习模型进行训练,得到训练之后的机器学习模型;Use the training samples to train a pre-established machine learning model for training, to obtain a trained machine learning model;
    对训练之后的所述机器学习模型进行模型准确性测试;Performing a model accuracy test on the machine learning model after training;
    若对所述训练之后的所述机器学习模型的准确性测试通过,则判定训练之后的所述机器学习模型为所述行人重识别模型;If the accuracy test of the machine learning model after the training is passed, it is determined that the machine learning model after the training is the pedestrian re-identification model;
    若对所述训练之后的所述机器学习模型的准确性测试不通过,则增加所述训练样本的数量,并返回执行利用所述训练样本训练预先建立的机器学习模型进行训练,得到所述行人重识别模型。If the accuracy test of the machine learning model after the training fails, increase the number of training samples, and return to perform training using the training sample training pre-established machine learning model to obtain the pedestrian Re-identify the model.
  20. 如权利要求19所述的计算机可读存储介质,其特征在于,所述利用所述训练样本训练预先建立的机器学习模型,得到训练之后的机器学习模型,包括:19. The computer-readable storage medium of claim 19, wherein the training a pre-established machine learning model using the training samples to obtain the trained machine learning model comprises:
    使用所述预先建立的机器学习模型重识别每个所述待识别行人对应的所有预设标签信息,得到每个所述待识别行人属于每类预设标签信息的概率,并确定每个所述待识别行人对应的概率最大的预设标签信息;Use the pre-established machine learning model to re-identify all the preset label information corresponding to each pedestrian to be identified, obtain the probability that each pedestrian to be identified belongs to each type of preset label information, and determine each The preset label information with the highest probability corresponding to the pedestrian to be identified;
    分别以每个所述待识别行人对应的所述概率最大的预设标签信息为训练所述机器学习模型的约束条件,迭代所述机器学习模型的预设参数;Respectively taking the preset label information with the highest probability corresponding to each of the pedestrians to be identified as the constraint conditions for training the machine learning model, and iterating the preset parameters of the machine learning model;
    若所述机器学习模型对应的损失函数值的变化率趋于稳定,则判定对所述机器学习模型的训练完成,得到所述行人重识别模型。If the rate of change of the loss function value corresponding to the machine learning model tends to be stable, it is determined that the training of the machine learning model is completed, and the pedestrian re-identification model is obtained.
PCT/CN2019/119860 2019-11-01 2019-11-21 Person re-identification method and apparatus, and terminal and storage medium WO2021082118A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201911060337.0A CN111027378B (en) 2019-11-01 2019-11-01 Pedestrian re-identification method, device, terminal and storage medium
CN201911060337.0 2019-11-01

Publications (1)

Publication Number Publication Date
WO2021082118A1 true WO2021082118A1 (en) 2021-05-06

Family

ID=70204935

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2019/119860 WO2021082118A1 (en) 2019-11-01 2019-11-21 Person re-identification method and apparatus, and terminal and storage medium

Country Status (2)

Country Link
CN (1) CN111027378B (en)
WO (1) WO2021082118A1 (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113343810A (en) * 2021-05-28 2021-09-03 国家计算机网络与信息安全管理中心 Pedestrian re-recognition model training and recognition method and device based on time sequence diversity and correlation
CN113361603A (en) * 2021-06-04 2021-09-07 北京百度网讯科技有限公司 Training method, class recognition device, electronic device and storage medium
CN114612752A (en) * 2022-03-02 2022-06-10 捻果科技(深圳)有限公司 Method and system for intelligently reviewing recognition result of video analysis technology
CN114863224A (en) * 2022-07-05 2022-08-05 深圳比特微电子科技有限公司 Training method, image quality detection method, device and medium
CN115132278A (en) * 2022-05-27 2022-09-30 腾讯科技(深圳)有限公司 Method, device, equipment and storage medium for modifying antibody species

Families Citing this family (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111950591B (en) * 2020-07-09 2023-09-01 中国科学院深圳先进技术研究院 Model training method, interaction relation recognition device and electronic equipment
CN111860316B (en) * 2020-07-20 2024-03-19 上海汽车集团股份有限公司 Driving behavior recognition method, device and storage medium
CN113065495B (en) * 2021-04-13 2023-07-14 深圳技术大学 Image similarity calculation method, target object re-recognition method and system
CN113349105B (en) * 2021-06-01 2022-06-21 深圳市天和荣科技有限公司 Intelligent bird feeding method, electronic equipment, bird feeder and storage medium
CN113762425B (en) * 2021-11-09 2022-04-05 天津创启投资有限公司 Data screen display sliding door control system based on 5G intelligent space
CN114758485A (en) * 2022-04-21 2022-07-15 成都商汤科技有限公司 Alarm information processing method and device, computer equipment and storage medium
CN114821651B (en) * 2022-04-30 2023-11-10 苏州浪潮智能科技有限公司 Pedestrian re-recognition method, system, equipment and computer readable storage medium

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107301380A (en) * 2017-06-01 2017-10-27 华南理工大学 One kind is used for pedestrian in video monitoring scene and knows method for distinguishing again
US20180276499A1 (en) * 2017-03-24 2018-09-27 Disney Enterprises, Inc. One shot color calibrated metric learning for object re-identification
CN109784258A (en) * 2019-01-08 2019-05-21 华南理工大学 A kind of pedestrian's recognition methods again cut and merged based on Analysis On Multi-scale Features
CN110334687A (en) * 2019-07-16 2019-10-15 合肥工业大学 A kind of pedestrian retrieval Enhancement Method based on pedestrian detection, attribute study and pedestrian's identification

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109583297B (en) * 2018-10-25 2020-10-02 清华大学 Retina OCT volume data identification method and device

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20180276499A1 (en) * 2017-03-24 2018-09-27 Disney Enterprises, Inc. One shot color calibrated metric learning for object re-identification
CN107301380A (en) * 2017-06-01 2017-10-27 华南理工大学 One kind is used for pedestrian in video monitoring scene and knows method for distinguishing again
CN109784258A (en) * 2019-01-08 2019-05-21 华南理工大学 A kind of pedestrian's recognition methods again cut and merged based on Analysis On Multi-scale Features
CN110334687A (en) * 2019-07-16 2019-10-15 合肥工业大学 A kind of pedestrian retrieval Enhancement Method based on pedestrian detection, attribute study and pedestrian's identification

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
1 October 2018, article WU, YANCHENG: "Person Re-identification Based on Attribute Learning", pages: 1 - 79, XP055813131 *
WU, YANCHENG: "Person Re-Identification Using Attribute Priori Distribution", ACTA AUTOMATICA SINICA, KEXUE CHUBANSHE, BEIJING, CN, vol. 45, no. 5, 1 May 2019 (2019-05-01), CN, pages 953 - 964, XP055813123, ISSN: 0254-4156, DOI: 10.16383/j.aas.c170691 *

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113343810A (en) * 2021-05-28 2021-09-03 国家计算机网络与信息安全管理中心 Pedestrian re-recognition model training and recognition method and device based on time sequence diversity and correlation
CN113361603A (en) * 2021-06-04 2021-09-07 北京百度网讯科技有限公司 Training method, class recognition device, electronic device and storage medium
CN113361603B (en) * 2021-06-04 2024-05-10 北京百度网讯科技有限公司 Training method, category identification device, electronic device, and storage medium
CN114612752A (en) * 2022-03-02 2022-06-10 捻果科技(深圳)有限公司 Method and system for intelligently reviewing recognition result of video analysis technology
CN115132278A (en) * 2022-05-27 2022-09-30 腾讯科技(深圳)有限公司 Method, device, equipment and storage medium for modifying antibody species
CN115132278B (en) * 2022-05-27 2024-02-02 腾讯科技(深圳)有限公司 Antibody species transformation method, device, equipment and storage medium
CN114863224A (en) * 2022-07-05 2022-08-05 深圳比特微电子科技有限公司 Training method, image quality detection method, device and medium

Also Published As

Publication number Publication date
CN111027378B (en) 2023-08-18
CN111027378A (en) 2020-04-17

Similar Documents

Publication Publication Date Title
WO2021082118A1 (en) Person re-identification method and apparatus, and terminal and storage medium
WO2021077984A1 (en) Object recognition method and apparatus, electronic device, and readable storage medium
CN109583332B (en) Face recognition method, face recognition system, medium, and electronic device
US20210342643A1 (en) Method, apparatus, and electronic device for training place recognition model
WO2021082635A1 (en) Region of interest detection method and apparatus, readable storage medium and terminal device
CN110532970B (en) Age and gender attribute analysis method, system, equipment and medium for 2D images of human faces
US20180357891A1 (en) Method of processing passage record and device
CN108280477A (en) Method and apparatus for clustering image
CN109145766A (en) Model training method, device, recognition methods, electronic equipment and storage medium
CN104484658A (en) Face gender recognition method and device based on multi-channel convolution neural network
CN105447532A (en) Identity authentication method and device
CN112307886A (en) Pedestrian re-identification method and device
CN110647938B (en) Image processing method and related device
CN108596110A (en) Image-recognizing method and device, electronic equipment, storage medium
CN110489659A (en) Data matching method and device
CN111582342A (en) Image identification method, device, equipment and readable storage medium
CN111368768A (en) Human body key point-based employee gesture guidance detection method
CN111723749A (en) Method, system and equipment for identifying wearing of safety helmet
CN102722732B (en) Image set matching method based on data second order static modeling
CN115862055A (en) Pedestrian re-identification method and device based on comparison learning and confrontation training
CN113191478A (en) Training method, device and system of neural network model
CN111275058B (en) Safety helmet wearing and color identification method and device based on pedestrian re-identification
CN111753618A (en) Image recognition method and device, computer equipment and computer readable storage medium
CN111626212A (en) Method and device for identifying object in picture, storage medium and electronic device
CN108733784B (en) Teaching courseware recommendation method, device and equipment

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19950634

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 27.09.2022)

122 Ep: pct application non-entry in european phase

Ref document number: 19950634

Country of ref document: EP

Kind code of ref document: A1