CN110490058A - Training method, device, system and the computer-readable medium of pedestrian detection model - Google Patents

Training method, device, system and the computer-readable medium of pedestrian detection model Download PDF

Info

Publication number
CN110490058A
CN110490058A CN201910615436.4A CN201910615436A CN110490058A CN 110490058 A CN110490058 A CN 110490058A CN 201910615436 A CN201910615436 A CN 201910615436A CN 110490058 A CN110490058 A CN 110490058A
Authority
CN
China
Prior art keywords
detection block
training
error
pedestrian
classification
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201910615436.4A
Other languages
Chinese (zh)
Other versions
CN110490058B (en
Inventor
胡立
孙培泽
李伯勋
俞刚
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Maigewei Technology Co Ltd
Original Assignee
Beijing Maigewei Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Maigewei Technology Co Ltd filed Critical Beijing Maigewei Technology Co Ltd
Priority to CN201910615436.4A priority Critical patent/CN110490058B/en
Publication of CN110490058A publication Critical patent/CN110490058A/en
Application granted granted Critical
Publication of CN110490058B publication Critical patent/CN110490058B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/22Matching criteria, e.g. proximity measures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/20Movements or behaviour, e.g. gesture recognition

Abstract

The present invention provides training method, device, system and the computer-readable mediums of a kind of pedestrian detection model, the training method includes: that training image is input to neural network, to generate the predictive information about the target object in the training image, the predictive information includes detection block position, detection block weight and detection block score, wherein, the detection block weight indicates the similitude of target object and background in the detection block, and similitude is higher, then detection block weight is lower;The first error in classification between the detection block score and score true value is calculated, and weighting classification error is calculated according to the detection block weight and first error in classification;Network parameter at least based on neural network described in the weighting classification error update.The detection block weight that the present invention reduces pedestrian sample similar with background automatically in the training process of pedestrian's detection model greatly improves the precision of pedestrian detection to reduce the adverse effect for easily obscuring sample to network parameter.

Description

Training method, device, system and the computer-readable medium of pedestrian detection model
Technical field
The present invention relates to pedestrian detection technology fields, relate more specifically to training method, the dress of a kind of pedestrian detection model It sets, system and computer-readable medium.
Background technique
Pedestrian detection has a wide range of applications in the fields such as security protection and automatic Pilot, and the purpose is to the handles from image or video The position of pedestrian, which searches out, to be come.Pedestrian detection is the basis of a lot of other visual tasks, for example, pedestrian identifies again, pedestrian tracking and Pedestrian's action recognition etc..Due to there is many and quite similar background foreign matter of pedestrian's appearance in pedestrian's scene, pedestrian is caused to examine Examining system will appear the situation of error detection in these scenes, reduce the accuracy of pedestrian detection.In security protection or automatically In Driving Scene, the testing result of mistake will lead to serious consequence, thus need more accurate detection system, with reduce with The similar foreign matter of pedestrian is to interference caused by pedestrian detection.
Summary of the invention
To solve the above-mentioned problems, the invention proposes a kind of training sides of pedestrian detection model based on weight self-regulation Case.The training program proposed by the present invention about pedestrian detection model is briefly described below, more details will be attached in subsequent combination Figure is described in a specific embodiment.
According to embodiments of the present invention on the one hand, a kind of training method of pedestrian detection model is provided, which comprises
Training image is input to neural network, to generate the prediction letter about the target object in the training image Breath, the predictive information includes detection block position, detection block weight and detection block score, wherein the detection block weight indicates The similitude of target object and background in the detection block, the similitude is higher, then the detection block weight is lower;
Calculate the first error in classification between the detection block score and score true value, and according to the detection block weight and First error in classification calculates weighting classification error;
Network parameter at least based on neural network described in the weighting classification error update.
In one embodiment, the weighting classification error is multiplying for the detection block weight and first error in classification Product.
In one embodiment, the method also includes: calculate second between the detection block weight and weight true value Error in classification;And the network parameter is updated based on second error in classification.
In one embodiment, the method also includes: calculate the position between the detection block position and position true value Error;And the network parameter is updated based on the location error.
In one embodiment, the predictive information for generating the training objective in the training image includes: based on institute It states neural network and feature extraction is carried out to the training image, to generate the characteristic pattern of the training image;According to the feature Figure generates the predictive information.
According to embodiments of the present invention on the other hand, a kind of training device of pedestrian detection model, pedestrian's inspection are provided Survey model training device include:
Prediction module, for training image to be input to neural network, to generate about the target in the training image The predictive information of object, the predictive information include detection block position, detection block weight and detection block score, wherein the inspection Surveying frame weight indicates the similitude of target object and background in the detection block, and the similitude is higher, then the detection block power It is again lower;
Error calculating module, for calculating the first error in classification between the detection block score and score true value, and root Weighting classification error is calculated according to the detection block weight and first error in classification;And
Training module, for being at least based on the weighting classification error update network parameter.
In one embodiment, the weighting classification error is multiplying for the detection block weight and first error in classification Product.
In one embodiment, described device further include: characteristic extracting module, for being based on the neural network to described Training image carries out feature extraction, to generate the characteristic pattern of the training image;Also, the prediction module is according to the feature Figure generates the predictive information.
According to embodiments of the present invention in another aspect, providing a kind of training system of pedestrian detection model, pedestrian's inspection The training system for surveying model includes storage device and processor, and the meter run by the processor is stored on the storage device Calculation machine program, the computer program execute pedestrian detection model described in any of the above embodiments when being run by the processor Training method.
Another aspect according to embodiments of the present invention, provides a kind of computer-readable medium, the computer-readable medium On be stored with computer program, the computer program executes the instruction of pedestrian detection model described in any of the above embodiments at runtime Practice method.
Training method, device, system and the computer-readable medium of the pedestrian detection model of the embodiment of the present invention are in pedestrian Automatically the detection block weight of pedestrian sample similar with background is reduced in the training process of detection model, so as to avoid easily obscuring Sample greatly improves the precision of pedestrian detection to the adverse effect of network parameter.
Detailed description of the invention
The embodiment of the present invention is described in more detail in conjunction with the accompanying drawings, the above and other purposes of the present invention, Feature and advantage will be apparent.Attached drawing is used to provide to further understand the embodiment of the present invention, and constitutes explanation A part of book, is used to explain the present invention together with the embodiment of the present invention, is not construed as limiting the invention.In the accompanying drawings, Identical reference label typically represents same parts or step.
Fig. 1 shows for realizing training method, device, system and the meter of pedestrian detection model according to an embodiment of the present invention The schematic block diagram of the exemplary electronic device of calculation machine readable medium;
Fig. 2 shows the schematic flow charts of the training method of pedestrian detection model according to an embodiment of the present invention;
Fig. 3 shows the frame diagram of the neural network of the training method of pedestrian detection model according to an embodiment of the present invention;
Fig. 4 shows the schematic block diagram of the training device of pedestrian detection model according to an embodiment of the present invention;And
Fig. 5 shows the schematic block diagram of the training system of pedestrian detection model according to an embodiment of the present invention.
Specific embodiment
In order to enable the object, technical solutions and advantages of the present invention become apparent, root is described in detail below with reference to accompanying drawings According to example embodiments of the present invention.Obviously, described embodiment is only a part of the embodiments of the present invention, rather than this hair Bright whole embodiments, it should be appreciated that the present invention is not limited by example embodiment described herein.Based on described in the present invention The embodiment of the present invention, those skilled in the art's obtained all other embodiment in the case where not making the creative labor It should all fall under the scope of the present invention.
Firstly, referring to Fig.1 come describe the training method of the pedestrian detection model for realizing the embodiment of the present invention, device, System and the exemplary electronic device of computer-readable medium 100.
As shown in Figure 1, electronic equipment 100 include one or more processors 102, it is one or more storage device 104, defeated Enter device 106, output device 108 and imaging sensor 110, these components pass through bus system 112 and/or other forms The interconnection of bindiny mechanism's (not shown).It should be noted that the component and structure of electronic equipment 100 shown in FIG. 1 are only exemplary, and Unrestricted, as needed, the electronic equipment also can have other assemblies and structure.
The processor 102 can be central processing unit (CPU) or have data-handling capacity and/or instruction execution The processing unit of the other forms of ability, and the other components that can control in the electronic equipment 100 are desired to execute Function.
The storage device 104 may include one or more computer program products, and the computer program product can To include various forms of computer readable storage mediums, such as volatile memory and/or nonvolatile memory.It is described easy The property lost memory for example may include random access memory (RAM) and/or cache memory (cache) etc..It is described non- Volatile memory for example may include read-only memory (ROM), hard disk, flash memory etc..In the computer readable storage medium On can store one or more computer program instructions, processor 102 can run described program instruction, to realize hereafter institute The client functionality (realized by processor) in the embodiment of the present invention stated and/or other desired functions.In the meter Can also store various application programs and various data in calculation machine readable storage medium storing program for executing, for example, the application program use and/or The various data etc. generated.
The input unit 106 can be the device that user is used to input instruction, and may include keyboard, mouse, wheat One or more of gram wind and touch screen etc..
The output device 108 can export various information (such as image or sound) to external (such as user), and It may include one or more of display, loudspeaker etc..
Described image sensor 110 can be shot the desired image of user (such as photo, video etc.), and will be captured Image be stored in the storage device 104 for other components use.
When note that the component and structure of electronic equipment shown in FIG. 1 100 are only exemplary, although electronics shown in fig. 1 Equipment 100 include multiple and different devices, but as needed, some of which device can not be it is necessary, therein one The quantity of a little devices can be more etc., and the present invention does not limit this.
Illustratively, the training method for realizing pedestrian detection model according to an embodiment of the present invention, recognition methods, dress It sets and the exemplary electronic device of processing equipment may be implemented as the intelligent terminals such as smart phone, tablet computer, computer.
In the following, reference Fig. 2 to be described to the training method 200 of pedestrian detection model according to an embodiment of the present invention.
As shown in Fig. 2, training image is input to neural network in step S210, the instruction in the training image is obtained Practice the predictive information of target, the predictive information includes detection block position, detection block weight and detection block score, wherein described Detection block weight indicates the similitude of target object and background in the detection block, and the similitude is higher, then the detection block Weight is lower.Conversely, similitude is lower, then the detection block weight is higher.
Wherein, the training image can be any candid photograph image comprising many pedestrians.Training set can be constructed in advance, Include several training images in the training set, all includes one or more pedestrians, the position of pedestrian on usual every width training image It sets and is marked in advance with bounding box, referred to as true frame.Based on the mark carried out in advance, each detection block can be obtained Position true value, score true value and weight true value.
The neural network includes but is not limited to convolutional neural networks, can be in existing various target detection nerve nets It is improved on the basis of network, such as the neural networks such as Faster R-CNN, RetinaNet, R-CNN, Fast R-CNN.
The neural network specifically includes feature extraction network with pedestrian detection network.Wherein, feature extraction network is used for The feature in original image is extracted, the characteristic pattern of original image is exported.Pedestrian detection network is used to carry out pedestrian based on characteristic pattern Detection, output test result.
Specifically, training image is inputted into neural network first, feature extraction is carried out, to generate the feature of a variety of scales Figure.Can by such as HOG (Histogram of Oriented Gradient, histograms of oriented gradients) feature extraction algorithm, LBP (Local Binary Pattern, local binary patterns) feature extraction algorithm, Haar-like feature extraction algorithm etc. is calculated Method carries out feature extraction processing to target image, obtains the characteristic pattern of target image.In practice, feature extraction network is not necessarily Need to rebuild completely, can also directly by some pre-training, the convolutional neural networks that are used for image classification task delete Part after being eventually used for the full articulamentum of classification output is as feature extraction network.The structure of feature extraction network and specific Feature extraction mode be not limited herein.
Then, it is based on the characteristic pattern, the predictive information about target object is determined, specifically includes the detection of target object Frame position, detection block weight and detection block score, wherein if there is pedestrian, detection block position indicates to surround this pedestrian's Rectangle frame, detection block score represent the probability in the detection block there are pedestrian.In one embodiment, detection block weight can also To be indicated by detection block score, referring specifically to hereafter.
As shown in figure 3, existing pedestrian detection model only exports the detection block position of each detection block in the training process With detection block score, and the embodiment of the present invention is while detection block position and the detection block score for exporting each detection block, also Detection block weight is exported, for identifying the probability in the detection block there are target object, namely indicates the target in the detection block With the similitude of background.
Wherein it is possible to the predictive information of target object is obtained using various feasible algorithm of target detection, it is finally defeated respectively Detection block position, detection block weight and detection block score out.Specifically, general detection model is pre- using two classifier difference Detection block position and detection block score are surveyed, for example, RPN network (Region Proposal Network, Area generation network) can Can predict object boundary and target fractional simultaneously in each position;In the present embodiment, other than detection block position, The probability in detection block there are pedestrian also is calculated separately using two the same or similar classifiers, and respectively as detection block point Several and detection block weight.
Specifically, larger in the presence of a possibility that obscuring if the similitude of target and background in detection block is higher, because There are the probability of target object is lower in this detection block;On the contrary, if the similitude of target and background in detection block compared with Low, then there are the probability of target object is higher in the detection block.Therefore, if the pedestrian of this detection block position and back Scape foreign matter is similar, then network is intended to export low probability, so that the detection block weight of the detection block at the position reduces, in It is that the subsequent weighting classification error measured is lower, that is, reduces the weight of this detection block when training, easily obscures sample to reduce This adverse effect to network parameter.
It is understood that detection block weight and detection block score are all used to indicate to deposit at position that the detection block is indicated In the probability of target object.Therefore in hands-on, similar or identical classifier can be used and export detection block power respectively Weight and detection block score, that is to say, that the probability one of the two classifiers output is another as detection as detection block score Frame weight.
In step S220, the first error in classification between the detection block score and score true value is calculated, and according to described Detection block weight and first error in classification calculate weighting classification error.
As described above, detection block weight represents the probability in the detection block there are pedestrian, the detection block is further indicated In pedestrian and background similitude, if the pedestrian of this position is similar with background foreign matter, network be intended to export low probability, So that the detection block weight of the detection block reduces.It include detection block weight and the first error in classification two in weighting classification error It is smaller to calculate resulting weighting classification error when detection block weight is lower for a factor, easily obscures sample to instruction to reduce Practice the influence of result.In one embodiment, the weighting classification error is that the detection block weight and first classification miss The product of difference.
It in addition to this, further include the second error in classification calculated between the detection block weight and weight true value, Yi Jiji Calculate the location error between the detection block position and position true value, when detection block weight by detection block score to indicate when, institute Stating weight true value can also be indicated by score true value;Second error in classification is the training error of detection block weight, location error For the training error of detection block position.
In step S230, it is at least based on the weighting classification error update network parameter.
Specifically, the network parameter of adjustable initial neural network, to minimize the value of weighting classification error as far as possible.Its In, network parameter may include weight and the number of iterations of each layer of neural network etc..It in addition to this, further include based on institute It states the second error in classification and location error updates the network parameter.
Due in detection block pedestrian and context similarity it is higher, then detection block weight is lower, and weighting classification error is opposite Smaller, then influence of the pedestrian sample at this to training result is smaller.
Specifically, can use by backpropagation (BP, back-propagation), stochastic gradient descent (SGD, Stochastic gradient descent) or gradient passback scheduling algorithm progress end-to-end (end-to-end) training, Lai Youhua Parameters in model.It is every handled a width training image after, it can be determined that whether meet trained termination condition, if meeting Condition then terminates to train, and network parameter at this time can be used as the parameter of trained pedestrian detection model.If being unsatisfactory for condition S210 is then back to continue to train.The condition that training terminates may include that the training image in training set has been used up, lost letter Number has been restrained etc..
The training method of pedestrian detection model according to an embodiment of the present invention is described above exemplarily.Illustratively, The training method of pedestrian detection model according to an embodiment of the present invention can with memory and processor unit or It is realized in person's system.
In addition, the training method of pedestrian detection model according to an embodiment of the present invention is deployed to intelligent hand in which can be convenient In the mobile devices such as machine, tablet computer, personal computer.Alternatively, the instruction of pedestrian detection model according to an embodiment of the present invention Server end (or cloud) can also be deployed in by practicing method.Alternatively, the instruction of pedestrian detection model according to an embodiment of the present invention Practicing method can also be deployed at server end (or cloud) and personal terminal with being distributed.
Based on above description, training method according to an embodiment of the present invention in the training process of pedestrian's detection model from The dynamic detection block weight for reducing pedestrian sample similar with background, to reduce the unfavorable shadow for easily obscuring sample to network parameter It rings, greatly improves the precision of pedestrian detection.
Show included by the training method for describing pedestrian detection model according to an embodiment of the present invention above exemplarily Example property steps flow chart.
The training device of the pedestrian detection model of another aspect of the present invention offer is described below with reference to Fig. 4.Fig. 4 shows root According to the schematic block diagram of the training device 400 of the pedestrian detection model of the embodiment of the present invention.
As shown in figure 4, the training device 300 of pedestrian detection model according to an embodiment of the present invention include prediction module 410, Error calculating module 420 and training module 430.The modules can execute pedestrian's inspection above in conjunction with Fig. 2 description respectively Survey each step/function of the training method of model.
Prediction module 410 is used to training image being input to neural network, obtains the training objective in the training image Predictive information, the predictive information includes detection block position, detection block weight and detection block score, wherein the detection block Weight indicates the similitude of target object and background in the detection block, and the similitude is higher, then the detection block weight is got over It is low.Conversely, similitude is lower, then the detection block weight is higher.
Wherein, the training image can be any candid photograph image comprising many pedestrians.Training set can be constructed in advance, Include several training images in the training set, all includes one or more pedestrians, the position of pedestrian on usual every width training image It sets and is marked in advance with bounding box, referred to as true frame.Based on the mark carried out in advance, the position of detection block can be obtained Set true value, score true value and weight true value.
The neural network includes but is not limited to convolutional neural networks, can be in existing various target detection nerve nets It is improved on the basis of network, such as the neural networks such as Faster R-CNN, RetinaNet, R-CNN, Fast R-CNN.
The neural network specifically includes feature extraction network with pedestrian detection network.Wherein, feature extraction network is used for The feature in original image is extracted, the characteristic pattern of original image is exported.Pedestrian detection network is used to carry out pedestrian based on characteristic pattern Detection, output test result.
Specifically, training image is inputted into neural network first, feature extraction is carried out, to generate the feature of a variety of scales Figure.Can by such as HOG (Histogram of Oriented Gradient, histograms of oriented gradients) feature extraction algorithm, LBP (Local Binary Pattern, local binary patterns) feature extraction algorithm, Haar-like feature extraction algorithm etc. mentions It takes algorithm to carry out feature extraction processing to target image, obtains the characteristic pattern of target image.In practice, feature extraction network is not Centainly need to rebuild completely, can also directly by some pre-training, be used for the convolutional neural networks of image classification task It deletes and is eventually used for the part after the full articulamentum of classification output as feature extraction network.The structure of feature extraction network and Specific feature extraction mode is not limited herein.
Then, it is based on the characteristic pattern, the predictive information about target object is determined, specifically includes the detection of target object Frame position, detection block weight and detection block score, wherein if there is pedestrian, detection block position indicates to surround this pedestrian's Rectangle frame, detection block score represent the probability in the detection block there are pedestrian.In one embodiment, detection block weight can also To be indicated by detection block score, referring specifically to hereafter.
As shown in figure 3, existing pedestrian detection model only exports the detection block position of each detection block in the training process With detection block score, and the embodiment of the present invention is while detection block position and the detection block score for exporting each detection block, also Detection block weight is exported, for identifying the probability in the detection block there are target object, namely indicates the target in the detection block With the similitude of background.
Wherein it is possible to the predictive information of target object is obtained using various feasible algorithm of target detection, it is finally defeated respectively Detection block position, detection block weight and detection block score out.Specifically, general detection model is pre- using two classifier difference Detection block position and detection block score are surveyed, for example, RPN network (Region Proposal Network, Area generation network) can Can predict object boundary and target fractional simultaneously in each position.In the present embodiment, other than detection block position, The probability in detection block there are pedestrian also is calculated separately using two the same or similar classifiers, and respectively as detection block point Several and detection block weight.
Specifically, larger in the presence of a possibility that obscuring if the similitude of target and background in detection block is higher, because There are the probability of target object is lower in this detection block;On the contrary, if the similitude of target and background in detection block compared with Low, then there are the probability of target object is higher in the detection block.Therefore, if the pedestrian of this detection block position and back Scape foreign matter is similar, then network is intended to export low probability, so that the detection block weight of the detection block at the position reduces, in It is that the subsequent weighting classification error measured is lower, that is, reduces the weight of this detection block when training, easily obscures sample to reduce This adverse effect to network parameter.
It is understood that detection block weight and detection block score are all used to indicate to deposit at position that the detection block is indicated In the probability of target object.Therefore in hands-on, similar or identical classifier can be used and export detection block power respectively Weight and detection block score, that is to say, that the probability one of the two classifiers output is another as detection as detection block score Frame weight.
Error calculating module 420 is used to calculate the first error in classification between the detection block score and score true value, and Weighting classification error is calculated according to the detection block weight and first error in classification.
As described above, detection block weight represents the probability in the detection block there are pedestrian, the detection block is further indicated In pedestrian and background similitude, if the pedestrian of this position is similar with background foreign matter, network be intended to export low probability, So that the detection block weight of the detection block reduces.It include detection block weight and the first error in classification two in weighting classification error It is smaller to calculate resulting weighting classification error when detection block weight is lower for a factor, easily obscures sample to instruction to reduce Practice the influence of result.In one embodiment, the weighting classification error is that the detection block weight and first classification miss The product of difference.
It in addition to this, further include the second error in classification calculated between the detection block weight and weight true value, Yi Jiji Calculate the location error between the detection block position and position true value, when detection block weight by detection block score to indicate when, institute Stating weight true value can also be indicated by score true value;Second error in classification is the training error of detection block weight, location error For the training error of detection block position.
Training module 430 is for being at least based on the weighting classification error update network parameter.
Specifically, the network parameter of adjustable initial neural network, to minimize the value of weighting classification error as far as possible.Its In, network parameter may include weight and the number of iterations of each layer of neural network etc..It in addition to this, further include based on institute It states the second error in classification and location error updates the network parameter.
Due in detection block pedestrian and context similarity it is higher, then detection block weight is lower, and weighting classification error is opposite Smaller, then influence of the pedestrian sample at this to training result is smaller.
Specifically, can use by backpropagation (BP, back-propagation), stochastic gradient descent (SGD, Stochastic gradient descent) or gradient passback scheduling algorithm progress end-to-end (end-to-end) training, Lai Youhua Parameters in model.It is every handled a width training image after, it can be determined that whether meet trained termination condition, if meeting Condition then terminates to train, and network parameter at this time can be used as the parameter of trained pedestrian detection model.If being unsatisfactory for condition Then continue to train.The condition that training terminates may include that the training image in training set has been used up, loss function has been restrained Etc..
The training device of pedestrian detection model according to an embodiment of the present invention is described above exemplarily.Illustratively, The training device of pedestrian detection model according to an embodiment of the present invention can with memory and processor unit or It is realized in person's system.
In addition, the training device of pedestrian detection model according to an embodiment of the present invention is deployed to intelligent hand in which can be convenient In the mobile devices such as machine, tablet computer, personal computer.Alternatively, the instruction of pedestrian detection model according to an embodiment of the present invention Server end (or cloud) can also be deployed in by practicing device.Alternatively, the instruction of pedestrian detection model according to an embodiment of the present invention Practicing device can also be deployed at server end (or cloud) and personal terminal with being distributed.
Based on above description, training device according to an embodiment of the present invention in the training process of pedestrian's detection model from The dynamic detection block weight for reducing pedestrian sample similar with background, to reduce the unfavorable shadow for easily obscuring sample to network parameter It rings, greatly improves the precision of pedestrian detection.
Fig. 5 shows the schematic block diagram of the training system 500 of pedestrian detection model according to an embodiment of the present invention.Pedestrian The training system 500 of detection model includes storage device 510 and processor 520.
Wherein, the training method that storage device 510 stores for realizing pedestrian detection model according to an embodiment of the present invention In corresponding steps program code.Program code of the processor 520 for being stored in Running storage device 510, to execute root According to the corresponding steps of the training method of the pedestrian detection model of the embodiment of the present invention, and for realizing according to embodiments of the present invention Pedestrian detection model training device in corresponding module.
In one embodiment, the training of pedestrian detection model is made when said program code is run by processor 520 System 500 executes following steps:
Training image is input to neural network, to generate the prediction letter about the target object in the training image Breath, the predictive information includes detection block position, detection block weight and detection block score, wherein the detection block weight indicates The similitude of target object and background in the detection block, the similitude is higher, then the detection block weight is lower;
Calculate the first error in classification between the detection block score and score true value, and according to the detection block weight and First error in classification calculates weighting classification error;
Network parameter at least based on neural network described in the weighting classification error update.
In one embodiment, the weighting classification error is multiplying for the detection block weight and first error in classification Product.
In one embodiment, the instruction of pedestrian detection model is also made when said program code is run by processor 520 Practice system 500 to execute: calculating the second error in classification between the detection block weight and weight true value;And it is based on described second Error in classification updates the network parameter.
In one embodiment, the instruction of pedestrian detection model is also made when said program code is run by processor 520 Practice system 500 to execute: calculating the location error between the detection block position and position true value;And it is based on the location error Update the network parameter.
In one embodiment, the predictive information for generating the training objective in the training image includes: based on institute It states neural network and feature extraction is carried out to the training image, to generate the characteristic pattern of the training image;According to the feature Figure generates the predictive information.
In addition, according to embodiments of the present invention, additionally providing a kind of storage medium, storing program on said storage Instruction, when described program instruction is run by computer or processor for executing the pedestrian detection model of the embodiment of the present invention The corresponding steps of training method, and for realizing the phase in the training device of pedestrian detection model according to an embodiment of the present invention Answer module.The storage medium for example may include the storage card of smart phone, the storage unit of tablet computer, personal computer Hard disk, read-only memory (ROM), Erasable Programmable Read Only Memory EPROM (EPROM), portable compact disc read-only memory (CD-ROM), any combination of USB storage or above-mentioned storage medium.The computer readable storage medium can be one Any combination of a or multiple computer readable storage mediums.
In one embodiment, the computer program instructions may be implemented real according to the present invention when being run by computer Each functional module of the training device of the pedestrian detection model of example is applied, and/or can be executed according to embodiments of the present invention Pedestrian detection model training method.
In one embodiment, the computer program instructions make computer or place when being run by computer or processor It manages device and executes following steps:
Training image is input to neural network, to generate the prediction letter about the target object in the training image Breath, the predictive information includes detection block position, detection block weight and detection block score, wherein the detection block weight indicates The similitude of target object and background in the detection block, the similitude is higher, then the detection block weight is lower;
Calculate the first error in classification between the detection block score and score true value, and according to the detection block weight and First error in classification calculates weighting classification error;
Network parameter at least based on neural network described in the weighting classification error update.
In one embodiment, the weighting classification error is multiplying for the detection block weight and first error in classification Product.
In one embodiment, the computer program instructions also make when being run by computer or processor computer or Processor executes: calculating the second error in classification between the detection block weight and weight true value;And it is based on described second point Network parameter described in class error update.
In one embodiment, the computer program instructions also make when being run by computer or processor computer or Processor executes: calculating the location error between the detection block position and position true value;And more based on the location error The new network parameter.
In one embodiment, the predictive information for generating the training objective in the training image includes: based on institute It states neural network and feature extraction is carried out to the training image, to generate the characteristic pattern of the training image;According to the feature Figure generates the predictive information.
Training method, device, system and the computer-readable medium of pedestrian detection model according to an embodiment of the present invention exist Automatically the detection block weight of pedestrian sample similar with background is reduced in the training process of pedestrian detection model, to reduce easily Sample is obscured to the adverse effect of network parameter, greatly improves the precision of pedestrian detection.
Although describing example embodiment by reference to attached drawing here, it should be understood that above example embodiment are only exemplary , and be not intended to limit the scope of the invention to this.Those of ordinary skill in the art can carry out various changes wherein And modification, it is made without departing from the scope of the present invention and spiritual.All such changes and modifications are intended to be included in appended claims Within required the scope of the present invention.
Those of ordinary skill in the art may be aware that list described in conjunction with the examples disclosed in the embodiments of the present disclosure Member and algorithm steps can be realized with the combination of electronic hardware or computer software and electronic hardware.These functions are actually It is implemented in hardware or software, the specific application and design constraint depending on technical solution.Professional technician Each specific application can be used different methods to achieve the described function, but this realization is it is not considered that exceed The scope of the present invention.
In several embodiments provided herein, it should be understood that disclosed device and method can pass through it Its mode is realized.For example, apparatus embodiments described above are merely indicative, for example, the division of the unit, only Only a kind of logical function partition, there may be another division manner in actual implementation, such as multiple units or components can be tied Another equipment is closed or is desirably integrated into, or some features can be ignored or not executed.
In the instructions provided here, numerous specific details are set forth.It is to be appreciated, however, that implementation of the invention Example can be practiced without these specific details.In some instances, well known method, structure is not been shown in detail And technology, so as not to obscure the understanding of this specification.
Similarly, it should be understood that in order to simplify the present invention and help to understand one or more of the various inventive aspects, In To in the description of exemplary embodiment of the present invention, each feature of the invention be grouped together into sometimes single embodiment, figure, Or in descriptions thereof.However, the method for the invention should not be construed to reflect an intention that i.e. claimed The present invention claims features more more than feature expressly recited in each claim.More precisely, such as corresponding power As sharp claim reflects, inventive point is that the spy of all features less than some disclosed single embodiment can be used Sign is to solve corresponding technical problem.Therefore, it then follows thus claims of specific embodiment are expressly incorporated in this specific Embodiment, wherein each, the claims themselves are regarded as separate embodiments of the invention.
It will be understood to those skilled in the art that any combination pair can be used other than mutually exclusive between feature All features disclosed in this specification (including adjoint claim, abstract and attached drawing) and so disclosed any method Or all process or units of equipment are combined.Unless expressly stated otherwise, this specification (is wanted including adjoint right Ask, make a summary and attached drawing) disclosed in each feature can be replaced with an alternative feature that provides the same, equivalent, or similar purpose.
In addition, it will be appreciated by those of skill in the art that although some embodiments described herein include other embodiments In included certain features rather than other feature, but the combination of the feature of different embodiments mean it is of the invention Within the scope of and form different embodiments.For example, in detail in the claims, embodiment claimed it is one of any Can in any combination mode come using.
Various component embodiments of the invention can be implemented in hardware, or to run on one or more processors Software module realize, or be implemented in a combination thereof.It will be understood by those of skill in the art that can be used in practice Microprocessor or other suitable processors realize some or all function of some modules according to an embodiment of the present invention Energy.The present invention be also implemented as executing method as described herein some or all program of device (for example, Computer program and computer program product).It is such to realize that program of the invention can store on a computer-readable medium, Or it may be in the form of one or more signals.Such signal can be downloaded from an internet website to obtain, or It is provided on the carrier signal, or is provided in any other form.
It should be noted that the above-mentioned embodiments illustrate rather than limit the invention, and ability Field technique personnel can be designed alternative embodiment without departing from the scope of the appended claims.In the claims, Any reference symbol between parentheses should not be configured to limitations on claims.Word "comprising" does not exclude the presence of not Element or step listed in the claims.Word "a" or "an" located in front of the element does not exclude the presence of multiple such Element.The present invention can be by means of including the hardware of several different elements and being come by means of properly programmed computer real It is existing.In the unit claims listing several devices, several in these devices can be through the same hardware branch To embody.The use of word first, second, and third does not indicate any sequence.These words can be explained and be run after fame Claim.
The above description is merely a specific embodiment or to the explanation of specific embodiment, protection of the invention Range is not limited thereto, and anyone skilled in the art in the technical scope disclosed by the present invention, can be easily Expect change or replacement, should be covered by the protection scope of the present invention.Protection scope of the present invention should be with claim Subject to protection scope.

Claims (10)

1. a kind of training method of pedestrian detection model, which is characterized in that the training method includes:
Training image is input to neural network, to generate the predictive information about the target object in the training image, institute Stating predictive information includes detection block position, detection block weight and detection block score, wherein the detection block weight indicates the inspection The similitude of target object and background in frame is surveyed, the similitude is higher, then the detection block weight is lower;
Calculate the first error in classification between the detection block score and score true value, and according to the detection block weight and described First error in classification calculates weighting classification error;
Network parameter at least based on neural network described in the weighting classification error update.
2. the training method of pedestrian detection model according to claim 1, which is characterized in that the weighting classification error is The product of the detection block weight and first error in classification.
3. the training method of pedestrian detection model according to claim 1, which is characterized in that further include:
Calculate the second error in classification between the detection block weight and weight true value;And
The network parameter is updated based on second error in classification.
4. the training method of pedestrian detection model according to claim 1, which is characterized in that further include:
Calculate the location error between the detection block position and position true value;And
The network parameter is updated based on the location error.
5. the training method of pedestrian detection model according to claim 1, which is characterized in that described to generate the training figure The predictive information of training objective as in includes:
Feature extraction is carried out to the training image based on the neural network, to generate the characteristic pattern of the training image;
The predictive information is generated according to the characteristic pattern.
6. a kind of training device of pedestrian detection model, which is characterized in that the training device of the pedestrian detection model includes:
Prediction module, for training image to be input to neural network, to generate about the target object in the training image Predictive information, the predictive information includes detection block position, detection block weight and detection block score, wherein the detection block Weight indicates the similitude of target object and background in the detection block, and the similitude is higher, then the detection block weight is got over It is low;
Error calculating module, for calculating the first error in classification between the detection block score and score true value, and according to institute It states detection block weight and first error in classification calculates weighting classification error;And
Training module, for the network parameter at least based on neural network described in the weighting classification error update.
7. the training device of pedestrian detection model according to claim 6, which is characterized in that the weighting classification error is The product of the detection block weight and first error in classification.
8. the training device of pedestrian detection model according to claim 6, which is characterized in that further include:
Characteristic extracting module, for carrying out feature extraction to the training image based on the neural network, to generate the instruction Practice the characteristic pattern of image;Also,
The prediction module generates the predictive information according to the characteristic pattern.
9. a kind of training system of pedestrian detection model, which is characterized in that the training system of the pedestrian detection model includes depositing Method for storing and processor are stored with the computer program run by the processor, the computer journey in the storage method Sequence executes the training side of the pedestrian detection model as described in any one of claim 1-5 when being run by the processor Method.
10. a kind of computer-readable medium, which is characterized in that be stored with computer program, institute on the computer-readable medium State the training method for the pedestrian detection model that computer program is executed at runtime as described in any one of claim 1-5.
CN201910615436.4A 2019-07-09 2019-07-09 Training method, device and system of pedestrian detection model and computer readable medium Active CN110490058B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910615436.4A CN110490058B (en) 2019-07-09 2019-07-09 Training method, device and system of pedestrian detection model and computer readable medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910615436.4A CN110490058B (en) 2019-07-09 2019-07-09 Training method, device and system of pedestrian detection model and computer readable medium

Publications (2)

Publication Number Publication Date
CN110490058A true CN110490058A (en) 2019-11-22
CN110490058B CN110490058B (en) 2022-07-26

Family

ID=68546867

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910615436.4A Active CN110490058B (en) 2019-07-09 2019-07-09 Training method, device and system of pedestrian detection model and computer readable medium

Country Status (1)

Country Link
CN (1) CN110490058B (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111091091A (en) * 2019-12-16 2020-05-01 北京迈格威科技有限公司 Method, device and equipment for extracting target object re-identification features and storage medium
CN112863187A (en) * 2021-01-18 2021-05-28 北京百度网讯科技有限公司 Detection method of perception model, electronic equipment, road side equipment and cloud control platform
WO2022043741A1 (en) * 2020-08-25 2022-03-03 商汤国际私人有限公司 Network training method and apparatus, person re-identification method and apparatus, storage medium, and computer program

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106845430A (en) * 2017-02-06 2017-06-13 东华大学 Pedestrian detection and tracking based on acceleration region convolutional neural networks
CN107909027A (en) * 2017-11-14 2018-04-13 电子科技大学 It is a kind of that there is the quick human body target detection method for blocking processing
CN107944369A (en) * 2017-11-17 2018-04-20 大连大学 A kind of pedestrian detection method based on tandem zones generation network and enhancing random forest
CN108038409A (en) * 2017-10-27 2018-05-15 江西高创保安服务技术有限公司 A kind of pedestrian detection method
CN108446662A (en) * 2018-04-02 2018-08-24 电子科技大学 A kind of pedestrian detection method based on semantic segmentation information
CN108563977A (en) * 2017-12-18 2018-09-21 华南理工大学 A kind of the pedestrian's method for early warning and system of expressway entrance and exit
CN108875770A (en) * 2018-02-06 2018-11-23 北京迈格威科技有限公司 Mask method, device, system and the storage medium of pedestrian detection wrong report data
CN108875903A (en) * 2018-01-02 2018-11-23 北京迈格威科技有限公司 Method, apparatus, system and the computer storage medium of image detection
WO2019071739A1 (en) * 2017-10-13 2019-04-18 平安科技(深圳)有限公司 Face living body detection method and apparatus, readable storage medium and terminal device

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106845430A (en) * 2017-02-06 2017-06-13 东华大学 Pedestrian detection and tracking based on acceleration region convolutional neural networks
WO2019071739A1 (en) * 2017-10-13 2019-04-18 平安科技(深圳)有限公司 Face living body detection method and apparatus, readable storage medium and terminal device
CN108038409A (en) * 2017-10-27 2018-05-15 江西高创保安服务技术有限公司 A kind of pedestrian detection method
CN107909027A (en) * 2017-11-14 2018-04-13 电子科技大学 It is a kind of that there is the quick human body target detection method for blocking processing
CN107944369A (en) * 2017-11-17 2018-04-20 大连大学 A kind of pedestrian detection method based on tandem zones generation network and enhancing random forest
CN108563977A (en) * 2017-12-18 2018-09-21 华南理工大学 A kind of the pedestrian's method for early warning and system of expressway entrance and exit
CN108875903A (en) * 2018-01-02 2018-11-23 北京迈格威科技有限公司 Method, apparatus, system and the computer storage medium of image detection
CN108875770A (en) * 2018-02-06 2018-11-23 北京迈格威科技有限公司 Mask method, device, system and the storage medium of pedestrian detection wrong report data
CN108446662A (en) * 2018-04-02 2018-08-24 电子科技大学 A kind of pedestrian detection method based on semantic segmentation information

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
QIWEI PENG, ETC.: "Pedestrian Detection for Transformer Substation Based on Gaussian Mixture Model and YOLO", 《2016 8TH INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN-MACHINE SYSTEMS AND CYBERNETICS》 *
刘建国等: "基于YOLOv2的行人检测方法研究", 《数字制造科学》 *

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111091091A (en) * 2019-12-16 2020-05-01 北京迈格威科技有限公司 Method, device and equipment for extracting target object re-identification features and storage medium
WO2022043741A1 (en) * 2020-08-25 2022-03-03 商汤国际私人有限公司 Network training method and apparatus, person re-identification method and apparatus, storage medium, and computer program
CN112863187A (en) * 2021-01-18 2021-05-28 北京百度网讯科技有限公司 Detection method of perception model, electronic equipment, road side equipment and cloud control platform
CN112863187B (en) * 2021-01-18 2022-04-15 阿波罗智联(北京)科技有限公司 Detection method of perception model, electronic equipment, road side equipment and cloud control platform

Also Published As

Publication number Publication date
CN110490058B (en) 2022-07-26

Similar Documents

Publication Publication Date Title
CN108197532B (en) The method, apparatus and computer installation of recognition of face
CN108875932A (en) Image-recognizing method, device and system and storage medium
CN108875522A (en) Face cluster methods, devices and systems and storage medium
CN109146830A (en) For generating the method, apparatus, system and storage medium of training data
CN109670452A (en) Method for detecting human face, device, electronic equipment and Face datection model
CN108875766A (en) Method, apparatus, system and the computer storage medium of image procossing
CN109919211A (en) Commodity recognition method, device, system and computer-readable medium
CN110309706A (en) Face critical point detection method, apparatus, computer equipment and storage medium
CN108921159A (en) Method and apparatus for detecting the wear condition of safety cap
CN105938552A (en) Face recognition method capable of realizing base image automatic update and face recognition device
CN108229341A (en) Sorting technique and device, electronic equipment, computer storage media, program
CN108875546A (en) Face auth method, system and storage medium
CN106650662A (en) Target object occlusion detection method and target object occlusion detection device
CN110490058A (en) Training method, device, system and the computer-readable medium of pedestrian detection model
CN107918767B (en) Object detection method, device, electronic equipment and computer-readable medium
CN111522987A (en) Image auditing method and device and computer readable storage medium
CN110009614A (en) Method and apparatus for output information
CN108875731A (en) Target identification method, device, system and storage medium
CN108875767A (en) Method, apparatus, system and the computer storage medium of image recognition
CN106131405A (en) Image processing method and device and for its image capturing system
CN108875509A (en) Biopsy method, device and system and storage medium
CN108876764A (en) Render image acquiring method, device, system and storage medium
CN108875517A (en) Method for processing video frequency, device and system and storage medium
CN108229418A (en) Human body critical point detection method and apparatus, electronic equipment, storage medium and program
CN108875483A (en) Image detecting method, device, system and computer-readable medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant