CN109325547A - Non-motor vehicle image multi-tag classification method, system, equipment and storage medium - Google Patents
Non-motor vehicle image multi-tag classification method, system, equipment and storage medium Download PDFInfo
- Publication number
- CN109325547A CN109325547A CN201811240000.3A CN201811240000A CN109325547A CN 109325547 A CN109325547 A CN 109325547A CN 201811240000 A CN201811240000 A CN 201811240000A CN 109325547 A CN109325547 A CN 109325547A
- Authority
- CN
- China
- Prior art keywords
- motor vehicle
- network model
- sorter network
- vehicle image
- training
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/21—Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
- G06F18/214—Generating training patterns; Bootstrap methods, e.g. bagging or boosting
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
- G06F18/241—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
- G06F18/2415—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on parametric or probabilistic models, e.g. based on likelihood ratio or false acceptance rate versus a false rejection rate
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
- G06N3/084—Backpropagation, e.g. using gradient descent
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- Evolutionary Computation (AREA)
- Life Sciences & Earth Sciences (AREA)
- Artificial Intelligence (AREA)
- General Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- Computing Systems (AREA)
- Software Systems (AREA)
- Molecular Biology (AREA)
- Computational Linguistics (AREA)
- Biophysics (AREA)
- Biomedical Technology (AREA)
- Mathematical Physics (AREA)
- General Health & Medical Sciences (AREA)
- Health & Medical Sciences (AREA)
- Evolutionary Biology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Probability & Statistics with Applications (AREA)
- Image Analysis (AREA)
Abstract
The present invention provides a kind of non-motor vehicle image multi-tag classification method, system, equipment and storage mediums, the label of the non-motor vehicle image includes the classification results of multiple attributes, described method includes following steps: the non-motor vehicle image of test is inputted in trained sorter network model, the sorter network model include feature extraction layer and with the attribute multiple taxons correspondingly;The feature extraction layer of the sorter network model extracts the feature in test image;Multiple taxons of the sorter network model are respectively according to the classification results of each attribute of the feature calculation of extraction;The classification results of each attribute are merged, the label of the non-motor vehicle image as test.Present invention employs a sorter network models, and non-motor vehicle multiple attributive classification can be thus achieved, and training is convenient, and nicety of grading is high.
Description
Technical field
The present invention relates to technical field of image processing more particularly to a kind of non-motor vehicle image multi-tag classification method, it is
System, equipment and storage medium.
Background technique
Representative method of the convolutional neural networks as deep learning, can learn image characteristics extraction, in major part automatically
Computer Vision Task in have good effect.But in the application in certain fields, such as non-motor vehicle image domains, often
Need to obtain the output of class label under multiple attributes of target image.Multiple polytypic networks can be used in Normal practice
Complete the image recognition of multiple class labels, but this method will lead to algorithm time-consuming with the scale of categorical attribute in application and
Linearly increasing, efficiency is very low.
Summary of the invention
For the problems of the prior art, the purpose of the present invention is to provide a kind of non-motor vehicle image multi-tag classification sides
Non-motor vehicle multiple attributive classification can be thus achieved using a sorter network model in method, system, equipment and storage medium, training
Convenient, nicety of grading is high.
The embodiment of the present invention provides a kind of non-motor vehicle image multi-tag classification method, the label of the non-motor vehicle image
Classification results including multiple attributes, described method includes following steps:
The non-motor vehicle image of test is inputted in trained sorter network model, the sorter network model includes spy
Levy extract layer and with the attribute multiple taxons correspondingly;
The feature extraction layer of the sorter network model extracts the feature in the non-motor vehicle image of test;
Multiple taxons of the sorter network model are respectively according to the classification knot of each attribute of the feature calculation of extraction
Fruit;
The classification results of each attribute are merged, the label of the non-motor vehicle image as test.
Optionally, multiple taxons of the sorter network model are respectively according to each attribute of the feature calculation of extraction
Classification results include the following steps:
The non-motor vehicle image that multiple taxons of the sorter network model calculate separately the test belong to pair
The probability of each classification in the attribute answered, classification results of the maximum classification of select probability as corresponding attribute.
Optionally, the feature extraction layer includes an at least convolutional layer and at least a pond layer, the taxon are
Softmax layers.
Optionally, it is additionally provided with the first full articulamentum between the feature extraction layer and taxon and multiple branches connect entirely
Layer is connect, the multiple full articulamentum of branch and the taxon correspond, and the output of the feature extraction layer passes through described
Be connected to the full articulamentum of the multiple branch after first full articulamentum, the output of each full articulamentum of branch be connected to pair
The taxon answered.
Optionally, the output of the described first full articulamentum is by a dropout layers of one second full articulamentum of input, and described the
Two full articulamentums are connected to the full articulamentum of the multiple branch by a dropout layers.
Optionally, the sorter network model is trained using following steps:
Building includes the sorter network model of feature extraction layer and multiple taxons, the taxon and non-motor vehicle
The attribute of image corresponds;
Obtain training set, the training set include training image data and with number of tags corresponding to each training image
According to the label data includes the classification of image path and training image in each attribute;
The training set is inputted into the sorter network model and is iterated training, the penalty values of each taxon are added
Loss of the power summation as sorter network, repetitive exercise to model are restrained;
Save the sorter network model that training is completed.
Optionally, after the building includes feature extraction layer and the sorter network model of multiple taxons, further include
Following steps:
Trained weight file on ImageNet public data collection is obtained to propose the feature of the sorter network model of building
Layer and the first full articulamentum is taken to be initialized;
The full articulamentum of multiple branches and multiple taxons to the sorter network model of building carry out random initializtion.
Optionally, described that the training set input sorter network model is iterated training, include the following steps:
Batch size, initial learning rate and the maximum number of iterations of the setting training sorter network model;
Using sorter network model described in the training set repetitive exercise, after every repetitive exercise i times learning rate multiplied by k value,
Learning rate as successive iterations training, wherein i is the cycle times of default regularized learning algorithm rate, and k is the adjustment of preset learning rate
Coefficient, and k < 1;
After training reaches maximum number of iterations, judge whether the penalty values of the sorter network model are less than preset threshold;
If it is, repetitive exercise is completed;
Otherwise, continue using sorter network model described in the training set repetitive exercise, until the sorter network model
Penalty values are less than preset threshold.
The embodiment of the present invention also provides a kind of non-motor vehicle image multi-tag categorizing system, applied to the non-motor vehicle
Image multi-tag classification method, the system comprises:
Image input module, the non-motor vehicle image for that will test inputs in trained sorter network model, described
Sorter network model include feature extraction layer and with the attribute multiple taxons correspondingly;
Characteristic extracting module extracts the non-motor vehicle figure of test for the feature extraction layer using the sorter network model
Feature as in;
Image classification module, for multiple taxons using the sorter network model respectively according to the feature of extraction
Calculate the classification results of each attribute;
As a result output module, for the classification results of each attribute to be merged, the non-motor vehicle image as the test
Label.
The embodiment of the present invention also provides a kind of non-motor vehicle image multi-tag sorting device, comprising:
Processor;
Memory, wherein being stored with the executable instruction of the processor;
Wherein, the processor is configured to more to execute the non-motor vehicle image via the executable instruction is executed
The step of labeling method.
The embodiment of the present invention also provides a kind of computer readable storage medium, and for storing program, described program is performed
Described in Shi Shixian the step of non-motor vehicle image multi-tag classification method.
Non-motor vehicle image multi-tag classification method, system, equipment and storage medium provided by the present invention have following
Advantage:
The present invention extracts feature by deep learning and multiple taxons combine, using a sorter network model
To realize non-motor vehicle multiple attributive classification, training is convenient, and nicety of grading is high, to solve multiple using multiple in the prior art
The problem of the step of image classification model is cumbersome, inefficiency.
Detailed description of the invention
Upon reading the detailed description of non-limiting embodiments with reference to the following drawings, other feature of the invention,
Objects and advantages will become more apparent upon.
Fig. 1 is the flow chart of the non-motor vehicle image multi-tag classification method of one embodiment of the invention;
Fig. 2 is the dimension style exemplary diagram of the non-motor vehicle image of the training set of one embodiment of the invention;
Fig. 3 is the structural schematic diagram of the sorter network model of one embodiment of the invention;
Fig. 4 is the structural schematic diagram of the non-motor vehicle image multi-tag categorizing system of one embodiment of the invention;
Fig. 5 is the structural schematic diagram of the non-motor vehicle image multi-tag sorting device of one embodiment of the invention;
Fig. 6 is the structural schematic diagram of the computer storage medium of one embodiment of the invention.
Specific embodiment
Example embodiment is described more fully with reference to the drawings.However, example embodiment can be with a variety of shapes
Formula is implemented, and is not understood as limited to embodiment set forth herein;On the contrary, thesing embodiments are provided so that the present invention will
Fully and completely, and by the design of example embodiment comprehensively it is communicated to those skilled in the art.It is identical attached in figure
Icon note indicates same or similar structure, thus will omit repetition thereof.
As shown in Figure 1, the embodiment of the present invention provides a kind of non-motor vehicle image multi-tag classification method, the non-motor vehicle
The label of image includes the classification results of multiple attributes, and described method includes following steps:
S100: the non-motor vehicle image of test is inputted in trained sorter network model, the sorter network model
Including feature extraction layer and with the attribute multiple taxons correspondingly;
S200: the feature extraction layer of the sorter network model extracts the feature in the non-motor vehicle image of test;
S300: multiple taxons of the sorter network model are respectively according to point of each attribute of the feature calculation of extraction
Class result;
S400: the classification results of each attribute are merged, the label of the non-motor vehicle image as test.
Each attribute can be the preset attribute that mutex relation is not present to each other, such as multiple attributes may include moving
Power type is applicable in gender, vehicular applications, whether has parasols etc., accordingly, in power type attribute in the following, non-motor vehicle
Electric vehicle, motorcycle, bicycle etc. can be divided into, applicable gender can be divided into male's vehicle, Nv Xingche, and vehicular applications can be divided into
Take out vehicle, express delivery vehicle, inoperative vehicle etc., if having parasols that can be divided into has parasols and without parasols, one non-maneuver
The label of vehicle can be the combination of the classification results of multiple attributes, such as the label of a non-motor vehicle can be electric vehicle, female
Property vehicle, has parasols at inoperative vehicle.
Therefore, the present invention extracts feature by deep learning and multiple taxons combine, using a sorter network mould
Non-motor vehicle multiple attributive classification can be thus achieved in type, and training is convenient, and nicety of grading is high.
In this embodiment, multiple taxons of the sorter network model are each according to the feature calculation of extraction respectively
The classification results of attribute, include the following steps:
The non-motor vehicle image that multiple taxons of the sorter network model calculate separately the test belong to pair
The probability of each classification in the attribute answered, classification results of the maximum classification of select probability as corresponding attribute.Wherein, divide
Class unit can use softmax layer, it is softmax layers trained in the feature vector extracted by feature extraction layer of input, it is defeated
Result out is the vector of a T*1, and the value of T corresponds to the other number of attribute lower class, and each numerical tabular in the vector of T*1
What is shown is the probability that feature belongs to the next classification of the attribute, classification knot of the maximum classification of select probability as the taxon
Fruit, the classification of the selection is the classification that the non-motor vehicle image tested most likely belongs under the attribute, so as to accurate
The highest classification results of non-motor vehicle image accuracy tested.
In this embodiment, the feature extraction layer includes an at least convolutional layer and an at least pond layer, the grouping sheet
Member is softmax layers.It is additionally provided with the first full articulamentum between the feature extraction layer and taxon and multiple branches connect entirely
Layer is connect, the multiple full articulamentum of branch and the taxon correspond, and the output of the feature extraction layer passes through described
Be connected to the full articulamentum of the multiple branch after first full articulamentum, the output of each full articulamentum of branch be connected to pair
The taxon answered.Further, the output of the described first full articulamentum can be connected entirely by a dropout layers of input one second
Layer is connect, the second full articulamentum is connected to the full articulamentum of the multiple branch by a dropout layers.In convolutional neural networks
Every layer of convolutional layer is made of several convolution units, and the parameter of each convolution unit is to optimize to obtain by back-propagation algorithm
's.The purpose of convolution algorithm is to extract the different characteristic of input, and first layer convolutional layer may can only extract some rudimentary features
Such as edge, lines and angle level, the network of more layers can from low-level features the more complicated feature of iterative extraction.Pond layer
Sampling layer is made equally to be made of multiple characteristic faces after convolutional layer, corresponding one layer thereon of its each characteristic face
A characteristic face, the number of characteristic face will not be changed.Pond layer is intended to the resolution ratio by reducing characteristic face to be had
The feature of space-invariance.Pond layer plays the role of second extraction feature, its each neuron carries out local acceptance region
Pondization operation.Common pond method has maximum pondization to take the maximum point of local acceptance region intermediate value, mean value pondization i.e. to part
All values in acceptance region average, random pool etc., this example is mainly using maximum pond method.
Therefore, the reasonable cooperation of convolutional layer and pond layer can preferably extract the feature of the non-motor vehicle image of test,
To further increase the accuracy that taxon is classified.It will be further in the specific example shown in figure 2 and figure 3 below
Introduce each layer of concrete application.
In this embodiment, the sorter network model is trained using following steps:
Building includes the sorter network model of feature extraction layer and multiple taxons, the taxon and non-motor vehicle
The attribute of image corresponds;
Obtain training set, the training set include training image data and with number of tags corresponding to each training image
According to the label data includes the classification of image path and training image in each attribute;Training image data can be reality
What is be now collected has been carried out the non-motor vehicle image for classifying and knowing classification results, is each according to known classification results
The non-motor vehicle image of a training increases label, and trained non-motor vehicle image and label are added in training set together, with
Sorter network model is trained;
The training set is inputted into the sorter network model and is iterated training, the penalty values of each taxon are added
Loss of the power summation as sorter network, repetitive exercise to model are restrained;The damage of each taxon is allowed for due to losing
Mistake value, in an iterative process, the continuous reduction of total losses value, also just optimization improves the corresponding grouping sheet of each attribute simultaneously
The recognition accuracy of member;
The sorter network model that training is completed is saved, the sorter network model after convergence can be used for test of the invention
Non-motor vehicle image Classification and Identification, and can choose the accurate non-motor vehicle image of identification in use and be newly added to instruction
Practice and concentrates, training set of enriching constantly, and periodically re -training sorter network model, continue to optimize the identification of sorter network model
Effect, the accuracy rate of model identification to be continuously improved in use.
In this embodiment, after the sorter network model of the building including feature extraction layer and multiple taxons,
Further include following steps:
Trained weight file on ImageNet public data collection is obtained to propose the feature of the sorter network model of building
Layer and the first full articulamentum is taken to be initialized;Since ImageNet has had the convolutional layer of comparative maturity, pond layer and complete
It directly can be used to initialize sorter network model of the invention by the weight file of articulamentum, the embodiment, can
To greatly save the time of sorter network model training;
The full articulamentum of multiple branches and multiple taxons to the sorter network model of building carry out random initializtion, with
Machine initialization can initialize weight using normal distribution, however, the present invention is not limited thereto.
In this embodiment, described that the training set input sorter network model is iterated training, including such as
Lower step:
Batch size, initial learning rate and the maximum number of iterations of the setting training sorter network model;
Using sorter network model described in the training set repetitive exercise, after every repetitive exercise i times learning rate multiplied by k value,
Learning rate as successive iterations training, wherein i is the cycle times of default regularized learning algorithm rate, and k is the adjustment of preset learning rate
Coefficient, and k < 1;
During training pattern, in order to which the training speed of balance model has selected relatively suitable study with loss
Rate, but the loss of training set may decline and just no longer decline to a certain extent, at this time can be by suitably reducing learning rate
One step reduces loss, but the decline of learning rate can extend the training required time.Therefore, which uses learning rate gradually
The method of decaying finds new balance between training time and reduction loss, and appropriate by setting maximum number of iterations
The controlled training time;
After training reaches maximum number of iterations, judge whether the penalty values of the sorter network model are less than preset threshold;
If it is, illustrating that sorter network model has reached convergence, repetitive exercise is completed;
Otherwise, continue using sorter network model described in the training set repetitive exercise, until the sorter network model
Penalty values are less than preset threshold, that is, train until the convergence of sorter network model.
Below with reference to Fig. 2 and Fig. 3, non-motor vehicle image multi-tag of the invention is further described with a specific example
Classification method.In the specific example, the non-motor vehicle image multi-tag classification includes the following steps:
Step 1: non-motor vehicle multi-tag data set arranges, and step 1 is to correspond to the instruction of above-mentioned sorter network model
The step of training set is obtained during practicing;The process of specific steps one is as follows:
Step 1.1: obtaining largely trained non-motor vehicle image from reality scene, be picture number;
Step 1.2: the markup information of image being designed, is divided by multiple attributes, respectively with one-bit digital
Position is marked to indicate, every group of attribute includes a variety of attributes, is indicated respectively with the number of 1~N.Annotation formatting are as follows: [image path
Diameter] [the attribute class code of feature 1] [the attribute class code of feature 2] ..., such as include non-motor vehicle image shown in Fig. 2
Five attribute markup informations mask method;
Step 1.3: all non-motor vehicle images being labeled with the above mask method, to label information and image
Information is organized into corresponding data set respectively.Data set is LMDB format.
Step 2: non-motor vehicle multi-tag sorter network model construction, the sorter network model of building is as shown in figure 3, step
Had in rapid two for the building mode of five convolutional layers, three pond layers and the full articulamentum that are used in above-described embodiment
The building process of body introduction, concrete model is as follows:
Step 2.1: network is by input layer, five convolutional layers, three pond layers, 2+N articulamentums, N number of full when training
Softmax layers of composition, wherein the value of N is identified non-motor vehicle attribute number.Step 2.2:data input layer is followed by the first volume
Lamination and Relu activation primitive, and it is followed by first standardization processing layer in Relu activation primitive, connect first pond below herein
Change layer maximum value pond;
Step 2.3: pond is followed by the second convolutional layer, adds Relu activation primitive after the second convolution, and be followed by activation primitive
Second batch standardization processing layer, standardization processing are followed by the second pond layer, and the second pond layer uses maximum value pond;
In deep neural network, usually using one kind cry correct linear unit (Rectified linear unit,
Relu) as the activation primitive of neuron.By ReLU realize it is sparse after model can preferably excavate correlated characteristic, be fitted
Training data.
Step 2.4: continuously connecing three convolutional layers after the second pond layer, put in order and activate letter for third convolutional layer, Relu
Number, Volume Four lamination, Relu activation primitive, the 5th convolutional layer, Relu activation primitive;
Step 2.5: being followed by third pond layer in previous step, pond is followed by the first full articulamentum, and in the first full connection
Layer is followed by Relu activation primitive and dropout layers, then connects the second full articulamentum, the first full articulamentum and the second full articulamentum are set
It sets identical;Each node of full articulamentum is connected with upper one layer of all nodes, comprehensive for feature that front is extracted
Altogether.
Step 2.6: being followed by the full articulamentum of N number of branch in the second full articulamentum, the respectively full articulamentum 1 of branch, branch is complete
The full articulamentum n of articulamentum 2 ... branch, connect after each full articulamentum of branch again one it is softmax layers corresponding, as corresponding
Taxon, attribute classification output of the output of taxon as final every class characteristics of image, respectively taxon 1,
Taxon 2 ... taxon n.
In the present invention, after multiple convolutional layers and pond layer, 2 full articulamentums is connected to and N number of branch connects entirely
Layer.Each neuron in full articulamentum is connect entirely with all neurons of its preceding layer.Full articulamentum can integrate volume
With the local message of class discrimination in lamination or pond layer.In order to promote the network performance of convolutional neural networks, Quan Lian
The excitation function of each neuron of layer is connect using ReLU function.The output valve of each full articulamentum of branch is delivered to one
Softmax layers are classified.Softmax layers of algorithm can be understood as normalizing, and such as current picture classification has x kind, that process
Softmax layers of output is exactly the vector of x dimension.First value in vector is exactly the probability that current image belongs to the first kind
It is worth, second value in vector is exactly that the sum of vector that current image belongs to that this x of the probability value ... of the second class is tieed up is 1.
Step 3: the training of non-motor vehicle multi-tag sorter network, step 3 correspond to the instruction of above-mentioned sorter network model
The training set is inputted into the sorter network model in white silk and is iterated trained step, by each grouping sheet in training process
Loss of the penalty values weighted sum of member as sorter network, repetitive exercise to model are restrained, and the process of specific steps three is as follows:
Step 3.1: according to the LMDB data set arranged in step 1, first calculating the mean value file of training dataset, save
For the format of .binaryproto file, and refer to the position for determining binary system mean value file in training network;
Step 3.2: using the training method of finetune, using the trained weight on ImageNet public data collection
File initializes current network carry out portion hierarchical weight, and other layers are carried out with the mode of random initializtion, institute as above
It states, in this example, using weight file trained on ImageNet public data collection to convolutional layer, pond layer, first
Full articulamentum and the second full articulamentum initialization, and random initializtion is carried out to the full articulamentum of branch and taxon, it is random first
Beginningization can initialize weight using normal distribution, however, the present invention is not limited thereto;
Step 3.3: in batch size, initial learning rate and the maximum number of iterations of the setting training sorter network model
When, setting crowd size batch_size is 32, and initial learning rate is 0.001, and maximum number of iterations is 200000 times, using step
Mode carry out learning rate modification in the training process, learning rate is multiplied by 0.9 after every iteration 1000 times, using stochastic gradient descent
Algorithm training data sets 10000 preservation primary network models of every iteration;Stochastic gradient descent algorithm can be accelerated to update each
A layer of weight file and biased data, referring in each iterative process at random here, sample will be upset at random, upset
Parameter caused by can effectively reducing between sample updates cancellation problem.In most basic stochastic gradient descent algorithm, parameter
Each step is updated by subtracting its gradient, and large-scale machine learning task, stochastic gradient descent algorithm are showed
Performance is very considerable.
When training, training sample and label are inputted into the network of initialization, calculate the penalty values of each softmax layers of input
Weighted sum is as final loss, and each softmax layers of weight is 1/N, but invention is not limited thereto, specific weight value
Distribution, which can according to need, to be configured.By two steps of continuous propagated forward and backpropagation, repetition training makes
Losing in training process constantly reduces, until reaching maximum the number of iterations;
After training is completed every time, using the network model of preservation as the pre-training model of next iteration training, avoid
Occur the loss of model data in the training process, continue to train, after training reaches maximum the number of iterations, judges that loss is
It is no to reach preset threshold hereinafter, if it is, end training, otherwise continues training until loss reaches preset threshold hereinafter, instruction
Practicing convergence then terminates to train.
Step 4: non-motor vehicle multi-tag image classification, step 1 and step 3 are all non-motor vehicle multi-tag images
The preparation process of sorter network model, and step 4 corresponds to above-mentioned steps S100~step S400, i.e., using trained
The step of sorter network model classifies to image, specifically, the process of step 4 are as follows:
Step 4.1: the test data pre-processed being sent into trained sorter network model, extracts non-motor vehicle image
Feature corresponds to step S100 and S200;
Step 4.2: the non-motor vehicle characteristics of image extracted being sent into softmax layers, N number of feature is exported and belongs to specified genus
Property in each classification probability, take the classification of maximum probability in each attribute as the classification results of the attribute, by N group identification
Classification results merge into final output list of categories, that is, correspond to step S300 and S400.
Due to the structure for the sorter network model that the present invention uses, non-motor vehicle image can be disposably obtained in multiple categories
Property under classification results, and operated without individually constructing disaggregated model for each attribute, therefore by above-mentioned step, i.e.,
The quick multi-tag classification of non-motor vehicle image can be achieved.
As shown in figure 4, the embodiment of the present invention also provides a kind of non-motor vehicle image multi-tag categorizing system, it is applied to described
Non-motor vehicle image multi-tag classification method, the system comprises:
Image input module M100, the non-motor vehicle image for that will test input in trained sorter network model,
The sorter network model include feature extraction layer and with the attribute multiple taxons correspondingly;
Characteristic extracting module M200, for being extracted in test image using the feature extraction layer of the sorter network model
Feature;
Image classification module M300, for multiple taxons using the sorter network model respectively according to extraction
The classification results of each attribute of feature calculation;
As a result output module M400, for the classification results of each attribute to be merged, the non-motor vehicle as the test
The label of image.
Therefore, feature extraction layer and image classification module that the present invention passes through deep learning in characteristic extracting module M200
Multiple taxons combine in M300, and non-motor vehicle multiple attributive classification can be thus achieved using a sorter network model, training
Convenient, nicety of grading is high.
The embodiment of the present invention also provides a kind of non-motor vehicle image multi-tag sorting device, including processor;Memory,
In be stored with the executable instruction of the processor;Wherein, the processor is configured to next via the executable instruction is executed
The step of executing the non-motor vehicle image multi-tag classification method.
Person of ordinary skill in the field it is understood that various aspects of the invention can be implemented as system, method or
Program product.Therefore, various aspects of the invention can be embodied in the following forms, it may be assumed that complete hardware embodiment, complete
The embodiment combined in terms of full Software Implementation (including firmware, microcode etc.) or hardware and software, can unite here
Referred to as circuit, " module " or " system ".
The electronic equipment 600 of this embodiment according to the present invention is described referring to Fig. 5.The electronics that Fig. 5 is shown
Equipment 600 is only an example, should not function to the embodiment of the present invention and use scope bring any restrictions.
As shown in figure 5, electronic equipment 600 is showed in the form of universal computing device.The component of electronic equipment 600 can wrap
It includes but is not limited to: at least one processing unit 610, at least one storage unit 620, (including the storage of the different system components of connection
Unit 620 and processing unit 610) bus 630, display unit 640 etc..
Wherein, the storage unit is stored with program code, and said program code can be held by the processing unit 610
Row, so that the processing unit 610 executes described in this specification above-mentioned electronic prescription circulation processing method part according to this
The step of inventing various illustrative embodiments.For example, the processing unit 610 can execute step as shown in fig. 1.
The storage unit 620 may include the readable medium of volatile memory cell form, such as random access memory
Unit (RAM) 6201 and/or cache memory unit 6202 can further include read-only memory unit (ROM) 6203.
The storage unit 620 can also include program/practical work with one group of (at least one) program module 6205
Tool 6204, such program module 6205 includes but is not limited to: operating system, one or more application program, other programs
It may include the realization of network environment in module and program data, each of these examples or certain combination.
Bus 630 can be to indicate one of a few class bus structures or a variety of, including storage unit bus or storage
Cell controller, peripheral bus, graphics acceleration port, processing unit use any bus structures in a variety of bus structures
Local bus.
Electronic equipment 600 can also be with one or more external equipments 700 (such as keyboard, sensing equipment, bluetooth equipment
Deng) communication, can also be enabled a user to one or more equipment interact with the electronic equipment 600 communicate, and/or with make
Any equipment (such as the router, modulation /demodulation that the electronic equipment 600 can be communicated with one or more of the other calculating equipment
Device etc.) communication.This communication can be carried out by input/output (I/O) interface 650.Also, electronic equipment 600 can be with
By network adapter 660 and one or more network (such as local area network (LAN), wide area network (WAN) and/or public network,
Such as internet) communication.Network adapter 660 can be communicated by bus 630 with other modules of electronic equipment 600.It should
Understand, although not shown in the drawings, other hardware and/or software module can be used in conjunction with electronic equipment 600, including but unlimited
In: microcode, device driver, redundant processing unit, external disk drive array, RAID system, tape drive and number
According to backup storage system etc..
The embodiment of the present invention also provides a kind of computer readable storage medium, and for storing program, described program is performed
Described in Shi Shixian the step of non-motor vehicle image multi-tag classification method.In some possible embodiments, of the invention
Various aspects are also implemented as a kind of form of program product comprising program code, when described program product is set in terminal
When standby upper operation, said program code is for making the terminal device execute the above-mentioned electronic prescription circulation processing method of this specification
Described in part according to the present invention various illustrative embodiments the step of.
Refering to what is shown in Fig. 6, describing the program product for realizing the above method of embodiment according to the present invention
800, can using portable compact disc read only memory (CD-ROM) and including program code, and can in terminal device,
Such as it is run on PC.However, program product of the invention is without being limited thereto, in this document, readable storage medium storing program for executing can be with
To be any include or the tangible medium of storage program, the program can be commanded execution system, device or device use or
It is in connection.
Described program product can be using any combination of one or more readable mediums.Readable medium can be readable letter
Number medium or readable storage medium storing program for executing.Readable storage medium storing program for executing for example can be but be not limited to electricity, magnetic, optical, electromagnetic, infrared ray or
System, device or the device of semiconductor, or any above combination.The more specific example of readable storage medium storing program for executing is (non exhaustive
List) include: electrical connection with one or more conducting wires, portable disc, hard disk, random access memory (RAM), read-only
Memory (ROM), erasable programmable read only memory (EPROM or flash memory), optical fiber, portable compact disc read only memory
(CD-ROM), light storage device, magnetic memory device or above-mentioned any appropriate combination.
The computer readable storage medium may include in a base band or the data as the propagation of carrier wave a part are believed
Number, wherein carrying readable program code.The data-signal of this propagation can take various forms, including but not limited to electromagnetism
Signal, optical signal or above-mentioned any appropriate combination.Readable storage medium storing program for executing can also be any other than readable storage medium storing program for executing
Readable medium, the readable medium can send, propagate or transmit for by instruction execution system, device or device use or
Person's program in connection.The program code for including on readable storage medium storing program for executing can transmit with any suitable medium, packet
Include but be not limited to wireless, wired, optical cable, RF etc. or above-mentioned any appropriate combination.
The program for executing operation of the present invention can be write with any combination of one or more programming languages
Code, described program design language include object oriented program language-Java, C++ etc., further include conventional
Procedural programming language-such as " C " language or similar programming language.Program code can be fully in user
It calculates and executes in equipment, partly executes on a user device, being executed as an independent software package, partially in user's calculating
Upper side point is executed on a remote computing or is executed in remote computing device or server completely.It is being related to far
Journey calculates in the situation of equipment, and remote computing device can pass through the network of any kind, including local area network (LAN) or wide area network
(WAN), it is connected to user calculating equipment, or, it may be connected to external computing device (such as utilize ISP
To be connected by internet).
In conclusion compared with prior art, non-motor vehicle image multi-tag classification method provided by the present invention is
System, equipment and storage medium have the advantage that
The present invention extracts feature by deep learning and multiple taxons combine, using a sorter network model
To realize non-motor vehicle multiple attributive classification, training is convenient, and nicety of grading is high, to solve multiple using multiple in the prior art
The problem of the step of image classification model is cumbersome, inefficiency.
The above content is a further detailed description of the present invention in conjunction with specific preferred embodiments, and it cannot be said that
Specific implementation of the invention is only limited to these instructions.For those of ordinary skill in the art to which the present invention belongs, exist
Under the premise of not departing from present inventive concept, a number of simple deductions or replacements can also be made, all shall be regarded as belonging to of the invention
Protection scope.
Claims (11)
1. a kind of non-motor vehicle image multi-tag classification method, which is characterized in that the label of the non-motor vehicle image includes more
The classification results of a attribute, described method includes following steps:
The non-motor vehicle image of test is inputted in trained sorter network model, the sorter network model includes that feature mentions
Take layer and with the attribute multiple taxons correspondingly;
The feature extraction layer of the sorter network model extracts the feature in the non-motor vehicle image of test;
Multiple taxons of the sorter network model are respectively according to the classification results of each attribute of the feature calculation of extraction;
The classification results of each attribute are merged, the label of the non-motor vehicle image as test.
2. non-motor vehicle image multi-tag classification method according to claim 1, which is characterized in that the sorter network mould
Multiple taxons of type include the following steps: respectively according to the classification results of each attribute of the feature calculation of extraction
The non-motor vehicle image that multiple taxons of the sorter network model calculate separately the test belongs to corresponding
The probability of each classification in attribute, classification results of the maximum classification of select probability as corresponding attribute.
3. non-motor vehicle image multi-tag classification method according to claim 1, which is characterized in that the feature extraction layer
Including an at least convolutional layer and an at least pond layer, the taxon is softmax layers.
4. non-motor vehicle image multi-tag classification method according to claim 1, which is characterized in that the feature extraction layer
It is additionally provided with the full articulamentum of the first full articulamentum and multiple branches between taxon, the full articulamentum of the multiple branch and institute
Taxon one-to-one correspondence is stated, the output of the feature extraction layer is by being connected to the multiple point after the described first full articulamentum
The full articulamentum of branch, the output of each full articulamentum of branch are connected to corresponding taxon.
5. non-motor vehicle image multi-tag classification method according to claim 4, which is characterized in that the described first full connection
The output of layer passes through a dropout layers of connection by a dropout layers of one second full articulamentum of input, the second full articulamentum
To the full articulamentum of the multiple branch.
6. non-motor vehicle image multi-tag classification method according to claim 4, which is characterized in that the sorter network mould
Type is trained using following steps:
Building includes the sorter network model of feature extraction layer and multiple taxons, the taxon and non-motor vehicle image
Attribute correspond;
Obtain training set, the training set include training image data and with label data corresponding to each training image, institute
Stating label data includes the classification of image path and training image in each attribute;
The training set is inputted into the sorter network model and is iterated training, the penalty values weighting of each taxon is asked
With the loss as sorter network, repetitive exercise to model is restrained;
Save the sorter network model that training is completed.
7. non-motor vehicle image multi-tag classification method according to claim 6, which is characterized in that the building includes spy
Further include following steps after levying extract layer and the sorter network model of multiple taxons:
Feature extraction layer of the trained weight file to the sorter network model of building on acquisition ImageNet public data collection
It is initialized with the first full articulamentum;
The full articulamentum of multiple branches and multiple taxons to the sorter network model of building carry out random initializtion.
8. non-motor vehicle image multi-tag classification method according to claim 7, which is characterized in that described by the training
Collection inputs the sorter network model and is iterated training, includes the following steps:
Batch size, initial learning rate and the maximum number of iterations of the setting training sorter network model;
Using sorter network model described in the training set repetitive exercise, after every repetitive exercise i times learning rate multiplied by k value, as
The learning rate of successive iterations training, wherein i is the cycle times of default regularized learning algorithm rate, and k is preset learning rate adjustment system
Number, and k < 1;
After training reaches maximum number of iterations, judge whether the penalty values of the sorter network model are less than preset threshold;
If it is, repetitive exercise is completed;
Otherwise, continue using sorter network model described in the training set repetitive exercise, until the loss of the sorter network model
Value is less than preset threshold.
9. a kind of non-motor vehicle image multi-tag categorizing system, which is characterized in that be applied to any one of claims 1 to 8 institute
The non-motor vehicle image multi-tag classification method stated, the system comprises:
Image input module, the non-motor vehicle image for that will test input in trained sorter network model, the classification
Network model include feature extraction layer and with the attribute multiple taxons correspondingly;
Characteristic extracting module, for being extracted in the non-motor vehicle image of test using the feature extraction layer of the sorter network model
Feature;
Image classification module, for multiple taxons using the sorter network model respectively according to the feature calculation of extraction
The classification results of each attribute;
As a result output module, for the classification results of each attribute to be merged, the mark of the non-motor vehicle image as the test
Label.
10. a kind of non-motor vehicle image multi-tag sorting device characterized by comprising
Processor;
Memory, wherein being stored with the executable instruction of the processor;
Wherein, the processor is configured to come described in any one of perform claim requirement 1 to 8 via the execution executable instruction
Non-motor vehicle image multi-tag classification method the step of.
11. a kind of computer readable storage medium, for storing program, which is characterized in that described program is performed realization power
Benefit require any one of 1 to 8 described in non-motor vehicle image multi-tag classification method the step of.
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811240000.3A CN109325547A (en) | 2018-10-23 | 2018-10-23 | Non-motor vehicle image multi-tag classification method, system, equipment and storage medium |
PCT/CN2019/111320 WO2020083073A1 (en) | 2018-10-23 | 2019-10-15 | Non-motorized vehicle image multi-label classification method, system, device and storage medium |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811240000.3A CN109325547A (en) | 2018-10-23 | 2018-10-23 | Non-motor vehicle image multi-tag classification method, system, equipment and storage medium |
Publications (1)
Publication Number | Publication Date |
---|---|
CN109325547A true CN109325547A (en) | 2019-02-12 |
Family
ID=65262678
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201811240000.3A Pending CN109325547A (en) | 2018-10-23 | 2018-10-23 | Non-motor vehicle image multi-tag classification method, system, equipment and storage medium |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN109325547A (en) |
WO (1) | WO2020083073A1 (en) |
Cited By (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109947940A (en) * | 2019-02-15 | 2019-06-28 | 平安科技(深圳)有限公司 | File classification method, device, terminal and storage medium |
WO2020083073A1 (en) * | 2018-10-23 | 2020-04-30 | 苏州科达科技股份有限公司 | Non-motorized vehicle image multi-label classification method, system, device and storage medium |
CN111368931A (en) * | 2020-03-09 | 2020-07-03 | 第四范式(北京)技术有限公司 | Method and device for training image classification model, computer device and storage medium |
CN111737521A (en) * | 2020-08-04 | 2020-10-02 | 北京微播易科技股份有限公司 | Video classification method and device |
CN111783574A (en) * | 2020-06-17 | 2020-10-16 | 李利明 | Meal image recognition method and device and storage medium |
CN111898475A (en) * | 2020-07-10 | 2020-11-06 | 浙江大华技术股份有限公司 | Method and device for estimating state of non-motor vehicle, storage medium, and electronic device |
CN112115880A (en) * | 2020-09-21 | 2020-12-22 | 成都数之联科技有限公司 | Ship pollution monitoring method, system, device and medium based on multi-label learning |
CN112446439A (en) * | 2021-01-29 | 2021-03-05 | 魔视智能科技(上海)有限公司 | Inference method and system for deep learning model dynamic branch selection |
CN112598076A (en) * | 2020-12-29 | 2021-04-02 | 北京易华录信息技术股份有限公司 | Motor vehicle attribute identification method and system |
CN113313079A (en) * | 2021-07-16 | 2021-08-27 | 深圳市安软科技股份有限公司 | Training method and system of vehicle attribute recognition model and related equipment |
CN114429638A (en) * | 2022-04-06 | 2022-05-03 | 四川省大数据中心 | Construction drawing examination management system |
Families Citing this family (21)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111651668B (en) * | 2020-05-06 | 2023-06-09 | 上海晶赞融宣科技有限公司 | User portrait label generation method and device, storage medium and terminal |
CN113688840B (en) * | 2020-05-19 | 2024-08-02 | 武汉Tcl集团工业研究院有限公司 | Image processing model generation method, processing method, storage medium and terminal |
CN111582409B (en) * | 2020-06-29 | 2023-12-26 | 腾讯科技(深圳)有限公司 | Training method of image tag classification network, image tag classification method and device |
CN111832580B (en) * | 2020-07-22 | 2023-07-28 | 西安电子科技大学 | SAR target recognition method combining less sample learning and target attribute characteristics |
CN112001258B (en) * | 2020-07-27 | 2023-07-11 | 上海东普信息科技有限公司 | Method, device, equipment and storage medium for identifying on-time arrival of logistics truck |
CN112287751B (en) * | 2020-09-21 | 2024-05-07 | 深圳供电局有限公司 | Excitation surge current identification method, device, computer equipment and storage medium |
CN112070093B (en) * | 2020-09-22 | 2024-06-28 | 杭州网易智企科技有限公司 | Method for generating image classification model, image classification method, device and equipment |
CN112508078B (en) * | 2020-12-02 | 2024-06-14 | 携程旅游信息技术(上海)有限公司 | Image multitasking multi-label recognition method, system, equipment and medium |
CN112541542B (en) * | 2020-12-11 | 2023-09-29 | 第四范式(北京)技术有限公司 | Method and device for processing multi-classification sample data and computer readable storage medium |
CN112651438A (en) * | 2020-12-24 | 2021-04-13 | 世纪龙信息网络有限责任公司 | Multi-class image classification method and device, terminal equipment and storage medium |
CN113762321A (en) * | 2021-04-13 | 2021-12-07 | 北京沃东天骏信息技术有限公司 | Multi-modal classification model generation method and device |
CN113610766A (en) * | 2021-07-12 | 2021-11-05 | 北京阅视智能技术有限责任公司 | Microscopic image analysis method, microscopic image analysis device, storage medium and electronic equipment |
CN113408482B (en) * | 2021-07-13 | 2023-10-10 | 杭州联吉技术有限公司 | Training sample generation method and generation device |
CN113673583A (en) * | 2021-07-30 | 2021-11-19 | 浙江大华技术股份有限公司 | Image recognition method, recognition network training method and related device |
CN114445884B (en) * | 2022-01-04 | 2024-04-30 | 深圳数联天下智能科技有限公司 | Method for training multi-target detection model, detection method and related device |
CN114529888B (en) * | 2022-01-12 | 2024-10-15 | 盛视科技股份有限公司 | Non-motor vehicle driving recognition method, device, computer and readable storage medium |
CN114612681A (en) * | 2022-01-30 | 2022-06-10 | 西北大学 | GCN-based multi-label image classification method, model construction method and device |
CN114638787B (en) * | 2022-02-23 | 2024-03-22 | 青岛海信网络科技股份有限公司 | Method for detecting whether non-motor vehicle hangs up or not and electronic equipment |
CN115116028B (en) * | 2022-06-06 | 2024-07-19 | 安徽理工大学 | Unmanned electric locomotive obstacle detection method and electronic equipment based on Tiny-Yolov4 |
CN116091867B (en) * | 2023-01-12 | 2023-09-29 | 北京邮电大学 | Model training and image recognition method, device, equipment and storage medium |
CN117496275B (en) * | 2023-12-29 | 2024-04-02 | 深圳市软盟技术服务有限公司 | Class learning-based depth image classification network training method, electronic equipment and storage medium |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105654066A (en) * | 2016-02-02 | 2016-06-08 | 北京格灵深瞳信息技术有限公司 | Vehicle identification method and device |
WO2017158058A1 (en) * | 2016-03-15 | 2017-09-21 | Imra Europe Sas | Method for classification of unique/rare cases by reinforcement learning in neural networks |
CN107330396A (en) * | 2017-06-28 | 2017-11-07 | 华中科技大学 | A kind of pedestrian's recognition methods again based on many attributes and many strategy fusion study |
CN107886073A (en) * | 2017-11-10 | 2018-04-06 | 重庆邮电大学 | A kind of more attribute recognition approaches of fine granularity vehicle based on convolutional neural networks |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106203330A (en) * | 2016-07-08 | 2016-12-07 | 西安理工大学 | A kind of vehicle classification method based on convolutional neural networks |
CN108256498A (en) * | 2018-02-01 | 2018-07-06 | 上海海事大学 | A kind of non power driven vehicle object detection method based on EdgeBoxes and FastR-CNN |
CN109325547A (en) * | 2018-10-23 | 2019-02-12 | 苏州科达科技股份有限公司 | Non-motor vehicle image multi-tag classification method, system, equipment and storage medium |
-
2018
- 2018-10-23 CN CN201811240000.3A patent/CN109325547A/en active Pending
-
2019
- 2019-10-15 WO PCT/CN2019/111320 patent/WO2020083073A1/en active Application Filing
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105654066A (en) * | 2016-02-02 | 2016-06-08 | 北京格灵深瞳信息技术有限公司 | Vehicle identification method and device |
WO2017158058A1 (en) * | 2016-03-15 | 2017-09-21 | Imra Europe Sas | Method for classification of unique/rare cases by reinforcement learning in neural networks |
CN107330396A (en) * | 2017-06-28 | 2017-11-07 | 华中科技大学 | A kind of pedestrian's recognition methods again based on many attributes and many strategy fusion study |
CN107886073A (en) * | 2017-11-10 | 2018-04-06 | 重庆邮电大学 | A kind of more attribute recognition approaches of fine granularity vehicle based on convolutional neural networks |
Cited By (16)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2020083073A1 (en) * | 2018-10-23 | 2020-04-30 | 苏州科达科技股份有限公司 | Non-motorized vehicle image multi-label classification method, system, device and storage medium |
CN109947940B (en) * | 2019-02-15 | 2023-09-05 | 平安科技(深圳)有限公司 | Text classification method, device, terminal and storage medium |
CN109947940A (en) * | 2019-02-15 | 2019-06-28 | 平安科技(深圳)有限公司 | File classification method, device, terminal and storage medium |
CN111368931A (en) * | 2020-03-09 | 2020-07-03 | 第四范式(北京)技术有限公司 | Method and device for training image classification model, computer device and storage medium |
CN111368931B (en) * | 2020-03-09 | 2023-11-17 | 第四范式(北京)技术有限公司 | Method for determining learning rate of image classification model |
CN111783574A (en) * | 2020-06-17 | 2020-10-16 | 李利明 | Meal image recognition method and device and storage medium |
CN111783574B (en) * | 2020-06-17 | 2024-02-23 | 李利明 | Meal image recognition method, device and storage medium |
CN111898475A (en) * | 2020-07-10 | 2020-11-06 | 浙江大华技术股份有限公司 | Method and device for estimating state of non-motor vehicle, storage medium, and electronic device |
CN111737521A (en) * | 2020-08-04 | 2020-10-02 | 北京微播易科技股份有限公司 | Video classification method and device |
CN112115880A (en) * | 2020-09-21 | 2020-12-22 | 成都数之联科技有限公司 | Ship pollution monitoring method, system, device and medium based on multi-label learning |
CN112598076A (en) * | 2020-12-29 | 2021-04-02 | 北京易华录信息技术股份有限公司 | Motor vehicle attribute identification method and system |
CN112598076B (en) * | 2020-12-29 | 2023-09-19 | 北京易华录信息技术股份有限公司 | Motor vehicle attribute identification method and system |
CN112446439A (en) * | 2021-01-29 | 2021-03-05 | 魔视智能科技(上海)有限公司 | Inference method and system for deep learning model dynamic branch selection |
CN112446439B (en) * | 2021-01-29 | 2021-04-23 | 魔视智能科技(上海)有限公司 | Inference method and system for deep learning model dynamic branch selection |
CN113313079A (en) * | 2021-07-16 | 2021-08-27 | 深圳市安软科技股份有限公司 | Training method and system of vehicle attribute recognition model and related equipment |
CN114429638A (en) * | 2022-04-06 | 2022-05-03 | 四川省大数据中心 | Construction drawing examination management system |
Also Published As
Publication number | Publication date |
---|---|
WO2020083073A1 (en) | 2020-04-30 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN109325547A (en) | Non-motor vehicle image multi-tag classification method, system, equipment and storage medium | |
CN112434721A (en) | Image classification method, system, storage medium and terminal based on small sample learning | |
US20230316699A1 (en) | Image semantic segmentation algorithm and system based on multi-channel deep weighted aggregation | |
CN108648020A (en) | User behavior quantization method, system, equipment and storage medium | |
CN113642431B (en) | Training method and device of target detection model, electronic equipment and storage medium | |
CN110084221A (en) | A kind of serializing face critical point detection method of the tape relay supervision based on deep learning | |
CN111797589B (en) | Text processing network, neural network training method and related equipment | |
CN111553419B (en) | Image identification method, device, equipment and readable storage medium | |
CN112766229B (en) | Human face point cloud image intelligent identification system and method based on attention mechanism | |
CN112732921B (en) | False user comment detection method and system | |
CN112199536A (en) | Cross-modality-based rapid multi-label image classification method and system | |
CN111666919A (en) | Object identification method and device, computer equipment and storage medium | |
CN113032613B (en) | Three-dimensional model retrieval method based on interactive attention convolution neural network | |
CN113034592B (en) | Three-dimensional scene target detection modeling and detection method based on natural language description | |
CN110111365B (en) | Training method and device based on deep learning and target tracking method and device | |
EP4227858A1 (en) | Method for determining neural network structure and apparatus thereof | |
CN113298817A (en) | High-accuracy semantic segmentation method for remote sensing image | |
CN110110724A (en) | The text authentication code recognition methods of function drive capsule neural network is squeezed based on exponential type | |
CN113822264A (en) | Text recognition method and device, computer equipment and storage medium | |
CN116226785A (en) | Target object recognition method, multi-mode recognition model training method and device | |
CN112364974B (en) | YOLOv3 algorithm based on activation function improvement | |
CN115187772A (en) | Training method, device and equipment of target detection network and target detection method, device and equipment | |
CN112560948A (en) | Eye fundus map classification method and imaging method under data deviation | |
CN110728186B (en) | Fire detection method based on multi-network fusion | |
CN116258990A (en) | Cross-modal affinity-based small sample reference video target segmentation method |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20190212 |