CN111488891A - Image identification processing method, device, equipment and computer readable storage medium - Google Patents

Image identification processing method, device, equipment and computer readable storage medium Download PDF

Info

Publication number
CN111488891A
CN111488891A CN201910073615.XA CN201910073615A CN111488891A CN 111488891 A CN111488891 A CN 111488891A CN 201910073615 A CN201910073615 A CN 201910073615A CN 111488891 A CN111488891 A CN 111488891A
Authority
CN
China
Prior art keywords
image
images
neural network
convolutional neural
identification
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201910073615.XA
Other languages
Chinese (zh)
Other versions
CN111488891B (en
Inventor
邢东佳
张志鹏
寿文卉
许利群
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
China Mobile Communications Group Co Ltd
China Mobile Communications Ltd Research Institute
Original Assignee
China Mobile Communications Group Co Ltd
China Mobile Communications Ltd Research Institute
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by China Mobile Communications Group Co Ltd, China Mobile Communications Ltd Research Institute filed Critical China Mobile Communications Group Co Ltd
Priority to CN201910073615.XA priority Critical patent/CN111488891B/en
Publication of CN111488891A publication Critical patent/CN111488891A/en
Application granted granted Critical
Publication of CN111488891B publication Critical patent/CN111488891B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • G06F18/2415Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on parametric or probabilistic models, e.g. based on likelihood ratio or false acceptance rate versus a false rejection rate
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/22Matching criteria, e.g. proximity measures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods

Abstract

The invention provides an image identification processing method, an image identification processing device, image identification processing equipment and a computer readable storage medium, wherein the image identification processing method comprises the following steps: acquiring a plurality of similar images of a target image in a preset image set in the preset image set; according to the identification information of the similar images, the identification of the target image is updated; the similar images are images with the matching degree with the target image reaching a preset threshold, and the number of the similar images is at least three. The scheme can realize automatic correction of the image identifier without manually extracting manual features according to the image features, thereby reducing the labor cost and time cost, improving the working efficiency and being suitable for various image data; the problems that image identification processing schemes in the prior art all need manual participation, and are high in cost, low in efficiency and not universal are well solved.

Description

Image identification processing method, device, equipment and computer readable storage medium
Technical Field
The present invention relates to the field of image processing technologies, and in particular, to an image identifier processing method, an image identifier processing apparatus, an image identifier processing device, and a computer-readable storage medium.
Background
In recent years, with the development of internet technology, a deep convolutional neural network has been widely applied to the field of image classification, but in some fields requiring professional knowledge for image labeling, such as the medical field, the problems of unclear image classification boundary, low labeling personnel level and the like often exist, which directly results in low accuracy of image labeling, and thus the accuracy of a deep convolutional neural network model is seriously affected.
At present, although some optimized image classification methods based on the convolutional neural network exist, most of the image classification methods are performed based on network models and parameters, and the problem of inaccurate image labeling is not solved. The existing method for correcting the image data annotation usually needs a plurality of annotating personnel to annotate the same image at the same time, which needs a large amount of labor cost and also reduces the efficiency of data set optimization and model establishment. Meanwhile, the accuracy of the trained classification network model is not high.
In addition, although there is no technical solution for directly correcting image annotation at present, there are relatively close solutions for annotating images, such as: firstly, manually extracting low-level fusion characteristic vectors of an image to be marked, such as scalable color characteristics, homogeneous texture characteristics, edge histogram characteristics, grid color moment characteristics, wavelet moment characteristics and the like; then, self-coding is carried out on the fusion characteristic vectors of all the images in the image library by using a depth automatic coding machine to obtain corresponding self-coding characteristics; and then, the image to be labeled finds N neighbor training images according to self-coding characteristics, and the automatic labeling of the image to be labeled is completed by using the labels of the group of neighbor training images.
However, the existing image annotation correction method or automatic annotation image scheme has the following disadvantages:
the first disadvantage is that:
the method for manually correcting the image annotation by multiple people to label the same batch of images or to find out the depth of label requires a large amount of labor cost and time cost, wastes time and labor, has low working efficiency, and often needs to pay expensive labeling cost for the field requiring professional knowledge to label.
The second disadvantage is that:
at present, the existing automatic image labeling scheme usually adopts manual characteristics to represent an image, if the image is to be completely represented, a plurality of characteristic vectors are required to be adopted and screened, and the vectors can not necessarily well represent the image characteristics; the solution also has a problem of non-universality, i.e. for another batch of image data with different forms, the previously selected manual features may not be applicable, and different manual features need to be selected for different image characteristics.
Therefore, the existing image identification processing schemes all need manual participation, and have the problems of high cost, low efficiency and non-universality.
Disclosure of Invention
The invention aims to provide an image identification processing method, an image identification processing device, image identification processing equipment and a computer readable storage medium, and solves the problems that image identification processing schemes in the prior art all need manual participation, and are high in cost, low in efficiency and not universal.
In order to solve the above technical problem, an embodiment of the present invention provides an image identifier processing method, including:
acquiring a plurality of similar images of a target image in a preset image set in the preset image set;
according to the identification information of the similar images, the identification of the target image is updated;
the similar images are images with the matching degree with the target image reaching a preset threshold, and the number of the similar images is at least three.
Optionally, the obtaining of multiple similar images of the target image in the preset image set includes:
acquiring a feature vector of each image in a preset image set by using a convolutional neural network;
acquiring Euclidean distances between the target image and other images according to the characteristic vectors of the target image in the preset image set and the characteristic vectors of other images except the target image;
and acquiring a plurality of similar images of the target image from other images according to the Euclidean distance.
Optionally, the obtaining a feature vector of each image in the preset image set by using the convolutional neural network includes:
training a convolutional neural network by using all images in a preset image set;
and acquiring a characteristic vector obtained after each image in a preset image set passes through a penultimate full-connected layer of the convolutional neural network in the training process, wherein the characteristic vector is used as a characteristic vector corresponding to each image.
Optionally, after training the convolutional neural network by using all images in the preset image set, the method further includes:
obtaining a loss value of the convolutional neural network trained at this time;
judging whether the loss value meets a stable condition;
and if not, continuing to train the convolutional neural network according to the target image with the updated identifier, and returning to the operation of acquiring a plurality of similar images of the target image in the preset image set.
Optionally, the training the convolutional neural network continuously according to the target image updated by the identifier includes:
judging whether each image in the preset image set is respectively used as a target image to be subjected to identification updating;
if yes, forming an image set with an updated identifier according to the target image with the updated identifier;
and training the convolutional neural network again by using all the images in the image set after the identification is updated.
Optionally, the determining whether the loss value meets a stability condition includes:
and judging whether the loss value meets a stable condition or not according to the change rate of the loss value.
Optionally, the stabilizing conditions are:
Figure BDA0001958049740000031
therein, lossiRepresenting said loss value, losskRepresenting the loss value of the required summation operation,
Figure BDA0001958049740000032
representing the rate of change of said loss value.
Optionally, after determining whether the loss value meets the stability condition, the method further includes:
and if the loss value meets a stable condition, storing the parameter information of the convolutional neural network.
Optionally, after storing the parameter information of the convolutional neural network, the method further includes:
and identifying the image to be identified by utilizing a convolutional neural network according to the parameter information.
Optionally, the performing, according to the identification information of the multiple similar images, identification update on the target image includes:
and according to the identification information of the plurality of similar images, utilizing a voting method to update the identification of the target image.
An embodiment of the present invention further provides an image identifier processing apparatus, including:
the first acquisition module is used for acquiring a plurality of similar images of a target image in a preset image set in the preset image set;
the first processing module is used for carrying out identification updating on the target image according to the identification information of the similar images;
the similar images are images with the matching degree with the target image reaching a preset threshold, and the number of the similar images is at least three.
Optionally, the first obtaining module includes:
the first obtaining submodule is used for obtaining a characteristic vector of each image in a preset image set by using a convolutional neural network;
the second obtaining submodule is used for obtaining Euclidean distances between the target image and other images according to the characteristic vectors of the target image in the preset image set and the characteristic vectors of other images except the target image;
and the third acquisition submodule is used for acquiring a plurality of similar images of the target image from other images according to the Euclidean distance.
Optionally, the first obtaining sub-module includes:
the first processing unit is used for training a convolutional neural network by utilizing all images in a preset image set;
and the second processing unit is used for acquiring a feature vector obtained by each image in the preset image set after passing through a penultimate full-link layer of the convolutional neural network in the training process, and taking the feature vector as a feature vector corresponding to each image.
Optionally, the method further includes:
the second acquisition module is used for acquiring a loss value of the convolutional neural network trained at this time after the convolutional neural network is trained by using all images in a preset image set;
the first judgment module is used for judging whether the loss value meets a stable condition or not;
and if the target images do not meet the preset image set, continuing to train the convolutional neural network according to the target images after the identification update, and returning to the operation of acquiring a plurality of similar images of the target images in the preset image set.
Optionally, the second processing module includes:
the first judgment submodule is used for judging whether each image in the preset image set is respectively used as a target image to be subjected to identification updating;
the first processing submodule is used for forming an image set after the identification is updated according to the target image after the identification is updated if the target image is the target image;
and the first updating submodule is used for retraining the convolutional neural network by utilizing all the images in the image set after the identification is updated.
Optionally, the first determining module includes:
and the second judgment submodule is used for judging whether the loss value meets a stable condition or not according to the change rate of the loss value.
Optionally, the stabilizing conditions are:
Figure BDA0001958049740000051
therein, lossiRepresenting said loss value, losskRepresenting the loss value of the required summation operation,
Figure BDA0001958049740000052
representing the rate of change of said loss value.
Optionally, the method further includes:
and the first storage module is used for storing the parameter information of the convolutional neural network if the loss value meets the stable condition after judging whether the loss value meets the stable condition.
Optionally, the method further includes:
and the first identification module is used for identifying the image to be identified by using the convolutional neural network according to the parameter information after the parameter information of the convolutional neural network is stored.
Optionally, the first processing module includes:
and the second processing submodule is used for carrying out identification updating on the target image by using a voting method according to the identification information of the plurality of similar images.
The embodiment of the invention also provides image identification processing equipment, which comprises a memory, a processor and a computer program which is stored on the memory and can run on the processor; the processor implements the image identification processing method described above when executing the program.
Embodiments of the present invention further provide a computer-readable storage medium, on which a computer program is stored, where the computer program, when executed by a processor, implements the steps in the image identification processing method.
The technical scheme of the invention has the following beneficial effects:
in the above scheme, the image identifier processing method obtains a plurality of similar images of a target image in a preset image set in the preset image set; according to the identification information of the similar images, the identification of the target image is updated; the similar images are images with the matching degree with the target image reaching a preset threshold value, and the number of the similar images is at least three; the automatic correction of the image identification can be realized, manual characteristics do not need to be manually extracted according to the image characteristics, so that the labor cost and the time cost are reduced, the working efficiency is improved, and the method is suitable for various image data; the problems that image identification processing schemes in the prior art all need manual participation, and are high in cost, low in efficiency and not universal are well solved.
Drawings
FIG. 1 is a schematic flow chart of an image identifier processing method according to an embodiment of the present invention;
FIG. 2 is a schematic diagram of a specific application flow of the image identifier processing method according to the embodiment of the present invention;
fig. 3 is a schematic structural diagram of an image identifier processing apparatus according to an embodiment of the present invention.
Detailed Description
In order to make the technical problems, technical solutions and advantages of the present invention more apparent, the following detailed description is given with reference to the accompanying drawings and specific embodiments.
Aiming at the problems of high cost, low efficiency and non-universality of image identification processing schemes in the prior art which need manual participation, the invention provides an image identification processing method, as shown in figure 1, which comprises the following steps:
step 11: acquiring a plurality of similar images of a target image in a preset image set in the preset image set;
step 12: according to the identification information of the similar images, the identification of the target image is updated;
the similar images are images with the matching degree with the target image reaching a preset threshold, and the number of the similar images is at least three.
The preset image set can be selected and set according to the actual requirement, for example, the image set consisting of only images of cats and dogs is selected.
The identification information may be annotation information for a subject in the image, such as a cat, a dog, or the like, or may be other annotation information for the image, such as weather information, scene information, or the like, which is not limited herein.
The image identification processing method provided by the embodiment of the invention obtains a plurality of similar images of a target image in a preset image set in the preset image set; according to the identification information of the similar images, the identification of the target image is updated; the similar images are images with the matching degree with the target image reaching a preset threshold value, and the number of the similar images is at least three; the automatic correction of the image identification can be realized, manual characteristics do not need to be manually extracted according to the image characteristics, so that the labor cost and the time cost are reduced, the working efficiency is improved, and the method is suitable for various image data; the problems that image identification processing schemes in the prior art all need manual participation, and are high in cost, low in efficiency and not universal are well solved.
The acquiring of multiple similar images of a target image in a preset image set in the preset image set includes: acquiring a feature vector of each image in a preset image set by using a convolutional neural network; acquiring Euclidean distances between the target image and other images according to the characteristic vectors of the target image in the preset image set and the characteristic vectors of other images except the target image; and acquiring a plurality of similar images of the target image from other images according to the Euclidean distance.
Specifically, the obtaining of multiple similar images of the target image from other images according to the euclidean distance may include: and sequencing the obtained Euclidean distances in an ascending order, and taking other images corresponding to the Euclidean distances of the first n bits as similar images of the target image, wherein n is more than or equal to 3 and less than or equal to the total number of other images. After sorting according to the Euclidean distance in an ascending order, the more front the image is, the smaller the corresponding Euclidean distance is, the higher the matching degree with the target image is, and the closer the image is to the target image, the more similar the image is.
In the embodiment of the present invention, the obtaining of the feature vector of each image in the preset image set by using the convolutional neural network includes: training a convolutional neural network by using all images in a preset image set; and acquiring a characteristic vector obtained after each image in a preset image set passes through a penultimate full-connected layer of the convolutional neural network in the training process, wherein the characteristic vector is used as a characteristic vector corresponding to each image.
Therefore, the image coding is carried out by utilizing the full connection layer of the convolutional neural network, the relevant features of the image can be automatically extracted and coded, the complex feature extraction process can be avoided, the method is suitable for various image data, and manual features do not need to be manually extracted according to the image features.
Further, after training the convolutional neural network with all the images in the preset image set, the method further includes: obtaining a loss value of the convolutional neural network trained at this time; judging whether the loss value meets a stable condition; and if not, continuing to train the convolutional neural network according to the target image with the updated identifier, and returning to the operation of acquiring a plurality of similar images of the target image in the preset image set.
Therefore, based on the convolutional neural network, the image identification correction can be performed through cyclic iteration in the training process, so that the labor cost and the time cost are further reduced, and the working efficiency is improved.
Wherein the continuing to train the convolutional neural network according to the target image updated by the identifier comprises: judging whether each image in the preset image set is respectively taken as a target image to be subjected to identification updating (namely judging whether the target image traverses the preset image set); if yes, forming an image set with an updated identifier according to the target image with the updated identifier; and training the convolutional neural network again by using all the images in the image set after the identification is updated.
Therefore, the accuracy of the trained convolutional neural network (image classification model) can be improved.
Specifically, the determining whether the loss value meets a stable condition includes: and judging whether the loss value meets a stable condition or not according to the change rate of the loss value.
More specifically, the stabilizing conditions are:
Figure BDA0001958049740000081
therein, lossiRepresenting said loss value, losskRepresenting the loss value of the required summation operation,
Figure BDA0001958049740000082
representing the rate of change of said loss value.
Here, k ═ i-2 · · i, that is, stability of the loss value is judged according to the loss values of the previous two times and the loss value of the current time; of course, k ═ i-3 · · i is also possible, i.e. the stability of the loss value is evaluated from the loss values of the previous three times and the loss value of the current time; the stability of the loss value may be evaluated based on the loss values of the first four times and the loss value of the current time, or k-i-5 · · i, and the like, which is not limited herein.
Further, after determining whether the loss value meets a stable condition, the method further includes: and if the loss value meets a stable condition, storing the parameter information of the convolutional neural network.
Further, after storing the parameter information of the convolutional neural network, the method further includes: and identifying the image to be identified by utilizing a convolutional neural network according to the parameter information.
The trained convolutional neural network (image classification model) is adopted for identification, and the identification accuracy can be improved.
In this embodiment of the present invention, the updating the identifier of the target image according to the identifier information of the multiple similar images includes: and according to the identification information of the plurality of similar images, utilizing a voting method to update the identification of the target image.
Therefore, the labor cost can be reduced, and the automatic correction of the image identification can be realized.
Specifically, the updating the identification of the target image by using a voting method according to the identification information of the plurality of similar images includes: acquiring the number of similar images with consistent identification information in a plurality of similar images; and according to the identification information of the similar images with the largest number, performing identification updating on the target image (updating the identification information of the target image into the acquired identification information).
The image identifier processing method provided by the embodiment of the present invention is further described below, wherein the preset image set takes an image set M including M images as an example.
To solve the above technical problem, an embodiment of the present invention provides an image identifier processing method, which may be specifically as shown in fig. 2:
the image classification training is carried out on a convolutional neural network by utilizing an image set M, the convolutional neural network comprises a convolutional layer, a pooling layer, a fully-connected layer (at least two layers) and the like, the feature vector of each image after passing through the second last fully-connected layer is saved as the feature vector of the image in the training process (namely class binary Hash coding, in the invention, the feature vector is a continuous value and does not need to be processed into values of 0 and 1), and the loss value (loss value) when the training stops (after passing through the last fully-connected layer and stopping) is recorded and recorded as the loss valueiWhere i represents the ith iteration.
Then, any one image A is selected from the image set M, Euclidean distances between the feature vector of the A and feature vectors of other images left in the image set M are calculated, the Euclidean distances are sorted in an ascending order, and other images corresponding to the first n Euclidean distances are obtained. Wherein n is not less than 3 and not more than m, and n is 5 as an example for explanation: 5 images B1, …, B5 corresponding to the euclidean distances of the top 5 are acquired as 5 images most similar to a. Then, the identification information of image a is corrected and updated by voting based on the identification information of 5 images B1, …, B5 (which identification information is used more than that based on the number of identification information). According to the method, A is taken until A traverses the image set M (namely each image in M images is taken as an image A to carry out the operation of updating the identification information), and the image set M after updating the label is obtained.
Then, based on the updated and labeled image set M, the image classification training is carried out on the convolutional neural network, and the process is continuously repeated until the loss value of the convolutional neural network is stable, for example, the loss value can be made to be stableiSatisfy the requirement of
Figure BDA0001958049740000091
The iterative update (of the image identification and of the convolutional neural network) is ended. And if the loss value of the convolutional neural network is unstable, continuously updating the identification information of the image and the parameter information of the convolutional neural network. Therein, losskRepresenting the penalty value that needs to be summed. In the embodiment of the invention, whether the loss value is stable or not is actually determined according to the change rate of the loss value; wherein k may also be k ═ i-3 · i; or k-i-4. cndot. i, etc.
After the iterative update is finished, the image set M after multiple iterative corrections can be obtained, and the accuracy of the image classification model (convolutional neural network) trained in the process is improved. And when the convolutional neural network is used for updating the marking information, the parameter information of the convolutional neural network is correspondingly updated.
In this way, in the embodiment of the present invention, each training has M images, and a loss value is obtained through the convolutional neural network. The M images are determined according to the training purpose, such as classifying cats and dogs, and the images are images of cats or dogs, but not limited thereto.
As can be seen from the above, the embodiment of the present invention specifically provides an image classification and image identification information processing method based on a convolutional neural network, which mainly involves: (1) correcting the image label by a convolutional neural network by adopting a voting method; and (2) a process of iteratively correcting the image identification information. While correcting the image identification information, a well-trained convolutional neural network (image classification model) can be obtained.
Specifically, the scheme provided by the embodiment of the invention is as follows:
(1) the image classification method based on the convolutional neural network can correct the image identification information through loop iteration in the training process, so that the labor cost and the time cost can be reduced, and the working efficiency can be improved;
(2) the full-connection layer of the convolutional neural network is utilized to encode the image, the process can automatically extract the relevant features of the image and encode the image, the complex feature extraction process can be avoided, and the method is suitable for various image data and does not need to manually extract manual features according to the image features;
(3) meanwhile, the accuracy of the trained image classification model can be improved.
In conclusion, the scheme provided by the embodiment of the invention can reduce the labor cost and realize the automatic correction of the image identification information; the method has no complicated manual feature extraction process, is suitable for various image data, and does not need to manually extract manual features according to the image features; meanwhile, an image classification method based on the convolutional neural network is formed, and the accuracy of image classification can be improved.
An embodiment of the present invention further provides an image identifier processing apparatus, as shown in fig. 3, including:
the first obtaining module 31 is configured to obtain multiple similar images of a target image in a preset image set in the preset image set;
the first processing module 32 is configured to perform identifier update on the target image according to the identifier information of the multiple similar images;
the similar images are images with the matching degree with the target image reaching a preset threshold, and the number of the similar images is at least three.
The image identification processing device provided by the embodiment of the invention acquires a plurality of similar images of a target image in a preset image set in the preset image set; according to the identification information of the similar images, the identification of the target image is updated; the similar images are images with the matching degree with the target image reaching a preset threshold value, and the number of the similar images is at least three; the automatic correction of the image identification can be realized, manual characteristics do not need to be manually extracted according to the image characteristics, so that the labor cost and the time cost are reduced, the working efficiency is improved, and the method is suitable for various image data; the problems that image identification processing schemes in the prior art all need manual participation, and are high in cost, low in efficiency and not universal are well solved.
Wherein, the first obtaining module comprises: the first obtaining submodule is used for obtaining a characteristic vector of each image in a preset image set by using a convolutional neural network; the second obtaining submodule is used for obtaining Euclidean distances between the target image and other images according to the characteristic vectors of the target image in the preset image set and the characteristic vectors of other images except the target image; and the third acquisition submodule is used for acquiring a plurality of similar images of the target image from other images according to the Euclidean distance.
In this embodiment of the present invention, the first obtaining sub-module includes: the first processing unit is used for training a convolutional neural network by utilizing all images in a preset image set; and the second processing unit is used for acquiring a feature vector obtained by each image in the preset image set after passing through a penultimate full-link layer of the convolutional neural network in the training process, and taking the feature vector as a feature vector corresponding to each image.
Further, the image identification processing apparatus further includes: the second acquisition module is used for acquiring a loss value of the convolutional neural network trained at this time after the convolutional neural network is trained by using all images in a preset image set; the first judgment module is used for judging whether the loss value meets a stable condition or not; and if the target images do not meet the preset image set, continuing to train the convolutional neural network according to the target images after the identification update, and returning to the operation of acquiring a plurality of similar images of the target images in the preset image set.
Wherein the second processing module comprises: the first judgment submodule is used for judging whether each image in the preset image set is respectively used as a target image to be subjected to identification updating; the first processing submodule is used for forming an image set after the identification is updated according to the target image after the identification is updated if the target image is the target image; and the first updating submodule is used for retraining the convolutional neural network by utilizing all the images in the image set after the identification is updated.
Specifically, the first determining module includes: and the second judgment submodule is used for judging whether the loss value meets a stable condition or not according to the change rate of the loss value.
More specifically, the stabilizing conditions are:
Figure BDA0001958049740000111
therein, lossiRepresenting said loss value, losskRepresenting the loss value of the required summation operation,
Figure BDA0001958049740000112
representing the rate of change of said loss value.
Further, the image identification processing apparatus further includes: and the first storage module is used for storing the parameter information of the convolutional neural network if the loss value meets the stable condition after judging whether the loss value meets the stable condition.
Still further, the image identification processing apparatus further includes: and the first identification module is used for identifying the image to be identified by using the convolutional neural network according to the parameter information after the parameter information of the convolutional neural network is stored.
In an embodiment of the present invention, the first processing module includes: and the second processing submodule is used for carrying out identification updating on the target image by using a voting method according to the identification information of the plurality of similar images.
The implementation embodiments of the image identifier processing method are all applicable to the embodiment of the image identifier processing device, and the same technical effects can be achieved.
The embodiment of the invention also provides image identification processing equipment, which comprises a memory, a processor and a computer program which is stored on the memory and can run on the processor; the processor implements the image identification processing method as described above when executing the program.
The implementation embodiments of the image identifier processing method are all applicable to the embodiment of the image identifier processing device, and the same technical effects can be achieved.
Embodiments of the present invention further provide a computer-readable storage medium, on which a computer program is stored, where the computer program, when executed by a processor, implements the steps in the image identification processing method.
The implementation embodiments of the image identifier processing method are all applicable to the embodiment of the computer-readable storage medium, and the same technical effects can be achieved.
It should be noted that many of the functional components described in this specification are referred to as modules/sub-modules in order to more particularly emphasize their implementation independence.
In embodiments of the invention, the modules/sub-modules may be implemented in software for execution by various types of processors. An identified module of executable code may, for instance, comprise one or more physical or logical blocks of computer instructions which may, for instance, be constructed as an object, procedure, or function. Nevertheless, the executables of an identified module need not be physically located together, but may comprise disparate instructions stored in different bits which, when joined logically together, comprise the module and achieve the stated purpose for the module.
Indeed, a module of executable code may be a single instruction, or many instructions, and may even be distributed over several different code segments, among different programs, and across several memory devices. Likewise, operational data may be identified within the modules and may be embodied in any suitable form and organized within any suitable type of data structure. The operational data may be collected as a single data set, or may be distributed over different locations including over different storage devices, and may exist, at least partially, merely as electronic signals on a system or network.
When a module can be implemented by software, considering the level of existing hardware technology, a module that can be implemented by software can build corresponding hardware circuits including conventional very large scale integration (V L SI) circuits or gate arrays and existing semiconductors such as logic chips, transistors, or other discrete components to implement corresponding functions, without considering the cost.
While the preferred embodiments of the present invention have been described, it will be understood by those skilled in the art that various changes and modifications may be made without departing from the spirit and scope of the invention.

Claims (22)

1. An image identifier processing method, comprising:
acquiring a plurality of similar images of a target image in a preset image set in the preset image set;
according to the identification information of the similar images, the identification of the target image is updated;
the similar images are images with the matching degree with the target image reaching a preset threshold, and the number of the similar images is at least three.
2. The image identifier processing method according to claim 1, wherein the acquiring multiple similar images of the target image in the preset image set comprises:
acquiring a feature vector of each image in a preset image set by using a convolutional neural network;
acquiring Euclidean distances between the target image and other images according to the characteristic vectors of the target image in the preset image set and the characteristic vectors of other images except the target image;
and acquiring a plurality of similar images of the target image from other images according to the Euclidean distance.
3. The image identifier processing method according to claim 2, wherein the obtaining the feature vector of each image in the preset image set by using a convolutional neural network comprises:
training a convolutional neural network by using all images in a preset image set;
and acquiring a characteristic vector obtained after each image in a preset image set passes through a penultimate full-connected layer of the convolutional neural network in the training process, wherein the characteristic vector is used as a characteristic vector corresponding to each image.
4. The image identifier processing method of claim 3, further comprising, after training the convolutional neural network with all images in the preset image set:
obtaining a loss value of the convolutional neural network trained at this time;
judging whether the loss value meets a stable condition;
and if not, continuing to train the convolutional neural network according to the target image with the updated identifier, and returning to the operation of acquiring a plurality of similar images of the target image in the preset image set.
5. The image tag processing method of claim 4, wherein the continuing training of the convolutional neural network according to the target image after tag update comprises:
judging whether each image in the preset image set is respectively used as a target image to be subjected to identification updating;
if yes, forming an image set with an updated identifier according to the target image with the updated identifier;
and training the convolutional neural network again by using all the images in the image set after the identification is updated.
6. The method according to claim 4, wherein the determining whether the loss value meets a stability condition comprises:
and judging whether the loss value meets a stable condition or not according to the change rate of the loss value.
7. The image marker processing method according to claim 6, wherein the stable condition is:
Figure FDA0001958049730000021
therein, lossiRepresenting said loss value, losskRepresenting the loss value of the required summation operation,
Figure FDA0001958049730000022
representing the rate of change of said loss value.
8. The image identifier processing method according to claim 4, further comprising, after determining whether the loss value satisfies a stable condition:
and if the loss value meets a stable condition, storing the parameter information of the convolutional neural network.
9. The image identifier processing method according to claim 8, further comprising, after storing the parameter information of the convolutional neural network:
and identifying the image to be identified by utilizing a convolutional neural network according to the parameter information.
10. The image identifier processing method according to claim 1, wherein the identifier updating of the target image according to the identifier information of the plurality of similar images includes:
and according to the identification information of the plurality of similar images, utilizing a voting method to update the identification of the target image.
11. An image tag processing apparatus, comprising:
the first acquisition module is used for acquiring a plurality of similar images of a target image in a preset image set in the preset image set;
the first processing module is used for carrying out identification updating on the target image according to the identification information of the similar images;
the similar images are images with the matching degree with the target image reaching a preset threshold, and the number of the similar images is at least three.
12. The image tag processing apparatus of claim 11, wherein the first obtaining module comprises:
the first obtaining submodule is used for obtaining a characteristic vector of each image in a preset image set by using a convolutional neural network;
the second obtaining submodule is used for obtaining Euclidean distances between the target image and other images according to the characteristic vectors of the target image in the preset image set and the characteristic vectors of other images except the target image;
and the third acquisition submodule is used for acquiring a plurality of similar images of the target image from other images according to the Euclidean distance.
13. The image identifier processing apparatus according to claim 12, wherein the first acquisition sub-module includes:
the first processing unit is used for training a convolutional neural network by utilizing all images in a preset image set;
and the second processing unit is used for acquiring a feature vector obtained by each image in the preset image set after passing through a penultimate full-link layer of the convolutional neural network in the training process, and taking the feature vector as a feature vector corresponding to each image.
14. The image tag processing apparatus according to claim 13, further comprising:
the second acquisition module is used for acquiring a loss value of the convolutional neural network trained at this time after the convolutional neural network is trained by using all images in a preset image set;
the first judgment module is used for judging whether the loss value meets a stable condition or not;
and if the target images do not meet the preset image set, continuing to train the convolutional neural network according to the target images after the identification update, and returning to the operation of acquiring a plurality of similar images of the target images in the preset image set.
15. The image tag processing apparatus of claim 14, wherein the second processing module comprises:
the first judgment submodule is used for judging whether each image in the preset image set is respectively used as a target image to be subjected to identification updating;
the first processing submodule is used for forming an image set after the identification is updated according to the target image after the identification is updated if the target image is the target image;
and the first updating submodule is used for retraining the convolutional neural network by utilizing all the images in the image set after the identification is updated.
16. The image tag processing apparatus according to claim 14, wherein the first determining means comprises:
and the second judgment submodule is used for judging whether the loss value meets a stable condition or not according to the change rate of the loss value.
17. The image tag processing apparatus according to claim 16, wherein the stable condition is:
Figure FDA0001958049730000041
therein, lossiRepresenting said loss value, losskRepresenting the loss value of the required summation operation,
Figure FDA0001958049730000042
representing the rate of change of said loss value.
18. The image tag processing apparatus according to claim 14, further comprising:
and the first storage module is used for storing the parameter information of the convolutional neural network if the loss value meets the stable condition after judging whether the loss value meets the stable condition.
19. The image tag processing apparatus according to claim 18, further comprising:
and the first identification module is used for identifying the image to be identified by using the convolutional neural network according to the parameter information after the parameter information of the convolutional neural network is stored.
20. The image tag processing apparatus of claim 11, wherein the first processing module comprises:
and the second processing submodule is used for carrying out identification updating on the target image by using a voting method according to the identification information of the plurality of similar images.
21. An image identification processing apparatus comprising a memory, a processor and a computer program stored on the memory and executable on the processor; characterized in that the processor implements the image identification processing method according to any one of claims 1 to 10 when executing the program.
22. A computer-readable storage medium, on which a computer program is stored, which program, when being executed by a processor, carries out the steps of the image identification processing method of any one of claims 1 to 10.
CN201910073615.XA 2019-01-25 2019-01-25 Image identification processing method, device, equipment and computer readable storage medium Active CN111488891B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910073615.XA CN111488891B (en) 2019-01-25 2019-01-25 Image identification processing method, device, equipment and computer readable storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910073615.XA CN111488891B (en) 2019-01-25 2019-01-25 Image identification processing method, device, equipment and computer readable storage medium

Publications (2)

Publication Number Publication Date
CN111488891A true CN111488891A (en) 2020-08-04
CN111488891B CN111488891B (en) 2023-04-07

Family

ID=71810686

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910073615.XA Active CN111488891B (en) 2019-01-25 2019-01-25 Image identification processing method, device, equipment and computer readable storage medium

Country Status (1)

Country Link
CN (1) CN111488891B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112307238A (en) * 2020-10-30 2021-02-02 维沃移动通信有限公司 Image management method and device and electronic equipment

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101395614A (en) * 2006-03-03 2009-03-25 皇家飞利浦电子股份有限公司 Identifying a set of image characteristics for assessing similarity of images
US20140093122A1 (en) * 2012-10-02 2014-04-03 Nitzan Rabinowitz Image identifiers and methods and systems of presenting image identifiers
CN104915643A (en) * 2015-05-26 2015-09-16 中山大学 Deep-learning-based pedestrian re-identification method

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101395614A (en) * 2006-03-03 2009-03-25 皇家飞利浦电子股份有限公司 Identifying a set of image characteristics for assessing similarity of images
US20140093122A1 (en) * 2012-10-02 2014-04-03 Nitzan Rabinowitz Image identifiers and methods and systems of presenting image identifiers
CN104915643A (en) * 2015-05-26 2015-09-16 中山大学 Deep-learning-based pedestrian re-identification method

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
李迎新等: "基于索引和相关反馈的图像检索研究", 《现代计算机》 *
王超: "基于卷积神经网络的图像分类技术研究与实现", 《电脑知识与技术》 *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112307238A (en) * 2020-10-30 2021-02-02 维沃移动通信有限公司 Image management method and device and electronic equipment

Also Published As

Publication number Publication date
CN111488891B (en) 2023-04-07

Similar Documents

Publication Publication Date Title
CN109255364B (en) Scene recognition method for generating countermeasure network based on deep convolution
CN110992365B (en) Loss function based on image semantic segmentation and design method thereof
CN106022232A (en) License plate detection method based on deep learning
CN110647907B (en) Multi-label image classification algorithm using multi-layer classification and dictionary learning
CN111079847B (en) Remote sensing image automatic labeling method based on deep learning
CN112541580A (en) Semi-supervised domain self-adaption method based on active counterstudy
Zin et al. Cow identification system using ear tag recognition
CN110738166B (en) Fishing administration monitoring system infrared target identification method based on PCNN and PCANet and storage medium
CN115439715A (en) Semi-supervised few-sample image classification learning method and system based on anti-label learning
CN116071560A (en) Fruit identification method based on convolutional neural network
CN111488891B (en) Image identification processing method, device, equipment and computer readable storage medium
CN112396042A (en) Real-time updated target detection method and system, and computer-readable storage medium
WO2023207535A1 (en) Pseudo label box generation method and apparatus, and electronic device
CN117197459A (en) Weak supervision semantic segmentation method based on saliency map and attention mechanism
CN111401394B (en) Image annotation method and device and computer readable storage medium
CN114357221B (en) Self-supervision active learning method based on image classification
CN113192108B (en) Man-in-loop training method and related device for vision tracking model
CN113344102B (en) Target image recognition method based on image HOG features and ELM model
CN115497021A (en) Method for identifying fine granularity of sow lactation behavior based on computer vision
Chen et al. IF-Net: an illumination-invariant feature network
CN113850166A (en) Ship image identification method and system based on convolutional neural network
CN113255722A (en) Image annotation method and device
CN113361530A (en) Image semantic accurate segmentation and optimization method using interaction means
CN111260659A (en) Image interactive segmentation method based on initial annotation point guidance
TWI815762B (en) Training method of image recognition deep learning model

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant