CN114549938B

CN114549938B - Model training method, image information management method, image recognition method and device

Info

Publication number: CN114549938B
Application number: CN202210436659.6A
Authority: CN
Inventors: 丁明; 王杰; 钟忞盛
Original assignee: Guangzhou Xuanwu Wireless Technology Co Ltd
Current assignee: Guangzhou Xuantong Technology Co ltd
Priority date: 2022-04-25
Filing date: 2022-04-25
Publication date: 2022-09-09
Anticipated expiration: 2042-04-25
Also published as: CN114549938A

Abstract

The invention discloses a model training method, an image information management method, an image recognition method and a device. The model training method includes: randomly initializing an initial image processing model; the initial image processing model includes a feature extraction branch and a prototype building branch; fixing the parameters of the prototype building branch, and inputting the training image set into the initial image processing model to obtain the first image processing model. An output result; determine a loss function according to the first output result, and use the loss function to update the initialization parameters of the feature extraction branch; fix the updated parameters of the feature extraction branch, input the training image set into the initial image processing model, and obtain the second Output results; determine a loss function according to the second output result, and use the loss function to update the initialization parameters of the prototype construction branch; build a commodity fingerprint extraction model according to the updated parameters of the feature extraction branch, so that the commodity fingerprint output by the commodity fingerprint extraction model has better differentiation.

Description

Model training method, image information management method, image recognition method and device

技术领域technical field

本发明涉及图像处理技术领域，特别涉及一种模型训练方法、图像信息管理方法、图像识别方法及装置。The present invention relates to the technical field of image processing, in particular to a model training method, an image information management method, an image recognition method and a device.

背景技术Background technique

商品指纹技术是一种通过数字编码形式表示商品的独特性和其关系的技术，通过商品指纹建模技术可以得到商品图片的有效表示，无需人工做商品区分标记，因此商品指纹建模技术可快速推广至商品分类、相似商品检索等应用中。Commodity fingerprint technology is a technology that represents the uniqueness of commodities and their relationships through digital coding. Through commodity fingerprint modeling technology, an effective representation of commodity pictures can be obtained without manual identification of commodities. Therefore, commodity fingerprint modeling technology can quickly Promote it to applications such as product classification and similar product retrieval.

目前，应用商品指纹匹配技术进行商品识别时，需在商品指纹数据库中存储各商品的基准指纹，以使目标商品指纹在完整的指纹数据库中进行搜索和识别。然而，在实际应用场景中，快销行业商品往往种类繁多且推陈出新较快，且商品拍摄时易受斜拍、摆放姿态及光照等环境影响，使得商品指纹数据库内需保存的基准指纹规模庞大，占用了大量的内存，同时也使得商品指纹匹配过程需花费更多的指纹库遍历查找时间，识别效率低下。At present, when using the commodity fingerprint matching technology for commodity identification, it is necessary to store the reference fingerprints of each commodity in the commodity fingerprint database, so that the target commodity fingerprint can be searched and identified in the complete fingerprint database. However, in practical application scenarios, commodities in the fast-selling industry tend to be of various types and bring forth new ones quickly, and the commodities are easily affected by the environment such as oblique shooting, posture and lighting, which makes the scale of the benchmark fingerprints to be stored in the commodity fingerprint database huge. It takes up a lot of memory, and at the same time, the fingerprint matching process of the product takes more time to traverse the fingerprint database, and the identification efficiency is low.

发明内容SUMMARY OF THE INVENTION

本发明的目的在于至少解决现有技术中存在的技术问题之一，提供一种模型训练方法、图像信息管理方法、图像识别方法及装置，可提高商品指纹的区分性，同时降低存储的指纹数量，所述技术方案如下：The purpose of the present invention is to solve at least one of the technical problems existing in the prior art, and to provide a model training method, an image information management method, an image recognition method and a device, which can improve the distinguishability of commodity fingerprints and reduce the number of stored fingerprints at the same time. , the technical solution is as follows:

第一方面，本发明提供一种模型训练方法，包括：In a first aspect, the present invention provides a model training method, comprising:

对初始图像处理模型的参数进行随机初始化；其中，所述初始图像处理模型包括特征提取分支和原型构建分支；Randomly initialize the parameters of the initial image processing model; wherein, the initial image processing model includes a feature extraction branch and a prototype building branch;

固定所述原型构建分支的初始化参数，将训练图像集输入至所述初始图像处理模型，得到第一输出结果；Fix the initialization parameters of the prototype construction branch, input the training image set into the initial image processing model, and obtain the first output result;

根据所述第一输出结果确定损失函数的值，利用所述损失函数的值对所述特征提取分支的初始化参数进行更新；Determine the value of the loss function according to the first output result, and use the value of the loss function to update the initialization parameter of the feature extraction branch;

固定所述特征提取分支更新后的参数，将所述训练图像集输入至特征提取分支参数更新后的初始图像处理模型中，得到第二输出结果；Fixing the updated parameters of the feature extraction branch, and inputting the training image set into the initial image processing model after updating the parameters of the feature extraction branch to obtain a second output result;

根据所述第二输出结果确定所述损失函数的值，利用所述损失函数的值对所述原型构建分支的初始化参数进行更新；Determine the value of the loss function according to the second output result, and use the value of the loss function to update the initialization parameter of the prototype building branch;

根据所述特征提取分支更新后的参数构建商品指纹提取模型。A commodity fingerprint extraction model is constructed according to the updated parameters of the feature extraction branch.

作为进一步改进，所述第一输出结果和所述第二输出结果均包括所述特征提取分支输出的商品指纹和所述原型构建分支输出的指纹原型。As a further improvement, both the first output result and the second output result include the commodity fingerprint output by the feature extraction branch and the fingerprint prototype output by the prototype building branch.

作为进一步改进，所述类别损失定义为：As a further improvement, the class loss is defined as:

式中，f _x表示所述特征提取分支输出的商品指纹，e表示所述原型构建分支输出的指纹原型，T表示温度系数，N表示图像样本数量；In the formula, f _x represents the commodity fingerprint output by the feature extraction branch, e represents the fingerprint prototype output by the prototype construction branch, T represents the temperature coefficient, and N represents the number of image samples;

所述类内类间损失定义为：The intra-class and inter-class loss is defined as:

式中，

表示边界阈值，Z _p表示第一输出结果或第二输出结果中与所述特征提取分支输出的商品指纹类别相同的结果，Z _n表示第一输出结果或第二输出结果中与所述特征提取分支输出的商品指纹类别不相同的结果。In the formula,

represents the boundary threshold , Z _p represents the first output result or the second output result that is the same as the product fingerprint category output by the feature extraction branch, Z _n represents the first output result or the second output result that is the same as the feature extraction result The product fingerprint categories output by the branch are not the same result.

第二方面，本发明还提供一种图像信息管理方法，包括：In a second aspect, the present invention also provides an image information management method, comprising:

获取场景图像集，利用图像检测模型获取所述场景图像集中的全量商品图像；Obtaining a scene image set, and using an image detection model to obtain the full amount of commodity images in the scene image set;

根据商品指纹提取模型提取所述商品图像中的商品指纹，得到商品指纹集；其中，The commodity fingerprints in the commodity images are extracted according to the commodity fingerprint extraction model to obtain a commodity fingerprint set; wherein,

所述商品指纹提取模型为采用第一方面所述的模型训练方法训练出的模型；The commodity fingerprint extraction model is a model trained by the model training method described in the first aspect;

确定所述商品指纹集的商品类别，利用聚类算法对各所述商品类别对应的商品指纹进行压缩；Determine the commodity category of the commodity fingerprint set, and use a clustering algorithm to compress commodity fingerprints corresponding to each commodity category;

将压缩后的商品指纹存储至商品指纹数据库。Store the compressed commodity fingerprints in the commodity fingerprint database.

作为进一步改进，所述利用聚类算法对各所述商品类别对应的商品指纹进行压缩，具体为：确定所述商品指纹集中各商品指纹的商品类别；根据预设的数量参数N，利用聚类算法依次将各商品类别下的商品指纹压缩为N个类簇，并将N个所述类簇的类簇中心设置为对应商品类别的目标商品指纹；其中，N为非零自然数；将各所述商品类别的目标商品指纹存储至商品指纹数据库。As a further improvement, the use of a clustering algorithm to compress the commodity fingerprints corresponding to each commodity category is specifically: determining the commodity category of each commodity fingerprint in the commodity fingerprint set; according to a preset quantity parameter N, using clustering The algorithm compresses the commodity fingerprints under each commodity category into N clusters in turn, and sets the cluster centers of the N clusters as the target commodity fingerprints of the corresponding commodity category; among them, N is a non-zero natural number; The target commodity fingerprint of the commodity category is stored in the commodity fingerprint database.

第三方面，本发明还提供一种图像识别方法，包括：In a third aspect, the present invention also provides an image recognition method, comprising:

获取目标商品图像，利用商品指纹提取模型提取所述目标商品图像的目标商品指纹；acquiring a target commodity image, and extracting the target commodity fingerprint of the target commodity image by using a commodity fingerprint extraction model;

基于所述目标商品指纹遍历商品指纹数据库中的基准指纹，将与所述目标商品指纹相似度最高的基准指纹所对应的商品类别确定为识别结果；其中，Based on the target commodity fingerprint traversing the benchmark fingerprints in the commodity fingerprint database, the commodity category corresponding to the benchmark fingerprint with the highest similarity with the target commodity fingerprint is determined as the identification result; wherein,

所述商品指纹提取模型为采用如第一方面所述的模型训练方法训练出的模型；The commodity fingerprint extraction model is a model trained by the model training method described in the first aspect;

所述商品指纹数据库由如第二方面所述的图像信息管理方法得到。The commodity fingerprint database is obtained by the image information management method according to the second aspect.

第四方面，本发明还提供一种模型训练装置，包括：In a fourth aspect, the present invention also provides a model training device, comprising:

初始化模块，用于对初始图像处理模型的参数进行随机初始化；其中，所述初始图像处理模型包括特征提取分支和原型构建分支；an initialization module for randomly initializing parameters of an initial image processing model; wherein, the initial image processing model includes a feature extraction branch and a prototype building branch;

第一训练模块，用于固定所述原型构建分支的初始化参数，将训练图像集输入至所述初始图像处理模型，得到第一输出结果；The first training module is used to fix the initialization parameters of the prototype construction branch, input the training image set into the initial image processing model, and obtain the first output result;

第二训练模块，用于固定所述特征提取分支更新后的参数，将所述训练图像集输入至特征提取分支参数更新后的初始图像处理模型中，得到第二输出结果；The second training module is used to fix the updated parameters of the feature extraction branch, and input the training image set into the initial image processing model after the updated parameters of the feature extraction branch to obtain a second output result;

确定模块，用于根据所述特征提取分支更新后的参数构建商品指纹提取模型。A determination module, configured to construct a commodity fingerprint extraction model according to the updated parameters of the feature extraction branch.

同时，本发明提供一种数据处理设备，包括处理器，所述处理器和存储器耦合，所述存储器存储有程序，所述程序由所述处理器执行，使得所述数据处理设备执行第一方面所述的模型训练方法，或第二方面所述的图像信息管理方法，或第三方面所述的图像识别方法。Meanwhile, the present invention provides a data processing device, comprising a processor, the processor is coupled with a memory, the memory stores a program, the program is executed by the processor, so that the data processing device executes the first aspect The model training method, the image information management method described in the second aspect, or the image recognition method described in the third aspect.

本发明还提供一种计算机可读存储介质，其上存储有计算机程序，所述计算机程序被处理器执行时，实现如上述第一方面所述的模型训练方法，或第二方面所述的图像信息管理方法，或第三方面所述的图像识别方法。The present invention also provides a computer-readable storage medium on which a computer program is stored, and when the computer program is executed by a processor, implements the model training method described in the first aspect above, or the image described in the second aspect. An information management method, or the image recognition method described in the third aspect.

相较于现有技术，本发明提供技术方案至少具有如下的有益效果：Compared with the prior art, the technical solutions provided by the present invention have at least the following beneficial effects:

1、通过训练包含特征提取分支和原型构建分支的双分支模型，可确保所构建的商品指纹提取模型所输出的商品指纹具备更好的区分性；1. By training the dual-branch model including the feature extraction branch and the prototype construction branch, it can ensure that the commodity fingerprint output by the constructed commodity fingerprint extraction model has better distinguishability;

2、通过利用双分支商品指纹提取模型和聚类方法对应用场景下的商品图像进行商品指纹提取和压缩，可在保障商品识别率的前提下，有效减小商品指纹库的存储规模，从而减少商品指纹的匹配搜索时间，提高商品指纹识别效率。2. By using the dual-branch commodity fingerprint extraction model and clustering method to extract and compress commodity images of commodity images in application scenarios, the storage scale of commodity fingerprint database can be effectively reduced on the premise of ensuring commodity recognition rate, thereby reducing The matching search time of commodity fingerprints improves the efficiency of commodity fingerprint identification.

附图说明Description of drawings

为了更清楚地说明本发明的技术方案，下面将对实施方式中所需要使用的附图作简单地介绍，显而易见地，下面描述中的附图仅仅是本发明的一些实施方式，对于本领域普通技术人员来讲，在不付出创造性劳动的前提下，还可以根据这些附图获得其他的附图。In order to illustrate the technical solutions of the present invention more clearly, the following will briefly introduce the accompanying drawings used in the embodiments. Obviously, the drawings in the following description are only some embodiments of the present invention, which are common in the art. As far as technical personnel are concerned, other drawings can also be obtained based on these drawings without any creative effort.

图1是本发明实施例提供的模型训练方法的流程示意图；1 is a schematic flowchart of a model training method provided by an embodiment of the present invention;

图2是本发明实施例提供的图像信息管理方法的流程示意图；2 is a schematic flowchart of an image information management method provided by an embodiment of the present invention;

图3是本发明实施例提供的模型训练装置的结构示意图。FIG. 3 is a schematic structural diagram of a model training apparatus provided by an embodiment of the present invention.

具体实施方式Detailed ways

本部分将详细描述本发明的具体实施例，本发明之较佳实施例在附图中示出，附图的作用在于用图形补充说明书文字部分的描述，使人能够直观地、形象地理解本发明的每个技术特征和整体技术方案，但其不能理解为对本发明保护范围的限制。This part will describe the specific embodiments of the present invention in detail, and the preferred embodiments of the present invention are shown in the accompanying drawings. Each technical feature and overall technical solution of the invention should not be construed as limiting the protection scope of the invention.

如图1所示，第一方面，本发明一个实施例提供一种模型训练方法，包括下述步骤S101至S104。As shown in FIG. 1 , in a first aspect, an embodiment of the present invention provides a model training method, including the following steps S101 to S104.

S101：对初始图像处理模型的参数进行随机初始化；其中，所述初始图像处理模型包括特征提取分支和原型构建分支。S101: Randomly initialize parameters of an initial image processing model; wherein, the initial image processing model includes a feature extraction branch and a prototype building branch.

S102：固定所述原型构建分支的初始化参数，将训练图像集输入至所述初始图像处理模型，得到第一输出结果；根据所述第一输出结果确定损失函数的值，利用所述损失函数的值对所述特征提取分支的初始化参数进行更新。S102: Fix the initialization parameters of the prototype construction branch, input the training image set into the initial image processing model, and obtain a first output result; determine the value of the loss function according to the first output result, and use the loss function value to update the initialization parameters of the feature extraction branch.

本实施例所使用的训练图像集具体为商品图像集，各商品图像通过对采集得到的大量货架、端架及冰箱等快销场景真实图像进行分割得到，具体可利用商品检测模型对所采集的真实图像进行图像分割。The training image set used in this embodiment is specifically a commodity image set, and each commodity image is obtained by segmenting a large number of collected real images of fast-selling scenes such as shelves, end racks, and refrigerators. real images for image segmentation.

S103：固定所述特征提取分支更新后的参数，将所述训练图像集输入至特征提取分支参数更新后的初始图像处理模型中，得到第二输出结果；根据所述第二输出结果确定所述损失函数的值，利用所述损失函数的值对所述原型构建分支的初始化参数进行更新。S103: Fix the updated parameters of the feature extraction branch, input the training image set into the initial image processing model after the updated parameters of the feature extraction branch, and obtain a second output result; determine the The value of the loss function, using the value of the loss function to update the initialization parameters of the prototype building branch.

S104：根据所述特征提取分支更新后的参数构建商品指纹提取模型。S104: Build a commodity fingerprint extraction model according to the updated parameters of the feature extraction branch.

需要说明的是，第一输出结果和所述第二输出结果均包括特征提取分支输出的商品指纹和原型构建分支输出的指纹原型。It should be noted that both the first output result and the second output result include the commodity fingerprint output by the feature extraction branch and the fingerprint prototype output by the prototype building branch.

具体地，损失函数包括类别损失和类内类间损失两个部分，其中，类别损失用于描述商品指纹与指纹原型间的相似度。Specifically, the loss function includes two parts: category loss and intra-class loss, where category loss is used to describe the similarity between commodity fingerprints and fingerprint prototypes.

具体地，所述类别损失定义为：Specifically, the class loss is defined as:

式中，

在本实施例中，损失函数可具体表示为：In this embodiment, the loss function can be specifically expressed as:

其中，α表示权重系数。Among them, α represents the weight coefficient.

示例性的，当获取训练图像集后，首先对初始图像处理模型中的特征提取分支及原型构建分支进行随机初始化，再重复下述训练步骤，直至模型收敛。Exemplarily, after acquiring the training image set, first randomly initialize the feature extraction branch and the prototype building branch in the initial image processing model, and then repeat the following training steps until the model converges.

训练图像输入初始图像处理模型中，经特征提取分支输出商品指纹f _x。首先保持原型构建分支参数不变，对特征提取分支进行训练，具体通过损失函数的反向传播对特征提取分支的参数进行更新，直至迭代结束。The training image is input into the initial image processing model, and the product fingerprint f _x is output through the feature extraction branch. First, keep the parameters of the prototype construction branch unchanged, and train the feature extraction branch. Specifically, the parameters of the feature extraction branch are updated through the back-propagation of the loss function until the iteration ends.

进一步地，保持更新后的特征提取分支参数不变，基于上述损失函数的反向传播对原型构建分支的参数进行更新。Further, keeping the updated parameters of the feature extraction branch unchanged, and updating the parameters of the prototype construction branch based on the back-propagation of the above loss function.

具体地，当每个epoch完成后，将初始图像处理模型的学习率指数衰减。Specifically, when each epoch is completed, the learning rate of the initial image processing model is exponentially decayed.

完成上述训练步骤后，保存特征提取分支参数，并将其设置为最终的商品指纹提取模型。After completing the above training steps, save the feature extraction branch parameters and set them as the final commodity fingerprint extraction model.

本发明上述实施例通过训练包含特征提取分支和原型构建分支的双分支模型，可使不同环境影响下的同类别商品的指纹分布围绕在同一指纹原型下，同时拉开不同类别商品的原型距离，确保所构建的所构建的商品指纹提取模型所输出的商品指纹具备更好的区分性，且能够更好地识别相似商品和未见商品。The above-mentioned embodiments of the present invention train the dual-branch model including the feature extraction branch and the prototype construction branch, so that the fingerprint distribution of the same category of commodities under different environmental influences can be surrounded by the same fingerprint prototype, and the prototype distance of different categories of commodities can be widened at the same time. It is ensured that the commodity fingerprint output by the constructed commodity fingerprint extraction model has better distinguishability, and can better identify similar commodities and unseen commodities.

如图2所示，第二方面，本发明一个实施例还提供一种图像信息管理方法，包括下述步骤S201至S203。As shown in FIG. 2, in a second aspect, an embodiment of the present invention further provides an image information management method, including the following steps S201 to S203.

S201：获取场景图像集，利用图像检测模型获取所述场景图像集中的全量商品图像。S201: Acquire a scene image set, and use an image detection model to acquire a full amount of commodity images in the scene image set.

可以理解的是，所述场景图像集具体包括货架、端架及冰箱等快销场景的真实图像，通过商品检测模型对真实图像进行图像分割，可得到场景图像集中的全量商品图像。It can be understood that the scene image set specifically includes real images of fast-selling scenes such as shelves, end racks, and refrigerators, and the real images are segmented by the commodity detection model to obtain the full amount of commodity images in the scene image set.

S202：根据商品指纹提取模型提取所述商品图像中的商品指纹，得到商品指纹集。其中，所述商品指纹提取模型为通过上述实施例所述的模型训练方法训练得到。S202: Extract commodity fingerprints in the commodity image according to the commodity fingerprint extraction model to obtain a commodity fingerprint set. Wherein, the commodity fingerprint extraction model is obtained by training the model training method described in the above embodiment.

S203：确定所述商品指纹集的商品类别，利用聚类算法对各所述商品类别对应的商品指纹进行压缩，将压缩后的商品指纹存储至商品指纹数据库。S203: Determine the commodity category of the commodity fingerprint set, use a clustering algorithm to compress commodity fingerprints corresponding to each commodity category, and store the compressed commodity fingerprints in a commodity fingerprint database.

在一个示例中，利用聚类算法对各商品类别对应的商品指纹进行压缩时，可先确定商品指纹集中各商品指纹的商品类别；再根据预设的数量参数N，利用聚类算法依次将各商品类别下的商品指纹压缩为N个类簇，并将N个类簇的类簇中心设置为对应商品类别的目标商品指纹；其中，N为非零自然数；最后将各商品类别的目标商品指纹存储至商品指纹数据库。In an example, when using the clustering algorithm to compress the commodity fingerprints corresponding to each commodity category, the commodity category of each commodity fingerprint in the commodity fingerprint set can be determined first; The commodity fingerprints under the commodity category are compressed into N clusters, and the cluster centers of the N clusters are set as the target commodity fingerprints of the corresponding commodity categories; among them, N is a non-zero natural number; finally, the target commodity fingerprints of each commodity category are set. Stored in the commodity fingerprint database.

具体地，所使用的聚类算法可为K-means算法。Specifically, the clustering algorithm used may be the K-means algorithm.

示例性的，当利用商品指纹提取模型提取得到第i类别的商品指纹集S _i时，可通过K-means算法将商品指纹集S _i聚类成N个类簇，并将N个类簇中心作为第i类别商品的N个商品指纹。Exemplarily, when the commodity fingerprint set S _i of the i -th category is extracted by using the commodity fingerprint extraction model, the commodity fingerprint set S _i can be clustered into N clusters by the K-means algorithm, and the centers of the N clusters can be divided into N clusters. As the N commodity fingerprints of the i -th category commodity.

需要说明的是，数量参数N为小于商品指纹集S _i中商品指纹数量的值，不同商品类别可依据商品外包装是否具有多个不同的面，确定对应的数量参数。It should be noted that the quantity parameter N is a value smaller than the number of commodity fingerprints in the commodity fingerprint set S _i , and different commodity categories can determine the corresponding quantity parameter according to whether the outer packaging of the commodity has multiple different surfaces.

具体地，N个类簇中心可通过下述公式计算得到：Specifically, the N cluster centers can be calculated by the following formula:

其中，

，y表示商品类别，M表示第i类别商品第N个类簇的商品图像数量。in,

, y represents the product category, and M represents the number of product images in the Nth cluster of the i -th product.

进一步地，将N个商品指纹及其对应的商品类别i写入商品指纹数据库中，直至在商品指纹数据库中存储全量商品类别的商品指纹。Further, N commodity fingerprints and their corresponding commodity categories i are written into the commodity fingerprint database, until commodity fingerprints of all commodity categories are stored in the commodity fingerprint database.

本发明上述实施例通过利用双分支商品指纹提取模型和聚类方法对应用场景下的商品图像进行商品指纹提取和压缩，可在保障商品识别率的前提下，以更少的指纹表征各场景下的商品，从而有效减小商品指纹库中基准指纹数量的存储规模，减少了商品指纹的匹配搜索时间，提高商品指纹识别效率。The above-mentioned embodiments of the present invention extract and compress commodity images under application scenarios by using a dual-branch commodity fingerprint extraction model and a clustering method, so that under the premise of ensuring the commodity recognition rate, fewer fingerprints can be used to characterize each scenario. Therefore, the storage scale of the number of reference fingerprints in the commodity fingerprint database is effectively reduced, the matching search time of commodity fingerprints is reduced, and the efficiency of commodity fingerprint identification is improved.

第三方面，本发明一个实施例还提供一种图像识别方法，具体包括：获取目标商品图像，利用商品指纹提取模型提取目标商品图像的目标商品指纹，再基于目标商品指纹遍历商品指纹数据库中的基准指纹，将与目标商品指纹相似度最高的基准指纹确定为识别结果。In a third aspect, an embodiment of the present invention also provides an image recognition method, which specifically includes: acquiring a target commodity image, extracting a target commodity fingerprint of the target commodity image by using a commodity fingerprint extraction model, and then traversing the commodity fingerprint database based on the target commodity fingerprint. The reference fingerprint, the reference fingerprint with the highest similarity with the target commodity fingerprint is determined as the identification result.

其中，所述商品指纹提取模型为第一方面所述的模型训练方法训练出的模型，所述商品指纹库通过第二方面所述的图像信息管理方法得到。Wherein, the commodity fingerprint extraction model is a model trained by the model training method described in the first aspect, and the commodity fingerprint database is obtained by the image information management method described in the second aspect.

具体地，当目标商品图像为已见类商品时，可直接对商品图像进行商品指纹提取，并采用余弦距离作为度量函数，在商品指纹数据库中提取与其余弦相似性最大的基准指纹所对应的商品类别作为最终识别结果。Specifically, when the target product image is a known product, the product fingerprint can be extracted directly from the product image, and the cosine distance is used as the metric function to extract the product corresponding to the reference fingerprint with the largest cosine similarity in the product fingerprint database. category as the final recognition result.

另一方面，当目标商品图像为未见类商品时，首先随机抽取M张未见类商品图像进行标记，即设置商品类别标签；再通过商品指纹提取模型对其进行指纹提取，将提取的商品指纹及其对应商品类别加入至商品指纹数据库中。On the other hand, when the target product image is an unseen product, M images of unseen products are randomly selected for marking, that is, the product category label is set; Fingerprints and their corresponding commodity categories are added to the commodity fingerprint database.

进一步地，从商品指纹数据库中取余弦相似性最大的基准指纹所对应的商品类别作为最终识别结果。Further, the commodity category corresponding to the reference fingerprint with the largest cosine similarity is taken from the commodity fingerprint database as the final identification result.

请参阅图3，本发明另一实施例还提供一种模型训练装置，包括初始化模块101、第一训练模块102、第二训练模块103和确定模块104。Referring to FIG. 3 , another embodiment of the present invention further provides a model training apparatus, including an initialization module 101 , a first training module 102 , a second training module 103 and a determination module 104 .

初始化模块101用于对初始图像处理模型的参数进行随机初始化；其中，所述初始图像处理模型包括特征提取分支和原型构建分支。The initialization module 101 is used to randomly initialize the parameters of the initial image processing model; wherein, the initial image processing model includes a feature extraction branch and a prototype building branch.

第一训练模块102用于固定所述原型构建分支的初始化参数，将训练图像集输入至所述初始图像处理模型，得到第一输出结果。The first training module 102 is configured to fix the initialization parameters of the prototype building branch, input the training image set to the initial image processing model, and obtain a first output result.

根据所述第一输出结果确定损失函数的值，利用所述损失函数的值对所述特征提取分支的初始化参数进行更新。The value of the loss function is determined according to the first output result, and the initialization parameter of the feature extraction branch is updated by using the value of the loss function.

第二训练模块103用于固定所述特征提取分支更新后的参数，将所述训练图像集输入至特征提取分支参数更新后的初始图像处理模型中，得到第二输出结果。The second training module 103 is configured to fix the updated parameters of the feature extraction branch, and input the training image set into the initial image processing model after the updated parameters of the feature extraction branch to obtain a second output result.

根据所述第二输出结果确定所述损失函数的值，利用所述损失函数的值对所述原型构建分支的初始化参数进行更新。The value of the loss function is determined according to the second output result, and the initialization parameter of the prototype building branch is updated by using the value of the loss function.

确定模块104用于根据所述特征提取分支更新后的参数构建商品指纹提取模型。The determining module 104 is configured to construct a commodity fingerprint extraction model according to the updated parameters of the feature extraction branch.

上述装置内的各模块之间信息交互、执行过程等内容，由于与本发明模型训练方法实施例基于同一构思，具体内容可参见本发明方法实施例中的叙述，此处不再赘述。Since the information exchange and execution process among the modules in the above-mentioned device are based on the same concept as the embodiment of the model training method of the present invention, the specific content can be found in the description in the method embodiment of the present invention, and will not be repeated here.

本发明提供一种数据处理设备，包括处理器，所述处理器和存储器耦合，所述存储器存储有程序，所述程序由所述处理器执行，使得所述数据处理设备执行上述第一方面所述的模型训练方法，或第二方面所述的图像信息管理方法，或第三方面所述的图像识别方法。The present invention provides a data processing device, comprising a processor, the processor is coupled with a memory, the memory stores a program, and the program is executed by the processor, so that the data processing device executes the above-mentioned first aspect. The model training method described in the second aspect, the image information management method described in the second aspect, or the image recognition method described in the third aspect.

本发明还提供一种计算机可读存储介质，其上存储有计算机程序，所述计算机程序被处理器执行时，实现上述第一方面所述的模型训练方法，或第二方面所述的图像信息管理方法，或第三方面所述的图像识别方法。The present invention also provides a computer-readable storage medium on which a computer program is stored, and when the computer program is executed by a processor, implements the model training method described in the first aspect or the image information described in the second aspect The management method, or the image recognition method described in the third aspect.

本领域普通技术人员可以理解实现上述实施例方法中的全部或部分流程，是可以通过计算机程序来指令相关的硬件来完成，所述的程序可存储于计算机可监听存储介质中，该程序在执行时，可包括如上述各方法的实施例的流程。其中，所述的存储介质可为磁碟、光盘、只读存储记忆体（Read-Only Memory，ROM）或随机存储记忆体（Random AccessMemory，RAM）等。Those of ordinary skill in the art can understand that all or part of the processes in the methods of the above embodiments can be completed by instructing the relevant hardware through a computer program, and the program can be stored in a computer monitorable storage medium. , may include the flow of the above-mentioned method embodiments. The storage medium may be a magnetic disk, an optical disk, a read-only memory (Read-Only Memory, ROM), or a random access memory (Random Access Memory, RAM) or the like.

以上实施例的各技术特征可以进行任意的组合，为使描述简洁，未对上述实施例中的各个技术特征所有可能的组合都进行描述，然而，只要这些技术特征的组合不存在矛盾，都应当认为是本说明书记载的范围。The technical features of the above embodiments can be combined arbitrarily. In order to make the description simple, all possible combinations of the technical features in the above embodiments are not described. However, as long as there is no contradiction in the combination of these technical features It is considered to be the range described in this specification.

Claims

1. A method of model training, comprising:

carrying out random initialization on parameters of the initial image processing model; wherein the initial image processing model comprises a feature extraction branch and a prototype construction branch;

fixing the initialization parameters of the prototype building branch, and inputting a training image set to the initial image processing model to obtain a first output result;

determining a value of a loss function according to the first output result, and updating the initialization parameter of the feature extraction branch by using the value of the loss function; the loss function comprises a class loss and an intra-class inter-class loss; wherein,

the class loss is defined as

(ii) a In the formula,f _x a fingerprint of the commodity representing the output of the feature extraction branch,ea fingerprint prototype representing the prototype-building branch output,< , >the inner product is represented by the sum of the two,ithe commodity type is represented, T represents a temperature coefficient, and N represents the number of image samples;

the intra-class inter-class loss is defined as

(ii) a In the formula,

indicating a boundary threshold，z _p The result which is the same as the commodity fingerprint category output by the characteristic extraction branch in the first output result or the second output result is represented,z _n the result which is different from the commodity fingerprint category output by the characteristic extraction branch in the first output result or the second output result is represented;

fixing the updated parameters of the feature extraction branch, and inputting the training image set into the initial image processing model after the parameters of the feature extraction branch are updated to obtain a second output result;

determining the value of the loss function according to the second output result, and updating the initialization parameter of the prototype construction branch by using the value of the loss function;

and constructing a commodity fingerprint extraction model according to the parameters updated by the feature extraction branches.

2. The model training method of claim 1, wherein the first output result and the second output result each comprise:

and the commodity fingerprint output by the branch is extracted by the characteristic extraction module and the prototype of the fingerprint output by the prototype construction module.

3. An image information management method characterized by comprising:

acquiring a scene image set, and acquiring a full commodity image in the scene image set by using an image detection model;

extracting the commodity fingerprint in the commodity image according to a commodity fingerprint extraction model to obtain a commodity fingerprint set; wherein,

the commodity fingerprint extraction model is a model trained by the model training method according to any one of claims 1-2;

determining the commodity category of the commodity fingerprint set, and compressing the commodity fingerprint corresponding to each commodity category by using a clustering algorithm;

and storing the compressed commodity fingerprint into a commodity fingerprint database.

4. The image information management method according to claim 3, wherein the compressing of the commodity fingerprint corresponding to each of the commodity categories by using a clustering algorithm is specifically:

determining the commodity category of each commodity fingerprint in the commodity fingerprint set;

according to a preset quantity parameter N, sequentially compressing the commodity fingerprints under each commodity category into N categories by using a clustering algorithm, and setting the cluster centers of the N categories as target commodity fingerprints corresponding to the commodity categories; wherein N is a non-zero natural number;

and storing the target commodity fingerprint of each commodity category in a commodity fingerprint database.

5. An image recognition method, comprising:

acquiring a target commodity image, and extracting a target commodity fingerprint of the target commodity image by using a commodity fingerprint extraction model;

traversing the reference fingerprints in the commodity fingerprint database based on the target commodity fingerprint, and determining the commodity category corresponding to the reference fingerprint with the highest similarity with the target commodity fingerprint as an identification result; wherein,

the commodity fingerprint database is obtained by the image information management method according to any one of claims 3 to 4.

6. A model training apparatus, comprising:

the initialization module is used for carrying out random initialization on the parameters of the initial image processing model; wherein the initial image processing model comprises a feature extraction branch and a prototype construction branch;

the first training module is used for fixing the initialization parameters of the prototype building branch, inputting a training image set to the initial image processing model and obtaining a first output result;

the class loss is defined as

the intra-class inter-class loss is defined as

(ii) a In the formula,

the second training module is used for fixing the updated parameters of the feature extraction branch, and inputting the training image set into the initial image processing model after the parameters of the feature extraction branch are updated to obtain a second output result;

and the determining module is used for constructing a commodity fingerprint extraction model according to the parameters updated by the feature extraction branches.

7. The model training apparatus of claim 6, wherein the first output result and the second output result each comprise:

8. A data processing apparatus, characterized by comprising:

a processor coupled to a memory, the memory storing a program for execution by the processor to cause the data processing apparatus to perform the model training method of any one of claims 1 to 2, or the image information management method of any one of claims 3 to 4, or the image recognition method of claim 5.

9. A computer storage medium storing computer instructions for performing the model training method according to any one of claims 1 to 2, or the image information management method according to any one of claims 3 to 4, or the image recognition method according to claim 5.