WO2020125207A1 - Information promotion method and apparatus - Google Patents

Information promotion method and apparatus Download PDF

Info

Publication number
WO2020125207A1
WO2020125207A1 PCT/CN2019/114081 CN2019114081W WO2020125207A1 WO 2020125207 A1 WO2020125207 A1 WO 2020125207A1 CN 2019114081 W CN2019114081 W CN 2019114081W WO 2020125207 A1 WO2020125207 A1 WO 2020125207A1
Authority
WO
WIPO (PCT)
Prior art keywords
trademark
target
area
image
information
Prior art date
Application number
PCT/CN2019/114081
Other languages
French (fr)
Chinese (zh)
Inventor
冯展鹏
黄轩
王孝宇
Original Assignee
深圳云天励飞技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳云天励飞技术有限公司 filed Critical 深圳云天励飞技术有限公司
Publication of WO2020125207A1 publication Critical patent/WO2020125207A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/06Buying, selling or leasing transactions

Definitions

  • the present invention relates to the field of information promotion, and in particular, to an information promotion method and device.
  • Trademarks are used by producers and operators of commodities on their production, manufacture, processing, selection or distribution of commodities or service providers on the services they provide, and are used to distinguish the source of commodities or services, including text , Graphics, letters, numbers, three-dimensional signs, color combinations and sounds, as well as the combination of the above-mentioned elements, signs with distinctive characteristics.
  • the promotion information about trademarks displayed in shopping malls is generally fixed, and the promotion information seen by each consumer is the same. Because consumers have different interests and hobbies, many consumers promote the fixed broadcast promotion information. May not be interested, resulting in poor information promotion.
  • Embodiments of the present invention provide information promotion methods and devices to solve the problem of poor information promotion effects.
  • the first aspect provides an information promotion method, including:
  • an information promotion device including:
  • an acquisition module configured to acquire a first image, where the first image is an image captured by a shooting device corresponding to a display screen in a shopping mall;
  • a detection module configured to detect whether a trademark exists in the first image
  • an information determination module configured to determine the characteristic information of at least one first trademark in the first image if a trademark is detected in the first image
  • a search module for searching a second trademark in the trademark information database that matches the feature information of each first trademark
  • a display module configured to display promotion information corresponding to the second trademark on the display screen, where the promotion information includes location information of commodities corresponding to the second trademark in the mall.
  • an information promotion device including a processor, a memory, and a communication interface, where the processor, memory, and communication interface are connected to each other, wherein the communication interface is used to input or output data, the The memory is used to store program code, and the processor is used to call the program code to perform the method of the first aspect.
  • a computer storage medium stores a computer program
  • the computer program includes program instructions
  • the program instructions when executed by a processor, perform the method of the first aspect described above .
  • each first trademark in the first image is determined Search for the second trademark matching the characteristic information of each first trademark in the trademark information database, and display the promotion information corresponding to the second trademark on the display screen of the mall.
  • the promotion information corresponding to the detected trademark is convenient for consumers to view and find the position of the product corresponding to the trademark in the mall.
  • the first image is an image captured by a shooting device corresponding to the display screen in the mall, the image is displayed at the exhibition
  • the possibility of the consumer’s image near the screen is high, so the detected trademark is likely to be a trademark on the consumer’s clothing or other parts (such as a backpack), and the consumer’s dress can be used to feedback the user to a certain extent.
  • display promotional information that matches the trademark
  • It is equivalent to displaying the promotion information of the products that the user likes, and plays the role of targeted promotion information, that is, promotion according to the user's preference. Since promotion is based on the user's preference, accurate delivery of the promotion information can be achieved, which can make The effect of promotion is better.
  • FIG. 1 is a schematic diagram of a scenario provided by an embodiment of the present invention.
  • FIG. 2 is a schematic flowchart of an information promotion method provided by an embodiment of the present invention.
  • FIG. 3 is a schematic structural diagram of detecting a brand name according to an embodiment of the present invention.
  • FIG. 4 is a schematic flowchart of another information promotion method provided by an embodiment of the present invention.
  • FIG. 5 is a schematic diagram of a convolutional network structure in a target detection model provided by an embodiment of the present invention.
  • FIG. 6 is a schematic diagram showing the relationship between the target convolution feature map and the target convolution feature sub-map provided by an embodiment of the present invention
  • FIG. 7 is a schematic diagram of a mapping relationship between a target convolution feature map and a first image provided by an embodiment of the present invention.
  • FIG. 8 is a schematic diagram of the relationship between three target convolution feature maps and corresponding target convolution feature sub-graphs obtained after convolution processing based on a target detection model provided by an embodiment of the present invention
  • FIG. 9 is a schematic flowchart of yet another information promotion method provided by an embodiment of the present invention.
  • FIG. 10 is a schematic diagram of determining feature information of two first trademarks in a first image provided by an embodiment of the present invention.
  • FIG. 11 is a schematic diagram of determining regions corresponding to a plurality of first trademarks in a first image provided by an embodiment of the present invention.
  • FIG. 12 is a schematic structural diagram of an information promotion device according to an embodiment of the present invention.
  • FIG. 13 is a schematic structural diagram of another information promotion device provided by an embodiment of the present invention.
  • the technical solution of the present invention is suitable for detecting an image, thereby obtaining a corresponding trademark detection result, and searching and displaying a scene of promotional information corresponding to the trademark in the trademark information database according to the trademark detection result.
  • the shooting device 02 corresponding to the display screen 01 senses that there is a consumer directly opposite it, it periodically consumes the consumer directly opposite and within a preset distance Take pictures of different parts of the wearer’s clothing and send the captured images to the corresponding image processing device 03 for detection.
  • the trademark information database 04 is based on the detected trademark feature information Find the matching trademark, and determine the promotion information corresponding to the trademark, so that the display screen 01 displays the promotion information, so that consumers can better view the information corresponding to the trademark and find the product corresponding to the trademark in the mall.
  • the shooting device may be a camera on the display screen, or may be another shooting device corresponding to the display screen, for example, a device with a photographing function, such as a personal computer, laptop computer, smart phone, tablet computer, and portable wearable device, period It can be any value such as 0.1s, 0.3s, 0.5s, etc.
  • the preset distance can be any value such as 0.5m, lm, 1.5m.
  • FIG. 2 and FIG. 2 is a schematic flowchart of an information promotion method provided by an embodiment of the present invention. As shown in the figure, the method includes the following steps:
  • S101 Acquire a first image, where the first image is an image captured by a shooting device corresponding to a display screen in a shopping mall.
  • the photographing device photographs different parts of the consumer's clothing, backpack, etc. to obtain the first image.
  • the acquired first image may or may not include a trademark.
  • the acquired image may include a trademark and other decorative graphics or background graphics other than the trademark, as shown in FIG. 3, the "cloud” and “rain” shaped images are decorative backgrounds on the article pattern.
  • the shooting device may be a camera above the display screen, as shown by 02 in FIG. 1, or other equipment with a photographing function.
  • FIG. 3 is a schematic structural diagram of detecting a brand name provided by an embodiment of the present invention
  • the first image may be adjusted to a preset size to detect the first image Whether there is a trademark in.
  • the size of the first image is larger than the preset size
  • the The first image is processed to reduce the first image to the same size as the preset size; in the case that the size of the first image is smaller than the preset size, the first image is processed by upsampling to make The first image is enlarged to the same size as the preset size.
  • the preset size may be an integer multiple of 32*32, for example, the preset size may be 416*416
  • S103 In a case where a trademark is detected in the first image, determine feature information of at least one first trademark in the first image.
  • the subsequent steps are not performed; when the trademark is detected in the first image, at least one of the first image is determined Characteristic information of the first trademark.
  • the first trademark is a trademark existing in the first image, and the number of the first trademark may be plural.
  • the characteristic information of the first trademark includes the trademark name of the first trademark and the position coordinates of the first trademark in the first image.
  • the characteristic information of the first trademark in the first image includes the trademark names of the two first trademarks and the position coordinates of the two first trademarks in the first image.
  • the characteristic information in the first image determined is the trademark name "NIKE" and the area 10c corresponding to the position coordinates of the trademark in the first image
  • the matching information is searched for in the trademark information database according to the feature information of each first trademark.
  • Two trademarks, and the feature information of each first trademark may match multiple second trademarks in the trademark information database, then there may be multiple second trademarks.
  • Finding the second trademark matching the characteristic information of each first trademark in the trademark information database may include the following situations:
  • the trademark information database is searched for a trademark pattern whose similarity with the target image is higher than the similarity threshold, and The trademark pattern whose similarity is higher than the similarity threshold is determined as the second trademark.
  • the target image corresponding to the position coordinates in the image searches the trademark information database for a trademark pattern whose similarity to the target image is higher than the similarity threshold, and determines the trademark pattern as the second trademark.
  • the similarity threshold may be any value such as 80%, 85%, and 90%.
  • the trademark information database is used to store trademark information
  • the trademark information may include a trademark name and a trademark pattern corresponding to the trademark
  • the second trademark is a variety of trademarks stored in the trademark information database. It can be understood that the matching here refers to the same or the similarity reaching the similarity threshold can be considered as a match between the two, the similarity threshold can be any value such as 80%, 85%, 90%.
  • the promotion information corresponding to the second trademark in the trademark information database is determined.
  • the promotion information may include the position information of the commodity corresponding to the second trademark in the shopping mall.
  • the location information of the product corresponding to the second trademark in the shopping mall may specifically be: A rectangular coordinate system is established using the current terminal location as the coordinate origin, and the location information of the trademark in the rectangular coordinate system corresponds to the product corresponding to the second trademark. Location information in the mall.
  • the promotion information may also include the trademark name of the second trademark, where the trademark name may be a Chinese name or an English name, and an abbreviation, etc. For example, the name of "NIKE" may include "NIKE" and "Nike".
  • the promotion information may also include the trademark logo of the second trademark.
  • the color of the trademark logo may be black and white, or it may be color.
  • the pattern may be a flat graphic or a 3D graphic.
  • the promotion information may also include the trademark introduction of the second trademark.
  • the trademark introduction may be the basic information of the trademark, such as the source of the trademark, the moral of the trademark, the applicable group of the trademark, the price of the goods corresponding to the trademark, and the types of goods included in the trademark.
  • S105 Display promotion information corresponding to the second trademark on a display screen in a shopping mall, where the promotion information includes location information of commodities corresponding to the second trademark in the mall.
  • Step S104 determines the promotion information corresponding to the second trademark.
  • the priority may be set according to the order of acquiring the promotion information corresponding to each second trademark, so that the promotion information corresponding to a plurality of second trademarks may be switched and displayed in order.
  • the corresponding promotional information is also two, and the acquisition time of the promotional information corresponding to the second trademark A is at Before the promotion information corresponding to the second trademark B, priority is given to displaying the promotion information corresponding to the second trademark A within the time threshold.
  • the time threshold can be any value such as 3S, 5S, and 10S.
  • the promotion information corresponding to multiple second trademarks may be displayed on the display screen of the mall at the same time, for example, the display screen is divided into multiple areas, and the number of the areas corresponds to the number of promotion information corresponding to the second trademark Similarly, the area may be a plurality of areas with different sizes, so as to display promotion information corresponding to multiple second trademarks at the same time.
  • the present invention by detecting the acquired first image, it is detected whether there is a trademark in the first image, and when a trademark is detected in the image, at least one first trademark in the first image is determined Search for the second trademark matching the characteristic information of each first trademark in the trademark information database, and display the promotion information corresponding to the second trademark on the display screen of the mall.
  • the promotion information corresponding to the detected trademark is convenient for consumers to view and find the position of the product corresponding to the trademark in the mall.
  • the first image is an image captured by a shooting device corresponding to the display screen in the mall, the image is displayed at the exhibition
  • the possibility of the consumer’s image near the screen is high, so the detected trademark is likely to be a trademark on the consumer’s clothing or other parts (such as a backpack), and the consumer’s dress can be used to feedback the user to a certain extent.
  • displaying promotional information matching the trademark is equivalent to displaying the promotional information of the product that the user likes, and plays the role of targeted promotional information, that is, promotion according to the user's preferences, because it is based on the user's preferences Promoting can achieve accurate delivery of promotional information, which can make the promotion effect better.
  • the target detection model may be used to detect whether a trademark exists in the first image and determine the feature information of at least one first trademark in the first image, where the target detection model may be regression-based target detection
  • the target detection model of the algorithm may also be a target detection model based on other algorithms, and the specific algorithm adopted by the target detection model is not limited herein.
  • FIG. 4 is a schematic flowchart of an information promotion method provided by an embodiment of the present invention. As shown in the figure, the method includes:
  • S201 Perform a convolution process on the first image based on the convolution layer in the target detection model to obtain a target convolution Feature map.
  • the target convolution feature map includes multiple target convolution feature subgraphs.
  • the target detection model predicts the category and offset corresponding to the position frame (which position in the figure corresponds to the position frame) by using a convolution kernel on the feature map.
  • the structure of the convolutional network in the target detection model includes multiple general convolutional layers, convolutional feature layers, residual units, upsampling units, and fusion units. It can be understood that in addition to the residual units, In addition to the convolutional layer included in the upsampling unit, the convolutional feature layer, and the fusion unit, the other convolutional layers can all be general convolutional layers.
  • the general convolutional layer is a convolution that only plays a convolutional role in the convolutional network.
  • the convolutional feature layer is used to generate a target convolutional feature image for image detection.
  • Each convolutional layer corresponds to a convolution kernel of a different size.
  • the convolution kernels of different sizes are used to convolve the image to obtain convolution maps of different sizes.
  • Convolution kernels of different sizes correspond to different multiple prior frames.
  • the a priori frame corresponding to the convolution kernel performs prediction processing on the convolution graph corresponding to the convolution kernel, and multiple bounding boxes can be obtained.
  • the convolutional layer in the target detection model performs convolution processing on the first image to obtain multiple target convolution feature maps of different sizes
  • the multiple target convolution feature maps of different sizes specifically refer to:
  • the number of target convolution feature maps with size 64*64 is less than the number of target convolution feature maps with size 32*32, and the number of target convolution feature maps with size 32*32 is less than size 16*
  • the number of target convolutional feature maps of 16 is less than the number of target convolutional feature maps of size 8*8...
  • the size of the first image may be adjusted to the size of the input image corresponding to the target detection model (the size may be 416*416), and the size of the image is the size of the input image corresponding to the target detection model. Then input the image into the convolutional network of the target detection model, use the image as the input of the first convolutional layer in the convolutional network, and check it in sequence through the convolution corresponding to the convolutional layer in the convolutional network The output result of a convolutional layer is subjected to convolution processing, and then the output result of the convolutional feature layer in the convolutional network is determined as a plurality of target convolutional feature maps with different sizes.
  • the convolution processing of the convolution layer corresponding to the convolution layer on the result of the previous convolution layer specifically means that the matrix corresponding to the convolution kernel is multiplied by the matrix output by the previous convolution layer, and the convolution layer
  • the corresponding convolution kernel performs convolution processing on the result output by the previous convolution layer.
  • the result obtained is a matrix of the size corresponding to the convolution layer.
  • the corresponding image is the convolution map corresponding to the convolution layer.
  • FIG. 5 is a convolution network structure in the target detection model provided by the embodiment of the present invention.
  • the schematic diagram of the target detection model includes multiple general convolutional layers, three convolutional feature layers, five residual units, two upsampling units, and two fusion units.
  • the first image is resized and input into a convolutional network, and the first image is convoluted by a convolution kernel in a general convolution layer to obtain a general convolution map, and the convolution in the first residual unit
  • the layer performs a convolution process on the general convolution map to obtain a first convolution map, and performs a convolution process on the first convolution map through the convolution layer in the second residual unit to obtain a second convolution map, and a third convolution map.
  • the convolutional layer in the residual unit performs a convolution process on the second convolution map to obtain a third convolution map
  • the convolutional layer in the fourth residual unit performs a convolution process on the third convolution map to obtain a third convolution map.
  • the fifth convolutional graph is convoluted by the convolutional layer in the fifth residual unit to obtain a fifth convolutional graph.
  • the seventh convolutional graph is upsampled by an upsampling unit to obtain a second upsampling convolutional graph
  • the second upsampling convolutional graph and the third convolutional graph are fused by a fusion unit to obtain a second Fusion convolution map, convolution processing of the second
  • the number of the sixth convolution map, the seventh convolution map, and the eighth convolution map are multiple, and the number of the sixth convolution map, the seventh convolution map, and the eighth convolution map are in order Reduced, that is, the number of sixth convolution maps is greater than the number of seventh convolution maps, and the number of seventh convolution maps is greater than the eighth convolutional layer. That is, the target convolution feature maps obtained after processing are three different sizes of convolution maps, and the size and number of each convolution map are different.
  • there are multiple sixth convolution maps, and each sixth convolution map has the same size; there are multiple seventh convolution maps, each seventh convolution map has the same size; and there are multiple eighth convolution maps, Each eighth convolution graph has the same size.
  • One of the residual units processes the image as follows. Taking the first residual unit as an example, the input image is convoluted by the convolution kernel corresponding to the first convolution layer in the first residual unit , Get first Convolution sub-graph, the first convolution sub-layer convolution corresponding to the first convolution sub-convolution of the first convolution sub-graph to obtain a second convolution sub-graph, the first convolution sub-graph and The second convolution subgraph performs residual learning to obtain a first convolution graph corresponding to the first residual unit.
  • the convolution network may also It includes more general convolutional layers, convolutional feature layers, residual units, upsampling units and more fusion units.
  • the target convolution feature submap refers to the feature unit included in the target convolution feature map, for example, the size of the target convolution feature map is 3*3, then the target convolution feature map may be as shown in FIG. 6, 6 is a schematic diagram of the relationship between a target convolution feature map and a target convolution feature sub-map provided by an embodiment of the present invention.
  • the target convolution feature map includes a total of 9 feature units, and each feature unit is a target convolution feature map A cell is numbered 1-9, that is, the target convolution feature map contains 9 target convolution feature submaps.
  • S202 Determine target convolution feature information corresponding to each target convolution feature submap in multiple target convolution feature submaps, respectively.
  • the target convolution feature information corresponding to each target convolution feature submap refers to: using the a priori box corresponding to the target convolution feature map as a bounding box to use the target convolution feature sub in the target convolution feature map
  • the picture is the content circled in the center.
  • the convolution feature map is shown in FIG. 6.
  • FIG. 6 is a schematic diagram of the relationship between the target convolution feature map and the target convolution feature sub-map provided by an embodiment of the present invention.
  • the target convolution feature information corresponding to the target convolution feature sub-picture 9 is the content of the target convolution feature map corresponding to the three dashed boxes with different sizes in FIG. 6.
  • the a priori box corresponding to the target convolution feature map may be used as a bounding box to determine the information within the bounding box corresponding to each target convolution feature submap in the target convolution feature map, and the bounding box The information in is determined as the target convolution feature information of the target convolution feature submap corresponding to the bounding box, thereby determining the target convolution feature information corresponding to the target convolution feature map.
  • the target convolution feature information corresponding to the target convolution feature map may be:
  • the prior frame corresponding to the convolution feature of size 3*3 is used as the boundary Box, centering the bounding box on feature unit 1 to determine the information corresponding to the bounding box, and determining the information corresponding to the bounding box as the target convolution feature information corresponding to feature unit 1, thereby determining the three boundaries corresponding to feature unit 1
  • the information corresponding to the box is the three target convolution feature information; the boundary box is centered on the feature unit 2 to determine the boundary box Corresponding information, the information corresponding to the bounding box is determined as the target convolution feature information corresponding to the feature unit 2, thereby determining that the information corresponding to the three bounding boxes corresponding to the feature unit 2 is the three target convolution feature information;...
  • the boundary box is centered on the feature unit 9 to determine the information corresponding to the boundary box, and the information corresponding to the boundary box is determined as the target convolution feature information corresponding to the feature unit 9, thereby determining the three corresponding to the feature unit 9
  • the information corresponding to each bounding box is three target convolution feature information; finally, the target convolution feature information corresponding to feature unit 1 ⁇ feature unit 9 is determined as the target convolution feature information corresponding to the target convolution feature map, and 27 can be obtained Target convolution feature information.
  • S203 separately determining the matching probability between each target convolution feature information and multiple trademark names in the target detection model, and, the position coordinates corresponding to each target convolution feature information, and respectively in the first image
  • the area corresponding to the position coordinates corresponding to each target convolution feature information is used as the first area corresponding to each target convolution feature information.
  • the matching probability between each target convolution feature information and the trademark graphics corresponding to multiple trademark names in the target detection model can be calculated by a classifier in the target detection model, so as to determine each target volume separately
  • the matching probability between the product feature information and the trademark graphics corresponding to various trademark names in the target detection model is the same as the number of trademark graphics corresponding to the trademark name in the target detection model.
  • each target convolution feature information is matched with a trademark graphic corresponding to each trademark name in the target detection model, each target convolution feature information corresponds to multiple matching probabilities, and the matching probability is
  • the target convolution feature information is the possibility of trademark graphics corresponding to various trademark names in the target detection model.
  • the position coordinates corresponding to each target convolution feature information refer to the position coordinates corresponding to the time when each bounding box corresponding to each target convolution feature information is mapped back to the first image, each target convolution feature information corresponds to four Position coordinates, these four position coordinates correspond to the four vertices of the bounding box, and the coordinates of the four points obtained by mapping the four vertices of the bounding box back to the original image are the position coordinates corresponding to the target convolution feature information, by This can obtain the position coordinates corresponding to the convolution feature information of each target.
  • each target convolution feature map is obtained from the first image through size adjustment and convolution processing, there is a corresponding relationship between each point in each target convolution feature map and a point or area in the first image, According to this correspondence, the position coordinates of each boundary box corresponding to four points in the first image can be determined, and then each The position coordinates of the four points corresponding to the bounding boxes in the first image are determined as the position coordinates corresponding to the respective target convolution feature information corresponding to each bounding box, and the area formed by the points corresponding to the position coordinates is determined as each target volume The first region corresponding to the product feature information, therefore, it can be obtained that each target convolution feature information corresponds to each first region in the first image.
  • FIG. 7 is a schematic diagram of the mapping relationship between the target convolution feature map and the first image provided by an embodiment of the present invention.
  • the four vertices of the bounding box are al, a2, a3, and a4 respectively, and the four vertices correspond to the points bl, b2, b3, and b4 when mapped back to the first image, and the position coordinates of bl in the first image are (bl l, bl2), b2's position coordinates in the first image are (b21, b22), b3's position coordinates in the first image are (b31, b34), b4's position coordinates in the first image are (b41, b44), then the position coordinates of bl (bl l, bl 2), b2 (b21, b22), b3 (b31, b32) and
  • the position coordinates corresponding to each target convolution feature information can be determined according to the mapping relationship between the target convolution feature map corresponding to the target convolution feature information and the first image, that is, the position coordinates can be determined Corresponding points in the first image, the area combined by the points is the first area.
  • S204 Determine the maximum matching probability corresponding to each target convolution feature information according to the matching probability between each target convolution feature information and multiple trademark names in the target detection model, respectively, and corresponding each target convolution feature information
  • the maximum matching probability of is determined as the confidence of the first region corresponding to each target convolution feature information
  • the trademark name corresponding to the maximum matching probability corresponding to each target convolution feature information is used as the corresponding to each target convolution feature information
  • the brand name corresponding to the first area The brand name corresponding to the first area.
  • each target convolution feature in the convolution feature information set The information corresponds to multiple matching probabilities, and the maximum matching probability among the multiple matching probabilities corresponding to each target convolution feature information is determined as the confidence of the first region corresponding to each target convolution feature information, then each target volume can be obtained The confidence of the first region corresponding to the product feature information, and using the brand name corresponding to the maximum matching probability corresponding to each target convolution feature information as the brand name corresponding to the first region corresponding to each target convolution feature information.
  • the confidence of the first area is the largest matching probability among the matching probabilities between the image corresponding to the first area and the trademark graphics corresponding to multiple trademark names in the target detection model.
  • the matching probability of the target convolution feature information A and the trademark pattern corresponding to the trademark name 1 in the target detection model is determined 0.80; the matching probability of the target convolution feature information A and the trademark graphic corresponding to the trademark name 2 in the target detection model is 0.20; the matching probability of the target convolution feature information A and the trademark graphic corresponding to the trademark name 3 in the target detection model Is 0.15, then the trademark pattern corresponding to the trademark name 1 in the target detection model of the target convolution feature information A has the maximum matching probability, and the maximum matching probability is 0.80. Then the maximum matching probability is the confidence of the first area, the confidence is 0.80, and the trademark name corresponding to the first area is the trademark name 1.
  • FIG. 8 is obtained after convolution processing of a target detection model provided by an embodiment of the present invention.
  • a small square in C1 is a target convolution feature subgraph corresponding to the target convolution feature map in B1
  • a small square in C2 is a target convolution feature submap corresponding to the target convolution feature map in B2
  • a small square in C3 is a target convolution feature submap corresponding to the target convolution feature map in B3.
  • each small square in C1 represents a target convolution feature submap
  • each target Convolutional feature submap corresponds to 3 target convolutional feature information (that is, a small square in C1 corresponds to 3 target convolutional feature information, and a target convolutional feature information corresponds to a bounding box)
  • the number 1 in B1 is
  • the target convolution feature map numbered 2 in B1 corresponds to 27 target convolution feature information
  • the target convolution feature map numbered 10 in the map corresponds to 27 target convolution feature information.
  • the 1890 target convolution feature information is target convolution feature information 1, ..., target convolution feature information 1890.
  • the 50 matching probabilities corresponding to the target convolution feature information 2 and the bounding box corresponding to the target convolution feature information 2 can be mapped back to the first region 2 in the first image, ..., the target can be determined
  • the 50 matching probabilities corresponding to the convolution feature information 1890 and the bounding box corresponding to the target convolution feature information 1890 are mapped back to the first region 1890 in the first image.
  • the maximum matching probability among the 50 matching probabilities corresponding to the target convolution feature information 1 is determined as the confidence of the first region 1, and the maximum matching probability is determined
  • the corresponding brand name is used as the brand name corresponding to the first region 1 to obtain the confidence of the first region 1 corresponding to the target convolution feature information 1 and the brand name corresponding to the first region 1.
  • the confidence of the first region 2 corresponding to the target convolution feature information 2 and the brand name corresponding to the first region 2 can be obtained, ..., the first region corresponding to the target convolution feature information 1890 can be obtained The confidence of 1890 and the brand name corresponding to the first area 1890.
  • 1890 first regions can be obtained, each of which has a corresponding confidence level and a corresponding trademark name.
  • S205 Determine whether the confidence of each first area is greater than a first threshold.
  • the confidence of the first area is the largest matching probability among the matching probabilities between the image corresponding to the first area and the trademark graphics corresponding to multiple trademark names in the target detection model, the confidence of the first area is greater than
  • the first threshold value indicates that the image corresponding to the first area matches the trademark graphics corresponding to multiple trademark names in the target detection model with a high probability of matching, that is, the image corresponding to the first area corresponds to multiple trademark names in the target detection model. Is more likely to have a trademark graphic; the confidence of the first area is less than the first threshold, indicating that the image corresponding to the first area is less likely to be a trademark graphic corresponding to multiple trademark names in the target detection model.
  • the first threshold is 0.60, when there is at least one first region in the 1890 first regions When the confidence is greater than or equal to 0.60, it means that there is a trademark in the first image. When the confidence of each of the 1890 first regions is less than 0.60, it means that there is no trademark in the first image.
  • the first threshold may be any value such as 0.60, 0.75, 0.85, etc. So far, it is detected whether the trademark exists in the first image.
  • the image is detected by a target detection model
  • the convolutional network structure in the target detection model includes a residual unit, an upsampling unit, a fusion unit, a convolution feature layer, and a general convolution layer
  • the residual unit can enable the convolutional network structure to converge in deep conditions.
  • the convolutional layer in the residual unit can compress the features corresponding to the image obtained by each convolution, reducing the amount of parameters and calculations in the model ;
  • the upsampling unit upsamples the image so that the network layer number is very good when the network layer is deep; the fusion unit enables the network to learn deep and shallow features at the same time, so that the convolutional image corresponds to more features; volume
  • the product feature layer outputs three convolutional images of different scales, so that three convolutional images of different scales can be detected, and whether a trademark exists in the image can be detected. Because the amount of parameters and calculations in the model are reduced during the detection process, the efficiency of trademark detection is improved, and the accuracy of trademark detection is improved by detecting three convolutional images of different scales.
  • the target before using the above target detection detection model to detect the first image to determine the position coordinates of the first trademark in the first image and the trademark name corresponding to the first trademark, the target may also be trained first Detection model.
  • first mark the training sample image including marking the position of the trademark, trademark name and other information.
  • use the marked sample images to train the initial model, and save the model when the model converges and reaches a certain accuracy (referring to the loss function value in the model is less than the loss threshold and the accuracy is greater than the accuracy threshold), the saved model is the target detection detection model.
  • FIG. 9 This is a schematic flowchart of an information promotion method provided by an embodiment of the present invention. The method includes:
  • S301 Determine a set consisting of a first region with a confidence greater than or equal to a first threshold as a first set.
  • each first region has a corresponding confidence level. Assuming that the first threshold is 0.60, there are 200 first regions in the 1890 first regions. If the confidence is greater than or equal to 0.60, then the first set is the set consisting of the 200 first regions.
  • the first set consists of ⁇ first area 1, first area 2, first area 3, and first area 4 ⁇ , and the confidence of each first area is 0.42, 0.55, 0.85, 0.95, respectively , Then the first area 4 with a confidence level of 0.95 is determined as the first target area.
  • S303 When the number of first regions in the first set is one, determine the first region in the first set as the first target region.
  • the first region 1 is the first target region.
  • S304 Determine the position coordinates corresponding to the first target area and the brand name corresponding to the first target area as feature information of a first trademark.
  • the position coordinates corresponding to the first area 1 and the trademark name corresponding to the first area 1 are determined as the characteristic information of the first trademark, that is, the first image Characteristic information of a first trademark.
  • an area where the first trademark in the first image is located in the first image is determined, which is the first target area, and by determining the position coordinates and trademark of the first target area
  • the name can determine the position coordinates of the trademark in the first image and the name of the trademark, that is, the characteristic information of the first trademark in the first image. In some possible cases, there may be in the first image
  • S305 Determine a set composed of the first area after excluding the first target area in the first set as the second set.
  • the second set is ⁇ first Area 1, first area 2, first area 3 ⁇ .
  • S306. Determine whether the degree of coincidence between each first area and the first target area in the second set is greater than or equal to a second threshold.
  • the degree of coincidence between each first area and the first target area in the second set may be determined by comparing the IOU, and the calculated degree of coincidence may be compared with the second threshold.
  • I0U is also called the cross-combination ratio, that is, the degree of coincidence of the images in the two regions, and calculating the I0U of the first region 1 and the first region 2 specifically refers to calculating the coincidence of the first region 1 and the first region 2 in the first image Degree, the greater the I0U, the greater the degree of coincidence between the two regions, the smaller the I0U, the smaller the degree of coincidence between the two regions.
  • the intersection ratio of the first area 1 and the first area 2 may be calculated according to the position coordinates of the first area 1 and the position coordinates of the first area 2.
  • the second threshold is a critical point for evaluating the degree of coincidence between two regions, and the second threshold may specifically be 0.60, 0.85, etc. When the I0U of the two regions is greater than the second threshold, it means that the overlap of the two regions is high.
  • the second target area of the second set is the first area with the highest confidence in the second set.
  • the second threshold is 0.65
  • the second set includes ⁇ first area A, first area B, first area C, first area D ⁇ , and the confidence levels are 0.62, 0.68, 0.75, 0.85, respectively.
  • a target area is the first area E.
  • the second target area of the second set It is the first area D.
  • the third target area of the third set is the first area with the highest confidence in the third set.
  • the second threshold is 0.65
  • the second set includes ⁇ first region F, first region G, first region H, first region I ⁇
  • the confidence levels are 0.65, 0.69, 0.78, 0.86, respectively.
  • a target area is the first area J, and the degree of coincidence of the first area F, the first area G, and the first area J is less than 0.65, and the degree of coincidence of the first area H, the first area I, and the first area J is greater than 0.65, Then the second area includes the first area H and the first area I, then the third set is a set of the first area after the second area is excluded from the second set, that is, the third set is ⁇ first area F, first Area G ⁇ , the third target area is the first area G in the third set.
  • S309 Determine the position coordinates corresponding to the second target area and the brand name corresponding to the second target area as feature information of another first trademark.
  • step S307 If it is determined in step S307 that the second target area of the second set is the first area D, then the position coordinates corresponding to the first area D and the trademark name corresponding to the first area D are determined as the characteristics of the second trademark information.
  • S310 Determine the position coordinates corresponding to the third target area and the brand name corresponding to the third target area as feature information of another first trademark.
  • the characteristic information of the other first trademark is the characteristic information of the second first trademark in the first image.
  • the third target area is determined to be the first area G
  • the position coordinates corresponding to the first area G and the trademark name corresponding to the first area G are determined as the characteristic information of another first trademark, that is, the second first trademark Characteristic information.
  • the second target area and the third target area determined in steps S308-S309 are the first image
  • the area corresponding to another first trademark in the first image that is, the second first trademark in the first image
  • determining the characteristic information of the second first trademark in the first image according to the target area may be as shown in FIG. 10,
  • FIG. 10 is a schematic diagram of determining feature information of two first trademarks in a first image provided by an embodiment of the present invention. Since there are two first trademarks in the figure, the characteristics of one first trademark (first trademark) The information may be the trademark name “NIKE” in FIG. 10 and the area 10e corresponding to the position coordinates of the trademark in the first image, and the characteristic information of another first trademark (second first trademark) may be the trademark in FIG. 10 An area 10d corresponding to the name "Xtep" and the position coordinates of the trademark in the first image.
  • steps S305-S310 an area where another first trademark (the second first trademark) in the first image is located in the first image is determined, and the area is the second target area, which is determined by The position coordinates and the brand name of the second target area can determine the characteristic information of another first trademark (second first trademark) in the first image.
  • steps S305-S310 can also refer to the specific implementation manner of steps S305-S310 to determine whether there are more first trademarks in the first image and to determine more feature information of the first trademark, until the first image is sequentially determined Characteristic information of all first trademarks. The following describes the process of sequentially determining the characteristic information of all the first trademarks in the first image by way of example.
  • FIG. 11 is a schematic diagram of determining regions corresponding to multiple first trademarks in a first image provided by an embodiment of the present invention.
  • the second threshold is 0.60
  • the determined first There are 9 first areas in the first set in the image (the dotted frame in the figure represents the first area), namely A, B, C, D, E, F,
  • the confidence levels are 0.60, 0.62, 0.65, 0.73, 0.95, 0.75, 0.65, 0.68, 0.60, then the first area E with the confidence level of 0.95 is determined as the first target area, and the second set is In the first set, the set consisting of the first area after excluding the first target area, that is, the second set consists of A, B, C, D, F,
  • the first target area E is an area where a first trademark in the first image is located, and then the area where the first first trademark in the first image is located is determined.
  • the degree of coincidence of A, B, C, D, F, G, H, I, and E in the second set is separately determined.
  • the second target area of the second set is F
  • the second target area F is the second in the first image (The other) the area where the first trademark is located, then the area where the second first trademark in the first image is located is determined;
  • A, B, C, D, F, G, H, I in the second set I0U of at least one of the first region and E is greater than or equal to 0.60, assuming that A,
  • the I0U of B, C, D, F, G, H, and E are greater than or equal to 0.60, then the second area includes A, B, C , D, F, G, H, then the third set is the set of the first area excluding the second area in the second set, that is, the third set is I, the third target area is I, and the third target area I is the area where the second first trademark in the first image is located, and then the area where the second first trademark in the first image is located is determined.
  • the second area includes A
  • the third set is the set after the second set excludes the second area, that is, the third set consists of F, G,
  • the third target area is the area with the highest confidence in the third set, namely F. Then, the area after excluding F in the third area is determined as the fourth set. Since there are multiple first areas in the fourth set, continue to judge the coincidence of G, H, I in the fourth set with the third target area F Degree, assuming that the coincidence of G, H, I, and F are all greater than 0.60, then the third area includes G, H, and I.
  • the fourth set is the empty set after excluding the third area, and the third target area F is the first The area where the third first trademark in the image is located, then the area where the third first trademark in the first image is located is determined. Finally, it is determined that the areas corresponding to the three first trademarks in the first image, that is, the first target area, the second target area, and the third target area, that is, there are three first trademarks in the first image.
  • the position coordinates and trademark names corresponding to the three target regions corresponding to the three first trademarks existing in the first image are determined as the characteristic information of the three first trademarks corresponding to the first image, that is, the first target
  • the position coordinates and trademark names corresponding to the area are determined as the characteristic information of the first first trademark
  • the position coordinates and trademark names corresponding to the second target area are determined as the characteristic information of the second first trademark
  • the third target area is corresponding
  • the position coordinates and trademark name are determined as the characteristic information of the third first trademark. So far, the characteristic information of all the first trademarks in the first image is determined.
  • the duplicate trademark at the same position is removed by comparing the overlapping degree, the area where the trademark is located in the image and the trademark name corresponding to the trademark are determined, and the overlap is compared by comparing
  • the degree method removes duplicate trademarks and improves the speed and accuracy of trademark detection.
  • FIG. 12 is a schematic structural diagram of an information promotion device provided by an embodiment of the present invention.
  • the device 50 includes:
  • an acquisition module 501 configured to acquire a first image, where the first image is an image captured by a photographing device corresponding to a display screen in a shopping mall;
  • the detection module 502 is configured to detect whether a trademark exists in the first image;
  • the information determining module 503 is configured to determine the characteristic information of at least one first trademark in the first image when it is detected that the trademark exists in the first image;
  • a search module 504 configured to search for a second trademark matching the characteristic information of each first trademark in the trademark information database
  • the display module 505 is configured to display promotion information corresponding to the second trademark on the display screen, where the promotion information includes position information of the commodity corresponding to the second trademark in the mall.
  • the characteristic information of each first trademark includes the trademark name of the first trademark and the position coordinates of the first trademark in the first image
  • the search module 504 is specifically configured to The name of the first trademark is searched for in the trademark information database with the same name as the first trademark, and the trademark with the same name as the first trademark is determined as the second trademark; or, based on A target image corresponding to the position coordinates of the first trademark in the first image, search for a trademark pattern in the trademark information database whose similarity to the target image is higher than a similarity threshold, and compare the The trademark pattern whose similarity of the target image is higher than the similarity threshold is determined as the second trademark.
  • the display module 505 is specifically configured to switch and display the plurality of second trademarks in order on a display screen of a shopping mall in the case of multiple second trademarks
  • the promotion information corresponding to the trademark, or, when there are multiple second trademarks, the promotion information corresponding to the multiple second trademarks are displayed simultaneously on the display screen of the mall.
  • the detection module 502 includes:
  • the processing unit 5021 is configured to perform convolution processing on the first image based on the convolution layer in the target detection model to obtain a target convolution feature map, where the target convolution feature map includes multiple target convolution feature sub Figure;
  • the determining unit 5022 is configured to determine target convolution feature information corresponding to each target convolution feature sub-graph in the multiple target convolution feature sub-graphs, respectively;
  • the determining unit 5022 is further configured to determine a matching probability between each target convolution feature information and a plurality of trademark names in the target detection model, and a position corresponding to each target convolution feature information Coordinates, and the area corresponding to the position coordinates corresponding to the respective target convolution feature information in the first image is used as the first area corresponding to the respective target convolution feature information;
  • the matching unit 5023 is configured to determine the maximum corresponding to each target convolution feature information according to the matching probability between each target convolution feature information and multiple trademark names in the target detection model, respectively Matching probability, respectively determining the maximum matching probability corresponding to each target convolution feature information as the confidence of the first region corresponding to each target convolution feature information, and respectively corresponding to each target convolution feature information
  • the brand name corresponding to the maximum matching probability is used as the brand name corresponding to the first region corresponding to each target convolution feature information;
  • the determining unit 5024 is configured to determine whether the confidence of each first area is greater than a first threshold
  • the judgment unit 5024 is further configured to detect the presence of a trademark in the first image when the confidence of at least one first area is greater than or equal to the first threshold;
  • the judging unit 5024 is further configured to detect that there is no trademark in the first image when the confidence of each first area is less than the first threshold.
  • the matching unit 5023 is specifically configured to:
  • [0142] determine a set consisting of a first region with a confidence greater than or equal to the first threshold as the first set;
  • the first region with the highest confidence in the first set is determined as the first target region
  • the matching unit 5023 is specifically configured to:
  • the degree of coincidence of at least one first area in the second set with the first target area is greater than or equal to the second threshold
  • the The first area where the degree of coincidence is greater than or equal to the second threshold is determined as the second area
  • the second set is excluded from the second set
  • the set consisting of the first area after the area is determined as the third set, and the third target area of the third set is determined;
  • the trademark name corresponding to the third target area is determined to be another first trademark.
  • the acquired first image is detected by a target detection model
  • the convolutional network structure in the target detection model includes a residual unit, an upsampling unit, a fusion unit, a convolution feature layer, and
  • the residual unit can make the convolutional network structure converge in a deep situation.
  • the convolutional layer in the residual unit can compress the features corresponding to the image obtained by each convolution, reducing the model.
  • the upsampling unit upsamples the image so that the network layer number is very well expressed in deep conditions; the fusion unit enables the network to learn deep and shallow features at the same time, so that the convolved image corresponds to More features; the convolutional feature layer outputs three convolutional images of different scales, so that three convolutional images of different scales can be detected, which improves the efficiency and accuracy of trademark detection.
  • the duplicate trademarks are removed by comparing the coincidences, the characteristic information of each first trademark in the first image is determined, and the trademark information database is searched for matching the characteristic information of each first trademark The second trademark, and display the promotion information corresponding to the second trademark on the display screen of the mall.
  • the promotion information By displaying the promotion information corresponding to the detected trademark on the mall display screen, it is convenient for consumers to view and find the products corresponding to the trademark in the mall In the location, since the first image is an image captured by a shooting device corresponding to the display screen in the mall, the image is likely to be an image of the consumer near the display screen, then the detected trademark is likely to be The trademark on the consumer's clothing or other parts (such as a backpack), and the consumer's clothing can be used to feedback the user's preference for the product to a certain extent, and display promotional information matching the trademark, which is equivalent to displaying the user's favorite product
  • the promotion information has played the role of targeted promotion information, that is, promotion according to the user's preference, because the promotion is based on the user's preference, can achieve accurate delivery of promotion information, which can make the promotion effect better.
  • FIG. 13 is a schematic structural diagram of an information promotion device provided by an embodiment of the present invention,
  • the device includes a processor 601, a memory 602, and a communication interface 603.
  • the processor 601 is connected to the memory 602 and the communication interface 603.
  • the processor 601 may be connected to the memory 602 and the communication interface 603 through a bus.
  • the processor 601 is configured to support the information promotion apparatus to perform the corresponding functions in the information promotion method described in FIGS. 2, 4, and 9.
  • the processor 601 may be a central processing unit (CPU), a network processor (NP), a hardware chip, or any combination thereof.
  • the hardware chip may be an application specific integrated circuit (ASIC), a programmable logic device (programmable logic device, PLD), or a combination thereof.
  • the above PLD may be a complex programmable logic device (CPLD), field-programmable gate array (field-programmable gate array)
  • the memory 602 is used for storing program codes and the like.
  • the memory 602 may include a volatile memory (vo latile memory, VM), such as a random access memory (random access memory, RAM)
  • VM volatile memory
  • RAM random access memory
  • the memory 602 may also include non-volatile memory (non-volatile memory, NVM), such as read-only memory (read-only memory, ROM), flash memory (flash memory), hard disk (hard disk drive, HDD) or Solid-state drive (solid-state drive, SSD);
  • NVM non-volatile memory
  • ROM read-only memory
  • flash memory flash memory
  • HDD hard disk drive
  • SSD Solid-state drive
  • the memory 602 may also include a combination of the aforementioned types of memory.
  • the communication interface 603 is used to send or receive data.
  • the processor 601 may call the program code to perform the following operations:
  • the promotion information corresponding to the second trademark is displayed on the display screen, and the promotion information includes location information of the commodity corresponding to the second trademark in the mall.
  • each operation may also correspond to the method shown in FIG. 2, FIG. 4, and FIG. 9.
  • the processor 601 can also cooperate with the communication interface 603 to perform other operations in the above method embodiments.

Landscapes

  • Business, Economics & Management (AREA)
  • Accounting & Taxation (AREA)
  • Finance (AREA)
  • Strategic Management (AREA)
  • Engineering & Computer Science (AREA)
  • Development Economics (AREA)
  • Marketing (AREA)
  • Economics (AREA)
  • Physics & Mathematics (AREA)
  • General Business, Economics & Management (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Game Theory and Decision Science (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Image Analysis (AREA)

Abstract

An information promotion method and a related apparatus, the method comprising: acquiring a first image, the first image being an image photographed by a photographic apparatus corresponding to a display screen in a shopping mall (S101); detecting whether a trademark is present in the first image (S102); when detecting that a trademark is present, determining feature information of at least one first trademark in the first image (S103); looking up a second trademark matching the feature information of each first trademark in a trademark information database (S104); and displaying promotion information corresponding to the second trademark on the display screen, the promotion information comprising position information of the product corresponding to the second trademark in the shopping mall (S105). The technical solution of the present method can solve the problem of the effect of information promotion not being good enough.

Description

信息推广方法和装置 技术领域 Information promotion method and device
[0001] 本发明涉及信息推广领域, 尤其涉及信息推广方法和装置。 [0001] The present invention relates to the field of information promotion, and in particular, to an information promotion method and device.
[0002] 本申请要求于 2018年 12月 21日提交中国专利局, 申请号为 201811579857.8、 发 明名称为“信息推广方法和装置”的中国专利申请的优先权, 其全部内容通过引用 结合在本申请中。 [0002] This application requires the priority of the Chinese patent application filed on December 21, 2018, with the application number 201811579857.8 and the invention titled "Information Promotion Method and Device", the entire contents of which are incorporated by reference in this application in.
背景技术 Background technique
[0003] 商标是商品的生产者、 经营者在其生产、 制造、 加工、 拣选或者经销的商品上 或者服务的提供者在其提供的服务上采用的, 用于区别商品或服务来源, 包括 文字、 图形、 字母、 数字、 三维标志、 颜色组合和声音等, 以及上述要素的组 合, 具有显著特征的标志。 [0003] Trademarks are used by producers and operators of commodities on their production, manufacture, processing, selection or distribution of commodities or service providers on the services they provide, and are used to distinguish the source of commodities or services, including text , Graphics, letters, numbers, three-dimensional signs, color combinations and sounds, as well as the combination of the above-mentioned elements, signs with distinctive characteristics.
[0004] 目前商场中展示的关于商标的推广信息一般为固定的, 每个消费者看到的推广 信息都相同, 由于消费者的兴趣爱好差异较大, 所以很多消费者对固定播放的 推广信息可能不感兴趣, 导致信息推广效果比较差。 [0004] At present, the promotion information about trademarks displayed in shopping malls is generally fixed, and the promotion information seen by each consumer is the same. Because consumers have different interests and hobbies, many consumers promote the fixed broadcast promotion information. May not be interested, resulting in poor information promotion.
发明概述 Summary of the invention
技术问题 technical problem
问题的解决方案 Solution to the problem
技术解决方案 Technical solution
[0005] 本发明实施例提供信息推广方法和装置, 解决信息推广效果不好的问题。 [0005] Embodiments of the present invention provide information promotion methods and devices to solve the problem of poor information promotion effects.
[0006] 第一方面, 提供信息推广方法, 包括: [0006] The first aspect provides an information promotion method, including:
[0007] 获取第一图像, 所述第一图像为商场中的展示屏对应的拍摄装置拍摄到的图像 [0007] acquiring a first image, where the first image is an image captured by a shooting device corresponding to a display screen in a shopping mall
[0008] 检测所述第一图像中是否存在商标; [0008] detecting whether a trademark exists in the first image;
[0009] 在检测到所述第一图像中存在商标的情况下, 确定所述第一图像中至少一个第 一商标的特征信息; [0009] in the case where a trademark is detected in the first image, determining feature information of at least one first trademark in the first image;
[0010] 在商标信息库中查找与每个第一商标的特征信息匹配的第二商标; [0011] 在所述展示屏中显示所述第二商标对应的推广信息, 所述推广信息包括所述第 二商标对应的商品在所述商场中的位置信息。 [0010] Find a second trademark matching the characteristic information of each first trademark in the trademark information database; [0011] The promotion information corresponding to the second trademark is displayed on the display screen, and the promotion information includes location information of commodities corresponding to the second trademark in the mall.
[0012] 第二方面, 提供一种信息推广装置, 包括: [0012] In a second aspect, an information promotion device is provided, including:
[0013] 获取模块, 用于获取第一图像, 所述第一图像为商场中的展示屏对应的拍摄装 置拍摄到的图像; [0013] an acquisition module, configured to acquire a first image, where the first image is an image captured by a shooting device corresponding to a display screen in a shopping mall;
[0014] 检测模块, 用于检测所述第一图像中是否存在商标; [0014] a detection module, configured to detect whether a trademark exists in the first image;
[0015] 信息确定模块, 用于在检测到所述第一图像中存在商标的情况下, 确定所述第 一图像中至少一个第一商标的特征信息; [0015] an information determination module, configured to determine the characteristic information of at least one first trademark in the first image if a trademark is detected in the first image;
[0016] 查找模块, 用于在商标信息库中查找与每个第一商标的特征信息匹配的第二商 标; [0016] a search module for searching a second trademark in the trademark information database that matches the feature information of each first trademark;
[0017] 显示模块, 用于在所述展示屏中显示所述第二商标对应的推广信息, 所述推广 信息包括所述第二商标对应的商品在所述商场中的位置信息。 [0017] a display module, configured to display promotion information corresponding to the second trademark on the display screen, where the promotion information includes location information of commodities corresponding to the second trademark in the mall.
[0018] 第三方面, 提供一种信息推广装置, 包括处理器、 存储器以及通信接口, 所述 处理器、 存储器和通信接口相互连接, 其中, 所述通信接口用于输入或输出数 据, 所述存储器用于存储程序代码, 所述处理器用于调用所述程序代码, 以执 行上述第一方面的方法。 [0018] In a third aspect, an information promotion device is provided, including a processor, a memory, and a communication interface, where the processor, memory, and communication interface are connected to each other, wherein the communication interface is used to input or output data, the The memory is used to store program code, and the processor is used to call the program code to perform the method of the first aspect.
[0019] 第四方面, 提供一种计算机存储介质, 所述计算机存储介质存储有计算机程序 , 所述计算机程序包括程序指令, 所述程序指令当被处理器执行时, 执行上述 第一方面的方法。 [0019] In a fourth aspect, a computer storage medium is provided, where the computer storage medium stores a computer program, the computer program includes program instructions, and the program instructions, when executed by a processor, perform the method of the first aspect described above .
[0020] 本发明实施例中, 通过对获取到的第一图像进行检测, 检测第一图像中是否存 在商标, 在检测到图像中存在商标的情况下, 确定第一图像中每个第一商标的 特征信息, 在商标信息库中查找与每个第一商标的特征信息匹配的第二商标, 并在商场的展示屏中显示与第二商标对应的推广信息, 通过在商场展示屏上显 示与检测到的商标对应的推广信息, 便于消费者观看并查找商标对应的商品在 商场中的位置, 由于第一图像为商场中的展示屏对应的拍摄装置拍摄到的图像 , 该图像为在该展示屏附近的消费者的图像的可能性较高, 那么检测到的商标 很可能为该消费者服装或者其他部位 (如背包) 上的商标, 而消费者的着装在 一定程度上可以用于反馈用户对商品的喜好, 显示与商标相匹配的推广信息, 相当于是显示用户喜欢的商品的推广信息, 起到了针对性地投放推广信息的作 用, 即根据用户的喜好进行推广, 由于是根据用户的喜好进行推广, 可以实现 推广信息的精准投放, 从而可以使得推广的效果更好。 [0020] In the embodiment of the present invention, by detecting the acquired first image, it is detected whether there is a trademark in the first image, and when a trademark is detected in the image, each first trademark in the first image is determined Search for the second trademark matching the characteristic information of each first trademark in the trademark information database, and display the promotion information corresponding to the second trademark on the display screen of the mall. The promotion information corresponding to the detected trademark is convenient for consumers to view and find the position of the product corresponding to the trademark in the mall. Since the first image is an image captured by a shooting device corresponding to the display screen in the mall, the image is displayed at the exhibition The possibility of the consumer’s image near the screen is high, so the detected trademark is likely to be a trademark on the consumer’s clothing or other parts (such as a backpack), and the consumer’s dress can be used to feedback the user to a certain extent. For product preferences, display promotional information that matches the trademark, It is equivalent to displaying the promotion information of the products that the user likes, and plays the role of targeted promotion information, that is, promotion according to the user's preference. Since promotion is based on the user's preference, accurate delivery of the promotion information can be achieved, which can make The effect of promotion is better.
发明的有益效果 Beneficial effects of invention
对附图的简要说明 Brief description of the drawings
附图说明 BRIEF DESCRIPTION
[0021] 为了更清楚地说明本发明实施例中的技术方案, 下面将对实施例中所需要使用 的附图作简单地介绍, 显而易见地, 下面描述中的附图仅仅是本发明的一些实 施例, 对于本领域普通技术人员来讲, 在不付出创造性劳动的前提下, 还可以 根据这些附图获得其他的附图。 [0021] In order to more clearly explain the technical solutions in the embodiments of the present invention, the following will briefly introduce the drawings required in the embodiments. Obviously, the drawings in the following description are only some implementations of the present invention For example, for those of ordinary skill in the art, without paying any creative labor, other drawings can also be obtained based on these drawings.
[0022] 图 1是本发明实施例提供的一种场景示意图; [0022] FIG. 1 is a schematic diagram of a scenario provided by an embodiment of the present invention;
[0023] 图 2本发明实施例提供的一种信息推广方法的流程示意图; [0023] FIG. 2 is a schematic flowchart of an information promotion method provided by an embodiment of the present invention;
[0024] 图 3是本发明实施例提供的一种检测商标名称的结构示意图; [0024] FIG. 3 is a schematic structural diagram of detecting a brand name according to an embodiment of the present invention;
[0025] 图 4是本发明实施例提供的另一种信息推广方法的流程示意图; [0025] FIG. 4 is a schematic flowchart of another information promotion method provided by an embodiment of the present invention;
[0026] 图 5是本发明实施例提供的目标检测模型中的卷积网络结构的示意图; [0026] FIG. 5 is a schematic diagram of a convolutional network structure in a target detection model provided by an embodiment of the present invention;
[0027] 图 6是本发明实施例提供的目标卷积特征图与目标卷积特征子图之间的关系示 意图; [0027] FIG. 6 is a schematic diagram showing the relationship between the target convolution feature map and the target convolution feature sub-map provided by an embodiment of the present invention;
[0028] 图 7是本发明实施例提供的目标卷积特征图与第一图像之间的映射关系的示意 图; [0028] FIG. 7 is a schematic diagram of a mapping relationship between a target convolution feature map and a first image provided by an embodiment of the present invention;
[0029] 图 8是本发明实施例提供的一种基于目标检测模型卷积处理后得到的三种目标 卷积特征图与对应的目标卷积特征子图之间的关系示意图; [0029] FIG. 8 is a schematic diagram of the relationship between three target convolution feature maps and corresponding target convolution feature sub-graphs obtained after convolution processing based on a target detection model provided by an embodiment of the present invention;
[0030] 图 9是本发明实施例提供的又一种信息推广方法的流程示意图; [0030] FIG. 9 is a schematic flowchart of yet another information promotion method provided by an embodiment of the present invention;
[0031] 图 10是本发明实施例提供的一种确定出第一图像中的两个第一商标的特征信息 的示意图; [0031] FIG. 10 is a schematic diagram of determining feature information of two first trademarks in a first image provided by an embodiment of the present invention;
[0032] 图 11是本发明实施例提供的一种在第一图像中确定出多个第一商标对应的区域 的示意图; [0032] FIG. 11 is a schematic diagram of determining regions corresponding to a plurality of first trademarks in a first image provided by an embodiment of the present invention;
[0033] 图 12是本发明实施例提供的一种信息推广装置的组成结构示意图; [0033] FIG. 12 is a schematic structural diagram of an information promotion device according to an embodiment of the present invention;
[0034] 图 13是本发明实施例提供的另一种信息推广装置的组成结构示意图。 发明实施例 [0034] FIG. 13 is a schematic structural diagram of another information promotion device provided by an embodiment of the present invention. Invention Example
本发明的实施方式 Embodiments of the invention
[0035] 本发明的技术方案适用于对图像进行检测, 从而得到对应的商标检测结果, 并 根据该商标检测结果在商标信息库中查找并显示商标对应的推广信息的场景。 在一种可能的场景下, 如图 1所示, 展示屏 01对应的拍摄装置 02在感应到其正对 面有消费者时, 周期性的对其正对面的且距离在预设距离内的消费者服装上的 不同部位进行拍照, 将拍摄的图像发送给相应的图像处理装置 03进行检测, 当 检测到图像中包含有商标的情况下, 根据检测到的商标的特征信息在商标信息 库 04中查找匹配的商标, 并确定该商标对应的推广信息, 以使展示屏 01显示该 推广信息, 使消费者能更好的观看到商标对应的信息以及查找到在该商场中商 标对应的商品。 该拍摄装置可以为展示屏上的摄像头, 也可以为展示屏对应的 其他拍摄装置, 例如可以为具有拍照功能的设备, 例如个人计算机、 笔记本电 脑、 智能手机、 平板电脑和便携式可穿戴设备, 周期可以为 0.1s、 0.3s、 0.5s等任 意值, 该预设距离可以为 0.5m、 lm、 1.5m等任意值。 [0035] The technical solution of the present invention is suitable for detecting an image, thereby obtaining a corresponding trademark detection result, and searching and displaying a scene of promotional information corresponding to the trademark in the trademark information database according to the trademark detection result. In a possible scenario, as shown in FIG. 1, when the shooting device 02 corresponding to the display screen 01 senses that there is a consumer directly opposite it, it periodically consumes the consumer directly opposite and within a preset distance Take pictures of different parts of the wearer’s clothing and send the captured images to the corresponding image processing device 03 for detection. When it is detected that the image contains a trademark, the trademark information database 04 is based on the detected trademark feature information Find the matching trademark, and determine the promotion information corresponding to the trademark, so that the display screen 01 displays the promotion information, so that consumers can better view the information corresponding to the trademark and find the product corresponding to the trademark in the mall. The shooting device may be a camera on the display screen, or may be another shooting device corresponding to the display screen, for example, a device with a photographing function, such as a personal computer, laptop computer, smart phone, tablet computer, and portable wearable device, period It can be any value such as 0.1s, 0.3s, 0.5s, etc. The preset distance can be any value such as 0.5m, lm, 1.5m.
[0036] 请参见图 2, 图 2是本发明实施例提供的一种信息推广方法的流程示意图, 如图 所示, 该方法包括如下步骤: [0036] Please refer to FIG. 2 and FIG. 2 is a schematic flowchart of an information promotion method provided by an embodiment of the present invention. As shown in the figure, the method includes the following steps:
[0037] S101 , 获取第一图像, 第一图像为商场中的展示屏对应的拍摄装置拍摄到的图 像。 [0037] S101: Acquire a first image, where the first image is an image captured by a shooting device corresponding to a display screen in a shopping mall.
[0038] 本发明实施例中, 拍摄装置对消费者服装、 背包等不同部位进行拍照, 获取到 第一图像。 其中, 获取到的第一图像中可以包括商标, 也可以不包括商标。 例 如, 获取到的图像可以包括商标以及物品上具有装饰作用的图案或者背景等除 商标以外的图形, 如图 3中所示, “云”与“雨”形图像为物品上具有装饰作用的背 景图案。 拍摄装置可以为展示屏上方的摄像头, 如图 1中 02所示, 也可以为其他 具有拍照功能的设备。 [0038] In the embodiment of the present invention, the photographing device photographs different parts of the consumer's clothing, backpack, etc. to obtain the first image. Among them, the acquired first image may or may not include a trademark. For example, the acquired image may include a trademark and other decorative graphics or background graphics other than the trademark, as shown in FIG. 3, the "cloud" and "rain" shaped images are decorative backgrounds on the article pattern. The shooting device may be a camera above the display screen, as shown by 02 in FIG. 1, or other equipment with a photographing function.
[0039] S102, 检测第一图像中是否存在商标。 [0039] S102, detecting whether a trademark exists in the first image.
[0040] 本发明实施例中, 如图 3所示,图 3是本发明实施例提供的一种检测商标名称的 结构示意图, 可以将获取的第一图像调整至预设尺寸后检测第一图像中是否存 在商标。 具体地, 在第一图像的尺寸大于预设尺寸的情况下, 通过下采样的方 式对第一图像进行处理, 使第一图像缩小为与预设尺寸大小相同的图像; 在第 一图像的尺寸小于预设尺寸的情况下, 通过上采样的方式对第一图像进行处理 , 使第一图像放大为与预设尺寸大小相同的图像。 预设尺寸可以为 32*32的整数 倍数, 例如预设尺寸可以为 416*416 [0040] In the embodiment of the present invention, as shown in FIG. 3, FIG. 3 is a schematic structural diagram of detecting a brand name provided by an embodiment of the present invention, and the first image may be adjusted to a preset size to detect the first image Whether there is a trademark in. Specifically, in the case where the size of the first image is larger than the preset size, the The first image is processed to reduce the first image to the same size as the preset size; in the case that the size of the first image is smaller than the preset size, the first image is processed by upsampling to make The first image is enlarged to the same size as the preset size. The preset size may be an integer multiple of 32*32, for example, the preset size may be 416*416
[0041] 检测第一图像中是否存在商标的具体方式参考图 4对应的实施例, 此处不做过 多描述。 [0041] For a specific method of detecting whether a trademark exists in the first image, refer to the embodiment corresponding to FIG. 4, and details are not described here.
[0042] S103 , 在检测到第一图像中存在商标的情况下, 确定第一图像中至少一个第一 商标的特征信息。 [0042] S103: In a case where a trademark is detected in the first image, determine feature information of at least one first trademark in the first image.
[0043] 本发明实施例中, 在检测到第一图像中不存在商标的情况下, 则不再进行后续 步骤; 在检测到第一图像中存在商标的情况下, 确定第一图像中至少一个第一 商标的特征信息。 这里, 第一商标为第一图像中存在的商标, 该第一商标的数 量可以为多个。 第一商标的特征信息包括第一商标的商标名称以及第一商标在 第一图像中的位置坐标。 例如, 第一图像中存在两个第一商标时, 则第一图像 中第一商标的特征信息包括两个第一商标的商标名称以及两个第一商标在第一 图像中的位置坐标。 例如, 如图 3所示, 由于图 3中只有一个商标, 所以确定出 来的第一图像中的特征信息为商标名称“NIKE”以及商标在第一图像中的位置坐 标对应的区域 10c [0043] In the embodiment of the present invention, when it is detected that the trademark does not exist in the first image, the subsequent steps are not performed; when the trademark is detected in the first image, at least one of the first image is determined Characteristic information of the first trademark. Here, the first trademark is a trademark existing in the first image, and the number of the first trademark may be plural. The characteristic information of the first trademark includes the trademark name of the first trademark and the position coordinates of the first trademark in the first image. For example, when there are two first trademarks in the first image, the characteristic information of the first trademark in the first image includes the trademark names of the two first trademarks and the position coordinates of the two first trademarks in the first image. For example, as shown in FIG. 3, since there is only one trademark in FIG. 3, the characteristic information in the first image determined is the trademark name "NIKE" and the area 10c corresponding to the position coordinates of the trademark in the first image
[0044] S104, 在商标信息库中查找与每个第一商标的的特征信息匹配的第二商标。 [0044] S104, searching for a second trademark matching the feature information of each first trademark in the trademark information database.
[0045] 本发明实施例中, 由于步骤 S103确定到的第一图像中第一商标的特征信息可能 为多个, 则分别根据每个第一商标的特征信息在商标信息库中查找匹配的第二 商标, 且每个第一商标的特征信息可能与商标信息库中的多个第二商标匹配, 则第二商标也可能为多个。 在商标信息库中查找与每个第一商标的特征信息匹 配的第二商标可以包括以下几种情况: [0045] In the embodiment of the present invention, since there may be multiple feature information of the first trademark in the first image determined in step S103, the matching information is searched for in the trademark information database according to the feature information of each first trademark. Two trademarks, and the feature information of each first trademark may match multiple second trademarks in the trademark information database, then there may be multiple second trademarks. Finding the second trademark matching the characteristic information of each first trademark in the trademark information database may include the following situations:
[0046] 第一种情况, 根据第一商标的商标名称在商标信息库中查找与第一商标的商标 名称相同的商标, 将与第一商标的商标名称相同的商标确定为第二商标。 [0046] In the first case, a trademark with the same trademark name as the first trademark is searched in the trademark information database based on the trademark name of the first trademark, and the trademark with the same trademark name as the first trademark is determined as the second trademark.
[0047] 第二种情况, 根据第一商标在第一图像中的位置坐标对应的目标图像在商标信 息库中查找与目标图像的相似度高于相似度阈值的商标图案, 将与目标图像的 相似度高于相似度阈值的商标图案确定为第二商标。 可以根据第一商标在第一 图像中的位置坐标对应的目标图像在商标信息库中查找与目标图像的相似度高 于相似度阈值的商标图案, 将该商标图案确定为第二商标。 相似度阈值可以为 8 0%、 85%、 90%等任意值。 [0047] In the second case, according to the target image corresponding to the position coordinates of the first trademark in the first image, the trademark information database is searched for a trademark pattern whose similarity with the target image is higher than the similarity threshold, and The trademark pattern whose similarity is higher than the similarity threshold is determined as the second trademark. According to the first trademark in the first The target image corresponding to the position coordinates in the image searches the trademark information database for a trademark pattern whose similarity to the target image is higher than the similarity threshold, and determines the trademark pattern as the second trademark. The similarity threshold may be any value such as 80%, 85%, and 90%.
[0048] 第三种情况, 还可以根据第一商标的商标名称与第一商标在第一图像中的位置 坐标对应的商标图案同时在商标信息库中查找相匹配的商标, 将查找到的商标 确定为第二商标。 [0048] In the third case, it is also possible to search for a matching trademark in the trademark information database according to the trademark name of the first trademark and the trademark pattern corresponding to the position coordinates of the first trademark in the first image, and search for the found trademark Determined as the second trademark.
[0049] 这里, 商标信息库用于存储商标信息, 该商标信息可以包括商标名称以及商标 对应的商标图案, 第二商标为商标信息库中存储的多种商标。 可以理解的是, 这里的匹配指相同或者相似度达到相似度阈值即可认为两者相匹配, 相似度阈 值可以为 80%、 85%、 90%等任意值。 [0049] Here, the trademark information database is used to store trademark information, and the trademark information may include a trademark name and a trademark pattern corresponding to the trademark, and the second trademark is a variety of trademarks stored in the trademark information database. It can be understood that the matching here refers to the same or the similarity reaching the similarity threshold can be considered as a match between the two, the similarity threshold can be any value such as 80%, 85%, 90%.
[0050] 在查找到与第一图像中第一商标的特征信息匹配的第二商标后, 确定商标信息 库中第二商标对应的推广信息。 其中, 推广信息可以包括第二商标对应的商品 在商场中的位置信息。 第二商标对应的商品在商场中的位置信息具体可以为: 以当前终端所在位置为坐标原点建立直角坐标系, 该商标在直角坐标系中的位 置信息即对应于该第二商标对应的商品在商场中的位置信息。 推广信息还可以 包括第二商标的商标名称, 这里, 商标名称可以为中文名称或者英文名称, 以 及简称等, 例如“NIKE”的名称可以包括“NIKE”、 “耐克”。 推广信息还可以包括 第二商标的商标图案, 商标图案的颜色可以为黑白色, 也可以为彩色, 该图案 可以为平面图形, 也可以为 3D图形。 推广信息还可以包括第二商标的商标介绍 , 商标介绍可以为该商标的基本信息, 例如商标来源、 商标寓意、 商标适用群 体、 商标对应的商品的价格以及商标包含的商品种类等信息。 [0050] After finding the second trademark that matches the feature information of the first trademark in the first image, the promotion information corresponding to the second trademark in the trademark information database is determined. The promotion information may include the position information of the commodity corresponding to the second trademark in the shopping mall. The location information of the product corresponding to the second trademark in the shopping mall may specifically be: A rectangular coordinate system is established using the current terminal location as the coordinate origin, and the location information of the trademark in the rectangular coordinate system corresponds to the product corresponding to the second trademark. Location information in the mall. The promotion information may also include the trademark name of the second trademark, where the trademark name may be a Chinese name or an English name, and an abbreviation, etc. For example, the name of "NIKE" may include "NIKE" and "Nike". The promotion information may also include the trademark logo of the second trademark. The color of the trademark logo may be black and white, or it may be color. The pattern may be a flat graphic or a 3D graphic. The promotion information may also include the trademark introduction of the second trademark. The trademark introduction may be the basic information of the trademark, such as the source of the trademark, the moral of the trademark, the applicable group of the trademark, the price of the goods corresponding to the trademark, and the types of goods included in the trademark.
[0051] S105 , 在商场中的展示屏中显示第二商标对应的推广信息, 推广信息包括第二 商标对应的商品在商场中的位置信息。 [0051] S105: Display promotion information corresponding to the second trademark on a display screen in a shopping mall, where the promotion information includes location information of commodities corresponding to the second trademark in the mall.
[0052] 通过步骤 S104确定出第二商标对应的推广信息。 在第二商标为多个的情况下, 可以有下面两种方式在商场的展示屏中显示多个第二商标对应的推广信息: [0052] Step S104 determines the promotion information corresponding to the second trademark. When there are multiple second trademarks, there may be the following two ways to display promotion information corresponding to multiple second trademarks on the display screen of the mall:
[0053] 第一种情况, 可以根据各个第二商标对应的推广信息的获取时间顺序设置优先 级, 从而按顺序切换显示多个第二商标对应的推广信息。 例如, 当第二商标为 两个时, 对应的推广信息也为两个, 且第二商标 A对应的推广信息的获取时间在 第二商标 B对应的推广信息之前, 则优先在时间阈值内显示第二商标 A对应的推 广信息, 当达到时间阈值时, 显示第二商标 B对应的推广信息, 当再次达到时间 阈值时, 显示第二商标 A对应的推广信息, 依次按顺序切换显示各个第二商标对 应的推广信息。 时间阈值可以为 3S、 5S、 10S等任意值。 [0053] In the first case, the priority may be set according to the order of acquiring the promotion information corresponding to each second trademark, so that the promotion information corresponding to a plurality of second trademarks may be switched and displayed in order. For example, when there are two second trademarks, the corresponding promotional information is also two, and the acquisition time of the promotional information corresponding to the second trademark A is at Before the promotion information corresponding to the second trademark B, priority is given to displaying the promotion information corresponding to the second trademark A within the time threshold. When the time threshold is reached, the promotion information corresponding to the second trademark B is displayed. When the time threshold is reached again, the display is displayed. The promotion information corresponding to the second trademark A is sequentially switched to display the promotion information corresponding to each second trademark. The time threshold can be any value such as 3S, 5S, and 10S.
[0054] 第二种情况, 可以在商场的展示屏上同时显示多个第二商标对应的推广信息, 例如将展示屏划分为多个区域, 该区域的数量与第二商标对应的推广信息数量 相同, 该区域可以为大小不等的多个区域, 从而实现同时显示多个第二商标对 应的推广信息。 [0054] In the second case, the promotion information corresponding to multiple second trademarks may be displayed on the display screen of the mall at the same time, for example, the display screen is divided into multiple areas, and the number of the areas corresponds to the number of promotion information corresponding to the second trademark Similarly, the area may be a plurality of areas with different sizes, so as to display promotion information corresponding to multiple second trademarks at the same time.
[0055] 本发明实施例中, 通过对获取到的第一图像进行检测, 检测第一图像中是否存 在商标, 在检测到图像中存在商标的情况下, 确定第一图像中至少一个第一商 标的特征信息, 在商标信息库中查找与每个第一商标的特征信息匹配的第二商 标, 并在商场的展示屏中显示与第二商标对应的推广信息, 通过在商场展示屏 上显示与检测到的商标对应的推广信息, 便于消费者观看并查找商标对应的商 品在商场中的位置, 由于第一图像为商场中的展示屏对应的拍摄装置拍摄到的 图像, 该图像为在该展示屏附近的消费者的图像的可能性较高, 那么检测到的 商标很可能为该消费者服装或者其他部位 (如背包) 上的商标, 而消费者的着 装在一定程度上可以用于反馈用户对商品的喜好, 显示与商标相匹配的推广信 息, 相当于是显示用户喜欢的商品的推广信息, 起到了针对性地投放推广信息 的作用, 即根据用户的喜好进行推广, 由于是根据用户的喜好进行推广, 可以 实现推广信息的精准投放, 从而可以使得推广的效果更好。 [0055] In the embodiment of the present invention, by detecting the acquired first image, it is detected whether there is a trademark in the first image, and when a trademark is detected in the image, at least one first trademark in the first image is determined Search for the second trademark matching the characteristic information of each first trademark in the trademark information database, and display the promotion information corresponding to the second trademark on the display screen of the mall. The promotion information corresponding to the detected trademark is convenient for consumers to view and find the position of the product corresponding to the trademark in the mall. Since the first image is an image captured by a shooting device corresponding to the display screen in the mall, the image is displayed at the exhibition The possibility of the consumer’s image near the screen is high, so the detected trademark is likely to be a trademark on the consumer’s clothing or other parts (such as a backpack), and the consumer’s dress can be used to feedback the user to a certain extent. For product preferences, displaying promotional information matching the trademark is equivalent to displaying the promotional information of the product that the user likes, and plays the role of targeted promotional information, that is, promotion according to the user's preferences, because it is based on the user's preferences Promoting can achieve accurate delivery of promotional information, which can make the promotion effect better.
[0056] 在一种可能的实现方式中, 可以通过目标检测模型检测第一图像中是否存在商 标以及确定第一图像中至少一个第一商标的特征信息, 其中目标检测模型可以 为基于回归目标检测算法的目标检测模型, 也可以是基于其他算法的的目标检 测模型, 所述目标检测模型采用的具体算法在此不做限定。 通过目标检测模型 检测第一图像中是否存在商标以及确定第一图像中第一商标的特征信息的具体 步骤如图 4所示, 图 4是本发明实施例提供的一种信息推广方法的流程示意图, 如图所示, 该方法包括: [0056] In a possible implementation manner, the target detection model may be used to detect whether a trademark exists in the first image and determine the feature information of at least one first trademark in the first image, where the target detection model may be regression-based target detection The target detection model of the algorithm may also be a target detection model based on other algorithms, and the specific algorithm adopted by the target detection model is not limited herein. The specific steps of detecting whether there is a trademark in the first image through the target detection model and determining the characteristic information of the first trademark in the first image are shown in FIG. 4, which is a schematic flowchart of an information promotion method provided by an embodiment of the present invention. As shown in the figure, the method includes:
[0057] S201 , 基于目标检测模型中的卷积层对第一图像进行卷积处理, 得到目标卷积 特征图, 目标卷积特征图包括多个目标卷积特征子图。 [0057] S201: Perform a convolution process on the first image based on the convolution layer in the target detection model to obtain a target convolution Feature map. The target convolution feature map includes multiple target convolution feature subgraphs.
[0058] 本发明实施例中, 目标检测模型通过在特征图上通过卷积核来预测位置框对应 的类别和偏移量 (位置框对应图中的哪一个位置) 。 该目标检测模型中的卷积 网络结构包括多个一般卷积层、 卷积特征层、 残差单元、 上采样单元、 以及融 合单元, 可以理解的是, 该卷积网络中除了残差单元、 上采样单元、 卷积特征 层以及融合单元中包括的卷积层以外, 其他的卷积层的都可以为一般卷积层, 一般卷积层为卷积网络中只起卷积作用的卷积层, 卷积特征层用于生成进行图 像检测的目标卷积特征图。 每个卷积层对应不同尺寸的卷积核, 通过不同尺寸 的卷积核对图像进行卷积处理, 得到不同尺寸的卷积图, 不同尺寸的卷积核对 应不同的多个先验框, 通过卷积核对应的先验框对与卷积核对应的卷积图进行 预测处理, 可得到多个边界框。 [0058] In the embodiment of the present invention, the target detection model predicts the category and offset corresponding to the position frame (which position in the figure corresponds to the position frame) by using a convolution kernel on the feature map. The structure of the convolutional network in the target detection model includes multiple general convolutional layers, convolutional feature layers, residual units, upsampling units, and fusion units. It can be understood that in addition to the residual units, In addition to the convolutional layer included in the upsampling unit, the convolutional feature layer, and the fusion unit, the other convolutional layers can all be general convolutional layers. The general convolutional layer is a convolution that only plays a convolutional role in the convolutional network. Layer, the convolutional feature layer is used to generate a target convolutional feature image for image detection. Each convolutional layer corresponds to a convolution kernel of a different size. The convolution kernels of different sizes are used to convolve the image to obtain convolution maps of different sizes. Convolution kernels of different sizes correspond to different multiple prior frames. The a priori frame corresponding to the convolution kernel performs prediction processing on the convolution graph corresponding to the convolution kernel, and multiple bounding boxes can be obtained.
[0059] 这里, 目标检测模型中的卷积层对第一图像进行卷积处理, 得到多个尺寸不同 的目标卷积特征图, 多个尺寸不同的目标卷积特征图具体是指: 通过目标检测 模型中的卷积特征层分别输出的结果, 其中, 每个卷积特征层对应多个尺寸相 同的目标卷积特征图, 卷积特征层对应的尺寸越小, 则卷积特征层对应的目标 卷积特征图的数量越多。 例如, 尺寸为 64*64的目标卷积特征图的数量少于尺寸 为 32*32的目标卷积特征图的数量, 尺寸为 32*32的目标卷积特征图的数量少于 尺寸为 16*16的目标卷积特征图的数量, 尺寸为 16*16的目标卷积特征图的数量 少于尺寸为 8*8的目标卷积特征图的数量 ...... [0059] Here, the convolutional layer in the target detection model performs convolution processing on the first image to obtain multiple target convolution feature maps of different sizes, and the multiple target convolution feature maps of different sizes specifically refer to: The output results of the convolutional feature layers in the detection model, where each convolutional feature layer corresponds to multiple target convolutional feature maps of the same size, the smaller the size of the convolutional feature layer, the corresponding to the convolutional feature layer The greater the number of target convolutional feature maps. For example, the number of target convolution feature maps with size 64*64 is less than the number of target convolution feature maps with size 32*32, and the number of target convolution feature maps with size 32*32 is less than size 16* The number of target convolutional feature maps of 16, the number of target convolutional feature maps of size 16*16 is less than the number of target convolutional feature maps of size 8*8...
[0060] 具体实现中, 可以将第一图像的尺寸调整为目标检测模型对应的输入图像的尺 寸 (该尺寸可以是 416*416) , 该图像的尺寸为目标检测模型对应的输入图像的 尺寸。 然后将该图像输入到目标检测模型的卷积网络中, 将该图像作为卷积网 络中的第一个卷积层的输入, 依次通过该卷积网络中的卷积层对应的卷积核对 上一个卷积层输出的结果进行卷积处理, 然后将该卷积网络中的卷积特征层输 出的结果确定为多个尺寸不同的目标卷积特征图。 其中, 通过卷积层对应的卷 积核对上一个卷积层输出的结果进行卷积处理具体是指通过该卷积核对应的矩 阵与上一个卷积层输出的矩阵相乘, 通过卷积层对应的卷积核对上一个卷积层 输出的结果进行卷积处理得到的结果为该卷积层对应的尺寸的矩阵, 该矩阵对 应的图像即为该卷积层对应的卷积图。 [0060] In a specific implementation, the size of the first image may be adjusted to the size of the input image corresponding to the target detection model (the size may be 416*416), and the size of the image is the size of the input image corresponding to the target detection model. Then input the image into the convolutional network of the target detection model, use the image as the input of the first convolutional layer in the convolutional network, and check it in sequence through the convolution corresponding to the convolutional layer in the convolutional network The output result of a convolutional layer is subjected to convolution processing, and then the output result of the convolutional feature layer in the convolutional network is determined as a plurality of target convolutional feature maps with different sizes. Among them, the convolution processing of the convolution layer corresponding to the convolution layer on the result of the previous convolution layer specifically means that the matrix corresponding to the convolution kernel is multiplied by the matrix output by the previous convolution layer, and the convolution layer The corresponding convolution kernel performs convolution processing on the result output by the previous convolution layer. The result obtained is a matrix of the size corresponding to the convolution layer. The corresponding image is the convolution map corresponding to the convolution layer.
[0061] 下面通过对目标检测模型中的卷积层对第一图像进行卷积处理的过程进行具体 说明, 参见图 5 , 图 5是本发明实施例提供的目标检测模型中的卷积网络结构的 示意图, 目标检测模型的卷积网络包括多个一般卷积层、 三个卷积特征层、 五 个残差单元、 两个上采样单元、 以及两个融合单元。 [0061] The following describes the process of convolving the first image by the convolution layer in the target detection model. Referring to FIG. 5, FIG. 5 is a convolution network structure in the target detection model provided by the embodiment of the present invention. The schematic diagram of the target detection model includes multiple general convolutional layers, three convolutional feature layers, five residual units, two upsampling units, and two fusion units.
[0062] 将第一图像调整尺寸后输入卷积网络中, 通过一般卷积层中的卷积核对第一图 像进行卷积处理, 得到一般卷积图, 通过第一残差单元中的卷积层对一般卷积 图进行卷积处理, 得到第一卷积图, 通过第二残差单元中的卷积层对第一卷积 图进行卷积处理后得到第二卷积图, 通过第三残差单元中的卷积层对第二卷积 图进行卷积处理后得到第三卷积图, 通过第四残差单元中的卷积层对第三卷积 图进行卷积处理后得到第四卷积图, 通过第五残差单元中的卷积层对第四卷积 图进行卷积处理后得到第五卷积图。 通过一般卷积层中的卷积核对第五卷积图 进行卷积处理, 得到第六卷积图, 通过上采样单元对第六卷积图进行上采样处 理, 得到第一上采样卷积图, 通过融合单元融合第一上采样卷积图与第四卷积 图, 得到第一融合卷积图, 通过一般卷积层中的卷积核对第一融合卷积图进行 卷积处理, 得到第七卷积图, 通过上采样单元对第七卷积图进行上采样处理, 得到第二上采样卷积图, 通过融合单元融合第二上采样卷积图与第三卷积图, 得到第二融合卷积图, 通过一般卷积层中的卷积核对第二融合卷积图进行卷积 处理, 得到第八卷积图, 然后将第六卷积图、 第七卷积图、 第八卷积图确定为 目标卷积特征图。 [0062] The first image is resized and input into a convolutional network, and the first image is convoluted by a convolution kernel in a general convolution layer to obtain a general convolution map, and the convolution in the first residual unit The layer performs a convolution process on the general convolution map to obtain a first convolution map, and performs a convolution process on the first convolution map through the convolution layer in the second residual unit to obtain a second convolution map, and a third convolution map. The convolutional layer in the residual unit performs a convolution process on the second convolution map to obtain a third convolution map, and the convolutional layer in the fourth residual unit performs a convolution process on the third convolution map to obtain a third convolution map. For the four convolutional graphs, the fifth convolutional graph is convoluted by the convolutional layer in the fifth residual unit to obtain a fifth convolutional graph. Perform a convolution process on the fifth convolutional map through the convolution kernel in the general convolutional layer to obtain a sixth convolutional map, and perform an upsampling process on the sixth convolutional map through an upsampling unit to obtain a first upsampled convolutional map , Fuse the first up-sampling convolution map and the fourth convolution map by the fusion unit to obtain the first fusion convolution map, and perform convolution processing on the first fusion convolution map by the convolution kernel in the general convolution layer to obtain the first Seven convolutional graphs, the seventh convolutional graph is upsampled by an upsampling unit to obtain a second upsampling convolutional graph, and the second upsampling convolutional graph and the third convolutional graph are fused by a fusion unit to obtain a second Fusion convolution map, convolution processing of the second fusion convolution map through the convolution kernel in the general convolution layer to obtain the eighth convolution map, and then the sixth convolution map, the seventh convolution map, the eighth volume The product graph is determined as the target convolution feature graph.
[0063] 其中, 第六卷积图、 第七卷积图、 第八卷积图的数量都为多个, 且第六卷积图 、 第七卷积图、 第八卷积图的数量依次减少, 即第六卷积图的数量多于第七卷 积图的数量, 第七卷积图的数量多于第八卷积层。 即处理后得到的目标卷积特 征图为三种不同尺寸的卷积图, 且每种卷积图的尺寸与数量不一样。 这里, 第 六卷积图有多个, 且每个第六卷积图尺寸相同; 第七卷积图有多个, 每个第七 卷积图尺寸相同; 第八卷积图有多个, 每个第八卷积图的尺寸相同。 [0063] Among them, the number of the sixth convolution map, the seventh convolution map, and the eighth convolution map are multiple, and the number of the sixth convolution map, the seventh convolution map, and the eighth convolution map are in order Reduced, that is, the number of sixth convolution maps is greater than the number of seventh convolution maps, and the number of seventh convolution maps is greater than the eighth convolutional layer. That is, the target convolution feature maps obtained after processing are three different sizes of convolution maps, and the size and number of each convolution map are different. Here, there are multiple sixth convolution maps, and each sixth convolution map has the same size; there are multiple seventh convolution maps, each seventh convolution map has the same size; and there are multiple eighth convolution maps, Each eighth convolution graph has the same size.
[0064] 其中的一个残差单元对图像的处理方法如下, 以第一残差单元为例, 通过第一 残差单元中的第一卷积层对应的卷积核对输入的图像进行卷积处理, 得到第一 卷积子图, 通过第一残差单元中的第二卷积层对应的卷积核对第一卷积子图进 行卷积处理, 得到第二卷积子图, 对第一卷积子图与第二卷积子图进行残差学 习, 得到第一残差单元对应的第一卷积图。 [0064] One of the residual units processes the image as follows. Taking the first residual unit as an example, the input image is convoluted by the convolution kernel corresponding to the first convolution layer in the first residual unit , Get first Convolution sub-graph, the first convolution sub-layer convolution corresponding to the first convolution sub-convolution of the first convolution sub-graph to obtain a second convolution sub-graph, the first convolution sub-graph and The second convolution subgraph performs residual learning to obtain a first convolution graph corresponding to the first residual unit.
[0065] 需说明的是, 这里的举例仅用于说明利用卷积网络对第一图像进行卷积处理的 过程, 不对本发明实施例进行限制, 在可选实施例中, 卷积网络还可以包括更 多的一般卷积层、 卷积特征层、 残差单元、 上采样单元和更多的融合单元。 [0065] It should be noted that the examples here are only used to illustrate the process of using a convolution network to perform a convolution process on the first image, and do not limit the embodiment of the present invention. In an alternative embodiment, the convolution network may also It includes more general convolutional layers, convolutional feature layers, residual units, upsampling units and more fusion units.
[0066] 这里, 目标卷积特征子图是指目标卷积特征图包含的特征单元, 例如, 目标卷 积特征图的尺寸为 3*3 , 则目标卷积特征图可以如图 6所示, 图 6是本发明实施例 提供的目标卷积特征图与目标卷积特征子图之间的关系示意图, 目标卷积特征 图一共包含 9个特征单元, 每个特征单元为目标卷积特征图中的一个单元格, 编 号为 1~9, 即目标卷积特征图包含 9个目标卷积特征子图。 [0066] Here, the target convolution feature submap refers to the feature unit included in the target convolution feature map, for example, the size of the target convolution feature map is 3*3, then the target convolution feature map may be as shown in FIG. 6, 6 is a schematic diagram of the relationship between a target convolution feature map and a target convolution feature sub-map provided by an embodiment of the present invention. The target convolution feature map includes a total of 9 feature units, and each feature unit is a target convolution feature map A cell is numbered 1-9, that is, the target convolution feature map contains 9 target convolution feature submaps.
[0067] S202, 分别确定多个目标卷积特征子图中各个目标卷积特征子图对应的目标卷 积特征信息。 [0067] S202: Determine target convolution feature information corresponding to each target convolution feature submap in multiple target convolution feature submaps, respectively.
[0068] 这里, 各个目标卷积特征子图对应的目标卷积特征信息是指: 以目标卷积特征 图对应的先验框作为边界框在该目标卷积特征图中以目标卷积特征子图为中心 所圈出来的内容。 例如, 卷积特征图如图 6所示, 图 6是本发明实施例提供的目 标卷积特征图与目标卷积特征子图之间的关系示意图, 则对于该目标卷积特征 图中的目标卷积特征子图 9, 该目标卷积特征子图 9对应的目标卷积特征信息为 图 6中的三个尺寸不同的虚线框所对应的该目标卷积特征图的内容。 [0068] Here, the target convolution feature information corresponding to each target convolution feature submap refers to: using the a priori box corresponding to the target convolution feature map as a bounding box to use the target convolution feature sub in the target convolution feature map The picture is the content circled in the center. For example, the convolution feature map is shown in FIG. 6. FIG. 6 is a schematic diagram of the relationship between the target convolution feature map and the target convolution feature sub-map provided by an embodiment of the present invention. For the target in the target convolution feature map Convolution feature sub-picture 9, the target convolution feature information corresponding to the target convolution feature sub-picture 9 is the content of the target convolution feature map corresponding to the three dashed boxes with different sizes in FIG. 6.
[0069] 具体实现中, 可以以目标卷积特征图对应的先验框作为边界框, 确定目标卷积 特征图中的各个目标卷积特征子图对应的边界框内的信息, 将该边界框内的信 息确定为该边界框对应的目标卷积特征子图的目标卷积特征信息, 从而确定目 标卷积特征图对应的目标卷积特征信息。 [0069] In a specific implementation, the a priori box corresponding to the target convolution feature map may be used as a bounding box to determine the information within the bounding box corresponding to each target convolution feature submap in the target convolution feature map, and the bounding box The information in is determined as the target convolution feature information of the target convolution feature submap corresponding to the bounding box, thereby determining the target convolution feature information corresponding to the target convolution feature map.
[0070] 例如, 目标卷积特征图如图 6所示, 则确定目标卷积特征图对应的目标卷积特 征信息可以为: 以尺寸为 3*3的卷积特征对应的先验框作为边界框, 将边界框以 特征单元 1为中心, 确定该边界框对应的信息, 将该边界框对应的信息确定为特 征单元 1对应的目标卷积特征信息, 从而确定特征单元 1对应的三个边界框对应 的信息为三个目标卷积特征信息; 将边界框以特征单元 2为中心, 确定该边界框 对应的信息, 将该边界框对应的信息确定为特征单元 2对应的目标卷积特征信息 , 从而确定特征单元 2对应的三个边界框对应的信息为三个目标卷积特征信息; ...... ; 将边界框以特征单元 9为中心, 确定该边界框对应的信息, 将该边界框对 应的信息确定为特征单元 9对应的目标卷积特征信息, 从而确定特征单元 9对应 的三个边界框对应的信息为三个目标卷积特征信息; 最后将特征单元 1~特征单 元 9对应的目标卷积特征信息确定为目标卷积特征图对应的目标卷积特征信息, 可得到 27个目标卷积特征信息。 [0070] For example, if the target convolution feature map is as shown in FIG. 6, the target convolution feature information corresponding to the target convolution feature map may be: The prior frame corresponding to the convolution feature of size 3*3 is used as the boundary Box, centering the bounding box on feature unit 1 to determine the information corresponding to the bounding box, and determining the information corresponding to the bounding box as the target convolution feature information corresponding to feature unit 1, thereby determining the three boundaries corresponding to feature unit 1 The information corresponding to the box is the three target convolution feature information; the boundary box is centered on the feature unit 2 to determine the boundary box Corresponding information, the information corresponding to the bounding box is determined as the target convolution feature information corresponding to the feature unit 2, thereby determining that the information corresponding to the three bounding boxes corresponding to the feature unit 2 is the three target convolution feature information;... ...; The boundary box is centered on the feature unit 9 to determine the information corresponding to the boundary box, and the information corresponding to the boundary box is determined as the target convolution feature information corresponding to the feature unit 9, thereby determining the three corresponding to the feature unit 9 The information corresponding to each bounding box is three target convolution feature information; finally, the target convolution feature information corresponding to feature unit 1~feature unit 9 is determined as the target convolution feature information corresponding to the target convolution feature map, and 27 can be obtained Target convolution feature information.
[0071] S203 , 分别确定各个目标卷积特征信息与目标检测模型中多个商标名称之间的 匹配概率, 以及, 各个目标卷积特征信息对应的位置坐标, 并分别将在第一图 像中与各个目标卷积特征信息对应的位置坐标所对应的区域作为各个目标卷积 特征信息对应的第一区域。 [0071] S203, separately determining the matching probability between each target convolution feature information and multiple trademark names in the target detection model, and, the position coordinates corresponding to each target convolution feature information, and respectively in the first image The area corresponding to the position coordinates corresponding to each target convolution feature information is used as the first area corresponding to each target convolution feature information.
[0072] 具体地, 可以通过目标检测模型中的分类器对各个目标卷积特征信息与目标检 测模型中的多种商标名称对应的商标图形之间的匹配概率进行计算, 从而分别 确定各个目标卷积特征信息与目标检测模型中的多种商标名称对应的商标图形 之间的匹配概率。 确定到的匹配概率的数量和目标检测模型中的商标名称对应 的商标图形的数量相同。 [0072] Specifically, the matching probability between each target convolution feature information and the trademark graphics corresponding to multiple trademark names in the target detection model can be calculated by a classifier in the target detection model, so as to determine each target volume separately The matching probability between the product feature information and the trademark graphics corresponding to various trademark names in the target detection model. The number of determined matching probabilities is the same as the number of trademark graphics corresponding to the trademark name in the target detection model.
[0073] 这里, 由于每个目标卷积特征信息都与目标检测模型中的每种商标名称对应的 商标图形之间进行匹配, 则每个目标卷积特征信息对应多个匹配概率, 匹配概 率即该目标卷积特征信息是目标检测模型中的各种商标名称对应的商标图形的 可能性。 [0073] Here, since each target convolution feature information is matched with a trademark graphic corresponding to each trademark name in the target detection model, each target convolution feature information corresponds to multiple matching probabilities, and the matching probability is The target convolution feature information is the possibility of trademark graphics corresponding to various trademark names in the target detection model.
[0074] 这里, 各个目标卷积特征信息对应的位置坐标是指将各个目标卷积特征信息对 应的各个边界框映射回第一图像时所对应的位置坐标, 每个目标卷积特征信息 对应四个位置坐标, 这四个位置坐标分别对应边界框的四个顶点, 将边界框的 四个顶点映射回原图所得到的四个点的坐标即为目标卷积特征信息对应的位置 坐标, 由此可得到各个目标卷积特征信息对应的位置坐标。 [0074] Here, the position coordinates corresponding to each target convolution feature information refer to the position coordinates corresponding to the time when each bounding box corresponding to each target convolution feature information is mapped back to the first image, each target convolution feature information corresponds to four Position coordinates, these four position coordinates correspond to the four vertices of the bounding box, and the coordinates of the four points obtained by mapping the four vertices of the bounding box back to the original image are the position coordinates corresponding to the target convolution feature information, by This can obtain the position coordinates corresponding to the convolution feature information of each target.
[0075] 由于各个目标卷积特征图是由第一图像经过尺寸调整以及卷积处理而来, 所以 各个目标卷积特征图中的每个点与第一图像中的点或区域存在对应关系, 根据 该对应关系可确定各个边界框在第一图像中对应四个点的位置坐标, 进而将各 个边界框在第一图像中对应的四个点的位置坐标确定为各个边界框对应的各个 目标卷积特征信息对应的位置坐标, 将该位置坐标对应的点所形成的区域确定 为各个目标卷积特征信息对应的第一区域, 因此, 可得到每个目标卷积特征信 息对应于第一图像中的每个第一区域。 [0075] Since each target convolution feature map is obtained from the first image through size adjustment and convolution processing, there is a corresponding relationship between each point in each target convolution feature map and a point or area in the first image, According to this correspondence, the position coordinates of each boundary box corresponding to four points in the first image can be determined, and then each The position coordinates of the four points corresponding to the bounding boxes in the first image are determined as the position coordinates corresponding to the respective target convolution feature information corresponding to each bounding box, and the area formed by the points corresponding to the position coordinates is determined as each target volume The first region corresponding to the product feature information, therefore, it can be obtained that each target convolution feature information corresponds to each first region in the first image.
[0076] 举例来进行说明, 例如, 目标卷积特征信息对应的边界框如图 7所示, 图 7是本 发明实施例提供的目标卷积特征图与第一图像之间的映射关系的示意图, 边界 框的四个顶点分别为 al、 a2、 a3以及 a4, 该四个顶点映射回第一图像时分别对应 点 bl、 b2、 b3以及 b4, bl在第一图像中的位置坐标为 (bl l, bl2) , b2在第一 图像中的位置坐标为 (b21 , b22) , b3在第一图像中的位置坐标为 (b31 , b34 ) , b4在第一图像中的位置坐标为 (b41 , b44) , 则将 bl的位置坐标 (bl l , bl 2) 、 b2的位置坐标 (b21 , b22) 、 b3的位置坐标 (b31 , b32) 以及 b4的位置坐 标 (b41 , b42) 确定为目标卷积特征信息对应的位置坐标, 将点 bl、 b2、 b3以 及 b4所在第一图像中形成的区域确定为目标卷积特征信息对应的第一区域。 [0076] For example, for example, the bounding box corresponding to the target convolution feature information is shown in FIG. 7, and FIG. 7 is a schematic diagram of the mapping relationship between the target convolution feature map and the first image provided by an embodiment of the present invention. , The four vertices of the bounding box are al, a2, a3, and a4 respectively, and the four vertices correspond to the points bl, b2, b3, and b4 when mapped back to the first image, and the position coordinates of bl in the first image are (bl l, bl2), b2's position coordinates in the first image are (b21, b22), b3's position coordinates in the first image are (b31, b34), b4's position coordinates in the first image are (b41, b44), then the position coordinates of bl (bl l, bl 2), b2 (b21, b22), b3 (b31, b32) and b4 (b41, b42) are determined as the target volume The position coordinates corresponding to the product feature information determine the region formed in the first image where the points bl, b2, b3, and b4 are located as the first region corresponding to the target convolution feature information.
[0077] 具体实现中, 可以根据该目标卷积特征信息对应的目标卷积特征图与第一图像 之间的映射关系确定各个目标卷积特征信息对应的位置坐标, 即可确定该位置 坐标在第一图像中对应的点, 点组合成的区域为第一区域。 [0077] In a specific implementation, the position coordinates corresponding to each target convolution feature information can be determined according to the mapping relationship between the target convolution feature map corresponding to the target convolution feature information and the first image, that is, the position coordinates can be determined Corresponding points in the first image, the area combined by the points is the first area.
[0078] S204, 分别根据各个目标卷积特征信息与目标检测模型中多个商标名称之间的 匹配概率, 确定各个目标卷积特征信息对应的最大匹配概率, 分别将各个目标 卷积特征信息对应的最大匹配概率确定为各个目标卷积特征信息对应的第一区 域的置信度, 并分别将各个目标卷积特征信息对应的最大匹配概率所对应的商 标名称作为与各个目标卷积特征信息对应的第一区域所对应的商标名称。 [0078] S204: Determine the maximum matching probability corresponding to each target convolution feature information according to the matching probability between each target convolution feature information and multiple trademark names in the target detection model, respectively, and corresponding each target convolution feature information The maximum matching probability of is determined as the confidence of the first region corresponding to each target convolution feature information, and the trademark name corresponding to the maximum matching probability corresponding to each target convolution feature information is used as the corresponding to each target convolution feature information The brand name corresponding to the first area.
[0079] 本发明实施例中, 由于各个目标卷积特征信息都与目标检测模型中的每种商标 名称对应的商标图形之间进行匹配, 则卷积特征信息集合中的每个目标卷积特 征信息对应多个匹配概率, 分别将各个目标卷积特征信息对应的多个匹配概率 中最大的匹配概率确定为各个目标卷积特征信息对应的第一区域的置信度, 则 可得到每个目标卷积特征信息对应的第一区域的置信度, 并分别将各个目标卷 积特征信息对应的最大匹配概率所对应的商标名称作为与各个目标卷积特征信 息对应的第一区域所对应的商标名称。 [0080] 这里, 第一区域的置信度为第一区域对应的图像与目标检测模型中多个商标名 称对应的商标图形之间的匹配概率中最大的匹配概率。 [0079] In the embodiment of the present invention, since each target convolution feature information is matched with a trademark graphic corresponding to each trademark name in the target detection model, each target convolution feature in the convolution feature information set The information corresponds to multiple matching probabilities, and the maximum matching probability among the multiple matching probabilities corresponding to each target convolution feature information is determined as the confidence of the first region corresponding to each target convolution feature information, then each target volume can be obtained The confidence of the first region corresponding to the product feature information, and using the brand name corresponding to the maximum matching probability corresponding to each target convolution feature information as the brand name corresponding to the first region corresponding to each target convolution feature information. [0080] Here, the confidence of the first area is the largest matching probability among the matching probabilities between the image corresponding to the first area and the trademark graphics corresponding to multiple trademark names in the target detection model.
[0081] 以一个目标卷积特征信息 (即一个边界框内的信息) 为例进行说明, 例如, 确 定出目标卷积特征信息 A与目标检测模型中的商标名称 1对应的商标图形的匹配 概率为 0.80; 目标卷积特征信息 A与目标检测模型中的商标名称 2对应的商标图 形的匹配概率为 0.20; 目标卷积特征信息 A与目标检测模型中的商标名称 3对应 的商标图形的匹配概率为 0.15 , 则目标卷积特征信息 A与目标检测模型中的商标 名称 1对应的商标图形具有最大匹配概率, 且最大匹配概率为 0.80。 则该最大匹 配概率为第一区域的置信度, 该置信度为 0.80, 该第一区域对应的商标名称为商 标名称 1。 [0081] Taking one target convolution feature information (ie, information within a bounding box) as an example for illustration, for example, the matching probability of the target convolution feature information A and the trademark pattern corresponding to the trademark name 1 in the target detection model is determined 0.80; the matching probability of the target convolution feature information A and the trademark graphic corresponding to the trademark name 2 in the target detection model is 0.20; the matching probability of the target convolution feature information A and the trademark graphic corresponding to the trademark name 3 in the target detection model Is 0.15, then the trademark pattern corresponding to the trademark name 1 in the target detection model of the target convolution feature information A has the maximum matching probability, and the maximum matching probability is 0.80. Then the maximum matching probability is the confidence of the first area, the confidence is 0.80, and the trademark name corresponding to the first area is the trademark name 1.
[0082] 为了便于理解, 下面通过举例的方式对步骤 S202 S204中的过程进行介绍, 例 如, 如图 8所示, 图 8是本发明实施例提供的一种目标检测模型卷积处理后得到 的三种目标卷积特征图与对应的目标卷积特征子图之间的关系示意图, 假设通 过 S201的方式对输入的图像 A进行处理后得到三种尺寸不同的目标卷积特征图 B1 、 : B2、 B3, 假设 B1的尺寸为 3*3、 数量为 10、 编号分别为 1~10, B2的尺寸为 6*6 、 数量为 6、 编号分别为
Figure imgf000015_0001
B3的尺寸为 9*9、 数量为 4、 编号分别为 17~20。 C1中的一个小方格为 B1中的目标卷积特征图对应的一个目标卷积特征子图, C2 中的一个小方格为 B2中的目标卷积特征图对应的一个目标卷积特征子图, C3中 的一个小方格为 B3中的目标卷积特征图对应的一个目标卷积特征子图。 可得到 B 1中编号为 1的目标卷积特征图中有 9个目标卷积特征子图, 如 C1所示, C1中的每 一个小方格代表一个目标卷积特征子图, 每个目标卷积特征子图对应 3个目标卷 积特征信息 (即 C1中一个小方格对应 3个目标卷积特征信息, 且一个目标卷积特 征信息对应一个边界框) , 则 B1中编号为 1的目标卷积特征图对应 27 (3*3*3=27 ) 个目标卷积特征信息, B1中编号为 2的目标卷积特征图对应 27个目标卷积特征 信息, ...... B1中编号为 10的目标卷积特征图对应 27个目标卷积特征信息。 对于 尺寸为 3*3的目标卷积特征图 B1而言, 由于 B1的数量为 10个, 则可得到 270 (27* 10=270) 个目标卷积特征信息。 可知, 尺寸为 6*6的目标卷积特征图 B2中编号为 11的目标卷积特征子图中有 108 (6*6*3=108) 个目标卷积特征信息, ......, 尺 寸为 6*6的目标卷积特征图 B2中编号为 16的目标卷积特征子图中有 108个目标卷 积特征信息对于尺寸为 6*6的目标卷积特征图 B2而言, 由于 B2的数量为 6个, 则 可得到 648 ( 108*6=648) 个目标卷积特征信息。 可知, 尺寸为 9*9的目标卷积特 征图 B3中编号为 17的目标卷积特征子图中有 243 (9*9*3=243) 个目标卷积特征 信息, ......, 尺寸为 9*9的目标卷积特征图 B3中编号为 20的目标卷积特征子图中 有 243个目标卷积特征信息, 对于尺寸为 9*9的目标卷积特征图 B3而言, 由于 B3 的数量为 4个, 则可得到 972 (243*4=972) 个目标卷积特征信息。 这里, 可得到 三种不同尺寸的目标卷积特征图 B l、 B2、 B3对应的 270+648+972=1890个目标卷 积特征信息。
[0082] For ease of understanding, the following describes the process in steps S202 to S204 by way of example. For example, as shown in FIG. 8, FIG. 8 is obtained after convolution processing of a target detection model provided by an embodiment of the present invention. A schematic diagram of the relationship between the three target convolution feature maps and the corresponding target convolution feature submaps. Suppose that after processing the input image A by means of S201, three target convolution feature maps B1 and B2 of different sizes are obtained: , B3, assuming that the size of B1 is 3*3, the quantity is 10, and the numbers are 1~10, the size of B2 is 6*6, the quantity is 6, and the numbers are respectively
Figure imgf000015_0001
The size of B3 is 9*9, the quantity is 4, and the numbers are 17~20 respectively. A small square in C1 is a target convolution feature subgraph corresponding to the target convolution feature map in B1, a small square in C2 is a target convolution feature submap corresponding to the target convolution feature map in B2 In the figure, a small square in C3 is a target convolution feature submap corresponding to the target convolution feature map in B3. There are 9 target convolution feature subgraphs in the target convolution feature map numbered 1 in B 1, as shown in C1, each small square in C1 represents a target convolution feature submap, each target Convolutional feature submap corresponds to 3 target convolutional feature information (that is, a small square in C1 corresponds to 3 target convolutional feature information, and a target convolutional feature information corresponds to a bounding box), then the number 1 in B1 is The target convolution feature map corresponds to 27 (3*3*3=27) target convolution feature information, the target convolution feature map numbered 2 in B1 corresponds to 27 target convolution feature information, ... B1 The target convolution feature map numbered 10 in the map corresponds to 27 target convolution feature information. For the target convolution feature map B1 with a size of 3*3, since the number of B1 is 10, 270 (27*10=270) target convolution feature information can be obtained. It can be seen that there are 108 (6*6*3=108) target convolution feature information in the target convolution feature submap numbered 11 in the target convolution feature graph B2 of size 6*6,... , ruler There are 108 target convolution feature information in the target convolution feature sub-number of 16 in the target convolution feature map B2 with the size of 6*6. For the target convolution feature map B2 with the size of 6*6, due to B2 The number is 6, then 648 (108*6=648) target convolution feature information can be obtained. It can be seen that there are 243 (9*9*3=243) target convolution feature information in the target convolution feature subgraph numbered 17 in the target convolution feature graph B3 of size 9*9,... There are 243 target convolution feature information in the target convolution feature sub-picture numbered 20 in the target convolution feature map B3 of size 9*9. For the target convolution feature map B3 of size 9*9, Since the number of B3 is four, 972 (243*4=972) target convolution feature information can be obtained. Here, 270+648+972=1890 target convolution feature information corresponding to three different sizes of target convolution feature maps Bl, B2, and B3 can be obtained.
[0083] 假设该 1890个目标卷积特征信息分别为目标卷积特征信息 1, ......, 目标卷积 特征信息 1890。 首先确定目标卷积特征信息 1与目标检测模型中的多种商标名称 对应的商标图形之间的匹配概率。 例如, 目标检测模型中有 50个商标名称对应 的商标图形, 则可得到目标卷积特征信息 1对应的 50个匹配概率, 并将目标卷积 特征信息 1对应的边界框映射回第一图像中的区域确定为目标卷积特征信息 1对 应的第一区域 1。 同理, 可确定目标卷积特征信息 2对应的 50个匹配概率以及目 标卷积特征信息 2对应的边界框映射回第一图像中的第一区域 2, ......, 可确定 目标卷积特征信息 1890对应的 50个匹配概率以及目标卷积特征信息 1890对应的 边界框映射回第一图像中的第一区域 1890。 [0083] Assume that the 1890 target convolution feature information is target convolution feature information 1, ..., target convolution feature information 1890. First, determine the matching probability between the target convolution feature information 1 and the trademark graphics corresponding to multiple trademark names in the target detection model. For example, if there are 50 trademark graphics corresponding to the trademark name in the target detection model, 50 matching probabilities corresponding to the target convolution feature information 1 can be obtained, and the bounding box corresponding to the target convolution feature information 1 can be mapped back into the first image Is determined as the first region 1 corresponding to the target convolution feature information 1. Similarly, the 50 matching probabilities corresponding to the target convolution feature information 2 and the bounding box corresponding to the target convolution feature information 2 can be mapped back to the first region 2 in the first image, ..., the target can be determined The 50 matching probabilities corresponding to the convolution feature information 1890 and the bounding box corresponding to the target convolution feature information 1890 are mapped back to the first region 1890 in the first image.
[0084] 其次, 以目标卷积特征信息 1为例, 将目标卷积特征信息 1对应的 50个匹配概率 中最大的匹配概率确定为第一区域 1的置信度, 并将该最大的匹配概率所对应的 商标名称作为第一区域 1对应的商标名称, 从而得到目标卷积特征信息 1对应的 第一区域 1的置信度以及第一区域 1对应的商标名称。 同理, 可得到目标卷积特 征信息 2对应的第一区域 2的置信度以及第一区域 2对应的商标名称, ......, 可得 到目标卷积特征信息 1890对应的第一区域 1890的置信度以及第一区域 1890对应 的商标名称。 [0084] Second, taking the target convolution feature information 1 as an example, the maximum matching probability among the 50 matching probabilities corresponding to the target convolution feature information 1 is determined as the confidence of the first region 1, and the maximum matching probability is determined The corresponding brand name is used as the brand name corresponding to the first region 1 to obtain the confidence of the first region 1 corresponding to the target convolution feature information 1 and the brand name corresponding to the first region 1. Similarly, the confidence of the first region 2 corresponding to the target convolution feature information 2 and the brand name corresponding to the first region 2 can be obtained, ..., the first region corresponding to the target convolution feature information 1890 can be obtained The confidence of 1890 and the brand name corresponding to the first area 1890.
[0085] 最后, 可得到 1890个第一区域, 每个第一区域都有对应的置信度以及对应的商 标名称。 [0085] Finally, 1890 first regions can be obtained, each of which has a corresponding confidence level and a corresponding trademark name.
[0086] S205, 判断各个第一区域的置信度是否大于第一阈值。 [0087] 这里, 由于第一区域的置信度为第一区域对应的图像与目标检测模型中多个商 标名称对应的商标图形之间的匹配概率中最大的匹配概率, 第一区域的置信度 大于第一阈值说明第一区域对应的图像与该目标检测模型中的多个商标名称对 应的商标图形的匹配概率较大, 即该第一区域对应的图像为该目标检测模型中 多个商标名称对应的商标图形的可能性较大; 第一区域的置信度小于第一阈值 说明第一区域对应的图像为该目标检测模型中多个商标名称对应的商标图形的 可能性较小。 [0086] S205. Determine whether the confidence of each first area is greater than a first threshold. [0087] Here, since the confidence of the first area is the largest matching probability among the matching probabilities between the image corresponding to the first area and the trademark graphics corresponding to multiple trademark names in the target detection model, the confidence of the first area is greater than The first threshold value indicates that the image corresponding to the first area matches the trademark graphics corresponding to multiple trademark names in the target detection model with a high probability of matching, that is, the image corresponding to the first area corresponds to multiple trademark names in the target detection model. Is more likely to have a trademark graphic; the confidence of the first area is less than the first threshold, indicating that the image corresponding to the first area is less likely to be a trademark graphic corresponding to multiple trademark names in the target detection model.
[0088] S206 , 当至少一个第一区域的置信度大于或等于第一阈值时, 检测到第一图像 中存在商标。 [0088] S206, when the confidence of at least one first area is greater than or equal to the first threshold, it is detected that a trademark exists in the first image.
[0089] S207 , 当各个第一区域的置信度均小于第一阈值时, 检测到第一图像中不存在 商标。 [0089] S207, when the confidence of each first area is less than the first threshold, it is detected that there is no trademark in the first image.
[0090] 如前所述, 假设确定了 1890个第一区域, 每个第一区域都有对应的置信度, 假 设第一阈值为 0.60, 当 1890个第一区域中至少存在一个第一区域的置信度大于或 等于 0.60时, 则说明第一图像中存在商标, 当 1890个第一区域中每个第一区域的 置信度均小于 0.60时, 则说明第一图像中不存在商标。 这里, 第一阈值可以为 0. 60、 0.75、 0.85等任意数值。 至此, 检测出了第一图像中是否存在商标。 [0090] As mentioned above, it is assumed that 1890 first regions are determined, and each first region has a corresponding confidence level. Assuming that the first threshold is 0.60, when there is at least one first region in the 1890 first regions When the confidence is greater than or equal to 0.60, it means that there is a trademark in the first image. When the confidence of each of the 1890 first regions is less than 0.60, it means that there is no trademark in the first image. Here, the first threshold may be any value such as 0.60, 0.75, 0.85, etc. So far, it is detected whether the trademark exists in the first image.
[0091] 本发明实施例中, 通过目标检测模型对图像进行检测, 该目标检测模型中的卷 积网络结构包括残差单元、 上采样单元、 融合单元、 卷积特征层以及一般卷积 层, 残差单元可以使卷积网络结构在很深的情况下仍能收敛, 残差单元中的卷 积层可以压缩每次卷积得到的图像对应的特征, 减少了模型中的参数量以及计 算量; 上采样单元对图像进行上采样使网络层数在很深的情况下表达效果也很 好; 融合单元使网络同时学习深层与浅层特征, 使卷积后的图像对应的特征更 多; 卷积特征层输出三个不同尺度的卷积后的图像, 从而可以对三个不同尺度 的卷积后的图像进行检测, 可以检测出图像中是否存在商标。 由于检测过程中 减少了模型中的参数量以及计算量, 从而提高了商标检测的效率, 且通过对三 个不同尺度的卷积后的图像进行检测, 提高了商标检测的准确度。 [0091] In the embodiment of the present invention, the image is detected by a target detection model, the convolutional network structure in the target detection model includes a residual unit, an upsampling unit, a fusion unit, a convolution feature layer, and a general convolution layer, The residual unit can enable the convolutional network structure to converge in deep conditions. The convolutional layer in the residual unit can compress the features corresponding to the image obtained by each convolution, reducing the amount of parameters and calculations in the model ; The upsampling unit upsamples the image so that the network layer number is very good when the network layer is deep; the fusion unit enables the network to learn deep and shallow features at the same time, so that the convolutional image corresponds to more features; volume The product feature layer outputs three convolutional images of different scales, so that three convolutional images of different scales can be detected, and whether a trademark exists in the image can be detected. Because the amount of parameters and calculations in the model are reduced during the detection process, the efficiency of trademark detection is improved, and the accuracy of trademark detection is improved by detecting three convolutional images of different scales.
[0092] 可选地, 在利用上述目标检测检测模型对第一图像进行检测, 以确定第一商标 在第一图像中的位置坐标和第一商标对应的商标名称之前, 还可以先训练目标 检测模型, 训练目标检测模型时, 首先对训练样本图像进行标注, 包括对商标 的位置、 商标名称等信息进行标注。 然后利用标注好的样本图像对初始模型进 行训练, 待模型收敛并达到一定精度 (指模型中的损失函数值小于损失阈值且 精度大于精度阈值) 时保存模型, 该保存下来的模型就是目标检测检测模型。 [0092] Optionally, before using the above target detection detection model to detect the first image to determine the position coordinates of the first trademark in the first image and the trademark name corresponding to the first trademark, the target may also be trained first Detection model. When training the target detection model, first mark the training sample image, including marking the position of the trademark, trademark name and other information. Then use the marked sample images to train the initial model, and save the model when the model converges and reaches a certain accuracy (referring to the loss function value in the model is less than the loss threshold and the accuracy is greater than the accuracy threshold), the saved model is the target detection detection model.
[0093] 在一种可能的实现方式中, 在检测到第一图像中存在商标的情况下, 确定第一 图像中至少一个第一商标的特征信息的方法如下, 如图 9所示, 图 9为本发明实 施例提供的一种信息推广方法的流程示意图, 该方法包括: [0093] In a possible implementation manner, in a case where a trademark is detected in the first image, a method for determining feature information of at least one first trademark in the first image is as follows, as shown in FIG. 9 and FIG. 9 This is a schematic flowchart of an information promotion method provided by an embodiment of the present invention. The method includes:
[0094] S301 , 将置信度大于或等于第一阈值的第一区域组成的集合确定为第一集合。 [0094] S301: Determine a set consisting of a first region with a confidence greater than or equal to a first threshold as a first set.
[0095] 如前所述, 假设确定了 1890个第一区域, 每个第一区域都有对应的置信度, 假 设第一阈值为 0.60, 该 1890个第一区域中有 200个第一区域的置信度大于或等于 0 .60, 则第一集合为该 200个第一区域组成的集合。 [0095] As mentioned above, it is assumed that 1890 first regions are determined, and each first region has a corresponding confidence level. Assuming that the first threshold is 0.60, there are 200 first regions in the 1890 first regions. If the confidence is greater than or equal to 0.60, then the first set is the set consisting of the 200 first regions.
[0096] S302, 在第一集合中的第一区域的数量为多个的情况下, 将第一集合中置信度 最大的第一区域确定为第一目标区域。 [0096] S302. In a case where the number of first regions in the first set is multiple, determine the first region with the highest confidence in the first set as the first target region.
[0097] 例如, 第一集合由{第一区域 1, 第一区域 2, 第一区域 3, 第一区域 4}组成时, 且各个第一区域的置信度分别为 0.42、 0.55、 0.85、 0.95,则将置信度为 0.95的第 一区域 4确定为第一目标区域。 For example, when the first set consists of {first area 1, first area 2, first area 3, and first area 4}, and the confidence of each first area is 0.42, 0.55, 0.85, 0.95, respectively , Then the first area 4 with a confidence level of 0.95 is determined as the first target area.
[0098] S303 , 在第一集合中的第一区域的数量为一个的情况下, 将第一集合中的第一 区域确定为第一目标区域。 [0098] S303: When the number of first regions in the first set is one, determine the first region in the first set as the first target region.
[0099] 例如, 第一集合由{第一区域 1}组成时, 则第一区域 1为第一目标区域。 [0099] For example, when the first set consists of {first region 1}, then the first region 1 is the first target region.
[0100] S304, 将第一目标区域对应的位置坐标和第一目标区域对应的商标名称确定为 一个第一商标的特征信息。 [0100] S304: Determine the position coordinates corresponding to the first target area and the brand name corresponding to the first target area as feature information of a first trademark.
[0101] 例如, 第一目标区域为第一区域 1, 则将第一区域 1对应的位置坐标以及第一区 域 1对应的商标名称确定为一个第一商标的特征信息, 即第一图像中第一个第一 商标的特征信息。 [0101] For example, if the first target area is the first area 1, the position coordinates corresponding to the first area 1 and the trademark name corresponding to the first area 1 are determined as the characteristic information of the first trademark, that is, the first image Characteristic information of a first trademark.
[0102] 这里, 通过步骤 S301-S304, 确定出第一图像中的第一个商标在第一图像中所 在的区域, 该区域为第一目标区域, 通过确定第一目标区域的位置坐标和商标 名称, 可确定出该商标在第一图像中的位置坐标和该商标的名称, 即确定出第 一图像中第一个商标的特征信息。 在一些可能的情况中, 第一图像中可能存在 多个商标, 在确定一个商标的特征信息后, 还可以通过步骤 S305~S310确定第一 图像中是否有第二个商标的存在, 并在第二个商标存在的情况下确定第二个商 标的特征信息。 [0102] Here, through steps S301-S304, an area where the first trademark in the first image is located in the first image is determined, which is the first target area, and by determining the position coordinates and trademark of the first target area The name can determine the position coordinates of the trademark in the first image and the name of the trademark, that is, the characteristic information of the first trademark in the first image. In some possible cases, there may be in the first image For multiple trademarks, after determining the characteristic information of one trademark, it is also possible to determine whether there is a second trademark in the first image through steps S305-S310, and determine the second trademark's presence if the second trademark exists Feature information.
[0103] S305, 将第一集合中排除第一目标区域之后的第一区域组成的集合确定为第二 集合。 [0103] S305: Determine a set composed of the first area after excluding the first target area in the first set as the second set.
[0104] 假设第一集合由{第一区域 1, 第一区域 2, 第一区域 3 , 第一区域 4}组成, 且第 一目标区域为第一区域 4, 则第二集合为{第一区域 1, 第一区域 2, 第一区域 3}组 成的集合。 [0104] Assuming that the first set consists of {first area 1, first area 2, first area 3, first area 4}, and the first target area is the first area 4, then the second set is {first Area 1, first area 2, first area 3}.
[0105] S306, 判断第二集合中每个第一区域与第一目标区域的重合度是否大于或等于 第二阈值。 [0105] S306. Determine whether the degree of coincidence between each first area and the first target area in the second set is greater than or equal to a second threshold.
[0106] 本发明实施例中, 可以通过比较 I0U的方式判断第二集合中每个第一区域与第 一目标区域的重合度, 并将计算得到的重合度与第二阈值进行比较。 I0U又称为 交并比, 即两个区域中图像的重合度, 计算第一区域 1与第一区域 2的 I0U具体是 指计算第一区域 1与第一区域 2在第一图像中的重合度, I0U越大则代表两个区域 的重合度越大, I0U越小则代表两个区域的重合度越小。 第一区域 1与第一区域 2 的 I0U等于第一区域 1与第一区域 2的交集除以第一区域 1与第一区域 2的并集, 用 公式表达为: I0U=[area (C) fiarea (D) ]/ [area (C) Uarea (D) ], area (C) 为第一区域 1, area (D) 为第一区域 2。 具体实现中, 可以根据第一区域 1的位置 坐标和第一区域 2的位置坐标计算第一区域 1与第一区域 2的交并比。 这里, 第二 阈值为一个评估两个区域之间的重合度的临界点, 第二阈值具体可以为 0.60、 0. 85 , 等等。 当两个区域的 I0U大于第二阈值时, 说明两个区域的重合度较高。 [0106] In the embodiment of the present invention, the degree of coincidence between each first area and the first target area in the second set may be determined by comparing the IOU, and the calculated degree of coincidence may be compared with the second threshold. I0U is also called the cross-combination ratio, that is, the degree of coincidence of the images in the two regions, and calculating the I0U of the first region 1 and the first region 2 specifically refers to calculating the coincidence of the first region 1 and the first region 2 in the first image Degree, the greater the I0U, the greater the degree of coincidence between the two regions, the smaller the I0U, the smaller the degree of coincidence between the two regions. The I0U of the first area 1 and the first area 2 is equal to the intersection of the first area 1 and the first area 2 divided by the union of the first area 1 and the first area 2, expressed by the formula: I0U=[area (C) fiarea (D) ]/ [area (C) Uarea (D) ], area (C) is the first area 1, and area (D) is the first area 2. In a specific implementation, the intersection ratio of the first area 1 and the first area 2 may be calculated according to the position coordinates of the first area 1 and the position coordinates of the first area 2. Here, the second threshold is a critical point for evaluating the degree of coincidence between two regions, and the second threshold may specifically be 0.60, 0.85, etc. When the I0U of the two regions is greater than the second threshold, it means that the overlap of the two regions is high.
[0107] S307, 当第二集合中每个第一区域与第一目标区域的重合度均小于第二阈值时 [0107] S307, when the degree of coincidence between each first area and the first target area in the second set is less than the second threshold
, 确定第二集合的第二目标区域。 To determine the second target area of the second set.
[0108] 这里, 第二集合的第二目标区域为第二集合中置信度最大的第一区域。 例如, 假设第二阈值为 0.65, 第二集合中包括{第一区域 A,第一区域 B, 第一区域 C, 第 一区域 D}, 且置信度分别为 0.62、 0.68、 0.75、 0.85, 第一目标区域为第一区域 E , 当第一区域 A、 第一区域 B、 第一区域 C、 第一区域 D与第一区域 E的重合度均 小于 0.65, 则第二集合的第二目标区域为第一区域 D。 [0109] S308, 当第二集合中至少一个第一区域与第一目标区域的重合度大于或等于第 二阈值时, 将第二集合中与第一目标区域的重合度大于或等于第二阈值的第一 区域确定为第二区域, 并将第二集合中排除第二区域之后的第一区域组成的集 合确定为第三集合, 且确定第三集合的第三目标区域。 [0108] Here, the second target area of the second set is the first area with the highest confidence in the second set. For example, assuming that the second threshold is 0.65, the second set includes {first area A, first area B, first area C, first area D}, and the confidence levels are 0.62, 0.68, 0.75, 0.85, respectively. A target area is the first area E. When the coincidence of the first area A, the first area B, the first area C, the first area D and the first area E is less than 0.65, the second target area of the second set It is the first area D. [0109] S308, when the degree of coincidence between at least one first area and the first target area in the second set is greater than or equal to the second threshold, the degree of coincidence with the first target area in the second set is greater than or equal to the second threshold The first area of is determined as the second area, and the set of the first area after the second area is excluded from the second set is determined as the third set, and the third target area of the third set is determined.
[0110] 这里, 第三集合的第三目标区域为第三集合中置信度最大的第一区域。 例如, 假设第二阈值为 0.65, 第二集合中包括{第一区域 F,第一区域 G, 第一区域 H, 第 一区域 I}, 且置信度分别为 0.65、 0.69、 0.78、 0.86, 第一目标区域为第一区域 J , 且第一区域 F、 第一区域 G与第一区域 J的重合度小于 0.65, 第一区域 H、 第一 区域 I与第一区域 J的重合度大于 0.65, 则第二区域包括第一区域 H、 第一区域 I, 则第三集合为第二集合中排除第二区域之后的第一区域组成的集合, 即第三集 合为{第一区域 F、 第一区域 G}, 第三目标区域为第三集合中的第一区域 G。 [0110] Here, the third target area of the third set is the first area with the highest confidence in the third set. For example, assuming that the second threshold is 0.65, the second set includes {first region F, first region G, first region H, first region I}, and the confidence levels are 0.65, 0.69, 0.78, 0.86, respectively. A target area is the first area J, and the degree of coincidence of the first area F, the first area G, and the first area J is less than 0.65, and the degree of coincidence of the first area H, the first area I, and the first area J is greater than 0.65, Then the second area includes the first area H and the first area I, then the third set is a set of the first area after the second area is excluded from the second set, that is, the third set is {first area F, first Area G}, the third target area is the first area G in the third set.
[0111] 可选地, 在当第二集合中的每个第一区域与第一目标区域的重合度均大于或等 于第二阈值的情况下, 确定第二集合中不存在第二目标区域, 即确定第一图像 中不存在第二个商标。 [0111] Optionally, when the degree of coincidence of each first region and the first target region in the second set is greater than or equal to the second threshold, it is determined that the second target region does not exist in the second set, That is, it is determined that the second trademark does not exist in the first image.
[0112] S309 , 将第二目标区域对应的位置坐标和第二目标区域对应的商标名称确定为 另一个第一商标的特征信息。 [0112] S309: Determine the position coordinates corresponding to the second target area and the brand name corresponding to the second target area as feature information of another first trademark.
[0113] 通过步骤 S305-S308确定了第一图像中存在另一个第一商标, 即第一图像中存 在两个第一商标, 则将第一图像中的第二个商标对应的第二目标区域对应的位 置坐标和第二目标区域对应的商标名称确定为第二个商标的特征信息。 [0113] It is determined through steps S305-S308 that there is another first trademark in the first image, that is, there are two first trademarks in the first image, and then the second target area corresponding to the second trademark in the first image The corresponding position coordinates and the trademark name corresponding to the second target area are determined as the characteristic information of the second trademark.
[0114] 如步骤 S307中确定出第二集合的第二目标区域为第一区域 D, 则将第一区域 D 对应的位置坐标和第一区域 D对应的商标名称确定为第二个商标的特征信息。 [0114] If it is determined in step S307 that the second target area of the second set is the first area D, then the position coordinates corresponding to the first area D and the trademark name corresponding to the first area D are determined as the characteristics of the second trademark information.
[0115] S310, 将第三目标区域对应的位置坐标和第三目标区域对应的商标名称确定为 另一个第一商标的特征信息。 [0115] S310: Determine the position coordinates corresponding to the third target area and the brand name corresponding to the third target area as feature information of another first trademark.
[0116] 这里, 可知第一图像中有两个第一商标, 另一个第一商标的特征信息为第一图 像中的第二个第一商标的特征信息。 假设确定出第三目标区域为第一区域 G, 则 将第一区域 G对应的位置坐标和第一区域 G对应的商标名称确定为另一个第一商 标的特征信息, 即第二个第一商标的特征信息。 [0116] Here, it can be known that there are two first trademarks in the first image, and the characteristic information of the other first trademark is the characteristic information of the second first trademark in the first image. Assuming that the third target area is determined to be the first area G, the position coordinates corresponding to the first area G and the trademark name corresponding to the first area G are determined as the characteristic information of another first trademark, that is, the second first trademark Characteristic information.
[0117] 例如, 步骤 S308-S309中确定出来的第二目标区域与第三目标区域为第一图像 中的另一个第一商标 (即第一图像中的第二个第一商标) 对应的区域, 根据目 标区域确定第一图像中第二个第一商标的特征信息可以如图 10中所示, 图 10是 本发明实施例提供的一种确定出第一图像中两个第一商标的特征信息的示意图 , 由于图中有两个第一商标, 一个第一商标 (第一个商标) 的特征信息可以为 图 10中的商标名称“NIKE”以及商标在第一图像中的位置坐标对应的区域 10e, 另 一个第一商标 (第二个第一商标) 的特征信息可以为图 10中的商标名称“特步”以 及商标在第一图像中的位置坐标对应的区域 10d。 [0117] For example, the second target area and the third target area determined in steps S308-S309 are the first image The area corresponding to another first trademark in the first image (that is, the second first trademark in the first image), and determining the characteristic information of the second first trademark in the first image according to the target area may be as shown in FIG. 10, FIG. 10 is a schematic diagram of determining feature information of two first trademarks in a first image provided by an embodiment of the present invention. Since there are two first trademarks in the figure, the characteristics of one first trademark (first trademark) The information may be the trademark name “NIKE” in FIG. 10 and the area 10e corresponding to the position coordinates of the trademark in the first image, and the characteristic information of another first trademark (second first trademark) may be the trademark in FIG. 10 An area 10d corresponding to the name "Xtep" and the position coordinates of the trademark in the first image.
[0118] 这里, 通过步骤 S305-S310, 确定出第一图像中的另一个第一商标 (第二个第 一商标) 在第一图像中所在的区域, 该区域为第二目标区域, 通过确定第二目 标区域的位置坐标和商标名称, 可确定出第一图像中的另一个第一商标 (第二 个第一商标) 的特征信息。 同理, 还可以参考步骤 S305-S310的具体实现方式确 定第一图像中是否有更多的第一商标的存在以及确定更多的第一商标的特征信 息, 直到依次确定出第一图像中的所有第一商标的特征信息。 以下通过举例的 方式来对依次确定出第一图像中所有第一商标的特征信息的过程进行描述。 [0118] Here, through steps S305-S310, an area where another first trademark (the second first trademark) in the first image is located in the first image is determined, and the area is the second target area, which is determined by The position coordinates and the brand name of the second target area can determine the characteristic information of another first trademark (second first trademark) in the first image. Similarly, you can also refer to the specific implementation manner of steps S305-S310 to determine whether there are more first trademarks in the first image and to determine more feature information of the first trademark, until the first image is sequentially determined Characteristic information of all first trademarks. The following describes the process of sequentially determining the characteristic information of all the first trademarks in the first image by way of example.
[0119] 参见图 11, 图 11是本发明实施例提供的一种在第一图像中确定出多个第一商标 对应的区域的示意图, 首先, 假设第二阈值为 0.60, 确定出的第一图像中第一集 合有 9个第一区域 (图中虚线框代表第一区域) , 分别为 A、 B、 C、 D、 E、 F、 [0119] Referring to FIG. 11, FIG. 11 is a schematic diagram of determining regions corresponding to multiple first trademarks in a first image provided by an embodiment of the present invention. First, assuming that the second threshold is 0.60, the determined first There are 9 first areas in the first set in the image (the dotted frame in the figure represents the first area), namely A, B, C, D, E, F,
G、 H、 I, 置信度分别为 0.60、 0.62、 0.65、 0.73、 0.95、 0.75、 0.65、 0.68、 0.60 , 则将置信度为 0.95的第一区域 E确定为第一目标区域, 第二集合为第一集合中 排除第一目标区域后的第一区域组成的集合, 即第二集合由 A、 B、 C、 D、 F、G, H, I, the confidence levels are 0.60, 0.62, 0.65, 0.73, 0.95, 0.75, 0.65, 0.68, 0.60, then the first area E with the confidence level of 0.95 is determined as the first target area, and the second set is In the first set, the set consisting of the first area after excluding the first target area, that is, the second set consists of A, B, C, D, F,
G、 H、 I组成。 则第一目标区域 E为第一图像中的一个第一商标所在的区域, 则 确定出第一图像中的第一个第一商标所在的区域。 G, H, I composition. Then, the first target area E is an area where a first trademark in the first image is located, and then the area where the first first trademark in the first image is located is determined.
[0120] 其次, 分别判断第二集合中 A、 B、 C、 D、 F、 G、 H、 I与 E的重合度。 当 A、 B 、 C、 D、 F、 G、 H、 I与 E的 I0U都小于 0.60, 则第二集合的第二目标区域为 F, 则第二目标区域 F为第一图像中的第二个 (另一个) 第一商标所在的区域, 则确 定出第一图像中的第二个第一商标所在的区域; 当第二集合中 A、 B、 C、 D、 F 、 G、 H、 I中至少一个第一区域与 E的 I0U大于或等于 0.60, 假设第二集合中 A、 [0120] Second, the degree of coincidence of A, B, C, D, F, G, H, I, and E in the second set is separately determined. When the I0U of A, B, C, D, F, G, H, I, and E are all less than 0.60, the second target area of the second set is F, and the second target area F is the second in the first image (The other) the area where the first trademark is located, then the area where the second first trademark in the first image is located is determined; when A, B, C, D, F, G, H, I in the second set I0U of at least one of the first region and E is greater than or equal to 0.60, assuming that A,
B、 C、 D、 F、 G、 H、 与 E的 I0U都大于或等于 0.60, 则第二区域包括 A、 B、 C 、 D、 F、 G、 H, 则第三集合为第二集合中排除第二区域的第一区域组成的集合 , 即第三集合为 I, 则第三目标区域为 I, 则第三目标区域 I为第一图像中的第二 个第一商标所在的区域, 则确定出第一图像中的第二个第一商标所在的区域。 The I0U of B, C, D, F, G, H, and E are greater than or equal to 0.60, then the second area includes A, B, C , D, F, G, H, then the third set is the set of the first area excluding the second area in the second set, that is, the third set is I, the third target area is I, and the third target area I is the area where the second first trademark in the first image is located, and then the area where the second first trademark in the first image is located is determined.
[0121] 假设第二集合中 A、 B、 C、 D与 E的 IOU大于或等于 0.60, 则第二区域包括 A、 [0121] Assuming that the IOUs of A, B, C, D, and E in the second set are greater than or equal to 0.60, the second area includes A,
B、 C、 D, 第三集合为第二集合排除第二区域后的集合, 即第三集合由 F、 G、B, C, D, the third set is the set after the second set excludes the second area, that is, the third set consists of F, G,
H、 I组成, 且第三目标区域为第三集合中置信度最大的区域, 即 F。 则将第三区 域中排除 F后的区域确定为第四集合, 由于第四集合中还存在多个第一区域, 则 继续判断第四集合中 G、 H、 I与第三目标区域 F的重合度, 假设 G、 H、 I与 F的 重合度均大于 0.60, 则第三区域包括 G、 H、 I, 第四集合中排除第三区域后为空 集, 则第三目标区域 F为第一图像中的第三个第一商标所在的区域, 则确定出第 一图像中的第三个第一商标所在的区域。 最终确定出在第一图像中三个第一商 标对应的区域, 即第一目标区域、 第二目标区域、 第三目标区域, 即第一图像 中存在三个第一商标。 It consists of H and I, and the third target area is the area with the highest confidence in the third set, namely F. Then, the area after excluding F in the third area is determined as the fourth set. Since there are multiple first areas in the fourth set, continue to judge the coincidence of G, H, I in the fourth set with the third target area F Degree, assuming that the coincidence of G, H, I, and F are all greater than 0.60, then the third area includes G, H, and I. The fourth set is the empty set after excluding the third area, and the third target area F is the first The area where the third first trademark in the image is located, then the area where the third first trademark in the first image is located is determined. Finally, it is determined that the areas corresponding to the three first trademarks in the first image, that is, the first target area, the second target area, and the third target area, that is, there are three first trademarks in the first image.
[0122] 最后, 将第一图像中存在的三个第一商标对应的三个目标区域对应的位置坐标 和商标名称确定为第一图像对应的三个第一商标的特征信息, 即将第一目标区 域对应的位置坐标和商标名称确定为第一个第一商标的特征信息、 将第二目标 区域对应的位置坐标和商标名称确定为第二个第一商标的特征信息、 将第三目 标区域对应的位置坐标和商标名称确定为第三个第一商标的特征信息。 至此, 确定出了第一图像中所有第一商标的特征信息。 [0122] Finally, the position coordinates and trademark names corresponding to the three target regions corresponding to the three first trademarks existing in the first image are determined as the characteristic information of the three first trademarks corresponding to the first image, that is, the first target The position coordinates and trademark names corresponding to the area are determined as the characteristic information of the first first trademark, the position coordinates and trademark names corresponding to the second target area are determined as the characteristic information of the second first trademark, and the third target area is corresponding The position coordinates and trademark name are determined as the characteristic information of the third first trademark. So far, the characteristic information of all the first trademarks in the first image is determined.
[0123] 本发明实施例中, 当检测到图像中存在商标时, 通过比较重合度的方法去除了 同一位置的重复商标, 确定了图像中商标所在的区域以及商标对应的商标名称 , 通过比较重合度的方法去除了重复商标, 提高了商标检测的速度与准确度。 [0123] In the embodiment of the present invention, when a trademark is detected in the image, the duplicate trademark at the same position is removed by comparing the overlapping degree, the area where the trademark is located in the image and the trademark name corresponding to the trademark are determined, and the overlap is compared by comparing The degree method removes duplicate trademarks and improves the speed and accuracy of trademark detection.
[0124] 上面介绍了发明实施例的方法, 下面介绍发明实施例的装置。 [0124] The method of the embodiment of the invention is described above, and the device of the embodiment of the invention is described below.
[0125] 参见图 12, 图 12是本发明实施例提供的一种信息推广装置的组成结构示意图, 该装置 50包括: [0125] Referring to FIG. 12, FIG. 12 is a schematic structural diagram of an information promotion device provided by an embodiment of the present invention. The device 50 includes:
[0126] 获取模块 501, 用于获取第一图像, 所述第一图像为商场中的展示屏对应的拍 摄装置拍摄到的图像; [0126] an acquisition module 501, configured to acquire a first image, where the first image is an image captured by a photographing device corresponding to a display screen in a shopping mall;
[0127] 检测模块 502, 用于检测所述第一图像中是否存在商标; [0128] 信息确定模块 503 , 用于在检测到所述第一图像中存在商标的情况下, 确定所 述第一图像中至少一个第一商标的特征信息; [0127] The detection module 502 is configured to detect whether a trademark exists in the first image; [0128] The information determining module 503 is configured to determine the characteristic information of at least one first trademark in the first image when it is detected that the trademark exists in the first image;
[0129] 查找模块 504, 用于在商标信息库中查找与每个第一商标的特征信息匹配的第 二商标; [0129] A search module 504, configured to search for a second trademark matching the characteristic information of each first trademark in the trademark information database;
[0130] 显示模块 505 , 用于在所述展示屏中显示所述第二商标对应的推广信息, 所述 推广信息包括所述第二商标对应的商品在所述商场中的位置信息。 [0130] The display module 505 is configured to display promotion information corresponding to the second trademark on the display screen, where the promotion information includes position information of the commodity corresponding to the second trademark in the mall.
[0131] 在一种可能的设计中, 每个第一商标的特征信息包括第一商标的商标名称以及 第一商标在所述第一图像中的位置坐标, 所述查找模块 504具体用于根据所述第 一商标的名称在所述商标信息库中查找与所述第一商标的名称相同的商标, 将 所述与所述第一商标的名称相同的商标确定为第二商标; 或者, 根据所述第一 商标在所述第一图像中的位置坐标对应的目标图像, 在所述商标信息库中查找 与所述目标图像的相似度高于相似度阈值的商标图案, 将所述与所述目标图像 的相似度高于相似度阈值的商标图案确定为第二商标。 [0131] In a possible design, the characteristic information of each first trademark includes the trademark name of the first trademark and the position coordinates of the first trademark in the first image, the search module 504 is specifically configured to The name of the first trademark is searched for in the trademark information database with the same name as the first trademark, and the trademark with the same name as the first trademark is determined as the second trademark; or, based on A target image corresponding to the position coordinates of the first trademark in the first image, search for a trademark pattern in the trademark information database whose similarity to the target image is higher than a similarity threshold, and compare the The trademark pattern whose similarity of the target image is higher than the similarity threshold is determined as the second trademark.
[0132] 在一种可能的设计中, 所述显示模块 505具体用于在所述第二商标为多个的情 况下, 在商场的展示屏中按顺序切换显示所述多个所述第二商标对应的推广信 息, 或者, 在所述第二商标为多个的情况下, 在商场的展示屏中同时显示所述 多个所述第二商标对应的推广信息。 [0132] In a possible design, the display module 505 is specifically configured to switch and display the plurality of second trademarks in order on a display screen of a shopping mall in the case of multiple second trademarks The promotion information corresponding to the trademark, or, when there are multiple second trademarks, the promotion information corresponding to the multiple second trademarks are displayed simultaneously on the display screen of the mall.
[0133] 在一种可能的设计中, 所述检测模块 502包括: [0133] In a possible design, the detection module 502 includes:
[0134] 处理单元 5021 用于基于目标检测模型中的卷积层对所述第一图像进行卷积处 理, 得到目标卷积特征图, 所述目标卷积特征图包括多个目标卷积特征子图; [0134] The processing unit 5021 is configured to perform convolution processing on the first image based on the convolution layer in the target detection model to obtain a target convolution feature map, where the target convolution feature map includes multiple target convolution feature sub Figure;
[0135] 确定单元 5022, 用于分别确定所述多个目标卷积特征子图中各个目标卷积特征 子图对应的目标卷积特征信息; [0135] The determining unit 5022 is configured to determine target convolution feature information corresponding to each target convolution feature sub-graph in the multiple target convolution feature sub-graphs, respectively;
[0136] 所述确定单元 5022, 还用于分别确定各个目标卷积特征信息与所述目标检测模 型中多个商标名称之间的匹配概率, 以及, 所述各个目标卷积特征信息对应的 位置坐标, 并分别将在所述第一图像中与所述各个目标卷积特征信息对应的位 置坐标所对应的区域作为所述各个目标卷积特征信息对应的第一区域; [0136] The determining unit 5022 is further configured to determine a matching probability between each target convolution feature information and a plurality of trademark names in the target detection model, and a position corresponding to each target convolution feature information Coordinates, and the area corresponding to the position coordinates corresponding to the respective target convolution feature information in the first image is used as the first area corresponding to the respective target convolution feature information;
[0137] 匹配单元 5023 用于分别根据所述各个目标卷积特征信息与所述目标检测模型 中多个商标名称之间的匹配概率, 确定所述各个目标卷积特征信息对应的最大 匹配概率, 分别将所述各个目标卷积特征信息对应的最大匹配概率确定为所述 各个目标卷积特征信息对应的第一区域的置信度, 并分别将所述各个目标卷积 特征信息对应的最大匹配概率所对应的商标名称作为与所述各个目标卷积特征 信息对应的第一区域所对应的商标名称; [0137] The matching unit 5023 is configured to determine the maximum corresponding to each target convolution feature information according to the matching probability between each target convolution feature information and multiple trademark names in the target detection model, respectively Matching probability, respectively determining the maximum matching probability corresponding to each target convolution feature information as the confidence of the first region corresponding to each target convolution feature information, and respectively corresponding to each target convolution feature information The brand name corresponding to the maximum matching probability is used as the brand name corresponding to the first region corresponding to each target convolution feature information;
[0138] 判断单元 5024, 用于判断各个第一区域的置信度是否大于第一阈值; [0138] The determining unit 5024 is configured to determine whether the confidence of each first area is greater than a first threshold;
[0139] 所述判断单元 5024, 还用于当至少一个第一区域的置信度大于或等于所述第一 阈值时, 检测到所述第一图像中存在商标; [0139] The judgment unit 5024 is further configured to detect the presence of a trademark in the first image when the confidence of at least one first area is greater than or equal to the first threshold;
[0140] 所述判断单元 5024, 还用于当所述各个第一区域的置信度均小于所述第一阈值 时, 检测到所述第一图像中不存在商标。 [0140] The judging unit 5024 is further configured to detect that there is no trademark in the first image when the confidence of each first area is less than the first threshold.
[0141] 在一种可能的设计中, 所述匹配单元 5023, 具体用于: [0141] In a possible design, the matching unit 5023 is specifically configured to:
[0142] 将置信度大于或等于所述第一阈值的第一区域组成的集合确定为第一集合; [0142] determine a set consisting of a first region with a confidence greater than or equal to the first threshold as the first set;
[0143] 在所述第一集合中的第一区域的数量为多个的情况下, 将所述第一集合中置信 度最大的第一区域确定为第一目标区域; [0143] In a case where the number of first regions in the first set is multiple, the first region with the highest confidence in the first set is determined as the first target region;
[0144] 在所述第一集合中的第一区域的数量为一个的情况下, 将所述第一集合中的第 一区域确定为第一目标区域; [0144] When the number of first regions in the first set is one, determine the first region in the first set as the first target region;
[0145] 将所述第一目标区域对应的位置坐标和所述第一目标区域对应的商标名称确定 为所述第一特征信息, 并将所述第一特征信息确定为一个第一商标的特征信息 [0145] determining the position coordinates corresponding to the first target area and the trademark name corresponding to the first target area as the first feature information, and determining the first feature information as a feature of a first trademark Information
[0146] 在一种可能的设计中, 所述匹配单元 5023, 具体用于: [0146] In a possible design, the matching unit 5023 is specifically configured to:
[0147] 将所述第一集合中排除所述第一目标区域之后的第一区域组成的集合确定为第 二集合; [0147] determining the set consisting of the first area after excluding the first target area from the first set as the second set;
[0148] 判断所述第二集合中每个第一区域与所述第一目标区域的重合度是否大于或等 于第二阈值; [0148] determine whether the degree of coincidence of each first area in the second set with the first target area is greater than or equal to a second threshold;
[0149] 当所述第二集合中每个第一区域与所述第一目标区域的重合度均小于所述第二 阈值时, 确定所述第二集合的第二目标区域; [0149] When the degree of coincidence between each first area and the first target area in the second set is less than the second threshold, determine the second target area of the second set;
[0150] 当所述第二集合中至少一个第一区域与所述第一目标区域的重合度大于或等于 所述第二阈值时, 将所述第二集合中与所述第一目标区域的重合度大于或等于 所述第二阈值的第一区域确定为第二区域, 并将所述第二集合中排除所述第二 区域之后的第一区域组成的集合确定为第三集合, 且确定所述第三集合的第三 目标区域; [0150] when the degree of coincidence of at least one first area in the second set with the first target area is greater than or equal to the second threshold, the The first area where the degree of coincidence is greater than or equal to the second threshold is determined as the second area, and the second set is excluded from the second set The set consisting of the first area after the area is determined as the third set, and the third target area of the third set is determined;
[0151] 将所述第二目标区域对应的位置坐标和所述第二目标区域对应的商标名称确定 为另一个第一商标的特征信息, 或将所述第三目标区域对应的位置坐标和所述 第三目标区域对应的商标名称确定为另一个第一商标的。 [0151] determining the position coordinates corresponding to the second target area and the trademark name corresponding to the second target area as the characteristic information of another first trademark, or determining the position coordinates and the corresponding position of the third target area The trademark name corresponding to the third target area is determined to be another first trademark.
[0152] 需要说明的是, 图 12对应的实施例中未提及的内容可参见方法实施例的描述, 这里不再赘述。 [0152] It should be noted that, for the content that is not mentioned in the embodiment corresponding to FIG. 12, reference may be made to the description of the method embodiment, and details are not repeated here.
[0153] 本发明实施例中, 通过目标检测模型对获取到的第一图像进行检测, 该目标检 测模型中的卷积网络结构包括残差单元、 上采样单元、 融合单元、 卷积特征层 以及一般卷积层, 残差单元可以使卷积网络结构在很深的情况下仍能收敛, 残 差单元中的卷积层可以压缩每次卷积得到的图像对应的特征, 减少了模型中的 参数量以及计算量; 上采样单元对图像进行上采样使网络层数在很深的情况下 表达效果也很好; 融合单元使网络同时学习深层与浅层特征, 使卷积后的图像 对应的特征更多; 卷积特征层输出三个不同尺度的卷积后的图像, 从而可以对 三个不同尺度的卷积后的图像进行检测, 提高了商标检测的效率以及准确度, 在检测到图像中存在商标的情况下, 通过比较重合度的方法去除了重复商标, 确定了第一图像中每个第一商标的特征信息, 在商标信息库中查找与每个第一 商标的特征信息匹配的第二商标, 并在商场的展示屏中显示与第二商标对应的 推广信息, 通过在商场展示屏上显示与检测到的商标对应的推广信息, 便于消 费者观看并查找商标对应的商品在商场中的位置, 由于第一图像为商场中的展 示屏对应的拍摄装置拍摄到的图像, 该图像为在该展示屏附近的消费者的图像 的可能性较高, 那么检测到的商标很可能为该消费者服装或者其他部位 (如背 包) 上的商标, 而消费者的着装在一定程度上可以用于反馈用户对商品的喜好 , 显示与商标相匹配的推广信息, 相当于是显示用户喜欢的商品的推广信息, 起到了针对性地投放推广信息的作用, 即根据用户的喜好进行推广, 由于是根 据用户的喜好进行推广, 可以实现推广信息的精准投放, 从而可以使得推广的 效果更好。 [0153] In the embodiment of the present invention, the acquired first image is detected by a target detection model, the convolutional network structure in the target detection model includes a residual unit, an upsampling unit, a fusion unit, a convolution feature layer, and In general convolutional layers, the residual unit can make the convolutional network structure converge in a deep situation. The convolutional layer in the residual unit can compress the features corresponding to the image obtained by each convolution, reducing the model. The amount of parameters and the amount of calculation; the upsampling unit upsamples the image so that the network layer number is very well expressed in deep conditions; the fusion unit enables the network to learn deep and shallow features at the same time, so that the convolved image corresponds to More features; the convolutional feature layer outputs three convolutional images of different scales, so that three convolutional images of different scales can be detected, which improves the efficiency and accuracy of trademark detection. In the case of trademarks, the duplicate trademarks are removed by comparing the coincidences, the characteristic information of each first trademark in the first image is determined, and the trademark information database is searched for matching the characteristic information of each first trademark The second trademark, and display the promotion information corresponding to the second trademark on the display screen of the mall. By displaying the promotion information corresponding to the detected trademark on the mall display screen, it is convenient for consumers to view and find the products corresponding to the trademark in the mall In the location, since the first image is an image captured by a shooting device corresponding to the display screen in the mall, the image is likely to be an image of the consumer near the display screen, then the detected trademark is likely to be The trademark on the consumer's clothing or other parts (such as a backpack), and the consumer's clothing can be used to feedback the user's preference for the product to a certain extent, and display promotional information matching the trademark, which is equivalent to displaying the user's favorite product The promotion information has played the role of targeted promotion information, that is, promotion according to the user's preference, because the promotion is based on the user's preference, can achieve accurate delivery of promotion information, which can make the promotion effect better.
[0154] 参见图 13 , 图 13是本发明实施例提供的一种信息推广装置的组成结构示意图, 该装置包括处理器 601、 存储器 602以及通信接口 603。 处理器 601连接到存储器 6 02和通信接口 603, 例如处理器 601可以通过总线连接到存储器 602和通信接口 60 3。 [0154] Referring to FIG. 13, FIG. 13 is a schematic structural diagram of an information promotion device provided by an embodiment of the present invention, The device includes a processor 601, a memory 602, and a communication interface 603. The processor 601 is connected to the memory 602 and the communication interface 603. For example, the processor 601 may be connected to the memory 602 and the communication interface 603 through a bus.
[0155] 处理器 601被配置为支持所述信息推广的装置执行图 2、 图 4、 图 9所述的信息推 广的方法中相应的功能。 该处理器 601可以是中央处理器 (central processing unit , CPU) , 网络处理器 (network processor, NP) , 硬件芯片或者其任意组合。 上述硬件芯片可以是专用集成电路 (application specific integrated circuit, ASIC ) , 可编程逻辑器件 (programmable logic device, PLD) 或其组合。 上述 PLD可 以是复杂可编程逻辑器件 (complex programmable logic device, CPLD) , 现场 可编程逻辑门阵列 (field-programmable gate [0155] The processor 601 is configured to support the information promotion apparatus to perform the corresponding functions in the information promotion method described in FIGS. 2, 4, and 9. The processor 601 may be a central processing unit (CPU), a network processor (NP), a hardware chip, or any combination thereof. The hardware chip may be an application specific integrated circuit (ASIC), a programmable logic device (programmable logic device, PLD), or a combination thereof. The above PLD may be a complex programmable logic device (CPLD), field-programmable gate array (field-programmable gate array)
array, FPGA) , 通用阵列逻辑 (generic array logic , GAL) 或其任意组合。 array, FPGA), generic array logic (GAL) or any combination thereof.
[0156] 存储器 602存储器用于存储程序代码等。 存储器 602可以包括易失性存储器 (vo latile memory , VM) , 例如随机存取存储器 (random access memory, RAM) [0156] The memory 602 is used for storing program codes and the like. The memory 602 may include a volatile memory (vo latile memory, VM), such as a random access memory (random access memory, RAM)
; 存储器 602也可以包括非易失性存储器 (non-volatile memory, NVM) , 例如 只读存储器 (read-only memory, ROM) , 快闪存储器 (flash memory) , 硬盘 (hard disk drive, HDD) 或固态硬盘 (solid-state drive, SSD) ; 存储器 602还 可以包括上述种类的存储器的组合。 The memory 602 may also include non-volatile memory (non-volatile memory, NVM), such as read-only memory (read-only memory, ROM), flash memory (flash memory), hard disk (hard disk drive, HDD) or Solid-state drive (solid-state drive, SSD); The memory 602 may also include a combination of the aforementioned types of memory.
[0157] 所述通信接口 603用于发送或接收数据。 [0157] The communication interface 603 is used to send or receive data.
[0158] 处理器 601可以调用所述程序代码以执行以下操作: [0158] The processor 601 may call the program code to perform the following operations:
[0159] 通过通信接口 603获取第一图像, 所述第一图像为商场中的展示屏对应的拍摄 装置拍摄到的图像; [0159] acquiring a first image through the communication interface 603, the first image being an image captured by a shooting device corresponding to a display screen in a shopping mall;
[0160] 检测所述第一图像中是否存在商标; [0160] detecting whether a trademark exists in the first image;
[0161] 在检测到所述第一图像中存在商标的情况下, 确定所述第一图像中至少一个第 一商标的特征信息; [0161] in the case where a trademark is detected in the first image, determining the characteristic information of at least one first trademark in the first image;
[0162] 在商标信息库中查找与每个第一商标的特征信息匹配的第二商标; [0162] Find the second trademark matching the characteristic information of each first trademark in the trademark information database;
[0163] 在所述展示屏中显示所述第二商标对应的推广信息, 所述推广信息包括所述第 二商标对应的商品在所述商场中的位置信息。 [0163] The promotion information corresponding to the second trademark is displayed on the display screen, and the promotion information includes location information of the commodity corresponding to the second trademark in the mall.
[0164] 需要说明的是, 各个操作的实现还可以对应参照图 2、 图 4、 图 9所示的方法实 施例的相应描述; 所述处理器 601还可以与通信接口 603配合执行上述方法实施 例中的其他操作。 [0164] It should be noted that the implementation of each operation may also correspond to the method shown in FIG. 2, FIG. 4, and FIG. 9. Corresponding description of the embodiment; the processor 601 can also cooperate with the communication interface 603 to perform other operations in the above method embodiments.
[0165] [0165]

Claims

权利要求书 Claims
[权利要求 i] 一种信息推广方法, 其特征在于, 包括: [Claim i] An information promotion method, characterized in that it includes:
获取第一图像, 所述第一图像为商场中的展示屏对应的拍摄装置拍摄 到的图像; Acquiring a first image, where the first image is an image captured by a shooting device corresponding to a display screen in a shopping mall;
检测所述第一图像中是否存在商标; Detecting whether a trademark exists in the first image;
在检测到所述第一图像中存在商标的情况下, 确定所述第一图像中至 少一个第一商标的特征信息; 在商标信息库中查找与每个第一商标的特征信息匹配的第二商标; 在所述展示屏中显示所述第二商标对应的推广信息, 所述推广信息包 括所述第二商标对应的商品在所述商场中的位置信息。 When it is detected that a trademark exists in the first image, determine the feature information of at least one first trademark in the first image; find a second matching the feature information of each first trademark in the trademark information database A trademark; displaying promotion information corresponding to the second trademark on the display screen, where the promotion information includes location information of commodities corresponding to the second trademark in the mall.
[权利要求 2] 根据权利要求 1所述的方法, 其特征在于, 每个第一商标的特征信息 包括第一商标的商标名称以及第一商标在所述第一图像中的位置坐标 [Claim 2] The method according to claim 1, wherein the characteristic information of each first trademark includes the trademark name of the first trademark and the position coordinates of the first trademark in the first image
所述在商标信息库中查找与每个第一商标的特征信息匹配的第二商标 , 包括: The searching for the second trademark matching the characteristic information of each first trademark in the trademark information database includes:
根据所述第一商标的名称在所述商标信息库中查找与所述第一商标的 名称相同的商标, 将所述与所述第一商标的名称相同的商标确定为第 二商标; 或者, Searching for a trademark having the same name as the first trademark in the trademark information database according to the name of the first trademark, and determining the trademark having the same name as the first trademark to be the second trademark; or,
根据所述第一商标在所述第一图像中的位置坐标对应的目标图像, 在 所述商标信息库中查找与所述目标图像的相似度高于相似度阈值的商 标图案, 将所述与所述目标图像的相似度高于相似度阈值的商标图案 确定为第二商标。 According to the target image corresponding to the position coordinates of the first trademark in the first image, search for a trademark pattern in the trademark information database whose similarity to the target image is higher than a similarity threshold, and compare the The trademark pattern whose similarity of the target image is higher than the similarity threshold is determined as the second trademark.
[权利要求 3] 根据权利要求 1所述的方法, 其特征在于, 所述在所述展示屏中显示 所述第二商标对应的推广信息, 包括: [Claim 3] The method according to claim 1, wherein the displaying of the promotion information corresponding to the second trademark on the display screen includes:
在所述第二商标为多个的情况下, 在所述展示屏中按顺序切换显示所 述多个所述第二商标对应的推广信息, 或者, When there are multiple second trademarks, the promotion information corresponding to the multiple second trademarks is switched and displayed in sequence on the display screen, or,
在所述第二商标为多个的情况下, 在所述展示屏中同时显示所述多个 所述第二商标对应的推广信息。 When there are multiple second trademarks, the promotion information corresponding to the multiple second trademarks is displayed simultaneously on the display screen.
[权利要求 4] 根据权利要求 1所述的方法, 其特征在于, 所述检测所述第一图像中 是否存在商标, 包括: [Claim 4] The method according to claim 1, wherein the detecting whether a trademark exists in the first image includes:
基于目标检测模型中的卷积层对所述第一图像进行卷积处理, 得到目 标卷积特征图, 所述目标卷积特征图包括多个目标卷积特征子图; 分别确定所述多个目标卷积特征子图中各个目标卷积特征子图对应的 目标卷积特征信息; Performing a convolution process on the first image based on the convolution layer in the target detection model to obtain a target convolution feature map, where the target convolution feature map includes multiple target convolution feature submaps; each of the multiple Target convolution feature information corresponding to each target convolution feature subgraph in the target convolution feature subgraph;
分别确定各个目标卷积特征信息与所述目标检测模型中多个商标名称 之间的匹配概率, 以及, 所述各个目标卷积特征信息对应的位置坐标 , 并分别将在所述第一图像中与所述各个目标卷积特征信息对应的位 置坐标所对应的区域作为所述各个目标卷积特征信息对应的第一区域 分别根据所述各个目标卷积特征信息与所述目标检测模型中多个商标 名称之间的匹配概率, 确定所述各个目标卷积特征信息对应的最大匹 配概率, 分别将所述各个目标卷积特征信息对应的最大匹配概率确定 为所述各个目标卷积特征信息对应的第一区域的置信度, 并分别将所 述各个目标卷积特征信息对应的最大匹配概率所对应的商标名称作为 与所述各个目标卷积特征信息对应的第一区域所对应的商标名称; 判断各个第一区域的置信度是否大于第一阈值; 当至少一个第一区域的置信度大于或等于所述第一阈值时, 检测到所 述第一图像中存在商标; Separately determine the matching probability between each target convolution feature information and a plurality of trademark names in the target detection model, and, the position coordinates corresponding to each target convolution feature information, and respectively in the first image The area corresponding to the position coordinates corresponding to the respective target convolution feature information is used as the first area corresponding to the respective target convolution feature information according to the plurality of target convolution feature information and the target detection model respectively. The matching probability between brand names, determining the maximum matching probability corresponding to each target convolution feature information, and determining the maximum matching probability corresponding to each target convolution feature information as the corresponding The confidence of the first area, and the trademark name corresponding to the maximum matching probability corresponding to each target convolution feature information is used as the trademark name corresponding to the first area corresponding to each target convolution feature information; Whether the confidence of each first area is greater than a first threshold; when the confidence of at least one first area is greater than or equal to the first threshold, it is detected that a trademark exists in the first image;
当所述各个第一区域的置信度均小于所述第一阈值时, 检测到所述第 一图像中不存在商标。 When the confidence of each of the first regions is less than the first threshold, it is detected that there is no trademark in the first image.
[权利要求 5] 根据权利要求 4所述的方法, 其特征在于, 所述确定所述第一图像中 至少一个第一商标的特征信息, 包括: [Claim 5] The method according to claim 4, wherein the determining the characteristic information of at least one first trademark in the first image includes:
将置信度大于或等于所述第一阈值的第一区域组成的集合确定为第一 集合; Determining a set consisting of first regions with a confidence greater than or equal to the first threshold as the first set;
在所述第一集合中的第一区域的数量为多个的情况下, 将所述第一集 合中置信度最大的第一区域确定为第一目标区域; 在所述第一集合中的第一区域的数量为一个的情况下, 将所述第一集 合中的第一区域确定为第一目标区域; When the number of first regions in the first set is multiple, determine the first region with the highest confidence in the first set as the first target region; When the number of first regions in the first set is one, determine the first region in the first set as the first target region;
将所述第一目标区域对应的位置坐标和所述第一目标区域对应的商标 名称确定为一个第一商标的特征信息。 The position coordinates corresponding to the first target area and the trademark name corresponding to the first target area are determined as the characteristic information of a first trademark.
[权利要求 6] 根据权利要求 5所述的方法, 其特征在于, 所述确定所述第一图像中 至少一个第一商标的特征信息, 还包括: [Claim 6] The method according to claim 5, wherein the determining the characteristic information of at least one first trademark in the first image further comprises:
将所述第一集合中排除所述第一目标区域之后的第一区域组成的集合 确定为第二集合; Determining the set consisting of the first area after excluding the first target area in the first set as the second set;
判断所述第二集合中每个第一区域与所述第一目标区域的重合度是否 大于或等于第二阈值; Determine whether the degree of coincidence of each first area in the second set with the first target area is greater than or equal to a second threshold;
当所述第二集合中每个第一区域与所述第一目标区域的重合度均小于 所述第二阈值时, 确定所述第二集合的第二目标区域; When the degree of coincidence between each first area in the second set and the first target area is less than the second threshold, determine the second target area in the second set;
当所述第二集合中至少一个第一区域与所述第一目标区域的重合度大 于或等于所述第二阈值时, 将所述第二集合中与所述第一目标区域的 重合度大于或等于所述第二阈值的第一区域确定为第二区域, 并将所 述第二集合中排除所述第二区域之后的第一区域组成的集合确定为第 三集合, 且确定所述第三集合的第三目标区域; 将所述第二目标区域对应的位置坐标和所述第二目标区域对应的商标 名称确定为另一个第一商标的特征信息, 或将所述第三目标区域对应 的位置坐标和所述第三目标区域对应的商标名称确定为另一个第一商 标的特征信息。 When the degree of coincidence of at least one first area in the second set with the first target area is greater than or equal to the second threshold, the degree of coincidence with the first target area in the second set is greater than Or a first area equal to the second threshold is determined as a second area, and a set of the first area after excluding the second area in the second set is determined as a third set, and the first area is determined Three sets of third target areas; determining the position coordinates corresponding to the second target area and the trademark name corresponding to the second target area as feature information of another first trademark, or corresponding to the third target area The position coordinates of and the brand name corresponding to the third target area are determined to be characteristic information of another first brand.
[权利要求 7] —种信息推广装置, 其特征在于, 包括: [Claim 7] An information promotion device, characterized in that it includes:
获取模块, 用于获取第一图像, 所述第一图像为商场中的展示屏对应 的拍摄装置拍摄到的图像; An obtaining module, configured to obtain a first image, where the first image is an image captured by a shooting device corresponding to a display screen in a shopping mall;
检测模块, 用于检测所述第一图像中是否存在商标; A detection module, configured to detect whether a trademark exists in the first image;
信息确定模块, 用于在检测到所述第一图像中存在商标的情况下, 确 定所述第一图像中至少一个第一商标的特征信息; 查找模块, 用于在商标信息库中查找与每个第一商标的特征信息匹配 的第二商标; The information determination module is used to determine the characteristic information of at least one first trademark in the first image when a trademark is detected in the first image; the search module is used to search for and search for each trademark in the trademark information database Feature information matching of the first trademark 'S second trademark;
显示模块, 用于在所述展示屏中显示所述第二商标对应的推广信息, 所述推广信息包括所述第二商标对应的商品在所述商场中的位置信息 A display module, configured to display promotion information corresponding to the second trademark on the display screen, where the promotion information includes position information of commodities corresponding to the second trademark in the mall
[权利要求 8] 根据权利要求 7所述的装置, 其特征在于, 所述检测模块, 包括: 处理单元, 用于基于目标检测模型中的卷积层对所述第一图像进行卷 积处理, 得到目标卷积特征图, 所述目标卷积特征图包括多个目标卷 积特征子图; [Claim 8] The apparatus according to claim 7, wherein the detection module includes: a processing unit configured to perform convolution processing on the first image based on the convolution layer in the target detection model, Obtaining a target convolution feature map, where the target convolution feature map includes multiple target convolution feature subgraphs;
确定单元, 用于分别确定所述多个目标卷积特征子图中各个目标卷积 特征子图对应的目标卷积特征信息; A determining unit, configured to separately determine target convolution feature information corresponding to each target convolution feature subgraph in the multiple target convolution feature subgraphs;
所述确定单元, 还用于分别确定各个目标卷积特征信息与所述目标检 测模型中多个商标名称之间的匹配概率, 以及, 所述各个目标卷积特 征信息对应的位置坐标, 并分别将在所述第一图像中与所述各个目标 卷积特征信息对应的位置坐标所对应的区域作为所述各个目标卷积特 征信息对应的第一区域; The determining unit is further configured to separately determine the matching probability between each target convolution feature information and multiple trademark names in the target detection model, and, the position coordinates corresponding to each target convolution feature information, and respectively Use the area corresponding to the position coordinates corresponding to the respective target convolution feature information in the first image as the first area corresponding to the respective target convolution feature information;
匹配单元, 用于分别根据所述各个目标卷积特征信息与所述目标检测 模型中多个商标名称之间的匹配概率, 确定所述各个目标卷积特征信 息对应的最大匹配概率, 分别将所述各个目标卷积特征信息对应的最 大匹配概率确定为所述各个目标卷积特征信息对应的第一区域的置信 度, 并分别将所述各个目标卷积特征信息对应的最大匹配概率所对应 的商标名称作为与所述各个目标卷积特征信息对应的第一区域所对应 的商标名称; The matching unit is configured to determine the maximum matching probability corresponding to each target convolution feature information according to the matching probability between each target convolution feature information and multiple trademark names in the target detection model, respectively The maximum matching probability corresponding to each target convolution feature information is determined as the confidence of the first region corresponding to each target convolution feature information, and the corresponding maximum matching probability corresponding to each target convolution feature information is respectively The trademark name serves as the trademark name corresponding to the first area corresponding to each target convolution feature information;
判断单元, 用于判断各个第一区域的置信度是否大于第一阈值; 所述判断单元, 还用于当至少一个第一区域的置信度大于或等于所述 第一阈值时, 检测到所述第一图像中存在商标; 所述判断单元, 还用于当所述各个第一区域的置信度均小于所述第一 阈值时, 检测到所述第一图像中不存在商标。 A judging unit, used to judge whether the confidence of each first area is greater than a first threshold; the judging unit is also used to detect when the confidence of at least one first area is greater than or equal to the first threshold There is a trademark in the first image; the judgment unit is further configured to detect that there is no trademark in the first image when the confidence of each first area is less than the first threshold.
[权利要求 9] 根据权利要求 7所述的装置, 其特征在于, 所述装置还包括显示模块 , 用于在所述第二商标为多个的情况下, 在所述展示屏中按顺序切换 显示所述多个所述第二商标对应的推广信息, 或者在所述第二商标为 多个的情况下, 在所述展示屏中同时显示所述多个所述第二商标对应 的推广信息。 [Claim 9] The device according to claim 7, wherein the device further comprises a display module , For switching the display of promotion information corresponding to the plurality of second trademarks in sequence on the display screen when there are multiple second trademarks, or when there are multiple second trademarks In the case of, displaying promotion information corresponding to the plurality of second trademarks simultaneously on the display screen.
[权利要求 10] 根据权利要求 7所述的装置, 其特征在于, 所述检测模块包括处理单 语、 确定单元、 匹配单元及判断单元, 所述处理单元用于基于目标检 测模型中的卷积层对所述第一图像进行卷积处理, 得到目标卷积特征 图, 所述目标卷积特征图包括多个目标卷积特征子图; 所述确定单元 用于分别确定所述多个目标卷积特征子图中各个目标卷积特征子图对 应的目标卷积特征信息; 分别确定各个目标卷积特征信息与所述目标 检测模型中多个商标名称之间的匹配概率, 以及, 所述各个目标卷积 特征信息对应的位置坐标, 并分别将在所述第一图像中与所述各个目 标卷积特征信息对应的位置坐标所对应的区域作为所述各个目标卷积 特征信息对应的第一区域; 所述匹配单元用于分别根据所述各个目标 卷积特征信息与所述目标检测模型中多个商标名称之间的匹配概率, 确定所述各个目标卷积特征信息对应的最大匹配概率, 分别将所述各 个目标卷积特征信息对应的最大匹配概率确定为所述各个目标卷积特 征信息对应的第一区域的置信度, 并分别将所述各个目标卷积特征信 息对应的最大匹配概率所对应的商标名称作为与所述各个目标卷积特 征信息对应的第一区域所对应的商标名称; 所述判断单元用于判断各 个第一区域的置信度是否大于第一阈值; 及当至少一个第一区域的置 信度大于或等于所述第一阈值时, 检测到所述第一图像中存在商标; 当所述各个第一区域的置信度均小于所述第一阈值时, 检测到所述第 一图像中不存在商标。 [Claim 10] The device according to claim 7, wherein the detection module includes a processing monolingual, a determination unit, a matching unit and a judgment unit, the processing unit is used for convolution based on the target detection model The layer performs convolution processing on the first image to obtain a target convolution feature map, where the target convolution feature map includes multiple target convolution feature submaps; the determining unit is configured to determine the multiple target convolutions respectively Target convolution feature information corresponding to each target convolution feature sub-picture in the product feature sub-picture; determining the matching probability between each target convolution feature information and multiple trademark names in the target detection model, and, The position coordinates corresponding to the target convolution feature information, and the regions corresponding to the position coordinates corresponding to the respective target convolution feature information in the first image are used as the first corresponding to the respective target convolution feature information Area; the matching unit is used to determine the maximum matching probability corresponding to each target convolution feature information according to the matching probability between each target convolution feature information and multiple trademark names in the target detection model, Determine the maximum matching probability corresponding to each target convolution feature information as the confidence of the first region corresponding to each target convolution feature information, and respectively determine the maximum matching probability corresponding to each target convolution feature information The corresponding brand name is used as the brand name corresponding to the first area corresponding to each target convolution feature information; the judgment unit is used to determine whether the confidence of each first area is greater than the first threshold; and when at least one When the confidence of the first area is greater than or equal to the first threshold, the presence of a trademark in the first image is detected; when the confidence of each of the first areas is less than the first threshold, the detection is performed There is no trademark in the first image.
[权利要求 11] 根据权利要求 10所述的装置, 其特征在于, 所述匹配单元还用于将置 信度大于或等于所述第一阈值的第一区域组成的集合确定为第一集合 ; 在所述第一集合中的第一区域的数量为多个的情况下, 将所述第一 集合中置信度最大的第一区域确定为第一目标区域; 在所述第一集合 中的第一区域的数量为一个的情况下, 将所述第一集合中的第一区域 确定为第一目标区域; 将所述第一目标区域对应的位置坐标和所述第 一目标区域对应的商标名称确定为一个第一商标的特征信息。 [Claim 11] The apparatus according to claim 10, characterized in that the matching unit is further configured to determine a set consisting of a first region with a confidence greater than or equal to the first threshold as the first set; When the number of first regions in the first set is multiple, determine the first region with the highest confidence in the first set as the first target region; in the first set When the number of first regions in is one, determine the first region in the first set as the first target region; map the position coordinates corresponding to the first target region to the first target region The trademark name is determined as the characteristic information of the first trademark.
[权利要求 12] 根据权利要求 11所述的装置, 其特征在于, 所述匹配单元还用于将所 述第一集合中排除所述第一目标区域之后的第一区域组成的集合确定 为第二集合; 判断所述第二集合中每个第一区域与所述第一目标区域 的重合度是否大于或等于第二阈值; 当所述第二集合中每个第一区域 与所述第一目标区域的重合度均小于所述第二阈值时, 确定所述第二 集合的第二目标区域; 当所述第二集合中至少一个第一区域与所述第 一目标区域的重合度大于或等于所述第二阈值时, 将所述第二集合中 与所述第一目标区域的重合度大于或等于所述第二阈值的第一区域确 定为第二区域, 并将所述第二集合中排除所述第二区域之后的第一区 域组成的集合确定为第三集合, 且确定所述第三集合的第三目标区域 ; 将所述第二目标区域对应的位置坐标和所述第二目标区域对应的商 标名称确定为另一个第一商标的特征信息, 或将所述第三目标区域对 应的位置坐标和所述第三目标区域对应的商标名称确定为另一个第一 商标的特征信息。 [Claim 12] The apparatus according to claim 11, wherein the matching unit is further configured to determine a set composed of a first area after excluding the first target area from the first set as the first Two sets; judging whether the degree of coincidence between each first area in the second set and the first target area is greater than or equal to a second threshold; when each first area in the second set and the first When the degree of coincidence of the target areas is less than the second threshold, determine the second target area of the second set; when the degree of coincidence of at least one first area in the second set with the first target area is greater than or When it is equal to the second threshold, determine the first area in the second set whose degree of coincidence with the first target area is greater than or equal to the second threshold as the second area, and determine the second set The set consisting of the first area after excluding the second area is determined as the third set, and the third target area of the third set is determined; the position coordinates corresponding to the second target area and the second The trademark name corresponding to the target area is determined to be characteristic information of another first trademark, or the position coordinates corresponding to the third target area and the trademark name corresponding to the third target area are determined to be characteristic information of another first trademark .
[权利要求 13] 一种信息推广装置, 其特征在于, 包括处理器、 存储器以及通信接口 , 所述处理器、 存储器和通信接口相互连接, 其中, 所述通信接口用 于输入或输出数据, 所述存储器用于存储程序代码, 所述处理器用于 调用所述程序代码, 执行如权利要求 1-6任一项所述的方法。 [Claim 13] An information promotion device, comprising a processor, a memory and a communication interface, wherein the processor, the memory and the communication interface are connected to each other, wherein the communication interface is used to input or output data, so The memory is used to store program code, and the processor is used to call the program code to execute the method according to any one of claims 1-6.
[权利要求 14] 一种计算机存储介质, 其特征在于, 所述计算机存储介质存储有计算 机程序, 所述计算机程序包括程序指令, 所述程序指令当被处理器执 行时, 执行如权利要求 1-6任一项所述的方法。 [Claim 14] A computer storage medium, characterized in that the computer storage medium stores a computer program, the computer program includes program instructions, and when executed by a processor, the program instructions are executed as claimed in claim 1- 6. The method of any one of the above.
PCT/CN2019/114081 2018-12-21 2019-10-29 Information promotion method and apparatus WO2020125207A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201811579857.8 2018-12-21
CN201811579857.8A CN109816429B (en) 2018-12-21 2018-12-21 Information popularization method and device

Publications (1)

Publication Number Publication Date
WO2020125207A1 true WO2020125207A1 (en) 2020-06-25

Family

ID=66602267

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2019/114081 WO2020125207A1 (en) 2018-12-21 2019-10-29 Information promotion method and apparatus

Country Status (2)

Country Link
CN (1) CN109816429B (en)
WO (1) WO2020125207A1 (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109816429B (en) * 2018-12-21 2022-04-29 深圳云天励飞技术有限公司 Information popularization method and device
CN110543839A (en) * 2019-08-20 2019-12-06 南京掌控网络科技有限公司 commodity goods laying rate acquisition method based on computer vision

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101388067A (en) * 2007-09-12 2009-03-18 中国科学院自动化研究所 Implantation method for interaction entertainment trademark advertisement
CN107067286A (en) * 2017-04-28 2017-08-18 深圳齐心集团股份有限公司 Push the method and stationery of stationery auxiliary products
CN108229495A (en) * 2017-06-23 2018-06-29 北京市商汤科技开发有限公司 Target object detection method, device, electronic equipment and storage medium
CN108985314A (en) * 2018-05-24 2018-12-11 北京飞搜科技有限公司 Object detection method and equipment
CN109816429A (en) * 2018-12-21 2019-05-28 深圳云天励飞技术有限公司 Information popularization method and apparatus

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110295693A1 (en) * 2010-06-01 2011-12-01 Microsoft Corporation Generating Tailored Content Based On Scene Image Detection
WO2016036338A1 (en) * 2014-09-02 2016-03-10 Echostar Ukraine, L.L.C. Detection of items in a home
CN105354334B (en) * 2015-11-27 2019-04-26 广州视源电子科技股份有限公司 A kind of information issuing method and Intelligent mirror based on Intelligent mirror
CN106651955B (en) * 2016-10-10 2020-01-14 北京小米移动软件有限公司 Method and device for positioning target object in picture
CN107862316B (en) * 2017-10-19 2021-06-04 珠海格力电器股份有限公司 Convolution operation method and device
CN108322545A (en) * 2018-02-13 2018-07-24 广东欧珀移动通信有限公司 Information-pushing method, device, server and computer-readable medium

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101388067A (en) * 2007-09-12 2009-03-18 中国科学院自动化研究所 Implantation method for interaction entertainment trademark advertisement
CN107067286A (en) * 2017-04-28 2017-08-18 深圳齐心集团股份有限公司 Push the method and stationery of stationery auxiliary products
CN108229495A (en) * 2017-06-23 2018-06-29 北京市商汤科技开发有限公司 Target object detection method, device, electronic equipment and storage medium
CN108985314A (en) * 2018-05-24 2018-12-11 北京飞搜科技有限公司 Object detection method and equipment
CN109816429A (en) * 2018-12-21 2019-05-28 深圳云天励飞技术有限公司 Information popularization method and apparatus

Also Published As

Publication number Publication date
CN109816429A (en) 2019-05-28
CN109816429B (en) 2022-04-29

Similar Documents

Publication Publication Date Title
US10831819B2 (en) Hue-based color naming for an image
US10528618B2 (en) Image processing and matching
US9633448B1 (en) Hue-based color naming for an image
CN110942370B (en) Descriptor and image based project model
CN110008808B (en) Panorama segmentation method, device and system and storage medium
US9665962B2 (en) Image distractor detection and processng
US9542736B2 (en) Evaluating image sharpness
CN111144242B (en) Three-dimensional target detection method, device and terminal
CN105517679B (en) Determination of the geographic location of a user
US20150117784A1 (en) Image foreground detection
TWI781554B (en) Method of determining item name of object, device, computer equipment and storage medium
US10417276B2 (en) Thumbnail generation from panoramic images
US20140093122A1 (en) Image identifiers and methods and systems of presenting image identifiers
KR102218608B1 (en) Real time overlay placement in videos for augmented reality applications
CN108961267B (en) Picture processing method, picture processing device and terminal equipment
TWI745818B (en) Method and electronic equipment for visual positioning and computer readable storage medium thereof
WO2020125207A1 (en) Information promotion method and apparatus
US20200134056A1 (en) Digital Image Search Training using Aggregated Digital Images
WO2016011763A1 (en) Image presentation method, apparatus and device, and nonvolatile computer storage medium
CN108280135B (en) Method and device for realizing visualization of data structure and electronic equipment
US8885952B1 (en) Method and system for presenting similar photos based on homographies
WO2020068241A1 (en) Fashion by trend user interfaces
CN115170168A (en) Advertisement pushing method and device, cloud server and storage medium
WO2019075644A1 (en) Portrait photograph searching method, and terminal
US20210295423A1 (en) Automatic clustering and mapping of user generated content with curated content

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19899261

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 19899261

Country of ref document: EP

Kind code of ref document: A1