WO2024051609A1 - Advertisement creative data selection method and apparatus, model training method and apparatus, and device and storage medium - Google Patents

Advertisement creative data selection method and apparatus, model training method and apparatus, and device and storage medium Download PDF

Info

Publication number
WO2024051609A1
WO2024051609A1 PCT/CN2023/116575 CN2023116575W WO2024051609A1 WO 2024051609 A1 WO2024051609 A1 WO 2024051609A1 CN 2023116575 W CN2023116575 W CN 2023116575W WO 2024051609 A1 WO2024051609 A1 WO 2024051609A1
Authority
WO
WIPO (PCT)
Prior art keywords
advertising
creative
data
feature vector
image
Prior art date
Application number
PCT/CN2023/116575
Other languages
French (fr)
Chinese (zh)
Inventor
刘银星
阮涛
张政
吕晶晶
詹科
Original Assignee
北京沃东天骏信息技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 北京沃东天骏信息技术有限公司 filed Critical 北京沃东天骏信息技术有限公司
Publication of WO2024051609A1 publication Critical patent/WO2024051609A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • G06Q30/0241Advertisements
    • G06Q30/0276Advertisement creation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • G06Q30/0201Market modelling; Market analysis; Collecting market data
    • G06Q30/0202Market predictions or forecasting for commercial activities
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/764Arrangements for image or video recognition or understanding using pattern recognition or machine learning using classification, e.g. of video objects
    • G06V10/765Arrangements for image or video recognition or understanding using pattern recognition or machine learning using classification, e.g. of video objects using rules for classification or partitioning the feature space
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/77Processing image or video features in feature spaces; using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]; Blind source separation
    • G06V10/774Generating sets of training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/77Processing image or video features in feature spaces; using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]; Blind source separation
    • G06V10/80Fusion, i.e. combining data from various sources at the sensor level, preprocessing level, feature extraction level or classification level
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/82Arrangements for image or video recognition or understanding using pattern recognition or machine learning using neural networks

Definitions

  • This application relates to the field of artificial intelligence technology, such as advertising creative data selection methods and devices, model training methods and devices, equipment, and storage media.
  • the advertising creative selection method generally uses image processing technology to process image elements in advertising materials, and uses natural language processing models to identify and process text elements in advertising materials. These methods are targeted at specific fields and have poor generalizability. Moreover, these methods cannot directly select the best advertising creative elements for users, which reduces the user experience.
  • This application provides advertising creative data selection methods and devices, model training methods and devices, equipment, and storage media to automatically and accurately select optimal target advertising creative data from candidate advertising creative data.
  • this application provides a method for selecting advertising creative data, including:
  • candidate advertising creative data corresponding to the target item; wherein the candidate advertising creative data includes advertising images and advertising copy;
  • the creative selection model is used to: use a self-attention mechanism to fuse the sparse feature vector and the picture feature vector, and output the recommendation probability value based on the fusion result.
  • this application also provides a model training method, which method includes:
  • training sample data includes a sample range corresponding to the sample item
  • the creative selection model is used to: use a self-attention mechanism to fuse the sparse feature vector and the picture feature vector, and output the predicted recommendation probability value based on the fusion result.
  • this application also provides an advertising creative data selection device, which includes:
  • the data acquisition module is configured to obtain candidate advertising creative data corresponding to the target item; wherein the candidate advertising creative data includes advertising pictures and advertising copy;
  • the probability value obtaining module is configured to obtain the sparse feature vector and picture feature vector corresponding to the candidate advertisement creative data, and obtain the candidate advertisement based on the sparse feature vector, the picture feature vector and the pre-trained creative selection model. Recommendation probability value corresponding to creative data;
  • a data selection module configured to select target advertising creative data according to the recommendation probability value
  • the creative selection model is used to: use a self-attention mechanism to fuse the sparse feature vector and the picture feature vector, and output the recommendation probability value based on the fusion result.
  • this application also provides a model training device, which includes:
  • a sample data acquisition module configured to obtain training sample data, wherein the training sample data includes sample advertising creative data corresponding to sample items and standard recommendation probability values corresponding to the sample advertising creative data, and the sample advertising creative data includes advertisements. images and advertising copy;
  • a vector acquisition module configured to obtain the sparse feature vector and the picture feature vector corresponding to the sample advertising creative data, and obtain the sample advertising creative based on the sparse feature vector, the picture feature vector and the creative selection model to be trained.
  • a model training module configured to determine a loss function based on the standard recommendation probability value and the predicted recommendation probability value, adjust the network parameters in the creative selection model to be trained based on the loss function, and adjust the network parameters in the creative selection model to be trained, and perform Stop training when iterating the stop condition;
  • the creative selection model is used to: use a self-attention mechanism to fuse the sparse feature vector and the picture feature vector, and output the predicted recommendation probability value based on the fusion result.
  • this application also provides a computer device, including a memory, a processor and a storage device.
  • a computer program is stored in a memory and can be run on a processor.
  • the processor executes the program, the above-mentioned advertising creative data selection method or model training method is implemented.
  • the present application also provides a computer-readable storage medium on which a computer program is stored.
  • the program is executed by a processor, the above-mentioned advertising creative data selection method or model training method is implemented.
  • Figure 1 is a flow chart of an advertising creative data selection method provided by an embodiment of the present application.
  • Figure 2 is a schematic structural diagram of a creative selection model provided by an embodiment of the present application.
  • Figure 3 is a flow chart of a method for obtaining image feature vectors provided by an embodiment of the present application.
  • Figure 4 is a flow chart of a method for obtaining candidate advertising creative data provided by an embodiment of the present application
  • Figure 5 is a flow chart of another method of obtaining candidate advertising creative data provided by an embodiment of the present application.
  • Figure 6 is a flow chart for optimizing candidate advertising creative data provided by an embodiment of the present application.
  • Figure 7 is a schematic flow chart of an advertising creative data selection and optimization process provided by an embodiment of the present application.
  • Figure 8 is a flow chart of a model training method provided by an embodiment of the present application.
  • Figure 9 is a schematic structural diagram of an advertising creative data selection device provided by an embodiment of the present application.
  • Figure 10 is a schematic structural diagram of a model training device provided by an embodiment of the present application.
  • Figure 11 is a schematic structural diagram of a computer device provided by an embodiment of the present application.
  • Figure 1 is a flow chart of an advertising creative data selection method provided by an embodiment of the present application. This embodiment can automatically and accurately select optimal target advertising creative data from candidate advertising creative data. This method can be executed by the advertising creative data selection device in the embodiment of the present application. The device can be implemented in software and/or hardware. As shown in Figure 1, the method includes the following steps:
  • the target item represents the item for which corresponding advertising creative data needs to be generated or selected.
  • the candidate advertising creative data includes advertising images and advertising copy.
  • the advertising material data (advertising copy and advertising pictures) corresponding to the item can be obtained through the item details page.
  • the advertising copy of the target item can be obtained; by performing image recognition and target detection on the content in the details page, the advertising image of the target item can be obtained.
  • the advertising material data corresponding to the target item can be obtained from the advertising creative materials provided by advertising companies. Screen the advertising creative data corresponding to the target item according to the needs to obtain the candidate advertising creative data corresponding to the target item.
  • the target item is a mobile phone. On the mobile website, view the content of the detail pages of multiple mobile phones.
  • S120 Obtain the sparse feature vector and picture feature vector corresponding to the candidate advertising creative data, and obtain the recommendation probability value corresponding to the candidate advertising creative data based on the sparse feature vector, picture feature vector and the pre-trained creative selection model.
  • the creative selection model is used to: use the self-attention mechanism to fuse sparse feature vectors and image feature vectors; and output recommendation probability values based on the fusion results.
  • the creative selection model includes a multilayer perceptron (MLP) module, a self-attention module and an output module; among them, the MLP module is used to output the first feature vector based on sparse feature vectors; the self-attention module The output module is configured to output a second feature vector based on the sparse feature vector and the image feature vector; the output module is configured to output a recommendation probability value based on the first feature vector and the second feature vector.
  • MLP multilayer perceptron
  • a sparse feature vector is a vector used to reflect multiple types of sparse features.
  • the sparse features of this solution include item features, user features and creative features.
  • Item characteristics include information such as item identification number, advertising space identification number, brand identification number, and item type target identification number.
  • User characteristics include the user’s age, gender, preferences, etc.
  • Creative features include background template features, copywriting features and picture features.
  • Background template features include background template identification number, template style, template layout, template main color and other features.
  • Copywriting features include Features such as main copy, sub-copy, and bubble copy (the type of copy used to indicate that items are on sale or hot-selling), etc.
  • the copy characteristics can be obtained from the advertising copy in the advertising creative data.
  • Image features include whether there is text on the advertising image, whether there are people, and the creative type.
  • the image feature vector is a vector used to reflect the image features of the advertising image.
  • the image features can be obtained from the advertising images in the advertising creative data.
  • the MLP module can map multiple input feature vectors to a single output feature vector.
  • the self-attention module can quickly extract important features from sparse feature vectors.
  • the self-attention module includes the multi-head self-attention module in the Transformer model.
  • the Transformer model is a neural network model.
  • the Transformer model can learn the context of the data by tracking the relationships in the sequence data.
  • the Transformer model includes a multi-head self-attention module.
  • the multi-head self-attention module can extract feature information from multiple dimensions, and the multi-head self-attention module is highly parallel and can combine information from different dimensions to capture multiple ranges of dependencies within the sequence.
  • FIG. 2 is a schematic structural diagram of a creative selection model provided by an embodiment of the present application.
  • the sparse feature matrix and image feature vector are input into the creative selection model, and the sparse feature matrix can be converted into a sparse feature vector using the vector conversion table.
  • the recommendation probability value corresponding to the candidate advertising creative data (the output result of the output module) is predicted.
  • the creative selection model Before using the creative selection model, the creative selection model needs to be trained. Collect a large number of advertising creative plans and sort out advertising creative data (background template information, item information, copywriting information, picture information, etc.) from the advertising creative plans.
  • the annotated sparse feature vectors and image feature vectors are used as sample data, and the recommended probability values (1 or 0) of the annotated sparse feature vectors and image feature vectors are used as sample labels.
  • S130 Select target advertising creative data according to the recommendation probability value.
  • the target advertising creative data is the optimal advertising creative data selected from multiple candidate advertising creative data.
  • the target advertising creative data includes the target item's advertising copy, advertising pictures, background templates (template style, background color, layout, etc.) and other data related to the target item's advertising plan.
  • the recommendation probability value is the probability value output by the creative selection model to recommend the corresponding candidate advertising creative data to the user. The larger the recommendation probability value is, the higher the creative selection model believes that the corresponding candidate advertising creative data is.
  • a preset probability value can be set according to requirements. When the recommendation probability value is greater than the preset probability value, the corresponding candidate advertising creative data is determined as the target advertising creative data. Or, directly select the candidate advertising creative data with the largest recommendation probability value as the target advertising creative data.
  • this plan also includes the following steps A1-Step A2:
  • Step A1 Obtain the first coding information of the advertising image and the second coding information of the advertising copy in the target advertising creative data, and generate a uniform resource locator (URL, Uniform) corresponding to the target advertising creative data based on the first coding information and the second coding information.
  • URL uniform resource locator
  • URL is a concise representation of the location and access method of resources obtained from the Internet. It is the address of standard resources on the Internet. Every file on the Internet has a unique URL, which contains information indicating the location of the file and what the browser should do with it.
  • the first encoding information can be obtained by URL encoding the advertising image
  • the second encoding information can be obtained by URL encoding the advertising copy.
  • a URL can be generated using the first encoding information and the second encoding information, that is, the URL corresponding to the target advertising creative data. Through the URL corresponding to the target advertising creative data, the address of the advertising creative data can be directly accessed.
  • Step A2 After receiving the access request for the URL sent by the client, obtain the advertising image and advertising copy in the target advertising creative data according to the URL, and perform a combined image operation on the obtained advertising image and advertising copy to obtain the target Advertising creative image; send the target advertising creative image to the client for display.
  • the address of the advertising creative data can be directly accessed.
  • the advertising image and advertising copy pointed to by the URL are obtained based on the URL.
  • the obtained advertising image and advertising copy are combined to obtain an advertising image, and the advertising image is used as the target advertising creative image.
  • image processing software is used to fuse the advertising copy and advertising images together with the category information of the target item. In the process of combining pictures, the position and size of the advertising copy and advertising image can be appropriately adjusted according to the needs and actual environment, and finally the target advertising creative image can be obtained.
  • Generating the URL corresponding to the target advertising creative data through the above steps can save the resources occupied by image storage, and the advertising image can also be updated at any time as the URL encoding changes, improving the efficiency of providing advertising creative data to users.
  • the technical solution of this embodiment is to obtain candidate advertising creative data corresponding to the target item; wherein the candidate advertising creative data includes advertising pictures and advertising copy.
  • the candidate advertising creative data includes advertising pictures and advertising copy.
  • the sparse feature vector and picture feature vector corresponding to the candidate advertising creative data and obtain the recommendation probability value corresponding to the candidate advertising creative data based on the sparse feature vector, picture feature vector and the pre-trained creative selection model; among which, the creative selection model includes MLP module, self-attention module and output module, MLP module is used based on sparse features
  • the vector outputs a first feature vector, the self-attention module is used to output a second feature vector based on the sparse feature vector and the picture feature vector; the output module is used to output a recommendation probability value based on the first feature vector and the second feature vector.
  • the solution of this embodiment can use a creative selection model to automatically select target advertising creative data from the collected candidate advertising creative data.
  • the self-attention module included in the creative selection model can identify (Identifier, ID) classes of sparse feature vectors. Features are fused with image feature vectors, so that images and text can be fused to solve the multi-modal creative selection problem in the advertising system, and it has good universality.
  • Figure 3 is a flow chart of a method for obtaining a picture feature vector provided by an embodiment of the present application. This embodiment explains the method of obtaining a picture feature vector based on the above embodiment. As shown in Figure 3, the method of this embodiment includes the following steps:
  • the candidate advertising creative data includes advertising copy and advertising images.
  • the image feature vector needs to be input into the multi-head self-attention module. Therefore, it is necessary to input the advertising images in the candidate advertising creative data into the pre-trained residual neural network model to obtain the image feature vector.
  • training the residual neural network model includes the following steps B1 to B3:
  • Step B1 Obtain the sample image and the classification label corresponding to the sample image.
  • the classification label is the product word of the sample item contained in the sample picture.
  • the product word is a vocabulary used to characterize the type of sample item and does not contain brand information.
  • the category words of items are used as classification labels.
  • the category word of an item includes the brand information of the item, and the product word can be used to indicate the type of item and does not contain brand information. For example, if the sample item is a mobile phone of brand A, the corresponding category words include mobile phones and brand A, and the corresponding product words only include mobile phones.
  • a sample image is an advertising image in an advertising plan for a sample item.
  • the method of using product words as sample labels is suitable for large-scale, multi-classification tasks. It can improve the generalization of image feature vectors generated by the residual neural network model and avoid insufficient expression of image content caused by excessive concentration of item category information.
  • item type information is often more important than item brand information.
  • item type information is often more important than item brand information.
  • An advertising plan for mobile phones may need to highlight the advertising copy (describing the performance of the phone, etc.), while an advertising plan for clothing may need to highlight advertising images.
  • the advertising plans are mostly the same.
  • the difference in their advertising plans may only be the content described in the advertising copy. Therefore, using the product words of sample items as classification labels can be closer to the actual situation and improve the accuracy of the recommendation probability value of the creative selection model.
  • Step B2 Input the sample image into the residual neural network model to obtain the predicted classification output by the residual neural network model.
  • the residual neural network model is a type of convolutional neural network model, such as the residual network (Residual Network, ResNet) model. Residual neural network models are mostly suitable for image classification and object recognition.
  • the residual neural network model is easy to optimize, and the accuracy can be improved by increasing a certain depth of the network model.
  • the internal residual block uses skip connections, which alleviates the vanishing gradient problem caused by increasing depth in deep neural networks. Input the sample image into the residual neural network model, and the residual neural network model predicts the type corresponding to the sample image through computational reasoning.
  • the tail of the residual neural network model includes three fully connected layers for outputting 32-dimensional vectors, 128-dimensional vectors and 256-dimensional vectors respectively. That is, these three fully connected layers are added to the tail of the ResNet model.
  • the residual neural network model includes convolutional layers, pooling layers, activation functions and fully connected layers. Among them, operations such as convolution layer, pooling layer and activation function map the original data to the feature space of the hidden layer to obtain the feature vector.
  • the fully connected layer can map the feature vector of the distributed feature representation to the sample label space.
  • the fully connected layer can extract the features of the feature vector and classify the sample images according to the feature vectors of the sample images. Depending on the size of the sample data and the classification requirements of the sample data, output vectors of different dimensions can be set for the residual neural network model.
  • the category of the sample image can be flexibly and accurately predicted based on business needs and the size of the sample data. .
  • Step B3 Determine the loss function based on the predicted classification and classification label, adjust the network parameters in the residual neural network model based on the loss function, and stop training when the preset iteration stop conditions are met.
  • Predictive classification refers to the category of the sample image predicted by the residual neural network model after inputting the sample image into the residual neural network model.
  • the classification label is the true category of the annotated sample image.
  • the "gap" between the predicted classification and the sample label can be calculated through the loss function.
  • the network parameters in the residual neural network model can be continuously adjusted, so that the predicted classification and The classification labels get closer and closer until the training stops when the preset iteration stopping condition is met.
  • the preset iteration stop condition includes that the prediction accuracy of the residual neural network model reaches the preset accuracy range.
  • the preset accuracy range includes [75%, 80%]. The higher the prediction accuracy of the residual neural network model, the more accurate the category of the predicted sample image. But at the same time, the computational complexity of the residual neural network model becomes greater, resulting in the residual neural network in practical applications.
  • the preset accuracy range can be set to [75%, 80%].
  • the prediction accuracy of the residual neural network model is not less than 75% and not greater than 80%, the training of the residual neural network model can be stopped.
  • using the product words of the sample items as classification labels can improve the accuracy of the recommendation probability value of the creative selection model.
  • the category of the sample image can be flexibly and accurately predicted based on business needs and the size of the sample data. , improving the accuracy of the recommended probability value of the creative selection model.
  • Image feature vectors are vectors that represent some features of the image in the form of vectors.
  • the image feature vector output by the residual neural network model can be used to represent the category of the image. For example, if a picture of a mobile phone is input into the residual neural network model, the residual neural network model predicts that the feature of the picture is "category: mobile phone picture” and outputs a feature vector of the picture with the characteristics of "mobile phone”.
  • the technical solution of this embodiment is to input the advertisement pictures in the candidate advertisement creative data into the pre-trained residual neural network model; and obtain the picture feature vector output by the residual neural network model.
  • the solution of this embodiment can flexibly and accurately predict the category of the sample picture based on business needs and the size of the sample data, and use the product words of the sample items as classification labels, improving the accuracy of the recommendation probability value of the creative selection model.
  • Figure 4 is a flow chart of a method for obtaining candidate advertisement creative data provided by an embodiment of the present application. This embodiment explains the method of obtaining candidate advertisement creative data based on the above embodiment. As shown in Figure 4, the method of this embodiment includes the following steps:
  • S310 Obtain multiple advertising material data corresponding to the target item, where the advertising material data includes advertising copy and advertising images.
  • the advertising material data (advertising copy and advertising pictures) corresponding to the item can be obtained through the item details page.
  • the advertising copy is obtained; by performing image recognition and image cropping on the content in the details page, the advertising image is obtained.
  • the advertising material data corresponding to the item can be obtained from the advertising creative materials provided by advertising companies.
  • S320 Select at least one advertisement copy and at least one advertisement image from the plurality of advertisement creative data according to the online click data corresponding to the plurality of advertisement creative data.
  • Online click data is used to represent the click volume of creative data.
  • the click volume can reflect the user's How much you like the creative. The more clicks a creative gets, the more people are likely to like it. Therefore, at least one advertisement copy and at least one advertisement image can be selected from the plurality of advertisement creative data based on the online click data respectively corresponding to the plurality of advertisement creative data.
  • selecting at least one advertising copy and at least one advertising image from multiple advertising creative data includes the following steps C1 to C2:
  • Step C1 For each creative data, determine the score of the creative data based on the average number of online clicks of the creative data and the cumulative number of times the creative data is selected.
  • the number of online clicks on the creative data can be determined by viewing and clicking on the target item's details page.
  • the score is:
  • n j is the cumulative number of times the current creative data has been selected
  • n represents the number of creative data. The greater the average number of online clicks and the cumulative number of times the creative data is selected, the higher the score of the creative data. The smaller the average number of online clicks and the cumulative number of times the creative data is selected, the lower the score of the creative data.
  • the scores of the advertising copy and advertising image in the creative are calculated respectively.
  • the selected advertising copy forms a copy group, and the score of each advertising copy in the copy group is calculated.
  • the score for ad copy is the average number of online clicks for the advertising copy in the copywriting group
  • n j is the cumulative number of times the current advertising copy has been selected
  • n represents the number of advertising copywriting in the copywriting group.
  • the selected advertising images form an image group, and the score of each advertising image in the image group is calculated.
  • n j is the cumulative number of times the current advertising image has been selected
  • n represents the number of advertising images in the image group.
  • Step C2 Select at least one advertising copy and at least one advertising image from multiple creative data based on the score of each creative data.
  • At least one advertising image is selected from the advertising creative data based on the score of the advertising image. Select at least one ad copy from the creative data based on the ad copy's score.
  • the score of the creative data can be accurately calculated, and the creative data can be selected based on the score of the creative data.
  • the appropriate creative data can be accurately and quickly selected, and the process of combining the creative data can be avoided.
  • S330 Combine the selected advertising copy and advertising image to obtain at least one candidate advertising creative data.
  • the selected advertising copy and advertising pictures are combined in pairs to obtain at least one candidate advertising creative data. For example, based on the score of each creative data, the advertising copy selected from multiple creative data is copy A and copy B. The advertising image selected from multiple creative data is image C. Then the candidate advertising creative data can be obtained as AC and BC based on the advertising copy and advertising pictures.
  • the technical solution of this embodiment is to obtain multiple advertising material data corresponding to the target item, where the advertising material data includes advertising copy and advertising pictures, and according to the online click data corresponding to the multiple advertising material data, from the multiple advertising material data Select at least one advertising copy and at least one advertising picture, and combine the selected advertising copy and advertising pictures to obtain at least one candidate advertising creative data.
  • the solution of this embodiment can accurately calculate the score of the advertising material data, select the advertising material data based on the score status of the advertising material data, accurately and quickly select the appropriate advertising material data, and avoid the occurrence of mutual interaction between the advertising material data.
  • the combinatorial explosion problem caused by the combination process is Moreover, the candidate advertising creative data obtained by combining the selected advertising copy and advertising images is more accurate and more in line with the user's wishes.
  • Figure 5 is a flow chart of another method for obtaining candidate advertisement creative data provided by an embodiment of the present application. This embodiment explains the method of obtaining candidate advertisement creative data based on the above embodiment. As shown in Figure 5, the method of this embodiment includes the following steps:
  • S410 Identify and extract advertising copy from the item details page and/or advertising creative materials of the target item.
  • the advertising material data corresponding to the item can be obtained through the item details page.
  • Ad copy can be extracted from creative data.
  • identifying and extracting advertising copy from the item details page and/or advertising creative materials of the target item includes the following steps D1 to D3:
  • Step D1 Based on the preset character recognition model, identify the candidate copy from the item details page and/or advertising creative materials of the target item.
  • Character recognition models include optical character recognition (Optical Character Recognition, OCR) models.
  • OCR Optical Character Recognition
  • the copywriting in the item details page and/or advertising creative materials can be identified and extracted through the OCR model to obtain the advertising copy, and the identified advertising copy can be used as the target item. Choose copy.
  • Step D2 Based on the first word list containing preset interest point words, filter out the benefits from the candidate copy. Click on copywriting.
  • Benefit point words are words used in advertising copy to express item characteristics, item benefits/advantages, consumer interests, emotions/values, etc. to users.
  • the default interest point vocabulary is a table set according to needs and actual environment to record the interest point vocabulary of target items. For example, if the target item is a camera, the first word list of the interest point vocabulary contains:
  • Item characteristics small size, high pixels; Item benefits/advantages: easily take clear and beautiful photos; Consumer benefits: Easy to carry and easy to operate; Emotions/values: Record life and show the most real world.
  • the interest point vocabulary is selected from the candidate copy according to the first word list.
  • Step D3 Based on the preset word limit conditions and/or the second word list containing preset non-selling point words, screen out the selling point copy from the remaining copy after excluding the benefit point copy from the candidate copy.
  • Selling point copywriting is valuable copywriting that can increase users' purchasing interest and promote product sales.
  • Selling point copy can describe the selling points of the product in simple language, so there is a certain word limit for selling point copy.
  • Selling point copy can be screened out from the remaining copy after removing benefit point copy from the candidate copy based on the preset word count limit.
  • advertising copy that meets the word limit is not necessarily selling point copy.
  • the selling point copy can be screened out from the remaining copy after excluding the benefit point copy from the candidate copy based on a second word list containing preset non-selling point words.
  • the OCR model can be used to accurately and quickly mine and identify the advertising copy in the item details page, and obtain the final interest point copy and selling point copy through the first word list and the second word list, which can be used to solve the problem.
  • S420 Position and crop the item image in the item details page of the target item to obtain an advertising image.
  • the position of the image of the target item is uncertain, and the saliency algorithm can be used to divide the item details page into multiple specific areas with unique properties (such as text areas and picture areas). Identify the item pictures on the detail page in the segmented picture area, analyze the size of the item pictures, and intelligently crop the item pictures. For example, when the main body of the item in the item picture is too small and the user cannot clearly see the main body of the item from the item picture, the item picture can be modified. Crop to get an appropriately sized item image. Use the cropped item image as an advertising image.
  • S430 Select at least one advertisement copy and at least one advertisement image from the plurality of advertisement creative data according to the online click data respectively corresponding to the plurality of advertisement creative data.
  • S440 Combine the selected advertising copy and advertising image to obtain at least one candidate advertising creative data.
  • the technical solution of this embodiment is to identify and extract advertising copy from the item details page of the target item and/or advertising creative materials; position and crop the item pictures in the item details page of the target item to obtain the advertisement picture; According to the online click data corresponding to the multiple creative data, at least one advertising copy and at least one advertising image are selected from the multiple advertising creative data; the selected advertising copy and advertising image are combined to obtain at least one candidate advertising creative data.
  • the technical solution of this embodiment can accurately and quickly mine and identify advertising copy in item detail pages, and solve the problem of insufficient online copy materials through selling point copy. Through intelligent cropping of advertising images, target item images that can highlight the target items are obtained, so that the advertising images can fully display the target items.
  • Figure 6 is a flow chart for optimizing candidate advertising creative data provided by an embodiment of the present application. This embodiment describes a method for optimizing candidate advertising creative data based on the above embodiment. As shown in Figure 6, the method in this embodiment includes the following steps:
  • S510 Combine the selected advertising copy and advertising images to obtain at least one copy and image combination.
  • the selected advertising copy and advertising images are combined in pairs to obtain at least one copy and image combination. For example, based on the score of each creative data, the advertising copy selected from multiple creative data is copy A and copy B. The advertising image selected from multiple creative data is image C. Then according to the advertising copy and advertising image, the copy image combination can be obtained as AC and BC.
  • S520 Combine at least one copywriting picture combination and at least one preset background template to obtain at least one creative combination.
  • the default background template is a layout template with a fixed style of copywriting and picture combination that is set in advance according to the characteristics and needs of the item. After obtaining the copywriting picture combination, you can select at least one background template for the copywriting picture combination based on the category information of the items and the characteristics of the items, etc., and combine the copywriting picture combination with at least one preset background template in pairs to obtain at least one creative combination. .
  • S530 Screen out at least one creative combination as candidate advertising creative data from at least one creative combination based on preset filtering factors.
  • the preset filtering factors include category information of the target items, and/or color information of the advertising images and background templates in each creative combination.
  • candidate advertising creative data needs to be filtered out based on the category information of the item and/or the color information of the advertising image and background template in each creative combination.
  • the color information may include the main color.
  • the K-Means clustering algorithm can be used to perform cluster analysis and main color extraction on the color of the picture, and identify the main color of the picture.
  • At least one creative combination from at least one creative combination as candidate advertising creative data based on the category information of the target item it may be based on the category information of the target item and the preset corresponding relationship between the item category and the background template style. , determine the background template style corresponding to the target item, and then select a creative combination that matches the background template style from at least one creative combination.
  • At least one creative combination as candidate advertising creative data from at least one creative combination based on the color information of the advertising image and background template in each creative combination it may be based on the main color of the advertising image and the main color of the background template, using The Hue-Saturation-Value (HSV) color model selects at least one creative combination from at least one creative combination as candidate advertising creative data by giving priority to adjacent color matching and contrasting color matching.
  • HSV Hue-Saturation-Value
  • the preset threshold can be set in advance according to specific needs. When the size of the target item area in the advertisement image of the target item is smaller than the preset threshold, the target item in the image may be too small and the user cannot see the item intuitively and clearly.
  • the target detection algorithm can be used to identify the target item area in the advertising image, and the advertising image can be intelligently cropped to obtain a target item image that can highlight the target item, and the advertising image contained in the candidate advertising creative data can be updated based on the target item image.
  • S550 Polish the target item area in the advertisement image according to the color information of the advertisement image in the candidate advertisement creative data.
  • the polishing process includes at least one of adjusting brightness, adjusting contrast, and adjusting saturation.
  • image analysis is performed on the advertising image based on the color of the advertising image and the color of the target item area.
  • the color of the target item area is too dark, which may result in the target item area not being eye-catching enough, the color brightness of the target item area can be brightened to highlight the target item and attract users to click or trigger the target item.
  • the color contrast between the target item area and the advertising image is weak, which may cause the target item area to blend into the background of the advertising image, the contrast can be adjusted to highlight the target item.
  • the color saturation of the target item area and the advertising image is weak, you can adjust the saturation to highlight the target item.
  • polishing the target item area in the advertising image includes the following steps E1 - step E2:
  • Step E1 Determine whether the advertising image is a color image or a black image based on the pixel values of the pixels contained in the advertising image in the candidate advertising creative data.
  • the pixel information in the picture can reflect the color information of the image.
  • Step E2 When the advertising image is a color image, polish the target item area in the advertising image based on the first preset brightness parameter value, the first preset contrast parameter value and the first preset saturation parameter value. .
  • the first preset brightness parameter value, the first preset contrast parameter value and the first preset saturation parameter value can be set in advance according to needs. For example, when the advertising picture is a color picture, the first preset brightness parameter value can be set to 15, the first preset contrast parameter value can be set to 10, and the first preset saturation parameter value can be set to 10. According to the above-mentioned first preset brightness parameter value, first preset contrast parameter value and first preset saturation parameter value, the target item area in the advertising image is retouched.
  • Step E3 When the advertising image is a black image, polish the target item area in the advertising image based on the second preset brightness parameter value, the second preset contrast parameter value and the second preset saturation parameter value. .
  • the second preset brightness parameter value is greater than the first preset brightness parameter value
  • the second preset contrast parameter value is greater than the first preset contrast parameter value
  • the second preset saturation parameter value is greater than the first preset saturation parameter value.
  • the second preset brightness parameter value, the second preset contrast parameter value and the second preset saturation parameter value can be set in advance according to needs.
  • the advertising image is a black image
  • the second preset brightness parameter value may be set to 20
  • the second preset contrast parameter value may be set to 15
  • the second preset saturation parameter value may be set to 15. According to the above-mentioned second preset brightness parameter value, second preset contrast parameter value and second preset saturation parameter value, the target item area in the advertising image is retouched.
  • the advertising image can be polished according to the color of the advertising image, making the advertising items in the advertising image more attractive and giving full play to the display function of the advertising image for the items.
  • Figure 7 is a schematic flow chart of an advertising creative data selection and optimization process provided by an embodiment of the present application.
  • the advertising copy of the target item is identified and extracted through the OCR model in the material mining module.
  • the model and target detection model identify advertising images of target items.
  • At least one advertising copy and at least one advertising image are selected from multiple creative data based on the scores.
  • the candidate advertising creative data is obtained through the creative element combination module, the candidate advertising creative data is input into the creative selection module, and the target advertising creative data is obtained using the output value of the creative selection model in the creative selection module.
  • the solution of this embodiment is to obtain at least one copywriting and picture combination by combining the selected advertising copy and advertising pictures; to obtain at least one creative combination by combining at least one copywriting and picture combination with at least one preset background template; based on the preset Screening factors, selecting at least one creative combination from at least one creative combination as candidate advertising creative data; when the size of the target item area in the advertising image contained in the candidate advertising creative data is smaller than a preset threshold, the target item area is Crop, and use the cropped target item image to update the advertising image contained in the candidate advertising creative data; and polish the target item area in the advertising image according to the color information of the advertising image in the candidate advertising creative data.
  • the solution of this embodiment can filter creative combinations according to the category information, color information, etc.
  • the advertising images are polished according to the color of the advertising images, making the advertising items in the advertising images more attractive and giving full play to the display role of the advertising images in displaying the items.
  • Figure 8 is a flow chart of a model training method provided by an embodiment of the present application.
  • This embodiment can train an initial model to obtain a creative selection model.
  • This method can be executed by the model training device in the embodiment of the present application.
  • the device can It is implemented using software and/or hardware, as shown in Figure 8.
  • the method includes the following steps:
  • the training sample data includes sample advertising creative data corresponding to the sample items and standard recommendation probability values corresponding to the sample advertising creative data.
  • the sample advertising creative data includes advertising pictures and advertising copy.
  • the sample data can be obtained from a historical material database that stores sample advertising creative data and standard recommendation probability values corresponding to the sample advertising creative data. For example, based on big data and data analysis algorithms, some advertising creative data and standard recommendation probability values corresponding to the advertising creative data can be determined, and these advertising creative data and their corresponding standard probability recommendation values are stored in the historical material database. Sample data can be obtained from the historical material database.
  • S620 Obtain the sparse feature vector and picture feature vector corresponding to the sample advertising creative data, and obtain the predicted recommendation probability value corresponding to the sample advertising creative data based on the sparse feature vector, picture feature vector and the creative selection model to be trained.
  • a sparse feature vector is a vector used to reflect multiple types of sparse features.
  • the image feature vector is used is a vector that reflects the image features of the advertising image.
  • the image features can be obtained from the advertising images in the sample advertising creative data.
  • the predicted recommendation probability value is the recommendation probability value corresponding to the sparse feature vector and the picture feature vector output by the untrained creative selection model based on the sparse feature vector and the picture feature vector.
  • the creative selection model to be trained can output the predicted recommendation probability value corresponding to the sample advertising creative data through model calculation.
  • S630 Determine the loss function based on the standard recommendation probability value and the predicted recommendation probability value, adjust the network parameters in the creative selection model based on the loss function, and stop training when the preset iteration stop conditions are met.
  • the loss function is a function that maps a random event or the value of its related random variable into a non-negative real number to represent the "risk” or “loss” of the random event.
  • the network parameters in the model are configuration variables inside the model, and the values of the model parameters can be adjusted according to the loss function.
  • the creative selection model is used to: use a self-attention mechanism to fuse sparse feature vectors and picture feature vectors, and output predicted recommendation probability values based on the fusion results.
  • the creative selection model includes a multi-layer perceptron neural network MLP module, a self-attention module and an output module.
  • the MLP module is used to output the first feature vector based on the sparse feature vector;
  • the self-attention module is used to output the second feature vector based on the sparse feature vector and the picture feature vector;
  • the output module is used to output the first feature vector based on the first feature vector and the second feature vector. Output the predicted recommendation probability value.
  • the creative selection model to be trained is continuously optimized based on the loss function and the "gap". Adjust the network parameters of the creative selection model so that the "gap" between the predicted recommendation probability value and the standard recommendation probability value is continuously narrowed.
  • the creative selection model after training can be obtained.
  • the technical solution of this embodiment can obtain training sample data, obtain the sparse feature vectors and picture feature vectors corresponding to the sample advertising creative data, and obtain the sample advertising creative data based on the sparse feature vectors, picture feature vectors and the creative selection model to be trained.
  • the corresponding predicted recommendation probability value Determine the loss function based on the standard recommendation probability value and the predicted recommendation probability value, adjust the network parameters in the creative selection model based on the loss function, and stop training when the preset iteration stop conditions are met.
  • the technical solution of this embodiment can continuously optimize the creative selection model, making the predicted recommendation probability value output by the creative selection model closer to the standard recommendation probability value, and improving the accuracy of the predicted recommendation probability value.
  • Figure 9 is a schematic structural diagram of an advertising creative data selection device provided by an embodiment of the present application.
  • Book Embodiments can automatically and accurately select optimal target advertising creative data from candidate advertising creative data.
  • the device can be implemented in the form of software and/or hardware.
  • the device can be integrated in any device that provides the function of selecting advertising creative data.
  • the devices for selecting advertising creative data include:
  • the data acquisition module 910 is configured to acquire the candidate advertising creative data corresponding to the target item; wherein the candidate advertising creative data includes advertising pictures and advertising copy; the probability value obtaining module 920 is configured to acquire the sparse advertising creative data corresponding to the candidate advertising creative data. feature vector and picture feature vector, and based on the sparse feature vector, the picture feature vector and the pre-trained creative selection model, obtain the recommendation probability value corresponding to the candidate advertising creative data; the data selection module 930 is set to The recommended probability value selects target advertising creative data; wherein the creative selection model is used to: use a self-attention mechanism to fuse based on the sparse feature vector and the picture feature vector, and output the recommended probability value based on the fusion result .
  • the creative selection model includes an MLP module, a self-attention module and an output module;
  • the MLP module is used to output a first feature vector based on the sparse feature vector; the self-attention module is used to output a second feature vector based on the sparse feature vector and the picture feature vector; the output module , used to output the recommendation probability value based on the first feature vector and the second feature vector.
  • the probability value obtaining module 920 is configured as:
  • the probability value obtaining module 920 is further configured to:
  • the classification label is a product word of a sample item contained in the sample picture, and the product word is used to characterize the type of the sample item and does not contain Vocabulary of brand information; input the sample picture into the residual neural network model to obtain the predicted classification output by the residual neural network model; determine a loss function based on the predicted classification and the classification label, based on the loss The function adjusts the network parameters in the residual neural network model and stops training when the preset iteration stop conditions are met.
  • the preset iteration stop condition includes that the prediction accuracy of the residual neural network model reaches a preset accuracy range, and the preset accuracy range may include [75%, 90%].
  • the tail of the residual neural network model includes three fully connected layers for outputting 32-dimensional vectors, 129-dimensional vectors, and 256-dimensional vectors respectively.
  • the self-attention module includes a multi-head self-attention module in the Transformer model.
  • the data acquisition module 910 is configured as:
  • the advertising material data includes advertising copy and advertising pictures; according to the online click data corresponding to the multiple advertising material data, from the multiple advertising material data Select at least one advertising copy and at least one advertising picture; combine the selected advertising copy and advertising pictures to obtain at least one candidate advertising creative data.
  • the data acquisition module 910 is further configured to:
  • the data acquisition module 910 is further configured to:
  • the candidate copy is identified from the item details page and/or advertising creative material of the target item; based on the first word list containing the preset interest point vocabulary, the candidate copy is identified Screening out the benefit point copywriting; based on the preset word limit conditions and/or a second vocabulary list containing preset non-selling point words, screening out the selling point copywriting from the remaining copywriting after removing the benefit point copywriting from the candidate copywriting .
  • the data acquisition module 910 is further configured to:
  • the score of the advertising material data is determined based on the average number of online clicks of the advertising material data and the cumulative number of times the advertising material data is selected; based on the score of each advertising material data, from At least one advertising copy and at least one advertising image are selected from the plurality of advertising material data.
  • the data acquisition module 910 is further configured to:
  • the data acquisition module 910 is further configured to:
  • the target item area in the advertising image contained in the candidate advertising creative data is smaller than the preset threshold, the target item area is cropped, and the cropped target item image is used to update the candidate advertising creative data. Advertising images included.
  • the data acquisition module 910 is further configured to:
  • the advertising image in the The target item area is subjected to retouching processing; wherein the retouching processing includes at least one of adjusting brightness, adjusting contrast, and adjusting saturation.
  • the data acquisition module 910 is further configured to:
  • the advertising image is a color image or a black image; in the case where the advertising image is a color image, based on the first preset brightness parameter value , the first preset contrast parameter value and the first preset saturation parameter value, to polish the target item area in the advertising picture; when the advertising picture is a black picture, based on the second preset brightness
  • the parameter value, the second preset contrast parameter value and the second preset saturation parameter value are used to polish the target item area in the advertising image; wherein the second preset brightness parameter value is greater than the first Preset brightness parameter value, the second preset contrast parameter value is greater than the first preset contrast parameter value, and the second preset saturation parameter value is greater than the first preset saturation parameter value.
  • the device is further configured to:
  • the advertising image and advertising copy in the target advertising creative data are obtained according to the URL, and the obtained advertising image and advertising copy are combined to obtain the target Advertising creative image; sending the target advertising creative image to the client for display.
  • the above-mentioned products can execute the methods provided by any embodiment of this application, and have corresponding functional modules and effects for executing the methods.
  • Figure 10 is a schematic structural diagram of a model training device provided by an embodiment of the present application.
  • the device can be implemented in the form of software and/or hardware.
  • the device can be integrated in any device that provides model training functions, as shown in Figure 10
  • the model training device includes:
  • the sample data acquisition module 1010 is configured to obtain training sample data.
  • the training sample data includes sample advertising creative data corresponding to sample items and standard recommendation probability values corresponding to the sample advertising creative data.
  • the sample advertising creative data includes advertising pictures. and advertising copy;
  • the vector acquisition module 1020 is configured to obtain the sparse feature vector and picture feature vector corresponding to the sample advertising creative data, and based on the sparse feature vector, the picture feature vector and the creative selection model to be trained, obtain The predicted recommendation probability value corresponding to the sample advertising creative data;
  • the model training module 1030 is configured to determine a loss function based on the standard recommendation probability value and the predicted recommendation probability value, and select the creative selection model based on the loss function. Adjust the network parameters and stop when the preset iteration stop conditions are met. Stop training; wherein, the creative selection model is used to: use a self-attention mechanism to fuse the sparse feature vector and the picture feature vector, and output the predicted recommendation probability value based on the fusion result.
  • the creative selection model includes an MLP module, a self-attention module and an output module;
  • the MLP module is used to output a first feature vector based on the sparse feature vector; the self-attention module is used to output a second feature vector based on the sparse feature vector and the picture feature vector; the output module , used to output the predicted recommendation probability value based on the first feature vector and the second feature vector.
  • the above-mentioned products can execute the methods provided by any embodiment of this application, and have corresponding functional modules and effects for executing the methods.
  • Figure 11 is a schematic structural diagram of a computer device provided by an embodiment of the present application. 11 illustrates a block diagram of an exemplary computer device 12 suitable for implementing embodiments of the present application.
  • the computer device 12 shown in FIG. 11 is only an example and should not bring any limitations to the functions and scope of use of the embodiments of the present application.
  • computer device 12 is embodied in the form of a general purpose computing device.
  • the components of computer device 12 may include, but are not limited to, one or more processors or processing units 16, memory 28, and a bus 18 connecting various system components, including memory 28 and processing unit 16.
  • Bus 18 represents one or more of several types of bus structures, including a memory bus or memory controller, a peripheral bus, a graphics accelerated port, a processor, or a local bus using any of a variety of bus structures.
  • these architectures include, but are not limited to, Industry Standard Architecture (ISA) bus, Micro Channel Architecture (MAC) bus, Enhanced ISA bus, Video Electronics Standards Association (Video Electronics Standards) Association, VESA) local bus and Peripheral Component Interconnect (PCI) bus.
  • Computer device 12 includes a variety of computer system readable media. These media can be any available media that can be accessed by computer device 12, including volatile and nonvolatile media, removable and non-removable media.
  • Memory 28 may include computer system readable media in the form of volatile memory, such as random access memory (RAM) 30 and/or cache memory 32 .
  • Computer device 12 may include other removable/non-removable, volatile/non-volatile computer system storage media.
  • storage system 34 may be configured to read and write to non-removable, non-volatile magnetic media (not shown in Figure 11, commonly referred to as a "hard drive”).
  • a disk drive configured to read and write to a removable non-volatile disk (eg, a "floppy disk”) may be provided, and a disk drive configured to read and write to a removable non-volatile disk may be provided.
  • optical disc drive that reads and writes optical discs (such as Compact Disc Read-Only Memory (CD-ROM), Digital Video Disc-Read Only Memory (DVD-ROM) or other optical media).
  • each drive may be connected to bus 18 through one or more data media interfaces.
  • the memory 28 may include at least one program product having a set of (eg, at least one) program modules configured to perform the functions of embodiments of the present application.
  • a program/utility 40 having a set of (at least one) program modules 42, which may be stored, for example, in memory 28, such program modules 42 including, but not limited to, an operating system, one or more application programs, other programs Modules, as well as program data, each or a combination of these examples may include an implementation of a network environment.
  • Program modules 42 generally perform functions and/or methods in the embodiments described herein.
  • Computer device 12 may also communicate with one or more external devices 14 (e.g., keyboard, pointing device, display 24, etc.), with one or more devices that enable a user to interact with computer device 12, and/or with Any device (eg, network card, modem, etc.) that enables the computer device 12 to communicate with one or more other computing devices.
  • This communication may occur through an input/output (I/O) interface 22 .
  • the display 24 does not exist as an independent entity, but is embedded in the mirror. When the display surface of the display 24 is not displayed, the display surface of the display 24 and the mirror surface are visually integrated.
  • the computer device 12 can also communicate with one or more networks (such as a local area network (Local Area Network, LAN), a wide area network (Wide Area Network, WAN), and/or a public network, such as the Internet) through the network adapter 20.
  • network adapter 20 communicates with other modules of computer device 12 via bus 18 .
  • other hardware and/or software modules may be used in conjunction with the computer device 12, including but not limited to: microcode, device drivers, redundant processing units, external disk drive arrays, disk arrays (Redundant Arrays). of Independent Disks, RAID) systems, tape drives, and data backup storage systems, etc.
  • the processing unit 16 executes a variety of functional applications and data processing by running programs stored in the memory 28, for example, implementing an advertising creative data selection method provided in the embodiment of the present application: obtaining candidate advertising creative data corresponding to the target item; Wherein, the candidate advertising creative data includes advertising pictures and advertising copy; the sparse feature vectors and picture feature vectors corresponding to the candidate advertising creative data are obtained, and based on the sparse feature vectors, the picture feature vectors and pre-trained creative Select a model to obtain the recommendation probability value corresponding to the candidate advertising creative data; wherein the creative selection model is used to: use a self-attention mechanism to fuse the sparse feature vector and the picture feature vector, and based on the fusion result Output the recommended probability value, or implement a model training method provided by the embodiment of the present application: obtain training sample data, wherein the training sample data includes sample advertising creative data corresponding to the sample item and the sample advertising creative data Corresponding standard recommendation probability value, the sample Advertising creative data includes advertising pictures and advertising copy; obtain the sparse feature vectors
  • the predicted recommendation probability value corresponding to the sample advertising creative data determine the loss function according to the standard recommendation probability value and the predicted recommendation probability value, and adjust the network parameters in the creative selection model to be trained based on the loss function , and stop training when the preset iteration stop conditions are met; wherein the creative selection model is used to: use a self-attention mechanism to fuse the sparse feature vector and the picture feature vector, and based on the fusion result Output the predicted recommendation probability value.
  • Embodiments of the present application provide a computer-readable storage medium on which a computer program is stored.
  • an advertising creative data selection method as provided in all embodiments of the present application is implemented: obtaining the corresponding target item.
  • Candidate advertising creative data wherein, the candidate advertising creative data includes advertising pictures and advertising copy; obtain the sparse feature vector and picture feature vector corresponding to the candidate advertising creative data, and based on the sparse feature vector, the picture feature vector and a pre-trained creative selection model to obtain the recommendation probability value corresponding to the candidate advertising creative data;
  • the creative selection model is used to: use a self-attention mechanism to compare the sparse feature vector and the picture feature vector Perform fusion, and output the recommended probability value based on the fusion result, or implement a model training method provided by the embodiment of the present application: obtain training sample data, wherein the training sample data includes sample advertising creative data corresponding to the sample item The standard recommendation probability value corresponding to the sample advertising creative data, which includes advertising pictures and advertising copy; obtains the
  • the computer-readable medium may be a computer-readable signal medium or a computer-readable storage medium.
  • the computer-readable storage medium may be, for example, but not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, device or device, or any combination thereof.
  • Examples of computer-readable storage media include: electrical connections having one or more conductors, portable computer disks, hard drives, RAM, ROM, Erasable Programmable Read-Only Memory Memory, EPROM or flash memory), optical fiber, CD-ROM, optical storage device, magnetic storage device, or any suitable combination of the above.
  • a computer-readable storage medium may be any tangible medium that contains or stores a program that The program may be used by or in conjunction with an instruction execution system, apparatus, or device.
  • a computer-readable signal medium may include a data signal propagated in baseband or as part of a carrier wave carrying computer-readable program code therein. Such propagated data signals may take many forms, including but not limited to electromagnetic signals, optical signals, or any suitable combination of the above.
  • a computer-readable signal medium may also be any computer-readable medium other than a computer-readable storage medium that can send, propagate, or transmit a program for use by or in connection with an instruction execution system, apparatus, or device .
  • Program code embodied on a computer-readable medium can be transmitted using any appropriate medium, including but not limited to wireless, wire, optical cable, radio frequency (Radio Frequency, RF), etc., or any suitable combination of the above.
  • any appropriate medium including but not limited to wireless, wire, optical cable, radio frequency (Radio Frequency, RF), etc., or any suitable combination of the above.
  • Computer program code for performing the operations of the present application may be written in one or more programming languages, including object-oriented programming languages such as Java, Smalltalk, C++, and conventional procedural programming languages, or a combination thereof.
  • a programming language such as "C” or a similar programming language.
  • the program code may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server.
  • the remote computer may be connected to the user computer through any kind of network, including a LAN or WAN, or may be connected to an external computer (eg, through the Internet using an Internet service provider).

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • General Physics & Mathematics (AREA)
  • Physics & Mathematics (AREA)
  • Accounting & Taxation (AREA)
  • Development Economics (AREA)
  • Evolutionary Computation (AREA)
  • Finance (AREA)
  • Strategic Management (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Computing Systems (AREA)
  • General Health & Medical Sciences (AREA)
  • Multimedia (AREA)
  • Software Systems (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Medical Informatics (AREA)
  • Databases & Information Systems (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Game Theory and Decision Science (AREA)
  • General Business, Economics & Management (AREA)
  • Economics (AREA)
  • Marketing (AREA)
  • Data Mining & Analysis (AREA)
  • Image Analysis (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

An advertisement creative data selection method and apparatus, a model training method and apparatus, and a device and a storage medium. The advertisement creative data selection method comprises: acquiring candidate advertisement creative data corresponding to a target article (S110); acquiring a sparse feature vector and a picture feature vector which correspond to the candidate advertisement creative data, and obtaining, on the basis of the sparse feature vector, the picture feature vector and a pre-trained creative selection model, a recommendation probability value corresponding to the candidate advertisement creative data (S120); and selecting target advertisement creative data according to the recommendation probability value (S130).

Description

广告创意数据选取方法及装置、模型训练方法及装置、设备、存储介质Advertising creative data selection method and device, model training method and device, equipment, storage media
本申请要求在2022年09月09日提交中国专利局、申请号为202211104745.3的中国专利申请的优先权,该申请的全部内容通过引用结合在本申请中。This application claims priority to the Chinese patent application with application number 202211104745.3, which was submitted to the China Patent Office on September 9, 2022. The entire content of this application is incorporated into this application by reference.
技术领域Technical field
本申请涉及人工智能技术领域,例如涉及广告创意数据选取方法及装置、模型训练方法及装置、设备、存储介质。This application relates to the field of artificial intelligence technology, such as advertising creative data selection methods and devices, model training methods and devices, equipment, and storage media.
背景技术Background technique
随着人工智能技术的不断发展,图像处理技术和自然语言处理技术已经被应用到广告行业的领域中。With the continuous development of artificial intelligence technology, image processing technology and natural language processing technology have been applied to the advertising industry.
广告创意选择方法一般是利用图像处理技术,对广告素材中的图像元素进行处理,利用自然语言处理模型对广告素材中的文字元素进行识别处理。这些方法都是针对特定领域的,普适性不好。并且这些方法不能直接为用户选择出最佳的广告创意元素,降低了用户体验。The advertising creative selection method generally uses image processing technology to process image elements in advertising materials, and uses natural language processing models to identify and process text elements in advertising materials. These methods are targeted at specific fields and have poor generalizability. Moreover, these methods cannot directly select the best advertising creative elements for users, which reduces the user experience.
发明内容Contents of the invention
本申请提供广告创意数据选取方法及装置、模型训练方法及装置、设备、存储介质,以实现自动精准的从候选广告创意数据中选取出最优的目标广告创意数据。This application provides advertising creative data selection methods and devices, model training methods and devices, equipment, and storage media to automatically and accurately select optimal target advertising creative data from candidate advertising creative data.
第一方面,本申请提供了一种广告创意数据选取方法,包括:In the first aspect, this application provides a method for selecting advertising creative data, including:
获取目标物品对应的候选广告创意数据;其中,所述候选广告创意数据包含广告图片和广告文案;Obtain candidate advertising creative data corresponding to the target item; wherein the candidate advertising creative data includes advertising images and advertising copy;
获取所述候选广告创意数据对应的稀疏特征向量和图片特征向量,并基于所述稀疏特征向量、所述图片特征向量以及预先训练的创意选择模型,获得所述候选广告创意数据对应的推荐概率值;Obtain the sparse feature vector and picture feature vector corresponding to the candidate advertising creative data, and obtain the recommendation probability value corresponding to the candidate advertising creative data based on the sparse feature vector, the picture feature vector and the pre-trained creative selection model ;
根据所述推荐概率值选取目标广告创意数据;Select target advertising creative data according to the recommended probability value;
其中,所述创意选择模型用于:采用自注意力机制对所述稀疏特征向量和所述图片特征向量进行融合,并基于融合结果输出所述推荐概率值。Wherein, the creative selection model is used to: use a self-attention mechanism to fuse the sparse feature vector and the picture feature vector, and output the recommendation probability value based on the fusion result.
第二方面,本申请还提供了一种模型训练方法,所述方法包括:In a second aspect, this application also provides a model training method, which method includes:
获取训练样本数据,其中,所述训练样本数据包括样本物品对应的样本广 告创意数据和所述样本广告创意数据对应的标准推荐概率值,所述样本广告创意数据包含广告图片和广告文案;Obtain training sample data, wherein the training sample data includes a sample range corresponding to the sample item The standard recommendation probability value corresponding to the advertising creative data and the sample advertising creative data, where the sample advertising creative data includes advertising images and advertising copy;
获取所述样本广告创意数据对应的稀疏特征向量和图片特征向量,并基于所述稀疏特征向量、所述图片特征向量以及待训练的创意选择模型,获得所述样本广告创意数据对应的预测推荐概率值;Obtain the sparse feature vector and picture feature vector corresponding to the sample advertising creative data, and obtain the predicted recommendation probability corresponding to the sample advertising creative data based on the sparse feature vector, the picture feature vector and the creative selection model to be trained. value;
根据所述标准推荐概率值和所述预测推荐概率值确定损失函数,基于所述损失函数对所述待训练的创意选择模型中的网络参数进行调整,并在满足预设迭代停止条件时停止训练;Determine a loss function according to the standard recommendation probability value and the predicted recommendation probability value, adjust the network parameters in the creative selection model to be trained based on the loss function, and stop training when the preset iteration stop conditions are met ;
其中,所述创意选择模型用于:采用自注意力机制对所述稀疏特征向量和所述图片特征向量进行融合,并基于融合结果输出所述预测推荐概率值。Wherein, the creative selection model is used to: use a self-attention mechanism to fuse the sparse feature vector and the picture feature vector, and output the predicted recommendation probability value based on the fusion result.
第三方面,本申请还提供了一种广告创意数据选取装置,该装置包括:In a third aspect, this application also provides an advertising creative data selection device, which includes:
数据获取模块,设置为获取目标物品对应的候选广告创意数据;其中,所述候选广告创意数据包含广告图片和广告文案;The data acquisition module is configured to obtain candidate advertising creative data corresponding to the target item; wherein the candidate advertising creative data includes advertising pictures and advertising copy;
概率值得到模块,设置为获取所述候选广告创意数据对应的稀疏特征向量和图片特征向量,并基于所述稀疏特征向量、所述图片特征向量以及预先训练的创意选择模型,获得所述候选广告创意数据对应的推荐概率值;The probability value obtaining module is configured to obtain the sparse feature vector and picture feature vector corresponding to the candidate advertisement creative data, and obtain the candidate advertisement based on the sparse feature vector, the picture feature vector and the pre-trained creative selection model. Recommendation probability value corresponding to creative data;
数据选取模块,设置为根据所述推荐概率值选取目标广告创意数据;A data selection module configured to select target advertising creative data according to the recommendation probability value;
其中,所述创意选择模型用于:采用自注意力机制对所述稀疏特征向量和所述图片特征向量进行融合,并基于融合结果输出所述推荐概率值。Wherein, the creative selection model is used to: use a self-attention mechanism to fuse the sparse feature vector and the picture feature vector, and output the recommendation probability value based on the fusion result.
第四方面,本申请还提供一种模型训练装置,所述装置包括:In a fourth aspect, this application also provides a model training device, which includes:
样本数据获取模块,设置为获取训练样本数据,其中,所述训练样本数据包括样本物品对应的样本广告创意数据和所述样本广告创意数据对应的标准推荐概率值,所述样本广告创意数据包含广告图片和广告文案;A sample data acquisition module configured to obtain training sample data, wherein the training sample data includes sample advertising creative data corresponding to sample items and standard recommendation probability values corresponding to the sample advertising creative data, and the sample advertising creative data includes advertisements. images and advertising copy;
向量获取模块,设置为获取所述样本广告创意数据对应的稀疏特征向量和图片特征向量,并基于所述稀疏特征向量、所述图片特征向量以及待训练的创意选择模型,获得所述样本广告创意数据对应的预测推荐概率值;A vector acquisition module configured to obtain the sparse feature vector and the picture feature vector corresponding to the sample advertising creative data, and obtain the sample advertising creative based on the sparse feature vector, the picture feature vector and the creative selection model to be trained. The predicted recommendation probability value corresponding to the data;
模型训练模块,设置为根据所述标准推荐概率值和所述预测推荐概率值确定损失函数,基于所述损失函数对所述待训练的创意选择模型中的网络参数进行调整,并在满足预设迭代停止条件时停止训练;A model training module configured to determine a loss function based on the standard recommendation probability value and the predicted recommendation probability value, adjust the network parameters in the creative selection model to be trained based on the loss function, and adjust the network parameters in the creative selection model to be trained, and perform Stop training when iterating the stop condition;
其中,所述创意选择模型用于:采用自注意力机制对所述稀疏特征向量和所述图片特征向量进行融合,并基于融合结果输出所述预测推荐概率值。Wherein, the creative selection model is used to: use a self-attention mechanism to fuse the sparse feature vector and the picture feature vector, and output the predicted recommendation probability value based on the fusion result.
第五方面,本申请还提供了一种计算机设备,包括存储器、处理器及存储 在存储器上并可在处理器上运行的计算机程序,所述处理器执行所述程序时实现上述的广告创意数据选取方法或模型训练方法。In a fifth aspect, this application also provides a computer device, including a memory, a processor and a storage device. A computer program is stored in a memory and can be run on a processor. When the processor executes the program, the above-mentioned advertising creative data selection method or model training method is implemented.
第六方面,本申请还提供了一种计算机可读存储介质,其上存储有计算机程序,该程序被处理器执行时实现上述的广告创意数据选取方法或模型训练方法。In a sixth aspect, the present application also provides a computer-readable storage medium on which a computer program is stored. When the program is executed by a processor, the above-mentioned advertising creative data selection method or model training method is implemented.
附图说明Description of the drawings
图1为本申请实施例提供的一种广告创意数据选取方法的流程图;Figure 1 is a flow chart of an advertising creative data selection method provided by an embodiment of the present application;
图2为本申请实施例提供的一种创意选择模型的结构示意图;Figure 2 is a schematic structural diagram of a creative selection model provided by an embodiment of the present application;
图3为本申请实施例提供的一种获取图片特征向量的方法流程图;Figure 3 is a flow chart of a method for obtaining image feature vectors provided by an embodiment of the present application;
图4为本申请实施例提供的一种获取候选广告创意数据方法的流程图;Figure 4 is a flow chart of a method for obtaining candidate advertising creative data provided by an embodiment of the present application;
图5为本申请实施例提供的另一种获取候选广告创意数据方法的流程图;Figure 5 is a flow chart of another method of obtaining candidate advertising creative data provided by an embodiment of the present application;
图6为本申请实施例提供的一种对候选广告创意数据进行优化处理的流程图;Figure 6 is a flow chart for optimizing candidate advertising creative data provided by an embodiment of the present application;
图7为本申请实施例提供的一种广告创意数据选取和优化过程的流程示意图;Figure 7 is a schematic flow chart of an advertising creative data selection and optimization process provided by an embodiment of the present application;
图8为本申请实施例提供的一种模型训练方法的流程图;Figure 8 is a flow chart of a model training method provided by an embodiment of the present application;
图9为本申请实施例提供的一种广告创意数据选取装置的结构示意图;Figure 9 is a schematic structural diagram of an advertising creative data selection device provided by an embodiment of the present application;
图10为本申请实施例提供的一种模型训练装置的结构示意图;Figure 10 is a schematic structural diagram of a model training device provided by an embodiment of the present application;
图11为本申请实施例提供的一种计算机设备的结构示意图。Figure 11 is a schematic structural diagram of a computer device provided by an embodiment of the present application.
具体实施方式Detailed ways
下面结合附图和实施例对本申请进行说明。此处所描述的具体实施例仅仅用于解释本申请。为了便于描述,附图中仅示出了与本申请相关的部分。The present application will be described below in conjunction with the drawings and embodiments. The specific embodiments described herein are merely illustrative of the application. For convenience of description, only parts relevant to the present application are shown in the drawings.
相似的标号和字母在下面的附图中表示类似项,因此,一旦一项在一个附图中被定义,则在随后的附图中不需要对其进行定义和解释。同时,在本申请的描述中,术语“第一”、“第二”等仅用于区分描述,而不能理解为指示或暗示相对重要性。Similar reference numbers and letters refer to similar items in the following figures, so that once an item is defined in one figure, it does not need to be defined or explained in subsequent figures. Meanwhile, in the description of the present application, the terms "first", "second", etc. are only used to differentiate the description and cannot be understood as indicating or implying relative importance.
图1为本申请实施例提供的一种广告创意数据选取方法的流程图,本实施例能够实现自动精准的从候选广告创意数据中选取出最优的目标广告创意数据, 该方法可以由本申请实施例中的广告创意数据选取装置来执行,该装置可采用软件和/或硬件的方式实现,如图1所示,该方法包括如下步骤:Figure 1 is a flow chart of an advertising creative data selection method provided by an embodiment of the present application. This embodiment can automatically and accurately select optimal target advertising creative data from candidate advertising creative data. This method can be executed by the advertising creative data selection device in the embodiment of the present application. The device can be implemented in software and/or hardware. As shown in Figure 1, the method includes the following steps:
S110,获取目标物品对应的候选广告创意数据。S110: Obtain candidate advertising creative data corresponding to the target item.
目标物品表示需要为其生成或选择相应的广告创意数据的物品。候选广告创意数据可以有多个,每个候选广告创意数据用于描述目标物品的一个广告创意方案。其中,候选广告创意数据包含广告图片和广告文案。The target item represents the item for which corresponding advertising creative data needs to be generated or selected. There may be multiple candidate advertising creative data, and each candidate advertising creative data is used to describe an advertising creative plan for the target item. Among them, the candidate advertising creative data includes advertising images and advertising copy.
在一些物品网站、手机应用(Application,APP)中展示的物品页面中,可以通过物品详情页获取到物品对应的广告素材数据(广告文案和广告图片)。通过对详情页中的内容进行文本识别和文字提取等,可以得到目标物品的广告文案;通过对详情页中的内容进行图像识别和目标检测等,可以得到目标物品的广告图片。或者,可以在广告公司等提供的广告创意素材中,获取到目标物品对应的广告素材数据。根据需求对目标物品对应的广告素材数据进行筛选,得到目标物品对应的候选广告创意数据。示例的,目标物品为手机。在手机网站中,查看多款手机的详情页的内容。详情页中的上方有手机在多个角度的展示图片,图片下方有对应的广告文案。对手机详情页中展示的内容进行文本识别和文字提取,提取出手机详情页下方的广告文案。对详情页中的多个角度的手机图片进行定位,得到广告图片。当提取出的广告图像尺寸不合适时,可以对图片进行智能剪裁,得到最终的广告图片。在得到广告素材数据后,根据需求(例如点击量、敏感词汇等)对广告素材数据进行筛选,得到候选广告创意数据。In the item pages displayed on some item websites and mobile applications (Application, APP), the advertising material data (advertising copy and advertising pictures) corresponding to the item can be obtained through the item details page. By performing text recognition and text extraction on the content in the details page, the advertising copy of the target item can be obtained; by performing image recognition and target detection on the content in the details page, the advertising image of the target item can be obtained. Alternatively, the advertising material data corresponding to the target item can be obtained from the advertising creative materials provided by advertising companies. Screen the advertising creative data corresponding to the target item according to the needs to obtain the candidate advertising creative data corresponding to the target item. For example, the target item is a mobile phone. On the mobile website, view the content of the detail pages of multiple mobile phones. There are pictures of mobile phones displayed at multiple angles at the top of the details page, and corresponding advertising copy below the pictures. Perform text recognition and text extraction on the content displayed on the mobile phone details page, and extract the advertising copy at the bottom of the mobile phone details page. Position the mobile phone pictures from multiple angles in the details page to obtain the advertising pictures. When the size of the extracted advertising image is not suitable, the image can be intelligently cropped to obtain the final advertising image. After obtaining the creative data, the creative data is filtered according to needs (such as click volume, sensitive words, etc.) to obtain candidate advertising creative data.
S120,获取候选广告创意数据对应的稀疏特征向量和图片特征向量,并基于稀疏特征向量、图片特征向量以及预先训练的创意选择模型,获得候选广告创意数据对应的推荐概率值。S120: Obtain the sparse feature vector and picture feature vector corresponding to the candidate advertising creative data, and obtain the recommendation probability value corresponding to the candidate advertising creative data based on the sparse feature vector, picture feature vector and the pre-trained creative selection model.
创意选择模型用于:采用自注意力机制对稀疏特征向量和图片特征向量进行融合;并基于融合结果输出推荐概率值。本方案中,创意选择模型包括多层感知机神经网络(Multilayer Perceptron,MLP)模块、自注意力模块和输出模块;其中,MLP模块用于基于稀疏特征向量输出第一特征向量;自注意力模块用于基于稀疏特征向量和图片特征向量输出第二特征向量;输出模块用于基于第一特征向量和第二特征向量输出推荐概率值。The creative selection model is used to: use the self-attention mechanism to fuse sparse feature vectors and image feature vectors; and output recommendation probability values based on the fusion results. In this solution, the creative selection model includes a multilayer perceptron (MLP) module, a self-attention module and an output module; among them, the MLP module is used to output the first feature vector based on sparse feature vectors; the self-attention module The output module is configured to output a second feature vector based on the sparse feature vector and the image feature vector; the output module is configured to output a recommendation probability value based on the first feature vector and the second feature vector.
稀疏特征向量是用于反映多个类型的稀疏特征的向量。本方案的稀疏特征包括物品特征、用户特征和创意特征。物品特征包括物品标识号、广告位标识号、品牌标识号和物品类目标识号等信息。用户特征包括用户的年龄、性别和偏好等。创意特征包括背景模板特征、文案特征和图片特征。背景模板特征包括背景模板标识号、模板风格、模板布局、模板主颜色等特征。文案特征包括 主文案、副文案和气泡文案(用于表示物品正在促销、热卖的文案类型)等特征,文案特征可以通过广告创意数据中的广告文案得到。图片特征包括广告图片上是否有文字、是否有人以及创意类型等特征。图片特征向量是用于反映广告图片的图片特征的向量,图片特征可以通过广告创意数据中的广告图片得到。MLP模块可以将输入的多个特征向量映射到单一的输出的特征向量上。自注意力模块可以快速提取稀疏特征向量中的重要特征。本方案中,自注意力模块包括Transformer模型中的多头自注意力模块。其中,Transformer模型是一种神经网络模型,Transformer模型可以通过跟踪序列数据中的关系来学习数据的上下文,Transformer模型中包括多头自注意力模块。多头自注意力模块可以从多个维度提炼特征信息,并且多头自注意力模块并行度高,能够将不同的维度的信息组合起来,以此捕获序列内多种范围的依赖关系。A sparse feature vector is a vector used to reflect multiple types of sparse features. The sparse features of this solution include item features, user features and creative features. Item characteristics include information such as item identification number, advertising space identification number, brand identification number, and item type target identification number. User characteristics include the user’s age, gender, preferences, etc. Creative features include background template features, copywriting features and picture features. Background template features include background template identification number, template style, template layout, template main color and other features. Copywriting features include Features such as main copy, sub-copy, and bubble copy (the type of copy used to indicate that items are on sale or hot-selling), etc. The copy characteristics can be obtained from the advertising copy in the advertising creative data. Image features include whether there is text on the advertising image, whether there are people, and the creative type. The image feature vector is a vector used to reflect the image features of the advertising image. The image features can be obtained from the advertising images in the advertising creative data. The MLP module can map multiple input feature vectors to a single output feature vector. The self-attention module can quickly extract important features from sparse feature vectors. In this solution, the self-attention module includes the multi-head self-attention module in the Transformer model. Among them, the Transformer model is a neural network model. The Transformer model can learn the context of the data by tracking the relationships in the sequence data. The Transformer model includes a multi-head self-attention module. The multi-head self-attention module can extract feature information from multiple dimensions, and the multi-head self-attention module is highly parallel and can combine information from different dimensions to capture multiple ranges of dependencies within the sequence.
将稀疏特征矩阵、图片特征向量输入进创意选择模型,可以获得候选广告创意数据对应的推荐概率值。图2为本申请实施例提供的一种创意选择模型的结构示意图。如图2所示,将稀疏特征矩阵和图像特征向量输入进创意选择模型,利用向量转化表可以将稀疏特征矩阵转化为稀疏特征向量。将稀疏特征向量和图像特征向量输入进多头自注意力模块,将稀疏特征向量输入进MLP模块。最终根据多头自注意力模块输出的第二特征向量和MLP模块的输出的第一特征向量,预测出候选广告创意数据对应的推荐概率值(输出模块的输出结果)。By inputting the sparse feature matrix and image feature vector into the creative selection model, the recommendation probability value corresponding to the candidate advertising creative data can be obtained. Figure 2 is a schematic structural diagram of a creative selection model provided by an embodiment of the present application. As shown in Figure 2, the sparse feature matrix and image feature vector are input into the creative selection model, and the sparse feature matrix can be converted into a sparse feature vector using the vector conversion table. Input sparse feature vectors and image feature vectors into the multi-head self-attention module, and input sparse feature vectors into the MLP module. Finally, based on the second feature vector output by the multi-head self-attention module and the first feature vector output by the MLP module, the recommendation probability value corresponding to the candidate advertising creative data (the output result of the output module) is predicted.
在使用创意选择模型前,需要对创意选择模型进行训练。收集大量广告创意方案,从广告创意方案中整理出广告创意数据(背景模板信息、物品信息、文案信息和图片信息等)。将标注的稀疏特征向量和图片特征向量作为样本数据,将标注的稀疏特征向量和图片特征向量的推荐概率值(1或0)作为样本标签。将样本数据输入进创意选择模型,得到创意选择模型预测出的与样本数据对应的推荐概率值。再利用样本标签和预测出的推荐概率值计算损失函数,根据计算结果不断对创意选择模型的模型参数进行调整和训练,得到训练后的创意选择模型。Before using the creative selection model, the creative selection model needs to be trained. Collect a large number of advertising creative plans and sort out advertising creative data (background template information, item information, copywriting information, picture information, etc.) from the advertising creative plans. The annotated sparse feature vectors and image feature vectors are used as sample data, and the recommended probability values (1 or 0) of the annotated sparse feature vectors and image feature vectors are used as sample labels. Input the sample data into the creative selection model, and obtain the recommendation probability value predicted by the creative selection model corresponding to the sample data. Then use the sample labels and predicted recommendation probability values to calculate the loss function, and continuously adjust and train the model parameters of the creative selection model based on the calculation results to obtain the trained creative selection model.
本申请技术方案中对数据的获取、存储、使用、处理等均获得用户授权,符合国家法律法规的相关规定。The acquisition, storage, use and processing of data in the technical solution of this application are all authorized by the user and comply with the relevant provisions of national laws and regulations.
S130,根据推荐概率值选取目标广告创意数据。S130: Select target advertising creative data according to the recommendation probability value.
目标广告创意数据是从多个候选广告创意数据中选取出的最优广告创意数据。目标广告创意数据包括目标物品的广告文案、广告图片、背景模板(模板风格、背景颜色、布局方式等)等和目标物品的广告方案有关的数据。推荐概率值是创意选择模型输出的为用户推荐对应候选广告创意数据的概率值。推荐概率值越大,表示创意选择模型认为对应候选广告创意数据的优秀程度越高。 可以根据需求设置预设概率值,当推荐概率值大于预设概率值时,则将对应的候选广告创意数据确定为目标广告创意数据。或者,直接选取推荐概率值最大的候选广告创意数据作为目标广告创意数据。The target advertising creative data is the optimal advertising creative data selected from multiple candidate advertising creative data. The target advertising creative data includes the target item's advertising copy, advertising pictures, background templates (template style, background color, layout, etc.) and other data related to the target item's advertising plan. The recommendation probability value is the probability value output by the creative selection model to recommend the corresponding candidate advertising creative data to the user. The larger the recommendation probability value is, the higher the creative selection model believes that the corresponding candidate advertising creative data is. A preset probability value can be set according to requirements. When the recommendation probability value is greater than the preset probability value, the corresponding candidate advertising creative data is determined as the target advertising creative data. Or, directly select the candidate advertising creative data with the largest recommendation probability value as the target advertising creative data.
在选取目标广告创意数据之后,本方案中,还包括如下步骤A1-步骤A2:After selecting the target advertising creative data, this plan also includes the following steps A1-Step A2:
步骤A1:获取目标广告创意数据中广告图片的第一编码信息和广告文案的第二编码信息,根据第一编码信息和第二编码信息生成目标广告创意数据对应的统一资源定位符(URL,Uniform Resource Locator)。Step A1: Obtain the first coding information of the advertising image and the second coding information of the advertising copy in the target advertising creative data, and generate a uniform resource locator (URL, Uniform) corresponding to the target advertising creative data based on the first coding information and the second coding information. Resource Locator).
URL是从互联网上得到的资源的位置和访问方法的一种简洁的表示,是互联网上标准资源的地址。互联网上的每个文件都有一个唯一的URL,URL包含的信息指出文件的位置以及浏览器应该怎么处理。对广告图片进行URL编码,可以得到第一编码信息,对广告文案进行URL编码可以得到第二编码信息。利用第一编码信息和第二编码信息可以生成一个URL,即目标广告创意数据对应的URL。通过目标广告创意数据对应的URL,可以直接访问到该广告创意数据的地址。URL is a concise representation of the location and access method of resources obtained from the Internet. It is the address of standard resources on the Internet. Every file on the Internet has a unique URL, which contains information indicating the location of the file and what the browser should do with it. The first encoding information can be obtained by URL encoding the advertising image, and the second encoding information can be obtained by URL encoding the advertising copy. A URL can be generated using the first encoding information and the second encoding information, that is, the URL corresponding to the target advertising creative data. Through the URL corresponding to the target advertising creative data, the address of the advertising creative data can be directly accessed.
步骤A2:在接收到客户端发送的针对URL的访问请求的情况下,根据URL获取目标广告创意数据中的广告图片和广告文案,并将获取的广告图片和广告文案执行合图操作,得到目标广告创意图像;将目标广告创意图像发送给客户端进行展示。Step A2: After receiving the access request for the URL sent by the client, obtain the advertising image and advertising copy in the target advertising creative data according to the URL, and perform a combined image operation on the obtained advertising image and advertising copy to obtain the target Advertising creative image; send the target advertising creative image to the client for display.
通过目标广告创意数据对应的URL,可以直接访问到该广告创意数据的地址。当接收到客户端发送的URL的访问请求时,根据URL获取到该URL所指向的广告图片和广告文案。将获取的广告图片和广告文案进行合图操作,得到广告图片,并将该广告图片作为目标广告创意图像。例如,利用图像处理软件,结合目标物品的类别信息等将广告文案和广告图片融合在一起。合图过程中可以根据需求和实际环境对广告文案和广告图像的位置、大小进行适当调整,最终得到目标广告创意图像。Through the URL corresponding to the target advertising creative data, the address of the advertising creative data can be directly accessed. When receiving an access request for a URL sent by the client, the advertising image and advertising copy pointed to by the URL are obtained based on the URL. The obtained advertising image and advertising copy are combined to obtain an advertising image, and the advertising image is used as the target advertising creative image. For example, image processing software is used to fuse the advertising copy and advertising images together with the category information of the target item. In the process of combining pictures, the position and size of the advertising copy and advertising image can be appropriately adjusted according to the needs and actual environment, and finally the target advertising creative image can be obtained.
通过上述步骤生成目标广告创意数据对应的URL,可以节省图片存储占用的资源,并且广告图像也可以随着URL编码的变化随时更新,提升了为用户提供广告创意数据的效率。Generating the URL corresponding to the target advertising creative data through the above steps can save the resources occupied by image storage, and the advertising image can also be updated at any time as the URL encoding changes, improving the efficiency of providing advertising creative data to users.
本实施例的技术方案,通过获取目标物品对应的候选广告创意数据;其中,候选广告创意数据包含广告图片和广告文案。获取候选广告创意数据对应的稀疏特征向量和图片特征向量,并基于稀疏特征向量、图片特征向量以及预先训练的创意选择模型,获得候选广告创意数据对应的推荐概率值;其中,创意选择模型包括MLP模块、自注意力模块和输出模块,MLP模块用于基于稀疏特征 向量输出第一特征向量,自注意力模块用于基于稀疏特征向量和图片特征向量输出第二特征向量;输出模块用于基于第一特征向量和第二特征向量输出推荐概率值。根据推荐概率值选取目标广告创意数据。本实施例的方案可以利用创意选择模型自动从收集到的候选广告创意数据中选取目标广告创意数据,该创意选择模型中包括的自注意力模块可以对稀疏特征向量的标识(Identifier,ID)类特征与图片特征向量进行融合,从而可以将图片和文字进行融合处理,解决广告系统中基于多模态的创意选择问题,具有较好的普适性。The technical solution of this embodiment is to obtain candidate advertising creative data corresponding to the target item; wherein the candidate advertising creative data includes advertising pictures and advertising copy. Obtain the sparse feature vector and picture feature vector corresponding to the candidate advertising creative data, and obtain the recommendation probability value corresponding to the candidate advertising creative data based on the sparse feature vector, picture feature vector and the pre-trained creative selection model; among which, the creative selection model includes MLP module, self-attention module and output module, MLP module is used based on sparse features The vector outputs a first feature vector, the self-attention module is used to output a second feature vector based on the sparse feature vector and the picture feature vector; the output module is used to output a recommendation probability value based on the first feature vector and the second feature vector. Select target advertising creative data based on recommendation probability values. The solution of this embodiment can use a creative selection model to automatically select target advertising creative data from the collected candidate advertising creative data. The self-attention module included in the creative selection model can identify (Identifier, ID) classes of sparse feature vectors. Features are fused with image feature vectors, so that images and text can be fused to solve the multi-modal creative selection problem in the advertising system, and it has good universality.
图3为本申请实施例提供的一种获取图片特征向量的方法流程图,本实施例以上述实施例为基础对获取图片特征向量的方法进行说明。如图3所示,本实施例的方法包括如下步骤:Figure 3 is a flow chart of a method for obtaining a picture feature vector provided by an embodiment of the present application. This embodiment explains the method of obtaining a picture feature vector based on the above embodiment. As shown in Figure 3, the method of this embodiment includes the following steps:
S210,将候选广告创意数据中的广告图片输入预先训练出的残差神经网络模型。S210. Input the advertising images in the candidate advertising creative data into the pre-trained residual neural network model.
候选广告创意数据中包括广告文案和广告图片。如图2所示,需要将图片特征向量输入进多头自注意力模块中。因此,需要将候选广告创意数据中的广告图片输入预先训练出的残差神经网络模型,得到图片特征向量。本方案实施例中,对残差神经网络模型进行训练,包括如下步骤B1-步骤B3:The candidate advertising creative data includes advertising copy and advertising images. As shown in Figure 2, the image feature vector needs to be input into the multi-head self-attention module. Therefore, it is necessary to input the advertising images in the candidate advertising creative data into the pre-trained residual neural network model to obtain the image feature vector. In this embodiment of the solution, training the residual neural network model includes the following steps B1 to B3:
步骤B1:获取样本图片和样本图片对应的分类标签。Step B1: Obtain the sample image and the classification label corresponding to the sample image.
分类标签为样本图片中包含的样本物品的产品词,产品词是用于表征样本物品的种类并且不包含品牌信息的词汇。相关技术中对残差神经网络模型进行训练时,采用物品的类目词作为分类标签。物品的类目词中包括物品的品牌信息,产品词可以用于表示物品的种类并且不包含品牌信息。例如,样本物品是品牌A的手机,则对应的类目词包括手机和品牌A,对应的产品词仅包括手机。样本图片是样本物品的广告方案中的广告图片。The classification label is the product word of the sample item contained in the sample picture. The product word is a vocabulary used to characterize the type of sample item and does not contain brand information. In related technologies, when training a residual neural network model, the category words of items are used as classification labels. The category word of an item includes the brand information of the item, and the product word can be used to indicate the type of item and does not contain brand information. For example, if the sample item is a mobile phone of brand A, the corresponding category words include mobile phones and brand A, and the corresponding product words only include mobile phones. A sample image is an advertising image in an advertising plan for a sample item.
可以从一些网站或手机APP中的详情页,获取到样本图片和样本图片对应的分类标签。或者,从广告商或专业人员提供的广告素材库中获取样本图片和样本图片对应的分类标签。示例的,可以根据物品的详情页/广告素材库中对所有物品的曝光量由高到低进行排序,选择出所有物品中曝光量前10000的物品对应的物品类别信息,将该类别信息作为样本标签。用产品词作为样本标签的方法适用于大规模、多分类的任务,可以提高残差神经网络模型生成图片特征向量的泛化性,避免了物品类别信息过于集中导致对图像内容表达的不充分。并且,在实际应用中,对于广告数据而言,物品的种类信息往往比物品的品牌信息更加重要。例如,手机类的广告方案和服装类的广告方案有很大的不同。 手机类的广告方案中可能需要突出广告文案(描述手机的性能等),而服装类的广告方案中可能更加需要突出广告图片。但是同一类的物品中,即使是不同品牌的物品,其广告方案也大都是相同的。例如不同品牌的手机,其广告方案的不同可能只是广告文案中描述的内容不同。因此,将样本物品的产品词作为分类标签可以更加贴近实际状况,提高了创意选择模型的推荐概率值的准确度。You can obtain sample pictures and the classification labels corresponding to the sample pictures from the detail pages of some websites or mobile APPs. Or, obtain sample images and classification labels corresponding to the sample images from the creative library provided by advertisers or professionals. For example, you can sort all items from high to low based on the exposure of all items in the item's detail page/ad creative library, select the item category information corresponding to the top 10,000 exposure items among all items, and use this category information as a sample. Label. The method of using product words as sample labels is suitable for large-scale, multi-classification tasks. It can improve the generalization of image feature vectors generated by the residual neural network model and avoid insufficient expression of image content caused by excessive concentration of item category information. Moreover, in practical applications, for advertising data, item type information is often more important than item brand information. For example, there are big differences between mobile phone advertising plans and clothing advertising plans. An advertising plan for mobile phones may need to highlight the advertising copy (describing the performance of the phone, etc.), while an advertising plan for clothing may need to highlight advertising images. However, within the same category of items, even items of different brands, the advertising plans are mostly the same. For example, for different brands of mobile phones, the difference in their advertising plans may only be the content described in the advertising copy. Therefore, using the product words of sample items as classification labels can be closer to the actual situation and improve the accuracy of the recommendation probability value of the creative selection model.
步骤B2:将样本图片输入残差神经网络模型,获取残差神经网络模型输出的预测分类。Step B2: Input the sample image into the residual neural network model to obtain the predicted classification output by the residual neural network model.
残差神经网络模型是卷积神经网络模型的一种,例如残差网络(Residual Network,ResNet)模型。残差神经网络模型多适用于图像分类和物体识别。残差神经网络模型易于优化,可以通过增加一定的网络模型深度从而提高准确率。其内部的残差块使用的是跳跃连接,缓解了在深度神经网络中增加深度带来的梯度消失问题。将样本图片输入进残差神经网络模型,残差神经网络模型通过计算推理预测出样本图片对应的种类。本方案实施例中,残差神经网络模型的尾部包含分别用于输出32维向量、128维向量和256维向量的三个全连接层,即在ResNet模型的尾部添加这三个全连接层。The residual neural network model is a type of convolutional neural network model, such as the residual network (Residual Network, ResNet) model. Residual neural network models are mostly suitable for image classification and object recognition. The residual neural network model is easy to optimize, and the accuracy can be improved by increasing a certain depth of the network model. The internal residual block uses skip connections, which alleviates the vanishing gradient problem caused by increasing depth in deep neural networks. Input the sample image into the residual neural network model, and the residual neural network model predicts the type corresponding to the sample image through computational reasoning. In the embodiment of this solution, the tail of the residual neural network model includes three fully connected layers for outputting 32-dimensional vectors, 128-dimensional vectors and 256-dimensional vectors respectively. That is, these three fully connected layers are added to the tail of the ResNet model.
残差神经网络模型中包括卷积层、池化层、激活函数和全连接层等。其中,卷积层、池化层和激活函数等操作是将原始数据映射到隐层的特征空间,得到特征向量。而全连接层可以将分布式特征表示的特征向量映射到样本标记空间。全连接层可以通过对特征向量的特征进行提取,根据样本图片的特征向量对样本图片进行分类。根据样本数据的数据量大小以及对样本数据的分类要求,可以为残差神经网络模型设置不同维度的输出向量。利用尾部包含有输出32维向量、128维向量和256维向量的三个全连接层的残差神经网络模型,可以根据业务需求和样本数据量的大小灵活、准确的预测出样本图片所在的类别。The residual neural network model includes convolutional layers, pooling layers, activation functions and fully connected layers. Among them, operations such as convolution layer, pooling layer and activation function map the original data to the feature space of the hidden layer to obtain the feature vector. The fully connected layer can map the feature vector of the distributed feature representation to the sample label space. The fully connected layer can extract the features of the feature vector and classify the sample images according to the feature vectors of the sample images. Depending on the size of the sample data and the classification requirements of the sample data, output vectors of different dimensions can be set for the residual neural network model. Utilizing a residual neural network model with three fully connected layers at the tail that output 32-dimensional vectors, 128-dimensional vectors and 256-dimensional vectors, the category of the sample image can be flexibly and accurately predicted based on business needs and the size of the sample data. .
步骤B3:基于预测分类和分类标签确定损失函数,基于损失函数对残差神经网络模型中的网络参数进行调整,并在满足预设迭代停止条件时停止训练。Step B3: Determine the loss function based on the predicted classification and classification label, adjust the network parameters in the residual neural network model based on the loss function, and stop training when the preset iteration stop conditions are met.
预测分类是将样本图片输入进残差神经网络模型后,残差神经网络模型经过计算预测出的样本图片的类别。分类标签是标注的样本图片的真实类别。Predictive classification refers to the category of the sample image predicted by the residual neural network model after inputting the sample image into the residual neural network model. The classification label is the true category of the annotated sample image.
通过损失函数可以计算出预测分类和样本标签之间的“差距”,根据基于预测分类和分类标签确定的损失函数,可以不断的对残差神经网络模型中的网络参数进行调整,使得预测分类和分类标签越来越接近,直到满足预设迭代停止条件时停止训练。其中,预设迭代停止条件包括残差神经网络模型的预测准确率达到预设准确率范围,本方案实施例中,预设准确率范围包括[75%,80%]。残差神经网络模型的预测准确率越高,预测出来的样本图片的类别越准确。但同时,残差神经网络模型的计算复杂度就越大,导致在实际应用中,残差神经网 络模型的计算速度慢,并且还有可能会造成过拟合的问题发生。因此,在提高残差神经网络模型预测准确率的基础上,为了避免残差神经网络模型的计算复杂度过大,可以将预设准确率范围设置为[75%,80%]。当残差神经网络模型的预测准确率不小于75%并且不大于80%时,就可以停止训练残差神经网络模型。The "gap" between the predicted classification and the sample label can be calculated through the loss function. According to the loss function determined based on the predicted classification and classification label, the network parameters in the residual neural network model can be continuously adjusted, so that the predicted classification and The classification labels get closer and closer until the training stops when the preset iteration stopping condition is met. The preset iteration stop condition includes that the prediction accuracy of the residual neural network model reaches the preset accuracy range. In the embodiment of this solution, the preset accuracy range includes [75%, 80%]. The higher the prediction accuracy of the residual neural network model, the more accurate the category of the predicted sample image. But at the same time, the computational complexity of the residual neural network model becomes greater, resulting in the residual neural network in practical applications. The calculation speed of the network model is slow and may cause over-fitting problems. Therefore, on the basis of improving the prediction accuracy of the residual neural network model, in order to avoid the excessive computational complexity of the residual neural network model, the preset accuracy range can be set to [75%, 80%]. When the prediction accuracy of the residual neural network model is not less than 75% and not greater than 80%, the training of the residual neural network model can be stopped.
上述步骤中,将样本物品的产品词作为分类标签可以提高创意选择模型的推荐概率值的准确度。利用尾部包含有输出32维向量、128维向量和256维向量的三个全连接层的残差神经网络模型,可以根据业务需求和样本数据量的大小灵活、准确的预测出样本图片所在的类别,提高了创意选择模型的推荐概率值的准确度。In the above steps, using the product words of the sample items as classification labels can improve the accuracy of the recommendation probability value of the creative selection model. Utilizing a residual neural network model with three fully connected layers at the tail that output 32-dimensional vectors, 128-dimensional vectors and 256-dimensional vectors, the category of the sample image can be flexibly and accurately predicted based on business needs and the size of the sample data. , improving the accuracy of the recommended probability value of the creative selection model.
S220,获取残差神经网络模型输出的图片特征向量。S220: Obtain the image feature vector output by the residual neural network model.
图片特征向量是将图片的一些特征用向量的形式表示出来的向量。本方案中,残差神经网络模型输出的图片特征向量可以用于表示图片的类别。例如,将手机的图片输入进残差神经网络模型中,残差神经网络模型预测出图片的特征为“类别:手机图片”,并输出带有“手机”特征的图片特征向量。Image feature vectors are vectors that represent some features of the image in the form of vectors. In this solution, the image feature vector output by the residual neural network model can be used to represent the category of the image. For example, if a picture of a mobile phone is input into the residual neural network model, the residual neural network model predicts that the feature of the picture is "category: mobile phone picture" and outputs a feature vector of the picture with the characteristics of "mobile phone".
本实施例的技术方案,将候选广告创意数据中的广告图片输入预先训练出的残差神经网络模型;获取残差神经网络模型输出的图片特征向量。本实施例的方案可以根据业务需求和样本数据量的大小灵活、准确的预测出样本图片所在的类别,将样本物品的产品词作为分类标签,提高了创意选择模型的推荐概率值的准确度。The technical solution of this embodiment is to input the advertisement pictures in the candidate advertisement creative data into the pre-trained residual neural network model; and obtain the picture feature vector output by the residual neural network model. The solution of this embodiment can flexibly and accurately predict the category of the sample picture based on business needs and the size of the sample data, and use the product words of the sample items as classification labels, improving the accuracy of the recommendation probability value of the creative selection model.
图4为本申请实施例提供的一种获取候选广告创意数据方法的流程图,本实施例以上述实施例为基础对获取候选广告创意数据方法进行说明。如图4所示,本实施例的方法包括如下步骤:Figure 4 is a flow chart of a method for obtaining candidate advertisement creative data provided by an embodiment of the present application. This embodiment explains the method of obtaining candidate advertisement creative data based on the above embodiment. As shown in Figure 4, the method of this embodiment includes the following steps:
S310,获取目标物品对应的多个广告素材数据,其中,广告素材数据包括广告文案和广告图片。S310: Obtain multiple advertising material data corresponding to the target item, where the advertising material data includes advertising copy and advertising images.
在一些物品网站、手机APP中展示的物品页面中,可以通过物品详情页获取到物品对应的广告素材数据(广告文案和广告图片)。通过对详情页中的内容进行文本识别和文字提取等,得到广告文案;通过对详情页中的内容进行图像识别和图像剪裁等,得到广告图片。或者,可以通过广告公司等提供的广告创意素材中,获取到物品对应的广告素材数据。In the item pages displayed on some item websites and mobile apps, the advertising material data (advertising copy and advertising pictures) corresponding to the item can be obtained through the item details page. By performing text recognition and text extraction on the content in the details page, the advertising copy is obtained; by performing image recognition and image cropping on the content in the details page, the advertising image is obtained. Alternatively, the advertising material data corresponding to the item can be obtained from the advertising creative materials provided by advertising companies.
S320,根据多个广告素材数据分别对应的线上点击数据,从多个广告素材数据中选取出至少一个广告文案和至少一个广告图片。S320: Select at least one advertisement copy and at least one advertisement image from the plurality of advertisement creative data according to the online click data corresponding to the plurality of advertisement creative data.
线上点击数据用于表示广告素材数据的点击量,点击量可以反映出用户对 于广告素材的喜爱程度。一个广告素材的点击量越多,则表明该广告素材可能被更多人喜欢。因此,可以根据多个广告素材数据分别对应的线上点击数据,从多个广告素材数据中选取出至少一个广告文案和至少一个广告图片。本方案实施例中,从多个广告素材数据中选取出至少一个广告文案和至少一个广告图片包括如下步骤C1-步骤C2:Online click data is used to represent the click volume of creative data. The click volume can reflect the user's How much you like the creative. The more clicks a creative gets, the more people are likely to like it. Therefore, at least one advertisement copy and at least one advertisement image can be selected from the plurality of advertisement creative data based on the online click data respectively corresponding to the plurality of advertisement creative data. In this embodiment of the solution, selecting at least one advertising copy and at least one advertising image from multiple advertising creative data includes the following steps C1 to C2:
步骤C1:对于每个广告素材数据,根据广告素材数据的线上点击量均值和广告素材数据被选择的累计次数,确定广告素材数据的得分。Step C1: For each creative data, determine the score of the creative data based on the average number of online clicks of the creative data and the cumulative number of times the creative data is selected.
通过目标物品的详情页被浏览、点击的情况可以确定出广告素材数据的线上点击量。可以利用多臂老虎机(Multi-Armed Bandits,MAB)模型,采用置信度上界算法(Upper Confidence Bound,UCB),离线统计近一个月的广告素材数据的得分为:
The number of online clicks on the creative data can be determined by viewing and clicking on the target item's details page. You can use the Multi-Armed Bandits (MAB) model and use the Upper Confidence Bound algorithm (Upper Confidence Bound, UCB) to calculate the offline statistics of the advertising material data in the past month. The score is:
表示广告素材数据的线上点击量均值,nj是当前广告素材数据累积被选择的次数,n表示广告素材数据的数量。线上点击量均值和广告素材数据被选择的累计次数越大,广告素材数据的得分就越高。线上点击量均值和广告素材数据被选择的累计次数越小,广告素材数据的得分就越低。 Represents the average number of online clicks on the creative data, n j is the cumulative number of times the current creative data has been selected, and n represents the number of creative data. The greater the average number of online clicks and the cumulative number of times the creative data is selected, the higher the score of the creative data. The smaller the average number of online clicks and the cumulative number of times the creative data is selected, the lower the score of the creative data.
根据上述公式,分别计算出广告素材中广告文案和广告图片的得分。从多个广告素材数据中选取出至少一个广告文案和至少一个广告图片后,由选取出的广告文案构成文案分组,计算出文案分组中每个广告文案的得分。在计算广告文案的得分时,为文案分组中广告文案的线上点击量均值,nj是当前广告文案累积被选择的次数,n表示文案分组中广告文案的数量。由选取出的广告图片构成图片分组,计算出图片分组中每个广告图片的得分。在计算广告图片的得分时,为图片分组中广告图片的线上点击量均值,nj是当前广告图片累积被选择的次数,n表示图片分组中广告图片的数量。According to the above formula, the scores of the advertising copy and advertising image in the creative are calculated respectively. After selecting at least one advertising copy and at least one advertising image from multiple creative data, the selected advertising copy forms a copy group, and the score of each advertising copy in the copy group is calculated. When calculating the score for ad copy, is the average number of online clicks for the advertising copy in the copywriting group, n j is the cumulative number of times the current advertising copy has been selected, and n represents the number of advertising copywriting in the copywriting group. The selected advertising images form an image group, and the score of each advertising image in the image group is calculated. When calculating the score for an ad image, is the average number of online clicks on the advertising images in the image group, n j is the cumulative number of times the current advertising image has been selected, and n represents the number of advertising images in the image group.
步骤C2:根据每个广告素材数据的得分,从多个广告素材数据中选取出至少一个广告文案和至少一个广告图片。Step C2: Select at least one advertising copy and at least one advertising image from multiple creative data based on the score of each creative data.
在得到广告图片的得分和广告文案的得分后,根据广告图片的得分从广告素材数据中选取出至少一个广告图片。根据广告文案的得分从广告素材数据中选取出至少一个广告文案。After obtaining the score of the advertising image and the score of the advertising copy, at least one advertising image is selected from the advertising creative data based on the score of the advertising image. Select at least one ad copy from the creative data based on the ad copy's score.
利用上述步骤,可以准确的计算出广告素材数据的得分,结合广告素材数据的得分状况对广告素材数据进行选择,可以准确快速的选择出合适的广告素材数据,并且避免发生广告素材数据相互组合过程中带来的组合爆炸问题。 Using the above steps, the score of the creative data can be accurately calculated, and the creative data can be selected based on the score of the creative data. The appropriate creative data can be accurately and quickly selected, and the process of combining the creative data can be avoided. The combinatorial explosion problem caused by
S330,对选取的广告文案和广告图片进行组合,得到至少一个候选广告创意数据。S330: Combine the selected advertising copy and advertising image to obtain at least one candidate advertising creative data.
选取出至少一个广告文案和至少一个广告图片后,对选取出的的广告文案和广告图片进行两两组合,得到至少一个候选广告创意数据。例如,根据每个广告素材数据的得分,从多个广告素材数据中选取出的广告文案为文案A和文案B。从多个广告素材数据中选取出的广告图片为图片C。则根据广告文案和广告图片可以得到候选广告创意数据为AC和BC。After selecting at least one advertising copy and at least one advertising picture, the selected advertising copy and advertising pictures are combined in pairs to obtain at least one candidate advertising creative data. For example, based on the score of each creative data, the advertising copy selected from multiple creative data is copy A and copy B. The advertising image selected from multiple creative data is image C. Then the candidate advertising creative data can be obtained as AC and BC based on the advertising copy and advertising pictures.
本实施例的技术方案,获取目标物品对应的多个广告素材数据,其中广告素材数据包括广告文案和广告图片,根据多个广告素材数据分别对应的线上点击数据,从多个广告素材数据中选取出至少一个广告文案和至少一个广告图片,对选取的广告文案和广告图片进行组合,得到至少一个候选广告创意数据。本实施例的方案,可以准确的计算出广告素材数据的得分,结合广告素材数据的得分状况对广告素材数据进行选择,可以准确快速的选择出合适的广告素材数据,并且避免发生广告素材数据相互组合过程中带来的组合爆炸问题。并且根据选取的广告文案和广告图片进行组合得到的候选广告创意数据更准确,更加符合用户的心意。The technical solution of this embodiment is to obtain multiple advertising material data corresponding to the target item, where the advertising material data includes advertising copy and advertising pictures, and according to the online click data corresponding to the multiple advertising material data, from the multiple advertising material data Select at least one advertising copy and at least one advertising picture, and combine the selected advertising copy and advertising pictures to obtain at least one candidate advertising creative data. The solution of this embodiment can accurately calculate the score of the advertising material data, select the advertising material data based on the score status of the advertising material data, accurately and quickly select the appropriate advertising material data, and avoid the occurrence of mutual interaction between the advertising material data. The combinatorial explosion problem caused by the combination process. Moreover, the candidate advertising creative data obtained by combining the selected advertising copy and advertising images is more accurate and more in line with the user's wishes.
图5为本申请实施例提供的另一种获取候选广告创意数据方法的流程图,本实施例以上述实施例为基础对获取候选广告创意数据方法进行说明。如图5所示,本实施例的方法包括如下步骤:Figure 5 is a flow chart of another method for obtaining candidate advertisement creative data provided by an embodiment of the present application. This embodiment explains the method of obtaining candidate advertisement creative data based on the above embodiment. As shown in Figure 5, the method of this embodiment includes the following steps:
S410,从目标物品的物品详情页和/或广告创意素材中,识别并提取出广告文案。S410: Identify and extract advertising copy from the item details page and/or advertising creative materials of the target item.
在一些物品网站、手机APP中展示的物品页面中,可以通过物品详情页获取到物品对应的广告素材数据。可以从广告素材数据中提取出广告文案。本方案实施例中,从目标物品的物品详情页和/或广告创意素材中,识别并提取出广告文案包括如下步骤D1-步骤D3:In the item pages displayed on some item websites and mobile apps, the advertising material data corresponding to the item can be obtained through the item details page. Ad copy can be extracted from creative data. In this embodiment of the solution, identifying and extracting advertising copy from the item details page and/or advertising creative materials of the target item includes the following steps D1 to D3:
步骤D1:基于预设字符识别模型,从所述目标物品的物品详情页和/或广告创意素材中,识别出待选文案。Step D1: Based on the preset character recognition model, identify the candidate copy from the item details page and/or advertising creative materials of the target item.
字符识别模型包括光学字符识别(Optical Character Recognition,OCR)模型。在目标物品的物品详情页和/或广告创意素材中,可以通过OCR模型对物品详情页中和/或广告创意素材中的文案进行识别提取,得到广告文案,并将识别出的广告文案作为待选文案。Character recognition models include optical character recognition (Optical Character Recognition, OCR) models. In the item details page and/or advertising creative materials of the target item, the copywriting in the item details page and/or advertising creative materials can be identified and extracted through the OCR model to obtain the advertising copy, and the identified advertising copy can be used as the target item. Choose copy.
步骤D2:基于包含预设利益点词汇的第一词表,从待选文案中筛选出利益 点文案。Step D2: Based on the first word list containing preset interest point words, filter out the benefits from the candidate copy. Click on copywriting.
利益点词汇是广告文案中向用户表达物品特征、物品利益/优势、消费者利益、情感/价值观等的词汇。预设利益点词汇表是根据需求和实际环境设定的记录目标物品利益点词汇的表格。例如目标物品为照相机,则在利益点词汇的第一词表中包含有:Benefit point words are words used in advertising copy to express item characteristics, item benefits/advantages, consumer interests, emotions/values, etc. to users. The default interest point vocabulary is a table set according to needs and actual environment to record the interest point vocabulary of target items. For example, if the target item is a camera, the first word list of the interest point vocabulary contains:
物品特征:体积小、像素高;物品利益/优势:简单拍摄出清晰、美观的照片;消费者利益:便于携带、易于操作;情感/价值观:记录生活,展现最真实的世界。Item characteristics: small size, high pixels; Item benefits/advantages: Easily take clear and beautiful photos; Consumer benefits: Easy to carry and easy to operate; Emotions/values: Record life and show the most real world.
在得到待选文案后,根据第一词表,从待选文案中筛选出利益点词汇。After obtaining the candidate copy, the interest point vocabulary is selected from the candidate copy according to the first word list.
步骤D3:基于预设字数限制条件和/或包含预设非卖点词汇的第二词表,从待选文案中除去利益点文案后剩余的文案中,筛选出卖点文案。Step D3: Based on the preset word limit conditions and/or the second word list containing preset non-selling point words, screen out the selling point copy from the remaining copy after excluding the benefit point copy from the candidate copy.
卖点文案是可以提升用户购买兴趣、促进产品销售的有价值的文案。卖点文案可以用简单的语言描述出商品的卖点,因此卖点文案有一定的字数限制。可以基于预设字数限制条件从待选文案中除去利益点文案后剩余的文案中,筛选出卖点文案。但在一些情况下,满足字数限制条件的广告文案也不一定就是卖点文案。在这种情况下,可以基于包含预设非卖点词汇的第二词表,从待选文案中除去利益点文案后剩余的文案中,筛选出卖点文案。为了更加精准的筛选出卖点文案,可以同时基于预设字数限制条件和第二词表,从待选文案中除去利益点文案后剩余的文案中,筛选出卖点文案。例如,预设字数限制条件是5,则首先从待选文案中除去利益点文案后剩余的文案中删除掉超过5字的广告文案,再根据第二词表对剩下的广告文案进行筛选,删除掉包含非卖点词汇的广告文案,最终得到卖点文案。Selling point copywriting is valuable copywriting that can increase users' purchasing interest and promote product sales. Selling point copy can describe the selling points of the product in simple language, so there is a certain word limit for selling point copy. Selling point copy can be screened out from the remaining copy after removing benefit point copy from the candidate copy based on the preset word count limit. However, in some cases, advertising copy that meets the word limit is not necessarily selling point copy. In this case, the selling point copy can be screened out from the remaining copy after excluding the benefit point copy from the candidate copy based on a second word list containing preset non-selling point words. In order to more accurately screen the selling point copywriting, you can filter the selling point copywriting from the remaining copywriting after excluding the benefit point copywriting from the candidate copywriting based on the preset word limit conditions and the second word list at the same time. For example, if the default word limit is 5, then first delete the advertising copy with more than 5 words from the remaining copy after excluding the benefit point copy from the candidate copy, and then filter the remaining advertising copy according to the second word list. Delete the ad copy that contains non-selling point words and finally get the selling point copy.
上述步骤中,可以利用OCR模型,准确、快速的对物品详情页中的广告文案进行挖掘和识别,通过第一词表和第二词表得到最终的利益点文案和卖点文案,可以用于解决线上文案素材不足的问题。In the above steps, the OCR model can be used to accurately and quickly mine and identify the advertising copy in the item details page, and obtain the final interest point copy and selling point copy through the first word list and the second word list, which can be used to solve the problem. The problem of insufficient online copywriting materials.
S420,对目标物品的物品详情页中的物品图片进行定位和裁剪,得到广告图片。S420: Position and crop the item image in the item details page of the target item to obtain an advertising image.
对物品详情页中的物品图片进行定位表示,在物品详情页中准确找出物品图片所在的位置。在物品详情页中,目标物品的图片的位置是不确定的,可以利用显著性算法将物品详情页分成多个特定的、具有独特性质的区域(如文字区和图片区)。在分割后的图片区域中识别出详情页的物品图片,再对物品图片的大小进行分析,对物品图片进行智能剪裁。例如,当物品图片中的物品主体过小,用户无法清楚的从物品图片中看到物品主体时,可以对物品图片进行 剪裁,得到适当大小的物品图片。将剪裁后的物品图片作为广告图片。Position and represent the item picture in the item details page, and accurately find the location of the item picture in the item details page. In the item details page, the position of the image of the target item is uncertain, and the saliency algorithm can be used to divide the item details page into multiple specific areas with unique properties (such as text areas and picture areas). Identify the item pictures on the detail page in the segmented picture area, analyze the size of the item pictures, and intelligently crop the item pictures. For example, when the main body of the item in the item picture is too small and the user cannot clearly see the main body of the item from the item picture, the item picture can be modified. Crop to get an appropriately sized item image. Use the cropped item image as an advertising image.
S430,根据多个广告素材数据分别对应的线上点击数据,从多个广告素材数据中选取出至少一个广告文案和至少一个广告图片。S430: Select at least one advertisement copy and at least one advertisement image from the plurality of advertisement creative data according to the online click data respectively corresponding to the plurality of advertisement creative data.
S440,对选取的广告文案和广告图片进行组合,得到至少一个候选广告创意数据。S440: Combine the selected advertising copy and advertising image to obtain at least one candidate advertising creative data.
本实施例的技术方案,通过从目标物品的物品详情页和/或广告创意素材中,识别并提取出广告文案;对目标物品的物品详情页中的物品图片进行定位和裁剪,得到广告图片;根据多个广告素材数据分别对应的线上点击数据,从多个广告素材数据中选取出至少一个广告文案和至少一个广告图片;对选取的广告文案和广告图片进行组合,得到至少一个候选广告创意数据。本实施例的技术方案,可以准确、快速的对物品详情页中的广告文案进行挖掘和识别,通过卖点文案解决线上文案素材不足的问题。通过对广告图片进行智能剪裁,得到可以突出目标物品的目标物品图片,使得广告图片可以充分展示出目标物品。The technical solution of this embodiment is to identify and extract advertising copy from the item details page of the target item and/or advertising creative materials; position and crop the item pictures in the item details page of the target item to obtain the advertisement picture; According to the online click data corresponding to the multiple creative data, at least one advertising copy and at least one advertising image are selected from the multiple advertising creative data; the selected advertising copy and advertising image are combined to obtain at least one candidate advertising creative data. The technical solution of this embodiment can accurately and quickly mine and identify advertising copy in item detail pages, and solve the problem of insufficient online copy materials through selling point copy. Through intelligent cropping of advertising images, target item images that can highlight the target items are obtained, so that the advertising images can fully display the target items.
图6为本申请实施例提供的一种对候选广告创意数据进行优化处理的流程图,本实施例以上述实施例为基础对候选广告创意数据进行优化处理的方法进行说明。如图6所示,本实施例的方法包括如下步骤:Figure 6 is a flow chart for optimizing candidate advertising creative data provided by an embodiment of the present application. This embodiment describes a method for optimizing candidate advertising creative data based on the above embodiment. As shown in Figure 6, the method in this embodiment includes the following steps:
S510,对选取的广告文案和广告图片进行组合,得到至少一个文案图片组合。S510: Combine the selected advertising copy and advertising images to obtain at least one copy and image combination.
从多个广告素材数据中选取出至少一个广告文案和至少一个广告图片后,对选取出的广告文案和广告图片进行两两组合,得到至少一个文案图片组合。例如,根据每个广告素材数据的得分,从多个广告素材数据中选取出的广告文案为文案A和文案B。从多个广告素材数据中选取出的广告图片为图片C。则根据广告文案和广告图片可以得到文案图片组合为AC和BC。After selecting at least one advertising copy and at least one advertising image from multiple creative data, the selected advertising copy and advertising images are combined in pairs to obtain at least one copy and image combination. For example, based on the score of each creative data, the advertising copy selected from multiple creative data is copy A and copy B. The advertising image selected from multiple creative data is image C. Then according to the advertising copy and advertising image, the copy image combination can be obtained as AC and BC.
S520,对至少一个文案图片组合与至少一个预设背景模板进行组合,得到至少一个创意组合。S520: Combine at least one copywriting picture combination and at least one preset background template to obtain at least one creative combination.
预设背景模板是根据物品特性和需求等提前设置的,具有固定风格的文案图片组合的排版模板。在得到文案图片组合后,可以根据物品的类别信息和物品的特征等,为文案图片组合选择至少一个背景模板,将文案图片组合与至少一个预设背景模板进行两两组合,得到至少一个创意组合。The default background template is a layout template with a fixed style of copywriting and picture combination that is set in advance according to the characteristics and needs of the item. After obtaining the copywriting picture combination, you can select at least one background template for the copywriting picture combination based on the category information of the items and the characteristics of the items, etc., and combine the copywriting picture combination with at least one preset background template in pairs to obtain at least one creative combination. .
S530,基于预设筛选因素,从至少一个创意组合中筛选出至少一个创意组合作为候选广告创意数据。 S530: Screen out at least one creative combination as candidate advertising creative data from at least one creative combination based on preset filtering factors.
预设筛选因素包括目标物品的类目信息、和/或每个创意组合中广告图片和背景模板的颜色信息。要得到更加适合目标物品的广告创意数据,需要根据物品的类目信息、和/或每个创意组合中广告图片和背景模板的颜色信息筛选出候选广告创意数据,该颜色信息可以包括主颜色。例如,可以利用K-Means聚类算法对图片颜色进行聚类分析和主要颜色提取,识别出图片的主颜色。The preset filtering factors include category information of the target items, and/or color information of the advertising images and background templates in each creative combination. To obtain advertising creative data that is more suitable for the target item, candidate advertising creative data needs to be filtered out based on the category information of the item and/or the color information of the advertising image and background template in each creative combination. The color information may include the main color. For example, the K-Means clustering algorithm can be used to perform cluster analysis and main color extraction on the color of the picture, and identify the main color of the picture.
在基于目标物品的类目信息从至少一个创意组合中筛选出至少一个创意组合作为候选广告创意数据时,可以是根据目标物品的类目信息以及预先设置的物品类目与背景模板风格的对应关系,确定目标物品对应的背景模板风格,然后从至少一个创意组合中选取出与该背景模板风格匹配的创意组合。When selecting at least one creative combination from at least one creative combination as candidate advertising creative data based on the category information of the target item, it may be based on the category information of the target item and the preset corresponding relationship between the item category and the background template style. , determine the background template style corresponding to the target item, and then select a creative combination that matches the background template style from at least one creative combination.
在基于每个创意组合中广告图片和背景模板的颜色信息从至少一个创意组合中筛选出至少一个创意组合作为候选广告创意数据时,可以是根据广告图片的主颜色和背景模板的主颜色,利用色相-饱和度-明度(Hue-Saturation-Value,HSV)颜色模型,按照优先使用邻近色搭配、对比色搭配的方式,从至少一个创意组合中筛选出至少一个创意组合作为候选广告创意数据。When selecting at least one creative combination as candidate advertising creative data from at least one creative combination based on the color information of the advertising image and background template in each creative combination, it may be based on the main color of the advertising image and the main color of the background template, using The Hue-Saturation-Value (HSV) color model selects at least one creative combination from at least one creative combination as candidate advertising creative data by giving priority to adjacent color matching and contrasting color matching.
S540,在候选广告创意数据包含的广告图片中的目标物品区域的尺寸小于预设阈值的情况下,将目标物品区域进行裁剪,并使用裁剪得到的目标物品图片更新候选广告创意数据包含的广告图片。S540: When the size of the target item area in the advertising image contained in the candidate advertising creative data is smaller than the preset threshold, crop the target item area, and use the cropped target item image to update the advertising image contained in the candidate advertising creative data. .
预设阈值可以根据具体需求等提前设定。目标物品的广告图片中目标物品区域的尺寸小于预设阈值时,可能存在图片中目标物品过小导致用户无法直观、清楚的看到物品。可以利用目标检测算法识别出广告图片中的标物品区域,对广告图片进行智能剪裁,得到可以突出目标物品的目标物品图片,并基于目标物品图片更新候选广告创意数据包含的广告图片。The preset threshold can be set in advance according to specific needs. When the size of the target item area in the advertisement image of the target item is smaller than the preset threshold, the target item in the image may be too small and the user cannot see the item intuitively and clearly. The target detection algorithm can be used to identify the target item area in the advertising image, and the advertising image can be intelligently cropped to obtain a target item image that can highlight the target item, and the advertising image contained in the candidate advertising creative data can be updated based on the target item image.
S550,根据候选广告创意数据中广告图片的颜色信息,对广告图片中的目标物品区域进行润色处理。S550: Polish the target item area in the advertisement image according to the color information of the advertisement image in the candidate advertisement creative data.
润色处理包括调整亮度、调整对比度、以及调整饱和度中的至少一项。在得到更新后的候选广告创意数据包含的广告图片后,基于广告图片的颜色和目标物品区域的颜色,对广告图片进行图像分析。当目标物品区域的颜色过暗时,可能会导致目标物品区域不够醒目的情况发生,则可以将目标物品区域的颜色亮度调亮,以突出目标物品,吸引用户点击或触发目标物品。同理,当目标物品区域与广告图片的色彩对比度较弱时,可能会导致目标物品区域与广告图片的背景融为一体,则可以通过调整对比度以突出目标物品。当当目标物品区域与广告图片的色彩饱和度较弱时,可以通过调整饱和度以突出目标物品。本方案实施例中,根据候选广告创意数据中广告图片的颜色信息,对广告图片中的目标物品区域进行润色处理包括如下步骤E1-步骤E2: The polishing process includes at least one of adjusting brightness, adjusting contrast, and adjusting saturation. After obtaining the advertising image contained in the updated candidate advertising creative data, image analysis is performed on the advertising image based on the color of the advertising image and the color of the target item area. When the color of the target item area is too dark, which may result in the target item area not being eye-catching enough, the color brightness of the target item area can be brightened to highlight the target item and attract users to click or trigger the target item. Similarly, when the color contrast between the target item area and the advertising image is weak, which may cause the target item area to blend into the background of the advertising image, the contrast can be adjusted to highlight the target item. When the color saturation of the target item area and the advertising image is weak, you can adjust the saturation to highlight the target item. In this embodiment of the solution, based on the color information of the advertising image in the candidate advertising creative data, polishing the target item area in the advertising image includes the following steps E1 - step E2:
步骤E1:根据候选广告创意数据中广告图片包含的像素点的像素值,确定广告图片是彩色图片或黑色图片。Step E1: Determine whether the advertising image is a color image or a black image based on the pixel values of the pixels contained in the advertising image in the candidate advertising creative data.
图片中的像素点信息可以反映出图像的色彩信息。对广告图片中的像素点的像素值进行统计,当所有像素点中存在像素值超过190(像素值取值范围为0-255)的像素点,并且像素值超过190的像素点的数量超过总像素点的50%的时候,确定广告图片为白色图片。当像素值超过190的像素点的数量不超过总像素点的15%,并且,像素值不超过55的像素点的数量超过总像素点的50%时,确定广告图片为黑色图片。其他情况确定广告图片为彩色图片。当确定广告图片为白色图片时,不对广告图片进行润色处理。The pixel information in the picture can reflect the color information of the image. Calculate the pixel values of pixels in advertising images. When there are pixels with a pixel value exceeding 190 (pixel value range is 0-255) among all pixels, and the number of pixels with a pixel value exceeding 190 exceeds the total At 50% of the pixels, the advertising image is determined to be a white image. When the number of pixels with a pixel value exceeding 190 does not exceed 15% of the total pixels, and the number of pixels with a pixel value not exceeding 55 exceeds 50% of the total pixels, the advertising image is determined to be a black image. In other cases, it is determined that the advertising image is a color image. When it is determined that the advertising image is a white image, the advertising image will not be retouched.
步骤E2:在广告图片是彩色图片的情况下,基于第一预设亮度参数值、第一预设对比度参数值和第一预设饱和度参数值,对广告图片中的目标物品区域进行润色处理。Step E2: When the advertising image is a color image, polish the target item area in the advertising image based on the first preset brightness parameter value, the first preset contrast parameter value and the first preset saturation parameter value. .
第一预设亮度参数值、第一预设对比度参数值和第一预设饱和度参数值可以根据需求提前设定。例如,当广告图片是彩色图片时,可以将第一预设亮度参数值设置为15,将第一预设对比度参数值设置为10,将和第一预设饱和度参数值设置为10。根据上述的第一预设亮度参数值、第一预设对比度参数值和第一预设饱和度参数值,对广告图片中的目标物品区域进行润色处理。The first preset brightness parameter value, the first preset contrast parameter value and the first preset saturation parameter value can be set in advance according to needs. For example, when the advertising picture is a color picture, the first preset brightness parameter value can be set to 15, the first preset contrast parameter value can be set to 10, and the first preset saturation parameter value can be set to 10. According to the above-mentioned first preset brightness parameter value, first preset contrast parameter value and first preset saturation parameter value, the target item area in the advertising image is retouched.
步骤E3:在广告图片是黑色图片的情况下,基于第二预设亮度参数值、第二预设对比度参数值和第二预设饱和度参数值,对广告图片中的目标物品区域进行润色处理。Step E3: When the advertising image is a black image, polish the target item area in the advertising image based on the second preset brightness parameter value, the second preset contrast parameter value and the second preset saturation parameter value. .
第二预设亮度参数值大于第一预设亮度参数值,第二预设对比度参数值大于第一预设对比度参数值,第二预设饱和度参数值大于第一预设饱和度参数值。第二预设亮度参数值、第二预设对比度参数值和第二预设饱和度参数值可以根据需求提前设定。当广告图片是黑色图片时,由于黑色图片对于图像润色处理的效果不明显,因此,可以对黑色图片采用强润色处理。例如,可以将第二预设亮度参数值设置为20,将第二预设对比度参数值设置为15,将和第二预设饱和度参数值设置为15。根据上述的第二预设亮度参数值、第二预设对比度参数值和第二预设饱和度参数值,对广告图片中的目标物品区域进行润色处理。The second preset brightness parameter value is greater than the first preset brightness parameter value, the second preset contrast parameter value is greater than the first preset contrast parameter value, and the second preset saturation parameter value is greater than the first preset saturation parameter value. The second preset brightness parameter value, the second preset contrast parameter value and the second preset saturation parameter value can be set in advance according to needs. When the advertising image is a black image, since the black image has no obvious effect on image retouching, strong retouching can be used on the black image. For example, the second preset brightness parameter value may be set to 20, the second preset contrast parameter value may be set to 15, and the second preset saturation parameter value may be set to 15. According to the above-mentioned second preset brightness parameter value, second preset contrast parameter value and second preset saturation parameter value, the target item area in the advertising image is retouched.
通过上述步骤,可以根据广告图片的颜色对广告图片进行润色处理,使得广告图片中的广告物品更加吸引人,充分发挥了广告图片对于物品的展示作用。Through the above steps, the advertising image can be polished according to the color of the advertising image, making the advertising items in the advertising image more attractive and giving full play to the display function of the advertising image for the items.
图7为本申请实施例提供的一种广告创意数据选取和优化过程的流程示意图。如图7所示,从物品详情页和/或广告素材库中,通过素材挖掘模块中的OCR模型,识别并提取出目标物品的广告文案。通过素材挖掘模块中的图像分割模 型和目标检测模型识别出目标物品的广告图片。通过MAB分别计算出广告文案和广告图片的得分情况。在根据得分情况从多个广告素材数据中选取出至少一个广告文案和至少一个广告图片。通过创意元素组合模块得到候选广告创意数据,将候选广告创意数据输入进创意选择模块,利用创意选择模块中的创意选择模型的输出值得到目标广告创意数据。Figure 7 is a schematic flow chart of an advertising creative data selection and optimization process provided by an embodiment of the present application. As shown in Figure 7, from the item details page and/or the advertising material library, the advertising copy of the target item is identified and extracted through the OCR model in the material mining module. Through the image segmentation model in the material mining module The model and target detection model identify advertising images of target items. Calculate the scores of advertising copy and advertising images through MAB. At least one advertising copy and at least one advertising image are selected from multiple creative data based on the scores. The candidate advertising creative data is obtained through the creative element combination module, the candidate advertising creative data is input into the creative selection module, and the target advertising creative data is obtained using the output value of the creative selection model in the creative selection module.
本实施例的方案,通过对选取的广告文案和广告图片进行组合,得到至少一个文案图片组合;对至少一个文案图片组合与至少一个预设背景模板进行组合,得到至少一个创意组合;基于预设筛选因素,从至少一个创意组合中筛选出至少一个创意组合作为候选广告创意数据;在候选广告创意数据包含的广告图片中的目标物品区域的尺寸小于预设阈值的情况下,将目标物品区域进行裁剪,并使用裁剪得到的目标物品图片更新候选广告创意数据包含的广告图片;根据候选广告创意数据中广告图片的颜色信息,对广告图片中的目标物品区域进行润色处理。本实施例的方案,可以根据目标物品的类别信息、色彩信息等对创意组合进项筛选,得到更加美观,颜色更加接近人工设计的候选广告创意数据。最后根据广告图片的颜色对广告图片进行润色处理,使得广告图片中的广告物品更加吸引人,充分发挥了广告图片对于物品的展示作用。The solution of this embodiment is to obtain at least one copywriting and picture combination by combining the selected advertising copy and advertising pictures; to obtain at least one creative combination by combining at least one copywriting and picture combination with at least one preset background template; based on the preset Screening factors, selecting at least one creative combination from at least one creative combination as candidate advertising creative data; when the size of the target item area in the advertising image contained in the candidate advertising creative data is smaller than a preset threshold, the target item area is Crop, and use the cropped target item image to update the advertising image contained in the candidate advertising creative data; and polish the target item area in the advertising image according to the color information of the advertising image in the candidate advertising creative data. The solution of this embodiment can filter creative combinations according to the category information, color information, etc. of the target item, and obtain candidate advertising creative data that is more beautiful and whose color is closer to manual design. Finally, the advertising images are polished according to the color of the advertising images, making the advertising items in the advertising images more attractive and giving full play to the display role of the advertising images in displaying the items.
图8为本申请实施例提供的一种模型训练方法的流程图,本实施例能够对初始模型进行训练得到创意选择模型,该方法可以由本申请实施例中的模型训练装置来执行,该装置可采用软件和/或硬件的方式实现,如图8所示,该方法包括如下步骤:Figure 8 is a flow chart of a model training method provided by an embodiment of the present application. This embodiment can train an initial model to obtain a creative selection model. This method can be executed by the model training device in the embodiment of the present application. The device can It is implemented using software and/or hardware, as shown in Figure 8. The method includes the following steps:
S610,获取训练样本数据。S610, obtain training sample data.
训练样本数据包括样本物品对应的样本广告创意数据和样本广告创意数据对应的标准推荐概率值,样本广告创意数据包含广告图片和广告文案。在一种实施方式中,可以从存储着样本广告创意数据和样本广告创意数据对应的标准推荐概率值的历史素材数据库中获取到样本数据。例如,根据大数据以及数据分析算法可以确定出一些广告创意数据和广告创意数据对应的标准推荐概率值,将这些广告创意数据和其对应的标准概率推荐值存储在历史素材数据库。可以从历史素材数据库中获取到样本数据。The training sample data includes sample advertising creative data corresponding to the sample items and standard recommendation probability values corresponding to the sample advertising creative data. The sample advertising creative data includes advertising pictures and advertising copy. In one implementation, the sample data can be obtained from a historical material database that stores sample advertising creative data and standard recommendation probability values corresponding to the sample advertising creative data. For example, based on big data and data analysis algorithms, some advertising creative data and standard recommendation probability values corresponding to the advertising creative data can be determined, and these advertising creative data and their corresponding standard probability recommendation values are stored in the historical material database. Sample data can be obtained from the historical material database.
S620,获取样本广告创意数据对应的稀疏特征向量和图片特征向量,并基于稀疏特征向量、图片特征向量以及待训练的创意选择模型,获得样本广告创意数据对应的预测推荐概率值。S620: Obtain the sparse feature vector and picture feature vector corresponding to the sample advertising creative data, and obtain the predicted recommendation probability value corresponding to the sample advertising creative data based on the sparse feature vector, picture feature vector and the creative selection model to be trained.
稀疏特征向量是用于反映多个类型的稀疏特征的向量。图片特征向量是用 于反映广告图片的图片特征的向量,图片特征可以通过样本广告创意数据中的广告图片得到。预测推荐概率值是未训练完成的创意选择模型根据稀疏特征向量和图片特征向量,经过计算后输出的稀疏特征向量和图片特征向量对应的推荐概率值。将稀疏特征向量和图片特征向量输入进待训练的创意选择模型,待训练的创意选择模型经过模型计算可以输出样本广告创意数据对应的预测推荐概率值。A sparse feature vector is a vector used to reflect multiple types of sparse features. The image feature vector is used is a vector that reflects the image features of the advertising image. The image features can be obtained from the advertising images in the sample advertising creative data. The predicted recommendation probability value is the recommendation probability value corresponding to the sparse feature vector and the picture feature vector output by the untrained creative selection model based on the sparse feature vector and the picture feature vector. Input sparse feature vectors and picture feature vectors into the creative selection model to be trained. The creative selection model to be trained can output the predicted recommendation probability value corresponding to the sample advertising creative data through model calculation.
S630,根据标准推荐概率值和预测推荐概率值确定损失函数,基于损失函数对创意选择模型中的网络参数进行调整,并在满足预设迭代停止条件时停止训练。S630: Determine the loss function based on the standard recommendation probability value and the predicted recommendation probability value, adjust the network parameters in the creative selection model based on the loss function, and stop training when the preset iteration stop conditions are met.
损失函数是将随机事件或其有关随机变量的取值映射为非负实数以表示该随机事件的“风险”或“损失”的函数。模型中的网络参数是模型内部的配置变量,模型参数的值可以根据损失函数进行调整。创意选择模型用于:采用自注意力机制,对稀疏特征向量和图片特征向量进行融合,并基于融合结果输出预测推荐概率值。本方案中,创意选择模型包括多层感知机神经网络MLP模块、自注意力模块和输出模块。其中,MLP模块用于基于稀疏特征向量输出第一特征向量;自注意力模块用于基于稀疏特征向量和图片特征向量输出第二特征向量;输出模块用于基于第一特征向量和第二特征向量输出预测推荐概率值。The loss function is a function that maps a random event or the value of its related random variable into a non-negative real number to represent the "risk" or "loss" of the random event. The network parameters in the model are configuration variables inside the model, and the values of the model parameters can be adjusted according to the loss function. The creative selection model is used to: use a self-attention mechanism to fuse sparse feature vectors and picture feature vectors, and output predicted recommendation probability values based on the fusion results. In this solution, the creative selection model includes a multi-layer perceptron neural network MLP module, a self-attention module and an output module. Among them, the MLP module is used to output the first feature vector based on the sparse feature vector; the self-attention module is used to output the second feature vector based on the sparse feature vector and the picture feature vector; the output module is used to output the first feature vector based on the first feature vector and the second feature vector. Output the predicted recommendation probability value.
将稀疏特征向量和图片特征向量输入进待训练的创意选择模型,得到对应的预测推荐概率值。此时的预测推荐概率值与标准推荐概率值之间存在很大的“差距”,根据损失函数和该“差距”不断对待训练的创意选择模型进行优化。调整创意选择模型的网络参数,使得预测推荐概率值与标准推荐概率值之间的“差距”不断缩小,预设迭代停止条件时,便可以得到训练完成后的创意选择模型。Input the sparse feature vector and picture feature vector into the creative selection model to be trained, and obtain the corresponding predicted recommendation probability value. There is a big "gap" between the predicted recommendation probability value and the standard recommendation probability value at this time. The creative selection model to be trained is continuously optimized based on the loss function and the "gap". Adjust the network parameters of the creative selection model so that the "gap" between the predicted recommendation probability value and the standard recommendation probability value is continuously narrowed. When the iteration stop conditions are preset, the creative selection model after training can be obtained.
本实施例的技术方案,可以获取训练样本数据,获取样本广告创意数据对应的稀疏特征向量和图片特征向量,并基于稀疏特征向量、图片特征向量以及待训练的创意选择模型,获得样本广告创意数据对应的预测推荐概率值。根据标准推荐概率值和预测推荐概率值确定损失函数,基于损失函数对创意选择模型中的网络参数进行调整,并在满足预设迭代停止条件时停止训练。本实施例的技术方案可以对创意选择模型不断优化,使得创意选择模型输出的预测推荐概率值更加接近标准推荐概率值,提高了预测推荐概率值的准确性。The technical solution of this embodiment can obtain training sample data, obtain the sparse feature vectors and picture feature vectors corresponding to the sample advertising creative data, and obtain the sample advertising creative data based on the sparse feature vectors, picture feature vectors and the creative selection model to be trained. The corresponding predicted recommendation probability value. Determine the loss function based on the standard recommendation probability value and the predicted recommendation probability value, adjust the network parameters in the creative selection model based on the loss function, and stop training when the preset iteration stop conditions are met. The technical solution of this embodiment can continuously optimize the creative selection model, making the predicted recommendation probability value output by the creative selection model closer to the standard recommendation probability value, and improving the accuracy of the predicted recommendation probability value.
本申请技术方案中对数据的获取、存储、使用、处理等均符合国家法律法规的相关规定。The acquisition, storage, use and processing of data in the technical solution of this application all comply with the relevant provisions of national laws and regulations.
图9为本申请实施例提供的一种广告创意数据选取装置的结构示意图。本 实施例能够实现自动精准的从候选广告创意数据中选取出最优的目标广告创意数据,该装置可采用软件和/或硬件的方式实现,该装置可集成在任何提供广告创意数据选取的功能的设备中,如图9所示,广告创意数据选取的装置包括:Figure 9 is a schematic structural diagram of an advertising creative data selection device provided by an embodiment of the present application. Book Embodiments can automatically and accurately select optimal target advertising creative data from candidate advertising creative data. The device can be implemented in the form of software and/or hardware. The device can be integrated in any device that provides the function of selecting advertising creative data. In the equipment, as shown in Figure 9, the devices for selecting advertising creative data include:
数据获取模块910,设置为获取目标物品对应的候选广告创意数据;其中,所述候选广告创意数据包含广告图片和广告文案;概率值得到模块920,设置为获取所述候选广告创意数据对应的稀疏特征向量和图片特征向量,并基于所述稀疏特征向量、所述图片特征向量以及预先训练的创意选择模型,获得所述候选广告创意数据对应的推荐概率值;数据选取模块930,设置为根据所述推荐概率值选取目标广告创意数据;其中,所述创意选择模型用于:采用自注意力机制基于所述稀疏特征向量和所述图片特征向量进行融合,并基于融合结果输出所述推荐概率值。The data acquisition module 910 is configured to acquire the candidate advertising creative data corresponding to the target item; wherein the candidate advertising creative data includes advertising pictures and advertising copy; the probability value obtaining module 920 is configured to acquire the sparse advertising creative data corresponding to the candidate advertising creative data. feature vector and picture feature vector, and based on the sparse feature vector, the picture feature vector and the pre-trained creative selection model, obtain the recommendation probability value corresponding to the candidate advertising creative data; the data selection module 930 is set to The recommended probability value selects target advertising creative data; wherein the creative selection model is used to: use a self-attention mechanism to fuse based on the sparse feature vector and the picture feature vector, and output the recommended probability value based on the fusion result .
所述创意选择模型包括MLP模块、自注意力模块和输出模块;其中:The creative selection model includes an MLP module, a self-attention module and an output module; where:
所述MLP模块,用于基于所述稀疏特征向量输出第一特征向量;所述自注意力模块,用于基于所述稀疏特征向量和所述图片特征向量输出第二特征向量;所述输出模块,用于基于所述第一特征向量和所述第二特征向量输出所述推荐概率值。The MLP module is used to output a first feature vector based on the sparse feature vector; the self-attention module is used to output a second feature vector based on the sparse feature vector and the picture feature vector; the output module , used to output the recommendation probability value based on the first feature vector and the second feature vector.
一实施例中,概率值得到模块920设置为:In one embodiment, the probability value obtaining module 920 is configured as:
将所述候选广告创意数据中的广告图片输入预先训练出的残差神经网络模型;获取所述残差神经网络模型输出的图片特征向量。Input the advertising pictures in the candidate advertising creative data into the pre-trained residual neural network model; obtain the picture feature vector output by the residual neural network model.
一实施例中,概率值得到模块920还设置为:In one embodiment, the probability value obtaining module 920 is further configured to:
获取样本图片和所述样本图片对应的分类标签;其中,所述分类标签为所述样本图片中包含的样本物品的产品词,所述产品词是用于表征所述样本物品的种类并且不包含品牌信息的词汇;将所述样本图片输入所述残差神经网络模型,获取所述残差神经网络模型输出的预测分类;基于所述预测分类和所述分类标签确定损失函数,基于所述损失函数对所述残差神经网络模型中的网络参数进行调整,并在满足预设迭代停止条件时停止训练。Obtain a sample picture and a classification label corresponding to the sample picture; wherein the classification label is a product word of a sample item contained in the sample picture, and the product word is used to characterize the type of the sample item and does not contain Vocabulary of brand information; input the sample picture into the residual neural network model to obtain the predicted classification output by the residual neural network model; determine a loss function based on the predicted classification and the classification label, based on the loss The function adjusts the network parameters in the residual neural network model and stops training when the preset iteration stop conditions are met.
一实施例中,所述预设迭代停止条件包括所述残差神经网络模型的预测准确率达到预设准确率范围,所述预设准确率范围可以包括[75%,90%]。In one embodiment, the preset iteration stop condition includes that the prediction accuracy of the residual neural network model reaches a preset accuracy range, and the preset accuracy range may include [75%, 90%].
一实施例中,所述残差神经网络模型的尾部包含分别用于输出32维向量、129维向量和256维向量的三个全连接层。In one embodiment, the tail of the residual neural network model includes three fully connected layers for outputting 32-dimensional vectors, 129-dimensional vectors, and 256-dimensional vectors respectively.
一实施例中,所述自注意力模块包括Transformer模型中的多头自注意力模块。 In one embodiment, the self-attention module includes a multi-head self-attention module in the Transformer model.
一实施例中,数据获取模块910设置为:In one embodiment, the data acquisition module 910 is configured as:
获取目标物品对应的多个广告素材数据,其中,所述广告素材数据包括广告文案和广告图片;根据所述多个广告素材数据分别对应的线上点击数据,从所述多个广告素材数据中选取出至少一个广告文案和至少一个广告图片;对选取的广告文案和广告图片进行组合,得到至少一个候选广告创意数据。Obtain multiple advertising material data corresponding to the target item, wherein the advertising material data includes advertising copy and advertising pictures; according to the online click data corresponding to the multiple advertising material data, from the multiple advertising material data Select at least one advertising copy and at least one advertising picture; combine the selected advertising copy and advertising pictures to obtain at least one candidate advertising creative data.
一实施例中,数据获取模块910还设置为:In one embodiment, the data acquisition module 910 is further configured to:
从所述目标物品的物品详情页和/或广告创意素材中,识别并提取出广告文案;对所述目标物品的物品详情页中的物品图片进行定位和裁剪,得到广告图片。Identify and extract advertising copy from the item details page of the target item and/or advertising creative materials; position and crop the item image in the item details page of the target item to obtain an advertising image.
一实施例中,数据获取模块910还设置为:In one embodiment, the data acquisition module 910 is further configured to:
基于预设字符识别模型,从所述目标物品的物品详情页和/或广告创意素材中,识别出待选文案;基于包含预设利益点词汇的第一词表,从所述待选文案中筛选出利益点文案;基于预设字数限制条件和/或包含预设非卖点词汇的第二词表,从所述待选文案中除去所述利益点文案后剩余的文案中,筛选出卖点文案。Based on the preset character recognition model, the candidate copy is identified from the item details page and/or advertising creative material of the target item; based on the first word list containing the preset interest point vocabulary, the candidate copy is identified Screening out the benefit point copywriting; based on the preset word limit conditions and/or a second vocabulary list containing preset non-selling point words, screening out the selling point copywriting from the remaining copywriting after removing the benefit point copywriting from the candidate copywriting .
一实施例中,数据获取模块910还设置为:In one embodiment, the data acquisition module 910 is further configured to:
对于每个广告素材数据,根据所述广告素材数据的线上点击量均值和所述广告素材数据被选择的累计次数,确定所述广告素材数据的得分;根据每个广告素材数据的得分,从所述多个广告素材数据中选取出至少一个广告文案和至少一个广告图片。For each advertising material data, the score of the advertising material data is determined based on the average number of online clicks of the advertising material data and the cumulative number of times the advertising material data is selected; based on the score of each advertising material data, from At least one advertising copy and at least one advertising image are selected from the plurality of advertising material data.
一实施例中,数据获取模块910还设置为:In one embodiment, the data acquisition module 910 is further configured to:
对选取的广告文案和广告图片进行组合,得到至少一个文案图片组合;对所述至少一个文案图片组合与至少一个预设背景模板进行组合,得到至少一个创意组合;基于预设筛选因素,从所述至少一个创意组合中筛选出至少一个创意组合作为候选广告创意数据;其中,所述预设筛选因素包括所述目标物品的类目信息、和/或每个创意组合中广告图片和背景模板的颜色信息。Combine the selected advertising copy and advertising pictures to obtain at least one copy and picture combination; combine the at least one copy and picture combination with at least one preset background template to obtain at least one creative combination; based on the preset filtering factors, select Filter out at least one creative combination from the at least one creative combination as candidate advertising creative data; wherein the preset filtering factors include category information of the target item, and/or the characteristics of the advertising pictures and background templates in each creative combination. Color information.
一实施例中,数据获取模块910还设置为:In one embodiment, the data acquisition module 910 is further configured to:
在所述候选广告创意数据包含的广告图片中的目标物品区域的尺寸小于预设阈值的情况下,将所述目标物品区域进行裁剪,并使用裁剪得到的目标物品图片更新所述候选广告创意数据包含的广告图片。When the size of the target item area in the advertising image contained in the candidate advertising creative data is smaller than the preset threshold, the target item area is cropped, and the cropped target item image is used to update the candidate advertising creative data. Advertising images included.
一实施例中,数据获取模块910还设置为:In one embodiment, the data acquisition module 910 is further configured to:
根据所述候选广告创意数据中广告图片的颜色信息,对所述广告图片中的 目标物品区域进行润色处理;其中,所述润色处理包括调整亮度、调整对比度、以及调整饱和度中的至少一项。According to the color information of the advertising image in the candidate advertising creative data, the advertising image in the The target item area is subjected to retouching processing; wherein the retouching processing includes at least one of adjusting brightness, adjusting contrast, and adjusting saturation.
一实施例中,数据获取模块910还设置为:In one embodiment, the data acquisition module 910 is further configured to:
根据所述候选广告创意数据中广告图片包含的像素点的像素值,确定所述广告图片是彩色图片或黑色图片;在所述广告图片是彩色图片的情况下,基于第一预设亮度参数值、第一预设对比度参数值和第一预设饱和度参数值,对所述广告图片中的目标物品区域进行润色处理;在所述广告图片是黑色图片的情况下,基于第二预设亮度参数值、第二预设对比度参数值和第二预设饱和度参数值,对所述广告图片中的目标物品区域进行润色处理;其中,所述第二预设亮度参数值大于所述第一预设亮度参数值,所述第二预设对比度参数值大于所述第一预设对比度参数值,所述第二预设饱和度参数值大于所述第一预设饱和度参数值。According to the pixel values of the pixels contained in the advertising images in the candidate advertising creative data, it is determined whether the advertising image is a color image or a black image; in the case where the advertising image is a color image, based on the first preset brightness parameter value , the first preset contrast parameter value and the first preset saturation parameter value, to polish the target item area in the advertising picture; when the advertising picture is a black picture, based on the second preset brightness The parameter value, the second preset contrast parameter value and the second preset saturation parameter value are used to polish the target item area in the advertising image; wherein the second preset brightness parameter value is greater than the first Preset brightness parameter value, the second preset contrast parameter value is greater than the first preset contrast parameter value, and the second preset saturation parameter value is greater than the first preset saturation parameter value.
一实施例中,所述装置还设置为:In one embodiment, the device is further configured to:
获取所述目标广告创意数据中广告图片的第一编码信息和广告文案的第二编码信息,根据所述第一编码信息和第二编码信息生成所述目标广告创意数据对应的URL;在接收到客户端发送的针对所述URL的访问请求的情况下,根据所述URL获取所述目标广告创意数据中的广告图片和广告文案,并将获取的广告图片和广告文案执行合图操作,得到目标广告创意图像;将所述目标广告创意图像发送给所述客户端进行展示。Obtain the first coding information of the advertising picture and the second coding information of the advertising copy in the target advertising creative data, and generate the URL corresponding to the target advertising creative data according to the first coding information and the second coding information; after receiving In the case of an access request for the URL sent by the client, the advertising image and advertising copy in the target advertising creative data are obtained according to the URL, and the obtained advertising image and advertising copy are combined to obtain the target Advertising creative image; sending the target advertising creative image to the client for display.
上述产品可执行本申请任意实施例所提供的方法,具备执行方法相应的功能模块和效果。The above-mentioned products can execute the methods provided by any embodiment of this application, and have corresponding functional modules and effects for executing the methods.
图10为本申请实施例提供的一种模型训练装置的结构示意图,该装置可采用软件和/或硬件的方式实现,该装置可集成在任何提供模型训练的功能的设备中,如图10所示,模型训练装置包括:Figure 10 is a schematic structural diagram of a model training device provided by an embodiment of the present application. The device can be implemented in the form of software and/or hardware. The device can be integrated in any device that provides model training functions, as shown in Figure 10 As shown, the model training device includes:
样本数据获取模块1010,设置为获取训练样本数据,所述训练样本数据包括样本物品对应的样本广告创意数据和所述样本广告创意数据对应的标准推荐概率值,所述样本广告创意数据包含广告图片和广告文案;向量获取模块1020,设置为获取所述样本广告创意数据对应的稀疏特征向量和图片特征向量,并基于所述稀疏特征向量、所述图片特征向量以及待训练的创意选择模型,获得所述样本广告创意数据对应的预测推荐概率值;模型训练模块1030,设置为根据所述标准推荐概率值和所述预测推荐概率值确定损失函数,基于所述损失函数对所述创意选择模型中的网络参数进行调整,并在满足预设迭代停止条件时停 止训练;其中,创意选择模型用于:采用自注意力机制,对所述稀疏特征向量和所述图片特征向量进行融合,并基于融合结果输出所述预测推荐概率值。The sample data acquisition module 1010 is configured to obtain training sample data. The training sample data includes sample advertising creative data corresponding to sample items and standard recommendation probability values corresponding to the sample advertising creative data. The sample advertising creative data includes advertising pictures. and advertising copy; the vector acquisition module 1020 is configured to obtain the sparse feature vector and picture feature vector corresponding to the sample advertising creative data, and based on the sparse feature vector, the picture feature vector and the creative selection model to be trained, obtain The predicted recommendation probability value corresponding to the sample advertising creative data; the model training module 1030 is configured to determine a loss function based on the standard recommendation probability value and the predicted recommendation probability value, and select the creative selection model based on the loss function. Adjust the network parameters and stop when the preset iteration stop conditions are met. Stop training; wherein, the creative selection model is used to: use a self-attention mechanism to fuse the sparse feature vector and the picture feature vector, and output the predicted recommendation probability value based on the fusion result.
所述创意选择模型包括MLP模块、自注意力模块和输出模块;其中:The creative selection model includes an MLP module, a self-attention module and an output module; where:
所述MLP模块,用于基于所述稀疏特征向量输出第一特征向量;所述自注意力模块,用于基于所述稀疏特征向量和所述图片特征向量输出第二特征向量;所述输出模块,用于基于所述第一特征向量和所述第二特征向量输出所述预测推荐概率值。The MLP module is used to output a first feature vector based on the sparse feature vector; the self-attention module is used to output a second feature vector based on the sparse feature vector and the picture feature vector; the output module , used to output the predicted recommendation probability value based on the first feature vector and the second feature vector.
上述产品可执行本申请任意实施例所提供的方法,具备执行方法相应的功能模块和效果。The above-mentioned products can execute the methods provided by any embodiment of this application, and have corresponding functional modules and effects for executing the methods.
图11为本申请实施例提供的一种计算机设备的结构示意图。图11示出了适于用来实现本申请实施方式的示例性计算机设备12的框图。图11显示的计算机设备12仅仅是一个示例,不应对本申请实施例的功能和使用范围带来任何限制。Figure 11 is a schematic structural diagram of a computer device provided by an embodiment of the present application. 11 illustrates a block diagram of an exemplary computer device 12 suitable for implementing embodiments of the present application. The computer device 12 shown in FIG. 11 is only an example and should not bring any limitations to the functions and scope of use of the embodiments of the present application.
如图11所示,计算机设备12以通用计算设备的形式表现。计算机设备12的组件可以包括但不限于:一个或者多个处理器或者处理单元16,内存28,连接不同系统组件(包括内存28和处理单元16)的总线18。As shown in Figure 11, computer device 12 is embodied in the form of a general purpose computing device. The components of computer device 12 may include, but are not limited to, one or more processors or processing units 16, memory 28, and a bus 18 connecting various system components, including memory 28 and processing unit 16.
总线18表示几类总线结构中的一种或多种,包括存储器总线或者存储器控制器,外围总线,图形加速端口,处理器或者使用多种总线结构中的任意总线结构的局域总线。举例来说,这些体系结构包括但不限于工业标准体系结构(Industry Standard Architecture,ISA)总线,微通道体系结构(Micro Channel Architecture,MAC)总线,增强型ISA总线、视频电子标准协会(Video Electronics Standards Association,VESA)局域总线以及外围组件互连(Peripheral Component Interconnect,PCI)总线。Bus 18 represents one or more of several types of bus structures, including a memory bus or memory controller, a peripheral bus, a graphics accelerated port, a processor, or a local bus using any of a variety of bus structures. For example, these architectures include, but are not limited to, Industry Standard Architecture (ISA) bus, Micro Channel Architecture (MAC) bus, Enhanced ISA bus, Video Electronics Standards Association (Video Electronics Standards) Association, VESA) local bus and Peripheral Component Interconnect (PCI) bus.
计算机设备12包括多种计算机系统可读介质。这些介质可以是任何能够被计算机设备12访问的可用介质,包括易失性和非易失性介质,可移动的和不可移动的介质。Computer device 12 includes a variety of computer system readable media. These media can be any available media that can be accessed by computer device 12, including volatile and nonvolatile media, removable and non-removable media.
内存28可以包括易失性存储器形式的计算机系统可读介质,例如随机存取存储器(Random Access Memory,RAM)30和/或高速缓存存储器32。计算机设备12可以包括其它可移动/不可移动的、易失性/非易失性计算机系统存储介质。仅作为举例,存储系统34可以设置为读写不可移动的、非易失性磁介质(图11未显示,通常称为“硬盘驱动器”)。尽管图11中未示出,可以提供设置为对可移动非易失性磁盘(例如“软盘”)读写的磁盘驱动器,以及对可移动非易失性 光盘(例如只读光盘存储器(Compact Disc Read-Only Memory,CD-ROM),数字视盘(Digital Video Disc-Read Only Memory,DVD-ROM)或者其它光介质)读写的光盘驱动器。在这些情况下,每个驱动器可以通过一个或者多个数据介质接口与总线18相连。内存28可以包括至少一个程序产品,该程序产品具有一组(例如至少一个)程序模块,这些程序模块被配置以执行本申请实施例的功能。Memory 28 may include computer system readable media in the form of volatile memory, such as random access memory (RAM) 30 and/or cache memory 32 . Computer device 12 may include other removable/non-removable, volatile/non-volatile computer system storage media. By way of example only, storage system 34 may be configured to read and write to non-removable, non-volatile magnetic media (not shown in Figure 11, commonly referred to as a "hard drive"). Although not shown in FIG. 11, a disk drive configured to read and write to a removable non-volatile disk (eg, a "floppy disk") may be provided, and a disk drive configured to read and write to a removable non-volatile disk may be provided. An optical disc drive that reads and writes optical discs (such as Compact Disc Read-Only Memory (CD-ROM), Digital Video Disc-Read Only Memory (DVD-ROM) or other optical media). In these cases, each drive may be connected to bus 18 through one or more data media interfaces. The memory 28 may include at least one program product having a set of (eg, at least one) program modules configured to perform the functions of embodiments of the present application.
具有一组(至少一个)程序模块42的程序/实用工具40,可以存储在例如内存28中,这样的程序模块42包括——但不限于——操作系统、一个或者多个应用程序、其它程序模块以及程序数据,这些示例中的每一个或一种组合中可能包括网络环境的实现。程序模块42通常执行本申请所描述的实施例中的功能和/或方法。A program/utility 40 having a set of (at least one) program modules 42, which may be stored, for example, in memory 28, such program modules 42 including, but not limited to, an operating system, one or more application programs, other programs Modules, as well as program data, each or a combination of these examples may include an implementation of a network environment. Program modules 42 generally perform functions and/or methods in the embodiments described herein.
计算机设备12也可以与一个或多个外部设备14(例如键盘、指向设备、显示器24等)通信,还可与一个或者多个使得用户能与该计算机设备12交互的设备通信,和/或与使得该计算机设备12能与一个或多个其它计算设备进行通信的任何设备(例如网卡,调制解调器等等)通信。这种通信可以通过输入/输出(Input/Output,I/O)接口22进行。另外,本实施例中的计算机设备12,显示器24不是作为独立个体存在,而是嵌入镜面中,在显示器24的显示面不予显示时,显示器24的显示面与镜面从视觉上融为一体。并且,计算机设备12还可以通过网络适配器20与一个或者多个网络(例如局域网(Local Area Network,LAN),广域网(Wide Area Network,WAN)和/或公共网络,例如因特网)通信。如图所示,网络适配器20通过总线18与计算机设备12的其它模块通信。应当明白,尽管图中未示出,可以结合计算机设备12使用其它硬件和/或软件模块,包括但不限于:微代码、设备驱动器、冗余处理单元、外部磁盘驱动阵列、磁盘阵列(Redundant Arrays of Independent Disks,RAID)系统、磁带驱动器以及数据备份存储系统等。Computer device 12 may also communicate with one or more external devices 14 (e.g., keyboard, pointing device, display 24, etc.), with one or more devices that enable a user to interact with computer device 12, and/or with Any device (eg, network card, modem, etc.) that enables the computer device 12 to communicate with one or more other computing devices. This communication may occur through an input/output (I/O) interface 22 . In addition, in the computer device 12 in this embodiment, the display 24 does not exist as an independent entity, but is embedded in the mirror. When the display surface of the display 24 is not displayed, the display surface of the display 24 and the mirror surface are visually integrated. Moreover, the computer device 12 can also communicate with one or more networks (such as a local area network (Local Area Network, LAN), a wide area network (Wide Area Network, WAN), and/or a public network, such as the Internet) through the network adapter 20. As shown, network adapter 20 communicates with other modules of computer device 12 via bus 18 . It should be understood that, although not shown in the figures, other hardware and/or software modules may be used in conjunction with the computer device 12, including but not limited to: microcode, device drivers, redundant processing units, external disk drive arrays, disk arrays (Redundant Arrays). of Independent Disks, RAID) systems, tape drives, and data backup storage systems, etc.
处理单元16通过运行存储在内存28中的程序,从而执行多种功能应用以及数据处理,例如实现本申请实施例所提供的一种广告创意数据选取方法:获取目标物品对应的候选广告创意数据;其中,所述候选广告创意数据包含广告图片和广告文案;获取所述候选广告创意数据对应的稀疏特征向量和图片特征向量,并基于所述稀疏特征向量、所述图片特征向量以及预先训练的创意选择模型,获得所述候选广告创意数据对应的推荐概率值;其中,所述创意选择模型用于:采用自注意力机制对所述稀疏特征向量和所述图片特征向量进行融合,并基于融合结果输出所述推荐概率值,或实现本申请实施例所提供的一种模型训练方法:获取训练样本数据,其中,所述训练样本数据包括样本物品对应的样本广告创意数据和所述样本广告创意数据对应的标准推荐概率值,所述样本 广告创意数据包含广告图片和广告文案;获取所述样本广告创意数据对应的稀疏特征向量和图片特征向量,并基于所述稀疏特征向量、所述图片特征向量以及待训练的创意选择模型,获得所述样本广告创意数据对应的预测推荐概率值;根据所述标准推荐概率值和所述预测推荐概率值确定损失函数,基于所述损失函数对所述待训练的创意选择模型中的网络参数进行调整,并在满足预设迭代停止条件时的情况下停止训练;其中,所述创意选择模型用于:采用自注意力机制对所述稀疏特征向量和所述图片特征向量进行融合,并基于融合结果输出所述预测推荐概率值。The processing unit 16 executes a variety of functional applications and data processing by running programs stored in the memory 28, for example, implementing an advertising creative data selection method provided in the embodiment of the present application: obtaining candidate advertising creative data corresponding to the target item; Wherein, the candidate advertising creative data includes advertising pictures and advertising copy; the sparse feature vectors and picture feature vectors corresponding to the candidate advertising creative data are obtained, and based on the sparse feature vectors, the picture feature vectors and pre-trained creative Select a model to obtain the recommendation probability value corresponding to the candidate advertising creative data; wherein the creative selection model is used to: use a self-attention mechanism to fuse the sparse feature vector and the picture feature vector, and based on the fusion result Output the recommended probability value, or implement a model training method provided by the embodiment of the present application: obtain training sample data, wherein the training sample data includes sample advertising creative data corresponding to the sample item and the sample advertising creative data Corresponding standard recommendation probability value, the sample Advertising creative data includes advertising pictures and advertising copy; obtain the sparse feature vectors and picture feature vectors corresponding to the sample advertising creative data, and obtain the sparse feature vectors, the picture feature vectors and the creative selection model to be trained. The predicted recommendation probability value corresponding to the sample advertising creative data; determine the loss function according to the standard recommendation probability value and the predicted recommendation probability value, and adjust the network parameters in the creative selection model to be trained based on the loss function , and stop training when the preset iteration stop conditions are met; wherein the creative selection model is used to: use a self-attention mechanism to fuse the sparse feature vector and the picture feature vector, and based on the fusion result Output the predicted recommendation probability value.
本申请实施例提供了一种计算机可读存储介质,其上存储有计算机程序,该程序被处理器执行时实现如本申请所有申请实施例提供的一种广告创意数据选取方法:获取目标物品对应的候选广告创意数据;其中,所述候选广告创意数据包含广告图片和广告文案;获取所述候选广告创意数据对应的稀疏特征向量和图片特征向量,并基于所述稀疏特征向量、所述图片特征向量以及预先训练的创意选择模型,获得所述候选广告创意数据对应的推荐概率值;其中,所述创意选择模型用于:采用自注意力机制,对所述稀疏特征向量和所述图片特征向量进行融合,并基于融合结果输出所述推荐概率值,或实现本申请实施例所提供的一种模型训练方法:获取训练样本数据,其中,所述训练样本数据包括样本物品对应的样本广告创意数据和所述样本广告创意数据对应的标准推荐概率值,所述样本广告创意数据包含广告图片和广告文案;获取所述样本广告创意数据对应的稀疏特征向量和图片特征向量,并基于所述稀疏特征向量、所述图片特征向量以及待训练的创意选择模型,获得所述样本广告创意数据对应的预测推荐概率值;根据所述标准推荐概率值和所述预测推荐概率值确定损失函数,基于所述损失函数对所述待训练的创意选择模型中的网络参数进行调整,并在满足预设迭代停止条件时的情况下停止训练;其中,所述创意选择模型用于:采用自注意力机制对所述稀疏特征向量和所述图片特征向量进行融合,并基于融合结果输出所述预测推荐概率值。Embodiments of the present application provide a computer-readable storage medium on which a computer program is stored. When the program is executed by a processor, an advertising creative data selection method as provided in all embodiments of the present application is implemented: obtaining the corresponding target item. Candidate advertising creative data; wherein, the candidate advertising creative data includes advertising pictures and advertising copy; obtain the sparse feature vector and picture feature vector corresponding to the candidate advertising creative data, and based on the sparse feature vector, the picture feature vector and a pre-trained creative selection model to obtain the recommendation probability value corresponding to the candidate advertising creative data; wherein the creative selection model is used to: use a self-attention mechanism to compare the sparse feature vector and the picture feature vector Perform fusion, and output the recommended probability value based on the fusion result, or implement a model training method provided by the embodiment of the present application: obtain training sample data, wherein the training sample data includes sample advertising creative data corresponding to the sample item The standard recommendation probability value corresponding to the sample advertising creative data, which includes advertising pictures and advertising copy; obtains the sparse feature vector and picture feature vector corresponding to the sample advertising creative data, and based on the sparse features vector, the picture feature vector and the creative selection model to be trained, to obtain the predicted recommendation probability value corresponding to the sample advertising creative data; determine a loss function based on the standard recommendation probability value and the predicted recommendation probability value, based on the The loss function adjusts the network parameters in the creative selection model to be trained, and stops training when the preset iteration stop conditions are met; wherein the creative selection model is used to: use a self-attention mechanism to The sparse feature vector and the picture feature vector are fused, and the predicted recommendation probability value is output based on the fusion result.
可以采用一个或多个计算机可读的介质的任意组合。计算机可读介质可以是计算机可读信号介质或者计算机可读存储介质。计算机可读存储介质例如可以是但不限于电、磁、光、电磁、红外线、或半导体的系统、装置或器件,或者任意以上的组合。计算机可读存储介质的例子(非穷举的列表)包括:具有一个或多个导线的电连接、便携式计算机磁盘、硬盘、RAM、ROM、可擦式可编程只读存储器(Erasable Programmable Read-Only Memory,EPROM或闪存)、光纤、CD-ROM、光存储器件、磁存储器件、或者上述的任意合适的组合。在本文件中,计算机可读存储介质可以是任何包含或存储程序的有形介质,该程 序可以被指令执行系统、装置或者器件使用或者与其结合使用。Any combination of one or more computer-readable media may be employed. The computer-readable medium may be a computer-readable signal medium or a computer-readable storage medium. The computer-readable storage medium may be, for example, but not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, device or device, or any combination thereof. Examples of computer-readable storage media (a non-exhaustive list) include: electrical connections having one or more conductors, portable computer disks, hard drives, RAM, ROM, Erasable Programmable Read-Only Memory Memory, EPROM or flash memory), optical fiber, CD-ROM, optical storage device, magnetic storage device, or any suitable combination of the above. As used in this document, a computer-readable storage medium may be any tangible medium that contains or stores a program that The program may be used by or in conjunction with an instruction execution system, apparatus, or device.
计算机可读的信号介质可以包括在基带中或者作为载波一部分传播的数据信号,其中承载了计算机可读的程序代码。这种传播的数据信号可以采用多种形式,包括但不限于电磁信号、光信号或上述的任意合适的组合。计算机可读的信号介质还可以是计算机可读存储介质以外的任何计算机可读介质,该计算机可读介质可以发送、传播或者传输用于由指令执行系统、装置或者器件使用或者与其结合使用的程序。A computer-readable signal medium may include a data signal propagated in baseband or as part of a carrier wave carrying computer-readable program code therein. Such propagated data signals may take many forms, including but not limited to electromagnetic signals, optical signals, or any suitable combination of the above. A computer-readable signal medium may also be any computer-readable medium other than a computer-readable storage medium that can send, propagate, or transmit a program for use by or in connection with an instruction execution system, apparatus, or device .
计算机可读介质上包含的程序代码可以用任何适当的介质传输,包括但不限于无线、电线、光缆、射频(Radio Frequency,RF)等等,或者上述的任意合适的组合。Program code embodied on a computer-readable medium can be transmitted using any appropriate medium, including but not limited to wireless, wire, optical cable, radio frequency (Radio Frequency, RF), etc., or any suitable combination of the above.
可以以一种或多种程序设计语言或其组合来编写用于执行本申请操作的计算机程序代码,程序设计语言包括面向对象的程序设计语言,诸如Java、Smalltalk、C++,还包括常规的过程式程序设计语言,诸如“C”语言或类似的程序设计语言。程序代码可以完全地在用户计算机上执行、部分地在用户计算机上执行、作为一个独立的软件包执行、部分在用户计算机上部分在远程计算机上执行、或者完全在远程计算机或服务器上执行。在涉及远程计算机的情形中,远程计算机可以通过任意种类的网络——包括LAN或WAN—连接到用户计算机,或者,可以连接到外部计算机(例如利用因特网服务提供商来通过因特网连接)。 Computer program code for performing the operations of the present application may be written in one or more programming languages, including object-oriented programming languages such as Java, Smalltalk, C++, and conventional procedural programming languages, or a combination thereof. A programming language, such as "C" or a similar programming language. The program code may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server. In situations involving remote computers, the remote computer may be connected to the user computer through any kind of network, including a LAN or WAN, or may be connected to an external computer (eg, through the Internet using an Internet service provider).

Claims (18)

  1. 一种广告创意数据选取方法,包括:A method for selecting advertising creative data, including:
    获取目标物品对应的候选广告创意数据;其中,所述候选广告创意数据包含广告图片和广告文案;Obtain candidate advertising creative data corresponding to the target item; wherein the candidate advertising creative data includes advertising images and advertising copy;
    获取所述候选广告创意数据对应的稀疏特征向量和图片特征向量,并基于所述稀疏特征向量、所述图片特征向量以及预先训练的创意选择模型,获得所述候选广告创意数据对应的推荐概率值;Obtain the sparse feature vector and picture feature vector corresponding to the candidate advertising creative data, and obtain the recommendation probability value corresponding to the candidate advertising creative data based on the sparse feature vector, the picture feature vector and the pre-trained creative selection model ;
    根据所述推荐概率值选取目标广告创意数据;Select target advertising creative data according to the recommended probability value;
    其中,所述创意选择模型用于:采用自注意力机制对所述稀疏特征向量和所述图片特征向量进行融合,并基于融合结果输出所述推荐概率值。Wherein, the creative selection model is used to: use a self-attention mechanism to fuse the sparse feature vector and the picture feature vector, and output the recommendation probability value based on the fusion result.
  2. 根据权利要求1所述的方法,其中,所述创意选择模型包括多层感知机神经网络MLP模块、自注意力模块和输出模块;其中:The method according to claim 1, wherein the creative selection model includes a multi-layer perceptron neural network MLP module, a self-attention module and an output module; wherein:
    所述MLP模块,用于基于所述稀疏特征向量输出第一特征向量;The MLP module is used to output a first feature vector based on the sparse feature vector;
    所述自注意力模块,用于基于所述稀疏特征向量和所述图片特征向量输出第二特征向量;The self-attention module is used to output a second feature vector based on the sparse feature vector and the picture feature vector;
    所述输出模块,用于基于所述第一特征向量和所述第二特征向量输出所述推荐概率值。The output module is configured to output the recommendation probability value based on the first feature vector and the second feature vector.
  3. 根据权利要求1所述的方法,其中,所述获取所述候选广告创意数据对应的图片特征向量,包括:The method according to claim 1, wherein said obtaining the image feature vector corresponding to the candidate advertising creative data includes:
    将所述候选广告创意数据中的广告图片输入预先训练出的残差神经网络模型;Input the advertising images in the candidate advertising creative data into the pre-trained residual neural network model;
    获取所述残差神经网络模型输出的图片特征向量。Obtain the image feature vector output by the residual neural network model.
  4. 根据权利要求3所述的方法,其中,所述残差神经网络模型的训练方法包括:The method according to claim 3, wherein the training method of the residual neural network model includes:
    获取样本图片和所述样本图片对应的分类标签;其中,所述分类标签为所述样本图片中包含的样本物品的产品词,所述产品词是用于表征所述样本物品的种类并且不包含品牌信息的词汇;Obtain a sample picture and a classification label corresponding to the sample picture; wherein the classification label is a product word of a sample item contained in the sample picture, and the product word is used to characterize the type of the sample item and does not contain vocabulary of brand messages;
    将所述样本图片输入待训练的残差神经网络模型,获取所述待训练的残差神经网络模型输出的预测分类;Input the sample picture into the residual neural network model to be trained, and obtain the predicted classification output by the residual neural network model to be trained;
    基于所述预测分类和所述分类标签确定损失函数,基于所述损失函数对所述待训练的残差神经网络模型中的网络参数进行调整,并在满足预设迭代停止条件的情况下停止训练。 Determine a loss function based on the predicted classification and the classification label, adjust network parameters in the residual neural network model to be trained based on the loss function, and stop training when preset iteration stop conditions are met. .
  5. 根据权利要求3所述的方法,其中,所述残差神经网络模型的尾部包含分别用于输出32维向量、128维向量和256维向量的三个全连接层。The method according to claim 3, wherein the tail of the residual neural network model includes three fully connected layers for outputting 32-dimensional vectors, 128-dimensional vectors and 256-dimensional vectors respectively.
  6. 根据权利要求1-5中任一项所述的方法,其中,所述获取目标物品对应的候选广告创意数据,包括:The method according to any one of claims 1-5, wherein said obtaining the candidate advertising creative data corresponding to the target item includes:
    获取所述目标物品对应的多个广告素材数据,其中,所述广告素材数据包括广告文案和广告图片;Obtain multiple advertising material data corresponding to the target item, wherein the advertising material data includes advertising copy and advertising pictures;
    根据所述多个广告素材数据分别对应的线上点击数据,从所述多个广告素材数据中选取出至少一个广告文案和至少一个广告图片;Select at least one advertising copy and at least one advertising image from the plurality of advertising material data according to the online click data corresponding to the plurality of advertising material data;
    对选取的广告文案和广告图片进行组合,得到至少一个候选广告创意数据。Combine the selected advertising copy and advertising images to obtain at least one candidate advertising creative data.
  7. 根据权利要求6所述的方法,其中,所述获取所述目标物品对应的多个广告素材数据,包括:The method according to claim 6, wherein said obtaining multiple advertising material data corresponding to the target item includes:
    从所述目标物品的物品详情页和广告创意素材中的至少之一中,识别并提取出广告文案;Identify and extract advertising copy from at least one of the item details page and advertising creative material of the target item;
    对所述目标物品的物品详情页中的物品图片进行定位和裁剪,得到广告图片。Position and crop the item image in the item details page of the target item to obtain an advertising image.
  8. 根据权利要求7所述的方法,其中,所述广告文案包括利益点文案和卖点文案;所述从所述目标物品的物品详情页和广告创意素材中的至少之一中,识别并提取出广告文案,包括:The method according to claim 7, wherein the advertising copy includes benefit point copy and selling point copy; and the advertisement is identified and extracted from at least one of the item details page of the target item and the advertising creative material. Copywriting, including:
    基于预设字符识别模型,从所述目标物品的物品详情页和广告创意素材中的至少之一中,识别出待选文案;Based on a preset character recognition model, identify the candidate copy from at least one of the item details page and advertising creative material of the target item;
    基于包含预设利益点词汇的第一词表,从所述待选文案中筛选出所述利益点文案;Based on the first vocabulary list containing preset interest point vocabulary, filter out the interest point copywriting from the candidate copywriting;
    基于预设字数限制条件和包含预设非卖点词汇的第二词表中的至少之一,从所述待选文案中除去所述利益点文案后剩余的文案中,筛选出所述卖点文案。Based on at least one of the preset word limit conditions and the second word list containing preset non-selling point words, the selling point copy is screened out from the remaining copy after excluding the benefit point copy from the candidate copy.
  9. 根据权利要求6所述的方法,其中,所述根据所述多个广告素材数据分别对应的线上点击数据,从所述多个广告素材数据中选取出至少一个广告文案和至少一个广告图片,包括:The method according to claim 6, wherein at least one advertising copy and at least one advertising image are selected from the plurality of advertising material data according to the online click data respectively corresponding to the plurality of advertising material data, include:
    对于每个广告素材数据,根据所述广告素材数据的线上点击量均值和所述广告素材数据被选择的累计次数,确定所述广告素材数据的得分;For each advertising material data, determine the score of the advertising material data based on the average number of online clicks of the advertising material data and the cumulative number of times the advertising material data is selected;
    根据每个广告素材数据的得分,从所述多个广告素材数据中选取出至少一个广告文案和至少一个广告图片。 According to the score of each advertising material data, at least one advertising copy and at least one advertising image are selected from the plurality of advertising material data.
  10. 根据权利要求6所述的方法,其中,所述对选取的广告文案和广告图片进行组合,得到至少一个候选广告创意数据,包括:The method according to claim 6, wherein the selected advertising copy and advertising image are combined to obtain at least one candidate advertising creative data, including:
    对所述选取的广告文案和广告图片进行组合,得到至少一个文案图片组合;Combine the selected advertising copy and advertising pictures to obtain at least one copy copy and picture combination;
    对所述至少一个文案图片组合与至少一个预设背景模板进行组合,得到至少一个创意组合;Combine the at least one copywriting picture combination and at least one preset background template to obtain at least one creative combination;
    基于预设筛选因素,从所述至少一个创意组合中筛选出至少一个创意组合作为候选广告创意数据;其中,所述预设筛选因素包括所述目标物品的类目信息、和每个创意组合中广告图片和背景模板的颜色信息中的至少之一。Based on preset screening factors, at least one creative combination is selected from the at least one creative combination as candidate advertising creative data; wherein the preset screening factors include category information of the target item, and the content of each creative combination. At least one of the color information of the ad image and background template.
  11. 根据权利要求6所述的方法,在所述得到至少一个候选广告创意数据之后,还包括:The method according to claim 6, after obtaining at least one candidate advertising creative data, further comprising:
    根据所述候选广告创意数据中广告图片的颜色信息,对所述广告图片中的目标物品区域进行润色处理;其中,所述润色处理包括调整亮度、调整对比度、以及调整饱和度中的至少一项。According to the color information of the advertising image in the candidate advertising creative data, the target item area in the advertising image is retouched; wherein the retouching process includes at least one of adjusting brightness, adjusting contrast, and adjusting saturation. .
  12. 根据权利要求11所述的方法,其中,所述根据所述候选广告创意数据中广告图片的颜色信息,对所述广告图片中的目标物品区域进行润色处理,包括:The method according to claim 11, wherein polishing the target item area in the advertising image according to the color information of the advertising image in the candidate advertising creative data includes:
    根据所述候选广告创意数据中广告图片包含的像素点的像素值,确定所述广告图片是彩色图片或黑色图片;Determine whether the advertising image is a color image or a black image based on the pixel values of the pixels contained in the advertising image in the candidate advertising creative data;
    在所述广告图片是彩色图片的情况下,基于第一预设亮度参数值、第一预设对比度参数值和第一预设饱和度参数值,对所述广告图片中的目标物品区域进行润色处理;When the advertising picture is a color picture, the target item area in the advertising picture is polished based on the first preset brightness parameter value, the first preset contrast parameter value and the first preset saturation parameter value. deal with;
    在所述广告图片是黑色图片的情况下,基于第二预设亮度参数值、第二预设对比度参数值和第二预设饱和度参数值,对所述广告图片中的目标物品区域进行润色处理;In the case where the advertising image is a black image, the target item area in the advertising image is retouched based on the second preset brightness parameter value, the second preset contrast parameter value and the second preset saturation parameter value. deal with;
    其中,所述第二预设亮度参数值大于所述第一预设亮度参数值,所述第二预设对比度参数值大于所述第一预设对比度参数值,所述第二预设饱和度参数值大于所述第一预设饱和度参数值。Wherein, the second preset brightness parameter value is greater than the first preset brightness parameter value, the second preset contrast parameter value is greater than the first preset contrast parameter value, and the second preset saturation parameter value is greater than the first preset contrast parameter value. The parameter value is greater than the first preset saturation parameter value.
  13. 根据权利要求1-5中任一项所述的方法,在所述选取目标广告创意数据之后,还包括:The method according to any one of claims 1-5, after selecting the target advertising creative data, further comprising:
    获取所述目标广告创意数据中广告图片的第一编码信息和广告文案的第二编码信息,根据所述第一编码信息和第二编码信息生成所述目标广告创意数据对应的统一资源定位符URL; Obtain the first coding information of the advertising image and the second coding information of the advertising copy in the target advertising creative data, and generate a unified resource locator URL corresponding to the target advertising creative data based on the first coding information and the second coding information. ;
    在接收到客户端发送的针对所述URL的访问请求的情况下,根据所述URL获取所述目标广告创意数据中的广告图片和广告文案,并将获取的广告图片和广告文案执行合图操作,得到目标广告创意图像;将所述目标广告创意图像发送给所述客户端进行展示。When an access request for the URL sent by the client is received, the advertising image and advertising copy in the target advertising creative data are obtained according to the URL, and the obtained advertising image and advertising copy are combined into a picture. , obtain the target advertisement creative image; send the target advertisement creative image to the client for display.
  14. 一种模型训练方法,包括:A model training method including:
    获取训练样本数据,其中,所述训练样本数据包括样本物品对应的样本广告创意数据和所述样本广告创意数据对应的标准推荐概率值,所述样本广告创意数据包含广告图片和广告文案;Obtain training sample data, wherein the training sample data includes sample advertising creative data corresponding to the sample items and standard recommendation probability values corresponding to the sample advertising creative data, and the sample advertising creative data includes advertising pictures and advertising copy;
    获取所述样本广告创意数据对应的稀疏特征向量和图片特征向量,并基于所述稀疏特征向量、所述图片特征向量以及待训练的创意选择模型,获得所述样本广告创意数据对应的预测推荐概率值;Obtain the sparse feature vector and picture feature vector corresponding to the sample advertising creative data, and obtain the predicted recommendation probability corresponding to the sample advertising creative data based on the sparse feature vector, the picture feature vector and the creative selection model to be trained. value;
    根据所述标准推荐概率值和所述预测推荐概率值确定损失函数,基于所述损失函数对所述待训练的创意选择模型中的网络参数进行调整,并在满足预设迭代停止条件的情况下停止训练;Determine a loss function according to the standard recommendation probability value and the predicted recommendation probability value, adjust the network parameters in the creative selection model to be trained based on the loss function, and when the preset iteration stop conditions are met Stop training;
    其中,所述创意选择模型用于:采用自注意力机制对所述稀疏特征向量和所述图片特征向量进行融合,并基于融合结果输出所述预测推荐概率值。Wherein, the creative selection model is used to: use a self-attention mechanism to fuse the sparse feature vector and the picture feature vector, and output the predicted recommendation probability value based on the fusion result.
  15. 一种广告创意数据选取装置,包括:An advertising creative data selection device, including:
    数据获取模块,设置为获取目标物品对应的候选广告创意数据;其中,所述候选广告创意数据包含广告图片和广告文案;The data acquisition module is configured to obtain candidate advertising creative data corresponding to the target item; wherein the candidate advertising creative data includes advertising pictures and advertising copy;
    概率值得到模块,设置为获取所述候选广告创意数据对应的稀疏特征向量和图片特征向量,并基于所述稀疏特征向量、所述图片特征向量以及预先训练的创意选择模型,获得所述候选广告创意数据对应的推荐概率值;The probability value obtaining module is configured to obtain the sparse feature vector and picture feature vector corresponding to the candidate advertisement creative data, and obtain the candidate advertisement based on the sparse feature vector, the picture feature vector and the pre-trained creative selection model. Recommendation probability value corresponding to creative data;
    数据选取模块,设置为根据所述推荐概率值选取目标广告创意数据;A data selection module configured to select target advertising creative data according to the recommendation probability value;
    其中,所述创意选择模型用于:采用自注意力机制对所述稀疏特征向量和所述图片特征向量进行融合,并基于融合结果输出所述推荐概率值。Wherein, the creative selection model is used to: use a self-attention mechanism to fuse the sparse feature vector and the picture feature vector, and output the recommendation probability value based on the fusion result.
  16. 一种模型训练装置,包括:A model training device including:
    样本数据获取模块,设置为获取训练样本数据,其中,所述训练样本数据包括样本物品对应的样本广告创意数据和所述样本广告创意数据对应的标准推荐概率值,所述样本广告创意数据包含广告图片和广告文案;A sample data acquisition module configured to obtain training sample data, wherein the training sample data includes sample advertising creative data corresponding to sample items and standard recommendation probability values corresponding to the sample advertising creative data, and the sample advertising creative data includes advertisements. images and advertising copy;
    向量获取模块,设置为获取所述样本广告创意数据对应的稀疏特征向量和图片特征向量,并基于所述稀疏特征向量、所述图片特征向量以及待训练的创 意选择模型,获得所述样本广告创意数据对应的预测推荐概率值;The vector acquisition module is configured to obtain the sparse feature vector and picture feature vector corresponding to the sample advertising creative data, and based on the sparse feature vector, the picture feature vector and the creative to be trained Select a model intentionally to obtain the predicted recommendation probability value corresponding to the sample advertising creative data;
    模型训练模块,设置为根据所述标准推荐概率值和所述预测推荐概率值确定损失函数,基于所述损失函数对所述待训练的创意选择模型中的网络参数进行调整,并在满足预设迭代停止条件的情况下停止训练;A model training module configured to determine a loss function based on the standard recommendation probability value and the predicted recommendation probability value, adjust the network parameters in the creative selection model to be trained based on the loss function, and adjust the network parameters in the creative selection model to be trained, and perform Stop training if the iteration stop condition is met;
    其中,所述创意选择模型用于:采用自注意力机制对所述稀疏特征向量和所述图片特征向量进行融合,并基于融合结果输出所述预测推荐概率值。Wherein, the creative selection model is used to: use a self-attention mechanism to fuse the sparse feature vector and the picture feature vector, and output the predicted recommendation probability value based on the fusion result.
  17. 一种电子设备,包括:An electronic device including:
    至少一个处理器;以及at least one processor; and
    与所述至少一个处理器通信连接的存储器;其中,a memory communicatively connected to the at least one processor; wherein,
    所述存储器存储有可被所述至少一个处理器执行的计算机程序,所述计算机程序被所述至少一个处理器执行,以使所述至少一个处理器能够执行权利要求1-13中任一项所述的广告创意数据选取方法,或权利要求14所述的模型训练方法。The memory stores a computer program executable by the at least one processor, the computer program being executed by the at least one processor, so that the at least one processor can perform any one of claims 1-13 The advertising creative data selection method, or the model training method of claim 14.
  18. 一种计算机可读存储介质,所述计算机可读存储介质存储有计算机指令,所述计算机指令用于使处理器执行时实现权利要求1-13中任一项所述的广告创意数据选取方法,或权利要求14所述的模型训练方法。 A computer-readable storage medium that stores computer instructions, and the computer instructions are used to implement the advertising creative data selection method described in any one of claims 1-13 when executed by a processor, Or the model training method according to claim 14.
PCT/CN2023/116575 2022-09-09 2023-09-01 Advertisement creative data selection method and apparatus, model training method and apparatus, and device and storage medium WO2024051609A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202211104745.3A CN115564469A (en) 2022-09-09 2022-09-09 Advertisement creative selection and model training method, device, equipment and storage medium
CN202211104745.3 2022-09-09

Publications (1)

Publication Number Publication Date
WO2024051609A1 true WO2024051609A1 (en) 2024-03-14

Family

ID=84740904

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2023/116575 WO2024051609A1 (en) 2022-09-09 2023-09-01 Advertisement creative data selection method and apparatus, model training method and apparatus, and device and storage medium

Country Status (2)

Country Link
CN (1) CN115564469A (en)
WO (1) WO2024051609A1 (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115564469A (en) * 2022-09-09 2023-01-03 北京沃东天骏信息技术有限公司 Advertisement creative selection and model training method, device, equipment and storage medium
CN116911928B (en) * 2023-09-12 2024-01-05 深圳须弥云图空间科技有限公司 Training method and device of advertisement recommendation model based on creative features
CN117058275B (en) * 2023-10-12 2024-04-12 深圳兔展智能科技有限公司 Commodity propaganda drawing generation method and device, computer equipment and storage medium
CN117829911B (en) * 2024-03-06 2024-06-04 湖南创研科技股份有限公司 AI-driven advertisement creative optimization method and system

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140156379A1 (en) * 2012-11-30 2014-06-05 Adobe Systems Incorporated Method and Apparatus for Hierarchical-Model-Based Creative Quality Scores
CN110889718A (en) * 2019-11-15 2020-03-17 腾讯科技(深圳)有限公司 Method and apparatus for screening program, medium, and electronic device
CN112862516A (en) * 2021-01-14 2021-05-28 北京达佳互联信息技术有限公司 Resource delivery method and device, electronic equipment and storage medium
CN112905840A (en) * 2021-02-09 2021-06-04 北京有竹居网络技术有限公司 Video processing method, device, storage medium and equipment
CN114493683A (en) * 2022-01-11 2022-05-13 北京达佳互联信息技术有限公司 Advertisement material recommendation method, model training method and device and electronic equipment
CN114898349A (en) * 2022-05-25 2022-08-12 广州欢聚时代信息科技有限公司 Target commodity identification method and device, equipment, medium and product thereof
CN115564469A (en) * 2022-09-09 2023-01-03 北京沃东天骏信息技术有限公司 Advertisement creative selection and model training method, device, equipment and storage medium

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140156379A1 (en) * 2012-11-30 2014-06-05 Adobe Systems Incorporated Method and Apparatus for Hierarchical-Model-Based Creative Quality Scores
CN110889718A (en) * 2019-11-15 2020-03-17 腾讯科技(深圳)有限公司 Method and apparatus for screening program, medium, and electronic device
CN112862516A (en) * 2021-01-14 2021-05-28 北京达佳互联信息技术有限公司 Resource delivery method and device, electronic equipment and storage medium
CN112905840A (en) * 2021-02-09 2021-06-04 北京有竹居网络技术有限公司 Video processing method, device, storage medium and equipment
CN114493683A (en) * 2022-01-11 2022-05-13 北京达佳互联信息技术有限公司 Advertisement material recommendation method, model training method and device and electronic equipment
CN114898349A (en) * 2022-05-25 2022-08-12 广州欢聚时代信息科技有限公司 Target commodity identification method and device, equipment, medium and product thereof
CN115564469A (en) * 2022-09-09 2023-01-03 北京沃东天骏信息技术有限公司 Advertisement creative selection and model training method, device, equipment and storage medium

Also Published As

Publication number Publication date
CN115564469A (en) 2023-01-03

Similar Documents

Publication Publication Date Title
US11574470B2 (en) Suggested actions for images
WO2024051609A1 (en) Advertisement creative data selection method and apparatus, model training method and apparatus, and device and storage medium
US10956007B2 (en) Electronic device and method for providing search result thereof
CN114155543B (en) Neural network training method, document image understanding method, device and equipment
US10198635B2 (en) Systems and methods for associating an image with a business venue by using visually-relevant and business-aware semantics
US20190156122A1 (en) Intelligent digital image scene detection
EP2551792B1 (en) System and method for computing the visual profile of a place
TW201712600A (en) Methods and systems for detecting and recognizing text from images
CN112364204B (en) Video searching method, device, computer equipment and storage medium
CN110942011B (en) Video event identification method, system, electronic equipment and medium
US11587342B2 (en) Using attributes for identifying imagery for selection
CN114648392B (en) Product recommendation method and device based on user portrait, electronic equipment and medium
CN110781925B (en) Software page classification method and device, electronic equipment and storage medium
WO2020057145A1 (en) Method and device for generating painting display sequence, and computer storage medium
US11532036B2 (en) Digital image ordering using object position and aesthetics
WO2024045474A1 (en) Image copywriting generation method, device, and computer storage medium
CN112085568B (en) Commodity and rich media aggregation display method and equipment, electronic equipment and medium
WO2023020160A1 (en) Recommendation method and apparatus, training method and apparatus, device, and recommendation system
CN111522979B (en) Picture sorting recommendation method and device, electronic equipment and storage medium
CN114329069A (en) Intelligent system and method for visual search query
CN114119136A (en) Product recommendation method and device, electronic equipment and medium
CN110998507A (en) Electronic device and method for providing search result thereof
CN111539782B (en) Deep learning-based merchant information data processing method and system
WO2023124793A1 (en) Image pushing method and device
Angeli et al. Making paper labels smart for augmented wine recognition

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 23862303

Country of ref document: EP

Kind code of ref document: A1