WO2022126901A1 - Procédé de recommandation de marchandises et dispositif associé correspondant - Google Patents

Procédé de recommandation de marchandises et dispositif associé correspondant Download PDF

Info

Publication number
WO2022126901A1
WO2022126901A1 PCT/CN2021/082934 CN2021082934W WO2022126901A1 WO 2022126901 A1 WO2022126901 A1 WO 2022126901A1 CN 2021082934 W CN2021082934 W CN 2021082934W WO 2022126901 A1 WO2022126901 A1 WO 2022126901A1
Authority
WO
WIPO (PCT)
Prior art keywords
commodity
node
embedding vector
sequence
attribute
Prior art date
Application number
PCT/CN2021/082934
Other languages
English (en)
Chinese (zh)
Inventor
陈浩
谯轶轩
高鹏
Original Assignee
平安科技(深圳)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from CN202011504942.5A external-priority patent/CN112633973B/zh
Application filed by 平安科技(深圳)有限公司 filed Critical 平安科技(深圳)有限公司
Publication of WO2022126901A1 publication Critical patent/WO2022126901A1/fr

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/06Buying, selling or leasing transactions
    • G06Q30/0601Electronic shopping [e-shopping]
    • G06Q30/0631Item recommendations
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/901Indexing; Data structures therefor; Storage structures
    • G06F16/9024Graphs; Linked lists
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting

Definitions

  • the present application relates to the technical field of artificial intelligence, and in particular, to a product recommendation method, device, computer equipment and storage medium.
  • Graph embedding is one of the hot research fields in recommender systems and graph social networks in recent years. By learning the information contained in the nodes in the graph, the nodes are mapped to a quantifiable space, that is, the node embedding in the graph is generated. In downstream tasks, the complex relationship between nodes in the network can be more deeply understood by performing tasks such as similarity calculation, classification, and clustering on node embedding.
  • the historical browsing order records of the products can be obtained first, and a graph network of commodity nodes can be constructed based on the historical browsing order records. Then, based on the random walk method, the topology structure between the nodes in each graph in the constructed commodity node graph network is transformed into a text-like sequence structure. Further, the word2vec model combined with the method of negative sampling is used to embed the generated sequence structure into a new space, and the structural relationship of the browsing order between each node in the graph network is mined, so as to determine the embedding vector of each commodity node in the commodity node graph network, That is, the embedding vector. Finally, use the determined embedding vector of each commodity node to recommend similar commodities.
  • the above-determined embedding vector of commodity nodes can realize commodity recommendation to a certain extent, but the inventor found that the commodity node embedding vector determined only by the above-mentioned means only contains the order information between commodity nodes, and contains a single information. When recommended, the effect of product recommendation is not significant.
  • the purpose of the embodiments of the present application is to provide a method, apparatus, computer equipment and storage medium for recommending products, which are mainly used to solve the technical problem of poor product recommendation effect due to the single information contained in the embedded vector of existing product nodes.
  • the embodiment of the present application provides a method for recommending commodities, which adopts the following technical solutions:
  • the commodity node sequence and the commodity attribute node sequence are used as training corpus, respectively input the word2vec model, and the commodity node embedding vector and the commodity attribute node embedding vector are obtained by calculation;
  • the product corresponding to the comprehensive embedding vector close to the comprehensive embedding vector of the target product is the recommended product determined for the target product.
  • the embodiment of the present application also provides a product recommendation device, which adopts the following technical solutions:
  • an acquisition unit used to acquire historical commodity browsing records of multiple users
  • a generating unit generating a sequence of commodity nodes and a sequence of commodity attribute nodes according to the historical commodity browsing records
  • the training unit is used for combining the negative sampling strategy, using the commodity node sequence and the commodity attribute node sequence as training corpus, respectively inputting the word2vec model, and calculating the commodity node embedding vector and the commodity attribute node embedding vector;
  • a computing unit configured to perform vector splicing or vector summation operations on the commodity node embedding vector and the commodity attribute node embedding vector to obtain a comprehensive embedding vector corresponding to each commodity;
  • a response unit used to determine the target commodity accessed by the user
  • the recommendation unit is configured to determine the product corresponding to the comprehensive embedding vector similar to the comprehensive embedding vector of the target product, which is the recommended product determined for the target product.
  • the embodiment of the present application also provides a computer device, which adopts the following technical solutions:
  • a computer device comprising a memory and a processor, wherein computer-readable instructions are stored in the memory, and when the processor executes the computer-readable instructions, the processor implements the following steps of a commodity recommendation method:
  • the commodity node sequence and the commodity attribute node sequence are used as training corpus, respectively input the word2vec model, and the commodity node embedding vector and the commodity attribute node embedding vector are obtained by calculation;
  • the product corresponding to the comprehensive embedding vector that is similar to the comprehensive embedding vector of the target product is the recommended product determined for the target product.
  • the embodiments of the present application also provide a computer-readable storage medium, which adopts the following technical solutions:
  • a computer-readable storage medium wherein computer-readable instructions are stored on the computer-readable storage medium, and when the computer-readable instructions are executed by a processor, the steps of the following commodity recommendation method are implemented:
  • the commodity node sequence and the commodity attribute node sequence are used as training corpus, respectively input the word2vec model, and the commodity node embedding vector and the commodity attribute node embedding vector are obtained by calculation;
  • the product corresponding to the comprehensive embedding vector that is similar to the comprehensive embedding vector of the target product is the recommended product determined for the target product.
  • a commodity node sequence and a commodity attribute node sequence are further generated according to the historical commodity browsing records. Then, combined with the negative sampling strategy, the commodity node sequence and commodity attribute node sequence are used as training corpus, and input into the word2vec model respectively, and the commodity node embedding vector and commodity attribute node embedding vector are calculated. Finally, perform vector splicing or vector summation operations on the commodity node embedding vector and commodity attribute node embedding vector to obtain the comprehensive embedding vector corresponding to each commodity.
  • the commodity recommendation method that integrates commodity attribute information proposed in this application can better combine the structural information and attribute information of the nodes in the graph network.
  • the commodity comprehensive embedding vector calculated in this application not only contains the user's behavior information, but also contains the content information of the commodity itself, which can reflect the property information of the commodity itself.
  • the embedding vector in the prior art can achieve a more accurate product recommendation effect.
  • FIG. 1 is a flowchart of an embodiment of a product recommendation method of the present application
  • Fig. 2 is a flow chart of a specific implementation manner of step S120 in Fig. 1;
  • FIG. 3 is a flowchart of another embodiment of a product recommendation method of the present application.
  • FIG. 4 is a schematic diagram of an embodiment of a product recommendation device of the present application.
  • FIG. 5 is a schematic diagram of an embodiment of the generating unit 420
  • FIG. 6 is a schematic diagram of an embodiment of a computer device of the present application.
  • Embedding refers to using a low-dimensional vector to represent an object, which can be a word, a commodity, or a movie, etc.
  • the nature of the embedding vector is to make the objects corresponding to the vectors with similar distances have similar meanings. For example, the distance between embedding (Avengers) and embedding (Iron Man) will be very close, but embedding (Avengers) and embedding (Gone with the Wind) distance will be farther.
  • the Google team published a tool for converting words into vector form, the word2vec (word to vector) algorithm.
  • word2vec word to vector
  • a unique multi-dimensional word vector can be mapped for each word included in the training corpus after the training, thereby simplifying the processing of the text content.
  • word2vec can be used in semantic analysis scenarios.
  • the word vectors corresponding to the two words can be determined.
  • word2vec provides an efficient bag-of-words and skip-gram implementation for computing vector words.
  • an embodiment of the present application proposes a method for recommending products.
  • the final network embedding vector not only contains the user's behavior information, but also contains the content information of the product itself, which can be used to a greater extent. It characterizes the nature of the commodity itself.
  • FIG. 1 a flowchart of an embodiment of a product recommendation method of the present application is shown.
  • the described method for recommending products includes the following steps:
  • Step S110 obtaining historical commodity browsing records of multiple users.
  • a commodity recommendation method runs on an electronic device (for example, a server/terminal device) on which it can receive transmissions from an external device through a wired connection or a wireless connection, or actively collect the user's history.
  • an electronic device for example, a server/terminal device
  • the above wireless connection methods may include but are not limited to 3G/4G connection, WiFi connection, Bluetooth connection, WiMAX connection, Zigbee connection, ultra wide band (UWB) connection, and other currently known or developed in the future. wireless connection method.
  • the above-mentioned commodities may cover various concepts, for example, may be actual commodities in a shopping website, or may be virtual commodities such as movies, music, videos, or games, which are not specifically limited in this embodiment.
  • the obtained historical commodity browsing records may specifically include commodity name or commodity ID information of the commodity, and attribute information of the commodity.
  • the attribute information of the product, or the label information of the product can be classified in various ways according to different classification standards and classification levels.
  • commodities can be classified according to attributes and characteristics such as usage, raw materials, production methods, chemical composition, and usage status, and there are no specific restrictions on the classification standards or classification levels corresponding to the attributes of specific commodities.
  • the acquisition of historical commodity browsing records may be collected by means of data buried points and browsing time thresholds. Trigger the operation of collecting the browsing records of the user this time.
  • the acquisition of historical commodity browsing records may also be for commodities under a certain commodity attribute. product.
  • the above-mentioned historical commodity browsing records may also be stored in a node of a blockchain.
  • the blockchain referred to in this application is a new application mode of computer technologies such as distributed data storage, point-to-point transmission, consensus mechanism, and encryption algorithm.
  • Blockchain essentially a decentralized database, is a series of data blocks associated with cryptographic methods. Each data block contains a batch of network transaction information to verify its Validity of information (anti-counterfeiting) and generation of the next block.
  • the blockchain can include the underlying platform of the blockchain, the platform product service layer, and the application service layer.
  • Step S120 Generate a commodity node sequence and a commodity attribute node sequence according to the historical commodity browsing records.
  • the historical commodity browsing records of multiple users recorded in the system can be counted to determine the commodity node set included in the historical commodity browsing records.
  • the commodity node set is processed by the commodity recommendation method of the graph network, and the commodity node sequence is output.
  • the classification standard corresponds to a set of commodity attributes, and the commodity node sequence is converted into a commodity attribute node sequence.
  • Commodity node sequence for example: commodity 1-commodity 6-commodity 3...commodity N, commodity attribute node sequence, such as category 1-category 4-category 3...category N, where the length of each sequence, that is, the included nodes The number can be set by the user in advance.
  • step S130 combining the negative sampling strategy, the commodity node sequence and the commodity attribute node sequence are used as training corpus, respectively input into the word2vec model, and the commodity node embedding vector and commodity attribute node embedding vector are calculated.
  • the sequence of commodity nodes and the sequence of commodity attribute nodes are used as training corpus, respectively input
  • the product node embedding vector corresponding to each product and the corresponding at least one product attribute category embedding vector are finally calculated respectively.
  • the negative sampling strategy formulated in this application is as follows Table 1 shown:
  • the structure of the graph network represents the behavior information of the user clicking on the product, and the side information represents the attribute information of the product.
  • the number of commodity nodes is generally much more than that of category nodes.
  • ⁇ r % commodity nodes and (1- ⁇ r )% category nodes are selected to participate in its training, and because there are fewer category nodes, in order to fully train them without losing their value
  • this application selects ⁇ c % category nodes and (1- ⁇ c )% commodity nodes to participate in training, where ⁇ r > ⁇ c .
  • Step S140 Perform vector splicing or vector summation operations on the commodity node embedding vector and the commodity attribute node embedding vector to obtain a comprehensive embedding vector corresponding to each commodity.
  • the comprehensive embedding vector corresponding to any commodity node ri can be calculated as:
  • S150 Determine the target commodity accessed by the user.
  • the target commodity may be the commodity currently being browsed by the user, or may be the commodity in the user's recent browsing record.
  • S160 Determine a product corresponding to a comprehensive embedding vector similar to the comprehensive embedding vector of the target product as a recommended product determined for the target product.
  • the comprehensive embedding vector of the commodity calculated above not only contains the behavior information of the user, but also contains the content information of the commodity itself, which can reflect the property information of the commodity itself. Therefore, the relationship between the commodity and the target commodity can be determined by calculation. Commodities corresponding to the integrated embedding vector close to the integrated embedding vector are the recommended commodities determined for the target commodity.
  • using the commodity node embedding vector and commodity attribute node embedding vector to perform commodity recommendation at the same time may include: calculating the comprehensive embedding vector of the target commodity and the combination of all other commodities except the target commodity.
  • the similarity is the cosine similarity. The larger the value of the cosine similarity, the closer the corresponding recommended product is to the target product. Therefore, a preset threshold or parameter N can be set to filter and determine the target product. Recommended product.
  • using the commodity node embedding vector and commodity attribute node embedding vector to perform commodity recommendation at the same time may include: determining the target commodity attribute corresponding to the target commodity; calculating the comprehensive embedding vector of the target commodity and In the attributes of the target product, the similarity between the comprehensive embedding vectors of all other products except the target product; the similarity is greater than the preset threshold, or the top N are ranked in the order of the similarity.
  • the commodity corresponding to the comprehensive embedding vector is the recommended commodity determined for the target commodity.
  • the attribute of the target product corresponding to the target product is determined by setting, and only the comprehensive embedding vector corresponding to the target product and the attribute of the target product are calculated, and the sum of the comprehensive embedding vector of each product in all other products except the target product is calculated. The similarity between the two, and then determine the recommended products of the target product, reducing the calculation pressure.
  • a commodity node sequence and a commodity attribute node sequence are further generated according to the historical commodity browsing records. Then, combined with the negative sampling strategy, the commodity node sequence and commodity attribute node sequence are used as training corpus, and input into the word2vec model respectively, and the commodity node embedding vector and commodity attribute node embedding vector are calculated. Finally, perform vector splicing or vector summation operations on the commodity node embedding vector and commodity attribute node embedding vector to obtain the comprehensive embedding vector corresponding to each commodity.
  • the commodity recommendation method that integrates commodity attribute information proposed in this application can better combine the structural information and attribute information of the nodes in the graph network.
  • the commodity comprehensive embedding vector calculated by this application not only contains the user's behavior information, but also contains the content information of the commodity itself, which can reflect more information on the nature of the commodity itself. Therefore, through this application
  • the comprehensive embedding vector of the commodity calculated in the embodiment is used for commodity recommendation, which can improve the accuracy of commodity recommendation.
  • FIG. 2 is a schematic diagram of an embodiment of step S120 shown in FIG. 1 , which may include:
  • step S121 the historical commodity browsing records are counted, and a graph network of commodity nodes is constructed.
  • the commodity set r in the historical commodity browsing record is selected as a graph node of the graph network
  • the set c of attribute information in the historical commodity browsing record is selected as the edge information of the graph network to be constructed.
  • the final graph network is generated by counting the historical commodity browsing records of all users and selecting nodes whose edge weights are greater than the preset value w, where V represents the set of commodity nodes in the graph, E represents the set of edges between items in the graph, e ij ⁇ E, e ij >w, and w is a positive integer.
  • Step S122 constructing a commodity attribute dictionary according to the graph network, where the commodity attribute dictionary includes corresponding relationships between different commodity nodes and different commodity attributes.
  • the attribute information of the commodities in the graph network is counted, and the mapping dictionary D from the commodity set r to the category set c is generated according to the node information in the graph, namely:
  • each commodity may correspond to one or more categories.
  • Step S123 converting the graph network into a sequence of commodity nodes by random walk.
  • the deepwalk algorithm is used for reference, and the commodity node sequence is obtained according to the commodity node set included in the graph network in a random walk manner.
  • the Deepwalk algorithm is a graph-structured data mining algorithm that combines random walk and word2vec algorithms. Random walk refers to randomly taking a node in the graph network as the starting point to generate a sequence of commodity nodes with a preset random walk sequence length.
  • specifically converting the graph network into a sequence of commodity nodes through a random walk method may include: sequentially normalizing the out-degrees of commodity nodes in the graph network, and determining each commodity node The out-degree probability of ; according to the out-degree probability, a random walk method is used to generate the commodity node sequence.
  • graphs can be divided into directed graphs and undirected graphs. All edges of a directed graph have a direction, that is, a direction from vertex to vertex is determined; while all edges of an undirected graph are bidirectional, that is, two vertices connected by an undirected edge can reach each other.
  • an undirected graph can be thought of as consisting of two directed edges where all edges are positive and negative.
  • the degree of a vertex refers to the number of edges connected to the vertex.
  • the number of out-edges of a vertex is called the out-degree of the vertex
  • the number of in-edges of a vertex is called the in-degree of the vertex.
  • random walk is performed according to the out-degree probability obtained after out-degree normalization, and a graph network that better reflects the closeness of commodity nodes can be obtained.
  • steps S122 and S123 do not necessarily require an execution order.
  • Step S124 based on the commodity attribute dictionary, convert the commodity node sequence into a commodity attribute node sequence.
  • the commodity node sequence set S r in the above step S123 is converted into the corresponding category node sequence set S c , that is, for any commodity sequence L r in S r , Its corresponding category sequence L c is:
  • the traditional word2vec combined with negative sampling model architecture is adopted, and the structure information and attribute information of the commodity nodes of the network nodes are integrated on the basis of it, which has the advantages of simple model structure, less parameter quantity and short training time. , which can be better and widely used in real-time graph network scenarios.
  • FIG. 3 is a schematic diagram of another embodiment of a product recommendation method in the embodiment of the present application, which may include:
  • Step S310 obtaining historical commodity browsing records of multiple users.
  • step S310 is similar to step S110 shown in FIG. 1 , and details are not described here.
  • step S320 the historical commodity browsing records are counted, and a graph network of commodity nodes is constructed.
  • step S320 is similar to step S121 shown in FIG. 2 , and details are not described here.
  • Step S330 constructing corresponding commodity attribute dictionaries for different classification standards preset by the graph network respectively, wherein each classification standard is preset corresponding to a commodity attribute set.
  • classification standard 1 and classification standard 2 can be set, and classification standard 1 can include a set of book categories, such as ⁇ city, romance, martial arts, fantasy, Suspense, game, reasoning ⁇ , the classification standard 2 is a collection of book author names, such as ⁇ Lu Yao, Guo Jingming, Higashino Keigo, Natsume Soseki, Salinger, Shakespeare, ... ⁇ and so on.
  • classification standard 1 and classification standard 2 can include a set of book categories, such as ⁇ city, romance, martial arts, fantasy, Suspense, game, reasoning ⁇
  • the classification standard 2 is a collection of book author names, such as ⁇ Lu Yao, Guo Jingming, Higashino Keigo, Natsume Soseki, Salinger, Shakespeare, ... ⁇ and so on.
  • corresponding commodity attribute dictionaries under various classification systems can be constructed.
  • step S122 in FIG. 2 for the process of constructing the commodity attribute dictionary under each classification system, reference may be made to step S122 in FIG
  • Step S340 converting the graph network into a sequence of commodity nodes by random walk.
  • step S340 is similar to step S123 shown in FIG. 2 , and details are not described here.
  • Step S350 based on the commodity attribute dictionary, convert the commodity node sequence into commodity attribute node sequences corresponding to different classification standards.
  • step S330 when a commodity attribute dictionary with multiple classification systems is generated, when the commodity node sequence is converted into a commodity attribute node sequence, commodity attribute nodes corresponding to different classification standards are also converted together. sequence.
  • step S124 for the process of converting the commodity attribute node sequence to obtain the commodity attribute node sequence, reference may be made to step S124 in FIG. 2 , and details are not described here.
  • step S360 combining the negative sampling strategy, the commodity node sequence and the commodity attribute node sequence are used as training corpus, respectively input the word2vec model, and the commodity node embedding vector and commodity attribute node embedding vector under different classification standards are calculated.
  • step S350 when the commodity attribute sequences of multiple classification systems are obtained after conversion, the commodity attribute sequence is used as the training corpus to calculate the commodity attribute node embedding vector, and the corresponding commodity attribute sequences under different classification systems are also obtained.
  • Product attribute node embedding vector For the specific calculation process of obtaining the commodity node embedding vector and commodity attribute node embedding vector under different classification standards, reference may be made to step S130 in FIG. 1 , which will not be repeated here.
  • Step S370 Perform a vector weighted sum operation on the commodity node embedding vector and the commodity attribute node embedding vector according to the preset weights corresponding to the commodity node embedding vectors and the corresponding weights of different classification standards, to obtain each The comprehensive embedding vector corresponding to the product.
  • the embedding vector and the corresponding commodity node embedding vector under different classification standards set the corresponding weight value. Therefore, the vector weighted summation operation is performed on the commodity node embedding vector corresponding to each commodity and the corresponding commodity attribute node embedding vector under at least one classification standard to obtain the comprehensive embedding vector corresponding to each commodity.
  • the comprehensive embedding vector may be obtained by combining commodity node embedding vector A, classification standard 1 commodity attribute node embedding vector B, and classification standard 2 commodity attribute node embedding vector C.
  • the priority order of importance is B-A-C.
  • weight parameters of different sizes can be set for A, B, and C respectively, such as 0.8, 0.9., and 0.6. Therefore, when the comprehensive embedding vector corresponding to a certain book is combined and calculated, the A, B and C corresponding to the book are first determined respectively, and then the respective embedding vectors are multiplied with the corresponding weights, and the final comprehensive calculation of the book is obtained by weighted combination calculation. Embedding vector.
  • S390 Determine a product corresponding to a comprehensive embedding vector similar to the comprehensive embedding vector of the target product as a recommended product determined for the target product.
  • steps S380-S390 are similar to the foregoing steps S150-S160, and are not repeated here.
  • the embedded vector considering that the embedded vector is in actual application scenarios, the impact of the embedded vector of commodity attribute nodes corresponding to different classification standards on the comprehensive embedded vector of the product is different.
  • the embedding vector and the corresponding commodity node embedding vector under different classification standards set the corresponding weight value. Therefore, the comprehensive embedding vector corresponding to the product obtained by the final calculation can better represent the essential properties of the product in different application scenarios. Therefore, product recommendation can be performed by using the comprehensive embedding vector of the product calculated in the embodiment of the present application, which can improve the accuracy of product recommendation. .
  • the present application may be used in numerous general purpose or special purpose computer system environments or configurations. For example: personal computers, server computers, handheld or portable devices, tablet devices, multiprocessor systems, microprocessor-based systems, set-top boxes, programmable consumer electronics, network PCs, minicomputers, mainframe computers, including A distributed computing environment for any of the above systems or devices, and the like.
  • the application may be described in the general context of computer-executable instructions, such as program modules, being executed by a computer.
  • program modules include routines, programs, objects, components, data structures, etc. that perform particular tasks or implement particular abstract data types.
  • the application may also be practiced in distributed computing environments where tasks are performed by remote processing devices that are linked through a communications network.
  • program modules may be located in both local and remote computer storage media including storage devices.
  • the aforementioned storage medium may be a non-volatile storage medium such as a magnetic disk, an optical disk, a read-only memory (Read-Only Memory, ROM), or a random access memory (Random Access Memory, RAM) or the like.
  • the present application provides an embodiment of a product recommendation device
  • the device embodiment corresponds to the product recommendation method embodiment shown in FIG. 1
  • the device Specifically, it can be applied to various electronic devices.
  • the apparatus 400 for determining a network embedding vector includes:
  • an obtaining unit 410 configured to obtain historical commodity browsing records of multiple users
  • generating unit 420 generating a sequence of commodity nodes and a sequence of commodity attribute nodes according to the historical commodity browsing records
  • the training unit 430 is configured to combine the negative sampling strategy, use the commodity node sequence and the commodity attribute node sequence as training corpus, respectively input the word2vec model, and calculate and obtain the commodity node embedding vector and the commodity attribute node embedding vector;
  • a computing unit 440 configured to perform a vector splicing or vector sum operation on the commodity node embedding vector and the commodity attribute node embedding vector to obtain a comprehensive embedding vector corresponding to each commodity;
  • a response unit 450 configured to determine the target commodity accessed by the user
  • the recommending unit 460 is configured to determine a product corresponding to a comprehensive embedding vector similar to the comprehensive embedding vector of the target product, which is a recommended product determined for the target product.
  • FIG. 4 is a schematic diagram of an embodiment of the generating unit 420, which may include:
  • the first construction subunit 421 is used to count the historical commodity browsing records and construct a graph network of commodity nodes
  • the second construction subunit 422 is configured to construct a commodity attribute dictionary according to the graph network, and the commodity attribute dictionary includes the corresponding relationship between different commodity nodes and different commodity attributes;
  • the first conversion subunit 423 is used to convert the graph network into a sequence of commodity nodes by random walk;
  • the second conversion subunit 424 is configured to convert the sequence of commodity nodes into a sequence of commodity attribute nodes based on the commodity attribute dictionary.
  • the first conversion subunit is specifically configured to perform normalization processing on the out-degrees of commodity nodes in the graph network in turn, and determine the out-degree probability of each commodity node;
  • the degree probability adopts a random walk method to generate the commodity node sequence.
  • the second constructing subunit is specifically configured to construct corresponding commodity attribute dictionaries according to different classification standards preset by the graph network, wherein each classification standard is preset to correspond to a commodity attribute set;
  • a second conversion subunit specifically configured to convert the commodity node sequence into commodity attribute node sequences corresponding to different classification standards based on the commodity attribute dictionary
  • the training unit 430 is specifically configured to combine the negative sampling strategy, use the commodity node sequence and the commodity attribute node sequence as training corpus, respectively input the word2vec model, and calculate the commodity node embedding vector and commodity attribute node embedding under different classification standards. vector;
  • the calculation unit 440 is specifically configured to perform a vector weighted sum operation on the commodity node embedding vector and the commodity attribute node embedding vector according to the preset weights corresponding to the commodity node embedding vectors and the corresponding weights of different classification standards , to obtain the comprehensive embedding vector corresponding to each commodity.
  • the first construction subunit 421 is specifically configured to count the historical commodity browsing records, and select the commodity collection in the historical commodity browsing records as the graph nodes of the graph network to be constructed; determine according to the statistical results Edge weights between each commodity node; select commodity nodes whose edge weights are greater than a preset threshold to construct a graph network of commodity nodes.
  • the apparatus 400 for determining the network embedding vector may further include:
  • the recommended commodity determination unit is used for determining the target commodity accessed by the user; determining the commodity corresponding to the integrated embedding vector close to the integrated embedding vector of the target commodity is the recommended commodity determined for the target commodity.
  • Recommended product determination unit including:
  • a first similarity calculation subunit configured to calculate the similarity between the comprehensive embedding vector of the target commodity and the comprehensive embedding vectors of all other commodities except the target commodity;
  • the first recommended commodity determination subunit is used to determine that the similarity is greater than a preset threshold, or the commodities corresponding to the top N comprehensive embedding vectors in the order of the similarity are determined for the target commodity.
  • N is a positive integer.
  • the recommended commodity determination unit includes:
  • a first commodity attribute determination subunit configured to determine the target commodity attribute corresponding to the target commodity
  • a second similarity calculation subunit configured to calculate the similarity between the comprehensive embedding vector of the target commodity and the comprehensive embedding vectors of all other commodities except the target commodity in the attributes of the target commodity;
  • the second recommended commodity determination subunit is used to take the similarity greater than the preset threshold, or the commodity corresponding to the top N comprehensive embedding vectors ranked according to the similarity degree, is determined for the target commodity
  • N is a positive integer.
  • the device 400 for determining the network embedding vector after acquiring the historical commodity browsing records of multiple users, the device 400 for determining the network embedding vector further generates commodity node sequences and commodity attribute node sequences according to the historical commodity browsing records. Then, combined with the negative sampling strategy, the commodity node sequence and commodity attribute node sequence are used as training corpus, and input into the word2vec model respectively, and the commodity node embedding vector and commodity attribute node embedding vector are calculated. Finally, perform vector splicing or vector summation operations on the commodity node embedding vector and commodity attribute node embedding vector to obtain the comprehensive embedding vector corresponding to each commodity.
  • the commodity recommendation method that integrates commodity attribute information proposed in this application can better combine the structural information and attribute information of the nodes in the graph network.
  • the commodity comprehensive embedding vector calculated by this application not only contains the user's behavior information, but also contains the content information of the commodity itself, which can reflect the property information of the commodity itself.
  • FIG. 6 is a block diagram of the basic structure of a computer device according to this embodiment.
  • the computer device 6 includes a memory 601 , a processor 602 , and a network interface 603 that communicate with each other through a system bus. It should be pointed out that only the computer device 6 with components 601-603 is shown in the figure, but it should be understood that it is not required to implement all of the shown components, and more or less components may be implemented instead.
  • the computer device here is a device that can automatically perform numerical calculation and/or information processing according to pre-set or stored instructions, and its hardware includes but is not limited to microprocessors, special-purpose Integrated circuit (Application Specific Integrated Circuit, ASIC), programmable gate array (Field-Programmable Gate Array, FPGA), digital processor (Digital Signal Processor, DSP), embedded equipment, etc.
  • ASIC Application Specific Integrated Circuit
  • FPGA Field-Programmable Gate Array
  • DSP Digital Signal Processor
  • embedded equipment etc.
  • the computer equipment may be a desktop computer, a notebook computer, a palmtop computer, a cloud server and other computing equipment.
  • the computer device can perform human-computer interaction with the user through a keyboard, a mouse, a remote control, a touch pad or a voice control device.
  • the memory 601 includes at least one type of readable storage medium, and the readable storage medium includes flash memory, hard disk, multimedia card, card-type memory (for example, SD or DX memory, etc.), random access memory (RAM), static Random Access Memory (SRAM), Read Only Memory (ROM), Electrically Erasable Programmable Read Only Memory (EEPROM), Programmable Read Only Memory (PROM), Magnetic Memory, Magnetic Disk, Optical Disk, etc.
  • the memory 601 may be an internal storage unit of the computer device 6 , such as a hard disk or a memory of the computer device 6 .
  • the memory 601 may also be an external storage device of the computer device 6, such as a plug-in hard disk, a smart memory card (Smart Media Card, SMC), a secure digital (Secure Digital, SD) card, flash memory card (Flash Card), etc.
  • the memory 601 may also include both the internal storage unit of the computer device 6 and its external storage device.
  • the memory 601 is generally used to store the operating system and various application software installed on the computer device 6 , such as computer-readable instructions of the above-mentioned method for recommending products.
  • the memory 601 can also be used to temporarily store various types of data that have been output or will be output.
  • the processor 602 may be a central processing unit (Central Processing Unit, CPU), a controller, a microcontroller, a microprocessor, or other data processing chips in some embodiments.
  • the processor 602 is typically used to control the overall operation of the computer device 6 .
  • the processor 602 is configured to execute computer-readable instructions stored in the memory 601 or process data, for example, computer-readable instructions for executing the method for recommending a commodity.
  • the network interface 603 may include a wireless network interface or a wired network interface, and the network interface 603 is generally used to establish a communication connection between the computer device 6 and other electronic devices.
  • the present application also provides another implementation manner, that is, to provide a computer-readable storage medium
  • the computer-readable storage medium may be non-volatile or volatile
  • the computer-readable storage medium stores Computer-readable instructions, executable by at least one processor, to cause the at least one processor to perform the steps of a method for recommending an item as described above.
  • the methods of the above embodiments can be implemented by means of software plus a necessary general hardware platform, and of course hardware can also be used, but in many cases the former is better implementation.
  • the technical solution of the present application can be embodied in the form of a software product in essence or in a part that contributes to the prior art, and the computer software product is stored in a storage medium (such as ROM/RAM, magnetic disk, CD-ROM), including several instructions to make a terminal device (which may be a mobile phone, a computer, a server, an air conditioner, or a network device, etc.) execute the commodity recommendation method described in the various embodiments of this application.
  • a storage medium such as ROM/RAM, magnetic disk, CD-ROM

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Business, Economics & Management (AREA)
  • General Physics & Mathematics (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Finance (AREA)
  • Accounting & Taxation (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Economics (AREA)
  • Evolutionary Biology (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Artificial Intelligence (AREA)
  • Development Economics (AREA)
  • Evolutionary Computation (AREA)
  • Marketing (AREA)
  • Strategic Management (AREA)
  • General Business, Economics & Management (AREA)
  • Software Systems (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

Des modes de réalisation de la présente demande se rapportent au domaine de l'intelligence artificielle, et concernent un procédé de recommandation de marchandises. Le procédé comprend principalement : après l'obtention d'un enregistrement historique de navigation de marchandises, la détermination séparée d'un vecteur d'incorporation de nœud de marchandise et d'un nœud d'incorporation de nœud d'attribut de marchandise, puis la fusion ou l'épissage des deux vecteurs d'incorporation pour obtenir un vecteur d'incorporation complet d'un nœud de marchandise, et la réalisation d'une recommandation de marchandises à l'aide du vecteur d'incorporation complet. Comme le vecteur d'incorporation complet représente les informations de comportement d'une marchandise et les informations d'attribut de la marchandise, par comparaison avec l'état de la technique, les informations représentées sont plus riches, et par conséquent, un effet de recommandation de marchandises plus précis peut être obtenu. La présente demande concerne en outre un appareil de recommandation de marchandises, un dispositif informatique et un support de stockage. De plus, la présente demande concerne en outre une technologie de chaînes de blocs, et l'enregistrement historique de navigation de marchandises d'un utilisateur peut être stocké dans une chaîne de blocs, ce qui permet d'améliorer la sécurité et la stabilité du stockage de l'enregistrement historique de navigation de marchandises.
PCT/CN2021/082934 2020-12-18 2021-03-25 Procédé de recommandation de marchandises et dispositif associé correspondant WO2022126901A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202011504942.5 2020-12-18
CN202011504942.5A CN112633973B (zh) 2020-12-18 一种商品推荐方法及其相关设备

Publications (1)

Publication Number Publication Date
WO2022126901A1 true WO2022126901A1 (fr) 2022-06-23

Family

ID=75317141

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/082934 WO2022126901A1 (fr) 2020-12-18 2021-03-25 Procédé de recommandation de marchandises et dispositif associé correspondant

Country Status (1)

Country Link
WO (1) WO2022126901A1 (fr)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116308684A (zh) * 2023-05-18 2023-06-23 和元达信息科技有限公司 一种网购平台店铺信息推送方法及系统
CN117611245A (zh) * 2023-12-14 2024-02-27 浙江博观瑞思科技有限公司 用于电商运营活动策划的数据分析管理系统及方法

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20190295124A1 (en) * 2018-03-26 2019-09-26 DoorDash, Inc. Dynamic predictive similarity grouping based on vectorization of merchant data
CN111639989A (zh) * 2020-04-28 2020-09-08 上海风秩科技有限公司 商品推荐方法和可读存储介质
CN111695960A (zh) * 2019-03-12 2020-09-22 阿里巴巴集团控股有限公司 一种对象推荐系统、方法、电子设备及存储介质
CN111815403A (zh) * 2020-06-19 2020-10-23 北京石油化工学院 商品推荐方法、装置及终端设备
CN112633973A (zh) * 2020-12-18 2021-04-09 平安科技(深圳)有限公司 一种商品推荐方法及其相关设备

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20190295124A1 (en) * 2018-03-26 2019-09-26 DoorDash, Inc. Dynamic predictive similarity grouping based on vectorization of merchant data
CN111695960A (zh) * 2019-03-12 2020-09-22 阿里巴巴集团控股有限公司 一种对象推荐系统、方法、电子设备及存储介质
CN111639989A (zh) * 2020-04-28 2020-09-08 上海风秩科技有限公司 商品推荐方法和可读存储介质
CN111815403A (zh) * 2020-06-19 2020-10-23 北京石油化工学院 商品推荐方法、装置及终端设备
CN112633973A (zh) * 2020-12-18 2021-04-09 平安科技(深圳)有限公司 一种商品推荐方法及其相关设备

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116308684A (zh) * 2023-05-18 2023-06-23 和元达信息科技有限公司 一种网购平台店铺信息推送方法及系统
CN116308684B (zh) * 2023-05-18 2023-08-11 和元达信息科技有限公司 一种网购平台店铺信息推送方法及系统
CN117611245A (zh) * 2023-12-14 2024-02-27 浙江博观瑞思科技有限公司 用于电商运营活动策划的数据分析管理系统及方法
CN117611245B (zh) * 2023-12-14 2024-05-31 浙江博观瑞思科技有限公司 用于电商运营活动策划的数据分析管理系统及方法

Also Published As

Publication number Publication date
CN112633973A (zh) 2021-04-09

Similar Documents

Publication Publication Date Title
CN112148987B (zh) 基于目标对象活跃度的消息推送方法及相关设备
JP5615931B2 (ja) クラスタ化方法およびシステム
Zhu et al. Heterogeneous hypergraph embedding for document recommendation
US9122989B1 (en) Analyzing website content or attributes and predicting popularity
Xu et al. Improving user recommendation by extracting social topics and interest topics of users in uni-directional social networks
Xu et al. Interdisciplinary topics of information science: a study based on the terms interdisciplinarity index series
WO2019041521A1 (fr) Appareil et procédé d'extraction de mot-clé d'utilisateur et support de mémoire lisible par ordinateur
CN103678672A (zh) 一种信息推荐方法
CN107895277A (zh) 在应用程序中推送贷款广告的方法、电子装置及介质
JP2017142796A (ja) 情報の特定及び抽出
CN104598539B (zh) 一种互联网事件热度计算方法及终端
WO2022126901A1 (fr) Procédé de recommandation de marchandises et dispositif associé correspondant
WO2021175021A1 (fr) Procédé et appareil de poussée de produit, dispositif informatique et support d'enregistrement
CN114138985B (zh) 文本数据处理的方法、装置、计算机设备以及存储介质
KR20220034701A (ko) 태그 기반 콘텐츠 추천 방법 및 이를 수행하는 서버
Pan et al. Recommendation of crowdsourcing tasks based on word2vec semantic tags
CN112131261B (zh) 基于社区网络的社区查询方法、装置和计算机设备
Liu et al. Self-adapted mixture distance measure for clustering uncertain data
CN110162711A (zh) 一种基于网络嵌入法的资源智能推荐方法及系统
Wang et al. ST-SAGE: A spatial-temporal sparse additive generative model for spatial item recommendation
Shaowen et al. An improved collaborative filtering recommendation algorithm
Chen et al. Inferring tag co-occurrence relationship across heterogeneous social networks
JP2023554210A (ja) インテリジェント推奨用のソートモデルトレーニング方法及び装置、インテリジェント推奨方法及び装置、電子機器、記憶媒体、並びにコンピュータプログラム
CN115099875A (zh) 基于决策树模型的数据分类方法及相关设备
CN112633973B (zh) 一种商品推荐方法及其相关设备

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21904841

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 21904841

Country of ref document: EP

Kind code of ref document: A1