WO2022218181A1 - Information recommendation method and device and computer storable medium - Google Patents

Information recommendation method and device and computer storable medium Download PDF

Info

Publication number
WO2022218181A1
WO2022218181A1 PCT/CN2022/085053 CN2022085053W WO2022218181A1 WO 2022218181 A1 WO2022218181 A1 WO 2022218181A1 CN 2022085053 W CN2022085053 W CN 2022085053W WO 2022218181 A1 WO2022218181 A1 WO 2022218181A1
Authority
WO
WIPO (PCT)
Prior art keywords
keyword
item
target
order
goods
Prior art date
Application number
PCT/CN2022/085053
Other languages
French (fr)
Chinese (zh)
Inventor
张青青
李山林
余聪
Original Assignee
北京沃东天骏信息技术有限公司
北京京东世纪贸易有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 北京沃东天骏信息技术有限公司, 北京京东世纪贸易有限公司 filed Critical 北京沃东天骏信息技术有限公司
Publication of WO2022218181A1 publication Critical patent/WO2022218181A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/35Clustering; Classification
    • G06F16/353Clustering; Classification into predefined classes
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • G06F16/2462Approximate or statistical queries
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • G06F16/2477Temporal data queries
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/335Filtering based on additional data, e.g. user or group profiles
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation

Definitions

  • the present application is based on the CN application number 202110405818.1 and the filing date is April 15, 2021, and claims its priority.
  • the disclosure of the CN application is hereby incorporated into the present application as a whole.
  • the present disclosure relates to the field of computer technology, and in particular, to a method and apparatus for recommending information, and a computer-storable medium.
  • an information recommendation method comprising: determining at least one target keyword and its target vector according to the keywords to be processed, and the target keyword and the target vector are in one-to-one correspondence; for each target key word, obtain a plurality of reference keywords belonging to the same keyword category as each target keyword from the historical data corresponding to the object related to the keyword to be processed; multiple reference vectors of Keyword related information.
  • determining the comprehensive vector corresponding to the object includes: for the object, determining conversion data corresponding to each reference keyword according to corresponding historical data; according to the plurality of reference vectors and Its corresponding transformation data, the integrated vector is determined.
  • determining the comprehensive vector according to the multiple reference vectors and their corresponding conversion data includes: determining a corresponding conversion weight according to the conversion data corresponding to each reference keyword; using multiple conversion weights , performing a weighting operation on the multiple reference vectors to obtain the integrated vector.
  • the object is a user with goods
  • the historical data is historical data with goods
  • the historical data with goods of the user with goods includes daily data in a historical time period
  • the daily data is
  • the product data includes at least one of the click volume, order volume, click follower volume and online follower volume of the carrying product corresponding to each reference keyword in one day
  • the click follower volume is the user carrying the product within one day.
  • the number of fans who clicked on the goods with the goods among the online fans of the The click volume is positively correlated and negatively correlated with the online fan volume
  • the conversion rate is positively correlated with the order volume and negatively correlated with the click volume
  • the fan activity rate is the click volume and the online fan volume ratio between.
  • the click-through rate is positively correlated with the average click volume and negatively correlated with the average online fan volume
  • the average click volume is the average click volume of the multiple users with goods.
  • the average of clicks, the average online fans is the average of online fans of the multiple users with goods; the conversion rate is positively correlated with the average order volume and negatively correlated with the average clicks, so
  • the average order quantity is the average of the order quantities of the multiple users with goods.
  • determining the corresponding conversion weight includes: determining the average of the click-through rate, the conversion rate, and the fan activity rate as the conversion weight.
  • the historical data corresponding to the object includes at least one group of order data derived from the object, and each group of order data includes item identifiers of a plurality of items, from the data corresponding to the object related to the keyword to be processed.
  • acquiring a plurality of reference keywords belonging to the same keyword category as each target keyword includes: determining a plurality of item groups according to each group of order data derived from the object, and each item group includes two Item identifiers of different items; according to each group of order data derived from the object, determine the association weight value between the two items in each item group corresponding to the item identifier.
  • the item identifiers in the multiple item groups are used as vertices, and the associated weights are used as edges to construct a graph model; using the graph model, from the historical data corresponding to the object, obtain the Describe multiple reference keywords that each target keyword belongs to the same keyword category.
  • the order data further includes other order parameters corresponding to the item identifier
  • the graph model is used to obtain the same key as each target keyword from historical data corresponding to the object.
  • the multiple reference keywords of the word category include: using a random walk algorithm to randomly walk the graph model until the walking stop condition is met, and at least one item identifier is obtained as a reference item identifier; for each target Keywords, from each group of order data derived from the object, select the item ID or other order parameters that belong to the same keyword category as each target keyword and correspond to the reference item ID, as the reference keyword .
  • the order data further includes an order date
  • determining an association weight value between two items in each item group corresponding to the item identifiers includes: The difference between the order dates determines the association weight value between the two items in each item group corresponding to the item identifier, and the association weight value is negatively correlated with the difference between the order dates.
  • the order data further includes an order date
  • determining a plurality of groups of goods with goods includes: according to each group of order data, determining at least one set of items, each set of goods Including multiple item IDs arranged in order of order, the difference between the latest order date and the earliest order date corresponding to multiple item IDs in the same item set is less than the date threshold; for each item set, the Two adjacent item identifiers are divided into one item group to obtain the plurality of item groups.
  • recommending information related to the to-be-processed keyword includes: when the object is a plurality of objects, selecting an object to be recommended from the plurality of objects, and recommending the to-be-recommended object information about the object.
  • the at least one target keyword includes multiple target keywords
  • recommending information related to the keyword to be processed includes: when the object is one In the case of the target keyword, the keyword to be recommended is selected from the multiple target keywords, and the related information of the keyword to be recommended is recommended.
  • determining at least one target keyword according to the keyword to be processed includes: converting the keyword to be processed into at least one target keyword according to a keyword category corresponding to the keyword to be processed.
  • the keyword categories include user identification categories, item identification categories, brand identification categories, store identification categories, and categories.
  • an information recommendation device comprising: a first determination module configured to determine at least one target keyword and its target vector according to the keywords to be processed, the target keyword and the target vector one by one Corresponding; the acquisition module is configured to, for each target keyword, acquire a plurality of references belonging to the same keyword category with each target keyword from historical data corresponding to the object related to the keyword to be processed keywords; a second determining module, configured to determine a comprehensive vector corresponding to the object according to a plurality of reference vectors corresponding to the plurality of reference keywords, and the reference keywords and the reference vectors are in one-to-one correspondence; a recommendation module, is configured to recommend information related to the keyword to be processed according to the similarity between the integrated vector and each of the target vectors.
  • an information recommendation apparatus comprising: a memory; and a processor coupled to the memory, the processor configured to execute any one of the above based on instructions stored in the memory The information recommendation method described in the embodiment.
  • a computer-storable medium on which computer program instructions are stored, and when the instructions are executed by a processor, implement the information recommendation method described in any of the foregoing embodiments.
  • FIG. 1 is a flowchart illustrating an information recommendation method according to some embodiments of the present disclosure
  • FIG. 2 is a flowchart illustrating obtaining a plurality of reference keywords belonging to the same keyword category as each target keyword according to some embodiments of the present disclosure
  • FIG. 3 is a flow diagram illustrating determining a synthesis vector corresponding to an object according to some embodiments of the present disclosure
  • FIG. 4 is a block diagram illustrating an information recommendation apparatus according to some embodiments of the present disclosure.
  • FIG. 5 is a block diagram illustrating an information recommendation apparatus according to other embodiments of the present disclosure.
  • FIG. 6 is a block diagram illustrating a computer system for implementing some embodiments of the present disclosure.
  • the matching degree between the user carrying the goods and the goods to be carried is not considered, so the accuracy of the recommendation is poor.
  • the present disclosure proposes an information recommendation method, which can improve the accuracy of recommendation.
  • FIG. 1 is a flowchart illustrating an information recommendation method according to some embodiments of the present disclosure.
  • the information recommendation method includes: step S10, determining at least one target keyword and its target vector according to the keywords to be processed; step S20, for each target keyword, from the object related to the keyword to be processed In the corresponding historical data, obtain a plurality of reference keywords belonging to the same keyword category as each target keyword; Step S30, according to a plurality of reference vectors corresponding to the plurality of reference keywords, determine a comprehensive vector corresponding to the object; Step S40, recommend information related to the keyword to be processed according to the similarity between the comprehensive vector and each target vector.
  • the information recommendation method is performed by an information recommendation apparatus.
  • the information recommendation method can be applied to the scene of live broadcast with goods, and can also be applied to the scene of advertising media selection.
  • step S10 at least one target keyword and its target vector are determined according to the keywords to be processed.
  • Target keywords and target vectors correspond one-to-one.
  • the keyword to be processed may be a search term input by a search user.
  • the keywords to be processed include, but are not limited to, the item identification of the carried item, the user identification of the carried user, the brand identification of the carried item, and the category identification of the carried item.
  • the item identifier can be an item ID (Identity Document, identification number) or an item name
  • the user identifier can be a user ID or a user name
  • the brand identifier can be a brand ID or a brand name
  • the category identifier can be a category ID or category name.
  • the keyword to be processed is converted into at least one target keyword according to the keyword category corresponding to the keyword to be processed.
  • the keyword category includes but is not limited to user identification category, item identification category, brand identification category, store identification category, and category category.
  • the keyword category corresponding to the keyword to be processed is the item identification category
  • the keyword to be processed itself is used as the target keyword.
  • the keyword to be processed is converted into the item identification, category identification and brand identification corresponding to the user identification corresponding to the keyword to be processed. at least one of.
  • one user ID corresponds to multiple item IDs, multiple category IDs, or multiple brand IDs
  • the target vector corresponding to the target keyword formed by multiple item IDs may be multiple IDs of multiple item IDs Average or weighted average of vectors.
  • target vectors corresponding to target keywords composed of multiple category identifiers or multiple brand identifiers can be determined.
  • the keyword to be processed may also be converted into item IDs of multiple candidate items.
  • step S20 for each target keyword, multiple reference keywords belonging to the same keyword category as each target keyword are acquired from historical data corresponding to objects related to the keyword to be processed.
  • step S20 is implemented in the manner shown in FIG. 2 .
  • FIG. 2 is a flowchart illustrating obtaining a plurality of reference keywords belonging to the same keyword category as each target keyword according to some embodiments of the present disclosure.
  • acquiring a plurality of reference keywords belonging to the same keyword category as each target keyword includes steps S21 to S24.
  • the historical data corresponding to the above-mentioned object includes at least one group of order data derived from the object.
  • Each set of order data includes item identifiers for multiple items.
  • each group of order data derived from the object is the order data of each fan of the user with goods.
  • the multiple items included in each set of order data are multiple carry items.
  • a plurality of item groups are determined according to each group of order data derived from the object.
  • Each item group includes item IDs for two different items.
  • the order data including the order date as an example
  • each group of order data at least one item set is determined, and each item set includes multiple item identifiers arranged in the order of the order, and the multiple item identifiers in the same item set correspond to
  • the difference between the latest order date and the earliest order date is less than a date threshold (eg, 3 days).
  • a date threshold eg, 3 days.
  • order data of fan A can be divided into order set B and order set C.
  • the difference between the latest order date and the earliest order date in a single order collection is less than 3 days.
  • step S22 according to each group of order data derived from the object, determine the association weight value between the two items corresponding to the item identifiers in each item group. The larger the association weight value, the stronger the correlation between the two items.
  • the order data also includes the order date. According to the difference between the order dates corresponding to the two item identifiers in each item group, the association weight value between the two items corresponding to the item identifiers in each item group is determined. The association weight value is negatively correlated with the difference between the order dates.
  • the associated weight value is D is the difference between order dates (later order dates minus earlier order dates), and M is a preset positive integer. For example, M is set to 4. This is because the difference between order dates will not exceed 4.
  • step S24 a graph model is used to acquire a plurality of reference keywords belonging to the same keyword category as each target keyword from the historical data corresponding to the object.
  • the order data also includes other order parameters corresponding to the item identification.
  • the above step S24 can be implemented in the following manner.
  • the random walk algorithm uses the random walk algorithm to randomly walk the graph model until the stop condition of the walk is satisfied, and at least one item identifier is obtained as a reference item identifier.
  • the parameters of the random walk include the maximum step size of the walk (eg, 13), and the number of traversals per vertex (eg, 4).
  • the stop condition of the walk is the total length of the walk.
  • the random walk algorithm is an Alias sampling based random walk algorithm.
  • the reference keyword is also the item ID (ie, the reference item ID).
  • the reference keyword is the brand identification corresponding to the reference item identification.
  • the reference keyword is the category identification corresponding to the reference item identification.
  • the correlation between items can be constructed, so that the reference keyword most related to the target keyword can be determined by using the correlation between the items, and the recommendation can be further improved. accuracy.
  • a comprehensive vector corresponding to an object is determined according to a plurality of reference vectors corresponding to a plurality of reference keywords. Taking the object as a user with goods as an example, each user with goods corresponds to a comprehensive vector.
  • step S30 is implemented in the manner shown in FIG. 3 .
  • Figure 3 is a flow diagram illustrating determining a synthesis vector corresponding to an object in accordance with some embodiments of the present disclosure.
  • determining the integrated vector corresponding to the object includes steps S31 to S32.
  • step S31 the conversion data corresponding to each reference keyword is determined according to the historical data corresponding to the object.
  • Conversion data is data that measures the ability to generate specific contributions (eg, traffic contributions such as click-through rate, conversion rate, fan activity rate, etc.) based on each reference keyword.
  • the historical data is historical delivery data
  • the transformation data is delivery conversion data.
  • the object is an advertising medium
  • the historical data is historical advertising data
  • the conversion data is advertising conversion data.
  • a comprehensive vector is determined according to a plurality of reference vectors and their corresponding transformation data.
  • the corresponding conversion weight is determined according to the conversion data corresponding to each reference keyword. Using multiple transformation weights, weighting operations are performed on multiple reference vectors to obtain a comprehensive vector. By converting the conversion data into conversion weights and performing weighting operations on multiple reference vectors, the conversion ability of the object can be comprehensively evaluated, and the accuracy of the recommendation can be further improved.
  • the transformation weight is the weight value of each reference vector.
  • the historical delivery data of the user with goods includes the daily delivery data in the historical time period.
  • the daily delivery data includes at least one of the click volume, order volume, click followers volume and online followers volume of the delivered product corresponding to each reference keyword in one day.
  • the number of clicked followers refers to the number of followers who clicked on the goods with goods among the online followers of the users who brought goods in one day.
  • the conversion data includes the daily click through rate (Click Through Rate, CTR), conversion rate (Conversion Rate, CVR) and fan activity rate in the historical time period. at least one of.
  • Click-through rate is positively correlated with clicks and negatively correlated with online followers.
  • Conversion rate is positively correlated with order volume and negatively correlated with click volume.
  • the fan activity rate is the ratio between the number of clicked fans and the number of online fans.
  • the average value of the click-through rate, conversion rate, and fan activity rate can be determined as the conversion weight corresponding to the reference keyword.
  • the click-through rate is positively correlated with the average click volume and negatively correlated with the average online fan count.
  • the average click volume is the average of the click volume of multiple users with goods.
  • the average number of online fans is the average of the online fans of multiple users with goods.
  • Conversion rate is positively correlated with average order volume and negatively correlated with average click volume.
  • the average order size is the average of the order sizes of multiple users with goods.
  • the conversion data is modified by using the average online followers, average clicks and average orders corresponding to multiple users with goods, that is, using the Bayesian smoothing idea to modify the conversion data, which can make the conversion data closer to the real value , so as to further improve the accuracy of the recommendation.
  • step S40 information related to the keywords to be processed is recommended according to the similarity between the comprehensive vector and each target vector.
  • the similarity is the vector inner product or cosine similarity between vectors.
  • the objects to be recommended are selected from the multiple objects, and the relevant information of the objects to be recommended is recommended.
  • a user to be recommended is selected from multiple users with goods, and user information of the user to be recommended is recommended.
  • a user with goods matching the keyword to be processed is selected from multiple users with goods, so that the selected user with goods can carry goods for the item corresponding to the keyword to be processed.
  • the user information of the user with goods corresponding to the comprehensive vector with the largest sum of similarity of each target vector may be recommended.
  • target keywords When the object is one object, there are plural target keywords. From a plurality of target keywords, a keyword to be recommended is selected, and relevant information of the keyword to be recommended is recommended. For example, from a plurality of candidate items, an item to be recommended is selected, and item information of the item to be recommended is recommended. In this case, a matching item to be recommended is selected for a user with goods, so that the user with goods can carry goods for the item to be recommended. In some embodiments, the item information of the candidate item corresponding to the target vector with the greatest similarity with the comprehensive vector may be recommended.
  • the comprehensive vector of the object is determined by converting the keywords to be processed into the target keywords, and using the reference vectors of multiple reference keywords belonging to the same keyword category as the target keyword obtained based on historical data.
  • the recommendation is based on the similarity between the comprehensive vector and the target vector, and the matching degree between the object and the keyword to be processed is comprehensively considered, which can improve the accuracy of the recommendation.
  • FIG. 4 is a block diagram illustrating an information recommendation apparatus according to some embodiments of the present disclosure.
  • the information recommendation apparatus 4 includes a first determination module 41 , an acquisition module 42 , a second determination module 43 and a recommendation module 44 .
  • the first determination module 41 is configured to determine at least one target keyword and its target vector according to the keywords to be processed, for example, to perform step S10 as shown in FIG. 1 .
  • Target keywords and target vectors correspond one-to-one.
  • the acquisition module 42 is configured to, for each target keyword, acquire a plurality of reference keywords belonging to the same keyword category with each target keyword from the historical data corresponding to the object related to the keyword to be processed, for example, perform the following steps: Step S20 shown in FIG. 1 .
  • the second determining module 43 is configured to determine a comprehensive vector corresponding to the object according to the plurality of reference vectors corresponding to the plurality of reference keywords, for example, performing step S30 shown in FIG. 1 for one-to-one correspondence between the reference keywords and the reference vectors.
  • the recommendation module 44 is configured to recommend information related to the keyword to be processed according to the similarity between the integrated vector and each target vector, for example, to perform step S40 as shown in FIG. 1 .
  • FIG. 5 is a block diagram illustrating an information recommendation apparatus according to other embodiments of the present disclosure.
  • the information recommendation apparatus 5 includes a memory 51 ; and a processor 52 coupled to the memory 51 .
  • the memory 51 is used for storing instructions for executing the corresponding embodiments of the information recommendation method.
  • the processor 52 is configured to execute the information recommendation method in any of some embodiments of the present disclosure based on the instructions stored in the memory 51 .
  • FIG. 6 is a block diagram illustrating a computer system for implementing some embodiments of the present disclosure.
  • Computer system 60 may take the form of a general-purpose computing device.
  • Computer system 60 includes memory 610, processor 620, and bus 600 that connects various system components.
  • Memory 610 may include, for example, system memory, non-volatile storage media, and the like.
  • the system memory stores, for example, an operating system, an application program, a boot loader (Boot Loader), and other programs.
  • System memory may include volatile storage media such as random access memory (RAM) and/or cache memory.
  • RAM random access memory
  • the non-volatile storage medium stores, for example, instructions for performing the corresponding embodiments of at least one of the information recommendation methods.
  • Non-volatile storage media include, but are not limited to, magnetic disk memory, optical memory, flash memory, and the like.
  • Processor 620 may be implemented as a general purpose processor, digital signal processor (DSP), application specific integrated circuit (ASIC), field programmable gate array (FPGA) or other programmable logic device, discrete hardware components such as discrete gates or transistors.
  • each module such as the judging module and the determining module can be implemented by a central processing unit (CPU) running the instructions in the memory for executing the corresponding steps, or can be implemented by a dedicated circuit for executing the corresponding steps.
  • CPU central processing unit
  • bus 600 may use any of a variety of bus structures.
  • bus structures include, but are not limited to, Industry Standard Architecture (ISA) bus, Micro Channel Architecture (MCA) bus, Peripheral Component Interconnect (PCI) bus.
  • ISA Industry Standard Architecture
  • MCA Micro Channel Architecture
  • PCI Peripheral Component Interconnect
  • the computer system 60 may also include an input-output interface 630, a network interface 640, a storage interface 650, and the like.
  • the interfaces 630 , 640 , 650 and the memory 610 and the processor 620 can be connected through the bus 600 .
  • the input and output interface 630 may provide a connection interface for input and output devices such as a monitor, a mouse, and a keyboard.
  • Network interface 640 provides a connection interface for various networked devices.
  • the storage interface 650 provides a connection interface for external storage devices such as a floppy disk, a U disk, and an SD card.
  • These computer readable program instructions may be provided to a processor of a general purpose computer, special purpose computer or other programmable device to produce a machine such that execution of the instructions by the processor produces one or more blocks in the flowchart and/or block diagrams device with the specified function.
  • Also stored in computer readable memory are these computer readable program instructions, which cause the computer to operate in a particular manner resulting in an article of manufacture including implementing the functions specified in one or more blocks of the flowchart and/or block diagrams instruction.
  • the present disclosure may take the form of an entirely hardware embodiment, an entirely software embodiment, or an embodiment combining software and hardware aspects.
  • the accuracy of recommendation can be improved.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Probability & Statistics with Applications (AREA)
  • Computational Linguistics (AREA)
  • Fuzzy Systems (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

Provided are an information recommendation method and device and a computer storable medium, relating to the technical field of computers. The information recommendation method comprises: determining at least one target keyword and a target vector thereof according to a keyword to be processed (S10); for each target keyword, obtaining, from historical data corresponding to an object related to said keyword, multiple reference keywords that belong to the same keyword category as each target keyword (S20); determining, according to multiple reference vectors corresponding to the multiple reference keywords, a comprehensive vector corresponding to the object (S30); and recommending, according to the similarity between the comprehensive vector and each target vector, information related to said keyword (S40). According to the present invention, the accuracy of recommendation can be improved.

Description

信息推荐方法及装置、计算机可存储介质Information recommendation method and device, and computer storable medium
相关申请的交叉引用CROSS-REFERENCE TO RELATED APPLICATIONS
本申请是以CN申请号为202110405818.1,申请日为2021年4月15日的申请为基础,并主张其优先权,该CN申请的公开内容在此作为整体引入本申请中。The present application is based on the CN application number 202110405818.1 and the filing date is April 15, 2021, and claims its priority. The disclosure of the CN application is hereby incorporated into the present application as a whole.
技术领域technical field
本公开涉及计算机技术领域,特别涉及信息推荐方法及装置、计算机可存储介质。The present disclosure relates to the field of computer technology, and in particular, to a method and apparatus for recommending information, and a computer-storable medium.
背景技术Background technique
随着直播行业的快速发展,如何准确筛选带货用户进行直播带货成为推动直播行业发展的关键。With the rapid development of the live broadcast industry, how to accurately screen users with goods for live broadcast has become the key to promoting the development of the live broadcast industry.
相关技术中,针对一待带货物品,通过统计不同带货用户的带货频次、带货销量,为商家推荐带货频次高且带货销量大的带货用户。In the related art, for an item to be delivered, by counting the frequency of delivery and the sales volume of different delivery users, a seller with a high frequency of delivery and a large volume of delivery is recommended to the merchant.
发明内容SUMMARY OF THE INVENTION
根据本公开的第一方面,提供了一种信息推荐方法,包括:根据待处理关键词,确定至少一个目标关键词及其目标向量,目标关键词与目标向量一一对应;对于每个目标关键词,从与所述待处理关键词相关的对象对应的历史数据中,获取与所述每个目标关键词属于同一关键词类别的多个参考关键词;根据与所述多个参考关键词对应的多个参考向量,确定与所述对象对应的综合向量,参考关键词与参考向量一一对应;根据所述综合向量与所述每个目标向量之间的相似度,推荐与所述待处理关键词相关的信息。According to a first aspect of the present disclosure, an information recommendation method is provided, comprising: determining at least one target keyword and its target vector according to the keywords to be processed, and the target keyword and the target vector are in one-to-one correspondence; for each target key word, obtain a plurality of reference keywords belonging to the same keyword category as each target keyword from the historical data corresponding to the object related to the keyword to be processed; multiple reference vectors of Keyword related information.
在一些实施例中,确定与所述对象对应的综合向量包括:对于所述对象,根据对应的历史数据,确定与所述每个参考关键词对应的转化数据;根据所述多个参考向量及其对应的转化数据,确定所述综合向量。In some embodiments, determining the comprehensive vector corresponding to the object includes: for the object, determining conversion data corresponding to each reference keyword according to corresponding historical data; according to the plurality of reference vectors and Its corresponding transformation data, the integrated vector is determined.
在一些实施例中,根据所述多个参考向量及其对应的转化数据,确定所述综合向量包括:根据与每个参考关键词对应的转化数据,确定相应的转化权重;利用多个转化权重,对所述多个参考向量进行加权操作,得到所述综合向量。In some embodiments, determining the comprehensive vector according to the multiple reference vectors and their corresponding conversion data includes: determining a corresponding conversion weight according to the conversion data corresponding to each reference keyword; using multiple conversion weights , performing a weighting operation on the multiple reference vectors to obtain the integrated vector.
在一些实施例中,所述对象为带货用户,所述历史数据为历史带货数据,所述带货用户的历史带货数据包括历史时间段内每天的带货数据,所述每天的带货数据包括与每个参考关键词对应的带货物品在一天内的点击量、订单量、点击粉丝量和在线粉丝量中的至少一种,所述点击粉丝量为一天内所述带货用户的在线粉丝中点击所述带货物品的粉丝数量,所述转化数据包括所述历史时间段内每天的点击通过率、转化率和粉丝活跃率中的至少一种,所述点击通过率与所述点击量成正相关且与所述在线粉丝量成负相关,所述转化率与所述订单量成正相关且与所述点击量成负相关,所述粉丝活跃率为点击粉丝量与在线粉丝量之间的比值。In some embodiments, the object is a user with goods, the historical data is historical data with goods, and the historical data with goods of the user with goods includes daily data in a historical time period, and the daily data is The product data includes at least one of the click volume, order volume, click follower volume and online follower volume of the carrying product corresponding to each reference keyword in one day, and the click follower volume is the user carrying the product within one day. The number of fans who clicked on the goods with the goods among the online fans of the The click volume is positively correlated and negatively correlated with the online fan volume, the conversion rate is positively correlated with the order volume and negatively correlated with the click volume, and the fan activity rate is the click volume and the online fan volume ratio between.
在一些实施例中,所述带货用户为多个,所述点击通过率与平均点击量成正相关且与平均在线粉丝量成负相关,所述平均点击量为所述多个带货用户的点击量的平均值,所述平均在线粉丝量为所述多个带货用户的在线粉丝量的平均值;所述转化率与平均订单量成正相关且与所述平均点击量成负相关,所述平均订单量为所述多个带货用户的订单量的平均值。In some embodiments, there are multiple users with goods, the click-through rate is positively correlated with the average click volume and negatively correlated with the average online fan volume, and the average click volume is the average click volume of the multiple users with goods. The average of clicks, the average online fans is the average of online fans of the multiple users with goods; the conversion rate is positively correlated with the average order volume and negatively correlated with the average clicks, so The average order quantity is the average of the order quantities of the multiple users with goods.
在一些实施例中,确定相应的转化权重包括:将所述点击通过率、所述转化率和所述粉丝活跃率的平均值,确定为转化权重。In some embodiments, determining the corresponding conversion weight includes: determining the average of the click-through rate, the conversion rate, and the fan activity rate as the conversion weight.
在一些实施例中,与所述对象对应的历史数据包括该对象衍生的至少一组订单数据,每组订单数据包括多个物品的物品标识,从与所述待处理关键词相关的对象对应的历史数据中,获取与所述每个目标关键词属于同一关键词类别的多个参考关键词包括:根据所述对象衍生的每组订单数据,确定多个物品组,每个物品组包括两个不同物品的物品标识;根据所述对象衍生的每组订单数据,确定每个物品组中、与物品标识对应的两个物品之间的关联权重值,关联权重值越大,两个物品之间的相关性越强;以所述多个物品组中的物品标识为顶点、关联权重值为边,构建图模型;利用所述图模型,从与所述对象对应的历史数据中,获取与所述每个目标关键词属于同一关键词类别的多个参考关键词。In some embodiments, the historical data corresponding to the object includes at least one group of order data derived from the object, and each group of order data includes item identifiers of a plurality of items, from the data corresponding to the object related to the keyword to be processed. In the historical data, acquiring a plurality of reference keywords belonging to the same keyword category as each target keyword includes: determining a plurality of item groups according to each group of order data derived from the object, and each item group includes two Item identifiers of different items; according to each group of order data derived from the object, determine the association weight value between the two items in each item group corresponding to the item identifier. The stronger the correlation is; the item identifiers in the multiple item groups are used as vertices, and the associated weights are used as edges to construct a graph model; using the graph model, from the historical data corresponding to the object, obtain the Describe multiple reference keywords that each target keyword belongs to the same keyword category.
在一些实施例中,所述订单数据还包括与物品标识对应的其他订单参数,利用所述图模型,从与所述对象对应的历史数据中,获取与所述每个目标关键词属于同一关键词类别的多个参考关键词包括:利用随机游走算法,对所述图模型进行随机游走,直到满足游走停止条件,得到至少一个物品标识,作为参考物品标识;对于所述每个目标关键词,从所述对象衍生的每组订单数据中,选择与所述每个目标关键词属于同 一关键词类别、且与所述参考物品标识对应的物品标识或其他订单参数,作为参考关键词。In some embodiments, the order data further includes other order parameters corresponding to the item identifier, and the graph model is used to obtain the same key as each target keyword from historical data corresponding to the object. The multiple reference keywords of the word category include: using a random walk algorithm to randomly walk the graph model until the walking stop condition is met, and at least one item identifier is obtained as a reference item identifier; for each target Keywords, from each group of order data derived from the object, select the item ID or other order parameters that belong to the same keyword category as each target keyword and correspond to the reference item ID, as the reference keyword .
在一些实施例中,订单数据还包括订单日期,确定每个物品组中的与物品标识对应的两个物品之间的关联权重值包括:根据每个物品组中、与两个物品标识对应的订单日期之间的差值,确定每个物品组中的与物品标识对应的两个物品之间的关联权重值,关联权重值与订单日期之间的差值成负相关。In some embodiments, the order data further includes an order date, and determining an association weight value between two items in each item group corresponding to the item identifiers includes: The difference between the order dates determines the association weight value between the two items in each item group corresponding to the item identifier, and the association weight value is negatively correlated with the difference between the order dates.
在一些实施例中,订单数据还包括订单日期,根据与每个带货用户对应的订单数据,确定多个带货物品组包括:根据每组订单数据,确定至少一个物品集合,每个物品集合包括多个按订单先后顺序排列的多个物品标识,同一物品集合中的多个物品标识所对应的最晚订单日期与最早订单日期之间的差值小于日期阈值;针对每个物品集合,将相邻的两个物品标识划分到一个物品组,以得到所述多个物品组。In some embodiments, the order data further includes an order date, and according to the order data corresponding to each user with goods, determining a plurality of groups of goods with goods includes: according to each group of order data, determining at least one set of items, each set of goods Including multiple item IDs arranged in order of order, the difference between the latest order date and the earliest order date corresponding to multiple item IDs in the same item set is less than the date threshold; for each item set, the Two adjacent item identifiers are divided into one item group to obtain the plurality of item groups.
在一些实施例中,推荐与所述待处理关键词相关的信息包括:在所述对象为多个对象的情况下,从所述多个对象中,选择待推荐对象,并推荐所述待推荐对象的相关信息。In some embodiments, recommending information related to the to-be-processed keyword includes: when the object is a plurality of objects, selecting an object to be recommended from the plurality of objects, and recommending the to-be-recommended object information about the object.
在一些实施例中,在所述对象为一个对象的情况下,所述至少一个目标关键词包括多个目标关键词,推荐与所述待处理关键词相关的信息包括:在所述对象为一个对象的情况下,从所述多个目标关键词中,选择待推荐关键词,并推荐所述待推荐关键词的相关信息。In some embodiments, when the object is one object, the at least one target keyword includes multiple target keywords, and recommending information related to the keyword to be processed includes: when the object is one In the case of the target keyword, the keyword to be recommended is selected from the multiple target keywords, and the related information of the keyword to be recommended is recommended.
在一些实施例中,根据待处理关键词,确定至少一个目标关键词包括:根据与所述待处理关键词对应的关键词类别,将所述待处理关键词转化为至少一个目标关键词。In some embodiments, determining at least one target keyword according to the keyword to be processed includes: converting the keyword to be processed into at least one target keyword according to a keyword category corresponding to the keyword to be processed.
在一些实施例中,在所述对象为带货用户的情况下,所述关键词类别包括用户标识类、物品标识类、品牌标识类、店铺标识类和品类。In some embodiments, when the object is a user with goods, the keyword categories include user identification categories, item identification categories, brand identification categories, store identification categories, and categories.
根据本公开第二方面,提供了一种信息推荐装置,包括:第一确定模块,被配置为根据待处理关键词,确定至少一个目标关键词及其目标向量,目标关键词与目标向量一一对应;获取模块,被配置为对于每个目标关键词,从与所述待处理关键词相关的对象对应的历史数据中,获取与所述每个目标关键词属于同一关键词类别的多个参考关键词;第二确定模块,被配置为根据与所述多个参考关键词对应的多个参考向量,确定与所述对象对应的综合向量,参考关键词与参考向量一一对应;推荐模块,被配 置为根据所述综合向量与所述每个目标向量之间的相似度,推荐与所述待处理关键词相关的信息。According to a second aspect of the present disclosure, there is provided an information recommendation device, comprising: a first determination module configured to determine at least one target keyword and its target vector according to the keywords to be processed, the target keyword and the target vector one by one Corresponding; the acquisition module is configured to, for each target keyword, acquire a plurality of references belonging to the same keyword category with each target keyword from historical data corresponding to the object related to the keyword to be processed keywords; a second determining module, configured to determine a comprehensive vector corresponding to the object according to a plurality of reference vectors corresponding to the plurality of reference keywords, and the reference keywords and the reference vectors are in one-to-one correspondence; a recommendation module, is configured to recommend information related to the keyword to be processed according to the similarity between the integrated vector and each of the target vectors.
根据本公开第三方面,提供了一种信息推荐装置,包括:存储器;以及耦接至所述存储器的处理器,所述处理器被配置为基于存储在所述存储器的指令,执行上述任一实施例所述的信息推荐方法。According to a third aspect of the present disclosure, there is provided an information recommendation apparatus, comprising: a memory; and a processor coupled to the memory, the processor configured to execute any one of the above based on instructions stored in the memory The information recommendation method described in the embodiment.
根据本公开的第四方面,提供了一种计算机可存储介质,其上存储有计算机程序指令,该指令被处理器执行时实现上述任一实施例所述的信息推荐方法。According to a fourth aspect of the present disclosure, there is provided a computer-storable medium on which computer program instructions are stored, and when the instructions are executed by a processor, implement the information recommendation method described in any of the foregoing embodiments.
附图说明Description of drawings
构成说明书的一部分的附图描述了本公开的实施例,并且连同说明书一起用于解释本公开的原理。The accompanying drawings, which form a part of the specification, illustrate embodiments of the present disclosure and together with the description serve to explain the principles of the present disclosure.
参照附图,根据下面的详细描述,可以更加清楚地理解本公开,其中:The present disclosure may be more clearly understood from the following detailed description with reference to the accompanying drawings, wherein:
图1是示出根据本公开一些实施例的信息推荐方法的流程图;FIG. 1 is a flowchart illustrating an information recommendation method according to some embodiments of the present disclosure;
图2是示出根据本公开一些实施例的获取与每个目标关键词属于同一关键词类别的多个参考关键词的流程图;FIG. 2 is a flowchart illustrating obtaining a plurality of reference keywords belonging to the same keyword category as each target keyword according to some embodiments of the present disclosure;
图3是示出根据本公开一些实施例的确定与对象对应的综合向量的流程图;3 is a flow diagram illustrating determining a synthesis vector corresponding to an object according to some embodiments of the present disclosure;
图4是示出根据本公开一些实施例的信息推荐装置的框图;4 is a block diagram illustrating an information recommendation apparatus according to some embodiments of the present disclosure;
图5是示出根据本公开另一些实施例的信息推荐装置的框图;FIG. 5 is a block diagram illustrating an information recommendation apparatus according to other embodiments of the present disclosure;
图6是示出用于实现本公开一些实施例的计算机系统的框图。6 is a block diagram illustrating a computer system for implementing some embodiments of the present disclosure.
具体实施方式Detailed ways
现在将参照附图来详细描述本公开的各种示例性实施例。应注意到:除非另外具体说明,否则在这些实施例中阐述的部件和步骤的相对布置、数字表达式和数值不限制本公开的范围。Various exemplary embodiments of the present disclosure will now be described in detail with reference to the accompanying drawings. It should be noted that the relative arrangement of the components and steps, the numerical expressions and numerical values set forth in these embodiments do not limit the scope of the present disclosure unless specifically stated otherwise.
同时,应当明白,为了便于描述,附图中所示出的各个部分的尺寸并不是按照实际的比例关系绘制的。Meanwhile, it should be understood that, for the convenience of description, the dimensions of various parts shown in the accompanying drawings are not drawn in an actual proportional relationship.
以下对至少一个示例性实施例的描述实际上仅仅是说明性的,决不作为对本公开及其应用或使用的任何限制。The following description of at least one exemplary embodiment is merely illustrative in nature and is in no way intended to limit the disclosure, its application or uses in any way.
对于相关领域普通技术人员已知的技术、方法和设备可能不作详细讨论,但在适当情况下,所述技术、方法和设备应当被视为说明书的一部分。Techniques, methods, and apparatus known to those of ordinary skill in the relevant art may not be discussed in detail, but where appropriate, such techniques, methods, and apparatus should be considered part of the specification.
在这里示出和讨论的所有示例中,任何具体值应被解释为仅仅是示例性的,而不是作为限制。因此,示例性实施例的其它示例可以具有不同的值。In all examples shown and discussed herein, any specific value should be construed as illustrative only and not as limiting. Accordingly, other examples of exemplary embodiments may have different values.
应注意到:相似的标号和字母在下面的附图中表示类似项,因此,一旦某一项在一个附图中被定义,则在随后的附图中不需要对其进行进一步讨论。It should be noted that like numerals and letters refer to like items in the following figures, so once an item is defined in one figure, it does not require further discussion in subsequent figures.
相关技术中,未考虑带货用户与待带货物品之间的匹配度,从而推荐的准确性较差。In the related art, the matching degree between the user carrying the goods and the goods to be carried is not considered, so the accuracy of the recommendation is poor.
针对上述技术问题,本公开提出了一种信息推荐方法,可以提高推荐的准确性。In view of the above technical problems, the present disclosure proposes an information recommendation method, which can improve the accuracy of recommendation.
图1是示出根据本公开一些实施例的信息推荐方法的流程图。FIG. 1 is a flowchart illustrating an information recommendation method according to some embodiments of the present disclosure.
如图1所示,信息推荐方法包括:步骤S10,根据待处理关键词,确定至少一个目标关键词及其目标向量;步骤S20,对于每个目标关键词,从与待处理关键词相关的对象对应的历史数据中,获取与每个目标关键词属于同一关键词类别的多个参考关键词;步骤S30,根据与多个参考关键词对应的多个参考向量,确定与对象对应的综合向量;步骤S40,根据综合向量与每个目标向量之间的相似度,推荐与待处理关键词相关的信息。在一些实施例中,信息推荐方法由信息推荐装置执行。该信息推荐方法可以应用到直播带货场景,也可以应用到广告媒体选择的场景。As shown in Figure 1, the information recommendation method includes: step S10, determining at least one target keyword and its target vector according to the keywords to be processed; step S20, for each target keyword, from the object related to the keyword to be processed In the corresponding historical data, obtain a plurality of reference keywords belonging to the same keyword category as each target keyword; Step S30, according to a plurality of reference vectors corresponding to the plurality of reference keywords, determine a comprehensive vector corresponding to the object; Step S40, recommend information related to the keyword to be processed according to the similarity between the comprehensive vector and each target vector. In some embodiments, the information recommendation method is performed by an information recommendation apparatus. The information recommendation method can be applied to the scene of live broadcast with goods, and can also be applied to the scene of advertising media selection.
在步骤S10中,根据待处理关键词,确定至少一个目标关键词及其目标向量。目标关键词与目标向量一一对应。在一些实施例中,待处理关键词可以为检索用户输入的检索词。In step S10, at least one target keyword and its target vector are determined according to the keywords to be processed. Target keywords and target vectors correspond one-to-one. In some embodiments, the keyword to be processed may be a search term input by a search user.
在一些实施例中,例如,待处理关键词包括但不限于带货物品的物品标识、带货用户的用户标识、带货物品的品牌标识和带货物品的品类标识。例如,物品标识可以为物品ID(Identity Document,身份标识号)或者物品名称,用户标识可以为用户ID或者用户名,品牌标识可以为品牌ID或者品牌名称,品类标识可以为品类ID或者品类名称。In some embodiments, for example, the keywords to be processed include, but are not limited to, the item identification of the carried item, the user identification of the carried user, the brand identification of the carried item, and the category identification of the carried item. For example, the item identifier can be an item ID (Identity Document, identification number) or an item name, the user identifier can be a user ID or a user name, the brand identifier can be a brand ID or a brand name, and the category identifier can be a category ID or category name.
在一些实施例中,根据与待处理关键词对应的关键词类别,将待处理关键词转化为至少一个目标关键词。在一些实施例中,在与待处理关键词相关的对象为带货用户的情况下,关键词类别包括但不限于用户标识类、物品标识类、品牌标识类、店铺标识类和品类。In some embodiments, the keyword to be processed is converted into at least one target keyword according to the keyword category corresponding to the keyword to be processed. In some embodiments, when the object related to the keyword to be processed is a user with goods, the keyword category includes but is not limited to user identification category, item identification category, brand identification category, store identification category, and category category.
例如,在待处理关键词所对应的关键词类别为物品标识类的情况下,将待处理关键词本身作为目标关键词。For example, when the keyword category corresponding to the keyword to be processed is the item identification category, the keyword to be processed itself is used as the target keyword.
又例如,在待处理关键词所对应的关键词类别为用户标识类的情况下,将待处理关键词转化为与待处理关键词所对应的用户标识对应的物品标识、品类标识和品牌标识中的至少一种。在一些实施例中,一个用户标识对应多个物品标识、多个品类标识或者多个品牌标识,则多个物品标识构成的目标关键词所对应的目标向量可以为多个物品标识的多个标识向量的平均值或者加权平均值。同理,可以确定多个品类标识或者多个品牌标识构成的目标关键词所对应的目标向量。For another example, in the case where the keyword category corresponding to the keyword to be processed is the user identification category, the keyword to be processed is converted into the item identification, category identification and brand identification corresponding to the user identification corresponding to the keyword to be processed. at least one of. In some embodiments, one user ID corresponds to multiple item IDs, multiple category IDs, or multiple brand IDs, then the target vector corresponding to the target keyword formed by multiple item IDs may be multiple IDs of multiple item IDs Average or weighted average of vectors. Similarly, target vectors corresponding to target keywords composed of multiple category identifiers or multiple brand identifiers can be determined.
又例如,在待处理关键词所对应的关键词类别为用户标识类的情况下,还可以将待处理关键词转化为多个候选物品的物品标识。For another example, in the case where the keyword category corresponding to the keyword to be processed is a user identification category, the keyword to be processed may also be converted into item IDs of multiple candidate items.
在步骤S20中,对于每个目标关键词,从与待处理关键词相关的对象对应的历史数据中,获取与每个目标关键词属于同一关键词类别的多个参考关键词。In step S20, for each target keyword, multiple reference keywords belonging to the same keyword category as each target keyword are acquired from historical data corresponding to objects related to the keyword to be processed.
例如,通过如图2所示的方式实现上述步骤S20。For example, the above-mentioned step S20 is implemented in the manner shown in FIG. 2 .
图2是示出根据本公开一些实施例的获取与每个目标关键词属于同一关键词类别的多个参考关键词的流程图。FIG. 2 is a flowchart illustrating obtaining a plurality of reference keywords belonging to the same keyword category as each target keyword according to some embodiments of the present disclosure.
如图2所示,获取与每个目标关键词属于同一关键词类别的多个参考关键词包括步骤S21-步骤S24。例如,与上述对象对应的历史数据包括该对象衍生的至少一组订单数据。每组订单数据包括多个物品的物品标识。例如,在对象为带货用户的情况下,该对象衍生的每组订单数据为带货用户的每个粉丝的订单数据。每组订单数据所包括的多个物品为多个带货物品。As shown in FIG. 2 , acquiring a plurality of reference keywords belonging to the same keyword category as each target keyword includes steps S21 to S24. For example, the historical data corresponding to the above-mentioned object includes at least one group of order data derived from the object. Each set of order data includes item identifiers for multiple items. For example, when the object is a user with goods, each group of order data derived from the object is the order data of each fan of the user with goods. The multiple items included in each set of order data are multiple carry items.
例如,一粉丝A的订单数据可以表示为{[商品ID=3645,日期=20200101,品牌ID=乔丹,店铺=乔丹专卖店,分类=运动鞋],[商品ID=4543,日期=20200101,品牌ID=耐克,店铺=耐克专卖店,分类=运动鞋],[商品ID=712345,日期=20200102,品牌ID=华为,店铺=华为专卖店,分类=手机],[商品ID=546574,日期=20200108,品牌ID=皮卡尔丹,店铺=皮卡尔丹专卖店,分类=牛仔裤]}。For example, the order data of a fan A can be expressed as {[item ID=3645, date=20200101, brand ID=Jordan, shop=Jordan store, category=sneakers], [item ID=4543, date=20200101, brand ID=Nike, Store=Nike Store, Category=Sneakers], [Item ID=712345, Date=20200102, Brand ID=Huawei, Store=Huawei Store, Category=Mobile], [Item ID=546574, Date= 20200108, Brand ID = Picardine, Store = Picardine, Category = Jeans]}.
在步骤S21中,根据对象衍生的每组订单数据,确定多个物品组。每个物品组包括两个不同物品的物品标识。以订单数据包括订单日期为例,根据每组订单数据,确定至少一个物品集合,每个物品集合包括多个按订单先后顺序排列的多个物品标识,同一物品集合中的多个物品标识所对应的最晚订单日期与最早订单日期之间的差值小 于日期阈值(例如为3天)。针对每个物品集合,将相邻的两个物品标识划分到一个物品组,以得到多个物品组。In step S21, a plurality of item groups are determined according to each group of order data derived from the object. Each item group includes item IDs for two different items. Taking the order data including the order date as an example, according to each group of order data, at least one item set is determined, and each item set includes multiple item identifiers arranged in the order of the order, and the multiple item identifiers in the same item set correspond to The difference between the latest order date and the earliest order date is less than a date threshold (eg, 3 days). For each item set, two adjacent item identifiers are divided into an item group to obtain multiple item groups.
例如,粉丝A的订单数据可以划分为订单集合B和订单集合C。订单集合B为{[商品ID=3645,日期=20200101,品牌ID=乔丹,店铺=乔丹专卖店,分类=运动鞋],[商品ID=4543,日期=20200101,品牌ID=耐克,店铺=耐克专卖店,分类=运动鞋],[商品ID=712345,日期=20200102,品牌ID=华为,店铺=华为专卖店,分类=手机]}。订单集合C为{[商品ID=546574,日期=20200108,品牌ID=皮卡尔丹,店铺=皮卡尔丹专卖店,分类=牛仔裤],[商品ID=546578,日期=20200109,品牌ID=皮卡尔丹,店铺=皮卡尔丹专卖店,分类=牛仔外套]}。单个订单集合中最晚订单日期与最早订单日期之间的差值小于3天。根据这两个订单集合可以确定两个物品集合,分别为{商品ID=3645,商品ID=4543,商品ID=712345}、{商品ID=546574,商品ID=546578}。For example, the order data of fan A can be divided into order set B and order set C. Order set B is {[Item ID=3645, Date=20200101, Brand ID=Jordan, Store=Jordan Store, Category=Sneakers], [Item ID=4543, Date=20200101, Brand ID=Nike, Store=Nike Store, Category=Sneakers], [Item ID=712345, Date=20200102, Brand ID=Huawei, Store=Huawei Store, Category=Mobile]}. Order set C is {[ItemID=546574, Date=20200108, BrandID=Picardin, Shop=Picardin, Category=Jeans], [ItemID=546578, Date=20200109, BrandID=Picard Dan, shop = Picardine, category = denim jacket]}. The difference between the latest order date and the earliest order date in a single order collection is less than 3 days. According to the two order sets, two item sets can be determined, namely {commodity ID=3645, commodity ID=4543, commodity ID=712345}, {commodity ID=546574, commodity ID=546578}.
在步骤S22中,根据对象衍生的每组订单数据,确定每个物品组中、与物品标识对应的两个物品之间的关联权重值。关联权重值越大,两个物品之间的相关性越强。In step S22, according to each group of order data derived from the object, determine the association weight value between the two items corresponding to the item identifiers in each item group. The larger the association weight value, the stronger the correlation between the two items.
在一些实施例中,订单数据还包括订单日期。根据每个物品组中、与两个物品标识对应的订单日期之间的差值,确定每个物品组中的与物品标识对应的两个物品之间的关联权重值。关联权重值与订单日期之间的差值成负相关。In some embodiments, the order data also includes the order date. According to the difference between the order dates corresponding to the two item identifiers in each item group, the association weight value between the two items corresponding to the item identifiers in each item group is determined. The association weight value is negatively correlated with the difference between the order dates.
例如,关联权重值为
Figure PCTCN2022085053-appb-000001
D为订单日期之间的差值(较晚的订单日期减去较早的订单日期),M为预设的正整数。例如,M设置为4。这是由于订单日期之间的差值不会超过4。
For example, the associated weight value is
Figure PCTCN2022085053-appb-000001
D is the difference between order dates (later order dates minus earlier order dates), and M is a preset positive integer. For example, M is set to 4. This is because the difference between order dates will not exceed 4.
在步骤S23中,以多个物品组中的物品标识为顶点、关联权重值为边,构建图模型。例如,在得到关联权重值后,对订单集合B进行处理,可以得到处理后的集合{[商品ID=3645,日期=20200101,品牌ID=乔丹,店铺=乔丹专卖店,分类=运动鞋],[商品ID=4543,日期=20200101,品牌ID=耐克,店铺=耐克专卖店,分类=运动鞋],权重=1}、和{[商品ID=4543,日期=20200101,品牌ID=耐克,店铺=耐克专卖店,分类=运动鞋],[商品ID=712345,日期=20200102,品牌ID=华为,店铺=华为专卖店,分类=手机],权重=0.75}。根据处理后的集合,可以构建图模型。In step S23, a graph model is constructed using the item identifiers in the multiple item groups as vertices and the associated weights as edges. For example, after obtaining the associated weight value, the order set B is processed, and the processed set {[item ID=3645, date=20200101, brand ID=Jordan, shop=Jordan store, category=sports shoes], [ItemID=4543, Date=20200101, BrandID=Nike, Store=Nike Store, Category=Sneakers], Weight=1}, and {[ItemID=4543, Date=20200101, BrandID=Nike, Store = Nike store, category = sports shoes], [item ID = 712345, date = 20200102, brand ID = Huawei, store = Huawei store, category = mobile], weight = 0.75}. From the processed collection, a graph model can be built.
在步骤S24中,利用图模型,从与对象对应的历史数据中,获取与每个目标关键词属于同一关键词类别的多个参考关键词。In step S24, a graph model is used to acquire a plurality of reference keywords belonging to the same keyword category as each target keyword from the historical data corresponding to the object.
在一些实施例中,订单数据还包括与物品标识对应的其他订单参数。可以通过如下的方式实现上述步骤S24。In some embodiments, the order data also includes other order parameters corresponding to the item identification. The above step S24 can be implemented in the following manner.
首先,利用随机游走算法,对图模型进行随机游走,直到满足游走停止条件,得到至少一个物品标识,作为参考物品标识。例如,随机游走的参数包括游走的最大步长(例如为13)、每个顶点的遍历次数(例如为4)。游走的停止条件为游走的总长度。在一些实施例中,随机游走算法为基于Alias(别名)采样的随机游走算法。First, use the random walk algorithm to randomly walk the graph model until the stop condition of the walk is satisfied, and at least one item identifier is obtained as a reference item identifier. For example, the parameters of the random walk include the maximum step size of the walk (eg, 13), and the number of traversals per vertex (eg, 4). The stop condition of the walk is the total length of the walk. In some embodiments, the random walk algorithm is an Alias sampling based random walk algorithm.
然后,对于每个目标关键词,从对象衍生的每组订单数据中,选择与每个目标关键词属于同一关键词类别、且与参考物品标识对应的物品标识或其他订单参数,作为参考关键词。以订单数据包括物品标识、品牌标识、品类标识为例,在目标关键词属于物品标识类的情况下,参考关键词也是物品标识(即参考物品标识)。在目标关键词属于品牌标识类的情况下,参考关键词为与参考物品标识对应的品牌标识。在目标关键词属于品类标识类的情况下,参考关键词为与参考物品标识对应的品类标识。Then, for each target keyword, from each group of order data derived from the object, select the item ID or other order parameters that belong to the same keyword category as each target keyword and correspond to the reference item ID as the reference keyword . Taking the order data including item ID, brand ID, and category ID as an example, in the case that the target keyword belongs to the item ID category, the reference keyword is also the item ID (ie, the reference item ID). In the case where the target keyword belongs to the brand identification category, the reference keyword is the brand identification corresponding to the reference item identification. In the case that the target keyword belongs to the category identification category, the reference keyword is the category identification corresponding to the reference item identification.
在上述实施例中,通过计算关联权重值,构建图模型,可以构建物品之间的关联性,从而利用物品之间的关联性,确定与目标关键词最具关联的参考关键词,进一步提高推荐的准确性。In the above embodiment, by calculating the correlation weight value and constructing a graph model, the correlation between items can be constructed, so that the reference keyword most related to the target keyword can be determined by using the correlation between the items, and the recommendation can be further improved. accuracy.
返回图1,在步骤S30中,根据与多个参考关键词对应的多个参考向量,确定与对象对应的综合向量。以对象为带货用户为例,每个带货用户对应一个综合向量。Returning to FIG. 1, in step S30, a comprehensive vector corresponding to an object is determined according to a plurality of reference vectors corresponding to a plurality of reference keywords. Taking the object as a user with goods as an example, each user with goods corresponds to a comprehensive vector.
例如,通过如图3所示的方式实现上述步骤S30。For example, the above-mentioned step S30 is implemented in the manner shown in FIG. 3 .
图3是示出根据本公开一些实施例的确定与对象对应的综合向量的流程图。Figure 3 is a flow diagram illustrating determining a synthesis vector corresponding to an object in accordance with some embodiments of the present disclosure.
如图3所示,确定与对象对应的综合向量包括步骤S31-步骤S32。As shown in FIG. 3 , determining the integrated vector corresponding to the object includes steps S31 to S32.
在步骤S31中,根据与对象对应的历史数据,确定与每个参考关键词对应的转化数据。转化数据为衡量基于每个参考关键词所产生的特定贡献(例如,点击通过率、转化率、粉丝活跃率等流量贡献)的能力的数据。在一些实施例中,在对象为带货用户的情况下,历史数据为历史带货数据,转化数据为带货转化数据。在另一些实施例中,在对象为广告媒体的情况下,历史数据为历史广告数据,转化数据为广告转化数据。In step S31, the conversion data corresponding to each reference keyword is determined according to the historical data corresponding to the object. Conversion data is data that measures the ability to generate specific contributions (eg, traffic contributions such as click-through rate, conversion rate, fan activity rate, etc.) based on each reference keyword. In some embodiments, when the object is a user with goods, the historical data is historical delivery data, and the transformation data is delivery conversion data. In other embodiments, when the object is an advertising medium, the historical data is historical advertising data, and the conversion data is advertising conversion data.
在步骤S32中,根据多个参考向量及其对应的转化数据,确定综合向量。通过结合转化数据来确定综合向量,不仅考虑了待处理关键词与对象之间的匹配度,还考虑了对象的转化能力,可以进一步提高推荐的准确性。In step S32, a comprehensive vector is determined according to a plurality of reference vectors and their corresponding transformation data. By combining the transformation data to determine the comprehensive vector, not only the matching degree between the keyword to be processed and the object, but also the transformation ability of the object is considered, which can further improve the accuracy of the recommendation.
在一些实施例中,根据与每个参考关键词对应的转化数据,确定相应的转化权重。利用多个转化权重,对多个参考向量进行加权操作,得到综合向量。通过将转化数据转换为转化权重,对多个参考向量进行加权操作,能够综合评价对象的转化能力,进一步提高推荐的准确性。转化权重即为每个参考向量的权重值。In some embodiments, the corresponding conversion weight is determined according to the conversion data corresponding to each reference keyword. Using multiple transformation weights, weighting operations are performed on multiple reference vectors to obtain a comprehensive vector. By converting the conversion data into conversion weights and performing weighting operations on multiple reference vectors, the conversion ability of the object can be comprehensively evaluated, and the accuracy of the recommendation can be further improved. The transformation weight is the weight value of each reference vector.
以对象为带货用户、历史数据为历史带货数据为例,带货用户的历史带货数据包括历史时间段内每天的带货数据。每天的带货数据包括与每个参考关键词对应的带货物品在一天内的点击量、订单量、点击粉丝量和在线粉丝量中的至少一种。点击粉丝量为一天内带货用户的在线粉丝中点击带货物品的粉丝数量。Taking the object as the user with goods and the historical data as the historical delivery data as an example, the historical delivery data of the user with goods includes the daily delivery data in the historical time period. The daily delivery data includes at least one of the click volume, order volume, click followers volume and online followers volume of the delivered product corresponding to each reference keyword in one day. The number of clicked followers refers to the number of followers who clicked on the goods with goods among the online followers of the users who brought goods in one day.
以对象为带货用户、历史数据为历史带货数据为例,转化数据包括历史时间段内每天的点击通过率(Click Through Rate,CTR)、转化率(Conversion Rate,CVR)和粉丝活跃率中的至少一种。点击通过率与点击量成正相关且与在线粉丝量成负相关。转化率与订单量成正相关且与点击量成负相关。粉丝活跃率为点击粉丝量与在线粉丝量之间的比值。Taking the object as the user with the goods and the historical data as the historical data of the goods as an example, the conversion data includes the daily click through rate (Click Through Rate, CTR), conversion rate (Conversion Rate, CVR) and fan activity rate in the historical time period. at least one of. Click-through rate is positively correlated with clicks and negatively correlated with online followers. Conversion rate is positively correlated with order volume and negatively correlated with click volume. The fan activity rate is the ratio between the number of clicked fans and the number of online fans.
以转化数据包括历史时间段内每天的点击通过率、转化率和粉丝活跃率为例,可以将点击通过率、转化率和粉丝活跃率的平均值,确定为与参考关键词对应的转化权重。Taking the conversion data including the daily click-through rate, conversion rate, and fan activity rate in a historical time period as an example, the average value of the click-through rate, conversion rate, and fan activity rate can be determined as the conversion weight corresponding to the reference keyword.
在一些实施例中,在带货用户为多个的情况下,点击通过率与平均点击量成正相关且与平均在线粉丝量成负相关。平均点击量为多个带货用户的点击量的平均值。平均在线粉丝量为多个带货用户的在线粉丝量的平均值。转化率与平均订单量成正相关且与平均点击量成负相关。平均订单量为多个带货用户的订单量的平均值。In some embodiments, when there are multiple users with goods, the click-through rate is positively correlated with the average click volume and negatively correlated with the average online fan count. The average click volume is the average of the click volume of multiple users with goods. The average number of online fans is the average of the online fans of multiple users with goods. Conversion rate is positively correlated with average order volume and negatively correlated with average click volume. The average order size is the average of the order sizes of multiple users with goods.
在上述实施例中,利用多个带货用户对应的平均在线粉丝量、平均点击量和平均订单量对转化数据进行修正,即利用贝叶斯平滑思想进行修正,可以使得转化数据更加接近真实值,从而进一步提高推荐的准确性。In the above embodiment, the conversion data is modified by using the average online followers, average clicks and average orders corresponding to multiple users with goods, that is, using the Bayesian smoothing idea to modify the conversion data, which can make the conversion data closer to the real value , so as to further improve the accuracy of the recommendation.
返回图1,在步骤S40中,根据综合向量与每个目标向量之间的相似度,推荐与待处理关键词相关的信息。例如,相似度为向量之间的向量内积或者余弦相似度。Returning to FIG. 1, in step S40, information related to the keywords to be processed is recommended according to the similarity between the comprehensive vector and each target vector. For example, the similarity is the vector inner product or cosine similarity between vectors.
在对象为多个对象的情况下,从多个对象中,选择待推荐对象,并推荐待推荐对象的相关信息。例如,从多个带货用户中选择待推荐用户,并推荐待推荐用户的用户信息。此种情况下,从多个带货用户选择与待处理关键词相匹配的带货用户,以便所 选择的带货用户针对与待处理关键词对应的物品进行带货。在一些实施例中,可以推荐与每个目标向量的相似度之和最大的综合向量所对应的带货用户的用户信息。In the case that the objects are multiple objects, the objects to be recommended are selected from the multiple objects, and the relevant information of the objects to be recommended is recommended. For example, a user to be recommended is selected from multiple users with goods, and user information of the user to be recommended is recommended. In this case, a user with goods matching the keyword to be processed is selected from multiple users with goods, so that the selected user with goods can carry goods for the item corresponding to the keyword to be processed. In some embodiments, the user information of the user with goods corresponding to the comprehensive vector with the largest sum of similarity of each target vector may be recommended.
在对象为一个对象的情况下,目标关键词为多个。从多个目标关键词中,选择待推荐关键词,并推荐待推荐关键词的相关信息。例如,从多个候选物品中,选择待推荐物品,并推荐待推荐物品的物品信息。此种情况下,为某个带货用户选择相匹配的待推荐物品,以便该带货用户针对该待推荐物品进行带货。在一些实施例中,可以推荐与综合向量之间的相似度最大的目标向量所对应的候选物品的物品信息。When the object is one object, there are plural target keywords. From a plurality of target keywords, a keyword to be recommended is selected, and relevant information of the keyword to be recommended is recommended. For example, from a plurality of candidate items, an item to be recommended is selected, and item information of the item to be recommended is recommended. In this case, a matching item to be recommended is selected for a user with goods, so that the user with goods can carry goods for the item to be recommended. In some embodiments, the item information of the candidate item corresponding to the target vector with the greatest similarity with the comprehensive vector may be recommended.
在上述实施例中,通过待处理关键词向目标关键词的转化,以及利用基于历史数据获取到的与目标关键词属于同一关键词类别的多个参考关键词的参考向量,确定对象的综合向量,以综合向量和目标向量之间的相似度为基准进行推荐,综合考虑了对象与待处理关键词的匹配度,可以提高推荐的准确性。In the above embodiment, the comprehensive vector of the object is determined by converting the keywords to be processed into the target keywords, and using the reference vectors of multiple reference keywords belonging to the same keyword category as the target keyword obtained based on historical data. , the recommendation is based on the similarity between the comprehensive vector and the target vector, and the matching degree between the object and the keyword to be processed is comprehensively considered, which can improve the accuracy of the recommendation.
图4是示出根据本公开一些实施例的信息推荐装置的框图。FIG. 4 is a block diagram illustrating an information recommendation apparatus according to some embodiments of the present disclosure.
如图4所示,信息推荐装置4包括第一确定模块41、获取模块42、第二确定模块43和推荐模块44。As shown in FIG. 4 , the information recommendation apparatus 4 includes a first determination module 41 , an acquisition module 42 , a second determination module 43 and a recommendation module 44 .
第一确定模块41被配置为根据待处理关键词,确定至少一个目标关键词及其目标向量,例如执行如图1所示的步骤S10。目标关键词与目标向量一一对应。The first determination module 41 is configured to determine at least one target keyword and its target vector according to the keywords to be processed, for example, to perform step S10 as shown in FIG. 1 . Target keywords and target vectors correspond one-to-one.
获取模块42被配置为对于每个目标关键词,从与待处理关键词相关的对象对应的历史数据中,获取与每个目标关键词属于同一关键词类别的多个参考关键词,例如执行如图1所示的步骤S20。The acquisition module 42 is configured to, for each target keyword, acquire a plurality of reference keywords belonging to the same keyword category with each target keyword from the historical data corresponding to the object related to the keyword to be processed, for example, perform the following steps: Step S20 shown in FIG. 1 .
第二确定模块43被配置为根据与多个参考关键词对应的多个参考向量,确定与对象对应的综合向量,例如执行如图1所示的步骤S30参考关键词与参考向量一一对应。The second determining module 43 is configured to determine a comprehensive vector corresponding to the object according to the plurality of reference vectors corresponding to the plurality of reference keywords, for example, performing step S30 shown in FIG. 1 for one-to-one correspondence between the reference keywords and the reference vectors.
推荐模块44被配置为根据综合向量与每个目标向量之间的相似度,推荐与待处理关键词相关的信息,例如执行如图1所示的步骤S40。The recommendation module 44 is configured to recommend information related to the keyword to be processed according to the similarity between the integrated vector and each target vector, for example, to perform step S40 as shown in FIG. 1 .
图5是示出根据本公开另一些实施例的信息推荐装置的框图。FIG. 5 is a block diagram illustrating an information recommendation apparatus according to other embodiments of the present disclosure.
如图5所示,信息推荐装置5包括存储器51;以及耦接至该存储器51的处理器52。存储器51用于存储执行信息推荐方法对应实施例的指令。处理器52被配置为基于存储在存储器51中的指令,执行本公开中任意一些实施例中的信息推荐方法。As shown in FIG. 5 , the information recommendation apparatus 5 includes a memory 51 ; and a processor 52 coupled to the memory 51 . The memory 51 is used for storing instructions for executing the corresponding embodiments of the information recommendation method. The processor 52 is configured to execute the information recommendation method in any of some embodiments of the present disclosure based on the instructions stored in the memory 51 .
图6是示出用于实现本公开一些实施例的计算机系统的框图。6 is a block diagram illustrating a computer system for implementing some embodiments of the present disclosure.
如图6所示,计算机系统60可以通用计算设备的形式表现。计算机系统60包括存储器610、处理器620和连接不同系统组件的总线600。As shown in FIG. 6, computer system 60 may take the form of a general-purpose computing device. Computer system 60 includes memory 610, processor 620, and bus 600 that connects various system components.
存储器610例如可以包括系统存储器、非易失性存储介质等。系统存储器例如存储有操作系统、应用程序、引导装载程序(Boot Loader)以及其他程序等。系统存储器可以包括易失性存储介质,例如随机存取存储器(RAM)和/或高速缓存存储器。非易失性存储介质例如存储有执行信息推荐方法中的至少一种的对应实施例的指令。非易失性存储介质包括但不限于磁盘存储器、光学存储器、闪存等。 Memory 610 may include, for example, system memory, non-volatile storage media, and the like. The system memory stores, for example, an operating system, an application program, a boot loader (Boot Loader), and other programs. System memory may include volatile storage media such as random access memory (RAM) and/or cache memory. The non-volatile storage medium stores, for example, instructions for performing the corresponding embodiments of at least one of the information recommendation methods. Non-volatile storage media include, but are not limited to, magnetic disk memory, optical memory, flash memory, and the like.
处理器620可以用通用处理器、数字信号处理器(DSP)、应用专用集成电路(ASIC)、现场可编程门阵列(FPGA)或其它可编程逻辑设备、分立门或晶体管等分立硬件组件方式来实现。相应地,诸如判断模块和确定模块的每个模块,可以通过中央处理器(CPU)运行存储器中执行相应步骤的指令来实现,也可以通过执行相应步骤的专用电路来实现。 Processor 620 may be implemented as a general purpose processor, digital signal processor (DSP), application specific integrated circuit (ASIC), field programmable gate array (FPGA) or other programmable logic device, discrete hardware components such as discrete gates or transistors. accomplish. Correspondingly, each module such as the judging module and the determining module can be implemented by a central processing unit (CPU) running the instructions in the memory for executing the corresponding steps, or can be implemented by a dedicated circuit for executing the corresponding steps.
总线600可以使用多种总线结构中的任意总线结构。例如,总线结构包括但不限于工业标准体系结构(ISA)总线、微通道体系结构(MCA)总线、外围组件互连(PCI)总线。The bus 600 may use any of a variety of bus structures. For example, bus structures include, but are not limited to, Industry Standard Architecture (ISA) bus, Micro Channel Architecture (MCA) bus, Peripheral Component Interconnect (PCI) bus.
计算机系统60还可以包括输入输出接口630、网络接口640、存储接口650等。这些接口630、640、650以及存储器610和处理器620之间可以通过总线600连接。输入输出接口630可以为显示器、鼠标、键盘等输入输出设备提供连接接口。网络接口640为各种联网设备提供连接接口。存储接口650为软盘、U盘、SD卡等外部存储设备提供连接接口。The computer system 60 may also include an input-output interface 630, a network interface 640, a storage interface 650, and the like. The interfaces 630 , 640 , 650 and the memory 610 and the processor 620 can be connected through the bus 600 . The input and output interface 630 may provide a connection interface for input and output devices such as a monitor, a mouse, and a keyboard. Network interface 640 provides a connection interface for various networked devices. The storage interface 650 provides a connection interface for external storage devices such as a floppy disk, a U disk, and an SD card.
这里,参照根据本公开实施例的方法、装置和计算机程序产品的流程图和/或框图描述了本公开的各个方面。应当理解,流程图和/或框图的每个框以及各框的组合,都可以由计算机可读程序指令实现。Aspects of the present disclosure are described herein with reference to flowchart illustrations and/or block diagrams of methods, apparatus and computer program products according to embodiments of the disclosure. It will be understood that each block of the flowchart illustrations and/or block diagrams, and combinations of blocks, can be implemented by computer readable program instructions.
这些计算机可读程序指令可提供到通用计算机、专用计算机或其他可编程装置的处理器,以产生一个机器,使得通过处理器执行指令产生实现在流程图和/或框图中一个或多个框中指定的功能的装置。These computer readable program instructions may be provided to a processor of a general purpose computer, special purpose computer or other programmable device to produce a machine such that execution of the instructions by the processor produces one or more blocks in the flowchart and/or block diagrams device with the specified function.
这些计算机可读程序指令也可存储在计算机可读存储器中,这些指令使得计算机以特定方式工作,从而产生一个制造品,包括实现在流程图和/或框图中一个或多个框中指定的功能的指令。Also stored in computer readable memory are these computer readable program instructions, which cause the computer to operate in a particular manner resulting in an article of manufacture including implementing the functions specified in one or more blocks of the flowchart and/or block diagrams instruction.
本公开可采用完全硬件实施例、完全软件实施例、或结合软件和硬件方面的实施例的形式。The present disclosure may take the form of an entirely hardware embodiment, an entirely software embodiment, or an embodiment combining software and hardware aspects.
通过上述实施例中的信息推荐方法及装置、计算机可存储介质,可以提高推荐的准确性。Through the information recommendation method and device, and the computer-storable medium in the above-mentioned embodiments, the accuracy of recommendation can be improved.
至此,已经详细描述了根据本公开的信息推荐方法及装置、计算机可存储介质。为了避免遮蔽本公开的构思,没有描述本领域所公知的一些细节。本领域技术人员根据上面的描述,完全可以明白如何实施这里公开的技术方案。So far, the information recommendation method and apparatus, and the computer-storable medium according to the present disclosure have been described in detail. Some details that are well known in the art are not described in order to avoid obscuring the concept of the present disclosure. Those skilled in the art can fully understand how to implement the technical solutions disclosed herein based on the above description.

Claims (17)

  1. 一种信息推荐方法,包括:An information recommendation method, including:
    根据待处理关键词,确定至少一个目标关键词及其目标向量,目标关键词与目标向量一一对应;According to the keywords to be processed, at least one target keyword and its target vector are determined, and the target keyword and the target vector are in one-to-one correspondence;
    对于每个目标关键词,从与所述待处理关键词相关的对象所对应的历史数据中,获取与所述每个目标关键词属于同一关键词类别的多个参考关键词;For each target keyword, obtain a plurality of reference keywords belonging to the same keyword category as each target keyword from the historical data corresponding to the object related to the keyword to be processed;
    根据与所述多个参考关键词对应的多个参考向量,确定与所述对象对应的综合向量,参考关键词与参考向量一一对应;According to a plurality of reference vectors corresponding to the plurality of reference keywords, a comprehensive vector corresponding to the object is determined, and the reference keywords and the reference vectors are in one-to-one correspondence;
    根据所述综合向量与所述每个目标向量之间的相似度,推荐与所述待处理关键词相关的信息。According to the similarity between the comprehensive vector and each target vector, information related to the keyword to be processed is recommended.
  2. 根据权利要求1所述的信息推荐方法,其中,确定与所述对象对应的综合向量包括:The information recommendation method according to claim 1, wherein determining the comprehensive vector corresponding to the object comprises:
    对于所述对象,根据对应的历史数据,确定与所述每个参考关键词对应的转化数据;For the object, according to the corresponding historical data, determine the conversion data corresponding to each of the reference keywords;
    根据所述多个参考向量及其对应的转化数据,确定所述综合向量。The integrated vector is determined according to the plurality of reference vectors and their corresponding transformation data.
  3. 根据权利要求2所述的信息推荐方法,其中,根据所述多个参考向量及其对应的转化数据,确定所述综合向量包括:The information recommendation method according to claim 2, wherein determining the comprehensive vector according to the plurality of reference vectors and their corresponding transformation data comprises:
    根据与每个参考关键词对应的转化数据,确定相应的转化权重;According to the conversion data corresponding to each reference keyword, determine the corresponding conversion weight;
    利用多个转化权重,对所述多个参考向量进行加权操作,得到所述综合向量。Using a plurality of transformation weights, a weighting operation is performed on the plurality of reference vectors to obtain the comprehensive vector.
  4. 根据权利要求3所述的信息推荐方法,其中,所述对象为带货用户,所述历史数据为历史带货数据,所述带货用户的历史带货数据包括历史时间段内每天的带货数据,所述每天的带货数据包括与每个参考关键词对应的带货物品在一天内的点击量、订单量、点击粉丝量和在线粉丝量中的至少一种,所述点击粉丝量为一天内所述带货用户的在线粉丝中点击所述带货物品的粉丝数量,The information recommendation method according to claim 3, wherein the object is a user with goods, the historical data is historical data with goods, and the historical data with goods of the user with goods includes daily goods in a historical time period. Data, the daily delivery data includes at least one of the click volume, order volume, click fan volume and online fan volume of the delivered product corresponding to each reference keyword in one day, and the click volume is: The number of fans who click on the item with the item among the online fans of the user with the item in one day,
    所述转化数据包括所述历史时间段内每天的点击通过率、转化率和粉丝活跃率中的至少一种,所述点击通过率与所述点击量成正相关且与所述在线粉丝量成负相关,所述转化率与所述订单量成正相关且与所述点击量成负相关,所述粉丝活跃率为点击粉丝量与在线粉丝量之间的比值。The conversion data includes at least one of click-through rate, conversion rate, and fan activity rate for each day in the historical time period, and the click-through rate is positively correlated with the click volume and negatively correlated with the online fan volume Correlation, the conversion rate is positively correlated with the order volume and negatively correlated with the click volume, and the fan activity rate is a ratio between the click volume and the online number of followers.
  5. 根据权利要求4所述的信息推荐方法,其中,所述带货用户为多个,所述点击通过率与平均点击量成正相关且与平均在线粉丝量成负相关,所述平均点击量为所述多个带货用户的点击量的平均值,所述平均在线粉丝量为所述多个带货用户的在线粉丝量的平均值;The information recommendation method according to claim 4, wherein there are multiple users with goods, the click-through rate is positively correlated with the average click volume and negatively correlated with the average online fan volume, and the average click volume is the The average value of the clicks of the multiple users with goods, and the average online fans is the average of the online fans of the multiple users with goods;
    所述转化率与平均订单量成正相关且与所述平均点击量成负相关,所述平均订单量为所述多个带货用户的订单量的平均值。The conversion rate is positively correlated with the average order volume and negatively correlated with the average click volume, and the average order volume is an average value of the order volumes of the plurality of users with goods.
  6. 根据权利要求4所述的信息推荐方法,其中,确定相应的转化权重包括:The information recommendation method according to claim 4, wherein determining the corresponding conversion weight comprises:
    将所述点击通过率、所述转化率和所述粉丝活跃率的平均值,确定为转化权重。The average value of the click-through rate, the conversion rate, and the fan activity rate is determined as the conversion weight.
  7. 根据权利要求1所述的信息推荐方法,其中,与所述对象对应的历史数据包括该对象衍生的至少一组订单数据,每组订单数据包括多个物品的物品标识,从与所述待处理关键词相关的对象对应的历史数据中,获取与所述每个目标关键词属于同一关键词类别的多个参考关键词包括:The information recommendation method according to claim 1, wherein the historical data corresponding to the object includes at least one group of order data derived from the object, and each group of order data includes item identifiers of a plurality of items, from which data are to be processed. In the historical data corresponding to the keyword-related objects, acquiring a plurality of reference keywords belonging to the same keyword category as each target keyword includes:
    根据所述对象衍生的每组订单数据,确定多个物品组,每个物品组包括两个不同物品的物品标识;Determine a plurality of item groups according to each group of order data derived from the object, and each item group includes item identifiers of two different items;
    根据所述对象衍生的每组订单数据,确定每个物品组中、与物品标识对应的两个物品之间的关联权重值,关联权重值越大,两个物品之间的相关性越强;According to each group of order data derived from the object, determine the association weight value between the two items corresponding to the item identifier in each item group, the larger the association weight value, the stronger the correlation between the two items;
    以所述多个物品组中的物品标识为顶点、关联权重值为边,构建图模型;Constructing a graph model with item identifiers in the plurality of item groups as vertices and associated weights as edges;
    利用所述图模型,从与所述对象对应的历史数据中,获取与所述每个目标关键词属于同一关键词类别的多个参考关键词。Using the graph model, a plurality of reference keywords belonging to the same keyword category as each target keyword are acquired from historical data corresponding to the object.
  8. 根据权利要求7所述的信息推荐方法,其中,所述订单数据还包括与物品标识对应的其他订单参数,利用所述图模型,从与所述对象对应的历史数据中,获取与所述每个目标关键词属于同一关键词类别的多个参考关键词包括:The information recommendation method according to claim 7, wherein the order data further includes other order parameters corresponding to the item identifiers, and the graph model is used to obtain information related to each item from historical data corresponding to the object by using the graph model. Multiple reference keywords belonging to the same keyword category for each target keyword include:
    利用随机游走算法,对所述图模型进行随机游走,直到满足游走停止条件,得到至少一个物品标识,作为参考物品标识;A random walk algorithm is used to randomly walk the graph model until the walking stop condition is met, and at least one item identifier is obtained as a reference item identifier;
    对于所述每个目标关键词,从所述对象衍生的每组订单数据中,选择与所述每个目标关键词属于同一关键词类别、且与所述参考物品标识对应的物品标识或其他订单参数,作为参考关键词。For each target keyword, from each group of order data derived from the object, select an item ID or other order that belongs to the same keyword category as each target keyword and corresponds to the reference item ID parameters, as reference keywords.
  9. 根据权利要求7所述的信息推荐方法,其中,订单数据还包括订单日期,确定每个物品组中的与物品标识对应的两个物品之间的关联权重值包括:The information recommendation method according to claim 7, wherein the order data further includes an order date, and determining the association weight value between the two items corresponding to the item identifiers in each item group comprises:
    根据每个物品组中、与两个物品标识对应的订单日期之间的差值,确定每个物品组中的与物品标识对应的两个物品之间的关联权重值,关联权重值与订单日期之间的差值成负相关。According to the difference between the order dates corresponding to the two item IDs in each item group, determine the association weight value between the two items corresponding to the item ID in each item group, the association weight value and the order date The difference between them is negatively correlated.
  10. 根据权利要求7所述的信息推荐方法,其中,订单数据还包括订单日期,根据与每个带货用户对应的订单数据,确定多个带货物品组包括:The information recommendation method according to claim 7, wherein the order data further includes an order date, and according to the order data corresponding to each user with goods, determining the plurality of groups of goods with goods includes:
    根据每组订单数据,确定至少一个物品集合,每个物品集合包括多个按订单先后顺序排列的多个物品标识,同一物品集合中的多个物品标识所对应的最晚订单日期与最早订单日期之间的差值小于日期阈值;According to each group of order data, at least one item set is determined, each item set includes multiple item identifiers arranged in order of order, and the latest order date and earliest order date corresponding to multiple item identifiers in the same item set The difference between is less than the date threshold;
    针对每个物品集合,将相邻的两个物品标识划分到一个物品组,以得到所述多个物品组。For each item set, two adjacent item identifiers are divided into one item group to obtain the plurality of item groups.
  11. 根据权利要求1所述的信息推荐方法,其中,推荐与所述待处理关键词相关的信息包括:The information recommendation method according to claim 1, wherein recommending the information related to the keyword to be processed comprises:
    在所述对象为多个对象的情况下,从所述多个对象中,选择待推荐对象,并推荐所述待推荐对象的相关信息。When the objects are multiple objects, an object to be recommended is selected from the multiple objects, and relevant information of the object to be recommended is recommended.
  12. 根据权利要求1所述的信息推荐方法,其中,在所述对象为一个对象的情况下,所述至少一个目标关键词包括多个目标关键词,推荐与所述待处理关键词相关的信息包括:The information recommendation method according to claim 1, wherein, when the object is one object, the at least one target keyword includes a plurality of target keywords, and recommending information related to the keyword to be processed includes :
    在所述对象为一个对象的情况下,从所述多个目标关键词中,选择待推荐关键词,并推荐所述待推荐关键词的相关信息。In the case that the object is one object, a keyword to be recommended is selected from the plurality of target keywords, and related information of the keyword to be recommended is recommended.
  13. 根据权利要求1所述的信息推荐方法,其中,根据待处理关键词,确定至少一个目标关键词包括:The information recommendation method according to claim 1, wherein, according to the keywords to be processed, determining at least one target keyword comprises:
    根据与所述待处理关键词对应的关键词类别,将所述待处理关键词转化为至少一个目标关键词。Convert the keyword to be processed into at least one target keyword according to the keyword category corresponding to the keyword to be processed.
  14. 根据权利要求1所述的信息推荐方法,其中,在所述对象为带货用户的情况下,所述关键词类别包括用户标识类、物品标识类、品牌标识类、店铺标识类和品类。The information recommendation method according to claim 1, wherein, when the object is a user with goods, the keyword categories include user identification category, item identification category, brand identification category, store identification category and category category.
  15. 一种信息推荐装置,包括:An information recommendation device, comprising:
    第一确定模块,被配置为根据待处理关键词,确定至少一个目标关键词及其目标向量,目标关键词与目标向量一一对应;The first determining module is configured to determine at least one target keyword and its target vector according to the keywords to be processed, and the target keywords and target vectors correspond one-to-one;
    获取模块,被配置为对于每个目标关键词,从与所述待处理关键词相关的对象对应的历史数据中,获取与所述每个目标关键词属于同一关键词类别的多个参考关键词;The acquisition module is configured to, for each target keyword, acquire a plurality of reference keywords belonging to the same keyword category as each target keyword from historical data corresponding to objects related to the keyword to be processed ;
    第二确定模块,被配置为根据与所述多个参考关键词对应的多个参考向量,确定与所述对象对应的综合向量,参考关键词与参考向量一一对应;The second determining module is configured to determine a comprehensive vector corresponding to the object according to a plurality of reference vectors corresponding to the plurality of reference keywords, and the reference keywords and the reference vectors are in one-to-one correspondence;
    推荐模块,被配置为根据所述综合向量与所述每个目标向量之间的相似度,推荐与所述待处理关键词相关的信息。The recommendation module is configured to recommend information related to the keyword to be processed according to the similarity between the comprehensive vector and each target vector.
  16. 一种信息推荐装置,包括:An information recommendation device, comprising:
    存储器;以及memory; and
    耦接至所述存储器的处理器,所述处理器被配置为基于存储在所述存储器的指令,执行如权利要求1至14任一项所述的信息推荐方法。A processor coupled to the memory, the processor configured to perform the information recommendation method of any one of claims 1 to 14 based on instructions stored in the memory.
  17. 一种计算机可存储介质,其上存储有计算机程序指令,该指令被处理器执行时实现如权利要求1至14任一项所述的信息推荐方法。A computer-storable medium having computer program instructions stored thereon, the instructions, when executed by a processor, implement the information recommendation method as claimed in any one of claims 1 to 14.
PCT/CN2022/085053 2021-04-15 2022-04-02 Information recommendation method and device and computer storable medium WO2022218181A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202110405818.1 2021-04-15
CN202110405818.1A CN113111182A (en) 2021-04-15 2021-04-15 Information recommendation method and device and computer-readable storage medium

Publications (1)

Publication Number Publication Date
WO2022218181A1 true WO2022218181A1 (en) 2022-10-20

Family

ID=76717194

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/085053 WO2022218181A1 (en) 2021-04-15 2022-04-02 Information recommendation method and device and computer storable medium

Country Status (2)

Country Link
CN (1) CN113111182A (en)
WO (1) WO2022218181A1 (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113111182A (en) * 2021-04-15 2021-07-13 北京沃东天骏信息技术有限公司 Information recommendation method and device and computer-readable storage medium
CN117332160B (en) * 2023-12-01 2024-02-09 中航信移动科技有限公司 Multi-target identification display method, storage medium and electronic equipment

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR102008696B1 (en) * 2019-03-27 2019-08-08 주식회사 페이크럭스컴퍼니 Apparatus and method for recommending goods based on clustering of transaction history data
CN110365996A (en) * 2019-07-25 2019-10-22 深圳市元征科技股份有限公司 Management method, live streaming management platform, electronic equipment and storage medium is broadcast live
CN111523918A (en) * 2019-02-02 2020-08-11 北京极智嘉科技有限公司 Commodity clustering method, commodity clustering device, commodity clustering equipment and storage medium
CN112488781A (en) * 2020-11-10 2021-03-12 北京三快在线科技有限公司 Search recommendation method and device, electronic equipment and readable storage medium
CN112508594A (en) * 2020-10-13 2021-03-16 有米科技股份有限公司 Method and device for estimating commodity sales volume attribute of video with goods
CN113111182A (en) * 2021-04-15 2021-07-13 北京沃东天骏信息技术有限公司 Information recommendation method and device and computer-readable storage medium

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11076189B2 (en) * 2009-03-30 2021-07-27 Time Warner Cable Enterprises Llc Personal media channel apparatus and methods
CN107194769A (en) * 2017-05-17 2017-09-22 东莞市华睿电子科技有限公司 A kind of Method of Commodity Recommendation that content is searched for based on user
CN110399479A (en) * 2018-04-20 2019-11-01 北京京东尚科信息技术有限公司 Search for data processing method, device, electronic equipment and computer-readable medium
CN111159555A (en) * 2019-12-30 2020-05-15 北京每日优鲜电子商务有限公司 Commodity recommendation method, commodity recommendation device, server and storage medium
CN112637629B (en) * 2020-12-25 2023-06-20 百度在线网络技术(北京)有限公司 Live content recommendation method and device, electronic equipment and medium

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111523918A (en) * 2019-02-02 2020-08-11 北京极智嘉科技有限公司 Commodity clustering method, commodity clustering device, commodity clustering equipment and storage medium
KR102008696B1 (en) * 2019-03-27 2019-08-08 주식회사 페이크럭스컴퍼니 Apparatus and method for recommending goods based on clustering of transaction history data
CN110365996A (en) * 2019-07-25 2019-10-22 深圳市元征科技股份有限公司 Management method, live streaming management platform, electronic equipment and storage medium is broadcast live
CN112508594A (en) * 2020-10-13 2021-03-16 有米科技股份有限公司 Method and device for estimating commodity sales volume attribute of video with goods
CN112488781A (en) * 2020-11-10 2021-03-12 北京三快在线科技有限公司 Search recommendation method and device, electronic equipment and readable storage medium
CN113111182A (en) * 2021-04-15 2021-07-13 北京沃东天骏信息技术有限公司 Information recommendation method and device and computer-readable storage medium

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
LIU, PINGSHENG ET AL.: "Research on the Influencing Mechanism of Live Broadcasting Marketing Pattern on Consumers' Purchase Decision", CHINA BUSINESS AND MARKET, vol. 10, no. 34, 13 October 2020 (2020-10-13), pages 38 - 47, XP009540361, ISSN: 1007-8266 *
SU XIUMEI: "An Empirical Study on the Influencing Factors of E-Commerce Live Streaming", 2019 INTERNATIONAL CONFERENCE ON ECONOMIC MANAGEMENT AND MODEL ENGINEERING (ICEMME), IEEE, 6 December 2019 (2019-12-06), pages 492 - 496, XP033707465, DOI: 10.1109/ICEMME49371.2019.00103 *

Also Published As

Publication number Publication date
CN113111182A (en) 2021-07-13

Similar Documents

Publication Publication Date Title
WO2022218181A1 (en) Information recommendation method and device and computer storable medium
US9892187B2 (en) Data analysis method, data analysis device, and storage medium storing processing program for same
CN105574089B (en) Knowledge graph generation method and device, and object comparison method and device
CN104252456B (en) A kind of weight method of estimation, apparatus and system
CN111724238B (en) Method, device and equipment for evaluating product recommendation accuracy and storage medium
US8655737B1 (en) Brand name synonymy
US20210073890A1 (en) Catalog-based image recommendations
CN105354202A (en) Data pushing method and apparatus
CN113032668A (en) Product recommendation method, device and equipment based on user portrait and storage medium
CN112818226A (en) Data processing method, recommendation device, electronic equipment and storage medium
CN111967924A (en) Commodity recommendation method, commodity recommendation device, computer device, and medium
CN110569446B (en) Method and system for constructing recommended object candidate set
Liu et al. Research of commodity recommendation workflow based on LSH algorithm
CN110427358B (en) Data cleaning method and device and information recommendation method and device
Johnson et al. K-strange points clustering algorithm
Wang et al. MIC-KMeans: a maximum information coefficient based high-dimensional clustering algorithm
Kapoor Data mining: Past, present and future scenario
CN111488479B (en) Hypergraph construction method and device, computer system and medium
Xu et al. Content-based recommendation in e-commerce
Nagpal et al. Analyzing software effort estimation using k means clustered regression approach
CN107622048B (en) Text mode recognition method and system
JP6499372B1 (en) PROCESSING SYSTEM, PROCESSING DEVICE, PROCESSING METHOD, PROGRAM, AND INFORMATION RECORDING MEDIUM
Mehrazma et al. Customer Clustering by Combining the Particle Swarm And K-Means Algorithms and Analyzing Their Behavior on Commercial Websites
Trihatmaja et al. Improving the performance of collaborative filtering using outlier labeling, clustering, and association rule mining
Ciesielczyk Using mutual information for feature selection in programmatic advertising

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22787408

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 080224)