WO2019119635A1 - Procédé de développement d'utilisateur initial, dispositif électronique et support de stockage lisible par ordinateur - Google Patents

Procédé de développement d'utilisateur initial, dispositif électronique et support de stockage lisible par ordinateur Download PDF

Info

Publication number
WO2019119635A1
WO2019119635A1 PCT/CN2018/076181 CN2018076181W WO2019119635A1 WO 2019119635 A1 WO2019119635 A1 WO 2019119635A1 CN 2018076181 W CN2018076181 W CN 2018076181W WO 2019119635 A1 WO2019119635 A1 WO 2019119635A1
Authority
WO
WIPO (PCT)
Prior art keywords
user
seed
expanded
seed user
users
Prior art date
Application number
PCT/CN2018/076181
Other languages
English (en)
Chinese (zh)
Inventor
安欣
许开河
王建明
肖京
Original Assignee
平安科技(深圳)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 平安科技(深圳)有限公司 filed Critical 平安科技(深圳)有限公司
Publication of WO2019119635A1 publication Critical patent/WO2019119635A1/fr

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • G06Q30/0201Market modelling; Market analysis; Collecting market data
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/23Clustering techniques
    • G06F18/232Non-hierarchical techniques
    • G06F18/2321Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions
    • G06F18/23213Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions with fixed number of clusters, e.g. K-means clustering

Definitions

  • the present application relates to the field of computer information technology, and in particular, to a seed user extension method, an electronic device, and a computer readable storage medium.
  • the present application proposes a seed user extension method, an electronic device, and a computer readable storage medium, which combines an unsupervised learning clustering method with a distance similarity algorithm to reduce the computational complexity of seed user expansion and improve The accuracy of the expansion.
  • the present application provides an electronic device including a memory, a processor, and a seed user extension system stored on the memory and operable on the processor, the seed user
  • the expansion system is executed by the processor, the following steps are implemented:
  • Clustering analysis is performed on a predetermined number of seed users by a preset clustering method, and the seed user is divided into a plurality of seed user communities having specific characteristics;
  • the user to be expanded is divided into the specific seed user community
  • the number of the users to be expanded into the specific seed user community is counted and sorted, and the expansion rules for expanding the users to be expanded into seed users are determined according to the ranking result.
  • the specific features include the geographic location of the user, whether it is a registered user, whether a particular product has been purchased.
  • the calculating the similarity between the user to be extended and the community of each seed user comprises: calculating the similarity between the user to be expanded and the center point of each seed user, as the similarity between the user to be expanded and each seed user community.
  • the extension rule is set to: select a specified number of users to be expanded according to the order of the number from high to low, and expand the selected users to be expanded into seed users.
  • the extension rule is configured to expand the user to be extended into a seed user if the number of the users to be expanded into a specific seed user community is greater than or equal to a second preset threshold, wherein the second pre- Let the threshold be set to a predetermined ratio of the total number of all seed user communities.
  • the present application further provides a seed user extension method, which is applied to an electronic device, and the method includes:
  • Clustering analysis is performed on a predetermined number of seed users by a preset clustering method, and the seed user is divided into a plurality of seed user communities having specific characteristics;
  • the user to be expanded is divided into the specific seed user community
  • the number of the users to be expanded into the specific seed user community is counted and sorted, and the expansion rules for expanding the users to be expanded into seed users are determined according to the ranking result.
  • the specific feature includes a geographical location of the user, whether it is a registered user, whether a specific product has been purchased;
  • the calculating the similarity between the user to be expanded and each seed user community includes:
  • the similarity between the user to be expanded and the center point of each seed user community is calculated as the similarity between the user to be expanded and each seed user community.
  • the extension rule is set to: select a specified number of users to be expanded according to the order of the number from high to low, and expand the selected users to be expanded into seed users.
  • the extension rule is configured to expand the user to be extended into a seed user if the number of the users to be expanded into a specific seed user community is greater than or equal to a second preset threshold, wherein the second pre- Let the threshold be set to a predetermined ratio of the total number of all seed user communities.
  • the present application further provides a computer readable storage medium storing a seed user extension system, the seed user extension system being executable by at least one processor, such that The at least one processor performs the steps of the seed user extension method as described above.
  • the electronic device, the seed user extension method and the computer readable storage medium proposed by the present application combine the unsupervised learning clustering method with the distance similarity algorithm to reduce the computational complexity of the seed user expansion. And improve the accuracy of the expansion.
  • 1 is a schematic diagram of an optional hardware architecture of an electronic device of the present application
  • FIG. 2 is a schematic diagram of a program module of an embodiment of a seed user extension system in an electronic device of the present application
  • FIG. 3 is a schematic diagram of an implementation process of an embodiment of a seed user extension method according to the present application.
  • FIG. 1 it is a schematic diagram of an optional hardware architecture of the electronic device 2 of the present application.
  • the electronic device 2 may include, but is not limited to, a memory 21, a processor 22, and a network interface 23 that can communicate with each other through a system bus. It is pointed out that FIG. 1 only shows the electronic device 2 with the components 21-23, but it should be understood that not all illustrated components are required to be implemented, and more or fewer components may be implemented instead.
  • the electronic device 2 may be a computing device such as a rack server, a blade server, a tower server, or a rack server.
  • the electronic device 2 may be an independent server or a server cluster composed of multiple servers. .
  • the memory 21 includes at least one type of readable storage medium including a flash memory, a hard disk, a multimedia card, a card type memory (eg, SD or DX memory, etc.), a random access memory (RAM), a static Random access memory (SRAM), read only memory (ROM), electrically erasable programmable read only memory (EEPROM), programmable read only memory (PROM), magnetic memory, magnetic disk, optical disk, and the like.
  • the memory 21 may be an internal storage unit of the electronic device 2, such as a hard disk or memory of the electronic device 2.
  • the memory 21 may also be an external storage device of the electronic device 2, such as a plug-in hard disk equipped on the electronic device 2, a smart memory card (SMC), and a secure digital device. (Secure Digital, SD) card, flash card, etc.
  • the memory 21 may also include both an internal storage unit of the electronic device 2 and an external storage device thereof.
  • the memory 21 is generally used to store an operating system installed in the electronic device 2 and various types of application software, such as program code of the seed user extension system 20. Further, the memory 21 can also be used to temporarily store various types of data that have been output or are to be output.
  • the processor 22 may be a Central Processing Unit (CPU), controller, microcontroller, microprocessor, or other data processing chip in some embodiments.
  • the processor 22 is typically used to control the overall operation of the electronic device 2, such as performing control and processing related to data interaction or communication with the electronic device 2.
  • the processor 22 is configured to run program code or process data stored in the memory 21, such as running the seed user extension system 20 and the like.
  • the network interface 23 may comprise a wireless network interface or a wired network interface, which is typically used to establish a communication connection between the electronic device 2 and other electronic devices.
  • the network interface 23 is configured to connect the electronic device 2 to an external data platform through a network, and establish a data transmission channel and a communication connection between the electronic device 2 and an external data platform.
  • the network may be an intranet, an Internet, a Global System of Mobile communication (GSM), a Wideband Code Division Multiple Access (WCDMA), a 4G network, or a 5G network.
  • Wireless or wired networks such as network, Bluetooth, Wi-Fi, etc.
  • the seed user extension system 20 may be divided into one or more program modules, the one or more program modules being stored in the memory 21 and being processed by one or more processors ( This embodiment is executed by the processor 22) to complete the application.
  • the seed user extension system 20 can be divided into an analysis module 201, a calculation module 202, and an expansion module 203.
  • a program module as referred to in the present application refers to a series of computer program instruction segments capable of performing a specific function, and is more suitable than the program to describe the execution process of the seed user extension system 20 in the electronic device 2. The function of each program module 201-203 will be described in detail below.
  • the analyzing module 201 is configured to perform cluster analysis on a predetermined number (which may be a large data level) of seed users by using a preset clustering method (such as an unsupervised learning K-means clustering method), and the seed is The user is divided into a number of seed user communities with specific characteristics (or salient features).
  • the specific features include, but are not limited to, the geographic location of the user, whether it is a registered user, whether a specific product has been purchased (such as a property insurance), and the like.
  • the K-means clustering method includes the following steps:
  • k data objects are arbitrarily selected from N data objects (ie, N seed users) as cluster centers (ie, initial cluster centers);
  • (A2) Calculate the similarity of each remaining data object to each cluster center for the remaining data objects (such as Euclidean distance, the smaller the distance, the higher the similarity), and each according to the similarity
  • the remaining data objects are assigned to clusters represented by cluster centers that are most similar to them;
  • (A4) Iterate the steps A2 to A3 until the preset standard measure function begins to converge.
  • the mean square error can be used as a preset standard measure function.
  • the calculating module 202 is configured to calculate, by using a preset similarity calculation method, a similarity between the user(s) to be extended and each seed user community (ie, a seed user community having a specific feature).
  • the predetermined similarity calculation method may be: a similarity calculation method such as an Euclidean distance, an included cosine, and a Hamming distance.
  • the calculating the similarity between the user to be extended and each seed user community comprises: calculating the similarity between the user to be expanded and the center point of each seed user community (calculation method and user similarity The calculation method is consistent) as the similarity between the user to be expanded and each seed user community.
  • the seed user community is a set of users with similar characteristics, and each set is aggregated to a central point, which is the center point of the seed user community. Since there is no need to calculate similarities with each user in the seed user community, the computational complexity can be greatly reduced.
  • Representing the cosine similarity of the user a to be expanded and the center point b of a seed user community Represents the rating vector of the user a to be expanded, A scoring vector representing the center point b of a seed user community.
  • the expansion module 203 is configured to divide the user to be expanded into the specific seed user community if the similarity between the user to be expanded and the specific seed user community is greater than or equal to a first preset threshold (eg, 80%).
  • a first preset threshold eg, 80%
  • the seed user community obtained after cluster analysis includes three: B1, B2, and B3, and the similarity S1 of the user A to be expanded and the seed user community B1 is 60%, and the similarity with the seed user community B2 is S2.
  • the user A to be expanded is divided into specific seed user communities B2 and B3.
  • the expansion module 203 is further configured to count and sort the number of the users to be expanded into a specific seed user community, and determine an extension rule for expanding the user to be expanded into a seed user according to the ranking result. Among them, the higher the ranking, the higher the similarity.
  • the extension rule is set to select a user to be expanded according to a specified number (such as the first two digits) according to the number from the highest to the lowest, and expand the selected user to be expanded into a seed user.
  • a specified number such as the first two digits
  • the user A to be expanded is divided into five seed user communities at the same time, and the user B to be expanded is divided into three seed user communities at the same time. If the extended user C is simultaneously divided into two seed user communities, the users A and B will be expanded. Expanded to seed users.
  • the extension rule may be further configured to: if the number of users to be expanded into a specific seed user community is greater than or equal to a second preset threshold, expand the user to be expanded into a seed user.
  • the second preset threshold may be set to a predetermined ratio (eg, 50%) of the total number of all seed user communities (ie, seed user communities having specific characteristics). For example, assuming that the total number of all seed user communities is 4 and the predetermined ratio is 50%, the second preset threshold is 2.
  • the seed user extension system 20 proposed by the present application combines the unsupervised learning clustering method with the distance similarity algorithm, thereby reducing the computational complexity of the seed user expansion and improving the accuracy of the expansion. degree.
  • the present application also proposes a seed user extension method.
  • FIG. 3 it is a schematic flowchart of an implementation process of an embodiment of a seed user extension method of the present application.
  • the order of execution of the steps in the flowchart shown in FIG. 3 may be changed according to different requirements, and some steps may be omitted.
  • Step S31 performing cluster analysis on a predetermined number of seed users (which may be a large data level) by a preset clustering method (such as an unsupervised learning K-means clustering method), and dividing the seed user into several A seed user community with specific characteristics (or salient features).
  • the specific features include, but are not limited to, the geographic location of the user, whether it is a registered user, whether a specific product has been purchased (such as a property insurance), and the like.
  • the K-means clustering method includes the following steps:
  • k data objects are arbitrarily selected from N data objects (ie, N seed users) as cluster centers (ie, initial cluster centers);
  • (A2) Calculate the similarity of each remaining data object to each cluster center for the remaining data objects (such as Euclidean distance, the smaller the distance, the higher the similarity), and each according to the similarity
  • the remaining data objects are assigned to clusters represented by cluster centers that are most similar to them;
  • (A4) Iterate the steps A2 to A3 until the preset standard measure function begins to converge.
  • the mean square error can be used as a preset standard measure function.
  • Step S32 Calculate the similarity between the user(s) to be extended and each seed user community (ie, a seed user community with a specific feature) by a preset similarity calculation method.
  • the predetermined similarity calculation method may be: a similarity calculation method such as an Euclidean distance, an included cosine, and a Hamming distance.
  • the calculating the similarity between the user to be extended and each seed user community comprises: calculating the similarity between the user to be expanded and the center point of each seed user community (calculation method and user similarity The calculation method is consistent) as the similarity between the user to be expanded and each seed user community.
  • the seed user community is a set of users with similar characteristics, and each set is aggregated to a central point, which is the center point of the seed user community. Since there is no need to calculate similarities with each user in the seed user community, the computational complexity can be greatly reduced.
  • Representing the cosine similarity of the user a to be expanded and the center point b of a seed user community Represents the rating vector of the user a to be expanded, A scoring vector representing the center point b of a seed user community.
  • step S33 if the similarity between the user to be expanded and the specific seed user community is greater than or equal to a first preset threshold (such as 80%), the user to be expanded is divided into the specific seed user community.
  • a first preset threshold such as 80%
  • the seed user community obtained after cluster analysis includes three: B1, B2, and B3, and the similarity S1 of the user A to be expanded and the seed user community B1 is 60%, and the similarity with the seed user community B2 is S2.
  • the user A to be expanded is divided into specific seed user communities B2 and B3.
  • step S34 the number of the users to be expanded into the specific seed user community is counted and sorted, and the expansion rule of expanding the user to be expanded into the seed user is determined according to the ranking result. Among them, the higher the ranking, the higher the similarity.
  • the extension rule is set to select a user to be expanded according to a specified number (such as the first two digits) according to the number from the highest to the lowest, and expand the selected user to be expanded into a seed user.
  • a specified number such as the first two digits
  • the user A to be expanded is divided into five seed user communities at the same time, and the user B to be expanded is divided into three seed user communities at the same time. If the extended user C is simultaneously divided into two seed user communities, the users A and B will be expanded. Expanded to seed users.
  • the extension rule may be further configured to: if the number of users to be expanded into a specific seed user community is greater than or equal to a second preset threshold, expand the user to be expanded into a seed user.
  • the second preset threshold may be set to a predetermined ratio (eg, 50%) of the total number of all seed user communities (ie, seed user communities having specific characteristics). For example, assuming that the total number of all seed user communities is 4 and the predetermined ratio is 50%, the second preset threshold is 2.
  • the seed user extension method proposed by the present application combines the unsupervised learning clustering method with the distance similarity algorithm, thereby reducing the computational complexity of the seed user expansion and improving the accuracy of the expansion.
  • the present application further provides a computer readable storage medium (such as a ROM/RAM, a magnetic disk, an optical disk), the computer readable storage medium storing a seed user extension system 20, the seed user
  • the extension system 20 can be executed by at least one processor 22 to cause the at least one processor 22 to perform the steps of the seed user extension method as described below.
  • the specific features include, but are not limited to, the geographic location of the user, whether it is a registered user, whether a specific product has been purchased (such as a property insurance), and the like.
  • the K-means clustering method includes the following steps:
  • k data objects are arbitrarily selected from N data objects (ie, N seed users) as cluster centers (ie, initial cluster centers);
  • (A2) Calculate the similarity of each remaining data object to each cluster center for the remaining data objects (such as Euclidean distance, the smaller the distance, the higher the similarity), and each according to the similarity
  • the remaining data objects are assigned to clusters represented by cluster centers that are most similar to them;
  • (A4) Iterate the steps A2 to A3 until the preset standard measure function begins to converge.
  • the mean square error can be used as a preset standard measure function.
  • the predetermined similarity calculation method may be: a similarity calculation method such as an Euclidean distance, an included cosine, and a Hamming distance.
  • the calculating the similarity between the user to be extended and each seed user community comprises: calculating the similarity between the user to be expanded and the center point of each seed user community (calculation method and user similarity The calculation method is consistent) as the similarity between the user to be expanded and each seed user community.
  • the seed user community is a set of users with similar characteristics, and each set is aggregated to a central point, which is the center point of the seed user community. Since there is no need to calculate similarities with each user in the seed user community, the computational complexity can be greatly reduced.
  • Representing the cosine similarity of the user a to be expanded and the center point b of a seed user community Represents the rating vector of the user a to be expanded, A scoring vector representing the center point b of a seed user community.
  • the seed user community obtained after cluster analysis includes three: B1, B2, and B3, and the similarity S1 of the user A to be expanded and the seed user community B1 is 60%, and the similarity with the seed user community B2 is S2.
  • the user A to be expanded is divided into specific seed user communities B2 and B3.
  • the extension rule is set to select a user to be expanded according to a specified number (such as the first two digits) according to the number from the highest to the lowest, and expand the selected user to be expanded into a seed user.
  • a specified number such as the first two digits
  • the user A to be expanded is divided into five seed user communities at the same time, and the user B to be expanded is divided into three seed user communities at the same time. If the extended user C is simultaneously divided into two seed user communities, the users A and B will be expanded. Expanded to seed users.
  • the extension rule may be further configured to: if the number of users to be expanded into a specific seed user community is greater than or equal to a second preset threshold, expand the user to be expanded into a seed user.
  • the second preset threshold may be set to a predetermined ratio (eg, 50%) of the total number of all seed user communities (ie, seed user communities having specific characteristics). For example, assuming that the total number of all seed user communities is 4 and the predetermined ratio is 50%, the second preset threshold is 2.
  • the computer readable storage medium proposed by the present application combines the unsupervised learning clustering method with the distance similarity algorithm, thereby reducing the computational complexity of the seed user expansion and improving the computational complexity. The accuracy of the expansion.
  • the foregoing embodiment method can be implemented by means of software plus a necessary general hardware platform, and can also be implemented by hardware, but in many cases, the former is A better implementation.
  • the technical solution of the present application which is essential or contributes to the prior art, may be embodied in the form of a software product stored in a storage medium (such as ROM/RAM, disk,
  • the optical disc includes a number of instructions for causing a terminal device (which may be a mobile phone, a computer, a server, an air conditioner, or a network device, etc.) to perform the methods described in various embodiments of the present application.

Abstract

La présente invention concerne un procédé de développement d'utilisateur initial, le procédé comprenant les étapes : la réalisation d'une analyse de groupement sur un nombre prédéterminé d'utilisateurs initiaux en utilisant un procédé de groupement prédéfini, et la division des utilisateurs initiaux en plusieurs communautés d'utilisateurs initiaux qui ont des caractéristiques spécifiques ; le calcul, en utilisant un procédé de calcul de similarité prédéfini, de la similarité entre un utilisateur à développer et chaque communauté d'utilisateurs initiaux ; si la similarité entre l'utilisateur à développer et la communauté spécifique d'utilisateurs initiaux est supérieure ou égale à une première valeur de seuil prédéfinie, la classification de l'utilisateur à développer dans la communauté spécifique d'utilisateurs initiaux ; et la compilation de statistiques du nombre de communautés spécifiques d'utilisateurs initiaux, dans lesquelles chaque utilisateur à développer est classifié, et le classement de celles-ci, et la détermination, selon un résultat de classement, d'une règle de développement pour développer un utilisateur à développer en l'utilisateur initial. La présente invention peut réduire la complexité de calcul d'un développement d'utilisateur initial et améliorer la précision de développement.
PCT/CN2018/076181 2017-12-18 2018-02-10 Procédé de développement d'utilisateur initial, dispositif électronique et support de stockage lisible par ordinateur WO2019119635A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201711364792.0A CN107944931A (zh) 2017-12-18 2017-12-18 种子用户拓展方法、电子设备及计算机可读存储介质
CN201711364792.0 2017-12-18

Publications (1)

Publication Number Publication Date
WO2019119635A1 true WO2019119635A1 (fr) 2019-06-27

Family

ID=61943698

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2018/076181 WO2019119635A1 (fr) 2017-12-18 2018-02-10 Procédé de développement d'utilisateur initial, dispositif électronique et support de stockage lisible par ordinateur

Country Status (2)

Country Link
CN (1) CN107944931A (fr)
WO (1) WO2019119635A1 (fr)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112967100A (zh) * 2021-04-02 2021-06-15 杭州网易云音乐科技有限公司 相似人群扩展方法、装置、计算设备以及介质

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108876470B (zh) * 2018-06-29 2022-03-01 腾讯科技(深圳)有限公司 标签用户扩展方法、计算机设备及存储介质
CN109376764B (zh) * 2018-09-13 2021-12-07 北京字节跳动网络技术有限公司 基于聚类的数据收集方法、装置和计算机可读存储介质
CN111563761A (zh) * 2020-01-19 2020-08-21 深圳前海微众银行股份有限公司 人群拓展方法、装置、设备及存储介质

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102737327A (zh) * 2011-03-31 2012-10-17 国际商业机器公司 一种划分客户群集的计算机实现的方法和系统
US20150302436A1 (en) * 2003-08-25 2015-10-22 Thomas J. Reynolds Decision strategy analytics
CN106022800A (zh) * 2016-05-16 2016-10-12 北京百分点信息科技有限公司 一种用户特征数据的处理方法和装置
CN107067045A (zh) * 2017-05-31 2017-08-18 北京京东尚科信息技术有限公司 数据聚类方法、装置、计算机可读介质和电子设备

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101763404B (zh) * 2009-12-10 2012-03-21 陕西鼎泰科技发展有限责任公司 基于模糊聚类的网络文本数据检测方法
US20140136451A1 (en) * 2012-11-09 2014-05-15 Apple Inc. Determining Preferential Device Behavior
CN103177090B (zh) * 2013-03-08 2016-11-23 亿赞普(北京)科技有限公司 一种基于大数据的话题检测方法及装置
CN106021376B (zh) * 2016-05-11 2019-05-10 上海点融信息科技有限责任公司 用于处理用户信息的方法和设备
CN105956628B (zh) * 2016-05-13 2021-01-26 北京京东尚科信息技术有限公司 数据分类方法和用于数据分类的装置
CN106296422A (zh) * 2016-07-29 2017-01-04 重庆邮电大学 一种融合多算法的社交网络垃圾用户检测方法
CN106446078A (zh) * 2016-09-08 2017-02-22 乐视控股(北京)有限公司 一种信息的推荐方法和推荐装置

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20150302436A1 (en) * 2003-08-25 2015-10-22 Thomas J. Reynolds Decision strategy analytics
CN102737327A (zh) * 2011-03-31 2012-10-17 国际商业机器公司 一种划分客户群集的计算机实现的方法和系统
CN106022800A (zh) * 2016-05-16 2016-10-12 北京百分点信息科技有限公司 一种用户特征数据的处理方法和装置
CN107067045A (zh) * 2017-05-31 2017-08-18 北京京东尚科信息技术有限公司 数据聚类方法、装置、计算机可读介质和电子设备

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112967100A (zh) * 2021-04-02 2021-06-15 杭州网易云音乐科技有限公司 相似人群扩展方法、装置、计算设备以及介质
CN112967100B (zh) * 2021-04-02 2024-03-15 杭州网易云音乐科技有限公司 相似人群扩展方法、装置、计算设备以及介质

Also Published As

Publication number Publication date
CN107944931A (zh) 2018-04-20

Similar Documents

Publication Publication Date Title
CN109871886B (zh) 基于谱聚类的异常点比例优化方法、装置及计算机设备
TWI718643B (zh) 異常群體識別方法及裝置
WO2021068610A1 (fr) Procédé et appareil de recommandation de ressources, dispositif électronique et support d'informations
WO2019119635A1 (fr) Procédé de développement d'utilisateur initial, dispositif électronique et support de stockage lisible par ordinateur
CN108615119B (zh) 一种异常用户的识别方法及设备
CN107786943B (zh) 一种用户分群方法及计算设备
CN107679084B (zh) 聚类标签生成方法、电子设备及计算机可读存储介质
CN107741986B (zh) 用户行为预测及相应信息推荐方法和装置
CN109918498B (zh) 一种问题入库方法和装置
WO2020114108A1 (fr) Procédé et dispositif d'interprétation de résultats de regroupement
WO2019085120A1 (fr) Procédé de recommandation à filtrage collaboratif, dispositif électronique, et support d'informations lisible par ordinateur
WO2021042844A1 (fr) Procédé et appareil de regroupement de données à grande échelle, dispositif informatique et support de stockage lisible par ordinateur
CN110825894A (zh) 数据索引建立、数据检索方法、装置、设备和存储介质
WO2019061664A1 (fr) Dispositif électronique, procédé de recommandation de produit basé sur des données de navigation sur internet d'un utilisateur et support d'enregistrement
US20230205755A1 (en) Methods and systems for improved search for data loss prevention
WO2019085463A1 (fr) Procédé de recommandation de demande de service, serveur d'application et support de stockage lisible par ordinateur
CN108961019B (zh) 一种用户账户的检测方法和装置
WO2021109724A1 (fr) Procédé et appareil de détection d'anomalie de journal
WO2019041528A1 (fr) Procédé, appareil électronique et support d'informations lisible par ordinateur permettant de déterminer la polarité d'un sentiment portant sur les actualités
WO2021135104A1 (fr) Procédé et appareil de poussée d'objet à base de données à multiples sources, dispositif et support de stockage
WO2019095587A1 (fr) Procédé de reconnaissance faciale, serveur d'application et support de stockage lisible par ordinateur
JP5518856B2 (ja) 改良された画像認識用の支援装置
CN110134721B (zh) 基于位图的数据统计方法、装置及电子设备
CN110334104B (zh) 一种榜单更新方法、装置、电子设备及存储介质
WO2017095413A1 (fr) Mise à jour automatique incrémentielle de listes de voisins classés selon les k-ièmes voisins les plus proches

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 18891887

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205 DATED 22/09/2020)

122 Ep: pct application non-entry in european phase

Ref document number: 18891887

Country of ref document: EP

Kind code of ref document: A1