CN102135983A - Group dividing method and device based on network user behavior - Google Patents

Group dividing method and device based on network user behavior Download PDF

Info

Publication number
CN102135983A
CN102135983A CN2011100091670A CN201110009167A CN102135983A CN 102135983 A CN102135983 A CN 102135983A CN 2011100091670 A CN2011100091670 A CN 2011100091670A CN 201110009167 A CN201110009167 A CN 201110009167A CN 102135983 A CN102135983 A CN 102135983A
Authority
CN
China
Prior art keywords
user
scope
point
behavior
space
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN2011100091670A
Other languages
Chinese (zh)
Inventor
崔鸿雁
陈睿杰
蔡云龙
刘韵洁
陈建亚
冯辰
谢明志
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing University of Posts and Telecommunications
Original Assignee
Beijing University of Posts and Telecommunications
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing University of Posts and Telecommunications filed Critical Beijing University of Posts and Telecommunications
Priority to CN2011100091670A priority Critical patent/CN102135983A/en
Publication of CN102135983A publication Critical patent/CN102135983A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Data Exchanges In Wide-Area Networks (AREA)

Abstract

本发明公开了一种基于网络用户行为的群体划分方法和装置,其中,所述方法为:根据使用者要求,确定行为特征参数集;根据已确定的行为特征参数集,构建用户类型空间;从用户行为服务器中读入数据,将每个用户多种参数归一化后,在用户类型空间中形成用户星座图,对星座图中各点进行初始化并确定探查范围,根据在该点探查范围内的点与其本身的距离计算移动距离向量,同时根据移动距离向量更新探查范围,重复以上步骤,直到所有点都不移动;根据初始探查范围,将邻近的点构成一个用户群。

Figure 201110009167

The invention discloses a group division method and device based on network user behavior, wherein the method includes: determining a behavior characteristic parameter set according to user requirements; constructing a user type space according to the determined behavior characteristic parameter set; The data is read in the user behavior server, and after normalizing various parameters of each user, a user constellation diagram is formed in the user type space, and each point in the constellation diagram is initialized and the detection range is determined. The moving distance vector is calculated based on the distance between the point and itself, and the detection range is updated according to the moving distance vector, and the above steps are repeated until all points do not move; according to the initial detection range, the adjacent points form a user group.

Figure 201110009167

Description

基于网络用户行为的群体划分方法和装置Group division method and device based on network user behavior

技术领域technical field

本发明涉及网络用户行为分析领域,特别是一种实现分析用户业务行为特性和规律的方法和装置。The invention relates to the field of network user behavior analysis, in particular to a method and device for analyzing user business behavior characteristics and rules.

背景技术Background technique

目前,随着互联网用户的急速增加,互联网业务也随之丰富,不同的业务对应不同的人群。因此,让一个业务更加适合其所对应用户就需要对使用该业务的用户进行群体划分,然后根据不同的用户群有针对性的改进现有业务。用户群划分技术是未来用户行为分析的一个关键技术。At present, with the rapid increase of Internet users, Internet services are also enriched, and different services correspond to different groups of people. Therefore, to make a service more suitable for its corresponding users, it is necessary to divide the users who use the service into groups, and then improve the existing services in a targeted manner according to different user groups. User group segmentation technology is a key technology for future user behavior analysis.

现有技术中的用户群划分方法,主要是基于LM(Link Mining链接挖掘)的基于逻辑链接的划分方法,该方法不利于在电信网络中多业务交叉使用,互联网和非互联网多种用户的情况下做出分析,同时,过大的程序复杂度也使得该方法应用范围受限。The user group division method in the prior art is mainly based on the division method based on logical links based on LM (Link Mining), which is not conducive to the cross-use of multiple services in the telecommunications network, and the situation of multiple users of the Internet and non-Internet At the same time, the excessive program complexity also limits the scope of application of this method.

发明内容Contents of the invention

本发明实施例提供基于网络用户行为的群体划分方法和装置,使用户群划分更符合电信网络监控情况和满足使用者特殊要求,同时扩大了使用范围。The embodiment of the present invention provides a group division method and device based on network user behavior, which makes the user group division more in line with the monitoring situation of the telecommunication network and meets the special requirements of users, and at the same time expands the scope of use.

本发明实施例公开了一种基于网络用户行为的群体划分方法,包括:确定行为特征参数集;再根据所确定的行为特征参数集构建用户类型空间,从用户行为服务器中读入数据,将每个用户多种参数归一化后,在用户类型空间中形成用户星座图,对星座图中各点进行初始化并确定探查范围,根据在该点探查范围内的点与其本身的距离计算移动距离向量,同时根据移动距离向量更新探查范围,重复以上步骤,直到所有点都不移动;根据初始探查范围,将邻近的点构成一个用户群。The embodiment of the present invention discloses a group division method based on network user behavior, including: determining a behavior characteristic parameter set; constructing a user type space according to the determined behavior characteristic parameter set, reading data from a user behavior server, and After the various parameters of each user are normalized, a user constellation diagram is formed in the user type space, each point in the constellation diagram is initialized and the detection range is determined, and the moving distance vector is calculated according to the distance between the points within the detection range of the point and itself , and at the same time update the detection range according to the moving distance vector, repeat the above steps until all points do not move; according to the initial detection range, the adjacent points form a user group.

本发明实施例还公开了一种基于网络用户行为的群体划分的装置,其特征在于,包括:The embodiment of the present invention also discloses a device for group division based on network user behavior, which is characterized in that it includes:

参数集确定单元,用于确定行为特征参数集;A parameter set determining unit, configured to determine a behavior characteristic parameter set;

空间确定单元,用于根据行为特征参数集确定用户类型空间;a space determining unit, configured to determine the user type space according to the behavior characteristic parameter set;

聚类单元,从用户行为服务器中读入数据,将每个用户多种参数数值归一化后,在用户类型空间中形成用户星座图并确定探查范围,根据在该点探查范围内的点与其本身的距离计算移动距离向量,同时根据移动距离向量更新探查范围,重复以上步骤,直到所有点都不移动The clustering unit reads data from the user behavior server, and after normalizing the various parameter values of each user, forms a user constellation diagram in the user type space and determines the detection range. Calculate the moving distance vector based on its own distance, and at the same time update the detection range according to the moving distance vector, repeat the above steps until all points do not move

群划分单元,用于根据初始探查范围,将邻近的点构成一个用户群。The group division unit is configured to form a user group from adjacent points according to the initial detection range.

从上述本发明的实施例可以看出,本发明实施例的用户行为空间可以变化满足了电信网络多业务交叉使用要求,引入使用者要求作为行为特征参数集的对应关系,表示参数之间的重要性程度,因此考虑了使用者的特殊要求。同时本发明的聚类方法不局限在网络用户的群体划分,扩大了适用范围。It can be seen from the above-mentioned embodiments of the present invention that the user behavior space in the embodiments of the present invention can be changed to meet the requirements for cross-use of multiple services in the telecommunication network, and user requirements are introduced as the corresponding relationship of behavior characteristic parameter sets, indicating the importance of parameters. The degree of flexibility, therefore taking into account the special requirements of users. At the same time, the clustering method of the present invention is not limited to the group division of network users, which expands the scope of application.

附图说明Description of drawings

图1为本发明实现用户群划分方法的一个实施例流程图;Fig. 1 is a flow chart of an embodiment of the method for realizing user group division in the present invention;

图2为本发明实现网络选择的方法的另一个实施例流程图;FIG. 2 is a flow chart of another embodiment of the method for realizing network selection in the present invention;

图3为本发明距离向量聚类方法的流程图;Fig. 3 is the flowchart of distance vector clustering method of the present invention;

图4为本发明实现用户群划分装置的一个实施例结构图。Fig. 4 is a structural diagram of an embodiment of the device for realizing user group division according to the present invention.

具体实施方式Detailed ways

下面结合附图对本发明实施例进行详细描述。Embodiments of the present invention will be described in detail below in conjunction with the accompanying drawings.

请参阅图1,为本发明实现网络选择的方法的一个实施例流程图,包括以下步骤:Referring to Fig. 1, it is a flowchart of an embodiment of a method for network selection in the present invention, including the following steps:

步骤101:确定行为特征参数集;Step 101: determining a behavior characteristic parameter set;

步骤102:根据已确定的行为特征参数集,构建用户类型空间;Step 102: Construct a user type space according to the determined behavior characteristic parameter set;

步骤103:从用户行为服务器中读入数据,将每个用户多种参数归一化后,在用户类型空间中形成用户星座图,对星座图中各点进行初始化并确定探查范围,根据在该点探查范围内的点与其本身的距离计算移动距离向量,同时根据移动距离向量更新探查范围,重复以上步骤,直到所有点都不移动;Step 103: Read data from the user behavior server, normalize various parameters of each user, form a user constellation diagram in the user type space, initialize each point in the constellation diagram and determine the detection range, according to the The distance between the points within the point detection range and itself is calculated to calculate the moving distance vector, and at the same time, the detection range is updated according to the moving distance vector, and the above steps are repeated until all points do not move;

步骤104:根据初始探查范围,将邻近的点构成一个用户群;Step 104: according to the initial detection range, form a user group with adjacent points;

上述实施例在用户群划分中从用户行为服务器中读入数据,将每个用户多种参数数值归一化后,在用户类型空间中形成用户星座图并确定探查范围,根据在该点探查范围内的点与其本身的距离计算移动距离向量,同时根据移动距离向量更新探查范围,重复以上步骤,直到所有点都不移动,从而达到用户聚类效果。其中,本发明实施例的用户行为空间可以变化满足了有限数据的要求,引入使用者要求作为行为特征参数集的对应关系,表示参数之间的重要性程度,因此考虑了业务开发者要求。同时本发明的聚类方法不局限在网络用户的群体划分,扩大了适用范围。In the above embodiment, data is read from the user behavior server in user group division, and after normalizing various parameter values of each user, a user constellation diagram is formed in the user type space and the detection range is determined. According to the detection range at this point Calculate the moving distance vector based on the distance between the points within and itself, and at the same time update the detection range according to the moving distance vector, repeat the above steps until all points do not move, so as to achieve the user clustering effect. Among them, the user behavior space in the embodiment of the present invention can be changed to meet the requirements of limited data, and user requirements are introduced as the corresponding relationship of behavior characteristic parameter sets to indicate the importance of parameters, so the requirements of business developers are considered. At the same time, the clustering method of the present invention is not limited to the group division of network users, which expands the scope of application.

请参阅图2,为本发明实现用户群划分的方法的另一个实施例流程图。本实施例中,通过用户界面,获取使用者所关注的信息。本实施例中,行为特征参数在UBAS中存储,比如用户静态类型和用户动态话题,针对IP地址也做出了区域的划分,当使用者需要划分用户群是,包括以下步骤:Please refer to FIG. 2 , which is a flow chart of another embodiment of the method for realizing user group division in the present invention. In this embodiment, the information concerned by the user is obtained through the user interface. In this embodiment, behavior characteristic parameters are stored in UBAS, such as user static type and user dynamic topic, and IP addresses are also divided into regions. When users need to divide user groups, the following steps are included:

步骤201:用户群划分装置UGDD(User Group Diving Device)从当前UBAS(User Behavior analysis Server)获取与使用者要求和各行为特征参数,根据使用者要求与行为特征参数集之间的对应关系,确定相应的行为特征参数集。Step 201: The user group division device UGDD (User Group Diving Device) obtains the user requirements and each behavior characteristic parameter from the current UBAS (User Behavior analysis Server), and determines according to the corresponding relationship between the user requirement and the behavior characteristic parameter set The corresponding set of behavioral feature parameters.

其中UGDD为当前UBAS的一个组成部分,每个UBAS中至少有一个UGDD。Among them, UGDD is a component of the current UBAS, and each UBAS has at least one UGDD.

为了确定行为特征参数集与使用者要求之间的对应关系,在用户群划分前预先读入用户热点话题及关注度:“钓鱼岛,1.0”、“日本,0.6”、“中国,0.8”用户静态类型及关注度:“博客,0.5”、“新闻,0.9”,地域为:“河南”,时间为:20100712作为使用者要求数据和UBAS中已经处理完毕的各种行为特征参数。In order to determine the corresponding relationship between the behavioral feature parameter set and the user’s requirements, the user’s hot topics and attention levels are read in advance before the user group is divided: "Diaoyu Island, 1.0", "Japan, 0.6", "China, 0.8" user static Type and attention: "blog, 0.5", "news, 0.9", region: "Henan", time: 20100712 as user request data and various behavior characteristic parameters that have been processed in UBAS.

所以在本实施例中选取的行为特征参数集为:Therefore, the behavior characteristic parameter set selected in this embodiment is:

表一根据使用者要求所得到的行为特征参数集Table 1 Behavioral feature parameter set obtained according to user requirements

Figure BSA00000419781300041
Figure BSA00000419781300041

步骤202:UGDD,根据行为特征参数集,构建用户类型欧式空间,同时确定归一化标准。Step 202: UGDD, according to the behavior characteristic parameter set, constructs user type Euclidean space, and determines the normalization standard at the same time.

上述构建用户类型欧式空间为在行为特征参数集中,除去“时间”和“地域”两个参数,还有5个参数,所以此时构建5维坐标欧式空间,5个坐标轴分别为:钓鱼岛轴,日本轴,中国轴,博客轴和新闻轴。The user type Euclidean space constructed above is in the behavior characteristic parameter set, except for the two parameters of "time" and "region", there are still 5 parameters, so at this time, a 5-dimensional coordinate Euclidean space is constructed, and the 5 coordinate axes are: Diaoyu Island axis , Japanese axis, Chinese axis, blog axis and news axis.

上述确定归一化标准S=(1.0,0.6,0.8,0.5,0.9)。The normalization standard S=(1.0, 0.6, 0.8, 0.5, 0.9) is determined above.

步骤203:UGDD从UBAS中读取数据,通过归一化标准归一化用户数据,在用户类型空间中形成星座图,用距离向量聚类算法聚类,将相似用户集中Step 203: UGDD reads data from UBAS, normalizes user data by normalization standard, forms constellation diagram in user type space, uses distance vector clustering algorithm to cluster, and gathers similar users

上述UGDD从UBAS中读取数据,通过归一化标准归一化用户数据为:The above UGDD reads data from UBAS, and normalizes the user data through the normalization standard as follows:

UGDD在UBAS中读取20100712河南的数据的结果是共有T个用户,该T个用户每个用户对于202中5个坐标轴有各自的数值,此时用户i可表示为

Figure BSA00000419781300042
据归一化标准S=(1.0,0.6,0.8,0.5,0.9),由公式
Figure BSA00000419781300043
将这T个用户均归一化,得到用户i为IPi=(pi1,pi2,...,pin),这时将这T个用户点入到用户类型空间中形成星座图,每一个点根据公式,计算其初始探查范围w0,令探查范围wi=w0,最小探查范围wmin=w0/4。The result of UGDD reading the data of 20100712 Henan in UBAS is that there are a total of T users, and each of the T users has its own value for the five coordinate axes in 202. At this time, user i can be expressed as
Figure BSA00000419781300042
According to the normalization standard S=(1.0,0.6,0.8,0.5,0.9), by the formula
Figure BSA00000419781300043
The T users are all normalized, and the user i is obtained as IP i =(p i1 , p i2 ,..., pin ), and these T users are entered into the user type space to form a constellation diagram, Each point according to the formula, Calculate the initial detection range w 0 , set the detection range w i =w 0 , and the minimum detection range w min =w 0 /4.

上述距离向量聚类算法聚类,将相似用户集中的具体实现方式请参阅图3,包括以下步骤:For the above-mentioned distance vector clustering algorithm clustering, please refer to Figure 3 for the specific implementation of clustering similar users, including the following steps:

步骤301:确定该n维欧式空间中,L表示向量长度,dij表示点i与j的距离;Step 301: Determine the n-dimensional Euclidean space, L represents the length of the vector, d ij represents the distance between point i and j;

本实施例中,该n维欧式空间就是用户类型空间In this embodiment, the n-dimensional Euclidean space is the user type space

步骤302:计算移动路径;Step 302: Calculate the moving path;

若点i与点j的距离dij<w,则称点j为点i的邻近点或点i为j的邻近点,点i的移动路径di通过公式

Figure BSA00000419781300052
If the distance d ij <w between point i and point j, then point j is called the adjacent point of point i or point i is the adjacent point of j, and the moving path d i of point i is passed by the formula
Figure BSA00000419781300052

计算而得;calculated;

步骤303:聚类移动;Step 303: clustering movement;

令Ldi为移动路径di的长度,令IDi为IDi+di进行移动,同时更新探查范围,使wi=Ldi;对空间中每个点进行一次上述移动;Let L di be the length of the moving path d i , let ID i move as ID i +d i , and update the detection range at the same time, so that w i =L di ; carry out the above-mentioned move once for each point in the space;

步骤304:移动结束判决;Step 304: move end judgment;

当所有点都不移动时,聚类结束,结束后相似用户就已经集中了。When all points do not move, the clustering ends, and similar users have been concentrated after the end.

步骤203:UGDD根据初始探查范围,将邻近的点构成一个用户群.同时将结果反馈给UBAS;Step 203: UGDD forms a user group with adjacent points according to the initial detection range, and feeds back the result to UBAS at the same time;

初始探查范围为w0,每个点i根据初始探查范围,查找邻近点,将点i及其邻近点设为一个群,重复上述探查过程,若该点已归入一个群则不进行探查,所有节点都归入群后结束;得到划分的网络用户群。The initial detection range is w 0 , each point i searches for neighboring points according to the initial detection range, sets point i and its neighboring points as a group, repeats the above detection process, if the point has been classified into a group, no detection is performed, After all the nodes are classified into the group, it ends; the divided network user groups are obtained.

UGDD将划分的各个用户群存入UBAS中。UGDD stores each divided user group into UBAS.

从上述实施例可看出本发明用户行为空间是可以变化的,当UBAS中行为特征参数数据较少时所生成的空间也会较小,满足了有限数据的要求,引入使用者要求作为行为特征参数集的对应关系,表示参数之间的重要性程度,因此考虑了业务开发者要求。同时本发明的聚类方法不局限在网络用户的群体划分,扩大了适用范围。It can be seen from the above embodiments that the user behavior space of the present invention can be changed. When the behavior characteristic parameter data in UBAS is less, the generated space will also be smaller, which meets the requirements of limited data and introduces user requirements as behavior characteristics. The correspondence between parameter sets indicates the degree of importance between parameters, so the requirements of business developers are taken into account. At the same time, the clustering method of the present invention is not limited to the group division of network users, which expands the scope of application.

以上实施例以当前的用户行为分析服务器UBAS为行为特征数据的来源,但本发明不限于这两个分析模型,比如当行为特征不为网络行为特征而是其他行为特征的时候,其用户群划分方法和上述实施例类似,此处不再赘述。请参阅图4,为本发明实现用户群划分的装置的一个实施例结构图,该装置包括参数集确定单元401、空间确定单元402、聚类单元403、群划分单元404The above embodiments use the current user behavior analysis server UBAS as the source of behavior feature data, but the present invention is not limited to these two analysis models. For example, when the behavior feature is not a network behavior feature but other behavior features, its user group division The method is similar to the above embodiment, and will not be repeated here. Please refer to FIG. 4 , which is a structural diagram of an embodiment of a device for realizing user group division in the present invention, the device includes a parameter set determination unit 401, a space determination unit 402, a clustering unit 403, and a group division unit 404

参数集确定单元401,用于根据用户要求,确定行为特征参数集;A parameter set determining unit 401, configured to determine a behavior characteristic parameter set according to user requirements;

空间确定单元402,用于根据参数集确定单元401确定的行为特征参数集确定用户类型空间;A space determining unit 402, configured to determine the user type space according to the behavior characteristic parameter set determined by the parameter set determining unit 401;

聚类单元403,根据空间确定单元402,从参数集中,将每个用户多种参数数值归一化后,在用户类型空间中形成用户星座图并确定探查范围,根据在该点探查范围内的点与其本身的距离计算移动距离向量,同时根据移动距离向量更新探查范围,重复以上步骤,直到所有点都不移动The clustering unit 403, according to the space determination unit 402, from the parameter set, after normalizing various parameter values of each user, forms a user constellation diagram in the user type space and determines the detection range, according to the detection range at this point The distance between the point and itself is used to calculate the moving distance vector, and at the same time, the detection range is updated according to the moving distance vector, and the above steps are repeated until all points do not move

该装置还包括:The unit also includes:

群划分单元404,用于根据聚类单元403所得的结果,利用初始探查范围,将邻近的点构成一个用户群。The group division unit 404 is configured to form a user group from adjacent points by using the initial detection range according to the result obtained by the clustering unit 403 .

本实施例中,用户行为空间不局限与互联网行为特征要求,引入使用者要求作为行为特征参数集的对应关系,表示参数之间的重要性程度,满足了使用者的特殊要求,同时考虑了电信网络多业务交叉使用的情况。同时本发明的聚类方法不局限在网络用户的群体划分,扩大了适用范围。In this embodiment, the user behavior space is not limited to the requirements of Internet behavior characteristics, and user requirements are introduced as the corresponding relationship of behavior characteristic parameter sets to indicate the importance of parameters, which meets the special requirements of users and takes into account the telecommunications The case of cross-use of multiple services on the network. At the same time, the clustering method of the present invention is not limited to the group division of network users, which expands the scope of application.

本领域普通技术人员可以理解实现上述实施例方法中的全部或部分步骤,是可以通过程序来指令相关的硬件来完成,所述的程序可以存储于计算机可读取存储介质中,该程序包括如下步骤:根据使用者要求,确定行为特征参数集;根据已确定的行为特征参数集,构建用户类型空间;从用户行为服务器中读入数据,将每个用户多种参数归一化后,在用户类型空间中形成用户星座图,对星座图中各点进行初始化并确定探查范围,根据在该点探查范围内的点与其本身的距离计算移动距离向量,同时根据移动距离向量更新探查范围,重复以上步骤,直到所有点都不移动;根据初始探查范围,将邻近的点构成一个用户群。所述的存储介质,如:ROM/RAM、磁碟、光盘等。Those of ordinary skill in the art can understand that all or part of the steps in the method of the above-mentioned embodiments can be completed by instructing related hardware through a program. The program can be stored in a computer-readable storage medium, and the program includes the following Steps: Determine the behavioral characteristic parameter set according to the user's requirements; construct the user type space according to the determined behavioral characteristic parameter set; Form the user constellation diagram in the type space, initialize each point in the constellation diagram and determine the detection range, calculate the moving distance vector according to the distance between the points within the detection range of the point and itself, and update the detection range according to the moving distance vector, repeat the above Steps until all points do not move; according to the initial exploration range, the adjacent points form a user group. The storage medium, such as: ROM/RAM, magnetic disk, optical disk, etc.

以上所述仅是本发明的优选实施方式,应当指出,对于本技术领域的普通技术人员来说,在不脱离本发明原理的前提下,还可以做出若干改进和润饰,这些改进和润饰也应该视为本发明的保护范围。The above is only a preferred embodiment of the present invention, it should be pointed out that, for those of ordinary skill in the art, without departing from the principle of the present invention, some improvements and modifications can also be made, and these improvements and modifications can also be made. It should be regarded as the protection scope of the present invention.

Claims (9)

1. colony's division methods of a user behavior Network Based is characterized in that, comprises
Determine the cybernetics control number collection;
According to fixed cybernetics control number collection, make up the user type space;
From the user behavior server, read in data, after the multiple parameter normalization of each user, in the user type space, form user's planisphere, each point in the planisphere is carried out initialization and determines the scope of detecting, according to detect point in the scope and the distance calculation displacement vector of itself at this point, upgrade simultaneously and detect scope, repeat above step, have a few mobile up to institute according to the displacement vector;
According to initially detecting scope, contiguous point is constituted a customer group.
2. method according to claim 1 is characterized in that, also comprises before described definite cybernetics control number:
In predetermined cybernetics control number scope, according to user's the needs setting and the corresponding relation of current behavior characteristic parameter collection;
Then described definite current behavior characteristic parameter collection is for obtaining the dynamic topic of user, user's static types and IP address from user behavior analysis server UBAS (User Behavior Analysis Server), according to user's the needs and the corresponding relation of current behavior characteristic parameter collection, determine the corresponding cybernetics control number collection of needs with the user;
3. method according to claim 2 is characterized in that, described cybernetics control number scope comprises:
The dynamic topic of user is by being used the determined keyword of Network by the user in a period of time in the user behavior server, this keyword can since the user who is analyzed for geographic position and time of use business different and different;
User's static types is meant the professional type of using, and comprising: news, physical culture, amusement, science and technology, finance and economics, automobile, number, mobile phone, house property, recreation, tourism, education, 13 classes of blog; These classifications are to add up by the historical data of user behavior server to draw.
4. method according to claim 2 is characterized in that, the user requires to comprise:
To degree of concern some or the dynamic topic of several users or user's static types, the selection of time period, the selection of geographic area scope.
5. method according to claim 1 is characterized in that, described definite cybernetics control number collection comprises:
Obtain with the user from UBAS and to require corresponding behavioural analysis parameter set.
6. according to claim 1,2 or 5 any described methods, it is characterized in that,, make up the user type space and comprise according to fixed cybernetics control number collection:
This user type space is a theorem in Euclid space;
With the orthogonal axis in the dynamic topic structure of user's static types and user in the parameter set space;
User's static types and the dynamic topic quantity of user sum n are called range parameter;
With the attention rate of each user's static types and the dynamic topic of user in the parameter set as data normalization standard S=(s 1, s 2..., s n);
7. according to claim 1,2 or 5 any described methods, it is characterized in that, describedly from the user behavior server, read in data, after the multiple parameter values normalization of each user, form user's planisphere and determine the scope of detecting in the user type space, according to the point and the distance calculation displacement vector of itself detected at this point in the scope, scope is detected in renewal according to the displacement vector simultaneously, repeat above step, up to have a few not move and comprise:
To from the user behavior server, reading in data, after the multiple parameter values normalization of each user, in the user type space, form user's planisphere and determine the scope of detecting according to:
From the user behavior server, read each user i data and indicate this user, obtain data set with the IP address
Figure FSA00000419781200021
According to number
According to normalization standard S=(s 1, s 2..., s n), by formula
Figure FSA00000419781200022
Normalization,
Each parameter of user i obtains IP i=(p I1, p I2..., p In) simultaneously, click and enter in the user type space,
Form planisphere, each is put according to formula,
Figure FSA00000419781200023
Calculate its initial spy
Look into scope w 0, scope w is detected in order i=w 0, minimum is detected scope w Min=w 0/ 4.
To according to point and the distance calculation displacement vector of itself detected at this point in the scope, upgrade according to the displacement vector simultaneously and detect scope, repeat above step, up to have a few mobilely according to the distance vector clustering algorithm, carry out mobile cluster;
The vector clustering algorithm of adjusting the distance carries out according to following steps:
A) in this n dimension theorem in Euclid space, L represents vector length, d IjThe distance of expression point i and j;
B) if the some i with the some j apart from d Ij<w claims that then neighbor point or the some i of some j for some i is j
Neighbor point, the some i mobile route d iBy
Calculate and get, simultaneously, make L DiBe mobile route d iLength, make ID iBe ID i+ d iMove, upgrade simultaneously and detect scope, make w i=L DiEach point in the space is carried out once above-mentioned moving;
C) repeat b), when institute has a few when not mobile the cluster end;
8. according to claim 1,2,5,6 or 7 any described methods, it is characterized in that described basis is initially detected scope, contiguous point constituted a customer group comprise:
Initially the scope of detecting is w 0, each some i searches neighbor point according to initially detecting scope, will put i and neighbor point thereof and be made as a group, repeats above-mentioned probe procedure, if this point has been included into a group and has not then detected, all nodes all are included into the group back to be finished; The network user group who obtains dividing.
9. the colony's division methods and the device of a user behavior Network Based is characterized in that, comprising:
The parameter set determining unit is used for determining the cybernetics control number collection;
The space determining unit is used for determining the user type space according to the cybernetics control number collection;
Cluster cell, from the user behavior server, read in data, after the multiple parameter values normalization of each user, in the user type space, form user's planisphere and determine the scope of detecting, according to detect point in the scope and the distance calculation displacement vector of itself at this point, upgrade simultaneously and detect scope, repeat above step, have a few mobile up to institute according to the displacement vector
Group's division unit is used for according to initially detecting scope contiguous point being constituted a customer group.
CN2011100091670A 2011-01-17 2011-01-17 Group dividing method and device based on network user behavior Pending CN102135983A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2011100091670A CN102135983A (en) 2011-01-17 2011-01-17 Group dividing method and device based on network user behavior

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2011100091670A CN102135983A (en) 2011-01-17 2011-01-17 Group dividing method and device based on network user behavior

Publications (1)

Publication Number Publication Date
CN102135983A true CN102135983A (en) 2011-07-27

Family

ID=44295770

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2011100091670A Pending CN102135983A (en) 2011-01-17 2011-01-17 Group dividing method and device based on network user behavior

Country Status (1)

Country Link
CN (1) CN102135983A (en)

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102438035A (en) * 2011-09-29 2012-05-02 惠州市德赛西威汽车电子有限公司 Automatic microblog interconnection access method suitable for automobile users
CN102509248A (en) * 2011-11-22 2012-06-20 昆明理工大学 Multi-objective categorization method for travel grouping and system adopting same
CN102937951A (en) * 2011-08-15 2013-02-20 北京百度网讯科技有限公司 Method for building internet protocol (IP) address classification model, user classifying method and device
CN103514167A (en) * 2012-06-15 2014-01-15 富士通株式会社 Data processing method and device
CN104573034A (en) * 2015-01-15 2015-04-29 中国联合网络通信集团有限公司 CDR call ticket based user group division method and system
CN105224681A (en) * 2015-10-23 2016-01-06 武汉大学 Based on the customer requirement retrieval method and system of family's place of working context environmental
WO2016000562A1 (en) * 2014-06-30 2016-01-07 Tencent Technology (Shenzhen) Company Limited Method and apparatus for grouping network service users
CN105763631A (en) * 2016-04-13 2016-07-13 北京网康科技有限公司 Peer group identification method and device
CN107623715A (en) * 2017-08-08 2018-01-23 阿里巴巴集团控股有限公司 A kind of identity information acquisition methods and device
CN109145934A (en) * 2017-12-22 2019-01-04 北京数安鑫云信息技术有限公司 User behavior data processing method, medium, equipment and device based on log
CN109522815A (en) * 2018-10-26 2019-03-26 深圳博为教育科技有限公司 A kind of focus appraisal procedure, device and electronic equipment
CN110737848A (en) * 2018-07-03 2020-01-31 百度在线网络技术(北京)有限公司 Mining processing method and device for user work unit and electronic equipment

Cited By (24)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102937951B (en) * 2011-08-15 2016-11-02 北京百度网讯科技有限公司 Set up the method for IP address sort model, the method and device to user's classification
CN102937951A (en) * 2011-08-15 2013-02-20 北京百度网讯科技有限公司 Method for building internet protocol (IP) address classification model, user classifying method and device
CN102438035B (en) * 2011-09-29 2014-10-29 惠州市德赛西威汽车电子有限公司 Automatic microblog interconnection accessing method suitable for automobile user
CN102438035A (en) * 2011-09-29 2012-05-02 惠州市德赛西威汽车电子有限公司 Automatic microblog interconnection access method suitable for automobile users
CN102509248B (en) * 2011-11-22 2016-03-30 昆明理工大学 A kind ofly be applied to the travel Multi-Target Classification Method of forming a team and system
CN102509248A (en) * 2011-11-22 2012-06-20 昆明理工大学 Multi-objective categorization method for travel grouping and system adopting same
CN103514167A (en) * 2012-06-15 2014-01-15 富士通株式会社 Data processing method and device
CN103514167B (en) * 2012-06-15 2017-03-01 富士通株式会社 Data processing method and equipment
WO2016000562A1 (en) * 2014-06-30 2016-01-07 Tencent Technology (Shenzhen) Company Limited Method and apparatus for grouping network service users
CN105281925A (en) * 2014-06-30 2016-01-27 腾讯科技(深圳)有限公司 Network service user group dividing method and device
CN105281925B (en) * 2014-06-30 2019-05-14 腾讯科技(深圳)有限公司 The method and apparatus that network service groups of users divides
US9817885B2 (en) 2014-06-30 2017-11-14 Tencent Technology (Shenzhen) Company Limited Method and apparatus for grouping network service users
CN104573034A (en) * 2015-01-15 2015-04-29 中国联合网络通信集团有限公司 CDR call ticket based user group division method and system
CN104573034B (en) * 2015-01-15 2018-03-23 中国联合网络通信集团有限公司 User group's division method and system based on CDR tickets
CN105224681A (en) * 2015-10-23 2016-01-06 武汉大学 Based on the customer requirement retrieval method and system of family's place of working context environmental
CN105224681B (en) * 2015-10-23 2018-07-27 武汉大学 Customer requirement retrieval method and system based on family's place of working context environmental
CN105763631B (en) * 2016-04-13 2019-03-29 北京网康科技有限公司 A kind of peer group recognition methods and device
CN105763631A (en) * 2016-04-13 2016-07-13 北京网康科技有限公司 Peer group identification method and device
CN107623715A (en) * 2017-08-08 2018-01-23 阿里巴巴集团控股有限公司 A kind of identity information acquisition methods and device
CN107623715B (en) * 2017-08-08 2020-06-09 阿里巴巴集团控股有限公司 Identity information acquisition method and device
CN109145934A (en) * 2017-12-22 2019-01-04 北京数安鑫云信息技术有限公司 User behavior data processing method, medium, equipment and device based on log
CN109145934B (en) * 2017-12-22 2019-05-21 北京数安鑫云信息技术有限公司 User behavior data processing method, medium, equipment and device based on log
CN110737848A (en) * 2018-07-03 2020-01-31 百度在线网络技术(北京)有限公司 Mining processing method and device for user work unit and electronic equipment
CN109522815A (en) * 2018-10-26 2019-03-26 深圳博为教育科技有限公司 A kind of focus appraisal procedure, device and electronic equipment

Similar Documents

Publication Publication Date Title
CN102135983A (en) Group dividing method and device based on network user behavior
Li et al. Privacy leakage via de-anonymization and aggregation in heterogeneous social networks
WO2022179384A1 (en) Social group division method and division system, and related apparatuses
CN104281882B (en) The method and system of prediction social network information stream row degree based on user characteristics
US11675928B2 (en) Electronic device for obfuscating and decoding data and method for controlling same
CN109460520B (en) Point-of-interest recommendation method based on geo-social relationship and deep implicit interest mining
CN109783614B (en) A differential privacy leak detection method and system for text to be published in social networks
CN106874435A (en) User portrait construction method and device
CN103744957A (en) Sequence mode mining method based on Web user time attributes
US9177039B2 (en) Communication assistance device, communication assistance method, and computer readable recording medium
CN112052404A (en) Group discovery method, system, device and medium for multi-source heterogeneous relation network
CN110990683A (en) Microblog rumor integrated identification method and device based on region and emotional characteristics
CN109992784A (en) A Heterogeneous Network Construction and Distance Measurement Method for Fusing Multimodal Information
CN113422761A (en) Malicious social user detection method based on counterstudy
WO2023155306A1 (en) Data recommendation method and apparatus based on graph neural network and electronic device
US9244970B2 (en) Communication assistance device, communication assistance method, and computer readable recording medium
CN107133268B (en) Collaborative filtering method for Web service recommendation
CN115131058B (en) Account identification method, device, equipment and storage medium
CN113556368A (en) User identification method, device, server and storage medium
CN109492924B (en) A Second-Order Influence Evaluation Method Based on Weibo Users&#39; Self and Behavioral Value
CN112163170A (en) A method and system for improving social network alignment based on virtual nodes and meta-learning
CN116401394B (en) Object set, image generation method and device, electronic equipment and storage medium
CN108763400B (en) Object dividing method and device based on object behaviors and theme preferences
CN109150974B (en) User identity linking method based on neighbor iteration similarity
CN106709597B (en) Method and device for parallel optimization processing of TSP problem based on artificial bee colony algorithm

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C02 Deemed withdrawal of patent application after publication (patent law 2001)
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20110727