CN104679787A - Interest information statistical method and device - Google Patents

Interest information statistical method and device Download PDF

Info

Publication number
CN104679787A
CN104679787A CN201310636603.6A CN201310636603A CN104679787A CN 104679787 A CN104679787 A CN 104679787A CN 201310636603 A CN201310636603 A CN 201310636603A CN 104679787 A CN104679787 A CN 104679787A
Authority
CN
China
Prior art keywords
keyword
focus
interest information
lists
keywords
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201310636603.6A
Other languages
Chinese (zh)
Other versions
CN104679787B (en
Inventor
陈嘉
曾嘉
袁明轩
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Huawei Technologies Co Ltd
Original Assignee
Huawei Technologies Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huawei Technologies Co Ltd filed Critical Huawei Technologies Co Ltd
Priority to CN202011588377.5A priority Critical patent/CN113051467A/en
Priority to CN201310636603.6A priority patent/CN104679787B/en
Publication of CN104679787A publication Critical patent/CN104679787A/en
Application granted granted Critical
Publication of CN104679787B publication Critical patent/CN104679787B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Abstract

An embodiment of the invention discloses an interest information statistical method and device and relates to the technical field of information. The method and device can improve statistical coverage of interest information. The method includes first acquiring coordinate information that user equipment corresponds to, then acquiring keywords of hotspots that the user equipment correspond to and finally configuring keywords of the hotspots to be interest information that the user equipment corresponds to. The method and device is suitable for conducting statistical on interest information of users.

Description

The statistical method of interest information and device
Technical field
The present invention relates to areas of information technology, particularly a kind of statistical method of interest information and device.
Background technology
Along with the development of infotech, the interest information of intelligent acquisition user is more and more important.Wherein, interest information can be shopping information, cuisines information, body-building information etc., and the intelligent interest information obtaining user refers to user by UE(User Equipment, subscriber equipment) in the various information that reports, obtain the interest information of this user.
At present, the text message that server is issued in social networks by UE, obtains the interest information that UE is corresponding.Wherein, text message comprises: status information, review information, label information etc.But obtain interest information corresponding to UE by text message, the text message only initiatively submitted to according to user in social networks due to server obtains interest information corresponding to UE, thus causes the statistics coverage rate of interest information lower.
Summary of the invention
The embodiment of the present invention provides a kind of statistical method and device of interest information, can improve the statistics coverage rate of interest information.
The technical scheme that the embodiment of the present invention adopts is:
First aspect, the embodiment of the present invention provides a kind of statistical method of interest information, comprising:
Obtain the coordinate information that user equipment (UE) is corresponding;
The coordinate information corresponding according to described UE, obtains the keyword of each focus corresponding to described UE;
The keyword of each focus described is configured to interest information corresponding to described UE.
In the first implementation of first aspect, the described coordinate information corresponding according to described UE, the step obtaining the keyword of each focus corresponding to described UE comprises:
The distance obtained between the coordinate information corresponding with described UE is less than or equal to each focus of predetermined threshold value and the keyword of each focus described.
In conjunction with the first implementation of first aspect or first aspect, in the second implementation of first aspect, distance between the coordinate information that described acquisition is corresponding with described UE also comprises after being less than or equal to the step of each focus of predetermined threshold value and the keyword of each focus described:
The keyword of each focus described is sorted, and generates lists of keywords corresponding to described UE;
Obtain the top n keyword in lists of keywords corresponding to described UE, wherein, N be greater than or equal to 1 integer;
The step that the described keyword by each focus described is configured to interest information corresponding to described UE comprises:
Top n keyword in lists of keywords corresponding for described UE is configured to interest information corresponding to described UE.
In conjunction with the first implementation of first aspect or first aspect, or the second implementation of first aspect, in the third implementation of first aspect, before the described step that the keyword of each focus described is sorted, also comprise:
According to word frequency-reverse document-frequency TF-IDF algorithm, calculate the weighted value that the keyword of each focus described is corresponding respectively;
Described the step that the keyword of each focus described sorts to be comprised:
The keyword of each focus described is sorted according to weighted value order from high to low.
In conjunction with the first implementation of first aspect or first aspect, or the second implementation of first aspect, or the third implementation of first aspect, in the 4th kind of implementation of first aspect, described according to word frequency-reverse document-frequency TF-IDF algorithm, before calculating the step of the weighted value of the keyword difference correspondence of each focus described, also comprise:
Obtain the lists of keywords that each UE is corresponding respectively;
Described according to word frequency-reverse document-frequency TF-IDF algorithm, described in calculating, the step of the weighted value that the keyword of each focus is corresponding respectively comprises:
The lists of keywords corresponding according to each UE described, according to TF-IDF algorithm, calculates the weighted value that the keyword of each focus described is corresponding respectively.
Second aspect, the present invention carries the statistic device of embodiment for a kind of interest information, comprising:
Acquiring unit, for obtaining coordinate information corresponding to user equipment (UE);
Described acquiring unit, also for the coordinate information corresponding according to described UE, obtains the keyword of each focus corresponding to described UE;
Dispensing unit, is configured to interest information corresponding to described UE for the keyword of each focus described in being obtained by described acquiring unit.
In the first implementation of second aspect,
Described acquiring unit, is also less than or equal to each focus of predetermined threshold value and the keyword of each focus described for the distance obtained between the coordinate information corresponding with described UE.
In conjunction with the first implementation of second aspect or second aspect, in the second implementation of second aspect, described device also comprises:
Sequencing unit, sorts for the keyword of each focus described in obtaining described acquiring unit;
Generation unit, for generating lists of keywords corresponding to described UE after the sequence of described sequencing unit;
Described acquiring unit, also for obtaining the top n keyword in lists of keywords corresponding to described UE that described generation unit generates, wherein, N be greater than or equal to 1 integer;
Described dispensing unit, is also configured to interest information corresponding to described UE for the top n keyword in lists of keywords corresponding to the described UE obtained by described acquiring unit.
In conjunction with the first implementation of second aspect or second aspect, or the second implementation of second aspect, in the third implementation of second aspect, described device also comprises:
Computing unit, for according to word frequency-reverse document-frequency TF-IDF algorithm, calculates the weighted value that the keyword of each focus described that described acquiring unit obtains is corresponding respectively;
Described sequencing unit, the weighted value order from high to low also for being calculated according to described computing unit by the keyword of each focus described sorts.
In conjunction with the first implementation of second aspect or second aspect, or the second implementation of second aspect, or the third implementation of second aspect, in the 4th kind of implementation of second aspect,
Described acquiring unit, also for obtaining each UE lists of keywords corresponding respectively;
Described computing unit, also for the lists of keywords that each UE described in obtaining according to described acquiring unit is corresponding, according to TF-IDF algorithm, calculates the weighted value that the keyword of each focus described is corresponding respectively.
The statistical method of the interest information that the embodiment of the present invention provides and device, first coordinate information corresponding to user equipment (UE) is obtained, then corresponding according to UE coordinate information, obtains the keyword of each focus corresponding to UE, finally the keyword of each focus is configured to interest information corresponding to UE.Obtain compared with interest information corresponding to UE with current by text message, the real time position that the embodiment of the present invention is corresponding according to UE, infer each focus corresponding to UE, thus can interest information corresponding to Real-time Obtaining UE, and then the statistics coverage rate of interest information can be improved.
Accompanying drawing explanation
In order to be illustrated more clearly in the technical scheme in the embodiment of the present invention, be briefly described to the accompanying drawing used required in embodiment or description of the prior art below, apparently, accompanying drawing in the following describes is only some embodiments of the present invention, for those of ordinary skill in the art, under the prerequisite not paying creative work, other accompanying drawing can also be obtained according to these accompanying drawings.
The statistical method process flow diagram of a kind of interest information that Fig. 1 provides for the embodiment of the present invention one;
The structural representation of the statistic device of a kind of interest information that Fig. 2 provides for the embodiment of the present invention one;
The structural representation of a kind of server that Fig. 3 provides for the embodiment of the present invention one;
The statistical method process flow diagram of a kind of interest information that Fig. 4 provides for the embodiment of the present invention two;
The structural representation of the statistic device of a kind of interest information that Fig. 5 provides for the embodiment of the present invention two;
The structural representation of a kind of server that Fig. 6 provides for the embodiment of the present invention two;
The corresponding relation schematic diagram of a kind of coordinate information that Fig. 7 provides for the embodiment of the present invention two and focus.
Embodiment
Below in conjunction with the accompanying drawing in the embodiment of the present invention, be clearly and completely described the technical scheme in the embodiment of the present invention, obviously, described embodiment is only the present invention's part embodiment, instead of whole embodiments.Based on the embodiment in the present invention, those of ordinary skill in the art, not making other embodiments all obtained under creative work prerequisite, belong to the scope of protection of the invention.
For making the advantage of technical solution of the present invention clearly, below in conjunction with drawings and Examples, the present invention is elaborated.
Embodiment one
The embodiment of the present invention provides a kind of statistical method of interest information, and as shown in Figure 1, described method comprises:
101, server obtains coordinate information corresponding to user equipment (UE).
Wherein, coordinate information can be the longitude and latitude value that user's real time position is corresponding.In embodiments of the present invention, operator, according to base station location and the distance between base station and UE, obtains the coordinate information that UE is corresponding.
For the embodiment of the present invention, step 101 is specifically as follows, if UE residence time in the round territory that pre-set radius is corresponding is greater than or equal to Preset Time, then server obtains the longitude corresponding to the center of circle in circle territory and latitude value as coordinate information corresponding to UE.Wherein, pre-set radius and Preset Time can be configured by server, and also can be configured by UE by user, the embodiment of the present invention does not limit.Such as, pre-set radius can be 300 meters, 350 meters, 500 meters etc.; Preset Time can be 1 minute, 2 minutes, 4 minutes etc.
102, the coordinate information that server is corresponding according to UE, obtains the keyword of each focus corresponding to UE.
Wherein, focus and focus keyword can be obtained by map system by server.Such as, focus can be market, National Stadium, Tsing-Hua University etc., keyword corresponding to focus market can be market, cuisines, shopping, amusement etc., keyword corresponding to focus National Stadium can be National Stadium, Bird's Nest, Olympic Green etc., and keyword corresponding to focus Tsing-Hua University can be Tsing-Hua University, education, institution of higher learning, Site of Qing Hua Yuan etc.In embodiments of the present invention, the focus that coordinate information is corresponding can be one, and also can be multiple, the embodiment of the present invention limit.
For the embodiment of the present invention, server, by obtaining each focus near coordinate information, can be avoided the situation causing because user trajectory is sparse the focus of acquisition less, thus can improve the statistics coverage rate of interest information.
103, the keyword of each focus is configured to interest information corresponding to UE by server.
Wherein, interest information may be used for the interest inferring user.Such as, interest information can be: cuisines, chafing dish, Sichuan cuisine, body-building, swimming, the Water Cube etc.
Further, as the specific implementation of method shown in Fig. 1, embodiments provide a kind of statistic device of interest information, as shown in Figure 2, the entity of described device can be server, and described device comprises: acquiring unit 21, dispensing unit 22.
Acquiring unit 21, for obtaining coordinate information corresponding to user equipment (UE).
Acquiring unit 21, also for the coordinate information corresponding according to UE, obtains the keyword of each focus corresponding to UE.
Dispensing unit 22, the keyword for each focus obtained by acquiring unit 21 is configured to interest information corresponding to UE.
Again further, the entity of the statistic device of described interest information can be server, as shown in Figure 3, described server can comprise: processor 31, input equipment 32, output device 33, storer 34, and described input equipment 32, output device 33 and storer 34 are connected with processor 31 respectively.
Processor 31, for obtaining coordinate information corresponding to user equipment (UE).
Processor 31, also for the coordinate information corresponding according to UE, obtains the keyword of each focus corresponding to UE.
Processor 31, also for the keyword of each focus is configured to interest information corresponding to UE.
It should be noted that, other the corresponding descriptions in the statistic device of the interest information provided in the embodiment of the present invention corresponding to each functional unit, the correspondence in reference diagram 1 can describe, do not repeat them here.
The statistical method of the interest information that the embodiment of the present invention provides and device, first coordinate information corresponding to user equipment (UE) is obtained, then corresponding according to UE coordinate information, obtains the keyword of each focus corresponding to UE, finally the keyword of each focus is configured to interest information corresponding to UE.Obtain compared with interest information corresponding to UE with current by text message, the real time position that the embodiment of the present invention is corresponding according to UE, infer each focus corresponding to UE, thus can interest information corresponding to Real-time Obtaining UE, and then the statistics coverage rate of interest information can be improved.
Embodiment two
The embodiment of the present invention provides a kind of statistical method of interest information, and as shown in Figure 4, described method comprises:
401, server obtains coordinate information corresponding to user equipment (UE).
Wherein, coordinate information can be the longitude and latitude value that user's real time position is corresponding.In embodiments of the present invention, operator, according to base station location and the distance between base station and UE, obtains the coordinate information that UE is corresponding.
For the embodiment of the present invention, step 401 is specifically as follows, if UE residence time in the round territory that pre-set radius is corresponding is greater than or equal to Preset Time, then server obtains the longitude corresponding to the center of circle in circle territory and latitude value as coordinate information corresponding to UE.Wherein, pre-set radius and Preset Time can be configured by server, and also can be configured by UE by user, the embodiment of the present invention does not limit.Such as, pre-set radius can be 200 meters, 300 meters, 400 meters etc.; Preset Time can be 1 minute, 3 minutes, 5 minutes etc.
402, the distance that server obtains between the coordinate information corresponding with UE is less than or equal to each focus of predetermined threshold value and the keyword of each focus.
Wherein, predetermined threshold value is for obtaining each hotspot location corresponding to coordinate information, and predetermined threshold value can be configured by server.Such as, predetermined threshold value can be 500 meters, 600 meters, 800 meters etc.
For the embodiment of the present invention, focus can be obtained by map system by server.Such as, focus can be Wangfujing Dajie, Peking University, Nanluoguxiang etc.
Such as shown in Fig. 7, some O is the coordinate information that UE is corresponding, and D is predetermined threshold value, and each focus that the distance between the coordinate information corresponding with UE is less than or equal to predetermined threshold value comprises: the Tian'anmen Square, Zhongshan Park and Palace Museum.
For the embodiment of the present invention, server is less than or equal to each focus of predetermined threshold value by the distance obtained between the coordinate information corresponding with UE, the situation causing because user trajectory is sparse the hotspot location of acquisition less can be avoided, thus the statistics coverage rate of interest information can be improved.
For the embodiment of the present invention, the coordinate information that UE is corresponding can be one or more, and the focus that the distance between the coordinate information corresponding with UE is less than or equal to predetermined threshold value can be one or more, and server can first according to formula obtain the keyword set of a jth focus corresponding to i-th coordinate information; Then according to formula T i=T i 1∪ T i 2∪ ... ∪ T i p, obtain the keyword set that i-th coordinate information is corresponding; Last according to formula T=T 1∪ T 2∪ ... ∪ T s, obtain the keyword set that UE is corresponding.
Wherein, T i jbe the keyword set of the jth focus that i-th coordinate information is corresponding, be the kth keyword that a jth focus that i-th coordinate information is corresponding comprises, be the keyword number that a jth focus that i-th coordinate information is corresponding comprises, T ibe the keyword set of i-th coordinate information, p is the focus number that i-th coordinate information is corresponding, and T is the keyword set that UE is corresponding, and s is the coordinate information number that UE is corresponding.In embodiments of the present invention, the keyword of each focus and keyword set corresponding to UE.
403, server obtains each UE lists of keywords corresponding respectively.
For the embodiment of the present invention, the lists of keywords that each UE is corresponding respectively can be carried out generating and preserving by server in advance.
404, the lists of keywords that server is corresponding according to each UE, according to TF-IDF algorithm, calculates the weighted value that the keyword of each focus is corresponding respectively.
Wherein, TF(term frequency, word frequency) refer to the frequency that keyword occurs in lists of keywords; IDF(inverse document frequency, reverse document-frequency) for measuring general importance corresponding to keyword.
Particularly, the total degree that the number of times that first server can be occurred by certain keyword in lists of keywords occurs divided by keyword whole in lists of keywords, obtain the TF value that this keyword is corresponding, then pass through UE sum divided by comprising to the UE number of this keyword, and result is taken the logarithm, obtain the IDF value that this keyword is corresponding, be multiplied finally by by IDF value corresponding with keyword for TF value corresponding for keyword, obtain the weighted value that this keyword is corresponding.
Such as, UE adds up to 1000, the total degree that in the lists of keywords that certain UE is corresponding, whole keyword occurs is 5000, the number of times that certain keyword occurs in lists of keywords is 300, the UE that corresponding lists of keywords comprises this keyword adds up to 10, then the TF value that this keyword is corresponding is 0.06, and the IDF value that this keyword is corresponding is 2, further, the weighted value that this keyword is corresponding is 0.12.
405, the keyword of each focus sorts according to weighted value order from high to low by server, and generates lists of keywords corresponding to UE.
For the embodiment of the present invention, if there is the identical situation of weighted value corresponding to multiple keyword in the keyword of each focus, then can sort according to the keyword that random order is identical to these weighted values, the embodiment of the present invention does not limit.Such as, can sort according to the keyword that TF value order is from high to low identical to weighted value, also can sort according to the keyword that IDF value order is from high to low identical to weighted value.
406, server obtains the top n keyword in lists of keywords corresponding to UE.
Wherein, N be greater than or equal to 1 integer.Such as, N can be 100,120 or 150 etc.
407, the top n keyword in lists of keywords corresponding for UE is configured to interest information corresponding to UE by server.
For the embodiment of the present invention, by the top n keyword in lists of keywords corresponding for UE is configured to interest information corresponding to UE, when can avoid keyword whole in lists of keywords corresponding for UE to be configured to interest information corresponding to UE, there is the situation that interest information corresponding to UE is too much, thus the configuration complexity of interest information can be reduced.
Further, as the specific implementation of method shown in Fig. 4, embodiments provide a kind of statistic device of interest information, as shown in Figure 5, the entity of described device can be server, and described device comprises: acquiring unit 51, dispensing unit 52.
Acquiring unit 51, for obtaining coordinate information corresponding to user equipment (UE).
Acquiring unit 51, also for the coordinate information corresponding according to UE, obtains the keyword of each focus corresponding to UE.
Dispensing unit 52, the keyword for each focus obtained by acquiring unit 51 is configured to interest information corresponding to UE.
Acquiring unit 51, is also less than or equal to each focus of predetermined threshold value and the keyword of each focus for the distance obtained between the coordinate information corresponding with UE.
Described device can also comprise: sequencing unit 53, generation unit 54.
Sequencing unit 53, the keyword for each focus obtained acquiring unit 51 sorts.
Generation unit 54, for generating lists of keywords corresponding to UE after sequencing unit 53 sorts.
Acquiring unit 51, also for obtaining the top n keyword in lists of keywords corresponding to UE that generation unit 54 generates.
Wherein, N be greater than or equal to 1 integer.
Dispensing unit 52, is also configured to interest information corresponding to UE for the top n keyword in lists of keywords corresponding to the UE obtained by acquiring unit 51.
Described device can also comprise: computing unit 55.
Computing unit 55, for according to word frequency-reverse document-frequency TF-IDF algorithm, calculates the weighted value that the keyword of each focus that acquiring unit 51 obtains is corresponding respectively.
Sequencing unit 53, the weighted value order from high to low also for being calculated according to computing unit 55 by the keyword of each focus sorts.
Acquiring unit 51, also for obtaining each UE lists of keywords corresponding respectively.
Computing unit 55, also for the lists of keywords that each UE obtained according to acquiring unit 51 is corresponding, according to TF-IDF algorithm, calculates the weighted value that the keyword of each focus is corresponding respectively.
Again further, the entity of the statistic device of described interest information can be server, as shown in Figure 6, described server can comprise: processor 61, input equipment 62, output device 63, storer 64, and described input equipment 62, output device 63 and storer 64 are connected with processor 61 respectively.
Processor 61, for obtaining coordinate information corresponding to user equipment (UE).
Processor 61, also for the coordinate information corresponding according to UE, obtains the keyword of each focus corresponding to UE.
Processor 61, also for the keyword of each focus is configured to interest information corresponding to UE.
Processor 61, is also less than or equal to each focus of predetermined threshold value and the keyword of each focus for the distance obtained between the coordinate information corresponding with UE.
Processor 61, also for sorting to the keyword of each focus.
Processor 61, also for generating lists of keywords.
Processor 61, also for obtaining the top n keyword in lists of keywords corresponding to UE.
Wherein, N be greater than or equal to 1 integer.
Processor 61, also for the top n keyword in lists of keywords corresponding for UE is configured to interest information corresponding to UE.
Processor 61, also for according to word frequency-reverse document-frequency TF-IDF algorithm, calculates the weighted value that the keyword of each focus is corresponding respectively.
Processor 61, also for being sorted according to weighted value order from high to low by the keyword of each focus.
Processor 61, also for obtaining each UE lists of keywords corresponding respectively.
Processor 61, also for the lists of keywords corresponding according to each UE, according to TF-IDF algorithm, calculates the weighted value that the keyword of each focus is corresponding respectively.
It should be noted that, other the corresponding descriptions in the statistic device of the interest information provided in the embodiment of the present invention corresponding to each functional unit, the correspondence in reference diagram 4 can describe, do not repeat them here.
The statistical method of the interest information that the embodiment of the present invention provides and device, first coordinate information corresponding to user equipment (UE) is obtained, then corresponding according to UE coordinate information, obtains the keyword of each focus corresponding to UE, finally the keyword of each focus is configured to interest information corresponding to UE.Obtain compared with interest information corresponding to UE with current by text message, the real time position that the embodiment of the present invention is corresponding according to UE, infer each focus corresponding to UE, thus can interest information corresponding to Real-time Obtaining UE, and then the statistics coverage rate of interest information can be improved.
The statistic device of the interest information that the embodiment of the present invention provides can realize the above-mentioned embodiment of the method provided, and concrete function realizes the explanation referred in embodiment of the method, does not repeat them here.The statistical method of the interest information that the embodiment of the present invention provides and device go for adding up the interest information of user, but are not limited only to this.
One of ordinary skill in the art will appreciate that all or part of flow process realized in above-described embodiment method, that the hardware that can carry out instruction relevant by computer program has come, described program can be stored in a computer read/write memory medium, this program, when performing, can comprise the flow process of the embodiment as above-mentioned each side method.Wherein, described storage medium can be magnetic disc, CD, read-only store-memory body (Read-Only Memory, ROM) or random store-memory body (Random Access Memory, RAM) etc.
The above; be only the specific embodiment of the present invention, but protection scope of the present invention is not limited thereto, is anyly familiar with those skilled in the art in the technical scope that the present invention discloses; the change that can expect easily or replacement, all should be encompassed within protection scope of the present invention.Therefore, protection scope of the present invention should be as the criterion with the protection domain of claim.

Claims (10)

1. a statistical method for interest information, is characterized in that, comprising:
Obtain the coordinate information that user equipment (UE) is corresponding;
The coordinate information corresponding according to described UE, obtains the keyword of each focus corresponding to described UE;
The keyword of each focus described is configured to interest information corresponding to described UE.
2. the statistical method of interest information according to claim 1, is characterized in that, the described coordinate information corresponding according to described UE, and the step obtaining the keyword of each focus corresponding to described UE comprises:
The distance obtained between the coordinate information corresponding with described UE is less than or equal to each focus of predetermined threshold value and the keyword of each focus described.
3. the statistical method of interest information according to claim 2, it is characterized in that, distance between the coordinate information that described acquisition is corresponding with described UE also comprises after being less than or equal to the step of each focus of predetermined threshold value and the keyword of each focus described:
The keyword of each focus described is sorted, and generates lists of keywords corresponding to described UE;
Obtain the top n keyword in lists of keywords corresponding to described UE, wherein, N be greater than or equal to 1 integer;
The step that the described keyword by each focus described is configured to interest information corresponding to described UE comprises:
Top n keyword in lists of keywords corresponding for described UE is configured to interest information corresponding to described UE.
4. the statistical method of interest information according to claim 3, is characterized in that, before the described step sorted to the keyword of each focus described, also comprises:
According to word frequency-reverse document-frequency TF-IDF algorithm, calculate the weighted value that the keyword of each focus described is corresponding respectively;
Described the step that the keyword of each focus described sorts to be comprised:
The keyword of each focus described is sorted according to weighted value order from high to low.
5. require the statistical method of the interest information described in 4 as requested, it is characterized in that, described according to word frequency-reverse document-frequency TF-IDF algorithm, before calculating the step of the weighted value of the keyword difference correspondence of each focus described, also comprise:
Obtain the lists of keywords that each UE is corresponding respectively;
Described according to word frequency-reverse document-frequency TF-IDF algorithm, described in calculating, the step of the weighted value that the keyword of each focus is corresponding respectively comprises:
The lists of keywords corresponding according to each UE described, according to TF-IDF algorithm, calculates the weighted value that the keyword of each focus described is corresponding respectively.
6. a statistic device for interest information, is characterized in that, comprising:
Acquiring unit, for obtaining coordinate information corresponding to user equipment (UE);
Described acquiring unit, also for the coordinate information corresponding according to described UE, obtains the keyword of each focus corresponding to described UE;
Dispensing unit, is configured to interest information corresponding to described UE for the keyword of each focus described in being obtained by described acquiring unit.
7. the statistic device of interest information according to claim 6, is characterized in that,
Described acquiring unit, is also less than or equal to each focus of predetermined threshold value and the keyword of each focus described for the distance obtained between the coordinate information corresponding with described UE.
8. the statistic device of interest information according to claim 7, is characterized in that, described device also comprises:
Sequencing unit, sorts for the keyword of each focus described in obtaining described acquiring unit;
Generation unit, for generating lists of keywords corresponding to described UE after the sequence of described sequencing unit;
Described acquiring unit, also for obtaining the top n keyword in lists of keywords corresponding to described UE that described generation unit generates, wherein, N be greater than or equal to 1 integer;
Described dispensing unit, is also configured to interest information corresponding to described UE for the top n keyword in lists of keywords corresponding to the described UE obtained by described acquiring unit.
9. the statistic device of interest information according to claim 8, is characterized in that, described device also comprises:
Computing unit, for according to word frequency-reverse document-frequency TF-IDF algorithm, calculates the weighted value that the keyword of each focus described that described acquiring unit obtains is corresponding respectively;
Described sequencing unit, the weighted value order from high to low also for being calculated according to described computing unit by the keyword of each focus described sorts.
10. require the statistic device of the interest information described in 9 as requested, it is characterized in that,
Described acquiring unit, also for obtaining each UE lists of keywords corresponding respectively;
Described computing unit, also for the lists of keywords that each UE described in obtaining according to described acquiring unit is corresponding, according to TF-IDF algorithm, calculates the weighted value that the keyword of each focus described is corresponding respectively.
CN201310636603.6A 2013-11-27 2013-11-27 Interest information statistical method and device Active CN104679787B (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN202011588377.5A CN113051467A (en) 2013-11-27 2013-11-27 Interest information statistical method and device
CN201310636603.6A CN104679787B (en) 2013-11-27 2013-11-27 Interest information statistical method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310636603.6A CN104679787B (en) 2013-11-27 2013-11-27 Interest information statistical method and device

Related Child Applications (1)

Application Number Title Priority Date Filing Date
CN202011588377.5A Division CN113051467A (en) 2013-11-27 2013-11-27 Interest information statistical method and device

Publications (2)

Publication Number Publication Date
CN104679787A true CN104679787A (en) 2015-06-03
CN104679787B CN104679787B (en) 2021-01-01

Family

ID=53314841

Family Applications (2)

Application Number Title Priority Date Filing Date
CN202011588377.5A Pending CN113051467A (en) 2013-11-27 2013-11-27 Interest information statistical method and device
CN201310636603.6A Active CN104679787B (en) 2013-11-27 2013-11-27 Interest information statistical method and device

Family Applications Before (1)

Application Number Title Priority Date Filing Date
CN202011588377.5A Pending CN113051467A (en) 2013-11-27 2013-11-27 Interest information statistical method and device

Country Status (1)

Country Link
CN (2) CN113051467A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2016202214A2 (en) * 2015-06-19 2016-12-22 阿里巴巴集团控股有限公司 Method and device for displaying keyword
CN108628832A (en) * 2018-05-08 2018-10-09 中国联合网络通信集团有限公司 A kind of information keyword acquisition methods and device
CN111475601A (en) * 2020-04-09 2020-07-31 云南电网有限责任公司电力科学研究院 Method and device for acquiring hot subject of power work order

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102456055A (en) * 2010-10-28 2012-05-16 腾讯科技(深圳)有限公司 Method and device for retrieving interest points
CN103064924A (en) * 2012-12-17 2013-04-24 浙江鸿程计算机系统有限公司 Travel destination situation recommendation method based on geotagged photo excavation
CN103150309A (en) * 2011-12-07 2013-06-12 清华大学 Method and system for searching POI (Point of Interest) points of awareness map in space direction

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102456055A (en) * 2010-10-28 2012-05-16 腾讯科技(深圳)有限公司 Method and device for retrieving interest points
CN103150309A (en) * 2011-12-07 2013-06-12 清华大学 Method and system for searching POI (Point of Interest) points of awareness map in space direction
CN103064924A (en) * 2012-12-17 2013-04-24 浙江鸿程计算机系统有限公司 Travel destination situation recommendation method based on geotagged photo excavation

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
ZHISHENG LI等: "IR-Tree: An Efficient Index for Geographic Document Search", 《IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING》 *

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2016202214A2 (en) * 2015-06-19 2016-12-22 阿里巴巴集团控股有限公司 Method and device for displaying keyword
WO2016202214A3 (en) * 2015-06-19 2017-02-09 阿里巴巴集团控股有限公司 Method and device for displaying keyword
US11403357B2 (en) 2015-06-19 2022-08-02 Advanced New Technologies Co., Ltd. Enhancing accuracy of presented search keywords
US11727075B2 (en) 2015-06-19 2023-08-15 Advanced New Technologies Co., Ltd. Enhancing accuracy of presented search keywords
CN108628832A (en) * 2018-05-08 2018-10-09 中国联合网络通信集团有限公司 A kind of information keyword acquisition methods and device
CN108628832B (en) * 2018-05-08 2022-03-18 中国联合网络通信集团有限公司 Method and device for acquiring information keywords
CN111475601A (en) * 2020-04-09 2020-07-31 云南电网有限责任公司电力科学研究院 Method and device for acquiring hot subject of power work order

Also Published As

Publication number Publication date
CN113051467A (en) 2021-06-29
CN104679787B (en) 2021-01-01

Similar Documents

Publication Publication Date Title
CN102831170B (en) The method for pushing of activity information and device
RU2571573C1 (en) Method and server for searching for nearby user in social networking services
CN107332889A (en) A kind of high in the clouds information management control system and control method based on cloud computing
CN104867402B (en) A kind of method and device thereof and terminal device of offline inverse geocoding
CN103167511B (en) The acquisition processing method of base stations in wireless communication networks station spacing and device
CN103914877A (en) Three-dimensional model multi-detail-level structure based on extension combination
CN104679787A (en) Interest information statistical method and device
CN110225453A (en) Mobile terminal locating method, device, electronic equipment and storage medium
CN104102635A (en) Method and device for digging knowledge graph
Yuan et al. Irregular distribution of wind power prediction
CN104268201B (en) Space magnanimity multivariate data based on GIS platform unifies indexing means
CN109413661A (en) A kind of computer installation away from method and device
CN113488996A (en) Power distribution network protogram modeling method based on distributed parallel graph computing framework
CN104980948A (en) Electromagnetic radiation statistical method, system thereof and mobile terminal
US10387545B2 (en) Processing page
CN104111981A (en) Method and device used for providing post messages
CN103487057B (en) Paths planning method based on end points extension and device
CN104811372A (en) Multi-user communication method based on geographic position and spatial range
CN108960624A (en) Grid similarity determination method, device and system based on user's visiting information
CN105809296A (en) Public transportation transfer method and system in combination with rail transit information and mobile equipment
CN105282720B (en) A kind of method for filtering spam short messages and device
CN103984684A (en) LBS (location based service)-based reachable area determining method and equipment
CN103207896B (en) Method and system for stable and efficient self-adaptive clustering
CN104392490A (en) Power network resource grid meteorological influence scope analysis method based on GIS platform
CN104156475A (en) Geographic information reading method and device

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant