CN104679787B - Interest information statistical method and device - Google Patents

Interest information statistical method and device Download PDF

Info

Publication number
CN104679787B
CN104679787B CN201310636603.6A CN201310636603A CN104679787B CN 104679787 B CN104679787 B CN 104679787B CN 201310636603 A CN201310636603 A CN 201310636603A CN 104679787 B CN104679787 B CN 104679787B
Authority
CN
China
Prior art keywords
keywords
hotspot
unit
interest information
acquiring
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201310636603.6A
Other languages
Chinese (zh)
Other versions
CN104679787A (en
Inventor
陈嘉
曾嘉
袁明轩
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Huawei Technologies Co Ltd
Original Assignee
Huawei Technologies Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huawei Technologies Co Ltd filed Critical Huawei Technologies Co Ltd
Priority to CN201310636603.6A priority Critical patent/CN104679787B/en
Priority to CN202011588377.5A priority patent/CN113051467A/en
Publication of CN104679787A publication Critical patent/CN104679787A/en
Application granted granted Critical
Publication of CN104679787B publication Critical patent/CN104679787B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Abstract

The embodiment of the invention discloses a method and a device for counting interest information, relates to the technical field of information, and can improve the counting coverage rate of the interest information. The method comprises the following steps: the method comprises the steps of firstly obtaining coordinate information corresponding to user equipment, then obtaining keywords of each hotspot corresponding to the user equipment according to the coordinate information corresponding to the user equipment, and finally configuring the keywords of each hotspot as interest information corresponding to the user equipment. The embodiment of the invention is suitable for counting the interest information of the user.

Description

Interest information statistical method and device
Technical Field
The present invention relates to the field of information technologies, and in particular, to a method and an apparatus for counting interest information.
Background
With the continuous development of information technology, it is more and more important to intelligently acquire interest information of a user. The interest information may be shopping information, food information, fitness information, and the like, and the intelligently acquiring the interest information of the User means acquiring the interest information of the User from various information reported by the User through a User Equipment (UE).
Currently, a server obtains interest information corresponding to a UE through text information published by the UE in a social network. Wherein the text information includes: status information, comment information, tag information, and the like. However, the interest information corresponding to the UE is obtained through the text information, and the server obtains the interest information corresponding to the UE only according to the text information actively submitted by the user in the social network, so that the statistical coverage of the interest information is low.
Disclosure of Invention
The embodiment of the invention provides a method and a device for counting interest information, which can improve the counting coverage rate of the interest information.
The embodiment of the invention adopts the technical scheme that:
in a first aspect, an embodiment of the present invention provides a method for counting interest information, including:
acquiring coordinate information corresponding to User Equipment (UE);
acquiring key words of each hotspot corresponding to the UE according to the coordinate information corresponding to the UE;
and configuring the keywords of each hotspot as interest information corresponding to the UE.
In a first implementation manner of the first aspect, the step of obtaining, according to the coordinate information corresponding to the UE, keywords of each hotspot corresponding to the UE includes:
and acquiring each hotspot and the keyword of each hotspot, wherein the distance between the hotspot and the coordinate information corresponding to the UE is less than or equal to a preset threshold value.
With reference to the first aspect or the first implementation manner of the first aspect, in a second implementation manner of the first aspect, after the step of obtaining each hotspot whose distance between the coordinate information corresponding to the UE is less than or equal to a preset threshold and the keyword of each hotspot, the method further includes:
sorting the keywords of each hotspot and generating a keyword list corresponding to the UE;
acquiring the first N keywords in a keyword list corresponding to the UE, wherein N is an integer greater than or equal to 1;
the step of configuring the keywords of the hotspots as the interest information corresponding to the UE includes:
and configuring the first N keywords in the keyword list corresponding to the UE as interest information corresponding to the UE.
With reference to the first aspect, or the first implementation manner of the first aspect, or the second implementation manner of the first aspect, in a third implementation manner of the first aspect, before the step of ranking the keywords of the respective hotspots, the method further includes:
calculating the weight values corresponding to the keywords of each hotspot according to a word frequency-reverse file frequency TF-IDF algorithm;
the step of ranking the keywords of each hotspot comprises:
and sorting the keywords of each hotspot according to the sequence of the weight values from high to low.
With reference to the first aspect, or the first implementation manner of the first aspect, or the second implementation manner of the first aspect, or the third implementation manner of the first aspect, in a fourth implementation manner of the first aspect, before the step of calculating, according to a term frequency-inverse file frequency TF-IDF algorithm, weight values corresponding to the keywords of the respective hotspots, the method further includes:
acquiring a keyword list corresponding to each UE;
the step of calculating the weight values corresponding to the keywords of each hotspot according to a word frequency-inverse file frequency TF-IDF algorithm comprises the following steps:
and calculating the weight values corresponding to the keywords of each hotspot respectively according to the keyword list corresponding to each UE and a TF-IDF algorithm.
In a second aspect, an embodiment of the present invention provides an interest information statistics apparatus, including:
the device comprises an acquisition unit, a processing unit and a processing unit, wherein the acquisition unit is used for acquiring coordinate information corresponding to User Equipment (UE);
the obtaining unit is further configured to obtain, according to the coordinate information corresponding to the UE, a keyword of each hotspot corresponding to the UE;
a configuration unit, configured to configure the keywords of the hotspots acquired by the acquisition unit as interest information corresponding to the UE.
In a first implementation form of the second aspect,
the acquiring unit is further configured to acquire each hotspot where a distance between the coordinate information corresponding to the UE is less than or equal to a preset threshold and a keyword of each hotspot.
With reference to the second aspect or the first implementation manner of the second aspect, in a second implementation manner of the second aspect, the apparatus further includes:
the sorting unit is used for sorting the keywords of the hotspots acquired by the acquiring unit;
a generating unit, configured to generate a keyword list corresponding to the UE after being sorted by the sorting unit;
the obtaining unit is further configured to obtain the first N keywords in the keyword list corresponding to the UE generated by the generating unit, where N is an integer greater than or equal to 1;
the configuration unit is further configured to configure the top N keywords in the keyword list corresponding to the UE acquired by the acquisition unit as interest information corresponding to the UE.
With reference to the second aspect, or the first implementation manner of the second aspect, or the second implementation manner of the second aspect, in a third implementation manner of the second aspect, the apparatus further includes:
the calculating unit is used for calculating the weight values corresponding to the keywords of the hot spots acquired by the acquiring unit according to a word frequency-reverse file frequency TF-IDF algorithm;
the sorting unit is further configured to sort the keywords of each hotspot in an order from high to low according to the weight values calculated by the calculating unit.
With reference to the second aspect or the first implementation manner of the second aspect, or the second implementation manner of the second aspect, or the third implementation manner of the second aspect, in a fourth implementation manner of the second aspect,
the acquiring unit is further configured to acquire a keyword list corresponding to each UE;
the calculating unit is further configured to calculate, according to the keyword list corresponding to each UE acquired by the acquiring unit, weight values corresponding to the keywords of each hotspot respectively according to a TF-IDF algorithm.
According to the method and the device for counting the interest information, coordinate information corresponding to User Equipment (UE) is obtained firstly, then keywords of each hot spot corresponding to the UE are obtained according to the coordinate information corresponding to the UE, and finally the keywords of each hot spot are configured into the interest information corresponding to the UE. Compared with the prior art that the interest information corresponding to the UE is acquired through the text information, the embodiment of the invention conjectures each hotspot corresponding to the UE according to the real-time position corresponding to the UE, thereby acquiring the interest information corresponding to the UE in real time and further improving the statistical coverage rate of the interest information.
Drawings
In order to more clearly illustrate the technical solutions in the embodiments of the present invention, the drawings needed for the embodiments or the prior art descriptions will be briefly described below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and it is obvious for those skilled in the art to obtain other drawings without creative efforts.
Fig. 1 is a flowchart of a statistical method for interest information according to an embodiment of the present invention;
fig. 2 is a schematic structural diagram of an interest information statistics apparatus according to an embodiment of the present invention;
fig. 3 is a schematic structural diagram of a server according to an embodiment of the present invention;
FIG. 4 is a flowchart of a statistical method for interest information according to a second embodiment of the present invention;
fig. 5 is a schematic structural diagram of an interest information statistics apparatus according to a second embodiment of the present invention;
fig. 6 is a schematic structural diagram of a server according to a second embodiment of the present invention;
fig. 7 is a schematic diagram of a correspondence relationship between coordinate information and a hotspot provided in the second embodiment of the present invention.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
In order to make the advantages of the technical solutions of the present invention clearer, the present invention is described in detail below with reference to the accompanying drawings and examples.
Example one
An embodiment of the present invention provides a statistical method for interest information, as shown in fig. 1, the method includes:
101. and the server acquires the coordinate information corresponding to the user equipment UE.
The coordinate information may be a longitude value and a latitude value corresponding to the real-time position of the user. In the embodiment of the invention, the operator acquires the coordinate information corresponding to the UE according to the position of the base station and the distance between the base station and the UE.
For the embodiment of the present invention, in step 101, specifically, if the stay time of the UE in the circular domain corresponding to the preset radius is greater than or equal to the preset time, the server obtains the longitude value and the latitude value corresponding to the center of the circular domain as the coordinate information corresponding to the UE. The preset radius and the preset time may be configured by a server, or may be configured by a user through a UE, which is not limited in the embodiments of the present invention. For example, the preset radius may be 300 meters, 350 meters, 500 meters, etc.; the preset time may be 1 minute, 2 minutes, 4 minutes, etc.
102. And the server acquires the keywords of each hotspot corresponding to the UE according to the coordinate information corresponding to the UE.
The hot spots and the keywords of the hot spots can be acquired by the server through the map system. For example, the hotspot may be a mall, a national stadium, a qinghua university, etc., the keyword corresponding to the hotspot mall may be a mall, a food, shopping, entertainment, etc., the keyword corresponding to the hotspot national stadium may be a national stadium, a bird nest, an olympic park, etc., and the keyword corresponding to the hotspot qinghua university may be a qinghua university, an education, an advanced schoolhouse, a qinghua garden, etc. In the embodiment of the present invention, one or more hotspots corresponding to the coordinate information may be provided, and the embodiment of the present invention is not limited.
For the embodiment of the invention, the server can avoid the situation that the acquired hotspots are few due to sparse user tracks by acquiring the hotspots near the coordinate information, so that the statistical coverage rate of the interest information can be improved.
103. And the server configures the keywords of each hotspot as interest information corresponding to the UE.
Wherein the interest information may be used to infer the interests of the user. For example, the interest information may be: food, chafing dish, Sichuan dish, body building, swimming, water cube, etc.
Further, as a specific implementation of the method shown in fig. 1, an embodiment of the present invention provides an interest information statistics apparatus, as shown in fig. 2, an entity of the apparatus may be a server, and the apparatus includes: an acquisition unit 21 and a configuration unit 22.
An obtaining unit 21, configured to obtain coordinate information corresponding to the user equipment UE.
The obtaining unit 21 is further configured to obtain, according to the coordinate information corresponding to the UE, a keyword of each hotspot corresponding to the UE.
The configuring unit 22 is configured to configure the keywords of each hotspot acquired by the acquiring unit 21 as interest information corresponding to the UE.
Still further, the entity of the interest information statistic device may be a server, as shown in fig. 3, and the server may include: the device comprises a processor 31, an input device 32, an output device 33 and a memory 34, wherein the input device 32, the output device 33 and the memory 34 are respectively connected with the processor 31.
And the processor 31 is configured to acquire coordinate information corresponding to the user equipment UE.
The processor 31 is further configured to obtain, according to the coordinate information corresponding to the UE, a keyword of each hotspot corresponding to the UE.
The processor 31 is further configured to configure the keywords of each hotspot as interest information corresponding to the UE.
It should be noted that, for other corresponding descriptions corresponding to each functional unit in the statistics apparatus for interest information provided in the embodiment of the present invention, reference may be made to the corresponding description in fig. 1, which is not described herein again.
According to the method and the device for counting the interest information, coordinate information corresponding to User Equipment (UE) is obtained firstly, then keywords of each hot spot corresponding to the UE are obtained according to the coordinate information corresponding to the UE, and finally the keywords of each hot spot are configured into the interest information corresponding to the UE. Compared with the prior art that the interest information corresponding to the UE is acquired through the text information, the embodiment of the invention conjectures each hotspot corresponding to the UE according to the real-time position corresponding to the UE, thereby acquiring the interest information corresponding to the UE in real time and further improving the statistical coverage rate of the interest information.
Example two
An embodiment of the present invention provides a statistical method for interest information, as shown in fig. 4, the method includes:
401. and the server acquires the coordinate information corresponding to the user equipment UE.
The coordinate information may be a longitude value and a latitude value corresponding to the real-time position of the user. In the embodiment of the invention, the operator acquires the coordinate information corresponding to the UE according to the position of the base station and the distance between the base station and the UE.
For the embodiment of the present invention, in step 401, specifically, if the staying time of the UE in the circular domain corresponding to the preset radius is greater than or equal to the preset time, the server obtains the longitude value and the latitude value corresponding to the center of the circular domain as the coordinate information corresponding to the UE. The preset radius and the preset time may be configured by a server, or may be configured by a user through a UE, which is not limited in the embodiments of the present invention. For example, the preset radius may be 200 meters, 300 meters, 400 meters, etc.; the preset time may be 1 minute, 3 minutes, 5 minutes, etc.
402. The server acquires each hotspot and the keyword of each hotspot, wherein the distance between the coordinate information corresponding to the UE is smaller than or equal to a preset threshold value.
The preset threshold is used for acquiring each hotspot position corresponding to the coordinate information, and the preset threshold can be configured by the server. For example, the preset threshold may be 500 meters, 600 meters, 800 meters, etc.
For the embodiment of the invention, the hot spot can be acquired by the server through the map system. For example, the hotspot may be the princf well avenue, the university of beijing, the southern gong-drum lane, and the like.
For example, as shown in fig. 7, the point O is coordinate information corresponding to the UE, the point D is a preset threshold, and each hotspot whose distance from the coordinate information corresponding to the UE is less than or equal to the preset threshold includes: tiananmen square, Zhongshan park and Imperial palace courtyard.
For the embodiment of the invention, the server acquires each hotspot of which the distance between the coordinate information corresponding to the UE is smaller than or equal to the preset threshold value, so that the situation that the acquired hotspot positions are few due to sparse user tracks can be avoided, and the statistical coverage rate of the interest information can be improved.
For the embodiment of the present invention, the number of the coordinate information corresponding to the UE may be one or more, the number of the hot spots having a distance smaller than or equal to the preset threshold value from the coordinate information corresponding to the UE may be one or more, and the server may first perform the following operations according to a formula
Figure BDA0000423695970000081
Acquiring a keyword set of a jth hotspot corresponding to ith coordinate information; then according to the formula Ti=Ti 1∪Ti 2∪…∪Ti pAcquiring a keyword set corresponding to the ith coordinate information; finally according to the formula T = T1∪T2∪…∪TsAnd acquiring a keyword set corresponding to the UE.
Wherein, Ti jThe keyword set of the jth hotspot corresponding to the ith coordinate information,
Figure BDA0000423695970000082
for the kth keyword included in the jth hotspot corresponding to the ith coordinate information,
Figure BDA0000423695970000083
the number of keywords T included in the jth hotspot corresponding to the ith coordinate informationiThe number of the hot spots corresponding to the ith coordinate information is p, the number of the key words corresponding to the ith coordinate information is T, and the number of the coordinate information corresponding to the UE is s. In the embodiment of the present invention, the keywords of each hotspot are the keyword set corresponding to the UE.
403. And the server acquires the keyword lists respectively corresponding to the UE.
For the embodiment of the present invention, the keyword lists respectively corresponding to each UE may be generated and stored in advance by the server.
404. And the server calculates the weight values corresponding to the keywords of each hotspot according to the keyword list corresponding to each UE and the TF-IDF algorithm.
Wherein TF (term frequency) refers to the frequency of occurrence of a keyword in a keyword list; IDF (inverse document frequency) is used to measure the general importance corresponding to the keyword.
Specifically, the server may first obtain a TF value corresponding to a certain keyword in the keyword list by dividing the number of times that the certain keyword appears in the keyword list by the total number of times that all keywords appear in the keyword list, then obtain an IDF value corresponding to the certain keyword by dividing the total number of UEs included in the certain keyword by the number of UEs included in the certain keyword, and taking a logarithm of the result, and finally obtain a weight value corresponding to the certain keyword by multiplying the TF value corresponding to the certain keyword by the IDF value corresponding to the certain keyword.
For example, the total number of UEs is 1000, the total number of occurrences of all keywords in a keyword list corresponding to a certain UE is 5000, the number of occurrences of a certain keyword in the keyword list is 300, and the total number of UEs including the keyword in the corresponding keyword list is 10, then the TF value corresponding to the keyword is 0.06, the IDF value corresponding to the keyword is 2, and further, the weight value corresponding to the keyword is 0.12.
405. And the server sorts the keywords of each hotspot according to the sequence of the weighted values from high to low and generates a keyword list corresponding to the UE.
For the embodiment of the present invention, if there is a situation that the weight values corresponding to a plurality of keywords are the same in the keywords of each hotspot, the keywords having the same weight values may be sorted according to any order, and the embodiment of the present invention is not limited. For example, the keywords having the same weight value may be sorted in the order of TF value from high to low, or the keywords having the same weight value may be sorted in the order of IDF value from high to low.
406. The server obtains the first N keywords in the keyword list corresponding to the UE.
Wherein N is an integer greater than or equal to 1. For example, N may be 100, 120, 150, etc.
407. And the server configures the first N keywords in the keyword list corresponding to the UE as interest information corresponding to the UE.
For the embodiment of the invention, the first N keywords in the keyword list corresponding to the UE are configured as the interest information corresponding to the UE, so that the situation that the interest information corresponding to the UE is too much when all the keywords in the keyword list corresponding to the UE are configured as the interest information corresponding to the UE can be avoided, and the configuration complexity of the interest information can be reduced.
Further, as a specific implementation of the method shown in fig. 4, an embodiment of the present invention provides an interest information statistics apparatus, as shown in fig. 5, an entity of the apparatus may be a server, and the apparatus includes: an acquisition unit 51 and a configuration unit 52.
An obtaining unit 51, configured to obtain coordinate information corresponding to the user equipment UE.
The obtaining unit 51 is further configured to obtain, according to the coordinate information corresponding to the UE, a keyword of each hotspot corresponding to the UE.
The configuring unit 52 is configured to configure the keywords of each hotspot acquired by the acquiring unit 51 as interest information corresponding to the UE.
The obtaining unit 51 is further configured to obtain each hotspot where a distance between the coordinate information corresponding to the UE is smaller than or equal to a preset threshold and a keyword of each hotspot.
The apparatus may further include: sorting unit 53, generating unit 54.
The sorting unit 53 is configured to sort the keywords of each hotspot acquired by the acquiring unit 51.
A generating unit 54, configured to generate a keyword list corresponding to the UE after being sorted by the sorting unit 53.
The obtaining unit 51 is further configured to obtain the top N keywords in the keyword list corresponding to the UE generated by the generating unit 54.
Wherein N is an integer greater than or equal to 1.
The configuring unit 52 is further configured to configure the top N keywords in the keyword list corresponding to the UE acquired by the acquiring unit 51 as interest information corresponding to the UE.
The apparatus may further include: a calculation unit 55.
The calculating unit 55 is configured to calculate, according to the term frequency-inverse file frequency TF-IDF algorithm, weight values corresponding to the keywords of each hotspot acquired by the acquiring unit 51.
The sorting unit 53 is further configured to sort the keywords of each hotspot in an order from high to low according to the weight values calculated by the calculating unit 55.
The obtaining unit 51 is further configured to obtain keyword lists corresponding to the respective UEs.
The calculating unit 55 is further configured to calculate, according to the keyword list corresponding to each UE acquired by the acquiring unit 51 and according to the TF-IDF algorithm, weight values corresponding to the keywords of each hotspot respectively.
Still further, the entity of the interest information statistic device may be a server, as shown in fig. 6, and the server may include: the device comprises a processor 61, an input device 62, an output device 63 and a memory 64, wherein the input device 62, the output device 63 and the memory 64 are respectively connected with the processor 61.
And the processor 61 is configured to acquire coordinate information corresponding to the user equipment UE.
The processor 61 is further configured to obtain, according to the coordinate information corresponding to the UE, a keyword of each hotspot corresponding to the UE.
The processor 61 is further configured to configure the keywords of each hotspot as interest information corresponding to the UE.
The processor 61 is further configured to acquire each hotspot where a distance between the coordinate information corresponding to the UE is less than or equal to a preset threshold and a keyword of each hotspot.
The processor 61 is further configured to rank the keywords of each hotspot.
The processor 61 is further configured to generate a keyword list.
The processor 61 is further configured to obtain the first N keywords in the keyword list corresponding to the UE.
Wherein N is an integer greater than or equal to 1.
The processor 61 is further configured to configure the first N keywords in the keyword list corresponding to the UE as interest information corresponding to the UE.
The processor 61 is further configured to calculate weight values corresponding to the keywords of each hotspot according to a term frequency-inverse document frequency TF-IDF algorithm.
The processor 61 is further configured to sort the keywords of each hotspot in an order from high to low in weight value.
The processor 61 is further configured to obtain a keyword list corresponding to each UE.
The processor 61 is further configured to calculate, according to the keyword list corresponding to each UE and according to the TF-IDF algorithm, weight values corresponding to the keywords of each hotspot respectively.
It should be noted that, for other corresponding descriptions corresponding to each functional unit in the statistics apparatus for interest information provided in the embodiment of the present invention, reference may be made to the corresponding description in fig. 4, which is not described herein again.
According to the method and the device for counting the interest information, coordinate information corresponding to User Equipment (UE) is obtained firstly, then keywords of each hot spot corresponding to the UE are obtained according to the coordinate information corresponding to the UE, and finally the keywords of each hot spot are configured into the interest information corresponding to the UE. Compared with the prior art that the interest information corresponding to the UE is acquired through the text information, the embodiment of the invention conjectures each hotspot corresponding to the UE according to the real-time position corresponding to the UE, thereby acquiring the interest information corresponding to the UE in real time and further improving the statistical coverage rate of the interest information.
The statistical device of interest information provided by the embodiment of the present invention can implement the method embodiment provided above, and for specific function implementation, reference is made to the description in the method embodiment, which is not repeated herein. The method and the device for counting the interest information provided by the embodiment of the invention can be suitable for counting the interest information of the user, but are not limited to the method and the device.
It will be understood by those skilled in the art that all or part of the processes of the methods of the embodiments described above can be implemented by a computer program, which can be stored in a computer-readable storage medium, and when executed, can include the processes of the embodiments of the methods described above. The storage medium may be a magnetic disk, an optical disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), or the like.
The above description is only for the specific embodiment of the present invention, but the scope of the present invention is not limited thereto, and any changes or substitutions that can be easily conceived by those skilled in the art within the technical scope of the present invention are included in the scope of the present invention. Therefore, the protection scope of the present invention shall be subject to the protection scope of the claims.

Claims (6)

1. A statistical method for interest information, comprising:
acquiring coordinate information corresponding to User Equipment (UE);
acquiring each hotspot and a keyword of each hotspot, wherein the distance between each hotspot and the coordinate information corresponding to the UE is less than or equal to a preset threshold value;
sorting the keywords of each hotspot and generating a keyword list corresponding to the UE;
acquiring the first N keywords in a keyword list corresponding to the UE, wherein N is an integer greater than or equal to 1;
and configuring the first N keywords in the keyword list corresponding to the UE as interest information corresponding to the UE, wherein the interest information is used for inferring the interest of the user.
2. The method for statistics of interest information according to claim 1, wherein the step of ranking the keywords of the respective hotspots further comprises:
calculating the weight values corresponding to the keywords of each hotspot according to a word frequency-reverse file frequency TF-IDF algorithm;
the step of ranking the keywords of each hotspot comprises:
and sorting the keywords of each hotspot according to the sequence of the weight values from high to low.
3. The method for counting interest information according to claim 2, wherein before the step of calculating the weight values corresponding to the keywords of the respective hotspots according to a term frequency-inverse document frequency TF-IDF algorithm, the method further comprises:
acquiring a keyword list corresponding to each UE;
the step of calculating the weight values corresponding to the keywords of each hotspot according to a word frequency-inverse file frequency TF-IDF algorithm comprises the following steps:
and calculating the weight values corresponding to the keywords of each hotspot respectively according to the keyword list corresponding to each UE and a TF-IDF algorithm.
4. An apparatus for statistics of interest information, comprising:
the device comprises an acquisition unit, a processing unit and a processing unit, wherein the acquisition unit is used for acquiring coordinate information corresponding to User Equipment (UE);
the acquiring unit is further configured to acquire, according to the coordinate information corresponding to the UE, each hotspot where a distance between the coordinate information corresponding to the UE is smaller than or equal to a preset threshold and a keyword of each hotspot;
the sorting unit is used for sorting the keywords of the hotspots acquired by the acquiring unit;
a generating unit, configured to generate a keyword list corresponding to the UE after being sorted by the sorting unit;
the obtaining unit is further configured to obtain the first N keywords in the keyword list corresponding to the UE generated by the generating unit, where N is an integer greater than or equal to 1;
the configuration unit is further configured to configure the top N keywords in the keyword list corresponding to the UE acquired by the acquisition unit as interest information corresponding to the UE;
the sorting unit is further configured to sort the keywords of each hotspot in an order from high to low according to the weight values calculated by the calculating unit.
5. The apparatus for statistics of interest information according to claim 4, wherein the apparatus further comprises:
the calculating unit is used for calculating the weight values corresponding to the keywords of the hot spots acquired by the acquiring unit according to a word frequency-reverse file frequency TF-IDF algorithm;
the sorting unit is further configured to sort the keywords of each hotspot in an order from high to low according to the weight values calculated by the calculating unit.
6. The apparatus for statistics of interest information according to claim 5,
the acquiring unit is further configured to acquire a keyword list corresponding to each UE;
the calculating unit is further configured to calculate, according to the keyword list corresponding to each UE acquired by the acquiring unit, weight values corresponding to the keywords of each hotspot respectively according to a TF-IDF algorithm.
CN201310636603.6A 2013-11-27 2013-11-27 Interest information statistical method and device Active CN104679787B (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN201310636603.6A CN104679787B (en) 2013-11-27 2013-11-27 Interest information statistical method and device
CN202011588377.5A CN113051467A (en) 2013-11-27 2013-11-27 Interest information statistical method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310636603.6A CN104679787B (en) 2013-11-27 2013-11-27 Interest information statistical method and device

Related Child Applications (1)

Application Number Title Priority Date Filing Date
CN202011588377.5A Division CN113051467A (en) 2013-11-27 2013-11-27 Interest information statistical method and device

Publications (2)

Publication Number Publication Date
CN104679787A CN104679787A (en) 2015-06-03
CN104679787B true CN104679787B (en) 2021-01-01

Family

ID=53314841

Family Applications (2)

Application Number Title Priority Date Filing Date
CN202011588377.5A Pending CN113051467A (en) 2013-11-27 2013-11-27 Interest information statistical method and device
CN201310636603.6A Active CN104679787B (en) 2013-11-27 2013-11-27 Interest information statistical method and device

Family Applications Before (1)

Application Number Title Priority Date Filing Date
CN202011588377.5A Pending CN113051467A (en) 2013-11-27 2013-11-27 Interest information statistical method and device

Country Status (1)

Country Link
CN (2) CN113051467A (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106257448A (en) 2015-06-19 2016-12-28 阿里巴巴集团控股有限公司 The methods of exhibiting of a kind of key word and device
CN108628832B (en) * 2018-05-08 2022-03-18 中国联合网络通信集团有限公司 Method and device for acquiring information keywords
CN111475601A (en) * 2020-04-09 2020-07-31 云南电网有限责任公司电力科学研究院 Method and device for acquiring hot subject of power work order

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103064924A (en) * 2012-12-17 2013-04-24 浙江鸿程计算机系统有限公司 Travel destination situation recommendation method based on geotagged photo excavation
CN103150309A (en) * 2011-12-07 2013-06-12 清华大学 Method and system for searching POI (Point of Interest) points of awareness map in space direction

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102456055B (en) * 2010-10-28 2014-11-12 腾讯科技(深圳)有限公司 Method and device for retrieving interest points

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103150309A (en) * 2011-12-07 2013-06-12 清华大学 Method and system for searching POI (Point of Interest) points of awareness map in space direction
CN103064924A (en) * 2012-12-17 2013-04-24 浙江鸿程计算机系统有限公司 Travel destination situation recommendation method based on geotagged photo excavation

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
IR-Tree: An Efficient Index for Geographic Document Search;Zhisheng Li等;《IEEE Transactions on Knowledge and Data Engineering》;20110430;第23卷(第4期);第585-599页 *

Also Published As

Publication number Publication date
CN104679787A (en) 2015-06-03
CN113051467A (en) 2021-06-29

Similar Documents

Publication Publication Date Title
Mestav et al. Bayesian state estimation for unobservable distribution systems via deep learning
AU2012339408B2 (en) Method and server for searching for nearby user in social network
CN104424235B (en) The method and apparatus for realizing user profile cluster
Chiang Networked life: 20 questions and answers
CN108520023A (en) A kind of identification of thunderstorm core and method for tracing based on Hybrid Clustering Algorithm
CN106303955A (en) For the method and apparatus that hotspot and POI are mated
CN105488044A (en) Data processing method and device
CN108882174B (en) Mobile terminal positioning method and device, electronic equipment and storage medium
Chen et al. Detecting climate change deniers on twitter using a deep neural network
CN105468598A (en) Friend recommendation method and device
Kabaila et al. Model‐averaged confidence intervals
CN104376083B (en) It is a kind of that method is recommended based on concern relation and the figure of multi-user's behavior
CN108230016B (en) Agricultural product market price transmission analysis method and analysis device
CN104679787B (en) Interest information statistical method and device
US11030532B2 (en) Information processing apparatus, information processing method, and non-transitory computer readable storage medium
Alizadeh Noughabi et al. An entropy test for the Rayleigh distribution and power comparison
CN104572757A (en) Microblog group processing method and device
CN113379176A (en) Telecommunication network abnormal data detection method, device, equipment and readable storage medium
Yao et al. A spatial co-location mining algorithm that includes adaptive proximity improvements and distant instance references
CN103646035B (en) A kind of information search method based on heuristic
US20160142357A1 (en) Method and system for exchanging messages on the basis of current position
Gan et al. Wind power ramp forecasting based on least-square support vector machine
CN104933082B (en) Evaluation information processing method and its device
CN103530789A (en) Method, device and apparatus for determining key index terms
Yuan et al. Location recommendation algorithm based on temporal and geographical similarity in location-based social networks

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant