CN103207901B - A kind of method and apparatus that IP address ownership place is obtained based on search engine - Google Patents
A kind of method and apparatus that IP address ownership place is obtained based on search engine Download PDFInfo
- Publication number
- CN103207901B CN103207901B CN201310091285.XA CN201310091285A CN103207901B CN 103207901 B CN103207901 B CN 103207901B CN 201310091285 A CN201310091285 A CN 201310091285A CN 103207901 B CN103207901 B CN 103207901B
- Authority
- CN
- China
- Prior art keywords
- word
- address
- user
- weighted value
- record
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Landscapes
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The present invention provides a kind of method and apparatus for obtaining IP address ownership place based on search engine, wherein method includes: that the user obtained in a period of time searches for record, it includes User ID, query word and IP address that the user, which searches for record, and identifies the ground noun that the user searches in the query word of record and the word with Regional Property;S2, the confidence level of the word with Regional Property is obtained as sample training using user's search record of preparatory mark IP address ownership place;S3, the User ID searched in record according to the user, the confidence level of the ground noun in the query word identified and word and the word with Regional Property with Regional Property, determine the ownership place of the IP address.The present invention can accurately obtain the ownership place of IP address based on search engine.
Description
[technical field]
The present invention relates to Internet protocol (IP) addressing techniques, more particularly to one kind is based on search engine acquisition IP
The method and apparatus of location ownership place.
[background technique]
With the continuous development of search engine technique, the regional expansion function of search engine is also increasingly by the weight of people
Depending on." regional expansion function " i.e. search engine, which refers to be returned according to the geographical location where user to user, has searching for regional characteristic
Rope is as a result, for example, being located at Pekinese's user search queries word is " weather ", then it is pre- to return to Pekinese's weather to it for search engine
It notifies breath, similar " regional expansion function " intelligently can more accurately meet user demand.
And one of the key point for realizing " regional expansion function " is exactly the ownership place for determining IP address.Existing method
In, usually only network operator's ownership place for will appreciate that its administrative IP address needs the public affairs of IP address information of home location
Department can only be obtained by business associate to third parties such as network operators, and certain cost is increased.
[summary of the invention]
In view of this, the present invention provides a kind of method and apparatus for obtaining IP address ownership place based on search engine, energy
Enough accurate geographical location information obtained where IP address.
Specific technical solution is as follows:
A method of IP address ownership place is obtained based on search engine, this method comprises:
S1, the user obtained in a period of time search for record, and it includes user identifier (ID), inquiry that the user, which searches for record,
Word and IP address, and identify the ground noun that the user searches in the query word of record and the word with Regional Property;
S2, record is searched for using the user of preparatory mark IP address ownership place obtain described having region as sample training
The confidence level of the word of attribute;
S3, the User ID in recording, the ground noun in the query word identified are searched for according to the user and is had
The confidence level of the word of Regional Property and the word with Regional Property, determines the ownership place of the IP address.
One example is preferably implemented according to the present invention, the place name that the user searches in the query word of record is identified in step S1
Word and word with Regional Property specifically include:
S11, the query word searched in record to the user segment, and identify ground therein noun;
S12, the non-place name extracted in query word segment, and will be higher than preset threshold with co-occurrence rate of the ground noun in query word
Non- place name participle as with Regional Property word.
According to one preferred embodiment of the present invention, after the step S12 further include:
S13, meaning of a word analysis is carried out to the word with Regional Property, extracts the band that meaning of a word weighted value is higher than preset threshold
There is the word of Regional Property.
According to one preferred embodiment of the present invention, after the step S13 further include:
The generic of the word of S14, basis with Regional Property has Regional Property to what the step S13 was extracted
Word be normalized.
According to one preferred embodiment of the present invention, the step S2 is specifically included:
According to formulaObtain the confidence level P [M] of the word M with Regional Property, wherein T [
Name i] it is the word M and the record number of ground noun i co-occurrence that Regional Property is had in the training sample, R [place name i] is the training
Word M and the IP address ownership place marked in advance when ground noun i co-occurrence in sample with Regional Property is corresponding for the ground noun i
The record number of region, n are the ground noun number in training sample with M co-occurrence.
According to one preferred embodiment of the present invention, the ownership place of the IP address is determined described in step S3 are as follows:
The first power that IP address belongs to the corresponding each region of described ground noun is calculated according to preset rule
Weight values determine the ownership place of the IP address according to first weighted value.
According to one preferred embodiment of the present invention, institute is belonged to according to preset rule calculating IP address described
When stating the first weighted value of the corresponding each region of ground noun, specifically include:
According to formulaObtain the first power that IP address belongs to region L
Weight values Z [L], wherein Cid is that the user of the IP address containing ground noun searches for the User ID number for including, C [L, word in record
I] it is that the user of the IP address containing ground noun searches in record the region corresponding ground L noun and has Regional Property
User ID number corresponding to the record of the co-occurrence of word i, P [word i] are the confidence level of the word i with Regional Property, and m is described contains
The user of the IP address of ground noun searches for the number of the word in record with Regional Property.
According to one preferred embodiment of the present invention, the ownership place that the IP address is determined according to first weighted value are as follows:
IP address is belonged in the first weighted value of the corresponding each region of described ground noun, the first weighted value highest
Ownership place of the region as the IP address.
According to one preferred embodiment of the present invention, this method further include:
S4, according to the default urban information that the user in a period of time for obtaining in advance is arranged in google maps with
And User ID, the second weighted value that IP address belongs to each region is calculated according to preset rule;
The ownership place that the IP address is determined according to first weighted value specifically:
The first weighted value and the second weighted value that IP address belongs to each region are integrated, the final ownership of IP address is obtained
Ground.
According to one preferred embodiment of the present invention, second weighted value for calculating IP address and belonging to each region, specifically
Include:
The default city that the user obtained in advance is arranged in google maps is belonged to the user of a certain region
The ratio of ID number and total User ID number belongs to the second weighted value of a certain region as IP address.
According to one preferred embodiment of the present invention, first weighted value and second integrated IP address and belong to each region
Weighted value, the final ownership place for obtaining IP address specifically include:
IP address is belonged into the first weighted value of each region and the second weighted value is multiplied, IP address is obtained and belongs to respectively
The synthetic weights weight values of a region, and using the highest region of synthetic weights weight values as the ownership place of IP address.
A kind of device obtaining IP address ownership place based on search engine, the device include:
Pretreatment unit searches for record for obtaining the user in a period of time, and it includes user that the user, which searches for record,
ID, query word and IP address, and identify ground noun and belong to region that the user searches in the query word of record
The word of property;
Training unit obtains institute for searching for record as sample training using the user of preparatory mark IP address ownership place
State the confidence level of the word with Regional Property;
Judgement unit, for searching for the User ID in recording, the place name in the query word identified according to the user
The confidence level of word and word and the word with Regional Property with Regional Property, determines the ownership place of the IP address.
According to one preferred embodiment of the present invention, the pretreatment unit is in the query word for identifying user's search record
Ground noun and when word with Regional Property, it is specific to execute:
S21, the query word searched in record to the user segment, and identify ground therein noun;
S22, the non-place name extracted in query word segment, and will be higher than preset threshold with co-occurrence rate of the ground noun in query word
Non- place name participle as with Regional Property word.
According to one preferred embodiment of the present invention, the pretreatment unit also executes after executing S22:
S23, meaning of a word analysis is carried out to the word with Regional Property, extracts the band that meaning of a word weighted value is higher than preset threshold
There is the word of Regional Property.
According to one preferred embodiment of the present invention, the pretreatment unit also executes after executing S23:
The generic of the word of S24, basis with Regional Property has Regional Property to what the step S23 was extracted
Word be normalized.
According to one preferred embodiment of the present invention, the training unit specifically executes:
According to formulaObtain the confidence level P [M] of the word M with Regional Property, wherein T [
Name i] it is the word M and the record number of ground noun i co-occurrence that Regional Property is had in the training sample, R [place name i] is the training
Word M and the IP address ownership place marked in advance when ground noun i co-occurrence in sample with Regional Property is corresponding for the ground noun i
The record number of region, n are the ground noun number in training sample with M co-occurrence.
According to one preferred embodiment of the present invention, the judgement unit is specific to execute in the ownership place for determining the IP address:
The first power that IP address belongs to the corresponding each region of described ground noun is calculated according to preset rule
Weight values determine the ownership place of the IP address according to first weighted value.
According to one preferred embodiment of the present invention, the judgement unit calculates IP address according to preset rule and returns
It is specific to execute when belonging to the first weighted value of the corresponding each region of described ground noun:
According to formulaObtain the first power that IP address belongs to region L
Weight values Z [L], wherein Cid is that the user of the IP address containing ground noun searches for the User ID number for including, C [L, word in record
I] it is that the user of the IP address containing ground noun searches in record the region corresponding ground L noun and has Regional Property
User ID number corresponding to the record of the co-occurrence of word i, P [word i] are the confidence level of the word i with Regional Property, and m is described contains
The user of the IP address of ground noun searches for the number of the word in record with Regional Property.
According to one preferred embodiment of the present invention, the judgement unit determines the ownership of the IP address according to first weighted value
It is specific to execute when ground:
IP address is belonged in the first weighted value of the corresponding each region of described ground noun, the first weighted value highest
Ownership place of the region as the IP address.
According to one preferred embodiment of the present invention, the device further include:
Cartographic information judgement unit, for being set in google maps according to the user in a period of time obtained in advance
The default urban information and User ID set calculate the second power that IP address belongs to each region according to preset rule
Weight values;
It is specific to execute when the judgement unit determines the ownership place of the IP address according to first weighted value:
The first weighted value and the second weighted value that IP address belongs to each region are integrated, the final ownership of IP address is obtained
Ground.
According to one preferred embodiment of the present invention, the cartographic information judgement unit calculates IP address and belongs to each region
It is specific to execute when the second weighted value:
The default city that the user obtained in advance is arranged in google maps is belonged to the user of a certain region
The ratio of ID number and total User ID number belongs to the second weighted value of a certain region as IP address.
According to one preferred embodiment of the present invention, the judgement unit integrates the first weight that IP address belongs to each region
Value and the second weighted value, specific to execute when obtaining the final ownership place of IP address:
IP address is belonged into the first weighted value of each region and the second weighted value is multiplied, IP address is obtained and belongs to respectively
The synthetic weights weight values of a region, and using the highest region of synthetic weights weight values as the ownership place of IP address.
As can be seen from the above technical solutions, user searches for record in a period of time that the present invention is obtained in advance by analysis
In query word Query, identify therein ground noun and the word with Regional Property, and combined training obtain have region
The word and User ID of attribute, can obtain the ownership place of IP address, use map according to user at the same time it can also combine
The information such as the default city being arranged when search engine and User ID, the final ownership place of Integration obtaining IP address.This hair
It is bright that Internet company is enabled to automatically analyze the ownership place for obtaining User ID address using search engine.
[Detailed description of the invention]
Fig. 1 is the method flow diagram for obtaining IP address ownership place provided by the embodiment of the present invention one based on search engine;
Fig. 2 is the provided ground noun identified in query word Query of the embodiment of the present invention one and has Regional Property
Word method flow diagram;
Fig. 3 is the user's search record exemplary diagram for marking IP address ownership place provided by the embodiment of the present invention one in advance;
Fig. 4 is that user provided by the embodiment of the present invention one searches for record exemplary diagram;
Fig. 5 be the default urban information that is arranged in google maps of user provided by the embodiment of the present invention one and
User ID records exemplary diagram;
Fig. 6 is the provided ground noun identified in query word Query of the embodiment of the present invention two and has Regional Property
Word schematic device.
[specific embodiment]
To make the objectives, technical solutions, and advantages of the present invention clearer, right in the following with reference to the drawings and specific embodiments
The present invention is described in detail.
Search behavior of the user using search engine when is analyzed it can be found that user usually can be obtained by search engine
For information about, therefore, user often implies its geographical position in the query word Query that search engine is searched in its location
The information set.The present invention is exactly based on the search record of user in analysis a period of time to obtain the geographical location of IP address
Information.
Embodiment one
Fig. 1 is the method flow diagram for obtaining IP address ownership place provided by the embodiment of the present invention one based on search engine,
As shown in Figure 1, this method comprises:
The user in a period of time that S101, analysis obtain in advance searches for record, the query word Query of identification user's search
In ground noun and word with Regional Property.
Can in pre-recorded a period of time user access search engine when information, those information may include user
Those information are formed user's search record and saved by the query word Query and IP address that ID, user search for.
Wherein, User ID is the browser access search engine net in user's first pass terminal (PC, mobile phone, tablet computer etc.)
When standing, for the ID of user's distribution, which is stored in the Cookie at the end user PC, is drawn later when user accesses search again
When holding up website, User ID directly can be obtained from the Cookie at the end user PC.The length of time for saving user's search record can
To be set as needed, for example, the user that can be saved in 30 days searches for record."00017255861E0FE2D25B26
B6BDB1139A, 114.112.29.35,362 tunnel public transport of Beijing " is the example that a user searches for record, wherein " 000172
55861E0FE2D25B26B6BDB1139A " is User ID, and " 114.112.29.35 " is IP address, and " 362 tunnel of Beijing is public
Hand over " it is the query word Query that user searches for.
Query word Query in order to be searched for according to user analyzes to obtain the ownership place of IP address, can obtain
After searching for record to the user in a period of time obtained in advance, further analysis handles the query word Query of user's search, with
Identification ground noun and the word with Regional Property from Query.Word with Regional Property refers to that region correlation is higher
Word, for example, the region correlation of " public transport " and " weather " is higher, and the region correlation of " gravitation " is lower, it is believed that
" public transport " and " weather " is the word with Regional Property.As shown in Fig. 2, can be identified by following step S1011-S1012
Ground noun in Query and the word with Regional Property:
S1011, word segmentation processing is carried out to Query, and obtains the ground noun in Query.
Word segmentation processing first can be carried out to Query, Query is divided into independent participle one by one, which belongs to existing
There is technology, does not repeat excessively herein.Later, the participle for belonging to ground noun in the participle of Query is identified, it can be by by Query
In participle matched with the ground noun in the dictionary of place name pre-established respectively to complete this identification process.
It further, can also be in this step by the place name root in the Query identified according to its geographical location
Subordinate relation by its normalizing be its affiliated region, for example, a certain query word Query be " by the subway apple orchard to north shadow how
Walk ", identify wherein " apple orchard " and " northern shadow " be place name, can further be inquired in the dictionary of place name pre-established this two
The affiliated region of a place name learns that " apple orchard " and " northern shadow " is all located at Beijing, therefore, can will identify in the Query
Ground noun " apple orchard " and " northern shadow " normalizing be " Beijing ", that is, differentiate that ground noun in the Query is " Beijing ".
S1012, non-place name participle in Query is extracted, and checks the co-occurrence rate of each non-place name participle and ground noun, by it
In with the co-occurrence rate of ground noun be higher than the non-place name participle of preset threshold as the word with Regional Property.
After Query is segmented and identifies ground therein noun, the participle of non-ground noun in Query can be extracted
(subsequent to be known as non-place name participle), and check the co-occurrence rate of each non-place name participle and ground noun.Co-occurrence rate with ground noun is
Refer to that a certain non-place name participle appears in the frequency in Query, each non-place name participle and ground noun with all ground noun simultaneously
Co-occurrence rate can be obtained by following methods: same in the query word Query of the user's search for a period of time that statistics obtains in advance
When occur a certain non-place name occur in the Query number N1 and query word Query of a certain non-place name participle and any ground noun
The Query number N2 of participle, then the co-occurrence rate of a certain non-place name participle and ground noun is N1/N2.For example, " dining room " this participle
Occurred in 2000 Query that the user in a period of time obtained in advance searches for record, and " dining room " and any place name
Word occurred in 400 Query jointly, then the co-occurrence rate of " dining room " and ground noun is 400/2000=0.2.It is obtaining often
The non-place name for being higher than preset threshold with the co-occurrence rate of ground noun is segmented and is made with after the co-occurrence rate of ground noun by one non-place name participle
For the word with Regional Property.
Regional Property is had in the query word Query of the available user's search of S1011-S1012 through the above steps
Word, further, can also by following step S1013 in the obtained word with Regional Property with extracting core
Domain Properties word.
S1013, meaning of a word analysis is carried out to the obtained word with Regional Property, and extracts core Regional Property word.
Meaning of a word analysis can be carried out to the obtained word with Regional Property, according to each word with Regional Property
Significance level of the meaning of a word in Query sets weight for each word with Regional Property, wherein the meaning of a word is more important to be had
The weighted value of the word of Regional Property is higher, can finally extract the word conduct with Regional Property that weighted value is higher than preset threshold
Core Regional Property word.For example, have in a certain Query " weather " and " " two words for having a Regional Property, pass through the meaning of a word point
After analysis setting weight, the weighted value of " weather " is higher than preset threshold, and " " weighted value be less than preset threshold, therefore, extract
" weather " is used as core Regional Property word.Part of speech analysis is carried out to the participle in Query, and weight is set according to the meaning of a word and is belonged to now
There is technology, does not repeat excessively herein.
Core Regional Property word can be extracted from the word with Regional Property by step S1013, further,
Obtained core Regional Property word can also be normalized by following step S1014, obtain final core
Regional Property word.
Core Regional Property word obtained in step S1013 can be normalized, normalized refers to will
Belong to same type of word to be normalized, for example, " public transport ", " bus ", " bus " belong to " public transport " this kind
Not, therefore, " public transport " in core Regional Property word, " bus ", " bus " are all normalized to " public transport ", " meal
The Room ", " restaurant ", " restaurant " belong to " dining room " this classification, therefore, by core Regional Property word " dining room ", " restaurant ",
" restaurant " is all normalized to " dining room ".It is understood that the example above is merely for exemplary purpose, the embodiment of the present invention is not
It is limited to this.The normalized of core Regional Property word can be realized by preparatory trained text classifier, that is,
With preparatory trained text classification, it classifies to obtained core Regional Property word, and each core region is belonged to
Property word be normalized to its generic, obtain final core Regional Property word, this method belongs to the prior art, not excessive herein
It repeats.
S1011-S1014 can recognize that the ground noun in the query word Query of user's search through the above steps, and
Word (or final core Regional Property word after core Regional Property word, or normalization) with Regional Property, can pass through step
Rapid S102 obtains the ownership place of IP address according to those information analyses.
S102, it obtains belonging to region as sample training using user's search record of preparatory mark IP address ownership place
The confidence level of the word of property.
In order to accurately obtain the ownership place of IP address, the confidence of the word with Regional Property in Query can be first obtained
Degree, the confidence level of a certain word with Regional Property are to characterize the word with Regional Property when differentiating IP address ownership place
The significance level of influence power.The confidence level of word with Regional Property can pass through the use to be labelled with IP address ownership place in advance
Search record in family obtains as training after sample, can specifically be trained by following methods obtain it is a certain with Regional Property
The confidence level of word: user's search record of IP address ownership place is obtained with ground noun and is labelled in advance, statistics is in those notes
The word of Regional Property and the record number of each ground noun are had comprising this simultaneously in the Query of record, is denoted as T [place name 1], T respectively
[place name 2] ... T [place name n], while counting should word and some place name Term co-occurrence with Regional Property in those records
When, IP ownership place be the place name record number, be denoted as respectively R [place name 1], R [place name 2] ... R [place name n], by this with ground
The confidence level of the word of Domain Properties is denoted as P, thenFor example, Fig. 3 is to be labelled in advance
The user of IP address ownership place searches for record exemplary diagram, and the word " public transport " with Regional Property is obtained from example shown in Fig. 3
Confidence level, then count " public transport " and the co-occurrence frequency of each place name in Query, e.g., " public transport " and " Nanjing " are recorded at 4
Query in occurred together, then [Nanjing]=4 T, wherein the IP address ownership place for having 3 records are Nanjing, then R [Nanjing]=
3, it is also possible to T [Beijing], T [Tianjin], R [Beijing], R [Tianjin] etc. be counted for " public transport ", finally, " public transport " is set
Reliability is It should be noted that if in step s101 further from
Core Regional Property word is extracted in word with Regional Property, or the final core region after further being normalized
Attribute word, then training obtains being the final core region category after core Regional Property word or normalization in above-mentioned training process
The confidence level of property word.
S103, it is pressed in advance according to the confidence level of User ID, the query word Query that user searches for and the word with Regional Property
The rule of setting calculates the first weighted value of IP address with belonging to each region in Query noun, by the first weighted value highest
Ownership place of the region as IP address.
In the search record of analysis user, identifies the ground noun in the query word Query of user's search and have region
The word (or final core Regional Property word after core Regional Property word, or normalization) of attribute, and obtaining each band
It, can after having the confidence level P of word (or final core Regional Property word after core Regional Property word, or normalization) of Regional Property
To calculate the first weight of IP address with belonging to each region in its corresponding Query noun according to preset rule
Value, and using the highest region of the first weighted value as the ownership place of IP address.It is below a kind of preferred implementation provided by the invention
Mode calculates the first weighted value that a certain IP address belongs to each region: with choosing the IP containing ground noun in Query
The user of location searches for record, counts the User ID number that the user containing the IP address searches in record, is denoted as Cid, statistics is simultaneously
Comprising the region noun and each word (or core Regional Property word, or the final core after normalizing with Regional Property
Regional Property word) Query corresponding to User ID number, be denoted as respectively C [place name, word 1], C [place name, word 2] ... C [place name,
Word m], the first weighted value which belongs to the region is denoted as Z [place name], then Wherein, word 1, word
2 ... word m refers to that (the final core region after or core Regional Property word, or normalization belongs to each word with Regional Property
Property word) confidence level.The first weighted value that a certain IP address belongs to each region can be calculated by the above method, finally,
Using the highest region of the first weighted value as the ownership place of IP address.It is understood that above-mentioned calculating IP address belongs to respectively
The method of first weighted value of a region is only a kind of preferred embodiment provided by the invention, in practical applications can be according to need
Different rules is set to calculate the first weighted value that IP address belongs to each region, the present invention is without limitation.
It above-mentioned IP address is further described below by specific example belongs to the first weighted value of each region and calculated
Journey, for example, Fig. 4 is to search for the IP address extracted in record example from the user in a period of time obtained in advance to be
Containing region the user of noun searches for record exemplary diagram in " 114.112.29.35 " and Query, as shown in figure 4, those users search
Occur the ground noun of the two regions of " Nanjing " and " Beijing " in the Query of Suo Jilu altogether, then can be distinguished using the above method
Calculate the first weighted value that the IP belongs to " Nanjing " and " Beijing ".Searched in record in those users, occur altogether 3 it is different
User ID, then Cid=3, it is assumed that have " public transport " and " weather " two words for ground in the Query that those users search for record
The word of Domain Properties occurred, then C wherein " public transport " and " Nanjing " are searched in record in the corresponding user of two different user ID altogether
[Nanjing, public transport]=2, " weather " and " Nanjing " is searched in record in the user that 1 User ID is answered altogether to be occurred, then [Nanjing, the day C
Gas]=1, it is also possible to obtain C [Beijing, public transport]=0, C [Beijing, weather]=1, it is assumed that the confidence level of " public transport " and " weather " point
Not Wei P [public transport]=0.6, P [weather]=0.75, then IP address " 114.112.29.35 " belongs to first weighted value in " Nanjing "
For Belong to " north
First weighted value in capital " is
As it can be seen that the first weighted value that the IP address belongs to " Nanjing " is higher than the first weighted value for belonging to " Beijing ", therefore, determining should
The ownership place of IP address is " Nanjing ".It is understood that the example above is merely for exemplary purpose, the embodiment of the present invention is not
It is limited to this.
Method provided by above-mentioned steps S101-S103 can be searched by analyzing user in a period of time obtained in advance
Query word Query in Suo Jilu, and User ID is combined, the accurate ownership place for obtaining IP address.It later, can be further
The record that IP address ownership place will be taken, which obtains in the above method as sample for training, has Regional Property
The confidence level P of word (or final core Regional Property word after core Regional Property word, or normalization).
Further, method provided by the present invention can also include the following steps S104-S105 to combine map to search
Index holds up the ownership place for obtaining the IP address of user.
S104, the default urban information being arranged in google maps according to the user in a period of time obtained in advance
And User ID, the second weighted value that IP address belongs to each region is calculated according to preset rule.
In general, google maps when providing a user map search service, can set default city for user, with
Just user can believe search correlation map when using google maps website is accessed directly in the default city of its setting
Breath, and default city of the user set by google maps is often exactly its location, therefore, analysis is used in a period of time
The ownership place of default urban information and the combination available IP address of User ID that family is arranged in google maps.
It can be in advance by default city set when user's access google maps website in a period of time, Yi Jiyong
The information such as family ID and IP address save after forming record.Such as " 43179D117F6AC7BD4856744B31F4E0E8,
125.34.37.129, the note in default city set by user and User ID and IP address that Beijing " is saved by one
Record, wherein " 43179D117F6AC7BD4856744B31F4E0E8 " be User ID, " 125.34.37.129 " for User IP
Location, " Beijing " are default city set by user.
The default urban information that can be arranged in google maps according to the user in a period of time obtained in advance
And User ID, obtain the ownership place of IP address, specific method can be with are as follows: according to the default urban information of user setting with
And User ID calculates User IP and belongs to the second weighted value Z [map, place name] of different cities, and by the second weighted value Z [
Figure, place name] ownership place of highest city as IP address, wherein User IP belongs to the second weighted value Z in a certain city
[map, place name] is the User ID number in the city and the ratio of total User ID number for default city in the record that obtains in advance
Example.Fig. 5 be IP address be " 218.25.103.196 " default urban information and User ID record exemplary diagram, as shown in figure 5,
In the record of the acquired IP address, share 4 User ID, wherein the corresponding default city of 3 User ID be " Shenyang ", 1
The corresponding default city of a User ID is " Changchun ", then the IP address belongs to the second weighted value Z [map, Shenyang] in " Shenyang "
=3/4=0.75, which belongs to the second weighted value Z [map, Changchun]=1/4=0.25 in " Changchun ", therefore, it is determined that the IP
The ownership place of address is " Shenyang ".
By step S104 can according to user using google maps when the default city that is arranged and User ID etc.
Information obtains the ownership place of IP address.Later, it can further be searched in record by step S105 to according to user
The ownership place for the User IP that query word Query and User ID obtain, and write from memory according to what user was arranged in google maps
The ownership place for recognizing the User IP of city and User ID acquisition is integrated.
S105, the first weighted value for belonging to each region according to IP address and the second weighted value are to IP address
Ownership place integrated.
The first weighted value and the second weighted value that each region can be belonged to according to IP address are to IP address
Ownership place integrated, can specifically be realized using following manner:
IP address is belonged to the first weighted value Z [place name] and the second weighted value Z [map, place name] phase of same region
Multiply, obtains the synthetic weights weight values that IP address belongs to each region, and using the highest region of synthetic weights weight values as IP address
Final ownership place.For example, a certain IP address belongs to " Nanjing " according to what the query word Query that User ID and user search for was obtained
First weighted value in " Beijing " is respectively Z [Nanjing]=0.65, Z [Beijing]=0.25, which searches according to user in map
The second weighted value for belonging to " Nanjing " and " Beijing " that index holds up the default city of middle setting and User ID obtains is respectively Z
[map, Nanjing]=0.45, Z [map, Beijing]=0.3, then it is Z [Nanjing] that the IP address, which belongs to the synthetic weights weight values in " Nanjing ",
Z [map, Nanjing]=0.2925, the synthetic weights weight values for belonging to " Nanjing " are Z [Beijing] Z [map, Beijing]=0.075, the IP
Synthetic weights weight values of the address attribution in " Nanjing " are higher, determine that the final ownership place of the IP address is " Nanjing ".
The above-mentioned description to be carried out to method provided by the embodiment of the present invention one, it can be seen that the present invention can be based on
Search engine searches for the User ID in recording and query word Query with accurately analyzing User IP according to the user obtained in advance
The ownership place of location, meanwhile, the present invention can also be obtained according to the default urban information and User ID that user is arranged in map
The ownership place of IP address, and the analysis result of two methods is integrated, obtain more accurate result.By this hair
It is bright provided by method, enable Internet company using Search Engine Analysis obtain user location, so as into
One step provides a user the search service with regional characteristic.
Embodiment two
Fig. 6 is a kind of device signal that IP address ownership place is obtained based on search engine provided by the embodiment of the present invention two
Figure, as shown in fig. 6, the device includes: pretreatment unit 10, training unit 20 and judgement unit 30, which can also be into one
Step includes cartographic information judgement unit 40.
Pretreatment unit 10 obtains the user in a period of time and searches for record, which searches for record and include User ID, look into
Word and IP address are ask, and identifies ground noun that the user searches in the query word of record and with Regional Property
Word.
Can in pre-recorded a period of time user access search engine when information, those information may include user
Those information are formed user's search record and saved by the query word Query and IP address that ID, user search for.
Wherein, User ID is in the browser access search engine web site at the end user first pass PC, for the ID of user's distribution, the use
Family ID is stored in the Cookie at the end user PC, can be directly from user later when user accesses search engine web site again
User ID is obtained in the Cookie at the end PC.The length of time for saving user's search record, which can according to need, to be set, for example,
The user that can be saved in 30 days searches for record." 00017255861E0FE2D25B26B6BDB1139A, 114.112.29.35,
362 tunnel public transport of Beijing " is the example that a user searches for record, wherein " 00017255861E0FE2D25B26B6BDB113
9A " is User ID, and " 114.112.29.35 " is IP address, and " 362 tunnel public transport of Beijing " is the query word of user's search
Query。
Query word Query in order to be searched for according to user analyzes to obtain the ownership place of IP address, and pretreatment is single
Member 10 can further analyze looking into for processing user's search after the user in a period of time obtained in advance searches for record
Word Query is ask, to identify ground noun from Query and with the word of Regional Property.Word with Regional Property refers to region
The higher word of correlation, for example, the region correlation of " public transport " and " weather " is higher, and the region correlation of " gravitation " compared with
It is low, it is believed that " public transport " and " weather " is the word with Regional Property.Pretreatment unit 10 can execute operations described below
S2011-S2012 identifies the ground noun in Query and the word with Regional Property:
S2011, word segmentation processing is carried out to Query, and obtains the ground noun in Query.
Pretreatment unit 10 first can carry out word segmentation processing to Query, and Query is divided into independent participle one by one,
The process belongs to the prior art, does not repeat excessively herein.Later, the participle for belonging to ground noun in the participle of Query is identified, it is pre- to locate
Manage unit 10 can be by matching the participle in Query with the ground noun in the dictionary of place name pre-established come complete respectively
At this identification process.
Further, pretreatment unit 10 can also be in this step by the place name root in the Query identified
According to its geographical location subordinate relation by its normalizing be its affiliated region, for example, a certain query word Query be " apple by the subway
Garden to northern shadow how to get to ", identify that wherein " apple orchard " and " northern shadow ", can be further in the ground noun pre-established for place name
The affiliated region that the two place names are inquired in allusion quotation learns that " apple orchard " and " northern shadow " is all located at Beijing, therefore, can will be at this
The ground noun " apple orchard " and " northern shadow " normalizing identified in Query is " Beijing ", that is, differentiates that the ground noun in the Query is
" Beijing ".
S2012, the participle for extracting non-ground noun in Query, and check the co-occurrence rate of each non-place name participle and ground noun,
The non-place name participle of preset threshold will be wherein higher than with the co-occurrence rate of ground noun as the word for having Regional Property.
After Query is segmented and identifies ground therein noun, pretreatment unit 10 can be extracted in Query non-ly
The participle of noun, and check the co-occurrence rate of each non-place name participle and ground noun.Refer to the co-occurrence rate of ground noun a certain non-
Noun participle appears in the frequency in Query with all ground noun simultaneously, and pretreatment unit 10 can execute operations described below acquisition
The co-occurrence rate of each non-place name participle and ground noun: the query word Query of the user's search for a period of time that statistics obtains in advance
In occur occurring this in the Query number N1 and query word Query of a certain non-place name participle and any ground noun simultaneously it is a certain non-
The Query number N2 of place name participle, then the co-occurrence rate of a certain non-place name participle and ground noun is N1/N2.For example, " dining room " this
Segment in a period of time obtained in advance user search for record 2000 Query in occurred, and " dining room " with it is any
Ground noun occurred in 400 Query jointly, then the co-occurrence rate of " dining room " and ground noun is 400/2000=0.2.It is obtaining
Each non-place name participle will be higher than the non-ground status of preset threshold with after the co-occurrence rate of ground noun with the co-occurrence rate of ground noun
Word is as the word for having Regional Property.
By the query word Query for executing the available user of aforesaid operations S2011-S2012 pretreatment unit 10 search
In have Regional Property word, further, operations described below S2013 can also be performed in obtained band in pretreatment unit 10
Have and extracts core Regional Property word in the word of Regional Property.
S2013, meaning of a word analysis is carried out with Regional Property to obtained, and extracts core Regional Property word.
Pretreatment unit 10 can carry out meaning of a word analysis to the obtained word with Regional Property, according to each with ground
Significance level of the meaning of a word of the word of Domain Properties in Query sets weight for each word with Regional Property, wherein the meaning of a word
The weighted value of the more important word with Regional Property is higher, can finally extract weighted value higher than preset threshold with region
The word of attribute is as core Regional Property word.For example, have in a certain Query " weather " and " " two with Regional Property
Word, after analyzing setting weight by the meaning of a word, the weighted value of " weather " is higher than preset threshold, and " " weighted value be less than default threshold
Therefore value extracts " weather " and is used as core Regional Property word.Part of speech analysis is carried out to the participle in Query, and is set according to the meaning of a word
Determine weight and belong to the prior art, does not repeat excessively herein.
After executing operation S2013, pretreatment unit 10 can extract core region from the word with Regional Property and belong to
Property word, further, pretreatment unit 10 can also be performed operations described below S2014 to obtained core Regional Property word into
Row normalized obtains final core Regional Property word.
S2014, obtained core Regional Property word is normalized, obtains final core Regional Property word.
Pretreatment unit 10 can be normalized core Regional Property word obtained in step S2013, normalizing
Same type of word will be belonged to and be normalized by changing processing i.e. finger, for example, " public transport ", " bus ", " bus " belong to
" public transport " in core Regional Property word, " bus ", " bus " are therefore all normalized to by " public transport " this classification
" public transport ", " dining room ", " restaurant ", " restaurant " belong to " dining room " this classification, therefore, by " the meal in core Regional Property word
The Room ", " restaurant ", " restaurant " are all normalized to " dining room ".It is understood that the example above is merely for exemplary purpose, this hair
Bright embodiment is without being limited thereto.Preparatory trained text classification can be passed through to the normalized of core Regional Property word
Device is realized, that is, with preparatory trained text classification, it classifies to obtained core Regional Property word, and will be each
A core Regional Property word is normalized to its generic, obtains final core Regional Property word, and this method belongs to existing skill
Art does not repeat excessively herein.
After executing aforesaid operations S2011-S2014, pretreatment unit 10 can recognize that the query word Query of user's search
In ground noun, and word (the final core Regional Property after or core Regional Property word, or normalization with Regional Property
Word).
Training unit 20 is obtained for searching for record as sample training using the user of preparatory mark IP address ownership place
The confidence level of the word with Regional Property.
In order to accurately obtain the ownership place of IP address excessively, it can be obtained by training unit 20 in Query and have region
The confidence level of the word of attribute, the confidence level of a certain word with Regional Property are to characterize the word with Regional Property with differentiating IP
The significance level of influence power when the ownership place of location.The confidence level of word with Regional Property can be by be labelled with IP address in advance
The user of ownership place searches for record and obtains as training after sample, and training unit 20 can specifically execute operations described below to train and obtain
The confidence level of a certain word with Regional Property: user's search of IP address ownership place is obtained with ground noun and is labelled in advance
Record counts the record number of the word and each ground noun in the Query of those records while comprising this with Regional Property, point
Be not denoted as T [place name 1], T [place name 2] ... T [place name n], at the same count in those records should with Regional Property word with
When some place name Term co-occurrence, IP ownership place be the place name record number, be denoted as respectively R [place name 1], R [place name 2] ... R [
Name n], the confidence level of the word with Regional Property is denoted as P, then
Belong to it should be noted that if pretreatment unit 10 is further extracted core region from the word with Regional Property
Property word, or the final core Regional Property word after further being normalized, then training unit 20 is in above-mentioned training process
Middle training obtains the confidence level of the final core Regional Property word after being core Regional Property word or normalizing.
Judgement unit 30, for searching for the User ID in recording, the ground in the query word identified according to the user
The confidence level of noun and word and the word with Regional Property with Regional Property, determines the ownership place of the IP address.
Preferably, IP address can be calculated according to preset rule belong to the first of the corresponding each region of described ground noun
Weighted value determines the ownership place of the IP address according to first weighted value.
In the search record of analysis user, identifies the ground noun in the query word Query of user's search and have region
The word (final core Regional Property word after or core Regional Property word, or normalization) of attribute, obtained it is each with ground
After the confidence level P of the word (or final core Regional Property word after core Regional Property word, or normalization) of Domain Properties, differentiate single
Member 30 can be calculated according to preset rule IP address with belonging to each region in its corresponding Query noun first
Weighted value, and using the highest region of the first weighted value as the ownership place of IP address, here is provided by the invention a kind of preferred
Embodiment calculates the first weighted value that a certain IP address belongs to each region: choose in Query containing ground noun should
The user of IP address searches for record, counts the User ID number that the user containing the IP address searches in record, is denoted as Cid, counts
Comprising the region noun and each word (or core Regional Property word, or final after normalizing with Regional Property simultaneously
Core Regional Property word) Query corresponding to User ID number, be denoted as C [place name, word 1], C [place name, word 2] ... C respectively
The first weighted value that the IP address belongs to the region is denoted as Z [place name], then by [place name, word m] Wherein, word 1, word
2 ... word m refers to that (the final core region after or core Regional Property word, or normalization belongs to each word with Regional Property
Property word) confidence level.The first weighted value that a certain IP address belongs to each region can be calculated by the above method, finally,
Using the highest region of the first weighted value as the ownership place of IP address.It is understood that above-mentioned calculating IP address belongs to respectively
The method of first weighted value of a region is only a kind of preferred embodiment provided by the invention, in practical applications can be according to need
Different rules is set to calculate the weighted value that IP address belongs to each region, the present invention is without limitation.
It, can be by analyzing one section obtained in advance using above-mentioned pretreatment unit 10, training unit 20, judgement unit 30
User searches for the query word Query in record in time, and combines User ID, the accurate ownership place for obtaining IP address.It
Afterwards, the record that IP address ownership place can further will be taken is used to train the band obtained in the above method as sample
There is the confidence level P of the word (or final core Regional Property word after core Regional Property word, or normalization) of Regional Property.
Further, device provided by the present invention can also include following apparatus cartographic information judgement unit 40 to tie
Close the ownership place that google maps obtain the IP address of user.
Cartographic information judgement unit 40, the user in a period of time obtained in advance for basis is in google maps
The default urban information and User ID of setting calculate IP address according to preset rule and belong to the second of each region
Weighted value.
In general, google maps when providing a user map search service, can set default city for user, with
Just user can believe search correlation map when using google maps website is accessed directly in the default city of its setting
Breath, and default city of the user set by google maps is often exactly its location, therefore, analysis is used in a period of time
The ownership place of default urban information and the combination available IP address of User ID that family is arranged in google maps.
It can be in advance by default city set when user's access google maps website in a period of time, Yi Jiyong
The information such as family ID and IP address save after forming record.Such as " 43179D117F6AC7BD4856744B31F4E0E8,
125.34.37.129, the note in default city set by user and User ID and IP address that Beijing " is saved by one
Record, wherein " 43179D117F6AC7BD4856744B31F4E0E8 " be User ID, " 125.34.37.129 " for User IP
Location, " Beijing " are default city set by user.
Later, cartographic information judgement unit 40 can draw according to the user in a period of time obtained in advance in map search
The default urban information and User ID for holding up middle setting, obtain the ownership place of IP address, and cartographic information judgement unit 40 has
Body can execute operations described below: calculating User IP according to the default urban information and User ID of user setting and belong to different cities
The second weighted value Z [map, place name] in city, and by highest city the returning as IP address of the second weighted value Z [map, place name]
Possession, wherein User IP belongs to the second weighted value Z [map, place name] in a certain city to write from memory in the record that obtains in advance
Recognize the ratio of User ID number and total User ID number that city is the city.
Cartographic information judgement unit 40 can according to user using google maps when the default city that is arranged and use
The information such as family ID obtain the ownership place of IP address.Later, judgement unit 30 further can search for record to according to user
In the ownership place of User IP that obtains of query word Query and User ID, and be arranged in google maps according to user
Default city and the ownership place of User IP that obtains of User ID integrated.
Judgement unit 30 can belong to the first weighted value and the second weighted value pair of each region according to IP address
The ownership place of IP address is integrated, and can specifically be realized using following manner:
IP address is belonged to the first weighted value Z [place name] and the second weighted value Z [map, place name] phase of same region
Multiply, obtains the synthetic weights weight values that IP address belongs to each region, and using the highest region of synthetic weights weight values as IP address
Final ownership place.For example, a certain IP address belongs to " Nanjing " according to what the query word Query that User ID and user search for was obtained
First weighted value in " Beijing " is respectively Z [Nanjing]=0.65, Z [Beijing]=0.25, which searches according to user in map
The second weighted value for belonging to " Nanjing " and " Beijing " that index holds up the default city of middle setting and User ID obtains is respectively Z
[map, Nanjing]=0.45, Z [map, Beijing]=0.3, then it is Z [Nanjing] that the IP address, which belongs to the synthetic weights weight values in " Nanjing ",
Z [map, Nanjing]=0.2925, the synthetic weights weight values for belonging to " Nanjing " are Z [Beijing] Z [map, Beijing]=0.075, the IP
Synthetic weights weight values of the address attribution in " Nanjing " are higher, determine that the final ownership place of the IP address is " Nanjing ".
The foregoing is merely illustrative of the preferred embodiments of the present invention, is not intended to limit the invention, all in essence of the invention
Within mind and principle, any modification, equivalent substitution, improvement and etc. done be should be included within the scope of the present invention.
Claims (20)
1. a kind of method for obtaining internet protocol address ownership place based on search engine, which is characterized in that this method comprises:
S1, the user obtained in a period of time search for record, and it includes user identifier ID, query word and use that the user, which searches for record,
Family IP address, and identify the ground noun that the user searches in the query word of record and the word with Regional Property;
S2, record is searched for using the user of preparatory mark IP address ownership place obtain described having Regional Property as sample training
Word confidence level;
S3, the User ID in recording, the ground noun in the query word identified are searched for according to the user and with region
The confidence level of the word of attribute and the word with Regional Property, determines the ownership place of the IP address;
The S2 is specifically included:
It utilizesObtain the confidence level P [M] of the word M with Regional Property, wherein T [place name i] is
The record number of word M and ground noun i co-occurrence in the training sample with Regional Property, R [place name i] is in the training sample
Word M with Regional Property and the IP address ownership place marked in advance when ground noun i co-occurrence are the corresponding region the ground noun i
Number is recorded, n is the ground noun number in training sample with M co-occurrence.
2. the method according to claim 1, wherein identifying that the user searches for the query word of record in step S1
In ground noun and word with Regional Property specifically include:
S11, the query word searched in record to the user segment, and identify ground therein noun;
S12, the non-place name extracted in query word segment, and will be higher than the non-of preset threshold with co-occurrence rate of the ground noun in query word
Place name participle is as the word for having Regional Property.
3. according to the method described in claim 2, it is characterized in that, after the step S12 further include:
S13, meaning of a word analysis is carried out to the word with Regional Property, extracts meaning of a word weighted value higher than preset threshold with ground
The word of Domain Properties.
4. according to the method described in claim 3, it is characterized in that, after the step S13 further include:
The generic of the word of S14, basis with Regional Property, the word with Regional Property that the step S13 is extracted
It is normalized.
5. according to claim 1 to method described in 4 any claims, which is characterized in that determine the IP address described in step S3
Ownership place are as follows:
The first weighted value that IP address belongs to the corresponding each region of described ground noun is calculated according to preset rule,
The ownership place of the IP address is determined according to first weighted value.
6. according to the method described in claim 5, it is characterized in that, calculating User IP according to preset rule described
When location belongs to the first weighted value of described ground noun corresponding each region, specifically include:
According to formulaObtain the first weighted value that IP address belongs to region L
Z [L], wherein Cid is that the user of the IP address containing ground noun searches for the User ID number for including in record, and C [L, word i] is
The user of the IP address containing ground noun searches for the region corresponding ground L noun and the word i for having Regional Property in record
Co-occurrence record corresponding to User ID number, P [word i] is the confidence level of the word i with Regional Property, and m is described containing ground
The user of the IP address of noun searches for the number of the word in record with Regional Property.
7. according to the method described in claim 5, it is characterized in that, described determine returning for the IP address according to first weighted value
Possession are as follows:
IP address is belonged in the first weighted value of the corresponding each region of described ground noun, the first weighted value is highestly
Ownership place of the domain as the IP address.
8. according to the method described in claim 5, it is characterized in that, this method further include:
The default urban information and use that the user in a period of time that S4, basis obtain in advance is arranged in google maps
Family ID calculates the second weighted value that IP address belongs to each region according to preset rule;
The ownership place that the IP address is determined according to first weighted value specifically:
The first weighted value and the second weighted value that IP address belongs to each region are integrated, the final ownership place of IP address is obtained.
9. according to the method described in claim 8, it is characterized in that, second power for calculating IP address and belonging to each region
Weight values specifically include:
The default city that the user obtained in advance is arranged in google maps is belonged to the User ID number of a certain region
The second weighted value of a certain region is belonged to as IP address with the ratio of total User ID number.
10. according to the method described in claim 8, it is characterized in that, first for integrating IP address and belonging to each region
Weighted value and the second weighted value, the final ownership place for obtaining IP address specifically include:
IP address is belonged into the first weighted value of each region and the second weighted value is multiplied, IP address is obtained and belongs to eachly
The synthetic weights weight values in domain, and using the highest region of synthetic weights weight values as the ownership place of IP address.
11. a kind of device for obtaining IP address ownership place based on search engine, which is characterized in that the device includes:
Pretreatment unit searches for record for obtaining the user in a period of time, and the user searches for record and includes User ID, looks into
Word and IP address are ask, and identifies ground noun that the user searches in the query word of record and with Regional Property
Word;
Training unit obtains the band for searching for record as sample training using the user of preparatory mark IP address ownership place
There is the confidence level of the word of Regional Property;
Judgement unit, for according to the user search for the User ID in record, the ground noun in the query word that is identified with
And the confidence level of the word and the word with Regional Property with Regional Property, determine the ownership place of the IP address;
Wherein, training unit is specifically used for: according toObtain the confidence level of the word M with Regional Property
P [M], wherein T [place name i] is the record number of the word M and ground noun i co-occurrence in the training sample with Regional Property, R [
Name i] be word M in the training sample with Regional Property with noun i co-occurrence when the IP address ownership place that marks in advance be
The record number of the corresponding region ground noun i, n are the ground noun number in training sample with M co-occurrence.
12. device according to claim 11, which is characterized in that the pretreatment unit is identifying user's search note
It is specific to execute when ground noun in the query word of record and the word with Regional Property:
S21, the query word searched in record to the user segment, and identify ground therein noun;
S22, the non-place name extracted in query word segment, and will be higher than the non-of preset threshold with co-occurrence rate of the ground noun in query word
Place name participle is as the word for having Regional Property.
13. device according to claim 12, which is characterized in that the pretreatment unit also executes after executing S22:
S23, meaning of a word analysis is carried out to the word with Regional Property, extracts meaning of a word weighted value higher than preset threshold with ground
The word of Domain Properties.
14. device according to claim 13, which is characterized in that the pretreatment unit also executes after executing S23:
The generic of the word of S24, basis with Regional Property, the word with Regional Property that the step S23 is extracted
It is normalized.
15. device described in 1 to 14 any claim according to claim 1, which is characterized in that the judgement unit is determining the IP
It is specific to execute when the ownership place of address:
The first weighted value that IP address belongs to the corresponding each region of described ground noun is calculated according to preset rule,
The ownership place of the IP address is determined according to first weighted value.
16. device according to claim 15, which is characterized in that the judgement unit is calculated according to preset rule
It is specific to execute when IP address belongs to the first weighted value of described ground noun corresponding each region:
According to formulaObtain the first weighted value that IP address belongs to region L
Z [L], wherein Cid is that the user of the IP address containing ground noun searches for the User ID number for including in record, and C [L, word i] is
The user of the IP address containing ground noun searches for the region corresponding ground L noun and the word i for having Regional Property in record
Co-occurrence record corresponding to User ID number, P [word i] is the confidence level of the word i with Regional Property, and m is described containing ground
The user of the IP address of noun searches for the number of the word in record with Regional Property.
17. device according to claim 15, which is characterized in that the judgement unit is determined according to first weighted value should
It is specific to execute when the ownership place of IP address:
IP address is belonged in the first weighted value of the corresponding each region of described ground noun, the first weighted value is highestly
Ownership place of the domain as the IP address.
18. device according to claim 15, which is characterized in that the device further include:
Cartographic information judgement unit, for what is be arranged in google maps according to the user in a period of time obtained in advance
Default urban information and User ID, calculates the second weight that IP address belongs to each region according to preset rule
Value;
It is specific to execute when the judgement unit determines the ownership place of the IP address according to first weighted value:
The first weighted value and the second weighted value that IP address belongs to each region are integrated, the final ownership place of IP address is obtained.
19. device according to claim 18, which is characterized in that the cartographic information judgement unit calculates IP address ownership
It is specific to execute when the second weighted value of each region:
The default city that the user obtained in advance is arranged in google maps is belonged to the User ID number of a certain region
The second weighted value of a certain region is belonged to as IP address with the ratio of total User ID number.
20. device according to claim 18, which is characterized in that the judgement unit is integrated IP address and belonged to eachly
First weighted value and the second weighted value in domain, specific to execute when obtaining the final ownership place of IP address:
IP address is belonged into the first weighted value of each region and the second weighted value is multiplied, IP address is obtained and belongs to eachly
The synthetic weights weight values in domain, and using the highest region of synthetic weights weight values as the ownership place of IP address.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201310091285.XA CN103207901B (en) | 2013-03-21 | 2013-03-21 | A kind of method and apparatus that IP address ownership place is obtained based on search engine |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201310091285.XA CN103207901B (en) | 2013-03-21 | 2013-03-21 | A kind of method and apparatus that IP address ownership place is obtained based on search engine |
Publications (2)
Publication Number | Publication Date |
---|---|
CN103207901A CN103207901A (en) | 2013-07-17 |
CN103207901B true CN103207901B (en) | 2019-03-08 |
Family
ID=48755123
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201310091285.XA Active CN103207901B (en) | 2013-03-21 | 2013-03-21 | A kind of method and apparatus that IP address ownership place is obtained based on search engine |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN103207901B (en) |
Families Citing this family (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104780235B (en) * | 2014-01-14 | 2019-08-06 | 腾讯科技(深圳)有限公司 | IP attribution inquiry method, device and server |
CN104780234B (en) * | 2014-01-14 | 2019-09-17 | 腾讯科技(深圳)有限公司 | IP attribution inquiry method, apparatus and system |
CN104168163A (en) * | 2014-08-27 | 2014-11-26 | 福建富士通信息软件有限公司 | Intelligent network line quality detection and data analysis method |
CN105335480A (en) * | 2015-10-13 | 2016-02-17 | 国家电网公司 | Internet website liability subject identifying method |
CN106096040B (en) * | 2016-06-29 | 2019-06-04 | 中国人民解放军国防科学技术大学 | Organization web ownership place method of discrimination and its device based on search engine |
CN106357835B (en) * | 2016-09-05 | 2020-03-06 | 百度在线网络技术(北京)有限公司 | Method and equipment for determining region of target IP address |
CN111327721B (en) * | 2020-02-28 | 2023-01-10 | 加和(北京)信息科技有限公司 | IP address positioning method and device, storage medium and electronic device |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102012900A (en) * | 2009-09-04 | 2011-04-13 | 阿里巴巴集团控股有限公司 | An information retrieval method and system |
CN102033947A (en) * | 2010-12-22 | 2011-04-27 | 百度在线网络技术(北京)有限公司 | Region recognizing device and method based on retrieval word |
CN102880721A (en) * | 2012-10-15 | 2013-01-16 | 瑞庭网络技术(上海)有限公司 | Implementation method of vertical search engine |
CN102932492A (en) * | 2011-09-12 | 2013-02-13 | 微软公司 | Correlation of users to ip address lease events |
-
2013
- 2013-03-21 CN CN201310091285.XA patent/CN103207901B/en active Active
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102012900A (en) * | 2009-09-04 | 2011-04-13 | 阿里巴巴集团控股有限公司 | An information retrieval method and system |
CN102033947A (en) * | 2010-12-22 | 2011-04-27 | 百度在线网络技术(北京)有限公司 | Region recognizing device and method based on retrieval word |
CN102932492A (en) * | 2011-09-12 | 2013-02-13 | 微软公司 | Correlation of users to ip address lease events |
CN102880721A (en) * | 2012-10-15 | 2013-01-16 | 瑞庭网络技术(上海)有限公司 | Implementation method of vertical search engine |
Non-Patent Citations (2)
Title |
---|
利用"百度"搜索网络信息资源;黄西安;《科技情报开发与经济》;20051231;第15卷(第4期);第257-259页 |
基于地理信息的用户行为理解;谢幸等;《https://wenku.baidu.com/view/927fed7202768e73876.html》;20110221;正文第6页 |
Also Published As
Publication number | Publication date |
---|---|
CN103207901A (en) | 2013-07-17 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN103207901B (en) | A kind of method and apparatus that IP address ownership place is obtained based on search engine | |
CN110472066B (en) | Construction method of urban geographic semantic knowledge map | |
CN101320375B (en) | Digital book search method based on user click action | |
CN107577688B (en) | Original article influence analysis system based on media information acquisition | |
CN103955505B (en) | A kind of event method of real-time and system based on microblogging | |
CN103678576B (en) | The text retrieval system analyzed based on dynamic semantics | |
CN105244031A (en) | Speaker identification method and device | |
CN110162695A (en) | A kind of method and apparatus of information push | |
CN106933947B (en) | A kind of searching method and device, electronic equipment | |
CN105843850B (en) | Search optimization method and device | |
CN109582969A (en) | Methodology for Entities Matching, device and electronic equipment | |
US20140025701A1 (en) | Query expansion | |
CN105653706A (en) | Multilayer quotation recommendation method based on literature content mapping knowledge domain | |
CN105718585B (en) | Document and label word justice correlating method and its device | |
CN106204156A (en) | A kind of advertisement placement method for network forum and device | |
CN103226578A (en) | Method for identifying websites and finely classifying web pages in medical field | |
CN109558587B (en) | Method for classifying public opinion tendency recognition aiming at category distribution imbalance | |
CN102541936A (en) | Method and device for acquiring popularity of POI (Point of Interest) | |
CN109636495A (en) | A kind of online recommended method of scientific and technological information based on big data | |
CN110737821B (en) | Similar event query method, device, storage medium and terminal equipment | |
CN108241690A (en) | A kind of data processing method and device, a kind of device for data processing | |
CN110705292B (en) | Entity name extraction method based on knowledge base and deep learning | |
CN102646124A (en) | Method for automatically identifying address information | |
WO2010096986A1 (en) | Mobile search method and device | |
CN110012122A (en) | A kind of domain name similarity analysis method of word-based embedded technology |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |