CN103207901B - A kind of method and apparatus that IP address ownership place is obtained based on search engine - Google Patents

A kind of method and apparatus that IP address ownership place is obtained based on search engine Download PDF

Info

Publication number
CN103207901B
CN103207901B CN201310091285.XA CN201310091285A CN103207901B CN 103207901 B CN103207901 B CN 103207901B CN 201310091285 A CN201310091285 A CN 201310091285A CN 103207901 B CN103207901 B CN 103207901B
Authority
CN
China
Prior art keywords
word
address
user
weighted value
record
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201310091285.XA
Other languages
Chinese (zh)
Other versions
CN103207901A (en
Inventor
阮星华
才鑫
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Baidu Netcom Science and Technology Co Ltd
Original Assignee
Beijing Baidu Netcom Science and Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Baidu Netcom Science and Technology Co Ltd filed Critical Beijing Baidu Netcom Science and Technology Co Ltd
Priority to CN201310091285.XA priority Critical patent/CN103207901B/en
Publication of CN103207901A publication Critical patent/CN103207901A/en
Application granted granted Critical
Publication of CN103207901B publication Critical patent/CN103207901B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The present invention provides a kind of method and apparatus for obtaining IP address ownership place based on search engine, wherein method includes: that the user obtained in a period of time searches for record, it includes User ID, query word and IP address that the user, which searches for record, and identifies the ground noun that the user searches in the query word of record and the word with Regional Property;S2, the confidence level of the word with Regional Property is obtained as sample training using user's search record of preparatory mark IP address ownership place;S3, the User ID searched in record according to the user, the confidence level of the ground noun in the query word identified and word and the word with Regional Property with Regional Property, determine the ownership place of the IP address.The present invention can accurately obtain the ownership place of IP address based on search engine.

Description

A kind of method and apparatus that IP address ownership place is obtained based on search engine
[technical field]
The present invention relates to Internet protocol (IP) addressing techniques, more particularly to one kind is based on search engine acquisition IP The method and apparatus of location ownership place.
[background technique]
With the continuous development of search engine technique, the regional expansion function of search engine is also increasingly by the weight of people Depending on." regional expansion function " i.e. search engine, which refers to be returned according to the geographical location where user to user, has searching for regional characteristic Rope is as a result, for example, being located at Pekinese's user search queries word is " weather ", then it is pre- to return to Pekinese's weather to it for search engine It notifies breath, similar " regional expansion function " intelligently can more accurately meet user demand.
And one of the key point for realizing " regional expansion function " is exactly the ownership place for determining IP address.Existing method In, usually only network operator's ownership place for will appreciate that its administrative IP address needs the public affairs of IP address information of home location Department can only be obtained by business associate to third parties such as network operators, and certain cost is increased.
[summary of the invention]
In view of this, the present invention provides a kind of method and apparatus for obtaining IP address ownership place based on search engine, energy Enough accurate geographical location information obtained where IP address.
Specific technical solution is as follows:
A method of IP address ownership place is obtained based on search engine, this method comprises:
S1, the user obtained in a period of time search for record, and it includes user identifier (ID), inquiry that the user, which searches for record, Word and IP address, and identify the ground noun that the user searches in the query word of record and the word with Regional Property;
S2, record is searched for using the user of preparatory mark IP address ownership place obtain described having region as sample training The confidence level of the word of attribute;
S3, the User ID in recording, the ground noun in the query word identified are searched for according to the user and is had The confidence level of the word of Regional Property and the word with Regional Property, determines the ownership place of the IP address.
One example is preferably implemented according to the present invention, the place name that the user searches in the query word of record is identified in step S1 Word and word with Regional Property specifically include:
S11, the query word searched in record to the user segment, and identify ground therein noun;
S12, the non-place name extracted in query word segment, and will be higher than preset threshold with co-occurrence rate of the ground noun in query word Non- place name participle as with Regional Property word.
According to one preferred embodiment of the present invention, after the step S12 further include:
S13, meaning of a word analysis is carried out to the word with Regional Property, extracts the band that meaning of a word weighted value is higher than preset threshold There is the word of Regional Property.
According to one preferred embodiment of the present invention, after the step S13 further include:
The generic of the word of S14, basis with Regional Property has Regional Property to what the step S13 was extracted Word be normalized.
According to one preferred embodiment of the present invention, the step S2 is specifically included:
According to formulaObtain the confidence level P [M] of the word M with Regional Property, wherein T [ Name i] it is the word M and the record number of ground noun i co-occurrence that Regional Property is had in the training sample, R [place name i] is the training Word M and the IP address ownership place marked in advance when ground noun i co-occurrence in sample with Regional Property is corresponding for the ground noun i The record number of region, n are the ground noun number in training sample with M co-occurrence.
According to one preferred embodiment of the present invention, the ownership place of the IP address is determined described in step S3 are as follows:
The first power that IP address belongs to the corresponding each region of described ground noun is calculated according to preset rule Weight values determine the ownership place of the IP address according to first weighted value.
According to one preferred embodiment of the present invention, institute is belonged to according to preset rule calculating IP address described When stating the first weighted value of the corresponding each region of ground noun, specifically include:
According to formulaObtain the first power that IP address belongs to region L Weight values Z [L], wherein Cid is that the user of the IP address containing ground noun searches for the User ID number for including, C [L, word in record I] it is that the user of the IP address containing ground noun searches in record the region corresponding ground L noun and has Regional Property User ID number corresponding to the record of the co-occurrence of word i, P [word i] are the confidence level of the word i with Regional Property, and m is described contains The user of the IP address of ground noun searches for the number of the word in record with Regional Property.
According to one preferred embodiment of the present invention, the ownership place that the IP address is determined according to first weighted value are as follows:
IP address is belonged in the first weighted value of the corresponding each region of described ground noun, the first weighted value highest Ownership place of the region as the IP address.
According to one preferred embodiment of the present invention, this method further include:
S4, according to the default urban information that the user in a period of time for obtaining in advance is arranged in google maps with And User ID, the second weighted value that IP address belongs to each region is calculated according to preset rule;
The ownership place that the IP address is determined according to first weighted value specifically:
The first weighted value and the second weighted value that IP address belongs to each region are integrated, the final ownership of IP address is obtained Ground.
According to one preferred embodiment of the present invention, second weighted value for calculating IP address and belonging to each region, specifically Include:
The default city that the user obtained in advance is arranged in google maps is belonged to the user of a certain region The ratio of ID number and total User ID number belongs to the second weighted value of a certain region as IP address.
According to one preferred embodiment of the present invention, first weighted value and second integrated IP address and belong to each region Weighted value, the final ownership place for obtaining IP address specifically include:
IP address is belonged into the first weighted value of each region and the second weighted value is multiplied, IP address is obtained and belongs to respectively The synthetic weights weight values of a region, and using the highest region of synthetic weights weight values as the ownership place of IP address.
A kind of device obtaining IP address ownership place based on search engine, the device include:
Pretreatment unit searches for record for obtaining the user in a period of time, and it includes user that the user, which searches for record, ID, query word and IP address, and identify ground noun and belong to region that the user searches in the query word of record The word of property;
Training unit obtains institute for searching for record as sample training using the user of preparatory mark IP address ownership place State the confidence level of the word with Regional Property;
Judgement unit, for searching for the User ID in recording, the place name in the query word identified according to the user The confidence level of word and word and the word with Regional Property with Regional Property, determines the ownership place of the IP address.
According to one preferred embodiment of the present invention, the pretreatment unit is in the query word for identifying user's search record Ground noun and when word with Regional Property, it is specific to execute:
S21, the query word searched in record to the user segment, and identify ground therein noun;
S22, the non-place name extracted in query word segment, and will be higher than preset threshold with co-occurrence rate of the ground noun in query word Non- place name participle as with Regional Property word.
According to one preferred embodiment of the present invention, the pretreatment unit also executes after executing S22:
S23, meaning of a word analysis is carried out to the word with Regional Property, extracts the band that meaning of a word weighted value is higher than preset threshold There is the word of Regional Property.
According to one preferred embodiment of the present invention, the pretreatment unit also executes after executing S23:
The generic of the word of S24, basis with Regional Property has Regional Property to what the step S23 was extracted Word be normalized.
According to one preferred embodiment of the present invention, the training unit specifically executes:
According to formulaObtain the confidence level P [M] of the word M with Regional Property, wherein T [ Name i] it is the word M and the record number of ground noun i co-occurrence that Regional Property is had in the training sample, R [place name i] is the training Word M and the IP address ownership place marked in advance when ground noun i co-occurrence in sample with Regional Property is corresponding for the ground noun i The record number of region, n are the ground noun number in training sample with M co-occurrence.
According to one preferred embodiment of the present invention, the judgement unit is specific to execute in the ownership place for determining the IP address:
The first power that IP address belongs to the corresponding each region of described ground noun is calculated according to preset rule Weight values determine the ownership place of the IP address according to first weighted value.
According to one preferred embodiment of the present invention, the judgement unit calculates IP address according to preset rule and returns It is specific to execute when belonging to the first weighted value of the corresponding each region of described ground noun:
According to formulaObtain the first power that IP address belongs to region L Weight values Z [L], wherein Cid is that the user of the IP address containing ground noun searches for the User ID number for including, C [L, word in record I] it is that the user of the IP address containing ground noun searches in record the region corresponding ground L noun and has Regional Property User ID number corresponding to the record of the co-occurrence of word i, P [word i] are the confidence level of the word i with Regional Property, and m is described contains The user of the IP address of ground noun searches for the number of the word in record with Regional Property.
According to one preferred embodiment of the present invention, the judgement unit determines the ownership of the IP address according to first weighted value It is specific to execute when ground:
IP address is belonged in the first weighted value of the corresponding each region of described ground noun, the first weighted value highest Ownership place of the region as the IP address.
According to one preferred embodiment of the present invention, the device further include:
Cartographic information judgement unit, for being set in google maps according to the user in a period of time obtained in advance The default urban information and User ID set calculate the second power that IP address belongs to each region according to preset rule Weight values;
It is specific to execute when the judgement unit determines the ownership place of the IP address according to first weighted value:
The first weighted value and the second weighted value that IP address belongs to each region are integrated, the final ownership of IP address is obtained Ground.
According to one preferred embodiment of the present invention, the cartographic information judgement unit calculates IP address and belongs to each region It is specific to execute when the second weighted value:
The default city that the user obtained in advance is arranged in google maps is belonged to the user of a certain region The ratio of ID number and total User ID number belongs to the second weighted value of a certain region as IP address.
According to one preferred embodiment of the present invention, the judgement unit integrates the first weight that IP address belongs to each region Value and the second weighted value, specific to execute when obtaining the final ownership place of IP address:
IP address is belonged into the first weighted value of each region and the second weighted value is multiplied, IP address is obtained and belongs to respectively The synthetic weights weight values of a region, and using the highest region of synthetic weights weight values as the ownership place of IP address.
As can be seen from the above technical solutions, user searches for record in a period of time that the present invention is obtained in advance by analysis In query word Query, identify therein ground noun and the word with Regional Property, and combined training obtain have region The word and User ID of attribute, can obtain the ownership place of IP address, use map according to user at the same time it can also combine The information such as the default city being arranged when search engine and User ID, the final ownership place of Integration obtaining IP address.This hair It is bright that Internet company is enabled to automatically analyze the ownership place for obtaining User ID address using search engine.
[Detailed description of the invention]
Fig. 1 is the method flow diagram for obtaining IP address ownership place provided by the embodiment of the present invention one based on search engine;
Fig. 2 is the provided ground noun identified in query word Query of the embodiment of the present invention one and has Regional Property Word method flow diagram;
Fig. 3 is the user's search record exemplary diagram for marking IP address ownership place provided by the embodiment of the present invention one in advance;
Fig. 4 is that user provided by the embodiment of the present invention one searches for record exemplary diagram;
Fig. 5 be the default urban information that is arranged in google maps of user provided by the embodiment of the present invention one and User ID records exemplary diagram;
Fig. 6 is the provided ground noun identified in query word Query of the embodiment of the present invention two and has Regional Property Word schematic device.
[specific embodiment]
To make the objectives, technical solutions, and advantages of the present invention clearer, right in the following with reference to the drawings and specific embodiments The present invention is described in detail.
Search behavior of the user using search engine when is analyzed it can be found that user usually can be obtained by search engine For information about, therefore, user often implies its geographical position in the query word Query that search engine is searched in its location The information set.The present invention is exactly based on the search record of user in analysis a period of time to obtain the geographical location of IP address Information.
Embodiment one
Fig. 1 is the method flow diagram for obtaining IP address ownership place provided by the embodiment of the present invention one based on search engine, As shown in Figure 1, this method comprises:
The user in a period of time that S101, analysis obtain in advance searches for record, the query word Query of identification user's search In ground noun and word with Regional Property.
Can in pre-recorded a period of time user access search engine when information, those information may include user Those information are formed user's search record and saved by the query word Query and IP address that ID, user search for. Wherein, User ID is the browser access search engine net in user's first pass terminal (PC, mobile phone, tablet computer etc.) When standing, for the ID of user's distribution, which is stored in the Cookie at the end user PC, is drawn later when user accesses search again When holding up website, User ID directly can be obtained from the Cookie at the end user PC.The length of time for saving user's search record can To be set as needed, for example, the user that can be saved in 30 days searches for record."00017255861E0FE2D25B26 B6BDB1139A, 114.112.29.35,362 tunnel public transport of Beijing " is the example that a user searches for record, wherein " 000172 55861E0FE2D25B26B6BDB1139A " is User ID, and " 114.112.29.35 " is IP address, and " 362 tunnel of Beijing is public Hand over " it is the query word Query that user searches for.
Query word Query in order to be searched for according to user analyzes to obtain the ownership place of IP address, can obtain After searching for record to the user in a period of time obtained in advance, further analysis handles the query word Query of user's search, with Identification ground noun and the word with Regional Property from Query.Word with Regional Property refers to that region correlation is higher Word, for example, the region correlation of " public transport " and " weather " is higher, and the region correlation of " gravitation " is lower, it is believed that " public transport " and " weather " is the word with Regional Property.As shown in Fig. 2, can be identified by following step S1011-S1012 Ground noun in Query and the word with Regional Property:
S1011, word segmentation processing is carried out to Query, and obtains the ground noun in Query.
Word segmentation processing first can be carried out to Query, Query is divided into independent participle one by one, which belongs to existing There is technology, does not repeat excessively herein.Later, the participle for belonging to ground noun in the participle of Query is identified, it can be by by Query In participle matched with the ground noun in the dictionary of place name pre-established respectively to complete this identification process.
It further, can also be in this step by the place name root in the Query identified according to its geographical location Subordinate relation by its normalizing be its affiliated region, for example, a certain query word Query be " by the subway apple orchard to north shadow how Walk ", identify wherein " apple orchard " and " northern shadow " be place name, can further be inquired in the dictionary of place name pre-established this two The affiliated region of a place name learns that " apple orchard " and " northern shadow " is all located at Beijing, therefore, can will identify in the Query Ground noun " apple orchard " and " northern shadow " normalizing be " Beijing ", that is, differentiate that ground noun in the Query is " Beijing ".
S1012, non-place name participle in Query is extracted, and checks the co-occurrence rate of each non-place name participle and ground noun, by it In with the co-occurrence rate of ground noun be higher than the non-place name participle of preset threshold as the word with Regional Property.
After Query is segmented and identifies ground therein noun, the participle of non-ground noun in Query can be extracted (subsequent to be known as non-place name participle), and check the co-occurrence rate of each non-place name participle and ground noun.Co-occurrence rate with ground noun is Refer to that a certain non-place name participle appears in the frequency in Query, each non-place name participle and ground noun with all ground noun simultaneously Co-occurrence rate can be obtained by following methods: same in the query word Query of the user's search for a period of time that statistics obtains in advance When occur a certain non-place name occur in the Query number N1 and query word Query of a certain non-place name participle and any ground noun The Query number N2 of participle, then the co-occurrence rate of a certain non-place name participle and ground noun is N1/N2.For example, " dining room " this participle Occurred in 2000 Query that the user in a period of time obtained in advance searches for record, and " dining room " and any place name Word occurred in 400 Query jointly, then the co-occurrence rate of " dining room " and ground noun is 400/2000=0.2.It is obtaining often The non-place name for being higher than preset threshold with the co-occurrence rate of ground noun is segmented and is made with after the co-occurrence rate of ground noun by one non-place name participle For the word with Regional Property.
Regional Property is had in the query word Query of the available user's search of S1011-S1012 through the above steps Word, further, can also by following step S1013 in the obtained word with Regional Property with extracting core Domain Properties word.
S1013, meaning of a word analysis is carried out to the obtained word with Regional Property, and extracts core Regional Property word.
Meaning of a word analysis can be carried out to the obtained word with Regional Property, according to each word with Regional Property Significance level of the meaning of a word in Query sets weight for each word with Regional Property, wherein the meaning of a word is more important to be had The weighted value of the word of Regional Property is higher, can finally extract the word conduct with Regional Property that weighted value is higher than preset threshold Core Regional Property word.For example, have in a certain Query " weather " and " " two words for having a Regional Property, pass through the meaning of a word point After analysis setting weight, the weighted value of " weather " is higher than preset threshold, and " " weighted value be less than preset threshold, therefore, extract " weather " is used as core Regional Property word.Part of speech analysis is carried out to the participle in Query, and weight is set according to the meaning of a word and is belonged to now There is technology, does not repeat excessively herein.
Core Regional Property word can be extracted from the word with Regional Property by step S1013, further, Obtained core Regional Property word can also be normalized by following step S1014, obtain final core Regional Property word.
Core Regional Property word obtained in step S1013 can be normalized, normalized refers to will Belong to same type of word to be normalized, for example, " public transport ", " bus ", " bus " belong to " public transport " this kind Not, therefore, " public transport " in core Regional Property word, " bus ", " bus " are all normalized to " public transport ", " meal The Room ", " restaurant ", " restaurant " belong to " dining room " this classification, therefore, by core Regional Property word " dining room ", " restaurant ", " restaurant " is all normalized to " dining room ".It is understood that the example above is merely for exemplary purpose, the embodiment of the present invention is not It is limited to this.The normalized of core Regional Property word can be realized by preparatory trained text classifier, that is, With preparatory trained text classification, it classifies to obtained core Regional Property word, and each core region is belonged to Property word be normalized to its generic, obtain final core Regional Property word, this method belongs to the prior art, not excessive herein It repeats.
S1011-S1014 can recognize that the ground noun in the query word Query of user's search through the above steps, and Word (or final core Regional Property word after core Regional Property word, or normalization) with Regional Property, can pass through step Rapid S102 obtains the ownership place of IP address according to those information analyses.
S102, it obtains belonging to region as sample training using user's search record of preparatory mark IP address ownership place The confidence level of the word of property.
In order to accurately obtain the ownership place of IP address, the confidence of the word with Regional Property in Query can be first obtained Degree, the confidence level of a certain word with Regional Property are to characterize the word with Regional Property when differentiating IP address ownership place The significance level of influence power.The confidence level of word with Regional Property can pass through the use to be labelled with IP address ownership place in advance Search record in family obtains as training after sample, can specifically be trained by following methods obtain it is a certain with Regional Property The confidence level of word: user's search record of IP address ownership place is obtained with ground noun and is labelled in advance, statistics is in those notes The word of Regional Property and the record number of each ground noun are had comprising this simultaneously in the Query of record, is denoted as T [place name 1], T respectively [place name 2] ... T [place name n], while counting should word and some place name Term co-occurrence with Regional Property in those records When, IP ownership place be the place name record number, be denoted as respectively R [place name 1], R [place name 2] ... R [place name n], by this with ground The confidence level of the word of Domain Properties is denoted as P, thenFor example, Fig. 3 is to be labelled in advance The user of IP address ownership place searches for record exemplary diagram, and the word " public transport " with Regional Property is obtained from example shown in Fig. 3 Confidence level, then count " public transport " and the co-occurrence frequency of each place name in Query, e.g., " public transport " and " Nanjing " are recorded at 4 Query in occurred together, then [Nanjing]=4 T, wherein the IP address ownership place for having 3 records are Nanjing, then R [Nanjing]= 3, it is also possible to T [Beijing], T [Tianjin], R [Beijing], R [Tianjin] etc. be counted for " public transport ", finally, " public transport " is set Reliability is It should be noted that if in step s101 further from Core Regional Property word is extracted in word with Regional Property, or the final core region after further being normalized Attribute word, then training obtains being the final core region category after core Regional Property word or normalization in above-mentioned training process The confidence level of property word.
S103, it is pressed in advance according to the confidence level of User ID, the query word Query that user searches for and the word with Regional Property The rule of setting calculates the first weighted value of IP address with belonging to each region in Query noun, by the first weighted value highest Ownership place of the region as IP address.
In the search record of analysis user, identifies the ground noun in the query word Query of user's search and have region The word (or final core Regional Property word after core Regional Property word, or normalization) of attribute, and obtaining each band It, can after having the confidence level P of word (or final core Regional Property word after core Regional Property word, or normalization) of Regional Property To calculate the first weight of IP address with belonging to each region in its corresponding Query noun according to preset rule Value, and using the highest region of the first weighted value as the ownership place of IP address.It is below a kind of preferred implementation provided by the invention Mode calculates the first weighted value that a certain IP address belongs to each region: with choosing the IP containing ground noun in Query The user of location searches for record, counts the User ID number that the user containing the IP address searches in record, is denoted as Cid, statistics is simultaneously Comprising the region noun and each word (or core Regional Property word, or the final core after normalizing with Regional Property Regional Property word) Query corresponding to User ID number, be denoted as respectively C [place name, word 1], C [place name, word 2] ... C [place name, Word m], the first weighted value which belongs to the region is denoted as Z [place name], then Wherein, word 1, word 2 ... word m refers to that (the final core region after or core Regional Property word, or normalization belongs to each word with Regional Property Property word) confidence level.The first weighted value that a certain IP address belongs to each region can be calculated by the above method, finally, Using the highest region of the first weighted value as the ownership place of IP address.It is understood that above-mentioned calculating IP address belongs to respectively The method of first weighted value of a region is only a kind of preferred embodiment provided by the invention, in practical applications can be according to need Different rules is set to calculate the first weighted value that IP address belongs to each region, the present invention is without limitation.
It above-mentioned IP address is further described below by specific example belongs to the first weighted value of each region and calculated Journey, for example, Fig. 4 is to search for the IP address extracted in record example from the user in a period of time obtained in advance to be Containing region the user of noun searches for record exemplary diagram in " 114.112.29.35 " and Query, as shown in figure 4, those users search Occur the ground noun of the two regions of " Nanjing " and " Beijing " in the Query of Suo Jilu altogether, then can be distinguished using the above method Calculate the first weighted value that the IP belongs to " Nanjing " and " Beijing ".Searched in record in those users, occur altogether 3 it is different User ID, then Cid=3, it is assumed that have " public transport " and " weather " two words for ground in the Query that those users search for record The word of Domain Properties occurred, then C wherein " public transport " and " Nanjing " are searched in record in the corresponding user of two different user ID altogether [Nanjing, public transport]=2, " weather " and " Nanjing " is searched in record in the user that 1 User ID is answered altogether to be occurred, then [Nanjing, the day C Gas]=1, it is also possible to obtain C [Beijing, public transport]=0, C [Beijing, weather]=1, it is assumed that the confidence level of " public transport " and " weather " point Not Wei P [public transport]=0.6, P [weather]=0.75, then IP address " 114.112.29.35 " belongs to first weighted value in " Nanjing " For Belong to " north First weighted value in capital " is As it can be seen that the first weighted value that the IP address belongs to " Nanjing " is higher than the first weighted value for belonging to " Beijing ", therefore, determining should The ownership place of IP address is " Nanjing ".It is understood that the example above is merely for exemplary purpose, the embodiment of the present invention is not It is limited to this.
Method provided by above-mentioned steps S101-S103 can be searched by analyzing user in a period of time obtained in advance Query word Query in Suo Jilu, and User ID is combined, the accurate ownership place for obtaining IP address.It later, can be further The record that IP address ownership place will be taken, which obtains in the above method as sample for training, has Regional Property The confidence level P of word (or final core Regional Property word after core Regional Property word, or normalization).
Further, method provided by the present invention can also include the following steps S104-S105 to combine map to search Index holds up the ownership place for obtaining the IP address of user.
S104, the default urban information being arranged in google maps according to the user in a period of time obtained in advance And User ID, the second weighted value that IP address belongs to each region is calculated according to preset rule.
In general, google maps when providing a user map search service, can set default city for user, with Just user can believe search correlation map when using google maps website is accessed directly in the default city of its setting Breath, and default city of the user set by google maps is often exactly its location, therefore, analysis is used in a period of time The ownership place of default urban information and the combination available IP address of User ID that family is arranged in google maps.
It can be in advance by default city set when user's access google maps website in a period of time, Yi Jiyong The information such as family ID and IP address save after forming record.Such as " 43179D117F6AC7BD4856744B31F4E0E8, 125.34.37.129, the note in default city set by user and User ID and IP address that Beijing " is saved by one Record, wherein " 43179D117F6AC7BD4856744B31F4E0E8 " be User ID, " 125.34.37.129 " for User IP Location, " Beijing " are default city set by user.
The default urban information that can be arranged in google maps according to the user in a period of time obtained in advance And User ID, obtain the ownership place of IP address, specific method can be with are as follows: according to the default urban information of user setting with And User ID calculates User IP and belongs to the second weighted value Z [map, place name] of different cities, and by the second weighted value Z [ Figure, place name] ownership place of highest city as IP address, wherein User IP belongs to the second weighted value Z in a certain city [map, place name] is the User ID number in the city and the ratio of total User ID number for default city in the record that obtains in advance Example.Fig. 5 be IP address be " 218.25.103.196 " default urban information and User ID record exemplary diagram, as shown in figure 5, In the record of the acquired IP address, share 4 User ID, wherein the corresponding default city of 3 User ID be " Shenyang ", 1 The corresponding default city of a User ID is " Changchun ", then the IP address belongs to the second weighted value Z [map, Shenyang] in " Shenyang " =3/4=0.75, which belongs to the second weighted value Z [map, Changchun]=1/4=0.25 in " Changchun ", therefore, it is determined that the IP The ownership place of address is " Shenyang ".
By step S104 can according to user using google maps when the default city that is arranged and User ID etc. Information obtains the ownership place of IP address.Later, it can further be searched in record by step S105 to according to user The ownership place for the User IP that query word Query and User ID obtain, and write from memory according to what user was arranged in google maps The ownership place for recognizing the User IP of city and User ID acquisition is integrated.
S105, the first weighted value for belonging to each region according to IP address and the second weighted value are to IP address Ownership place integrated.
The first weighted value and the second weighted value that each region can be belonged to according to IP address are to IP address Ownership place integrated, can specifically be realized using following manner:
IP address is belonged to the first weighted value Z [place name] and the second weighted value Z [map, place name] phase of same region Multiply, obtains the synthetic weights weight values that IP address belongs to each region, and using the highest region of synthetic weights weight values as IP address Final ownership place.For example, a certain IP address belongs to " Nanjing " according to what the query word Query that User ID and user search for was obtained First weighted value in " Beijing " is respectively Z [Nanjing]=0.65, Z [Beijing]=0.25, which searches according to user in map The second weighted value for belonging to " Nanjing " and " Beijing " that index holds up the default city of middle setting and User ID obtains is respectively Z [map, Nanjing]=0.45, Z [map, Beijing]=0.3, then it is Z [Nanjing] that the IP address, which belongs to the synthetic weights weight values in " Nanjing ", Z [map, Nanjing]=0.2925, the synthetic weights weight values for belonging to " Nanjing " are Z [Beijing] Z [map, Beijing]=0.075, the IP Synthetic weights weight values of the address attribution in " Nanjing " are higher, determine that the final ownership place of the IP address is " Nanjing ".
The above-mentioned description to be carried out to method provided by the embodiment of the present invention one, it can be seen that the present invention can be based on Search engine searches for the User ID in recording and query word Query with accurately analyzing User IP according to the user obtained in advance The ownership place of location, meanwhile, the present invention can also be obtained according to the default urban information and User ID that user is arranged in map The ownership place of IP address, and the analysis result of two methods is integrated, obtain more accurate result.By this hair It is bright provided by method, enable Internet company using Search Engine Analysis obtain user location, so as into One step provides a user the search service with regional characteristic.
Embodiment two
Fig. 6 is a kind of device signal that IP address ownership place is obtained based on search engine provided by the embodiment of the present invention two Figure, as shown in fig. 6, the device includes: pretreatment unit 10, training unit 20 and judgement unit 30, which can also be into one Step includes cartographic information judgement unit 40.
Pretreatment unit 10 obtains the user in a period of time and searches for record, which searches for record and include User ID, look into Word and IP address are ask, and identifies ground noun that the user searches in the query word of record and with Regional Property Word.
Can in pre-recorded a period of time user access search engine when information, those information may include user Those information are formed user's search record and saved by the query word Query and IP address that ID, user search for. Wherein, User ID is in the browser access search engine web site at the end user first pass PC, for the ID of user's distribution, the use Family ID is stored in the Cookie at the end user PC, can be directly from user later when user accesses search engine web site again User ID is obtained in the Cookie at the end PC.The length of time for saving user's search record, which can according to need, to be set, for example, The user that can be saved in 30 days searches for record." 00017255861E0FE2D25B26B6BDB1139A, 114.112.29.35, 362 tunnel public transport of Beijing " is the example that a user searches for record, wherein " 00017255861E0FE2D25B26B6BDB113 9A " is User ID, and " 114.112.29.35 " is IP address, and " 362 tunnel public transport of Beijing " is the query word of user's search Query。
Query word Query in order to be searched for according to user analyzes to obtain the ownership place of IP address, and pretreatment is single Member 10 can further analyze looking into for processing user's search after the user in a period of time obtained in advance searches for record Word Query is ask, to identify ground noun from Query and with the word of Regional Property.Word with Regional Property refers to region The higher word of correlation, for example, the region correlation of " public transport " and " weather " is higher, and the region correlation of " gravitation " compared with It is low, it is believed that " public transport " and " weather " is the word with Regional Property.Pretreatment unit 10 can execute operations described below S2011-S2012 identifies the ground noun in Query and the word with Regional Property:
S2011, word segmentation processing is carried out to Query, and obtains the ground noun in Query.
Pretreatment unit 10 first can carry out word segmentation processing to Query, and Query is divided into independent participle one by one, The process belongs to the prior art, does not repeat excessively herein.Later, the participle for belonging to ground noun in the participle of Query is identified, it is pre- to locate Manage unit 10 can be by matching the participle in Query with the ground noun in the dictionary of place name pre-established come complete respectively At this identification process.
Further, pretreatment unit 10 can also be in this step by the place name root in the Query identified According to its geographical location subordinate relation by its normalizing be its affiliated region, for example, a certain query word Query be " apple by the subway Garden to northern shadow how to get to ", identify that wherein " apple orchard " and " northern shadow ", can be further in the ground noun pre-established for place name The affiliated region that the two place names are inquired in allusion quotation learns that " apple orchard " and " northern shadow " is all located at Beijing, therefore, can will be at this The ground noun " apple orchard " and " northern shadow " normalizing identified in Query is " Beijing ", that is, differentiates that the ground noun in the Query is " Beijing ".
S2012, the participle for extracting non-ground noun in Query, and check the co-occurrence rate of each non-place name participle and ground noun, The non-place name participle of preset threshold will be wherein higher than with the co-occurrence rate of ground noun as the word for having Regional Property.
After Query is segmented and identifies ground therein noun, pretreatment unit 10 can be extracted in Query non-ly The participle of noun, and check the co-occurrence rate of each non-place name participle and ground noun.Refer to the co-occurrence rate of ground noun a certain non- Noun participle appears in the frequency in Query with all ground noun simultaneously, and pretreatment unit 10 can execute operations described below acquisition The co-occurrence rate of each non-place name participle and ground noun: the query word Query of the user's search for a period of time that statistics obtains in advance In occur occurring this in the Query number N1 and query word Query of a certain non-place name participle and any ground noun simultaneously it is a certain non- The Query number N2 of place name participle, then the co-occurrence rate of a certain non-place name participle and ground noun is N1/N2.For example, " dining room " this Segment in a period of time obtained in advance user search for record 2000 Query in occurred, and " dining room " with it is any Ground noun occurred in 400 Query jointly, then the co-occurrence rate of " dining room " and ground noun is 400/2000=0.2.It is obtaining Each non-place name participle will be higher than the non-ground status of preset threshold with after the co-occurrence rate of ground noun with the co-occurrence rate of ground noun Word is as the word for having Regional Property.
By the query word Query for executing the available user of aforesaid operations S2011-S2012 pretreatment unit 10 search In have Regional Property word, further, operations described below S2013 can also be performed in obtained band in pretreatment unit 10 Have and extracts core Regional Property word in the word of Regional Property.
S2013, meaning of a word analysis is carried out with Regional Property to obtained, and extracts core Regional Property word.
Pretreatment unit 10 can carry out meaning of a word analysis to the obtained word with Regional Property, according to each with ground Significance level of the meaning of a word of the word of Domain Properties in Query sets weight for each word with Regional Property, wherein the meaning of a word The weighted value of the more important word with Regional Property is higher, can finally extract weighted value higher than preset threshold with region The word of attribute is as core Regional Property word.For example, have in a certain Query " weather " and " " two with Regional Property Word, after analyzing setting weight by the meaning of a word, the weighted value of " weather " is higher than preset threshold, and " " weighted value be less than default threshold Therefore value extracts " weather " and is used as core Regional Property word.Part of speech analysis is carried out to the participle in Query, and is set according to the meaning of a word Determine weight and belong to the prior art, does not repeat excessively herein.
After executing operation S2013, pretreatment unit 10 can extract core region from the word with Regional Property and belong to Property word, further, pretreatment unit 10 can also be performed operations described below S2014 to obtained core Regional Property word into Row normalized obtains final core Regional Property word.
S2014, obtained core Regional Property word is normalized, obtains final core Regional Property word.
Pretreatment unit 10 can be normalized core Regional Property word obtained in step S2013, normalizing Same type of word will be belonged to and be normalized by changing processing i.e. finger, for example, " public transport ", " bus ", " bus " belong to " public transport " in core Regional Property word, " bus ", " bus " are therefore all normalized to by " public transport " this classification " public transport ", " dining room ", " restaurant ", " restaurant " belong to " dining room " this classification, therefore, by " the meal in core Regional Property word The Room ", " restaurant ", " restaurant " are all normalized to " dining room ".It is understood that the example above is merely for exemplary purpose, this hair Bright embodiment is without being limited thereto.Preparatory trained text classification can be passed through to the normalized of core Regional Property word Device is realized, that is, with preparatory trained text classification, it classifies to obtained core Regional Property word, and will be each A core Regional Property word is normalized to its generic, obtains final core Regional Property word, and this method belongs to existing skill Art does not repeat excessively herein.
After executing aforesaid operations S2011-S2014, pretreatment unit 10 can recognize that the query word Query of user's search In ground noun, and word (the final core Regional Property after or core Regional Property word, or normalization with Regional Property Word).
Training unit 20 is obtained for searching for record as sample training using the user of preparatory mark IP address ownership place The confidence level of the word with Regional Property.
In order to accurately obtain the ownership place of IP address excessively, it can be obtained by training unit 20 in Query and have region The confidence level of the word of attribute, the confidence level of a certain word with Regional Property are to characterize the word with Regional Property with differentiating IP The significance level of influence power when the ownership place of location.The confidence level of word with Regional Property can be by be labelled with IP address in advance The user of ownership place searches for record and obtains as training after sample, and training unit 20 can specifically execute operations described below to train and obtain The confidence level of a certain word with Regional Property: user's search of IP address ownership place is obtained with ground noun and is labelled in advance Record counts the record number of the word and each ground noun in the Query of those records while comprising this with Regional Property, point Be not denoted as T [place name 1], T [place name 2] ... T [place name n], at the same count in those records should with Regional Property word with When some place name Term co-occurrence, IP ownership place be the place name record number, be denoted as respectively R [place name 1], R [place name 2] ... R [ Name n], the confidence level of the word with Regional Property is denoted as P, then
Belong to it should be noted that if pretreatment unit 10 is further extracted core region from the word with Regional Property Property word, or the final core Regional Property word after further being normalized, then training unit 20 is in above-mentioned training process Middle training obtains the confidence level of the final core Regional Property word after being core Regional Property word or normalizing.
Judgement unit 30, for searching for the User ID in recording, the ground in the query word identified according to the user The confidence level of noun and word and the word with Regional Property with Regional Property, determines the ownership place of the IP address. Preferably, IP address can be calculated according to preset rule belong to the first of the corresponding each region of described ground noun Weighted value determines the ownership place of the IP address according to first weighted value.
In the search record of analysis user, identifies the ground noun in the query word Query of user's search and have region The word (final core Regional Property word after or core Regional Property word, or normalization) of attribute, obtained it is each with ground After the confidence level P of the word (or final core Regional Property word after core Regional Property word, or normalization) of Domain Properties, differentiate single Member 30 can be calculated according to preset rule IP address with belonging to each region in its corresponding Query noun first Weighted value, and using the highest region of the first weighted value as the ownership place of IP address, here is provided by the invention a kind of preferred Embodiment calculates the first weighted value that a certain IP address belongs to each region: choose in Query containing ground noun should The user of IP address searches for record, counts the User ID number that the user containing the IP address searches in record, is denoted as Cid, counts Comprising the region noun and each word (or core Regional Property word, or final after normalizing with Regional Property simultaneously Core Regional Property word) Query corresponding to User ID number, be denoted as C [place name, word 1], C [place name, word 2] ... C respectively The first weighted value that the IP address belongs to the region is denoted as Z [place name], then by [place name, word m] Wherein, word 1, word 2 ... word m refers to that (the final core region after or core Regional Property word, or normalization belongs to each word with Regional Property Property word) confidence level.The first weighted value that a certain IP address belongs to each region can be calculated by the above method, finally, Using the highest region of the first weighted value as the ownership place of IP address.It is understood that above-mentioned calculating IP address belongs to respectively The method of first weighted value of a region is only a kind of preferred embodiment provided by the invention, in practical applications can be according to need Different rules is set to calculate the weighted value that IP address belongs to each region, the present invention is without limitation.
It, can be by analyzing one section obtained in advance using above-mentioned pretreatment unit 10, training unit 20, judgement unit 30 User searches for the query word Query in record in time, and combines User ID, the accurate ownership place for obtaining IP address.It Afterwards, the record that IP address ownership place can further will be taken is used to train the band obtained in the above method as sample There is the confidence level P of the word (or final core Regional Property word after core Regional Property word, or normalization) of Regional Property.
Further, device provided by the present invention can also include following apparatus cartographic information judgement unit 40 to tie Close the ownership place that google maps obtain the IP address of user.
Cartographic information judgement unit 40, the user in a period of time obtained in advance for basis is in google maps The default urban information and User ID of setting calculate IP address according to preset rule and belong to the second of each region Weighted value.
In general, google maps when providing a user map search service, can set default city for user, with Just user can believe search correlation map when using google maps website is accessed directly in the default city of its setting Breath, and default city of the user set by google maps is often exactly its location, therefore, analysis is used in a period of time The ownership place of default urban information and the combination available IP address of User ID that family is arranged in google maps.
It can be in advance by default city set when user's access google maps website in a period of time, Yi Jiyong The information such as family ID and IP address save after forming record.Such as " 43179D117F6AC7BD4856744B31F4E0E8, 125.34.37.129, the note in default city set by user and User ID and IP address that Beijing " is saved by one Record, wherein " 43179D117F6AC7BD4856744B31F4E0E8 " be User ID, " 125.34.37.129 " for User IP Location, " Beijing " are default city set by user.
Later, cartographic information judgement unit 40 can draw according to the user in a period of time obtained in advance in map search The default urban information and User ID for holding up middle setting, obtain the ownership place of IP address, and cartographic information judgement unit 40 has Body can execute operations described below: calculating User IP according to the default urban information and User ID of user setting and belong to different cities The second weighted value Z [map, place name] in city, and by highest city the returning as IP address of the second weighted value Z [map, place name] Possession, wherein User IP belongs to the second weighted value Z [map, place name] in a certain city to write from memory in the record that obtains in advance Recognize the ratio of User ID number and total User ID number that city is the city.
Cartographic information judgement unit 40 can according to user using google maps when the default city that is arranged and use The information such as family ID obtain the ownership place of IP address.Later, judgement unit 30 further can search for record to according to user In the ownership place of User IP that obtains of query word Query and User ID, and be arranged in google maps according to user Default city and the ownership place of User IP that obtains of User ID integrated.
Judgement unit 30 can belong to the first weighted value and the second weighted value pair of each region according to IP address The ownership place of IP address is integrated, and can specifically be realized using following manner:
IP address is belonged to the first weighted value Z [place name] and the second weighted value Z [map, place name] phase of same region Multiply, obtains the synthetic weights weight values that IP address belongs to each region, and using the highest region of synthetic weights weight values as IP address Final ownership place.For example, a certain IP address belongs to " Nanjing " according to what the query word Query that User ID and user search for was obtained First weighted value in " Beijing " is respectively Z [Nanjing]=0.65, Z [Beijing]=0.25, which searches according to user in map The second weighted value for belonging to " Nanjing " and " Beijing " that index holds up the default city of middle setting and User ID obtains is respectively Z [map, Nanjing]=0.45, Z [map, Beijing]=0.3, then it is Z [Nanjing] that the IP address, which belongs to the synthetic weights weight values in " Nanjing ", Z [map, Nanjing]=0.2925, the synthetic weights weight values for belonging to " Nanjing " are Z [Beijing] Z [map, Beijing]=0.075, the IP Synthetic weights weight values of the address attribution in " Nanjing " are higher, determine that the final ownership place of the IP address is " Nanjing ".
The foregoing is merely illustrative of the preferred embodiments of the present invention, is not intended to limit the invention, all in essence of the invention Within mind and principle, any modification, equivalent substitution, improvement and etc. done be should be included within the scope of the present invention.

Claims (20)

1. a kind of method for obtaining internet protocol address ownership place based on search engine, which is characterized in that this method comprises:
S1, the user obtained in a period of time search for record, and it includes user identifier ID, query word and use that the user, which searches for record, Family IP address, and identify the ground noun that the user searches in the query word of record and the word with Regional Property;
S2, record is searched for using the user of preparatory mark IP address ownership place obtain described having Regional Property as sample training Word confidence level;
S3, the User ID in recording, the ground noun in the query word identified are searched for according to the user and with region The confidence level of the word of attribute and the word with Regional Property, determines the ownership place of the IP address;
The S2 is specifically included:
It utilizesObtain the confidence level P [M] of the word M with Regional Property, wherein T [place name i] is The record number of word M and ground noun i co-occurrence in the training sample with Regional Property, R [place name i] is in the training sample Word M with Regional Property and the IP address ownership place marked in advance when ground noun i co-occurrence are the corresponding region the ground noun i Number is recorded, n is the ground noun number in training sample with M co-occurrence.
2. the method according to claim 1, wherein identifying that the user searches for the query word of record in step S1 In ground noun and word with Regional Property specifically include:
S11, the query word searched in record to the user segment, and identify ground therein noun;
S12, the non-place name extracted in query word segment, and will be higher than the non-of preset threshold with co-occurrence rate of the ground noun in query word Place name participle is as the word for having Regional Property.
3. according to the method described in claim 2, it is characterized in that, after the step S12 further include:
S13, meaning of a word analysis is carried out to the word with Regional Property, extracts meaning of a word weighted value higher than preset threshold with ground The word of Domain Properties.
4. according to the method described in claim 3, it is characterized in that, after the step S13 further include:
The generic of the word of S14, basis with Regional Property, the word with Regional Property that the step S13 is extracted It is normalized.
5. according to claim 1 to method described in 4 any claims, which is characterized in that determine the IP address described in step S3 Ownership place are as follows:
The first weighted value that IP address belongs to the corresponding each region of described ground noun is calculated according to preset rule, The ownership place of the IP address is determined according to first weighted value.
6. according to the method described in claim 5, it is characterized in that, calculating User IP according to preset rule described When location belongs to the first weighted value of described ground noun corresponding each region, specifically include:
According to formulaObtain the first weighted value that IP address belongs to region L Z [L], wherein Cid is that the user of the IP address containing ground noun searches for the User ID number for including in record, and C [L, word i] is The user of the IP address containing ground noun searches for the region corresponding ground L noun and the word i for having Regional Property in record Co-occurrence record corresponding to User ID number, P [word i] is the confidence level of the word i with Regional Property, and m is described containing ground The user of the IP address of noun searches for the number of the word in record with Regional Property.
7. according to the method described in claim 5, it is characterized in that, described determine returning for the IP address according to first weighted value Possession are as follows:
IP address is belonged in the first weighted value of the corresponding each region of described ground noun, the first weighted value is highestly Ownership place of the domain as the IP address.
8. according to the method described in claim 5, it is characterized in that, this method further include:
The default urban information and use that the user in a period of time that S4, basis obtain in advance is arranged in google maps Family ID calculates the second weighted value that IP address belongs to each region according to preset rule;
The ownership place that the IP address is determined according to first weighted value specifically:
The first weighted value and the second weighted value that IP address belongs to each region are integrated, the final ownership place of IP address is obtained.
9. according to the method described in claim 8, it is characterized in that, second power for calculating IP address and belonging to each region Weight values specifically include:
The default city that the user obtained in advance is arranged in google maps is belonged to the User ID number of a certain region The second weighted value of a certain region is belonged to as IP address with the ratio of total User ID number.
10. according to the method described in claim 8, it is characterized in that, first for integrating IP address and belonging to each region Weighted value and the second weighted value, the final ownership place for obtaining IP address specifically include:
IP address is belonged into the first weighted value of each region and the second weighted value is multiplied, IP address is obtained and belongs to eachly The synthetic weights weight values in domain, and using the highest region of synthetic weights weight values as the ownership place of IP address.
11. a kind of device for obtaining IP address ownership place based on search engine, which is characterized in that the device includes:
Pretreatment unit searches for record for obtaining the user in a period of time, and the user searches for record and includes User ID, looks into Word and IP address are ask, and identifies ground noun that the user searches in the query word of record and with Regional Property Word;
Training unit obtains the band for searching for record as sample training using the user of preparatory mark IP address ownership place There is the confidence level of the word of Regional Property;
Judgement unit, for according to the user search for the User ID in record, the ground noun in the query word that is identified with And the confidence level of the word and the word with Regional Property with Regional Property, determine the ownership place of the IP address;
Wherein, training unit is specifically used for: according toObtain the confidence level of the word M with Regional Property P [M], wherein T [place name i] is the record number of the word M and ground noun i co-occurrence in the training sample with Regional Property, R [ Name i] be word M in the training sample with Regional Property with noun i co-occurrence when the IP address ownership place that marks in advance be The record number of the corresponding region ground noun i, n are the ground noun number in training sample with M co-occurrence.
12. device according to claim 11, which is characterized in that the pretreatment unit is identifying user's search note It is specific to execute when ground noun in the query word of record and the word with Regional Property:
S21, the query word searched in record to the user segment, and identify ground therein noun;
S22, the non-place name extracted in query word segment, and will be higher than the non-of preset threshold with co-occurrence rate of the ground noun in query word Place name participle is as the word for having Regional Property.
13. device according to claim 12, which is characterized in that the pretreatment unit also executes after executing S22:
S23, meaning of a word analysis is carried out to the word with Regional Property, extracts meaning of a word weighted value higher than preset threshold with ground The word of Domain Properties.
14. device according to claim 13, which is characterized in that the pretreatment unit also executes after executing S23:
The generic of the word of S24, basis with Regional Property, the word with Regional Property that the step S23 is extracted It is normalized.
15. device described in 1 to 14 any claim according to claim 1, which is characterized in that the judgement unit is determining the IP It is specific to execute when the ownership place of address:
The first weighted value that IP address belongs to the corresponding each region of described ground noun is calculated according to preset rule, The ownership place of the IP address is determined according to first weighted value.
16. device according to claim 15, which is characterized in that the judgement unit is calculated according to preset rule It is specific to execute when IP address belongs to the first weighted value of described ground noun corresponding each region:
According to formulaObtain the first weighted value that IP address belongs to region L Z [L], wherein Cid is that the user of the IP address containing ground noun searches for the User ID number for including in record, and C [L, word i] is The user of the IP address containing ground noun searches for the region corresponding ground L noun and the word i for having Regional Property in record Co-occurrence record corresponding to User ID number, P [word i] is the confidence level of the word i with Regional Property, and m is described containing ground The user of the IP address of noun searches for the number of the word in record with Regional Property.
17. device according to claim 15, which is characterized in that the judgement unit is determined according to first weighted value should It is specific to execute when the ownership place of IP address:
IP address is belonged in the first weighted value of the corresponding each region of described ground noun, the first weighted value is highestly Ownership place of the domain as the IP address.
18. device according to claim 15, which is characterized in that the device further include:
Cartographic information judgement unit, for what is be arranged in google maps according to the user in a period of time obtained in advance Default urban information and User ID, calculates the second weight that IP address belongs to each region according to preset rule Value;
It is specific to execute when the judgement unit determines the ownership place of the IP address according to first weighted value:
The first weighted value and the second weighted value that IP address belongs to each region are integrated, the final ownership place of IP address is obtained.
19. device according to claim 18, which is characterized in that the cartographic information judgement unit calculates IP address ownership It is specific to execute when the second weighted value of each region:
The default city that the user obtained in advance is arranged in google maps is belonged to the User ID number of a certain region The second weighted value of a certain region is belonged to as IP address with the ratio of total User ID number.
20. device according to claim 18, which is characterized in that the judgement unit is integrated IP address and belonged to eachly First weighted value and the second weighted value in domain, specific to execute when obtaining the final ownership place of IP address:
IP address is belonged into the first weighted value of each region and the second weighted value is multiplied, IP address is obtained and belongs to eachly The synthetic weights weight values in domain, and using the highest region of synthetic weights weight values as the ownership place of IP address.
CN201310091285.XA 2013-03-21 2013-03-21 A kind of method and apparatus that IP address ownership place is obtained based on search engine Active CN103207901B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201310091285.XA CN103207901B (en) 2013-03-21 2013-03-21 A kind of method and apparatus that IP address ownership place is obtained based on search engine

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310091285.XA CN103207901B (en) 2013-03-21 2013-03-21 A kind of method and apparatus that IP address ownership place is obtained based on search engine

Publications (2)

Publication Number Publication Date
CN103207901A CN103207901A (en) 2013-07-17
CN103207901B true CN103207901B (en) 2019-03-08

Family

ID=48755123

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201310091285.XA Active CN103207901B (en) 2013-03-21 2013-03-21 A kind of method and apparatus that IP address ownership place is obtained based on search engine

Country Status (1)

Country Link
CN (1) CN103207901B (en)

Families Citing this family (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104780235B (en) * 2014-01-14 2019-08-06 腾讯科技(深圳)有限公司 IP attribution inquiry method, device and server
CN104780234B (en) * 2014-01-14 2019-09-17 腾讯科技(深圳)有限公司 IP attribution inquiry method, apparatus and system
CN104168163A (en) * 2014-08-27 2014-11-26 福建富士通信息软件有限公司 Intelligent network line quality detection and data analysis method
CN105335480A (en) * 2015-10-13 2016-02-17 国家电网公司 Internet website liability subject identifying method
CN106096040B (en) * 2016-06-29 2019-06-04 中国人民解放军国防科学技术大学 Organization web ownership place method of discrimination and its device based on search engine
CN106357835B (en) * 2016-09-05 2020-03-06 百度在线网络技术(北京)有限公司 Method and equipment for determining region of target IP address
CN111327721B (en) * 2020-02-28 2023-01-10 加和(北京)信息科技有限公司 IP address positioning method and device, storage medium and electronic device

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102012900A (en) * 2009-09-04 2011-04-13 阿里巴巴集团控股有限公司 An information retrieval method and system
CN102033947A (en) * 2010-12-22 2011-04-27 百度在线网络技术(北京)有限公司 Region recognizing device and method based on retrieval word
CN102880721A (en) * 2012-10-15 2013-01-16 瑞庭网络技术(上海)有限公司 Implementation method of vertical search engine
CN102932492A (en) * 2011-09-12 2013-02-13 微软公司 Correlation of users to ip address lease events

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102012900A (en) * 2009-09-04 2011-04-13 阿里巴巴集团控股有限公司 An information retrieval method and system
CN102033947A (en) * 2010-12-22 2011-04-27 百度在线网络技术(北京)有限公司 Region recognizing device and method based on retrieval word
CN102932492A (en) * 2011-09-12 2013-02-13 微软公司 Correlation of users to ip address lease events
CN102880721A (en) * 2012-10-15 2013-01-16 瑞庭网络技术(上海)有限公司 Implementation method of vertical search engine

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
利用"百度"搜索网络信息资源;黄西安;《科技情报开发与经济》;20051231;第15卷(第4期);第257-259页
基于地理信息的用户行为理解;谢幸等;《https://wenku.baidu.com/view/927fed7202768e73876.html》;20110221;正文第6页

Also Published As

Publication number Publication date
CN103207901A (en) 2013-07-17

Similar Documents

Publication Publication Date Title
CN103207901B (en) A kind of method and apparatus that IP address ownership place is obtained based on search engine
CN110472066B (en) Construction method of urban geographic semantic knowledge map
CN101320375B (en) Digital book search method based on user click action
CN107577688B (en) Original article influence analysis system based on media information acquisition
CN103955505B (en) A kind of event method of real-time and system based on microblogging
CN103678576B (en) The text retrieval system analyzed based on dynamic semantics
CN105244031A (en) Speaker identification method and device
CN110162695A (en) A kind of method and apparatus of information push
CN106933947B (en) A kind of searching method and device, electronic equipment
CN105843850B (en) Search optimization method and device
CN109582969A (en) Methodology for Entities Matching, device and electronic equipment
US20140025701A1 (en) Query expansion
CN105653706A (en) Multilayer quotation recommendation method based on literature content mapping knowledge domain
CN105718585B (en) Document and label word justice correlating method and its device
CN106204156A (en) A kind of advertisement placement method for network forum and device
CN103226578A (en) Method for identifying websites and finely classifying web pages in medical field
CN109558587B (en) Method for classifying public opinion tendency recognition aiming at category distribution imbalance
CN102541936A (en) Method and device for acquiring popularity of POI (Point of Interest)
CN109636495A (en) A kind of online recommended method of scientific and technological information based on big data
CN110737821B (en) Similar event query method, device, storage medium and terminal equipment
CN108241690A (en) A kind of data processing method and device, a kind of device for data processing
CN110705292B (en) Entity name extraction method based on knowledge base and deep learning
CN102646124A (en) Method for automatically identifying address information
WO2010096986A1 (en) Mobile search method and device
CN110012122A (en) A kind of domain name similarity analysis method of word-based embedded technology

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant