CN101350154A - Method and apparatus for ordering electronic map data - Google Patents

Method and apparatus for ordering electronic map data Download PDF

Info

Publication number
CN101350154A
CN101350154A CNA2008102224228A CN200810222422A CN101350154A CN 101350154 A CN101350154 A CN 101350154A CN A2008102224228 A CNA2008102224228 A CN A2008102224228A CN 200810222422 A CN200810222422 A CN 200810222422A CN 101350154 A CN101350154 A CN 101350154A
Authority
CN
China
Prior art keywords
map data
electronic map
keyword
web page
importance degree
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CNA2008102224228A
Other languages
Chinese (zh)
Other versions
CN101350154B (en
Inventor
董正斌
佟子健
王云峰
王登
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Sogou Technology Development Co Ltd
Original Assignee
Beijing Sogou Technology Development Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Sogou Technology Development Co Ltd filed Critical Beijing Sogou Technology Development Co Ltd
Priority to CN 200810222422 priority Critical patent/CN101350154B/en
Publication of CN101350154A publication Critical patent/CN101350154A/en
Application granted granted Critical
Publication of CN101350154B publication Critical patent/CN101350154B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Abstract

The invention discloses a priority method of electronic map data and a device, which solves the problem that a traditional artificial priority method causes a poor priority effect, wastes manpower and has high cost. The method comprises that a key word of each electronic map data is extracted, web sets of search results, which correspond to each electronic map data are obtained through that the key word is used to search. According the corresponding wet sets of search results of each electronic map data, the importance of the electronic map data is calculated, and the electronic map data is sorted according to the importance. The invention uses the internet popularity of internet to depict the importance degree of POI data, because the depiction represents the recognition of vast netizen even broad masses, the priority effect is better and has excellent mass basis and rationality. Further, a machine is used to automatically score and sort, thereby the man labor is effectively saved, the efficiency is higher, and the cost is very low.

Description

A kind of sort method of electronic map data and device
Technical field
The present invention relates to networking technology area, particularly relate to a kind of sort method and device of electronic map data.
Background technology
Along with DEVELOPMENT OF GEOGRAPHICAL INFORMATION SYSTEM and perfect, the technology of designing and developing of electronic chart also reaches its maturity.In the electronic chart, there are class data to be called interest point data (being Point of Interest, the POI data), are meant the interested data of people, as the geography information of buildingss such as restaurant, park, market, or the information in some streets or the like.Usually, the POI data comprise the information of title, classification, longitude, four aspects of latitude, also comprise some other information sometimes, as the address, and phone, postcode or the like.The POI data are one of most important elements of electronic chart, also are the information that people pay close attention to when using electronic chart the most.
An electronic chart comprises a lot of POI data usually, and these POI data have contained the most geographical information in this body of a map or chart.But, the significance level of geography information is different in this electronic chart, more important than " square, Zhong Guan-cun " as " Tian'anmen Square ", " Peking University " is more important than " affiliated middle school of Peking University ", and the difference of this geography information importance causes the importance of POI data there are differences.
The POI ordering is meant the ordering of the POI data being carried out according to the difference of POI data importance, and the importance of POI data is embodied in the importance of its geography information that refers to.POI ordering can be applicable in the ordering of search engine, promptly according to the importance of POI data to the displaying of sorting of the Query Result of electronic chart.
At present, also ripe without comparison POI sort method.Traditionally, the developer of electronic chart can ask some editors or general public, according to people the familiarity of POI data is come the POI data are sorted, this core concept that sorts according to familiarity is: if the geographic position that POI data are referred to is extremely important, then it necessarily is familiar with by people.This thought has certain rationality, because the user of the geography information of electronic chart and even reality is a general public, therefore the geography information of being familiar with by general public should have higher importance.
But there are the following problems for this method:
The first, though can portray the significance level of POI data with familiarity, how calculating familiarity is a very problem of difficulty.Therefore, the method for above-mentioned artificial ordering can't be represented users owing to have only minimum some people to participate in, so the ordering effect does not ensure the ordering weak effect; And, since fewer in number, so error rate is also than higher.
The second, because the POI data volume is very big, and renewal is very fast, thus adopt the very labor intensive that manually sorts, and also cost is very expensive.
Therefore, this artificial sort method can't obtain actual use.
Summary of the invention
Technical matters to be solved by this invention provides a kind of sort method and device of electronic map data, causes ordering weak effect, labor intensive, problem that cost is too high to solve traditional artificial sort method.
For solving the problems of the technologies described above,, the invention discloses following technical scheme according to specific embodiment provided by the invention:
A kind of sort method of electronic map data comprises:
Extract the keyword of each electronic map data;
Utilize described keyword to search for, obtain the search result web page set of corresponding each electronic map data;
According to the corresponding search result web page set of each electronic map data, calculate the importance degree of this electronic map data;
According to described importance degree described electronic map data is sorted.
Wherein, described corresponding search result web page set according to each electronic map data, calculate the importance degree of this electronic map data, specifically comprise:, calculate the second value that is used to represent first numerical value of webpage significance level and is used to represent webpage and keyword matching degree respectively at each search result web page in the set; According to first numerical value and the second value of all search result web page in the corresponding set, calculate the importance degree of this electronic map data.
Wherein, described first numerical value and second value according to all search result web page in the corresponding set, calculate the importance degree of this electronic map data, specifically comprise: first numerical value and the second value of each search result web page multiplies each other in will gathering, and then the multiplied result of all search result web page is sued for peace in will gathering, and obtains the importance degree of this electronic map data.
Preferably, described first numerical value obtains by calculating the webpage rank.
Preferably, after the importance degree of described this electronic map data of calculating, also comprise: according to the different weights that classification had under the electronic map data, the importance degree of this electronic map data be multiply by the weighted value of classification under this electronic map data, obtain adjusted result data, be used for ordering.
Wherein, the described keyword that extracts each electronic map data specifically comprises: the name that extracts each electronic map data is referred to as keyword.
Preferably, also comprise: extract the address information of each electronic map data, with title together as keyword.
Preferably, before the described keyword that extracts each electronic map data, also comprise: original electronic map data is carried out pre-service, and described pre-service comprises removes irrelevant symbol, character code conversion, adjusts consolidation form; The pre-service result is used for the extraction of keyword;
Preferably, after according to described importance degree described electronic map data being sorted, also comprise: in the electronic chart retrieval, the query word of importing according to the user returns the result for retrieval that is complementary, and the forward electronic map data of ordering in the result for retrieval is preferentially shown.
Preferably, after according to described importance degree described electronic map data being sorted, also comprise: when the figure layer shows, choose the forward electronic map data of indication range internal sort and show.
Preferably, after according to described importance degree described electronic map data being sorted, also comprise: forward electronic map data preferentially upgrades to sorting.
The present invention also provides a kind of collator of electronic map data, comprising:
Keyword extracting unit is used to extract the keyword of each electronic map data;
Query unit is used to utilize described keyword to search for, and obtains the search result web page set of corresponding each electronic map data;
Computing unit is used for the corresponding search result web page set according to each electronic map data, calculates the importance degree of this electronic map data;
Sequencing unit is used for according to described importance degree described electronic map data being sorted.
Wherein, described computing unit specifically comprises: first computation subunit is used for calculating first numerical value that is used to represent the webpage significance level respectively at each search result web page of set; Second computation subunit is used for calculating the second value that is used to represent webpage and keyword matching degree respectively at each search result web page of set; The COMPREHENSIVE CALCULATING subelement is used for first numerical value and second value according to all search result web page of each electronic map data corresponding set, calculates the importance degree of this electronic map data.
Wherein, first numerical value and the second value of each search result web page multiplied each other during described COMPREHENSIVE CALCULATING subelement will be gathered, and then the multiplied result of all search result web page is sued for peace in will gathering, and obtains the importance degree of this electronic map data.
Preferably, described first computation subunit obtains first numerical value by calculating the webpage rank.
Preferably, described device also comprises: adjustment unit, be used for different weights according to classification had under the electronic map data, the importance degree of this electronic map data be multiply by the weighted value of classification under this electronic map data, obtain adjusted result data, and output to sequencing unit be used for the ordering.
Wherein, described keyword extracting unit is referred to as keyword with the name of the electronic map data that extracts.
Preferably, described keyword extracting unit is also with the address information of the electronic map data that extracts, with title together as keyword.
Preferably, described device also comprises: pretreatment unit is used for original electronic map data is carried out pre-service, and the pre-service result is outputed to keyword extracting unit; Wherein, described pre-service comprises the irrelevant symbol of removal, character code conversion, adjusts consolidation form.
Preferably, described device also comprises: retrieval unit, be used in the electronic chart retrieval, and the query word of importing according to the user returns the result for retrieval that is complementary, and the forward electronic map data of ordering in the result for retrieval is preferentially shown.
Preferably, described device also comprises: figure layer display unit is used for choosing the forward electronic map data of indication range internal sort and showing when the figure layer shows.
Preferably, described device also comprises: data updating unit is used for the forward electronic map data that sorts is preferentially upgraded.
The present invention also provides a kind of search engine system, and described system comprises the described device of above-mentioned arbitrary device embodiment.
According to specific embodiment provided by the invention, the present invention has following technique effect:
At first, the present invention utilizes Internet technology that the POI data are sorted, the network popularity of internet usage is portrayed the significance level of POI data, and the network popularity is to calculate according to the results web page that keyword (being to go out from the POI extracting data) returns search engine.Because numerous netizens and even broad masses' understanding has been represented in this portrayal, therefore utilize the network popularity to come the POI data are sorted, the effect of ordering is relatively good, has good mass foundation and rationality.And, use machine automatically the POI data to be given a mark and sorted, greatly saved manpower, efficient is higher, and cost is very cheap.
Secondly, when utilizing the significance level of network popularity portrayal POI data, the present invention has mainly used these two indexs of matching degree of significance level, webpage and the keyword of webpage, and each index also has different computing method.
Once more, the present invention has also taken into full account the influence of the classification of POI data to the POI significance level, the classification information of utilizing the POI data comes thereby basic network popularity score is adjusted the final score that obtains POI, thereby has portrayed the significance level of POI data more exactly.
Description of drawings
Fig. 1 is the sort method process flow diagram of the embodiment of the invention one described a kind of electronic map data;
Fig. 2 is the sort method schematic flow sheet of the embodiment of the invention two described a kind of POI data;
Fig. 3 is the collator structural drawing of the described a kind of electronic map data of the embodiment of the invention.
Embodiment
For above-mentioned purpose of the present invention, feature and advantage can be become apparent more, the present invention is further detailed explanation below in conjunction with the drawings and specific embodiments.
Embodiment one:
At the artificial sort method of traditional POI, the embodiment of the invention provides a kind of sort method that utilizes Internet technology to carry out.With reference to Fig. 1, be the sort method process flow diagram of the embodiment of the invention one described a kind of electronic map data.In the present embodiment, described electronic map data describes with the POI data instance, but described electronic map data includes but not limited to the POI data.
S101 extracts the keyword of each POI data;
Present embodiment need go out a keyword from each POI extracting data, is used for inquiring about in the search engine of internet.Because each POI data has some attributes, comprises title, classification, coordinate or other attribute information, can from these attribute informations, extract when therefore extracting and to represent the speech of these POI data as keyword.In the present embodiment, the essential part of keyword is the title of POI, because title is POI data most important parts.
Preferably, when extracting the title of POI data, need carry out some to title and handle, as information such as the branch in the removal title, branch officies.Because often there are the situation of branch, branch office in the title as food and drink, company, the inside, and the purpose of POI ordering is for home office, main office being come forward position, so at this moment just can remove the character of this branch, branch office.As " branch, xx company five road junction ", just can remove only surplus " xx company " to " branch, five road junctions ".
Preferably, also can add some other information as the replenishing of title, as address, district etc.Because some title is too short, do not have practical significance, as speech such as public lavatory, parking lots, at this time just can add the address of POI come in and title together as keyword, the better effects if of Chu Liing like this.
S102 utilizes described keyword to search for, and obtains the search result web page set of corresponding each POI data;
The results set that returns is inquired about and obtained to the keyword that said extracted goes out in search engine.
S103 according to the corresponding search result web page set of each POI data, calculates the importance degree of these POI data;
The present invention utilizes the network popularity of internet to portray the significance level of POI data, and the network popularity of POI is according to search result web page set that should POI is calculated.Wherein, described network popularity is meant the well-known degree of a title in network.
At each POI data, utilize the keyword that extracts to inquire about and can access a plurality of search result web page (being collections of web pages), and each webpage has two indexs: one is the significance level of webpage, and another is the matching degree of webpage and keyword.Present embodiment mainly utilizes described two indexs to weigh the network popularity of POI data.
Because every kind of index all has different computing method, present embodiment only adopts wherein a kind of method relatively more commonly used.For the significance level of webpage, adopt the method for calculating webpage rank (PageRank).The PageRank of webpage is a kind of index of tolerance webpage significance level, is to calculate according to the hyperlink between the webpage, stems from the PageRank algorithm that Google founder proposes.Certainly, the significance level that also can represent webpage with the flow of webpage.For the matching degree (MatchRank) of webpage and keyword, usually the computing method that adopt are: if keyword complete appearance in webpage, then matching degree is higher, if keyword occurs after by cutting, then matching degree is lower.The present invention is including but not limited to above computing method.
After obtaining the PageRank and MatchRank of each webpage, the PageRank and the MatchRank of each webpage multiplied each other, and then, promptly obtain a POI data computing result the multiplied result addition of all webpages of the same POI data of correspondence.In the present embodiment, adopting the mode to the marking of POI data, is a score value that the network popularity of these POI data is portrayed so described result of calculation obtains.
Need to prove that above-mentioned PageRank and MatchRank according to webpage adopts the calculating of the addition again of multiplying each other to obtain the method for a POI score value,, the present invention includes but be not limited to described method only as a kind of implementation of present embodiment.
S104 sorts to described POI data according to described importance degree.
After obtaining the score of each POI data, utilize described score promptly can all POI data to be sorted.
By above-mentioned treatment scheme as can be known, the network popularity of internet usage of the present invention is portrayed the significance level of POI data, because numerous netizens and even broad masses' understanding has been represented in this portrayal, therefore utilize the network popularity to come the POI data are sorted, the effect of ordering is relatively good, has good mass foundation and rationality.And, use machine automatically the POI data to be given a mark and sorted, greatly saved manpower, efficient is higher, and cost is very cheap.
Embodiment two:
The embodiment of the invention two provides a kind of concrete application example.
With reference to Fig. 2, be the sort method schematic flow sheet of the embodiment of the invention two described a kind of POI data.
S201 carries out pre-service to original POI data;
Original POI data are carried out cleaning and filtering, and major function is the input standard that makes data fit certain.Described pre-service mainly comprises removes irrelevant symbol, character code conversion, three parts of adjustment consolidation form.Wherein,
1) remove irrelevant symbol: because may there be some irrelevant symbols in the source or the other problems of data in the data, these symbols do not have practical significance, as! , symbol such as #, also have mess code etc., these irrelevant symbols need be removed, play a cleaning and filtering effect;
2) character code conversion: make the coding unanimity of character, the justice that can help giving a mark later.Change full-shape as half-angle, the traditional font commentaries on classics is simplified etc.;
3) adjust form: the input format of data should be unified, and is beneficial to programming like this.
S202 at pretreated POI data, extracts the keyword of each POI data;
In the leaching process, can identify information such as the branch that comprises in the title, branch office, remove these information then according to the bank of geographical names and another name storehouse.For example " branch, xx company five road junction ", if " five road junctions " is a speech in the bank of geographical names, " branch " is the speech in the peculiar dictionary, so just can remove only surplus " xx company " to " branch, five road junctions ".
S203 utilizes described keyword to search for, and obtains the search result web page set of corresponding each POI data;
S204 at each POI data, calculates the basic score value that is used to represent this POI data significance level according to corresponding search result web page set;
In the present embodiment, the score value that calculates according to the PageRank and the MatchRank of webpage is as the basic score value of POI data, and this basic score value is the portrayal to the network popularity of these POI data.
S205 adjusts described basic score value according to the classification information of POI data;
Because the POI data have a lot of classifications, and different classes of data have different character on network.For example, the POI data of food and drink class more receive publicity on network than the POI data of government bodies class, but the POI data of government bodies' class are more even more important than the POI data of food and drink class, because people more pay close attention to the POI data of government bodies' class in real life.Therefore, for the score of the different classes of POI data of balance, present embodiment has been introduced the classification weight, need adjust the basic score of POI according to the weight of classification, makes the important POI score of classification improve, and the unessential POI score of classification reduces.The weight of classification can rule of thumb be set, and also can use some training datas to train acquisition.Adjustment process is: multiply by the weight size of classification under it with the basic score of POI data, so just obtain final score.
For example, two POI data are arranged, one is The Third Affiliated Hospital of Peking University, and one is the Guo Lin home cooking.Because the title of food and drink class occurs in webpage often, so the basic of Guo Lin home cooking must be divided into 5 fens, and The Third Affiliated Hospital of Peking University must be divided into 4 fens.But according to people's experience and custom, hospital can be more important than food and drink class, so the classification weight of hospital's class is bigger, be made as 1.5, and the weight of food and drink is lower, is made as 0.8.The score of final like this two POI is respectively: the 4 * 1.5=6 of The Third Affiliated Hospital of Peking University, Guo Lin home cooking 5 * 0.8=4.Thereby The Third Affiliated Hospital of Peking University is than the score height of Guo Lin home cooking, and it is forward to sort, and this has just met people's general understanding.
S206 sorts to described POI data according to described adjusted final score value.
Comparative example one and embodiment two, embodiment two have increased the adjustment process of preprocessing process and basic score value.Embodiment two has also taken into full account the influence of the classification of POI data to the POI significance level, the classification information of utilizing the POI data comes thereby basic network popularity score is adjusted the final score that obtains POI, thereby has portrayed the significance level of POI data more exactly.
The ordering of electronic chart POI data has a lot of practical values, for example:
1) query and search aspect: the user imports a query word when electronic map query, can return a lot of result for retrieval, and these result for retrieval all mate with this query word, but often also has the branch of significance level among these results.After if POI sorted, just can in coupling, be presented at the front to important POI, unessential putting behind, more convenient like this user's use.For example, inquiry " Quanjude ", a lot of branch and some subsidiary corporatioies or the training organization that Quanjude can occur, they all mate with this query word, but can not be presented at the front to some subsidiary corporatioies and training organization, because generally these are not too important, and should come the front to important home office or branch.For another example: inquiry Peking University, Peking University and its cum rights can appear, and Peking University should make number one, but should there be the branch of the front and back of an ordering in its numerous cum rights.
2) a figure layer demonstration aspect: electronic chart generally is made up of multi-layer image very, when the user when checking certain figure layer, POI that should figure layer should be shown the confession user and check.But the user in certain figure layer focus around perhaps a lot of POI is arranged, if these POI are all shown, then full page can be very mixed and disorderly and too fat to move, this just is unfavorable for that the user checks.Therefore, need choose a part of POI according to significance level and show, so not only the user can view the information that oneself needs, and whole display effect is relatively good.
3) Data Update aspect: because the POI renewal speed is very fast, and renewal amount is bigger, if can only upgrade earlier at important data under the energy condition of limited.
At said method embodiment, the present invention also provides a kind of collator embodiment of electronic map data.With reference to Fig. 3, be the collator structural drawing of the described a kind of electronic map data of the embodiment of the invention.Described device mainly comprises:
Keyword extracting unit U32 is used to extract the keyword of each electronic map data;
Query unit U33 is used to utilize described keyword to search for, and obtains the search result web page set of corresponding each electronic map data;
Computing unit U34 is used for the corresponding search result web page set according to each electronic map data, calculates the importance degree of this electronic map data;
Sequencing unit U36 is used for according to described importance degree described electronic map data being sorted.
Wherein, described computing unit U34 specifically comprises:
First computation subunit is used for calculating first numerical value that is used to represent the webpage significance level respectively at each search result web page of set; The significance level of webpage can be represented by webpage rank (PageRank), so described first numerical value promptly refers to calculate the PageRank of gained; Certainly, also can represent with the flow of webpage;
Second computation subunit is used for calculating the second value that is used to represent webpage and query word matching degree respectively at each search result web page of set; The matching degree of webpage and query word (MatchRank) can be calculated by several different methods;
The COMPREHENSIVE CALCULATING subelement is used at each electronic map data, according to first numerical value and the second value of all search result web page in the corresponding set, calculates the result data that is used to represent this electronic map data significance level.A kind of account form is: first numerical value and the second value of each search result web page multiplied each other during described COMPREHENSIVE CALCULATING subelement will be gathered, and then the multiplied result of all search result web page is sued for peace in will gathering, and obtains the significance level value of this electronic map data.
Wherein, described keyword extracting unit U32 is referred to as keyword with the name of the electronic map data that extracts; Perhaps, with the address information of the electronic map data that extracts, with title together as keyword.Preferably, when extracting title, remove the information that comprises branch, branch office.
Preferably, in another device embodiment of the present invention, described device also comprises adjustment unit U35, be used for different weights according to classification had under the electronic map data, the importance degree of this electronic map data be multiply by the weighted value of classification under this electronic map data, obtain adjusted result data, and output to sequencing unit U36 be used for the ordering.
Preferably, in another device embodiment of the present invention, described device also comprises pretreatment unit U31, is used for original electronic map data is carried out pre-service, and the pre-service result is outputed to keyword extracting unit U32; Wherein, described pre-service comprises the irrelevant symbol of removal, carries out the character code conversion, adjusts consolidation form.
Preferably, in another device embodiment of the present invention, described device also comprises retrieval unit U37, is used for retrieving at electronic chart, query word according to user's input returns the result for retrieval that is complementary, and the forward electronic map data of ordering in the result for retrieval is preferentially shown.
Preferably, in another device embodiment of the present invention, described device also comprises figure layer display unit U38, is used for choosing the forward electronic map data of indication range internal sort and showing when the figure layer shows.
Preferably, in another device embodiment of the present invention, described device also comprises data updating unit U39, is used for the forward electronic map data that sorts is preferentially upgraded.
The part that does not describe in detail in the device shown in Figure 3 can be considered for length referring to the relevant portion of Fig. 1, method shown in Figure 2, is not described in detail in this.
In addition, the present invention also provides a kind of search engine system, and described system comprises the described device of above-mentioned arbitrary device embodiment.Described search engine system can provide the result for retrieval of high-quality more in the search application facet of electronic map data.
More than to the sort method and the device of a kind of electronic map data provided by the present invention, be described in detail, used specific case herein principle of the present invention and embodiment are set forth, the explanation of above embodiment just is used for helping to understand method of the present invention and core concept thereof; Simultaneously, for one of ordinary skill in the art, according to thought of the present invention, part in specific embodiments and applications all can change.In sum, this description should not be construed as limitation of the present invention.

Claims (23)

1, a kind of sort method of electronic map data is characterized in that, comprising:
Extract the keyword of each electronic map data;
Utilize described keyword to search for, obtain the search result web page set of corresponding each electronic map data;
According to the corresponding search result web page set of each electronic map data, calculate the importance degree of this electronic map data;
According to described importance degree described electronic map data is sorted.
2, method according to claim 1 is characterized in that, described corresponding search result web page according to each electronic map data is gathered, and calculates the importance degree of this electronic map data, specifically comprises:
At each search result web page in the set, calculate the second value that is used to represent first numerical value of webpage significance level and is used to represent webpage and keyword matching degree respectively;
According to first numerical value and the second value of all search result web page in the corresponding set, calculate the importance degree of this electronic map data.
3, method according to claim 2 is characterized in that, described first numerical value and second value according to all search result web page in the corresponding set calculate the importance degree of this electronic map data, specifically comprise:
First numerical value and the second value of each search result web page in the set are multiplied each other, and then the multiplied result of all search result web page is sued for peace in will gathering, and obtains the importance degree of this electronic map data.
4, according to claim 2 or 3 described methods, it is characterized in that: described first numerical value obtains by calculating the webpage rank.
5, according to claim 2 or 3 described methods, it is characterized in that, after the importance degree of described this electronic map data of calculating, also comprise:
According to the different weights that classification had under the electronic map data, the importance degree of this electronic map data be multiply by the weighted value of classification under this electronic map data, obtain adjusted result data, be used for ordering.
6, method according to claim 1 is characterized in that, the described keyword that extracts each electronic map data specifically comprises:
The name that extracts each electronic map data is referred to as keyword.
7, method according to claim 6 is characterized in that, also comprises:
Extract the address information of each electronic map data, with title together as keyword.
8, method according to claim 1 is characterized in that, before the described keyword that extracts each electronic map data, also comprises:
Original electronic map data is carried out pre-service, and described pre-service comprises removes irrelevant symbol, character code conversion, adjustment consolidation form;
The pre-service result is used for the extraction of keyword.
9, method according to claim 1 is characterized in that, after according to described importance degree described electronic map data being sorted, also comprises:
In the electronic chart retrieval, the query word of importing according to the user returns the result for retrieval that is complementary, and the forward electronic map data of ordering in the result for retrieval is preferentially shown.
10, method according to claim 1 is characterized in that, after according to described importance degree described electronic map data being sorted, also comprises:
When the figure layer shows, choose the forward electronic map data of indication range internal sort and show.
11, method according to claim 1 is characterized in that, after according to described importance degree described electronic map data being sorted, also comprises:
Forward electronic map data preferentially upgrades to sorting.
12, a kind of collator of electronic map data is characterized in that, comprising:
Keyword extracting unit is used to extract the keyword of each electronic map data;
Query unit is used to utilize described keyword to search for, and obtains the search result web page set of corresponding each electronic map data;
Computing unit is used for the corresponding search result web page set according to each electronic map data, calculates the importance degree of this electronic map data;
Sequencing unit is used for according to described importance degree described electronic map data being sorted.
13, device according to claim 12 is characterized in that, described computing unit specifically comprises:
First computation subunit is used for calculating first numerical value that is used to represent the webpage significance level respectively at each search result web page of set;
Second computation subunit is used for calculating the second value that is used to represent webpage and keyword matching degree respectively at each search result web page of set;
The COMPREHENSIVE CALCULATING subelement is used for first numerical value and second value according to all search result web page of each electronic map data corresponding set, calculates the importance degree of this electronic map data.
14, device according to claim 13 is characterized in that:
First numerical value and the second value of each search result web page multiplied each other during described COMPREHENSIVE CALCULATING subelement will be gathered, and then the multiplied result of all search result web page is sued for peace in will gathering, and obtains the importance degree of this electronic map data.
15, device according to claim 13 is characterized in that: described first computation subunit obtains first numerical value by calculating the webpage rank.
16, device according to claim 12 is characterized in that, described device also comprises:
Adjustment unit, be used for different weights according to classification had under the electronic map data, the importance degree of this electronic map data be multiply by the weighted value of classification under this electronic map data, obtain adjusted result data, and output to sequencing unit and be used for ordering.
17, device according to claim 12 is characterized in that: described keyword extracting unit is referred to as keyword with the name of the electronic map data that extracts.
18, device according to claim 17 is characterized in that: described keyword extracting unit is also with the address information of the electronic map data that extracts, with title together as keyword.
19, device according to claim 12 is characterized in that, described device also comprises:
Pretreatment unit is used for original electronic map data is carried out pre-service, and the pre-service result is outputed to keyword extracting unit; Wherein, described pre-service comprises the irrelevant symbol of removal, character code conversion, adjusts consolidation form.
20, device according to claim 12 is characterized in that, described device also comprises:
Retrieval unit is used in the electronic chart retrieval, and the query word of importing according to the user returns the result for retrieval that is complementary, and the forward electronic map data of ordering in the result for retrieval is preferentially shown.
21, device according to claim 12 is characterized in that, described device also comprises:
Figure layer display unit is used for choosing the forward electronic map data of indication range internal sort and showing when the figure layer shows.
22, device according to claim 12 is characterized in that, described device also comprises:
Data updating unit is used for the forward electronic map data that sorts is preferentially upgraded.
23, a kind of search engine system is characterized in that, described system comprises the described device of the arbitrary claim of claim 12 to 22.
CN 200810222422 2008-09-16 2008-09-16 Method and apparatus for ordering electronic map data Active CN101350154B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN 200810222422 CN101350154B (en) 2008-09-16 2008-09-16 Method and apparatus for ordering electronic map data

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN 200810222422 CN101350154B (en) 2008-09-16 2008-09-16 Method and apparatus for ordering electronic map data

Publications (2)

Publication Number Publication Date
CN101350154A true CN101350154A (en) 2009-01-21
CN101350154B CN101350154B (en) 2013-01-30

Family

ID=40268929

Family Applications (1)

Application Number Title Priority Date Filing Date
CN 200810222422 Active CN101350154B (en) 2008-09-16 2008-09-16 Method and apparatus for ordering electronic map data

Country Status (1)

Country Link
CN (1) CN101350154B (en)

Cited By (25)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2011072411A1 (en) * 2009-12-14 2011-06-23 北京友迈在地科技有限公司 Method and system for displaying special symbols in priority order in electronic map
CN102541936A (en) * 2010-12-31 2012-07-04 高德软件有限公司 Method and device for acquiring popularity of POI (Point of Interest)
CN102890725A (en) * 2012-11-02 2013-01-23 瑞庭网络技术(上海)有限公司 Result ranking method for search engine
CN103185596A (en) * 2011-12-30 2013-07-03 上海博泰悦臻电子设备制造有限公司 Interest point searching method and interest point searching device
CN103258057A (en) * 2013-06-03 2013-08-21 北京奇虎科技有限公司 Method and device for displaying point of interest on electronic map interface
CN103336807A (en) * 2013-06-25 2013-10-02 百度在线网络技术(北京)有限公司 Method and system for displaying POI (points of interest)
CN103577442A (en) * 2012-07-30 2014-02-12 腾讯科技(深圳)有限公司 Method and device for calculating map data importance
CN104123318A (en) * 2013-04-28 2014-10-29 百度在线网络技术(北京)有限公司 Method and system for displaying interest points in map
CN104281577A (en) * 2013-07-02 2015-01-14 威盛电子股份有限公司 Method for ordering data files
CN104281576A (en) * 2013-07-02 2015-01-14 威盛电子股份有限公司 Display method for landmark data
CN104317909A (en) * 2014-10-27 2015-01-28 百度在线网络技术(北京)有限公司 Method and device for verifying data of points of interest
CN104462143A (en) * 2013-09-24 2015-03-25 高德软件有限公司 Method and device for establishing chain brand word bank and category word bank
CN104899200A (en) * 2014-03-04 2015-09-09 高德软件有限公司 POI search feedback method and device
CN105069079A (en) * 2015-07-31 2015-11-18 北京奇虎科技有限公司 Method and device for screening point of interest POI data
CN105222803A (en) * 2015-10-20 2016-01-06 北京百度网讯科技有限公司 Map POI display packing and terminal
CN105550330A (en) * 2015-12-21 2016-05-04 北京奇虎科技有限公司 Point of interest (POI) information sorting method and system
CN105608112A (en) * 2015-12-10 2016-05-25 北京奇虎科技有限公司 Method and apparatus for measuring quality of map POI data
CN105786915A (en) * 2014-12-25 2016-07-20 高德软件有限公司 POI importance degree determination method and device
CN107315748A (en) * 2016-04-26 2017-11-03 斑马网络技术有限公司 Electronic map indexing means, device, terminal device and user interface system
CN107315750A (en) * 2016-04-26 2017-11-03 斑马网络技术有限公司 Electronic map figure layer display methods, device, terminal device and user interface system
CN107798018A (en) * 2016-09-06 2018-03-13 高德软件有限公司 A kind of method to set up and device of point of interest display information
CN107918512A (en) * 2017-11-16 2018-04-17 携程旅游信息技术(上海)有限公司 Hotel information display methods, device, electronic equipment, storage medium
CN108984640A (en) * 2018-06-22 2018-12-11 华北电力大学 A kind of geography information acquisition methods excavated based on web data
CN111026937A (en) * 2019-11-13 2020-04-17 百度在线网络技术(北京)有限公司 Method, device and equipment for extracting POI name and computer storage medium
CN111177125A (en) * 2013-03-15 2020-05-19 美国结构数据有限公司 Apparatus, system and method for analyzing characteristics of entities of interest

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1389811A (en) * 2002-02-06 2003-01-08 北京造极人工智能技术有限公司 Intelligent search method of search engine
US7475074B2 (en) * 2005-02-22 2009-01-06 Taiwan Semiconductor Manufacturing Co., Ltd. Web search system and method thereof
CN101000608A (en) * 2006-01-11 2007-07-18 吴风勇 Key word dynamic matching generating based on search engine technology

Cited By (42)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102667759A (en) * 2009-12-14 2012-09-12 北京友迈在地科技有限公司 Method and system for displaying special symbols in priority order in electronic map
WO2011072411A1 (en) * 2009-12-14 2011-06-23 北京友迈在地科技有限公司 Method and system for displaying special symbols in priority order in electronic map
CN102667759B (en) * 2009-12-14 2014-07-30 北京友迈在地科技有限公司 Method and system for displaying special symbols in priority order in electronic map
CN102541936A (en) * 2010-12-31 2012-07-04 高德软件有限公司 Method and device for acquiring popularity of POI (Point of Interest)
CN103185596A (en) * 2011-12-30 2013-07-03 上海博泰悦臻电子设备制造有限公司 Interest point searching method and interest point searching device
CN103577442B (en) * 2012-07-30 2019-02-05 腾讯科技(深圳)有限公司 A kind of map datum importance calculation method and device
CN103577442A (en) * 2012-07-30 2014-02-12 腾讯科技(深圳)有限公司 Method and device for calculating map data importance
CN102890725B (en) * 2012-11-02 2015-08-19 瑞庭网络技术(上海)有限公司 The result ordering method of search engine
CN102890725A (en) * 2012-11-02 2013-01-23 瑞庭网络技术(上海)有限公司 Result ranking method for search engine
CN111177125B (en) * 2013-03-15 2023-10-31 美国结构数据有限公司 Apparatus, system, and method for analyzing characteristics of an entity of interest
US11762818B2 (en) 2013-03-15 2023-09-19 Foursquare Labs, Inc. Apparatus, systems, and methods for analyzing movements of target entities
CN111177125A (en) * 2013-03-15 2020-05-19 美国结构数据有限公司 Apparatus, system and method for analyzing characteristics of entities of interest
CN104123318B (en) * 2013-04-28 2019-01-15 百度在线网络技术(北京)有限公司 A kind of method and system of map denotation point of interest
CN104123318A (en) * 2013-04-28 2014-10-29 百度在线网络技术(北京)有限公司 Method and system for displaying interest points in map
CN103258057A (en) * 2013-06-03 2013-08-21 北京奇虎科技有限公司 Method and device for displaying point of interest on electronic map interface
CN103258057B (en) * 2013-06-03 2017-06-23 北京奇虎科技有限公司 The method and apparatus for showing point of interest POI in electronic map interface
CN103336807B (en) * 2013-06-25 2018-01-05 百度在线网络技术(北京)有限公司 A kind of method and system for showing point of interest
CN103336807A (en) * 2013-06-25 2013-10-02 百度在线网络技术(北京)有限公司 Method and system for displaying POI (points of interest)
CN104281576A (en) * 2013-07-02 2015-01-14 威盛电子股份有限公司 Display method for landmark data
CN104281577A (en) * 2013-07-02 2015-01-14 威盛电子股份有限公司 Method for ordering data files
CN104281577B (en) * 2013-07-02 2018-11-16 威盛电子股份有限公司 The sort method of data file
CN104281576B (en) * 2013-07-02 2018-08-31 威盛电子股份有限公司 The display methods of landmark data
CN104462143A (en) * 2013-09-24 2015-03-25 高德软件有限公司 Method and device for establishing chain brand word bank and category word bank
CN104462143B (en) * 2013-09-24 2018-01-30 高德软件有限公司 Chain brand word dictionary, classifier dictionary method for building up and device
CN104899200A (en) * 2014-03-04 2015-09-09 高德软件有限公司 POI search feedback method and device
CN104317909B (en) * 2014-10-27 2018-09-28 百度在线网络技术(北京)有限公司 The method of calibration and device of interest point data
CN104317909A (en) * 2014-10-27 2015-01-28 百度在线网络技术(北京)有限公司 Method and device for verifying data of points of interest
CN105786915A (en) * 2014-12-25 2016-07-20 高德软件有限公司 POI importance degree determination method and device
CN105069079A (en) * 2015-07-31 2015-11-18 北京奇虎科技有限公司 Method and device for screening point of interest POI data
CN105222803A (en) * 2015-10-20 2016-01-06 北京百度网讯科技有限公司 Map POI display packing and terminal
WO2017067211A1 (en) * 2015-10-20 2017-04-27 北京百度网讯科技有限公司 Map poi display method and terminal
CN105608112A (en) * 2015-12-10 2016-05-25 北京奇虎科技有限公司 Method and apparatus for measuring quality of map POI data
CN105550330A (en) * 2015-12-21 2016-05-04 北京奇虎科技有限公司 Point of interest (POI) information sorting method and system
CN105550330B (en) * 2015-12-21 2020-09-11 北京奇虎科技有限公司 Method and system for ordering POI (Point of interest) information
CN107315750A (en) * 2016-04-26 2017-11-03 斑马网络技术有限公司 Electronic map figure layer display methods, device, terminal device and user interface system
CN107315748A (en) * 2016-04-26 2017-11-03 斑马网络技术有限公司 Electronic map indexing means, device, terminal device and user interface system
CN107798018B (en) * 2016-09-06 2020-04-10 高德软件有限公司 Method and device for setting display information of interest points
CN107798018A (en) * 2016-09-06 2018-03-13 高德软件有限公司 A kind of method to set up and device of point of interest display information
CN107918512A (en) * 2017-11-16 2018-04-17 携程旅游信息技术(上海)有限公司 Hotel information display methods, device, electronic equipment, storage medium
CN108984640A (en) * 2018-06-22 2018-12-11 华北电力大学 A kind of geography information acquisition methods excavated based on web data
CN111026937A (en) * 2019-11-13 2020-04-17 百度在线网络技术(北京)有限公司 Method, device and equipment for extracting POI name and computer storage medium
US11768892B2 (en) 2019-11-13 2023-09-26 Baidu Online Network Technology (Beijing) Co., Ltd. Method and apparatus for extracting name of POI, device and computer storage medium

Also Published As

Publication number Publication date
CN101350154B (en) 2013-01-30

Similar Documents

Publication Publication Date Title
CN101350154B (en) Method and apparatus for ordering electronic map data
CN100405371C (en) Method and system for abstracting new word
CN101299217B (en) Method, apparatus and system for processing map information
CN1936893B (en) Method and system for generating input-method word frequency base based on internet information
US10445346B2 (en) Custom local search
CN104881488B (en) Configurable information extraction method based on relation table
CN103365924B (en) A kind of method of internet information search, device and terminal
CN104463730A (en) Method and equipment for excavating tour route based on tour destination
CN106682169A (en) Application label mining method and device, and application searching method and server
CN101630314A (en) Semantic query expansion method based on domain knowledge
CN101794277B (en) Method for embedding geographical labels in network character information and system
CN103106287A (en) Processing method and processing system for retrieving sentences by user
CN102541936A (en) Method and device for acquiring popularity of POI (Point of Interest)
CN106682170A (en) Application searching method and device
JP2022532451A (en) How to disambiguate Chinese place name meanings based on encyclopedia knowledge base and word embedding
CN102253972A (en) Web crawler-based geographical name database maintenance method
CN103902521A (en) Chinese statement identification method and device
CN112528639B (en) Object recognition method and device, storage medium and electronic equipment
CN103886020A (en) Quick search method of real estate information
CN112527933A (en) Chinese address association method based on space position and text training
Ahlers et al. Location-based Web search
CN108984640A (en) A kind of geography information acquisition methods excavated based on web data
CN102306182A (en) Method for excavating user interest based on conceptual semantic background image
CN105678383A (en) Mobile knowledge service system on the basis of ontology model
CN106649823A (en) Webpage classification recognition method based on comprehensive subject term vertical search and focused crawler

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
ASS Succession or assignment of patent right

Owner name: BEIJING SOHU NEW MEDIA INFORMATION TECHNOLOGY CO.,

Free format text: FORMER OWNER: SOGO SCIENCE-TECHNOLOGY DEVELOPMENT CO., LTD., BEIJING

Effective date: 20101020

C41 Transfer of patent application or patent right or utility model
COR Change of bibliographic data

Free format text: CORRECT: ADDRESS; FROM: 100084 ROOM 01, 9/F, SOHU.COM INTERNET PLAZA, BUILDING 9, YARD 1, ZHONGGUANCUN EAST ROAD, HAIDIAN DISTRICT, BEIJING TO: 100084 ROOM 802, 8/F, SOHU.COM INTERNET PLAZA, BUILDING 9, YARD 1, ZHONGGUANCUN EAST ROAD, HAIDIAN DISTRICT, BEIJING

TA01 Transfer of patent application right

Effective date of registration: 20101020

Address after: 100084 Beijing, Zhongguancun East Road, building 1, No. 9, Sohu cyber building, room 8, room, Room 802

Applicant after: Beijing Sohu New Media Information Technology Co., Ltd.

Address before: 100084 Beijing, Zhongguancun East Road, building 1, No. 9, Sohu cyber building, room 9, room, room 01

Applicant before: Sogo Science-Technology Development Co., Ltd., Beijing

C14 Grant of patent or utility model
GR01 Patent grant
ASS Succession or assignment of patent right

Owner name: SOGO SCIENCE-TECHNOLOGY DEVELOPMENT CO., LTD., BEI

Free format text: FORMER OWNER: BEIJING SOHU NEW MEDIA INFORMATION TECHNOLOGY CO., LTD.

Effective date: 20130902

C41 Transfer of patent application or patent right or utility model
TR01 Transfer of patent right

Effective date of registration: 20130902

Address after: 100084 Beijing, Zhongguancun East Road, building 1, No. 9, Sohu cyber building, room 9, room, room 01

Patentee after: Sogo Science-Technology Development Co., Ltd., Beijing

Address before: 100084 Beijing, Zhongguancun East Road, building 1, No. 9, Sohu cyber building, room 8, room, Room 802

Patentee before: Beijing Sohu New Media Information Technology Co., Ltd.