CN103577442B - A kind of map datum importance calculation method and device - Google Patents

A kind of map datum importance calculation method and device Download PDF

Info

Publication number
CN103577442B
CN103577442B CN201210266470.3A CN201210266470A CN103577442B CN 103577442 B CN103577442 B CN 103577442B CN 201210266470 A CN201210266470 A CN 201210266470A CN 103577442 B CN103577442 B CN 103577442B
Authority
CN
China
Prior art keywords
place name
data
list
map datum
score value
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201210266470.3A
Other languages
Chinese (zh)
Other versions
CN103577442A (en
Inventor
程盛远
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tencent Technology Shenzhen Co Ltd
Tencent Cloud Computing Beijing Co Ltd
Original Assignee
Tencent Technology Shenzhen Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tencent Technology Shenzhen Co Ltd filed Critical Tencent Technology Shenzhen Co Ltd
Priority to CN201210266470.3A priority Critical patent/CN103577442B/en
Publication of CN103577442A publication Critical patent/CN103577442A/en
Application granted granted Critical
Publication of CN103577442B publication Critical patent/CN103577442B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/29Geographical information databases
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9537Spatial or temporal dependent retrieval, e.g. spatiotemporal queries

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Remote Sensing (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention belongs to Internet technical field more particularly to a kind of map datum importance calculation methods and device.Map datum importance calculation method of the present invention includes: to obtain data from map interest point data base, wherein the data of acquisition include place name list;The frequency that statistically place name occurs in list of file names and/or the number of results occurred in web page search engine;Corresponding score value is converted by the frequency of place name appearance and/or the number of results occurred in web page search engine, importance sorting is carried out according to score value.Map datum importance calculation method and device of the present invention are ranked up by the statistics interest point data base internal map data frequency of occurrences and by the query result number of search engine, and place name Rank table is generated according to ordering score size after being filtered to abnormal sorting data, the coverage rate and accuracy rate of map datum different degree are improved, and improves the correlation of the sequence of map search.

Description

A kind of map datum importance calculation method and device
Technical field
The invention belongs to Internet technical field more particularly to a kind of map datum importance calculation methods and device.
Background technique
The abbreviation of comprehensive POI(" Point of Interest ", point of interest) information be navigation map indispensable information, Timely POI point of interest can remind the branch of user's road conditions and the detailed information of neighboring buildings, include place in each POI data The cartographic informations such as title, classification, longitude and latitude, each place required for facilitating user to find.Currently, in map search In, POI data would generally according to relevance ranking, when specifying when Perimeter or sub-category retrieval are done in some place, by In no query word, can utilize with central point away from discrete data importance sorting.The different degree of POI data generally passes through offline It calculates, it mainly includes two kinds that utilizable different degree, which calculates information: one, different numbers being manually assigned to according to administrative grade height Value, such as the national level different numerical value corresponding with district grade in government bodies' class;Two, according to the quality of data source, to not Same source is assigned to different score values, such as thematic data will be typically higher than the score value of crawl data with the score value for buying data.
The shortcomings that existing different degree calculation is: one, government bodies' class is a part of whole POI data, Its category map data can not determine an accurate administrative grade, such as restaurant class, meanwhile, it is more under the same rank Data different degree also cannot be distinguished, so determining that the method coverage rate of different degree is lower according to administrative grade height;Two, high The data in quality source can also have wrong data, and quality height is two different concepts, quality data with data different degree Not necessarily different degree is higher, and the height of different degree cannot be distinguished in the data of same source, so obtaining weight by quality height Want that the calculation coverage rate of angle value is not high and accuracy is relatively low.
Summary of the invention
The present invention provides a kind of map datum importance calculation method and devices, it is intended to solve map number in the prior art Problem not high according to the calculation coverage rate of different degree and that accuracy is low.
The invention is realized in this way a kind of map datum importance calculation method, comprising:
Data are obtained from map interest point data base, wherein obtaining data includes place name list;
The frequency that statistically place name occurs in list of file names and/or the number of results occurred in web page search engine;
Corresponding score value, root are converted by the frequency of place name appearance and/or the number of results occurred in web page search engine Importance sorting is carried out according to score value.
Another technical solution that the present invention takes are as follows: a kind of map datum different degree computing device, including data acquisition mould Block, data statistics module and standardization processing module, the data acquisition module from map interest point data base for obtaining Data, wherein obtaining data includes place name list;The frequency that the data statistics module occurs for place name in statistically list of file names Rate and/or the number of results occurred in web page search engine;The standardization processing module be used for frequency that place name is occurred with/ Or the number of results occurred in web page search engine is converted into corresponding score value, carries out importance sorting according to score value.
Technical solution of the present invention have the following advantages that or the utility model has the advantages that map datum importance calculation method of the present invention and Device is ranked up by the statistics POI data library internal map data frequency of occurrences and by the query result number of search engine, And place name Rank table is generated according to ordering score size after being filtered to abnormal sorting data, place name Rank table is built into and is searched It indexes the concordance program held up to use for online correlation, improves the coverage rate and accuracy rate of map datum different degree, and mention The high correlation of map search sequence.
Detailed description of the invention
Attached drawing 1 is the flow chart of the map datum importance calculation method of first embodiment of the invention;
Attached drawing 2 is the flow chart of the map datum importance calculation method of second embodiment of the invention;
Attached drawing 3 is the structural schematic diagram of the map datum different degree computing device of first embodiment of the invention;
Attached drawing 4 is the structural schematic diagram of the map datum different degree computing device of second embodiment of the invention.
Specific embodiment
In order to make the objectives, technical solutions, and advantages of the present invention clearer, with reference to the accompanying drawings and embodiments, right The present invention is further elaborated.It should be appreciated that the specific embodiments described herein are merely illustrative of the present invention, and It is not used in the restriction present invention.
Referring to Fig. 1, being the flow chart of the map datum importance calculation method of first embodiment of the invention.The present invention the The map datum importance calculation method of one embodiment the following steps are included:
S100: from the abbreviation of map POI(" Point of Interest ", point of interest) two parts are obtained respectively in database Data: ground list of file names and the corresponding relationship list of place name address;
In S100, ground list of file names is used for by generating standard place name after conversion, and the corresponding relationship list of place name address is used for The frequency of occurrence of SS place name.
S110: ground list of file names and the corresponding relationship list of place name address are pre-processed, and generate standard gazetteer;
In S110, pretreatment includes cleaning bracket, carries out complicated and simple conversion, the conversion of full half-angle and/or Chinese figure conversion The processing such as Arabic numerals.Ground list of file names may contain bracket, such as " BJ University of Aeronautics & Astronautics (southwestern door) ", " Tsinghua University (west gate) ", bracket are often annotated content, if it is less than normal directly to go statistics to will cause result, are needed bracket together with inner The content in face is all removed;Ground list of file names and the corresponding relationship list of place name address are using complicated and simple conversion, the conversion of full half-angle, Chinese number Word conversion Arabic numerals etc. after totally four preprocessing process, generate the gazetteer of standard.
S120: according to the place name in standard gazetteer, count what the place name occurred in the corresponding relationship list of place name address Frequency;
In S120, the frequency of occurrences includes the number occurred in ground ranks, the number occurred in address column or line number, No matter the place name ranks on ground or occurs in address column, all shows primary reference, is equivalent to PageRank(Google ranking A part of algorithm (ranking formula) is a kind of method of grade/importance that Google is used to be used to presentation web page) It is directed toward, the number that place name occurs in address or title can directly reflect the reference value of a POI, such as the dining room * *, * * North gate, the number for quoting entity place name * * is more, show the place name be used as terrestrial reference direction chance it is more, have certain " power Prestige ", this is similar with Webpage search PageRank, the difference is that PageRank is referred to by people's (webpage), here LinkRank is referred to by other POI;Wherein, the line number only occurred by place name is it may determine that the size of the frequency of occurrences is suitable Sequence;In embodiments of the present invention, matching way is using exact matching.
S130: the place name frequency of occurrences value of statistics is subjected to standardization processing and is converted into corresponding score value;
In S130, the place name frequency of occurrences value of statistics is subjected to standardization processing and is converted into corresponding score value are as follows: will Long section, dispersion integer value be converted into the short interval value that correlation can be used, such as 0 ~ 1 or 0~10 or 0~100, at this In short interval value, score value size is to represent the size of frequency values, and common conversion method has: linear function or log function etc., can According to frequency and a kind of suitable transfer function of relevance score interval selection.
S140: the place name sequence Rank table of (place name, score value) is generated according to the sequence of score value size.
In S140, place name Rank table can be built into index by the offline concordance program of search engine, and score value part supplies Line correlation uses.It include the information such as place name and its corresponding score value, the high place name, that is, frequency of occurrences of score value in place name Rank table It is higher, it indicates that its different degree is higher, the different degree of map datum is calculated by the frequency of occurrences of place name, improves map search Accuracy rate.In embodiments of the present invention, in order to avoid interfering with each other between different cities, the geographical name data in a city will not It is influenced by the data of the same name in another city, can be limited in the closed data subset in some city and be counted;Separately Outside, since the more usual different degree of the number of results arrived by search engine inquiry can be higher, inquiry knot can further be limited Quality threshold in fruit, as ordering score cannot be too low.The present invention is sorted by the internal statistical in POI data library, can be accurate The significance level for reflecting map datum, improves the sequence correlation of map search, such as: in certain closing POI data library statistics (place name, number of results), first two are national well-known research institutes, and latter two are prefecture-level research institution or company, statistical result It is as follows:
Institute of Automation Research of CAS 13
The Institute of Software, Chinese Academy of Science 4
Beijing aura technical research institute 1
Beijing Control Engineering Inst. 1
Similarly:
The Third Affiliated Hospital of Peking University 24 [front three]
Beijing Haidian hospital 7 [diformazan]
It can be seen that the relative size of score value number embodies the different different degrees of data.
Referring to Fig. 2, being the flow chart of the map datum importance calculation method of second embodiment of the invention.The present invention the The map datum importance calculation methods of two embodiments the following steps are included:
S200: ground list of file names is obtained from map POI data library;
S210: standardization format conversion is carried out by place name lists construction query string, and by query string;
In S210, when due to routine access search engine, the format of query string there are certain requirements, need query string In the punctuation marks such as single quotation marks, TAB be converted into space.
S220: the number of results that every standardization query string of statistics occurs in web page search engine;
In step S220, the result page that every standardization query string occurs in web page search engine can also be counted.
S230: processing is filtered to abnormal number of results;
In S230, since the query string search range that has is than broad, such as " restaurant ", it is this kind of to have more correlation Webpage searching result influences the quality of search result;Even if the relatively determining query string of meaning, the result that search engine provides There can be of low quality or even incoherent situation, so needing to be filtered abnormal number of results and result page, remove knot The too many inquiry of fruit number or the too low page of PageRank score value, to improve the quality of search result.In embodiment of the present invention In, standard score workable for correlation can be corrected by only calculating former pages of result PageRank.In addition, number of results With result page PageRank calculating section, two different dimensions in city and classification, such as root can also be limited according to different cities According to realistic situation, it is normal, and the culture and education class of Lhasa that the culture and education class place name number of results of Beijing, which is higher than 100,000, It is abnormal that name number of results, which is higher than 100,000,.
S240: corresponding score value is converted by filter result number progress standardization processing;
In S240, greatly due to result sum variation space, it is not easy to directly be ranked up, statistical result number is advised Generalized processing is converted into corresponding score value are as follows: the integer value in long section, dispersion is converted into the short area that correlation can be used Between, such as 0 ~ 1,0~10 or 0~100 etc., common conversion method has: linear function or log function etc., can be according to number of results With a kind of suitable transfer function of relevance score interval selection.
S250: place name Rank table is generated according to the score value size sequence after conversion.
In S250, place name Rank table can be built into index by the offline concordance program of search engine, and score value part supplies Line correlation uses.It include the information such as place name and its corresponding score value in place name Rank table, the high place name, that is, number of results of score value is got over Height indicates that its different degree is higher, and the query result number by statistics place name in web page search engine calculates the important of map datum Degree, can accurately reflect the significance level of geographical name data, convenient for the relevance ranking of map search, for example, in certain search engine (place name, the number of results) of statistics, well-known universities and colleges vs. College of Adult Education, statistical result are as follows:
Tsinghua University 1,000,000,000
Beijing City University 2,700,000
The former number of results can correspond to a relatively high importance value, and the different degree score value of the latter is then relatively low.
In an embodiment of the present invention, when carrying out number of results filtering and standardization processing, city, classification two can be limited Different dimensions, such as rule of thumb, it is normal, and the culture of Lhasa that the culture and education class place name number of results of Beijing, which is higher than 100,000, It is abnormal that educational place name number of results, which is higher than 100,000,.
It in an alternative embodiment of the invention, can also be by the internal statistical sequence and the in POI data library in first embodiment The query result number sequence of search engine is combined use according to different applications in two embodiments, improves the row of map search Sequence correlation.
Referring to Fig. 3, being the structural schematic diagram of the device of the map datum different degree calculating of first embodiment of the invention.This The device that the map datum different degree of invention first embodiment calculates includes data acquisition module, data conversion module, data system Count module, standardization processing module and result-generation module, wherein
Data acquisition module is used for from the abbreviation of map POI(" Point of Interest ", point of interest) it obtains in database Take two parts of data: ground list of file names and the corresponding relationship list of place name address;Ground list of file names is used for by generating standard place name after conversion, The corresponding relationship list of place name address is used for the frequency of occurrence of SS place name.
Data conversion module is used for ground list of file names and the corresponding relationship list of place name address according to pre-processing, and generates mark Quasi- gazetteer;Wherein, the pretreatment of data conversion module include cleaning bracket, carry out it is complicated and simple conversion, full half-angle conversion and/or in The processing such as literary number conversion Arabic numerals.Ground list of file names may contain bracket, such as " BJ University of Aeronautics & Astronautics (southwestern door) ", " Tsinghua University (west gate) ", bracket is often annotated content, if it is less than normal directly to go statistics to will cause result, is needed including It number is all removed together with the content of the inside;Ground list of file names and the corresponding relationship list of place name address are also needed by complicated and simple conversion, complete half Angle conversion, Chinese figure conversion Arabic numerals etc. after totally four preprocessing process, generate the gazetteer of standard.
Data statistics module is used to count the place name according to the place name in standard gazetteer and arrange in place name address corresponding relationship The frequency occurred in table;Wherein, the frequency of occurrences includes the number occurred in ground ranks, the number or row that occur in address column Number, no matter the place name ranks on ground or occurs in address column, all shows primary reference, is equivalent to PageRank(Google A part of ranking algorithm (ranking formula) is a kind of side of grade/importance that Google is used to be used to presentation web page Method) direction, the number that place name occurs in address or title can directly reflect the reference value of a POI, such as " * * " Dining room, the north gate " * * ", the number for quoting the entity place name " * * " is more, shows that the place name is used as the chance that terrestrial reference is directed toward and gets over It is more, have certain " authority ", it is similar with Webpage search PageRank, the difference is that PageRank is referred to by people's (webpage), Here LinkRank is referred to by other POI;Wherein, only by place name occur line number it may determine that the frequency of occurrences it is big Small sequence;In embodiments of the present invention, matching way is using exact matching.
Standardization processing module is used to convert the place name frequency of occurrences value progress standardization processing of statistics to corresponding Score value;Wherein, the place name frequency of occurrences value of statistics is carried out standardization processing and is converted into corresponding point by standardization processing module Value are as follows: the integer value in long section, dispersion is converted into the short interval value that correlation can be used, such as 0 ~ 1 or 0~10 or 0~100 Deng in the short interval value, score value size is to represent the sizes of frequency values, and common conversion method has: linear function or log letter Number etc., can be according to frequency and a kind of suitable transfer function of relevance score interval selection.
Result-generation module is used to generate the place name sequence Rank table of (place name, score value) according to the sequence of score value size.It searches Place name Rank table can be built into index by indexing the offline concordance program held up, and score value part is used for online correlation.Wherein, ground It include the information such as place name and its corresponding score value in name Rank table, the high place name, that is, frequency of occurrences of score value is higher, indicates that its is important Degree is higher, and the different degree of map datum is calculated by the frequency of occurrences of place name, improves the accuracy rate of map search.In the present invention In embodiment, in order to avoid interfering with each other between different cities, the geographical name data in a city will not be by another city The influence of data of the same name can be limited in the closed data subset in some city and be counted;In addition, due to by searching Index holds up the more usual different degree of the number of results inquired can be higher, can further limit the quality threshold in query result Value, as ordering score cannot be too low.The present invention is sorted by the internal statistical in POI data library, can accurately reflect geographical name data Significance level, improve the sequence correlation of map search, such as: certain closing POI data library statistics (place name, as a result Number), first two are national well-known research institutes, and latter two are prefecture-level research institution or company, and statistical result is as follows:
Institute of Automation Research of CAS 13
The Institute of Software, Chinese Academy of Science 4
Beijing aura technical research institute 1
Beijing Control Engineering Inst. 1
Similarly:
The Third Affiliated Hospital of Peking University 24 [front three]
Beijing Haidian hospital 7 [diformazan]
It can be seen that the relative size of score value number embodies the different different degrees of data.
Referring to Fig. 4, being the structural schematic diagram of the device of the map datum different degree calculating of second embodiment of the invention.This The device that the map datum different degree of invention second embodiment calculates includes data acquisition module, format converting module, data system Count module, data filtering module, standardization processing module and result-generation module, wherein
Data acquisition module is used to obtain ground list of file names from map POI data library;
Format converting module is used for through place name lists construction query string, and query string is carried out standardization format conversion; When wherein, due to routine access search engine, the format of query string there are certain requirements, need in query string single quotation marks, The punctuation marks such as TAB are converted into space.
Data statistics module is used to count the number of results that every standardization character string occurs in web page search engine one by one;
Data filtering module is used to be filtered processing to abnormal number of results or/and result page;Wherein, since some is looked into String search range is ask than broad, such as " restaurant ", it is this kind of to have more related web page search result, influence search result Quality;Even if the relatively determining query string of meaning, the result that search engine provides can also exist of low quality or even incoherent Situation removes the too many inquiry of number of results or PageRank points so needing to be filtered abnormal number of results and result page It is worth the too low page, to improve the quality of search result.It in embodiments of the present invention, can be by only calculating former pages of knot Fruit PageRank corrects standard score workable for correlation.In addition, number of results and result page PageRank calculating section, also Two different dimensions in city and classification can be limited according to different cities, such as according to realistic situation, the culture and education of Beijing It is normal that class place name number of results, which is higher than 100,000, and it is abnormal that the culture and education class place name number of results of Lhasa, which is higher than 100,000,.
Standardization processing module is used to convert corresponding score value for filter result number progress standardization processing.Wherein, Corresponding score value is converted by statistical result number progress standardization processing are as follows: the integer value of long section, dispersion is converted into phase The short section that closing property can be used, such as 0 ~ 1,0~10 or 0~100 etc., common conversion method has: linear function or log letter Number etc., can be according to number of results and a kind of suitable transfer function of relevance score interval selection.
Result-generation module is used to generate place name Rank(sequence according to the score value size sequence after conversion) table, by place name Rank table is built into the concordance program of search engine, uses for online correlation;Wherein, in place name Rank table include place name and its The information such as corresponding score value, the high place name, that is, number of results of score value is higher, indicates that its different degree is higher, by statistics place name in webpage The query result number of search engine calculates the different degree of map datum, can accurately reflect the significance level of geographical name data, be convenient for The relevance ranking of map search, for example, well-known universities and colleges vs. is at teaching in (place name, the number of results) of certain search engine statistics Institute, statistical result are as follows:
Tsinghua University 1,000,000,000
Beijing City University 2,700,000
The former number of results can correspond to a relatively high importance value, and the different degree score value of the latter is then relatively low.
It in an alternative embodiment of the invention, can also be by the internal statistical sequence and the in POI data library in first embodiment The query result number sequence of search engine is combined use according to different applications in two embodiments, improves the row of map search Sequence correlation.
There is frequency by statistics POI data library internal map data in map datum importance calculation method and device of the present invention Rate is ranked up and is ranked up by the query result number of search engine, and is filtered rear basis to abnormal sorting data Ordering score size generates place name Rank table, and the concordance program that place name Rank table is built into search engine is made for online correlation With improving the sequence correlation of map search, and improve the coverage rate and accuracy rate of map datum different degree;In addition, of the invention Map datum importance calculation method and device are combined uses according to different applications, improve the sequence correlation of map search Property.
The foregoing is merely illustrative of the preferred embodiments of the present invention, is not intended to limit the invention, all in essence of the invention Made any modifications, equivalent replacements, and improvements etc., should all be included in the protection scope of the present invention within mind and principle.

Claims (10)

1. a kind of map datum importance calculation method, comprising:
Data are obtained from map interest point data base, wherein the data of acquisition include the corresponding pass of place name list and place name address Series of tables;
Ground list of file names, the corresponding relationship list of place name address are pre-processed, standard gazetteer is generated;
Count the frequency that the place name occurs in the corresponding relationship list of place name address according to the place name in standard gazetteer, it is described go out Existing frequency include: rank number, the number occurred in address column or the line number of middle appearance, no matter the place name ranks also on ground It is to occur in address column, all shows primary reference;
The frequency translation that place name is occurred is corresponding score value, carries out importance sorting according to score value.
2. map datum importance calculation method according to claim 1, which is characterized in that the pretreatment includes cleaning The bracket that contains in ground list of file names and over the ground list of file names, the corresponding relationship list of place name address carry out complicated and simple conversion, full half-angle turns It changes and/or Chinese figure converts Arabic numerals.
3. map datum importance calculation method according to claim 1, which is characterized in that further include:
The statistically number of results that place name occurs in web page search engine in list of file names;
It specifically includes: by place name lists construction query string, query string being converted into standardization format, statistical specifications query string The number of results occurred in web page search engine.
4. map datum importance calculation method according to claim 1, which is characterized in that from map interest point data In library after acquisition data, further includes:
The statistically number of results that place name occurs in web page search engine in list of file names;
The number of results that place name is occurred in web page search engine is converted into corresponding score value, carries out different degree row according to score value Sequence;
Wherein, after the number of results step that place name occurs in web page search engine in the statistically list of file names further include: to different Normal number of results is filtered processing.
5. map datum importance calculation method according to claim 1 or 4, which is characterized in that described place name occur Frequency translation be corresponding score value step include: that the place name frequency of occurrences of statistics is converted into the short area that correlation can be used Between be worth.
6. map datum importance calculation method according to claim 1, which is characterized in that the frequency for place name occur After rate is converted into corresponding score value step further include: generate place name sequencing table according to the score value size sequence after conversion, and by ground Name sequencing table is built into the concordance program of search engine.
7. a kind of map datum different degree computing device, which is characterized in that including data acquisition module, data conversion module, number Module and standardization processing module according to statistics, the data acquisition module are used to obtain data from map interest point data base, Wherein, obtaining data includes place name list and the corresponding relationship list of place name address;The data conversion module is for ranking ground Table, the corresponding relationship list of place name address are pre-processed, and standard gazetteer is generated;The data statistics module is used for according to standard Place name in gazetteer counts the frequency that the place name occurs in the corresponding relationship list of place name address, and the frequency of occurrences includes: Ground ranks number, the number occurred in address column or the line number of middle appearance, and no matter the place name ranks on ground or in address column Occur, all shows primary reference;It is corresponding score value that the standardization processing module, which is used for the frequency translation for place name occur, Importance sorting is carried out according to score value.
8. map datum different degree computing device according to claim 7, which is characterized in that the data conversion module into Capable pretreatment includes the bracket contained in cleaning ground list of file names and list of file names, the corresponding relationship list of place name address carry out over the ground Complicated and simple conversion, the conversion of full half-angle and/or Chinese figure convert Arabic numerals.
9. map datum different degree computing device according to claim 7, which is characterized in that the data statistics module is also It can be used for the number of results that place name occurs in web page search engine in statistically list of file names, further include format converting module sum number According to filtering module, the format converting module is used for through place name lists construction query string, and query string is converted to standardization Format;The data filtering module is used to be filtered processing to abnormal number of results.
10. map datum different degree computing device according to claim 7, which is characterized in that further include that result generates mould Block, the result-generation module, which is used to be sorted according to score value size, generates place name sequencing table, and place name sequencing table is built into search The concordance program of engine.
CN201210266470.3A 2012-07-30 2012-07-30 A kind of map datum importance calculation method and device Active CN103577442B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201210266470.3A CN103577442B (en) 2012-07-30 2012-07-30 A kind of map datum importance calculation method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201210266470.3A CN103577442B (en) 2012-07-30 2012-07-30 A kind of map datum importance calculation method and device

Publications (2)

Publication Number Publication Date
CN103577442A CN103577442A (en) 2014-02-12
CN103577442B true CN103577442B (en) 2019-02-05

Family

ID=50049247

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201210266470.3A Active CN103577442B (en) 2012-07-30 2012-07-30 A kind of map datum importance calculation method and device

Country Status (1)

Country Link
CN (1) CN103577442B (en)

Families Citing this family (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104899200A (en) * 2014-03-04 2015-09-09 高德软件有限公司 POI search feedback method and device
CN103823900B (en) * 2014-03-17 2017-07-21 北京百度网讯科技有限公司 Information point importance determines method and apparatus
CN104462289B (en) * 2014-11-27 2018-11-20 百度在线网络技术(北京)有限公司 The recommended method and device of through number keyword
CN104462533B (en) * 2014-12-23 2018-12-07 北京奇虎科技有限公司 A kind of method and system judging that electronic map is shown based on query inquiry pattern
CN105222803A (en) * 2015-10-20 2016-01-06 北京百度网讯科技有限公司 Map POI display packing and terminal
CN105608112A (en) * 2015-12-10 2016-05-25 北京奇虎科技有限公司 Method and apparatus for measuring quality of map POI data
CN105574259B (en) * 2015-12-14 2017-06-20 华南理工大学 A kind of Urban cognition ground drawing generating method based on internet word frequency
CN105550330B (en) * 2015-12-21 2020-09-11 北京奇虎科技有限公司 Method and system for ordering POI (Point of interest) information
CN110019645B (en) * 2017-09-28 2022-04-19 北京搜狗科技发展有限公司 Index library construction method, search method and device
CN109408819B (en) * 2018-10-16 2023-05-16 吉奥时空信息技术股份有限公司 Core place name extraction method and device based on natural language processing technology

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101350154A (en) * 2008-09-16 2009-01-21 北京搜狗科技发展有限公司 Method and apparatus for ordering electronic map data
CN102541936A (en) * 2010-12-31 2012-07-04 高德软件有限公司 Method and device for acquiring popularity of POI (Point of Interest)

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20080048786A (en) * 2006-11-29 2008-06-03 팅크웨어(주) System and method for providing point of interest in destination around

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101350154A (en) * 2008-09-16 2009-01-21 北京搜狗科技发展有限公司 Method and apparatus for ordering electronic map data
CN102541936A (en) * 2010-12-31 2012-07-04 高德软件有限公司 Method and device for acquiring popularity of POI (Point of Interest)

Also Published As

Publication number Publication date
CN103577442A (en) 2014-02-12

Similar Documents

Publication Publication Date Title
CN103577442B (en) A kind of map datum importance calculation method and device
JP7182585B2 (en) program
CA2640365C (en) Geographic coding for location search queries
EP2631814B1 (en) Method for mapping text phrases to geographical locations
US20150356088A1 (en) Tile-based geocoder
US20070198495A1 (en) Geographic coding for location search queries
CN103605752A (en) Address matching method based on semantic recognition
JP2022532451A (en) How to disambiguate Chinese place name meanings based on encyclopedia knowledge base and word embedding
Huang et al. A natural-language-based visual query approach of uncertain human trajectories
EP2783308B1 (en) Full text search based on interwoven string tokens
CN102385597B (en) The fault-tolerant searching method of a kind of POI
CN101567150A (en) Method for accurately positioning digital map
Laddha et al. Semantic tourism information retrieval interface
David et al. Smart geocoding of objects
Thenmozhi et al. A framework for tourist recommendation system exploiting geo-tagged photos
Venkateswaran et al. Exploring and visualizing differences in geographic and linguistic web coverage
Varriale et al. VTIS: a volunteered travelers information system
Xu et al. Exploring regional variation in spatial language using spatially stratified web-sampled route direction documents
CN104537042B (en) Method and system for determining whether electronic map is displayed or not based on query item
Li et al. Automatic construction and visualization of address models
Wang et al. Construction of Scenic Spots Knowledge Map under the Integration of Culture and Tourism
Meng et al. Three fuzzy concepts and their implications for cartography
Wu et al. Distribution Characteristics and Image Perception Differences of Urban and Rural Tourist Attractions: A Case of Beijing
Zhang Route extraction, road name disambiguation and efficient spatial query processing under location constraints
Gonzalez Problems that arise when providing geographic coordinate information for cataloged maps

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
TR01 Transfer of patent right

Effective date of registration: 20211009

Address after: 518000 Tencent Building, No. 1 High-tech Zone, Nanshan District, Shenzhen City, Guangdong Province, 35 Floors

Patentee after: TENCENT TECHNOLOGY (SHENZHEN) Co.,Ltd.

Patentee after: TENCENT CLOUD COMPUTING (BEIJING) Co.,Ltd.

Address before: 2, 518044, East 403 room, SEG science and Technology Park, Zhenxing Road, Shenzhen, Guangdong, Futian District

Patentee before: TENCENT TECHNOLOGY (SHENZHEN) Co.,Ltd.

TR01 Transfer of patent right