CN103577442A - Method and device for calculating map data importance - Google Patents

Method and device for calculating map data importance Download PDF

Info

Publication number
CN103577442A
CN103577442A CN201210266470.3A CN201210266470A CN103577442A CN 103577442 A CN103577442 A CN 103577442A CN 201210266470 A CN201210266470 A CN 201210266470A CN 103577442 A CN103577442 A CN 103577442A
Authority
CN
China
Prior art keywords
place name
list
data
search engine
importance
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201210266470.3A
Other languages
Chinese (zh)
Other versions
CN103577442B (en
Inventor
程盛远
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tencent Technology Shenzhen Co Ltd
Tencent Cloud Computing Beijing Co Ltd
Original Assignee
Tencent Technology Shenzhen Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tencent Technology Shenzhen Co Ltd filed Critical Tencent Technology Shenzhen Co Ltd
Priority to CN201210266470.3A priority Critical patent/CN103577442B/en
Publication of CN103577442A publication Critical patent/CN103577442A/en
Application granted granted Critical
Publication of CN103577442B publication Critical patent/CN103577442B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/29Geographical information databases
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9537Spatial or temporal dependent retrieval, e.g. spatiotemporal queries

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Remote Sensing (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention belongs to the technical field of Internet and particularly relates to a method and a device for calculating map data importance. The method includes: acquiring data from a map interest point database, wherein the data comprises a geographical name list; counting occurrence frequency and/or result number occurred in a webpage search engine of the geographical names in the geographical name list; converting the frequency and the result number into corresponding values, and performing importance sorting according to the values. The method and the device has the advantages that sorting is performed by counting the occurrence frequency and search engine search result number of t he internal map data of the interest point database, a geographical name Rank table is generated according to the sorting values after abnormal sorting data is filtered, map data importance coverage rate and accuracy rate are increased, and relevance of map search sorting is increased.

Description

A kind of map datum importance calculation method and device
Technical field
The invention belongs to Internet technical field, relate in particular to a kind of map datum importance calculation method and device.
Background technology
The comprehensively abbreviation of POI(" Point of Interest ", point of interest) information is the indispensable information of navigation map, the branch of POI point of interest energy reminding user road conditions and the detailed information of neighboring buildings timely, in each POI data, comprise the cartographic informations such as place name, classification, longitude and latitude, facilitate user to find needed each place.At present, in map search, POI data conventionally can be according to relevance ranking, when specifying in certain place and do Perimeter or sub-category retrieval, owing to there is no query word, can utilize with central point apart from discrete data importance sorting.The importance degree of POI data is generally by calculated off-line, and utilizable importance degree computing information mainly comprises two kinds: one, according to administrative grade height, manually compose with different numerical value, for example the numerical value corresponding different with district level of the national level in government bodies' class; Two, according to the quality of Data Source, different sources is composed with different score values, for example the score value of thematic data and purchase data generally can be higher than the score value that captures data.
The shortcoming of existing importance degree account form is: Yi, government bodies class is a part for whole POI data, other category map data cannot be determined an administrative grade accurately, restaurant class for example, simultaneously, many data importance degrees under same rank also cannot be distinguished, so just come to determine that according to administrative grade the method coverage rate of importance degree is lower; Two, also can there is misdata in the data in high-quality source, quality height is two different concepts with data importance degree, quality data not necessarily importance degree is higher, and the data in same source cannot be distinguished the height of importance degree, so just obtain by quality, the account form coverage rate of importance value is not high and accuracy is also lower.
Summary of the invention
The invention provides a kind of map datum importance calculation method and device, be intended to solve the problem that account form coverage rate is not high and accuracy is low of map datum importance degree in prior art.
The present invention is achieved in that a kind of map datum importance calculation method, comprising:
From map interest point data base, obtain data, wherein, obtain data and comprise place name list;
The frequency that in statistics ground list of file names, place name occurs and/or the number of results occurring in web page search engine;
The frequency that place name is occurred and/or the number of results occurring in web page search engine are converted into corresponding score value, according to score value, carry out importance sorting.
Another technical scheme that the present invention takes is: a kind of map datum importance degree calculation element, comprise data acquisition module, data statistics module and standardization processing module, described data acquisition module, for obtaining data from map interest point data base, wherein, obtains data and comprises place name list; Described data statistics module is for adding up the frequency of ground list of file names place name appearance and/or the number of results occurring in web page search engine; Described standardization processing module is converted into corresponding score value for the frequency that place name is occurred and/or the number of results occurring in web page search engine, according to score value, carries out importance sorting.
Technical scheme tool of the present invention has the following advantages or beneficial effect: map datum importance calculation method of the present invention and device sort by the statistics POI database internal map data frequency of occurrences and the Query Result number by search engine, and after being filtered, abnormal sorting data generates place name Rank table according to ordering score size, the concordance program that place name Rank table is built into search engine is for online correlativity, improve coverage rate and the accuracy rate of map datum importance degree, and improved the correlativity of map search sequence.
Accompanying drawing explanation
Accompanying drawing 1 is the process flow diagram of the map datum importance calculation method of first embodiment of the invention;
Accompanying drawing 2 is process flow diagrams of the map datum importance calculation method of second embodiment of the invention;
Accompanying drawing 3 is structural representations of the map datum importance degree calculation element of first embodiment of the invention;
Accompanying drawing 4 is structural representations of the map datum importance degree calculation element of second embodiment of the invention.
Embodiment
In order to make object of the present invention, technical scheme and advantage clearer, below in conjunction with drawings and Examples, the present invention is further elaborated.Should be appreciated that specific embodiment described herein, only in order to explain the present invention, is not intended to limit the present invention.
Referring to Fig. 1, is the process flow diagram of the map datum importance calculation method of first embodiment of the invention.The map datum importance calculation method of first embodiment of the invention comprises the following steps:
S100: from the abbreviation of map POI(" Point of Interest ", point of interest) in database, obtain respectively two piece of data: ground list of file names and the corresponding relation list of place name address;
In S100, ground list of file names is for passing through the rear generation standard place name of conversion, and the corresponding relation list of place name address is for the occurrence number of statistical standard place name.
S110: ground list of file names and the corresponding relation list of place name address are carried out to pre-service, generate study plot famous-brand clock;
In S110, the processing such as pre-service comprises cleaning bracket, carries out complicated and simple conversion, the conversion of full half-angle and/or Chinese figure conversion arabic numeral.Ground list of file names may contain bracket, ”,“ Tsing-Hua University (west gate) of Ru“ BJ University of Aeronautics & Astronautics (southwest door) ", bracket is the content of annotation property often, if directly go statistics can cause result less than normal, bracket all need to be removed together with the content of the inside; Ground list of file names and the corresponding relation list of place name address after totally four preprocessing process, generates the gazetteer of standard through complicated and simple conversion, full half-angle conversion, Chinese figure conversion arabic numeral etc. again.
S120: according to the place name in study plot famous-brand clock, add up the frequency that this place name occurs in the corresponding relation list of place name address;
In S120, the frequency of occurrences is included in the number of times that ranks middle appearance, the number of times occurring in address column or line number, no matter this place name ranks on ground or occurs at address column, all shown once to quote, a part that is equivalent to PageRank(Google rank algorithm (rank formula), Google for being used for a kind of method of grade/importance of presentation web page) sensing, the number of times that place name occurs in address or title, the reference value that can directly reflect a POI, * * dining room for example, * north gate, the number of times of quoting this entity place name * * is more, show that this place name is used as the chance that terrestrial reference points to more, there is certain " authority ", this and Webpage search PageRank are similar, difference is, PageRank is mentioned by people's (webpage), the LinkRank is here mentioned by other POI, wherein, the line number only occurring by place name just can judge the size order of the frequency of occurrences, in embodiment of the present invention, matching way adopts coupling completely.
S130: the place name frequency of occurrences value of statistics is carried out to standardization processing and be converted into corresponding score value;
In S130, the place name frequency of occurrences value of statistics is carried out to standardization processing to be converted into corresponding score value and to be: convert long round values interval, that disperse to correlativity operable short interval value, as 0 ~ 1 or 0~10 or 0~100 etc., in this short interval value, score value size represents the size of frequency values, conventional conversion method has: linear function or log function etc., and can be according to frequency and the suitable transfer function of relevance score interval selection one class.
S140: the place name sequence Rank table that generates (place name, score value) according to the sequence of score value size.
In S140, the off-line concordance program of search engine can be built into index by place name Rank table, and within its minute, value part is for online correlativity.Place name Rank table comprises place name and the corresponding information such as score value thereof, and the place name that score value is high is that the frequency of occurrences is higher, represents that its importance degree is higher, calculates the importance degree of map datum by the frequency of occurrences of place name, has improved the accuracy rate of map search.In embodiment of the present invention, for fear of phase mutual interference between different cities, the geographical name data in a city can not be subject to the impact of the data of the same name in another city, can be limited in the sealing data subset in some cities and add up; In addition, the common importance degree more due to the number of results arriving by search engine inquiry can be higher, can further limit the quality threshold in Query Result, as ordering score can not be too low.The present invention sorts by the internal statistical of POI database, the significance level that can accurately reflect map datum, improve the sequence correlativity of map search, for example: in (the place name of certain sealing POI statistics of database, number of results), first two is national well-known research institute, and latter two is prefecture-level research institution or company, and statistics is as follows:
Institute of Automation Research of CAS 13
The Institute of Software, Chinese Academy of Science 4
Beijing aura technical institute 1
Beijing Control Engineering Inst. 1
In like manner:
The Third Affiliated Hospital of Peking University 24 [front three]
Haidian, Beijing hospital 7 [diformazan]
Can see, the relative size of score value number has embodied the different importance degrees of data.
Referring to Fig. 2, is the process flow diagram of the map datum importance calculation method of second embodiment of the invention.The map datum importance calculation method of second embodiment of the invention comprises the following steps:
S200: obtain ground list of file names from map POI database;
S210: by ground list of file names structure query string, and by the query string format conversion of standardizing;
In S210, during due to routine access search engine, the form of query string be there are certain requirements, the punctuation marks such as the single quotation marks in query string, TAB need to be converted to space.
S220: add up the number of results that every standardization query string occurs in web page search engine;
In step S220, can also add up the result page that every standardization query string occurs in web page search engine.
S230: abnormal number of results is carried out to filtration treatment;
In S230, because the query string hunting zone having is more wide in range, for example " restaurant ", this class has more related web page Search Results, affects the quality of Search Results; Even the query string that implication is relatively definite, also can there is even incoherent situation of low quality in the result that search engine provides, so need to filter abnormal number of results and result page, remove the too many inquiry of number of results or the too low page of PageRank score value, to improve the quality of Search Results.In embodiment of the present invention, can revise the spendable standard score value of correlativity by only calculating the result PageRank of former pages.In addition, number of results and result page PageRank calculating section, can also be according to restriction city, different cities and two different dimensions of classification, for example, according to realistic situation, the culture and education class place name number of results of Beijing is normal higher than 100,000, and the culture and education class place name number of results of Lhasa is abnormal higher than 100,000.
S240: filter result number is carried out to standardization processing and be converted into corresponding score value;
In S240, because result sum variation space is large, be not easy to directly sort, statistics number is carried out to standardization processing to be converted into corresponding score value and to be: convert long round values interval, that disperse to correlativity operable short interval, such as 0 ~ 1,0~10 or 0~100 etc., conventional conversion method has: linear function or log function etc., and can be according to number of results and the suitable transfer function of relevance score interval selection one class.
S250: generate place name Rank table according to the score value size sequence after transforming.
In S250, the off-line concordance program of search engine can be built into index by place name Rank table, and within its minute, value part is for online correlativity.Place name Rank table comprises place name and the corresponding information such as score value thereof, the place name that score value is high is that number of results is higher, represent that its importance degree is higher, by statistics place name, at the Query Result number of web page search engine, calculate the importance degree of map datum, can accurately reflect the significance level of geographical name data, be convenient to the relevance ranking of map search, (place name, the number of results) of for example, adding up at certain search engine, the well-known vs. of universities and colleges College of Adult Education, statistics is as follows:
Tsing-Hua University 1,000,000,000
Beijing City University 2,700,000
The former corresponding higher importance value of number of results meeting, the latter's importance degree score value is on the low side.
In an embodiment of the present invention, while carrying out number of results filtration and standardization processing, can limit city, two different dimensions of classification, as rule of thumb, the culture and education class place name number of results of Beijing is normal higher than 100,000, and the culture and education class place name number of results of Lhasa is abnormal higher than 100,000.
In another embodiment of the present invention, the Query Result number sequence of search engine in the internal statistical sequence of POI database in the first embodiment and the second embodiment can also be used in combination according to different application, improve the sequence correlativity of map search.
Referring to Fig. 3, is the structural representation of the device that calculates of the map datum importance degree of first embodiment of the invention.The device that the map datum importance degree of first embodiment of the invention calculates comprises data acquisition module, data conversion module, data statistics module, standardization processing module and result-generation module, wherein,
Data acquisition module is for from the abbreviation of map POI(" Point of Interest ", point of interest) obtain two piece of data in database: ground list of file names and the corresponding relation list of place name address; Ground list of file names is for passing through the rear generation standard place name of conversion, and the corresponding relation list of place name address is for the occurrence number of statistical standard place name.
Data conversion module is used for ground list of file names and place name address corresponding relation list certificate to carry out pre-service, and generates study plot famous-brand clock; Wherein, the pre-service of data conversion module comprise cleaning bracket, carry out complicated and simple conversion, the processing such as the conversion of full half-angle and/or Chinese figure conversion arabic numeral.Ground list of file names may contain bracket, ”,“ Tsing-Hua University (west gate) of Ru“ BJ University of Aeronautics & Astronautics (southwest door) ", bracket is the content of annotation property often, if directly go statistics can cause result less than normal, bracket all need to be removed together with the content of the inside; Ground list of file names and the corresponding relation list of place name address also needs after totally four preprocessing process, to generate the gazetteer of standard through complicated and simple conversion, full half-angle conversion, Chinese figure conversion arabic numeral etc.
Data statistics module, for according to the place name in study plot famous-brand clock, is added up the frequency that this place name occurs in the corresponding relation list of place name address, wherein, the frequency of occurrences is included in the number of times that ranks middle appearance, the number of times occurring in address column or line number, no matter this place name ranks on ground or occurs at address column, all shown once to quote, a part that is equivalent to PageRank(Google rank algorithm (rank formula), Google for being used for a kind of method of grade/importance of presentation web page) sensing, the number of times that place name occurs in address or title, the reference value that can directly reflect a POI, " * * " dining room for example, " * * " north gate, the number of times of quoting this entity place name " * * " is more, show that this place name is used as the chance that terrestrial reference points to more, there is certain " authority ", PageRank is similar with Webpage search, difference is, PageRank is mentioned by people's (webpage), the LinkRank is here mentioned by other POI, wherein, the line number only occurring by place name just can judge the size order of the frequency of occurrences, in embodiment of the present invention, matching way adopts coupling completely.
Standardization processing module is converted into corresponding score value for the place name frequency of occurrences value of statistics is carried out to standardization processing; Wherein, standardization processing module is carried out standardization processing by the place name frequency of occurrences value of statistics and is converted into corresponding score value and is: convert long round values interval, that disperse to correlativity operable short interval value, as 0 ~ 1 or 0~10 or 0~100 etc., in this short interval value, score value size represents the size of frequency values, conventional conversion method has: linear function or log function etc., and can be according to frequency and the suitable transfer function of relevance score interval selection one class.
Result-generation module is for generating the place name sequence Rank table of (place name, score value) according to the sequence of score value size.The off-line concordance program of search engine can be built into index by place name Rank table, and within its minute, value part is for online correlativity.Wherein, place name Rank table comprises place name and the corresponding information such as score value thereof, and the place name that score value is high is that the frequency of occurrences is higher, represents that its importance degree is higher, calculates the importance degree of map datum by the frequency of occurrences of place name, has improved the accuracy rate of map search.In embodiment of the present invention, for fear of phase mutual interference between different cities, the geographical name data in a city can not be subject to the impact of the data of the same name in another city, can be limited in the sealing data subset in some cities and add up; In addition, the common importance degree more due to the number of results arriving by search engine inquiry can be higher, can further limit the quality threshold in Query Result, as ordering score can not be too low.The present invention sorts by the internal statistical of POI database, the significance level that can accurately reflect geographical name data, improve the sequence correlativity of map search, for example: in (the place name of certain sealing POI statistics of database, number of results), first two is national well-known research institute, and latter two is prefecture-level research institution or company, and statistics is as follows:
Institute of Automation Research of CAS 13
The Institute of Software, Chinese Academy of Science 4
Beijing aura technical institute 1
Beijing Control Engineering Inst. 1
In like manner:
The Third Affiliated Hospital of Peking University 24 [front three]
Haidian, Beijing hospital 7 [diformazan]
Can see, the relative size of score value number has embodied the different importance degrees of data.
Referring to Fig. 4, is the structural representation of the device that calculates of the map datum importance degree of second embodiment of the invention.The device that the map datum importance degree of second embodiment of the invention calculates comprises data acquisition module, format converting module, data statistics module, data filtering module, standardization processing module and result-generation module, wherein,
Data acquisition module is for obtaining ground list of file names from map POI database;
Format converting module is for by ground list of file names structure query string, and by the query string format conversion of standardizing; Wherein, during due to routine access search engine, the form of query string be there are certain requirements, the punctuation marks such as the single quotation marks in query string, TAB need to be converted to space.
The number of results that data statistics module occurs in web page search engine for adding up one by one every standardization character string;
Data filtering module is used for abnormal number of results or/and result page carries out filtration treatment; Wherein, because the query string hunting zone having is more wide in range, for example " restaurant ", this class has more related web page Search Results, affects the quality of Search Results; Even the query string that implication is relatively definite, also can there is even incoherent situation of low quality in the result that search engine provides, so need to filter abnormal number of results and result page, remove the too many inquiry of number of results or the too low page of PageRank score value, to improve the quality of Search Results.In embodiment of the present invention, can revise the spendable standard score value of correlativity by only calculating the result PageRank of former pages.In addition, number of results and result page PageRank calculating section, can also be according to restriction city, different cities and two different dimensions of classification, for example, according to realistic situation, the culture and education class place name number of results of Beijing is normal higher than 100,000, and the culture and education class place name number of results of Lhasa is abnormal higher than 100,000.
Standardization processing module is converted into corresponding score value for filter result number is carried out to standardization processing.Wherein, statistics number is carried out to standardization processing to be converted into corresponding score value and to be: convert long round values interval, that disperse to correlativity operable short interval, such as 0 ~ 1,0~10 or 0~100 etc., conventional conversion method has: linear function or log function etc., and can be according to number of results and the suitable transfer function of relevance score interval selection one class.
Result-generation module is for generating place name Rank(sequence according to the score value size sequence after conversion) show, place name Rank table is built into the concordance program of search engine, for online correlativity; Wherein, place name Rank table comprises place name and the corresponding information such as score value thereof, and the place name that score value is high is that number of results is higher, represents that its importance degree is higher, by statistics place name, at the Query Result number of web page search engine, calculate the importance degree of map datum, the significance level that can accurately reflect geographical name data, is convenient to the relevance ranking of map search, for example, (place name in certain search engine statistics, number of results), the well-known vs. of universities and colleges College of Adult Education, statistics is as follows:
Tsing-Hua University 1,000,000,000
Beijing City University 2,700,000
The former corresponding higher importance value of number of results meeting, the latter's importance degree score value is on the low side.
In another embodiment of the present invention, the Query Result number sequence of search engine in the internal statistical sequence of POI database in the first embodiment and the second embodiment can also be used in combination according to different application, improve the sequence correlativity of map search.
Map datum importance calculation method of the present invention and device sort by the statistics POI database internal map data frequency of occurrences and the Query Result number by search engine sorts, and after being filtered, abnormal sorting data generates place name Rank table according to ordering score size, the concordance program that place name Rank table is built into search engine is for online correlativity, improve the sequence correlativity of map search, and improve coverage rate and the accuracy rate of map datum importance degree; In addition, map datum importance calculation method of the present invention and device are used in combination according to different application, improve the sequence correlativity of map search.
The foregoing is only preferred embodiment of the present invention, not in order to limit the present invention, all any modifications of doing within the spirit and principles in the present invention, be equal to and replace and improvement etc., within all should being included in protection scope of the present invention.

Claims (13)

1. a map datum importance calculation method, comprising:
From map interest point data base, obtain data, wherein, the data of obtaining comprise place name list;
The frequency that in statistics ground list of file names, place name occurs and/or the number of results occurring in web page search engine;
The frequency that place name is occurred and/or the number of results occurring in web page search engine are converted into corresponding score value, according to score value, carry out importance sorting.
2. map datum importance calculation method according to claim 1, it is characterized in that, describedly from map interest point data base, obtain data and also comprise: obtain the corresponding relation list of place name address, before the frequency step that in described statistics ground list of file names, place name occurs, comprise: ground list of file names, the corresponding relation list of place name address are carried out to pre-service, generate study plot famous-brand clock.
3. map datum importance calculation method according to claim 2, it is characterized in that, described pre-service comprises the bracket containing in cleaning ground list of file names and complicated and simple conversion, full half-angle conversion and/or Chinese figure conversion arabic numeral are carried out in list of file names, the corresponding relation list of place name address over the ground.
4. according to the map datum importance calculation method described in claim 2 or 3, it is characterized in that, the frequency step that in described statistics ground list of file names, place name occurs comprises: according to the place name in study plot famous-brand clock, add up the frequency that this place name occurs in the corresponding relation list of place name address.
5. map datum importance calculation method according to claim 1, it is characterized in that, the number of results that in described statistics ground list of file names, place name occurs in web page search engine comprises: by ground list of file names structure query string, query string is converted to standardization form, the number of results that statistical specifications query string occurs in web page search engine.
6. map datum importance calculation method according to claim 1, is characterized in that, also comprises: abnormal number of results is carried out to filtration treatment after the number of results step that in described statistics ground list of file names, place name occurs in web page search engine.
7. according to the map datum importance calculation method described in claim 1 or 6, it is characterized in that, the described frequency that place name is occurred or the number of results occurring in web page search engine are converted into corresponding score value step and comprise: convert the place name frequency of occurrences of statistics and the number of results that occurs to correlativity operable short interval value in web page search engine.
8. map datum importance calculation method according to claim 1, it is characterized in that, after being converted into corresponding score value step, the described frequency that place name is occurred or the number of results occurring in web page search engine also comprise: according to the score value size sequence after transforming, generate place name sequencing table, and place name sequencing table is built into the concordance program of search engine.
9. a map datum importance degree calculation element, it is characterized in that, comprise data acquisition module, data statistics module and standardization processing module, described data acquisition module is for obtaining data from map interest point data base, wherein, obtain data and comprise place name list; Described data statistics module is for adding up the frequency of ground list of file names place name appearance and/or the number of results occurring in web page search engine; Described standardization processing module is converted into corresponding score value for the frequency that place name is occurred and/or the number of results occurring in web page search engine, according to score value, carries out importance sorting.
10. map datum importance degree calculation element according to claim 9, it is characterized in that, the data that described data acquisition module obtains also comprise: the corresponding relation list of place name address, described map datum importance degree calculation element also comprises data conversion module, described data conversion module is for carrying out pre-service by ground list of file names, the corresponding relation list of place name address, generate study plot famous-brand clock, the frequency that described data statistics module occurs in the corresponding relation list of place name address according to the place name statistics in study plot famous-brand clock.
11. map datum importance degree calculation elements according to claim 10, it is characterized in that, the pre-service that described data conversion module carries out comprises the bracket containing in cleaning ground list of file names and complicated and simple conversion, full half-angle conversion and/or Chinese figure conversion arabic numeral are carried out in list of file names, the corresponding relation list of place name address over the ground.
12. map datum importance degree calculation elements according to claim 9, it is characterized in that, also comprise format converting module and data filtering module, described format converting module is constructed query string for passing through ground list of file names, and query string is converted to standardization form; Described data filtering module is for carrying out filtration treatment to abnormal number of results.
13. map datum importance degree calculation elements according to claim 9, it is characterized in that, also comprise result-generation module, described result-generation module is used for the sequence according to score value size and generates place name sequencing table, and place name sequencing table is built into the concordance program of search engine.
CN201210266470.3A 2012-07-30 2012-07-30 A kind of map datum importance calculation method and device Active CN103577442B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201210266470.3A CN103577442B (en) 2012-07-30 2012-07-30 A kind of map datum importance calculation method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201210266470.3A CN103577442B (en) 2012-07-30 2012-07-30 A kind of map datum importance calculation method and device

Publications (2)

Publication Number Publication Date
CN103577442A true CN103577442A (en) 2014-02-12
CN103577442B CN103577442B (en) 2019-02-05

Family

ID=50049247

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201210266470.3A Active CN103577442B (en) 2012-07-30 2012-07-30 A kind of map datum importance calculation method and device

Country Status (1)

Country Link
CN (1) CN103577442B (en)

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104462289A (en) * 2014-11-27 2015-03-25 百度在线网络技术(北京)有限公司 Direct number keyword recommending method and device
CN104462533A (en) * 2014-12-23 2015-03-25 北京奇虎科技有限公司 Method and system for judging electronic map display based on query style
CN104899200A (en) * 2014-03-04 2015-09-09 高德软件有限公司 POI search feedback method and device
CN105222803A (en) * 2015-10-20 2016-01-06 北京百度网讯科技有限公司 Map POI display packing and terminal
CN105550330A (en) * 2015-12-21 2016-05-04 北京奇虎科技有限公司 Point of interest (POI) information sorting method and system
CN105574259A (en) * 2015-12-14 2016-05-11 华南理工大学 Internet word frequency-based city cognitive map generation method
CN105608112A (en) * 2015-12-10 2016-05-25 北京奇虎科技有限公司 Method and apparatus for measuring quality of map POI data
CN103823900B (en) * 2014-03-17 2017-07-21 北京百度网讯科技有限公司 Information point importance determines method and apparatus
CN109408819A (en) * 2018-10-16 2019-03-01 武大吉奥信息技术有限公司 A kind of core place name extracting method and device based on natural language processing technique
CN110019645A (en) * 2017-09-28 2019-07-16 北京搜狗科技发展有限公司 Index base construction method, searching method and device
CN116109117A (en) * 2023-04-14 2023-05-12 北京科技大学 Method and medium for evaluating importance of data stream of item

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101350154A (en) * 2008-09-16 2009-01-21 北京搜狗科技发展有限公司 Method and apparatus for ordering electronic map data
US20100070165A1 (en) * 2006-11-29 2010-03-18 Kang Jung Min System and method for providing point of interest in destination around
CN102541936A (en) * 2010-12-31 2012-07-04 高德软件有限公司 Method and device for acquiring popularity of POI (Point of Interest)

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100070165A1 (en) * 2006-11-29 2010-03-18 Kang Jung Min System and method for providing point of interest in destination around
CN101350154A (en) * 2008-09-16 2009-01-21 北京搜狗科技发展有限公司 Method and apparatus for ordering electronic map data
CN102541936A (en) * 2010-12-31 2012-07-04 高德软件有限公司 Method and device for acquiring popularity of POI (Point of Interest)

Cited By (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104899200A (en) * 2014-03-04 2015-09-09 高德软件有限公司 POI search feedback method and device
CN103823900B (en) * 2014-03-17 2017-07-21 北京百度网讯科技有限公司 Information point importance determines method and apparatus
CN104462289A (en) * 2014-11-27 2015-03-25 百度在线网络技术(北京)有限公司 Direct number keyword recommending method and device
CN104462289B (en) * 2014-11-27 2018-11-20 百度在线网络技术(北京)有限公司 The recommended method and device of through number keyword
CN104462533A (en) * 2014-12-23 2015-03-25 北京奇虎科技有限公司 Method and system for judging electronic map display based on query style
CN104462533B (en) * 2014-12-23 2018-12-07 北京奇虎科技有限公司 A kind of method and system judging that electronic map is shown based on query inquiry pattern
CN105222803A (en) * 2015-10-20 2016-01-06 北京百度网讯科技有限公司 Map POI display packing and terminal
WO2017067211A1 (en) * 2015-10-20 2017-04-27 北京百度网讯科技有限公司 Map poi display method and terminal
CN105608112A (en) * 2015-12-10 2016-05-25 北京奇虎科技有限公司 Method and apparatus for measuring quality of map POI data
CN105574259A (en) * 2015-12-14 2016-05-11 华南理工大学 Internet word frequency-based city cognitive map generation method
WO2017101277A1 (en) * 2015-12-14 2017-06-22 华南理工大学 City cognitive map generating method based on internet word frequency
CN105574259B (en) * 2015-12-14 2017-06-20 华南理工大学 A kind of Urban cognition ground drawing generating method based on internet word frequency
CN105550330A (en) * 2015-12-21 2016-05-04 北京奇虎科技有限公司 Point of interest (POI) information sorting method and system
CN105550330B (en) * 2015-12-21 2020-09-11 北京奇虎科技有限公司 Method and system for ordering POI (Point of interest) information
CN110019645A (en) * 2017-09-28 2019-07-16 北京搜狗科技发展有限公司 Index base construction method, searching method and device
CN110019645B (en) * 2017-09-28 2022-04-19 北京搜狗科技发展有限公司 Index library construction method, search method and device
CN109408819A (en) * 2018-10-16 2019-03-01 武大吉奥信息技术有限公司 A kind of core place name extracting method and device based on natural language processing technique
CN109408819B (en) * 2018-10-16 2023-05-16 吉奥时空信息技术股份有限公司 Core place name extraction method and device based on natural language processing technology
CN116109117A (en) * 2023-04-14 2023-05-12 北京科技大学 Method and medium for evaluating importance of data stream of item
CN116109117B (en) * 2023-04-14 2024-05-24 北京科技大学 Method and medium for evaluating importance of data stream

Also Published As

Publication number Publication date
CN103577442B (en) 2019-02-05

Similar Documents

Publication Publication Date Title
CN103577442A (en) Method and device for calculating map data importance
Hong et al. Hierarchical community detection and functional area identification with OSM roads and complex graph theory
CN101350012B (en) Method and system for matching address
US20150356088A1 (en) Tile-based geocoder
US7444343B2 (en) Hybrid location and keyword index
US9442905B1 (en) Detecting neighborhoods from geocoded web documents
WO2016150407A1 (en) Address resolution data-based construction land type rapid identification method
CN102163214B (en) Numerical map generation device and method thereof
JP2021009720A (en) Information search device and information search system
EP2836928B1 (en) Full text search using r-trees
CN109933797A (en) Geocoding and system based on Jieba participle and address dictionary
Zhu et al. A similarity-based automatic data recommendation approach for geographic models
Carrion et al. From historical documents to GIS: A spatial database for medieval fiscal data in Southern Italy
CN104199938A (en) RSS-based agricultural land information sending method and system
Zook et al. Cyberspatial proximity metrics: Reconceptualizing distance in the global urban system
US20130031458A1 (en) Hyperlocal content determination
Honarparvar et al. Improvement of a location-aware recommender system using volunteered geographic information
Wang et al. Spatial-temporal characteristics and causes of changes to the county-level administrative toponyms cultural landscape in the eastern plains of China
CN110906942A (en) POI point reminding navigation method, system, storage medium and equipment
CN101567150A (en) Method for accurately positioning digital map
Xiang Region2vec: An Approach for Urban Land Use Detection by Fusing Multiple Features
CN103049442A (en) Method and device for identifying abbreviation-full name conversion of mobile phone network retrieval words
Ballatore et al. A holistic semantic similarity measure for viewports in interactive maps
David et al. Smart geocoding of objects
CN101769752A (en) Search method of intersection

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
TR01 Transfer of patent right
TR01 Transfer of patent right

Effective date of registration: 20211009

Address after: 518000 Tencent Building, No. 1 High-tech Zone, Nanshan District, Shenzhen City, Guangdong Province, 35 Floors

Patentee after: TENCENT TECHNOLOGY (SHENZHEN) Co.,Ltd.

Patentee after: TENCENT CLOUD COMPUTING (BEIJING) Co.,Ltd.

Address before: 2, 518044, East 403 room, SEG science and Technology Park, Zhenxing Road, Shenzhen, Guangdong, Futian District

Patentee before: TENCENT TECHNOLOGY (SHENZHEN) Co.,Ltd.