CN103577442B - A kind of map datum importance calculation method and device - Google Patents
A kind of map datum importance calculation method and device Download PDFInfo
- Publication number
- CN103577442B CN103577442B CN201210266470.3A CN201210266470A CN103577442B CN 103577442 B CN103577442 B CN 103577442B CN 201210266470 A CN201210266470 A CN 201210266470A CN 103577442 B CN103577442 B CN 103577442B
- Authority
- CN
- China
- Prior art keywords
- place name
- data
- list
- map datum
- score value
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/951—Indexing; Web crawling techniques
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/29—Geographical information databases
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
- G06F16/9537—Spatial or temporal dependent retrieval, e.g. spatiotemporal queries
Landscapes
- Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Remote Sensing (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The invention belongs to Internet technical field more particularly to a kind of map datum importance calculation methods and device.Map datum importance calculation method of the present invention includes: to obtain data from map interest point data base, wherein the data of acquisition include place name list;The frequency that statistically place name occurs in list of file names and/or the number of results occurred in web page search engine;Corresponding score value is converted by the frequency of place name appearance and/or the number of results occurred in web page search engine, importance sorting is carried out according to score value.Map datum importance calculation method and device of the present invention are ranked up by the statistics interest point data base internal map data frequency of occurrences and by the query result number of search engine, and place name Rank table is generated according to ordering score size after being filtered to abnormal sorting data, the coverage rate and accuracy rate of map datum different degree are improved, and improves the correlation of the sequence of map search.
Description
Technical field
The invention belongs to Internet technical field more particularly to a kind of map datum importance calculation methods and device.
Background technique
The abbreviation of comprehensive POI(" Point of Interest ", point of interest) information be navigation map indispensable information,
Timely POI point of interest can remind the branch of user's road conditions and the detailed information of neighboring buildings, include place in each POI data
The cartographic informations such as title, classification, longitude and latitude, each place required for facilitating user to find.Currently, in map search
In, POI data would generally according to relevance ranking, when specifying when Perimeter or sub-category retrieval are done in some place, by
In no query word, can utilize with central point away from discrete data importance sorting.The different degree of POI data generally passes through offline
It calculates, it mainly includes two kinds that utilizable different degree, which calculates information: one, different numbers being manually assigned to according to administrative grade height
Value, such as the national level different numerical value corresponding with district grade in government bodies' class;Two, according to the quality of data source, to not
Same source is assigned to different score values, such as thematic data will be typically higher than the score value of crawl data with the score value for buying data.
The shortcomings that existing different degree calculation is: one, government bodies' class is a part of whole POI data,
Its category map data can not determine an accurate administrative grade, such as restaurant class, meanwhile, it is more under the same rank
Data different degree also cannot be distinguished, so determining that the method coverage rate of different degree is lower according to administrative grade height;Two, high
The data in quality source can also have wrong data, and quality height is two different concepts, quality data with data different degree
Not necessarily different degree is higher, and the height of different degree cannot be distinguished in the data of same source, so obtaining weight by quality height
Want that the calculation coverage rate of angle value is not high and accuracy is relatively low.
Summary of the invention
The present invention provides a kind of map datum importance calculation method and devices, it is intended to solve map number in the prior art
Problem not high according to the calculation coverage rate of different degree and that accuracy is low.
The invention is realized in this way a kind of map datum importance calculation method, comprising:
Data are obtained from map interest point data base, wherein obtaining data includes place name list;
The frequency that statistically place name occurs in list of file names and/or the number of results occurred in web page search engine;
Corresponding score value, root are converted by the frequency of place name appearance and/or the number of results occurred in web page search engine
Importance sorting is carried out according to score value.
Another technical solution that the present invention takes are as follows: a kind of map datum different degree computing device, including data acquisition mould
Block, data statistics module and standardization processing module, the data acquisition module from map interest point data base for obtaining
Data, wherein obtaining data includes place name list;The frequency that the data statistics module occurs for place name in statistically list of file names
Rate and/or the number of results occurred in web page search engine;The standardization processing module be used for frequency that place name is occurred with/
Or the number of results occurred in web page search engine is converted into corresponding score value, carries out importance sorting according to score value.
Technical solution of the present invention have the following advantages that or the utility model has the advantages that map datum importance calculation method of the present invention and
Device is ranked up by the statistics POI data library internal map data frequency of occurrences and by the query result number of search engine,
And place name Rank table is generated according to ordering score size after being filtered to abnormal sorting data, place name Rank table is built into and is searched
It indexes the concordance program held up to use for online correlation, improves the coverage rate and accuracy rate of map datum different degree, and mention
The high correlation of map search sequence.
Detailed description of the invention
Attached drawing 1 is the flow chart of the map datum importance calculation method of first embodiment of the invention;
Attached drawing 2 is the flow chart of the map datum importance calculation method of second embodiment of the invention;
Attached drawing 3 is the structural schematic diagram of the map datum different degree computing device of first embodiment of the invention;
Attached drawing 4 is the structural schematic diagram of the map datum different degree computing device of second embodiment of the invention.
Specific embodiment
In order to make the objectives, technical solutions, and advantages of the present invention clearer, with reference to the accompanying drawings and embodiments, right
The present invention is further elaborated.It should be appreciated that the specific embodiments described herein are merely illustrative of the present invention, and
It is not used in the restriction present invention.
Referring to Fig. 1, being the flow chart of the map datum importance calculation method of first embodiment of the invention.The present invention the
The map datum importance calculation method of one embodiment the following steps are included:
S100: from the abbreviation of map POI(" Point of Interest ", point of interest) two parts are obtained respectively in database
Data: ground list of file names and the corresponding relationship list of place name address;
In S100, ground list of file names is used for by generating standard place name after conversion, and the corresponding relationship list of place name address is used for
The frequency of occurrence of SS place name.
S110: ground list of file names and the corresponding relationship list of place name address are pre-processed, and generate standard gazetteer;
In S110, pretreatment includes cleaning bracket, carries out complicated and simple conversion, the conversion of full half-angle and/or Chinese figure conversion
The processing such as Arabic numerals.Ground list of file names may contain bracket, such as " BJ University of Aeronautics & Astronautics (southwestern door) ", " Tsinghua University
(west gate) ", bracket are often annotated content, if it is less than normal directly to go statistics to will cause result, are needed bracket together with inner
The content in face is all removed;Ground list of file names and the corresponding relationship list of place name address are using complicated and simple conversion, the conversion of full half-angle, Chinese number
Word conversion Arabic numerals etc. after totally four preprocessing process, generate the gazetteer of standard.
S120: according to the place name in standard gazetteer, count what the place name occurred in the corresponding relationship list of place name address
Frequency;
In S120, the frequency of occurrences includes the number occurred in ground ranks, the number occurred in address column or line number,
No matter the place name ranks on ground or occurs in address column, all shows primary reference, is equivalent to PageRank(Google ranking
A part of algorithm (ranking formula) is a kind of method of grade/importance that Google is used to be used to presentation web page)
It is directed toward, the number that place name occurs in address or title can directly reflect the reference value of a POI, such as the dining room * *, * *
North gate, the number for quoting entity place name * * is more, show the place name be used as terrestrial reference direction chance it is more, have certain " power
Prestige ", this is similar with Webpage search PageRank, the difference is that PageRank is referred to by people's (webpage), here
LinkRank is referred to by other POI;Wherein, the line number only occurred by place name is it may determine that the size of the frequency of occurrences is suitable
Sequence;In embodiments of the present invention, matching way is using exact matching.
S130: the place name frequency of occurrences value of statistics is subjected to standardization processing and is converted into corresponding score value;
In S130, the place name frequency of occurrences value of statistics is subjected to standardization processing and is converted into corresponding score value are as follows: will
Long section, dispersion integer value be converted into the short interval value that correlation can be used, such as 0 ~ 1 or 0~10 or 0~100, at this
In short interval value, score value size is to represent the size of frequency values, and common conversion method has: linear function or log function etc., can
According to frequency and a kind of suitable transfer function of relevance score interval selection.
S140: the place name sequence Rank table of (place name, score value) is generated according to the sequence of score value size.
In S140, place name Rank table can be built into index by the offline concordance program of search engine, and score value part supplies
Line correlation uses.It include the information such as place name and its corresponding score value, the high place name, that is, frequency of occurrences of score value in place name Rank table
It is higher, it indicates that its different degree is higher, the different degree of map datum is calculated by the frequency of occurrences of place name, improves map search
Accuracy rate.In embodiments of the present invention, in order to avoid interfering with each other between different cities, the geographical name data in a city will not
It is influenced by the data of the same name in another city, can be limited in the closed data subset in some city and be counted;Separately
Outside, since the more usual different degree of the number of results arrived by search engine inquiry can be higher, inquiry knot can further be limited
Quality threshold in fruit, as ordering score cannot be too low.The present invention is sorted by the internal statistical in POI data library, can be accurate
The significance level for reflecting map datum, improves the sequence correlation of map search, such as: in certain closing POI data library statistics
(place name, number of results), first two are national well-known research institutes, and latter two are prefecture-level research institution or company, statistical result
It is as follows:
Institute of Automation Research of CAS 13
The Institute of Software, Chinese Academy of Science 4
Beijing aura technical research institute 1
Beijing Control Engineering Inst. 1
Similarly:
The Third Affiliated Hospital of Peking University 24 [front three]
Beijing Haidian hospital 7 [diformazan]
It can be seen that the relative size of score value number embodies the different different degrees of data.
Referring to Fig. 2, being the flow chart of the map datum importance calculation method of second embodiment of the invention.The present invention the
The map datum importance calculation methods of two embodiments the following steps are included:
S200: ground list of file names is obtained from map POI data library;
S210: standardization format conversion is carried out by place name lists construction query string, and by query string;
In S210, when due to routine access search engine, the format of query string there are certain requirements, need query string
In the punctuation marks such as single quotation marks, TAB be converted into space.
S220: the number of results that every standardization query string of statistics occurs in web page search engine;
In step S220, the result page that every standardization query string occurs in web page search engine can also be counted.
S230: processing is filtered to abnormal number of results;
In S230, since the query string search range that has is than broad, such as " restaurant ", it is this kind of to have more correlation
Webpage searching result influences the quality of search result;Even if the relatively determining query string of meaning, the result that search engine provides
There can be of low quality or even incoherent situation, so needing to be filtered abnormal number of results and result page, remove knot
The too many inquiry of fruit number or the too low page of PageRank score value, to improve the quality of search result.In embodiment of the present invention
In, standard score workable for correlation can be corrected by only calculating former pages of result PageRank.In addition, number of results
With result page PageRank calculating section, two different dimensions in city and classification, such as root can also be limited according to different cities
According to realistic situation, it is normal, and the culture and education class of Lhasa that the culture and education class place name number of results of Beijing, which is higher than 100,000,
It is abnormal that name number of results, which is higher than 100,000,.
S240: corresponding score value is converted by filter result number progress standardization processing;
In S240, greatly due to result sum variation space, it is not easy to directly be ranked up, statistical result number is advised
Generalized processing is converted into corresponding score value are as follows: the integer value in long section, dispersion is converted into the short area that correlation can be used
Between, such as 0 ~ 1,0~10 or 0~100 etc., common conversion method has: linear function or log function etc., can be according to number of results
With a kind of suitable transfer function of relevance score interval selection.
S250: place name Rank table is generated according to the score value size sequence after conversion.
In S250, place name Rank table can be built into index by the offline concordance program of search engine, and score value part supplies
Line correlation uses.It include the information such as place name and its corresponding score value in place name Rank table, the high place name, that is, number of results of score value is got over
Height indicates that its different degree is higher, and the query result number by statistics place name in web page search engine calculates the important of map datum
Degree, can accurately reflect the significance level of geographical name data, convenient for the relevance ranking of map search, for example, in certain search engine
(place name, the number of results) of statistics, well-known universities and colleges vs. College of Adult Education, statistical result are as follows:
Tsinghua University 1,000,000,000
Beijing City University 2,700,000
The former number of results can correspond to a relatively high importance value, and the different degree score value of the latter is then relatively low.
In an embodiment of the present invention, when carrying out number of results filtering and standardization processing, city, classification two can be limited
Different dimensions, such as rule of thumb, it is normal, and the culture of Lhasa that the culture and education class place name number of results of Beijing, which is higher than 100,000,
It is abnormal that educational place name number of results, which is higher than 100,000,.
It in an alternative embodiment of the invention, can also be by the internal statistical sequence and the in POI data library in first embodiment
The query result number sequence of search engine is combined use according to different applications in two embodiments, improves the row of map search
Sequence correlation.
Referring to Fig. 3, being the structural schematic diagram of the device of the map datum different degree calculating of first embodiment of the invention.This
The device that the map datum different degree of invention first embodiment calculates includes data acquisition module, data conversion module, data system
Count module, standardization processing module and result-generation module, wherein
Data acquisition module is used for from the abbreviation of map POI(" Point of Interest ", point of interest) it obtains in database
Take two parts of data: ground list of file names and the corresponding relationship list of place name address;Ground list of file names is used for by generating standard place name after conversion,
The corresponding relationship list of place name address is used for the frequency of occurrence of SS place name.
Data conversion module is used for ground list of file names and the corresponding relationship list of place name address according to pre-processing, and generates mark
Quasi- gazetteer;Wherein, the pretreatment of data conversion module include cleaning bracket, carry out it is complicated and simple conversion, full half-angle conversion and/or in
The processing such as literary number conversion Arabic numerals.Ground list of file names may contain bracket, such as " BJ University of Aeronautics & Astronautics (southwestern door) ",
" Tsinghua University (west gate) ", bracket is often annotated content, if it is less than normal directly to go statistics to will cause result, is needed including
It number is all removed together with the content of the inside;Ground list of file names and the corresponding relationship list of place name address are also needed by complicated and simple conversion, complete half
Angle conversion, Chinese figure conversion Arabic numerals etc. after totally four preprocessing process, generate the gazetteer of standard.
Data statistics module is used to count the place name according to the place name in standard gazetteer and arrange in place name address corresponding relationship
The frequency occurred in table;Wherein, the frequency of occurrences includes the number occurred in ground ranks, the number or row that occur in address column
Number, no matter the place name ranks on ground or occurs in address column, all shows primary reference, is equivalent to PageRank(Google
A part of ranking algorithm (ranking formula) is a kind of side of grade/importance that Google is used to be used to presentation web page
Method) direction, the number that place name occurs in address or title can directly reflect the reference value of a POI, such as " * * "
Dining room, the north gate " * * ", the number for quoting the entity place name " * * " is more, shows that the place name is used as the chance that terrestrial reference is directed toward and gets over
It is more, have certain " authority ", it is similar with Webpage search PageRank, the difference is that PageRank is referred to by people's (webpage),
Here LinkRank is referred to by other POI;Wherein, only by place name occur line number it may determine that the frequency of occurrences it is big
Small sequence;In embodiments of the present invention, matching way is using exact matching.
Standardization processing module is used to convert the place name frequency of occurrences value progress standardization processing of statistics to corresponding
Score value;Wherein, the place name frequency of occurrences value of statistics is carried out standardization processing and is converted into corresponding point by standardization processing module
Value are as follows: the integer value in long section, dispersion is converted into the short interval value that correlation can be used, such as 0 ~ 1 or 0~10 or 0~100
Deng in the short interval value, score value size is to represent the sizes of frequency values, and common conversion method has: linear function or log letter
Number etc., can be according to frequency and a kind of suitable transfer function of relevance score interval selection.
Result-generation module is used to generate the place name sequence Rank table of (place name, score value) according to the sequence of score value size.It searches
Place name Rank table can be built into index by indexing the offline concordance program held up, and score value part is used for online correlation.Wherein, ground
It include the information such as place name and its corresponding score value in name Rank table, the high place name, that is, frequency of occurrences of score value is higher, indicates that its is important
Degree is higher, and the different degree of map datum is calculated by the frequency of occurrences of place name, improves the accuracy rate of map search.In the present invention
In embodiment, in order to avoid interfering with each other between different cities, the geographical name data in a city will not be by another city
The influence of data of the same name can be limited in the closed data subset in some city and be counted;In addition, due to by searching
Index holds up the more usual different degree of the number of results inquired can be higher, can further limit the quality threshold in query result
Value, as ordering score cannot be too low.The present invention is sorted by the internal statistical in POI data library, can accurately reflect geographical name data
Significance level, improve the sequence correlation of map search, such as: certain closing POI data library statistics (place name, as a result
Number), first two are national well-known research institutes, and latter two are prefecture-level research institution or company, and statistical result is as follows:
Institute of Automation Research of CAS 13
The Institute of Software, Chinese Academy of Science 4
Beijing aura technical research institute 1
Beijing Control Engineering Inst. 1
Similarly:
The Third Affiliated Hospital of Peking University 24 [front three]
Beijing Haidian hospital 7 [diformazan]
It can be seen that the relative size of score value number embodies the different different degrees of data.
Referring to Fig. 4, being the structural schematic diagram of the device of the map datum different degree calculating of second embodiment of the invention.This
The device that the map datum different degree of invention second embodiment calculates includes data acquisition module, format converting module, data system
Count module, data filtering module, standardization processing module and result-generation module, wherein
Data acquisition module is used to obtain ground list of file names from map POI data library;
Format converting module is used for through place name lists construction query string, and query string is carried out standardization format conversion;
When wherein, due to routine access search engine, the format of query string there are certain requirements, need in query string single quotation marks,
The punctuation marks such as TAB are converted into space.
Data statistics module is used to count the number of results that every standardization character string occurs in web page search engine one by one;
Data filtering module is used to be filtered processing to abnormal number of results or/and result page;Wherein, since some is looked into
String search range is ask than broad, such as " restaurant ", it is this kind of to have more related web page search result, influence search result
Quality;Even if the relatively determining query string of meaning, the result that search engine provides can also exist of low quality or even incoherent
Situation removes the too many inquiry of number of results or PageRank points so needing to be filtered abnormal number of results and result page
It is worth the too low page, to improve the quality of search result.It in embodiments of the present invention, can be by only calculating former pages of knot
Fruit PageRank corrects standard score workable for correlation.In addition, number of results and result page PageRank calculating section, also
Two different dimensions in city and classification can be limited according to different cities, such as according to realistic situation, the culture and education of Beijing
It is normal that class place name number of results, which is higher than 100,000, and it is abnormal that the culture and education class place name number of results of Lhasa, which is higher than 100,000,.
Standardization processing module is used to convert corresponding score value for filter result number progress standardization processing.Wherein,
Corresponding score value is converted by statistical result number progress standardization processing are as follows: the integer value of long section, dispersion is converted into phase
The short section that closing property can be used, such as 0 ~ 1,0~10 or 0~100 etc., common conversion method has: linear function or log letter
Number etc., can be according to number of results and a kind of suitable transfer function of relevance score interval selection.
Result-generation module is used to generate place name Rank(sequence according to the score value size sequence after conversion) table, by place name
Rank table is built into the concordance program of search engine, uses for online correlation;Wherein, in place name Rank table include place name and its
The information such as corresponding score value, the high place name, that is, number of results of score value is higher, indicates that its different degree is higher, by statistics place name in webpage
The query result number of search engine calculates the different degree of map datum, can accurately reflect the significance level of geographical name data, be convenient for
The relevance ranking of map search, for example, well-known universities and colleges vs. is at teaching in (place name, the number of results) of certain search engine statistics
Institute, statistical result are as follows:
Tsinghua University 1,000,000,000
Beijing City University 2,700,000
The former number of results can correspond to a relatively high importance value, and the different degree score value of the latter is then relatively low.
It in an alternative embodiment of the invention, can also be by the internal statistical sequence and the in POI data library in first embodiment
The query result number sequence of search engine is combined use according to different applications in two embodiments, improves the row of map search
Sequence correlation.
There is frequency by statistics POI data library internal map data in map datum importance calculation method and device of the present invention
Rate is ranked up and is ranked up by the query result number of search engine, and is filtered rear basis to abnormal sorting data
Ordering score size generates place name Rank table, and the concordance program that place name Rank table is built into search engine is made for online correlation
With improving the sequence correlation of map search, and improve the coverage rate and accuracy rate of map datum different degree;In addition, of the invention
Map datum importance calculation method and device are combined uses according to different applications, improve the sequence correlation of map search
Property.
The foregoing is merely illustrative of the preferred embodiments of the present invention, is not intended to limit the invention, all in essence of the invention
Made any modifications, equivalent replacements, and improvements etc., should all be included in the protection scope of the present invention within mind and principle.
Claims (10)
1. a kind of map datum importance calculation method, comprising:
Data are obtained from map interest point data base, wherein the data of acquisition include the corresponding pass of place name list and place name address
Series of tables;
Ground list of file names, the corresponding relationship list of place name address are pre-processed, standard gazetteer is generated;
Count the frequency that the place name occurs in the corresponding relationship list of place name address according to the place name in standard gazetteer, it is described go out
Existing frequency include: rank number, the number occurred in address column or the line number of middle appearance, no matter the place name ranks also on ground
It is to occur in address column, all shows primary reference;
The frequency translation that place name is occurred is corresponding score value, carries out importance sorting according to score value.
2. map datum importance calculation method according to claim 1, which is characterized in that the pretreatment includes cleaning
The bracket that contains in ground list of file names and over the ground list of file names, the corresponding relationship list of place name address carry out complicated and simple conversion, full half-angle turns
It changes and/or Chinese figure converts Arabic numerals.
3. map datum importance calculation method according to claim 1, which is characterized in that further include:
The statistically number of results that place name occurs in web page search engine in list of file names;
It specifically includes: by place name lists construction query string, query string being converted into standardization format, statistical specifications query string
The number of results occurred in web page search engine.
4. map datum importance calculation method according to claim 1, which is characterized in that from map interest point data
In library after acquisition data, further includes:
The statistically number of results that place name occurs in web page search engine in list of file names;
The number of results that place name is occurred in web page search engine is converted into corresponding score value, carries out different degree row according to score value
Sequence;
Wherein, after the number of results step that place name occurs in web page search engine in the statistically list of file names further include: to different
Normal number of results is filtered processing.
5. map datum importance calculation method according to claim 1 or 4, which is characterized in that described place name occur
Frequency translation be corresponding score value step include: that the place name frequency of occurrences of statistics is converted into the short area that correlation can be used
Between be worth.
6. map datum importance calculation method according to claim 1, which is characterized in that the frequency for place name occur
After rate is converted into corresponding score value step further include: generate place name sequencing table according to the score value size sequence after conversion, and by ground
Name sequencing table is built into the concordance program of search engine.
7. a kind of map datum different degree computing device, which is characterized in that including data acquisition module, data conversion module, number
Module and standardization processing module according to statistics, the data acquisition module are used to obtain data from map interest point data base,
Wherein, obtaining data includes place name list and the corresponding relationship list of place name address;The data conversion module is for ranking ground
Table, the corresponding relationship list of place name address are pre-processed, and standard gazetteer is generated;The data statistics module is used for according to standard
Place name in gazetteer counts the frequency that the place name occurs in the corresponding relationship list of place name address, and the frequency of occurrences includes:
Ground ranks number, the number occurred in address column or the line number of middle appearance, and no matter the place name ranks on ground or in address column
Occur, all shows primary reference;It is corresponding score value that the standardization processing module, which is used for the frequency translation for place name occur,
Importance sorting is carried out according to score value.
8. map datum different degree computing device according to claim 7, which is characterized in that the data conversion module into
Capable pretreatment includes the bracket contained in cleaning ground list of file names and list of file names, the corresponding relationship list of place name address carry out over the ground
Complicated and simple conversion, the conversion of full half-angle and/or Chinese figure convert Arabic numerals.
9. map datum different degree computing device according to claim 7, which is characterized in that the data statistics module is also
It can be used for the number of results that place name occurs in web page search engine in statistically list of file names, further include format converting module sum number
According to filtering module, the format converting module is used for through place name lists construction query string, and query string is converted to standardization
Format;The data filtering module is used to be filtered processing to abnormal number of results.
10. map datum different degree computing device according to claim 7, which is characterized in that further include that result generates mould
Block, the result-generation module, which is used to be sorted according to score value size, generates place name sequencing table, and place name sequencing table is built into search
The concordance program of engine.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201210266470.3A CN103577442B (en) | 2012-07-30 | 2012-07-30 | A kind of map datum importance calculation method and device |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201210266470.3A CN103577442B (en) | 2012-07-30 | 2012-07-30 | A kind of map datum importance calculation method and device |
Publications (2)
Publication Number | Publication Date |
---|---|
CN103577442A CN103577442A (en) | 2014-02-12 |
CN103577442B true CN103577442B (en) | 2019-02-05 |
Family
ID=50049247
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201210266470.3A Active CN103577442B (en) | 2012-07-30 | 2012-07-30 | A kind of map datum importance calculation method and device |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN103577442B (en) |
Families Citing this family (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104899200A (en) * | 2014-03-04 | 2015-09-09 | 高德软件有限公司 | POI search feedback method and device |
CN103823900B (en) * | 2014-03-17 | 2017-07-21 | 北京百度网讯科技有限公司 | Information point importance determines method and apparatus |
CN104462289B (en) * | 2014-11-27 | 2018-11-20 | 百度在线网络技术(北京)有限公司 | The recommended method and device of through number keyword |
CN104462533B (en) * | 2014-12-23 | 2018-12-07 | 北京奇虎科技有限公司 | A kind of method and system judging that electronic map is shown based on query inquiry pattern |
CN105222803A (en) * | 2015-10-20 | 2016-01-06 | 北京百度网讯科技有限公司 | Map POI display packing and terminal |
CN105608112A (en) * | 2015-12-10 | 2016-05-25 | 北京奇虎科技有限公司 | Method and apparatus for measuring quality of map POI data |
CN105574259B (en) * | 2015-12-14 | 2017-06-20 | 华南理工大学 | A kind of Urban cognition ground drawing generating method based on internet word frequency |
CN105550330B (en) * | 2015-12-21 | 2020-09-11 | 北京奇虎科技有限公司 | Method and system for ordering POI (Point of interest) information |
CN110019645B (en) * | 2017-09-28 | 2022-04-19 | 北京搜狗科技发展有限公司 | Index library construction method, search method and device |
CN109408819B (en) * | 2018-10-16 | 2023-05-16 | 吉奥时空信息技术股份有限公司 | Core place name extraction method and device based on natural language processing technology |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101350154A (en) * | 2008-09-16 | 2009-01-21 | 北京搜狗科技发展有限公司 | Method and apparatus for ordering electronic map data |
CN102541936A (en) * | 2010-12-31 | 2012-07-04 | 高德软件有限公司 | Method and device for acquiring popularity of POI (Point of Interest) |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR20080048786A (en) * | 2006-11-29 | 2008-06-03 | 팅크웨어(주) | System and method for providing point of interest in destination around |
-
2012
- 2012-07-30 CN CN201210266470.3A patent/CN103577442B/en active Active
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101350154A (en) * | 2008-09-16 | 2009-01-21 | 北京搜狗科技发展有限公司 | Method and apparatus for ordering electronic map data |
CN102541936A (en) * | 2010-12-31 | 2012-07-04 | 高德软件有限公司 | Method and device for acquiring popularity of POI (Point of Interest) |
Also Published As
Publication number | Publication date |
---|---|
CN103577442A (en) | 2014-02-12 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN103577442B (en) | A kind of map datum importance calculation method and device | |
JP7182585B2 (en) | program | |
CA2640365C (en) | Geographic coding for location search queries | |
EP2631814B1 (en) | Method for mapping text phrases to geographical locations | |
US20150356088A1 (en) | Tile-based geocoder | |
US20070198495A1 (en) | Geographic coding for location search queries | |
CN103605752A (en) | Address matching method based on semantic recognition | |
JP2022532451A (en) | How to disambiguate Chinese place name meanings based on encyclopedia knowledge base and word embedding | |
Huang et al. | A natural-language-based visual query approach of uncertain human trajectories | |
EP2783308B1 (en) | Full text search based on interwoven string tokens | |
CN102385597B (en) | The fault-tolerant searching method of a kind of POI | |
CN101567150A (en) | Method for accurately positioning digital map | |
Laddha et al. | Semantic tourism information retrieval interface | |
David et al. | Smart geocoding of objects | |
Thenmozhi et al. | A framework for tourist recommendation system exploiting geo-tagged photos | |
Venkateswaran et al. | Exploring and visualizing differences in geographic and linguistic web coverage | |
Varriale et al. | VTIS: a volunteered travelers information system | |
Xu et al. | Exploring regional variation in spatial language using spatially stratified web-sampled route direction documents | |
CN104537042B (en) | Method and system for determining whether electronic map is displayed or not based on query item | |
Li et al. | Automatic construction and visualization of address models | |
Wang et al. | Construction of Scenic Spots Knowledge Map under the Integration of Culture and Tourism | |
Meng et al. | Three fuzzy concepts and their implications for cartography | |
Wu et al. | Distribution Characteristics and Image Perception Differences of Urban and Rural Tourist Attractions: A Case of Beijing | |
Zhang | Route extraction, road name disambiguation and efficient spatial query processing under location constraints | |
Gonzalez | Problems that arise when providing geographic coordinate information for cataloged maps |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
TR01 | Transfer of patent right |
Effective date of registration: 20211009 Address after: 518000 Tencent Building, No. 1 High-tech Zone, Nanshan District, Shenzhen City, Guangdong Province, 35 Floors Patentee after: TENCENT TECHNOLOGY (SHENZHEN) Co.,Ltd. Patentee after: TENCENT CLOUD COMPUTING (BEIJING) Co.,Ltd. Address before: 2, 518044, East 403 room, SEG science and Technology Park, Zhenxing Road, Shenzhen, Guangdong, Futian District Patentee before: TENCENT TECHNOLOGY (SHENZHEN) Co.,Ltd. |
|
TR01 | Transfer of patent right |