CN113673240A - Method and storage medium for inferring textual geographic location from spatial entity relationships - Google Patents

Method and storage medium for inferring textual geographic location from spatial entity relationships Download PDF

Info

Publication number
CN113673240A
CN113673240A CN202110869708.0A CN202110869708A CN113673240A CN 113673240 A CN113673240 A CN 113673240A CN 202110869708 A CN202110869708 A CN 202110869708A CN 113673240 A CN113673240 A CN 113673240A
Authority
CN
China
Prior art keywords
place name
text
place
character string
dictionary
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202110869708.0A
Other languages
Chinese (zh)
Other versions
CN113673240B (en
Inventor
曾壮
陈仁谣
程旭阳
李圣文
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
China University of Geosciences
Original Assignee
China University of Geosciences
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by China University of Geosciences filed Critical China University of Geosciences
Priority to CN202110869708.0A priority Critical patent/CN113673240B/en
Publication of CN113673240A publication Critical patent/CN113673240A/en
Application granted granted Critical
Publication of CN113673240B publication Critical patent/CN113673240B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/289Phrasal analysis, e.g. finite state techniques or chunking
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9537Spatial or temporal dependent retrieval, e.g. spatiotemporal queries
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/205Parsing
    • G06F40/216Parsing using statistical methods
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Probability & Statistics with Applications (AREA)
  • Data Mining & Analysis (AREA)
  • Character Discrimination (AREA)

Abstract

The invention provides a method for deducing text geographic position by integrating spatial entity relationship, which comprises the following steps: preparing a target dictionary and a combined dictionary; completing the weight factor marking of the place name in the dictionary by utilizing different attributes of the place name; putting forward a StringMerging (SM) algorithm to complete the extraction of candidate place names in the target text; filtering the noise place name by using the vector representation corresponding to the place name; providing a place name weight calculation formula, and converting the weight factor of the place name in the target text into the relative weight of the place name in the target text; and multiplying the obtained relative weight by the coordinates of the place name and accumulating to obtain the implicit geographic coordinates of the target text. The technical route provided by the invention can effectively solve the problems that the existing method for acquiring the hidden geographic position of the text is difficult to acquire external knowledge, the granularity of the predicted hidden geographic position is large and the like, and can effectively predict the longitude and latitude coordinates of the target text under the unsupervised condition.

Description

Method and storage medium for inferring textual geographic location from spatial entity relationships
Technical Field
The invention relates to the technical field, in particular to a method and a storage medium for inferring the geographical position of a text by synthesizing spatial entity relations.
Background
Studies have shown that a large part of all data generated today is unstructured data, and that about 60% of the data (text data and other data) can be considered geospatially referenced data. However, since the text directly containing the spatial position information only occupies a small part of the total text quantity, how to accurately extract the spatial position information implied in the remaining text has a very important research value. Existing methods for obtaining the implicit geographic location of the text have problems, such as: difficulty in obtaining external knowledge, large granularity of predicted implied geographic locations, etc.
Disclosure of Invention
The method solves the main problems that the conventional method for acquiring the hidden geographic position of the text has difficulty in acquiring external knowledge and large granularity of the predicted hidden geographic position.
According to one aspect of the invention, the invention provides a method for inferring the geographic location of text from the relationship of spatial entities, comprising:
acquiring a target dictionary by using a standard data set;
labeling a place name in the target dictionary with a weight factor;
performing text word segmentation on the target text by using a word segmentation tool to obtain a word segmentation text;
matching the word segmentation text with place names in the target dictionary by using a String Merging algorithm, further acquiring candidate place names corresponding to character strings in the word segmentation text, and acquiring a Merging dictionary consisting of a set of Merging character strings;
screening and denoising the candidate place names by using the merging dictionary;
converting the weight factor into the relative weight of each place name in the participle text by using a conversion formula, wherein the conversion formula is as follows:
Figure BDA0003188501210000021
wherein N is the number of place names, fnFor each place name corresponding weight factor, LCM (W) is the least common multiple of N place names corresponding weight factors, N refers to each of N place namesPlace name, P (n) is the value of the relative weight;
and multiplying and accumulating the relative weight and the longitude and latitude coordinates of the place name to obtain the geographic coordinates of the word segmentation text.
Further, the weighting factors include different attributes of place names in the target dictionary.
Further, the attributes include a word frequency of place names appearing in the target dictionary, a place name category, and a geographical area corresponding to the place names.
Further, the acquiring the Merging dictionary by using String Merging algorithm comprises:
and acquiring a set of all the merged character strings by using the String Merging algorithm as the merged dictionary.
Further, the target dictionary is an overcomplete dictionary containing all place names in the target text.
Further, matching the segmented text with place names in the target dictionary by using a String Merging algorithm, and further acquiring candidate place names corresponding to character strings in the segmented text comprises:
if the first character string in the word segmentation text is the same as the first place name in the target dictionary, storing the first character string and the first place name;
and if the length of the first character string is greater than 1 and the first character string is contained by the second place name, continuously judging whether the next character string still belongs to the first place name.
If the next character string does not belong to the second place name but the next character belongs to the second place name, merging the first character string, the next character string and the next character string;
if the next character string of the first character string belongs to the stop character, ending the character string combination;
if the two adjacent character strings of the first character string do not belong to the second place name, ending the character string combination;
and repeating the character string merging step, and taking all the obtained merged character strings as candidate character strings so as to obtain all the candidate place names.
Further, the screening and denoising the candidate place names comprises:
calculating the place name S in the acquired character string listaPlace name S corresponding to standard place name listbThe cosine similarity of the place name vectors between the two sets of place name vectors is as follows:
Figure BDA0003188501210000031
wherein S isaFor the place name in the list of character strings,
Figure BDA0003188501210000033
is SaCorresponding place name vector, SbFor another place name in the list of character strings,
Figure BDA0003188501210000034
is SbA corresponding place name vector;
after calculation, if the value of the cosine similarity Cos _ sim is not less than a threshold value K, the place name S is reservedaAnd said place name SbOtherwise, deleting the place name SaAnd said place name Sb
Further, the relative weight is multiplied by the latitude and longitude coordinates of the place name and accumulated to obtain the geographic coordinate of the participle text, and the calculation formula is as follows:
Figure BDA0003188501210000032
wherein P (X, Y) is the predicted longitude and latitude coordinate value of the target text, Pn(Xn,Yn) The longitude and latitude coordinates of the nth place name in the target text.
Furthermore, when the word segmentation tool is used for text word segmentation of the target text, a user-defined dictionary is added to improve the word segmentation accuracy.
According to another aspect of the present invention, the present invention further comprises a storage medium, wherein the storage medium is a computer-readable storage medium, and the computer-readable storage medium stores a method for inferring the geographic location of text from the relationship of spatial entities according to any one of claims 1 to 9.
Selecting a proper custom dictionary, namely a target dictionary, for a target text, segmenting words by using a word segmentation tool, and then merging character segments obtained after word segmentation by using an SM algorithm; filtering the noisy data using a vector representation of the place name; calculating to obtain the relative weight of the place name by using the weight factor of the place name and combining a weight calculation formula; and accumulating and multiplying the longitude and latitude coordinates of the target text by utilizing the longitude and latitude coordinates of the ground and combining the obtained relative weight of the place name.
In the method for inferring the geographic position of the text by integrating the spatial entity relationship, the place name weight factor labeling part acquires the weight factor corresponding to the place name by utilizing the attribute of the place name labeling part, and the weight factor labeling part is combined with the part for calculating the relative weight of the candidate place name in the target text, so that the relative weight of the candidate place name in the target text can be acquired in a simple, direct and effective mode.
Drawings
The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate embodiments of the invention and together with the description, serve to explain the principles of the invention.
Fig. 1 is a schematic diagram of the UFGLI model in an embodiment of the present invention.
Fig. 2 is a schematic flow chart of the SM algorithm in the embodiment of the present invention.
FIG. 3 is a diagram illustrating a method for inferring a geographic location of a text from a spatial entity relationship according to an embodiment of the present invention.
Detailed Description
Various exemplary embodiments of the present invention will be described in detail below with reference to the accompanying drawings. It should be noted that: the relative arrangement of the components and steps, the numerical expressions and numerical values set forth in these embodiments do not limit the scope of the present invention unless specifically stated otherwise.
Meanwhile, it should be understood that the sizes of the respective portions shown in the drawings are not drawn in an actual proportional relationship for the convenience of description.
The following description of at least one exemplary embodiment is merely illustrative in nature and is in no way intended to limit the invention, its application, or uses.
In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention is described in further detail below with reference to specific embodiments and the accompanying drawings.
Techniques, methods, and apparatus known to those of ordinary skill in the relevant art may not be discussed in detail but are intended to be part of the specification where appropriate.
In all examples shown and discussed herein, any particular value should be construed as merely illustrative, and not limiting. Thus, other examples of the exemplary embodiments may have different values.
It should be noted that: like reference numbers and letters refer to like items in the following figures, and thus, once an item is defined in one figure, further discussion thereof is not required in subsequent figures.
In the first embodiment, as shown in fig. 1, which is a schematic diagram of a UFGLI model, the method for inferring a geographic location of a text from a comprehensive spatial entity relationship proposed in this embodiment includes: dictionary preparation, place name weight factor labeling, candidate place name extraction in the target text, noise place name filtering in the candidate place names, relative weight calculation of the candidate place names and target text implied geographic coordinate prediction.
The UFGLI is a model for obtaining implicit geographic coordinates of an input target text, and an input part of the UFGLI needs to be marked with a location name dictionary of weight factors and longitude and latitude coordinates in addition to the target text. The target text passes through: and the predicted longitude and latitude coordinates can be obtained by four parts of candidate place name extraction in the target text, noise place name filtering in the candidate place names, relative weight calculation of the candidate place names and target text position prediction.
Step one, dictionary preparation:
the present embodiment includes two types of dictionaries: a target dictionary and a merged dictionary. The target dictionary is an overcomplete dictionary containing all possible place names in the target text. The place names in the target dictionary and the latitude and longitude coordinates of the place names are derived from the standard data set. The merging dictionary is a set of all the merging strings automatically acquired by the SM algorithm and used for calculating the cosine similarity.
Step two, labeling a place name weight factor:
and labeling the place names in the target dictionary by a weighting factor, wherein each place name has a plurality of different attributes, so that each different attribute of the place name can be labeled once, and the place name in the target dictionary is: and quantifying attributes such as word frequency, place name category, geographical area corresponding to place names and the like appearing in the text set to serve as weight factors.
In the embodiment, different numbers of text sets are selected for counting word frequency, and the word frequency f of the place name in the target dictionary is obtained by using a small number of text statistics1Using mass text statistics to obtain the word frequency f of place names in the target dictionary2. The invention classifies the place names in the target dictionary, and each place name has a category, such as: roads, cells, schools, etc.; different place names in the dictionary are respectively endowed with different weight factors fc(ii) a The geographic area or the road length corresponding to the place name in the target dictionary is obtained, and the area or the length is set as the weight factor f corresponding to the place namea(ii) a The units of area and road length are further unified and taken as the weighting factor fa2
Step three, extracting candidate place names in the target text:
first, candidate place names contained in the target text are acquired. The method for extracting the candidate place name in the target text comprises the following steps: the target text participle and the candidate place name are matched with two parts.
In some embodiments, the word segmentation method is not fixed, and any word segmentation tool can be selected to perform text word segmentation, and an external dictionary (a custom dictionary) can be added to improve the word segmentation accuracy. The character string merging algorithm SM designed in this embodiment can match the participle text of the target text with the place names in the target dictionary, and further obtain candidate place names corresponding to the character strings in the target text.
As shown in fig. 2, it is a flow chart of the SM algorithm, and its design idea is as follows:
if the current character string a is the same as a place name A in the target dictionary, storing the current character string and the corresponding place name;
if the length of the current character string a is greater than 1 and the current character string a is contained by a place name B, continuously judging whether the next character string a2 still belongs to the place name B; if a2 does not belong to B, but a3 belongs to B, then a, a2, a3 are merged and step 2 is repeated;
if the two adjacent character strings of a do not belong to B or the adjacent character strings of a are stop symbols, ending the character string combination;
and taking the combined character string as a candidate character string and taking the candidate character string as a candidate place name.
Step four, filtering noise place names in the candidate place names:
the obtained candidate place name list in the target text may contain a large amount of noise data, and for this reason, the place names in the place name list need to be screened, firstly, the cosine similarity of the place name vectors between the place names in the character string list sl _ list extracted by the algorithm SM and the corresponding place names in the standard place name list Poi _ list is calculated:
let place name S in sl _ listbThe corresponding place name vector is
Figure BDA0003188501210000061
SbThe corresponding bert vector is
Figure BDA0003188501210000062
Then B isaAnd BbThe calculation formula of the cosine similarity Cos _ sim is as follows:
Figure BDA0003188501210000071
if the value of Cos _ sim is not less than the threshold value K, the current place name S is reservedbAnd SbOtherwise, deleting the current place name SaAnd Sb
Step five, calculating the relative weight of the candidate place name in the target text
In the second step, the weighting factor corresponding to each place name is already marked, but the weighting factor corresponding to the place name needs to be converted through a weighting calculation formula and is finally converted into the relative weight of each place name in the target text.
In some embodiments, various weight calculation formulas are designed, wherein the best effect is a method of normalization by using the least common multiple of each place name corresponding to a weight factor contained in the target text. That is, for N place names contained in each text, the weighting factor corresponding to each place name of the N place names is set as fnThe least common multiple of the weighting factors corresponding to the N place names is LCM (W), and the specific formula for calculating the relative weight P (N) of the nth place name by using the least common multiple is as follows:
Figure BDA0003188501210000072
step six, predicting the hidden geographic coordinates of the target text
For the N place names extracted from the target text according to the foregoing steps, the relative weight p (N) of each place name has been calculated using the weight calculation formula in step five, and position prediction is now performed using the longitude and latitude coordinates, which is the third attribute of the POI point. Namely the relative weight P (n) and the latitude and longitude coordinate P of each place name contained in the target textn(Xn,Yn) Multiplying and accumulating to obtain the final longitude and latitude coordinate P (X, Y) of the target text, wherein the specific formula is as follows:
Figure BDA0003188501210000073
in a second embodiment, as shown in fig. 3, a method for inferring a geographic location of a text from a spatial entity relationship includes the following steps:
after receiving a text signal of 'the grand square garden cell has the singing in the morning and evening every day and hopes the government to actively process', the word segmentation tool is adopted to segment words, and the word segmentation result is 'the grand square garden', 'the cell', 'every day', 'the morning and evening', 'the singing', 'the hope', 'the government', 'the initiative', 'the processing', 'and'. ";
after word segmentation, the SM algorithm mentioned in the previous embodiment is used to merge the character strings to obtain: the method comprises the following steps of [ 'Hongfang garden small area ] ], and place name matching is carried out to obtain a corresponding place name [' Hongfang garden green electric small area ] ];
after noise is filtered by utilizing the vector of the place name, the reserved character string is [ "macro square garden cell" ];
calculating relative weight according to the weight factor of the corresponding place name of the reserved character string, namely the weight factor of [ "macro Fangyuan green electricity cell" ] is 60, and the relative weight is 1;
multiplying the relative weight of the candidate place name and the corresponding longitude and latitude coordinates, wherein the relative weight of the 'macro square root cell' is 1, and the longitude and latitude coordinates are (114.354941, 30.586017);
and finally, obtaining the longitude and latitude coordinates of the target text (114.354941, 30.586017).
In the method for inferring the geographical position of the text by integrating the spatial entity relationship, the place name weighting factor labeling part acquires the weighting factor corresponding to the place name by utilizing the attribute of the place name labeling part, and the weighting factor labeling part is combined with the part for calculating the relative weight of the candidate place name in the target text, so that the relative weight of the candidate place name in the target text can be acquired in a simple, direct and effective mode, and the method for extracting the candidate place name in the target text can meet the requirement of unsupervised acquisition of the candidate place name contained in the target text.
The above description is only exemplary of the present invention and should not be taken as limiting the invention, as any modification, equivalent replacement, or improvement made within the spirit and scope of the present invention should be included in the present invention.
It should also be noted that the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other identical elements in the process, method, article, or apparatus that comprises the element.

Claims (10)

1. A method for inferring textual geolocation through the synthesis of spatial entity relationships, comprising:
acquiring a target dictionary by using a standard data set;
labeling a place name in the target dictionary with a weight factor;
performing text word segmentation on the target text by using a word segmentation tool to obtain a word segmentation text;
matching the word segmentation text with place names in the target dictionary by using a String Merging algorithm, further acquiring candidate place names corresponding to character strings in the word segmentation text, and acquiring a Merging dictionary consisting of a set of Merging character strings;
screening and denoising the candidate place names by using the merging dictionary;
converting the weight factor into the relative weight of each place name in the participle text by using a conversion formula, wherein the conversion formula is as follows:
Figure FDA0003188501200000011
wherein N is the number of place names, fnFor each place name, LCM (W) is the least common multiple of the N place name weight factorsA number, N, denotes each of the N place names, p (N) being a value of relative weight;
and multiplying and accumulating the relative weight and the longitude and latitude coordinates of the place name to obtain the geographic coordinates of the word segmentation text.
2. The method of claim 1, wherein the weighting factors include different attributes of place names in the target dictionary.
3. The method as claimed in claim 2, wherein the attributes include word frequency of place names appearing in the target dictionary, place name category and geographical area corresponding to place names.
4. The method of claim 1, wherein the obtaining a merged dictionary using String Merging algorithm comprises:
and acquiring a set of all the merged character strings by using the String Merging algorithm as the merged dictionary.
5. The method for inferring geographic location of text from integrated spatial entity relationships of claim 1 wherein said target lexicon is an overcomplete lexicon containing all place names in said target text.
6. The method of claim 1, wherein matching the segmented text with place names in the target dictionary by using a String Merging algorithm to obtain candidate place names corresponding to character strings in the segmented text comprises:
if the first character string in the word segmentation text is the same as the first place name in the target dictionary, storing the first character string and the first place name;
if the length of the first character string is larger than 1 and the first character string is contained by the second place name, continuously judging whether the next character string still belongs to the first place name;
if the next character string does not belong to the second place name but the next character belongs to the second place name, merging the first character string, the next character string and the next character string;
if the next character string of the first character string belongs to the stop character, ending the character string combination;
if the two adjacent character strings of the first character string do not belong to the second place name, ending the character string combination;
and repeating the character string merging step, and taking all the obtained merged character strings as candidate character strings so as to obtain all the candidate place names.
7. The method of claim 1, wherein filtering and denoising the candidate place names comprises:
calculating the place name S in the acquired character string listaAnd the place name S corresponding to the standard place name listbThe cosine similarity of the place name vectors between the two sets of place name vectors is as follows:
Figure FDA0003188501200000021
wherein,
Figure FDA0003188501200000032
is SaCorresponding place name vector, SaFor the place name in the list of character strings,
Figure FDA0003188501200000033
is SbCorresponding place name vector, SbIs another place name in the character string list;
after calculation, if the value of the cosine similarity Cos _ sim is not less than the threshold valueK, then the place name S is reservedaAnd said place name SbOtherwise, deleting the place name SaAnd said place name Sb
8. The method of claim 1, wherein the relative weight is multiplied by latitude and longitude coordinates of place names and accumulated to obtain the geographic coordinates of the participled text according to the following formula:
Figure FDA0003188501200000031
wherein P (X, Y) is the predicted longitude and latitude coordinate value of the target text, Pn(Xn,Yn) The longitude and latitude coordinates of the nth place name in the target text.
9. The method as claimed in claim 1, wherein a self-defined dictionary is added to improve the accuracy of word segmentation when a word segmentation tool is used to perform text word segmentation on the target text.
10. A storage medium, wherein the storage medium is a computer-readable storage medium, and wherein the computer-readable storage medium stores a method for inferring the geographic location of text from the relationship of spatial entities according to any of claims 1-9.
CN202110869708.0A 2021-07-30 2021-07-30 Method and storage medium for inferring text geographic location from integrated spatial entity relationships Active CN113673240B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110869708.0A CN113673240B (en) 2021-07-30 2021-07-30 Method and storage medium for inferring text geographic location from integrated spatial entity relationships

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110869708.0A CN113673240B (en) 2021-07-30 2021-07-30 Method and storage medium for inferring text geographic location from integrated spatial entity relationships

Publications (2)

Publication Number Publication Date
CN113673240A true CN113673240A (en) 2021-11-19
CN113673240B CN113673240B (en) 2023-07-04

Family

ID=78540850

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110869708.0A Active CN113673240B (en) 2021-07-30 2021-07-30 Method and storage medium for inferring text geographic location from integrated spatial entity relationships

Country Status (1)

Country Link
CN (1) CN113673240B (en)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110532546A (en) * 2019-07-29 2019-12-03 河北远东通信系统工程有限公司 A kind of automatic delivery method of alert merging geographical location and text similarity
US20210012103A1 (en) * 2019-07-09 2021-01-14 American International Group, Inc., Systems and methods for information extraction from text documents with spatial context
CN112836146A (en) * 2021-03-09 2021-05-25 威创集团股份有限公司 Geographic space coordinate information acquisition method and device based on network message

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20210012103A1 (en) * 2019-07-09 2021-01-14 American International Group, Inc., Systems and methods for information extraction from text documents with spatial context
CN110532546A (en) * 2019-07-29 2019-12-03 河北远东通信系统工程有限公司 A kind of automatic delivery method of alert merging geographical location and text similarity
CN112836146A (en) * 2021-03-09 2021-05-25 威创集团股份有限公司 Geographic space coordinate information acquisition method and device based on network message

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
何小波等: "《规则匹配和深度学习结合的文本空间信息识别及定位》", 《地理信息世界》, vol. 27, no. 5 *
宋鹏飞: "《大气污染专题语料库构建与语料空间化方法研究》", 《中国优秀硕士学位论文全文库》 *
张剑;叶远智;翁宝凤;: "面向智慧城市的高精度地名地址匹配方法", 测绘与空间地理信息, no. 11 *
郭丹: "《自然语言空间信息标注及识别》", 《中国优秀硕士学位论文全文库》 *

Also Published As

Publication number Publication date
CN113673240B (en) 2023-07-04

Similar Documents

Publication Publication Date Title
CN111783419B (en) Address similarity calculation method, device, equipment and storage medium
CN110245981B (en) Crowd type identification method based on mobile phone signaling data
WO2023168781A1 (en) Soil cadmium risk prediction method based on spatial-temporal interaction relationship
Tadjudin et al. Robust parameter estimation for mixture model
Jeon et al. Partially supervised classification using weighted unsupervised clustering
CN109165273B (en) General Chinese address matching method facing big data environment
Naboureh et al. A review of land use/land cover change mapping in the China-Central Asia-West Asia economic corridor countries
CN115934990B (en) Remote sensing image recommendation method based on content understanding
Zhang et al. Semantic classification of high-resolution remote-sensing images based on mid-level features
Xu et al. A supervoxel approach to the segmentation of individual trees from LiDAR point clouds
CN107577702B (en) Method for distinguishing traffic information in social media
CN112163101B (en) Geographic entity matching and fusing method for spatial knowledge graph
CN113223042A (en) Intelligent acquisition method and equipment for remote sensing image deep learning sample
CN108763192B (en) Entity relation extraction method and device for text processing
CN111680506A (en) External key mapping method and device of database table, electronic equipment and storage medium
CN106844733B (en) Image retrieval method based on combination of vocabulary tree information fusion and Hausdorff distance
CN105701227B (en) A kind of across media method for measuring similarity and search method based on local association figure
CN110046595A (en) A kind of intensive method for detecting human face multiple dimensioned based on tandem type
CN109582743A (en) A kind of data digging method for the attack of terrorism
CN113095265A (en) Fungal target detection method based on feature fusion and attention
CN113673240A (en) Method and storage medium for inferring textual geographic location from spatial entity relationships
CN116975738A (en) Polynomial naive Bayesian classification method for question intent recognition
CN116186513A (en) Vibration signal identification method based on one-dimensional convolutional neural network
Deng et al. Conceptualization of place via spatial clustering and co-occurrence analysis
CN105260982B (en) Method for analyzing image based on sparse and dense reconstruct

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant