JP2008090334A

JP2008090334A - Location analyzer, location analyzing method, its program, and recording medium

Info

Publication number: JP2008090334A
Application number: JP2006266834A
Authority: JP
Inventors: Toru Hirano; 徹平野; Yoshihiro Matsuo; 義博松尾
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2006-09-29
Filing date: 2006-09-29
Publication date: 2008-04-17
Anticipated expiration: 2026-09-29
Also published as: JP4510792B2

Abstract

<P>PROBLEM TO BE SOLVED: To automatically select and output only one of locations even when two or more locations corresponding to input address expressions exist. <P>SOLUTION: A location candidate extraction part 31 retrieves a database 1 in which only address information or the designations of facilities existing in predetermined addresses and the address information are registered for every record by using the input address expressions as keys, and extracts the address information of records including the pertinent address expressions as location candidates. A fame score calculation part 32 calculates each geographical distance between the address information of the extracted location candidates and the address information of each record in the database 1, and calculates the number of records whose geographical distances are a fixed distance or less as fame scores for every location candidate, and determines the location candidate whose fame score is the highest as a location. <P>COPYRIGHT: (C)2008,JPO&INPIT

Description

本発明は、住所情報の一部もしくは所定の住所に存在する施設の名称からなる住所表現から、当該住所表現に対応する場所を表す情報（例えば、住所情報、緯度・経度情報等）である所在地を求める技術に関する。 The present invention is a location that is information (for example, address information, latitude / longitude information, etc.) representing a place corresponding to the address expression from an address expression consisting of part of the address information or the name of a facility existing at a predetermined address. Related to technology.

従来より、住所表現から、これに対応する所在地を求めるシステムとして、住所情報のみもしくは所定の住所に存在する施設の名称とその住所情報をレコード毎に登録してなるデータベースを、入力された住所表現をキーとして検索し、該当する住所表現を含むレコードの住所情報を所在地として出力するものがあった（特許文献１参照）。
特開２００１−１３４５７９号公報 Conventionally, as a system for finding the location corresponding to this from the address expression, the address information input is a database in which only the address information or the name of the facility existing at the predetermined address and the address information is registered for each record. Is used as a key, and the address information of a record including the corresponding address expression is output as a location (see Patent Document 1).
JP 2001-134579 A

ところで、住所表現の中には、対応する所在地が２つ以上存在するものが少なくない。例えば「日本橋」という住所情報の一部からなる住所表現に対応する所在地は東京にも大阪にも存在し、また「末広町」という住所情報の一部からなる住所表現に対応する所在地は日本全国に１２６箇所も存在する。 By the way, there are not a few address expressions that have two or more corresponding locations. For example, there are locations in Tokyo and Osaka that correspond to address expressions consisting of part of the address information “Nihonbashi”, and locations that correspond to address expressions consisting of part of the address information “Suehirocho” are all over Japan. There are also 126 locations.

この場合、前述した従来のシステムでは、対応する全ての所在地を候補として一覧表示するのみであり、その後は利用者が自らの判断で選択しなければならず、特に候補の数が多い場合、正しい所在地、即ち目的とする所在地を求めることが困難であったり、時間がかかったりするという問題があった。 In this case, the above-mentioned conventional system only displays a list of all the corresponding locations as candidates, and after that, the user must make a selection based on his / her own judgment. There has been a problem that it is difficult or time consuming to determine the location, that is, the target location.

本発明は、入力された住所表現に対応する所在地が２つ以上存在する場合であっても、そのうちの１つのみを自動的に選択して出力可能とすることを目的とする。 An object of the present invention is to make it possible to automatically select and output only one of the addresses even if there are two or more locations corresponding to the input address expression.

本発明では、前記課題を解決するため、住所情報のみもしくは所定の住所に存在する施設の名称とその住所情報をレコード毎に登録してなるデータベースを参照し、入力された住所表現を含むレコードの住所情報を所在地候補として出力するとともに、当該所在地候補の住所情報と同一及び地理的に近い住所情報を含むレコードの数を有名度スコアとして所在地候補毎に出力し、前記出力された所在地候補のうち、有名度スコアが最も高い所在地候補を所在地に決定することを特徴とする。 In the present invention, in order to solve the above-mentioned problem, the address of only the address information or the name of the facility existing at the predetermined address and the address information registered for each record is referred to, and the record including the input address expression is recorded. The address information is output as a location candidate, and the number of records including address information identical and geographically close to the address information of the location candidate is output for each location candidate as a celebrity score. The location candidate having the highest reputation score is determined as the location.

また、本発明では、前記課題を解決するため、住所情報のみもしくは所定の住所に存在する施設の名称とその住所情報をレコード毎に登録してなるデータベースを参照し、入力された住所表現を含むレコードの住所情報を所在地候補として出力するとともに、当該所在地候補の住所情報と前記データベース中の各レコードの住所情報との間の地理的な距離のうちｎ番目に小さい距離を有名度スコアとして所在地候補毎に出力し、前記出力された所在地候補のうち、有名度スコアが最も低い所在地候補を所在地に決定することを特徴とする。 Further, in the present invention, in order to solve the above-mentioned problem, the name of the facility existing only at the address information or at the predetermined address and the database in which the address information is registered for each record are referred to, and the input address expression is included The address information of the record is output as a location candidate, and the location candidate with the nth smallest distance among the geographical distances between the address information of the location candidate and the address information of each record in the database as a celebrity score It is output every time, and among the output location candidates, the location candidate having the lowest celebrity score is determined as the location.

本発明によれば、入力された住所表現を含む所在地候補のうち、その住所情報と同一及び地理的に近い住所情報がデータベース中に数多く含まれる候補を所在地と決定する、あるいはその住所情報とデータベース中の各レコードの住所情報との間の地理的な距離のうちｎ番目に小さい距離が最小の候補を所在地と決定することにより、入力された住所表現に対応する所在地が２つ以上存在する場合であっても、そのうちの１つのみを自動的に選択して出力することが可能となる。 According to the present invention, among the address candidates including the input address expression, a candidate whose address information includes many address information identical and geographically close to the address information is determined as the address, or the address information and the database. When there are two or more locations corresponding to the input address expression by determining the candidate having the smallest nth smallest distance among the geographical information between the address information of each record in Even so, only one of them can be automatically selected and output.

ここで、所在地候補の住所情報を含むレコードだけでなく、これと地理的に近い住所情報を含むレコードの数を有名度スコアとし、あるいは所在地候補の住所情報とデータベース中の各レコードの住所情報との間の地理的な距離のうちｎ番目に小さい距離を有名度スコアとしたことにより、関連する先願（特願２００６−１３７６６０：未公開）に比べ、行政区画の違いによって有名度スコアに反映されなかった情報を反映でき、より正確な有名度を得ることが可能となる。 Here, the number of records that include not only the address information of the address candidate but also the address information that is geographically close to this is used as the celebrity score, or the address information of the address candidate and the address information of each record in the database The nth smallest distance among the geographical distances between the two is used as the celebrity score, so that it is reflected in the celebrity score due to the difference in administrative divisions compared to the related prior application (Japanese Patent Application No. 2006-137660: unpublished) It is possible to reflect information that has not been made, and to obtain a more accurate reputation.

例えば、東京都の三鷹駅周辺は三鷹市と武蔵野市との境界に当たり、道路を挟んだ反対側は行政区画が異なることが多い。このような場合、先願では、道路の反対側の情報は地理的に近くにあるにも拘わらず、異なる行政区画となって有名度スコアに反映されなかった。本発明では、所在地候補の住所情報だけでなく、これと地理的に近い住所情報を用い、あるいは所在地候補の住所情報とデータベース中の各レコードの住所情報との間の地理的な距離を用いるため、行政区画に依らない有名度スコアを求めることが可能となる。 For example, the area around Mitaka Station in Tokyo is the boundary between Mitaka City and Musashino City, and the other side across the road often has different administrative divisions. In such a case, in the previous application, the information on the other side of the road was geographically close, but became a different administrative division and was not reflected in the reputation score. In the present invention, not only the address information of the address candidate but also the address information geographically close to the address candidate is used, or the geographical distance between the address information of the address candidate and the address information of each record in the database is used. It becomes possible to obtain a reputation score that does not depend on administrative divisions.

以下、本発明の実施の形態を図面により説明する。 Hereinafter, embodiments of the present invention will be described with reference to the drawings.

＜実施の形態１＞
図１は本発明の所在地解析装置の実施の形態の一例を示すもので、図中、１はデータベース、２は住所表現記憶部、３は所在地候補生成部、４は所在地候補記憶部、５は所在地決定部、６は住所表現−所在地対応付け部である。 <Embodiment 1>
FIG. 1 shows an example of an embodiment of the location analysis apparatus of the present invention. In the figure, 1 is a database, 2 is an address expression storage unit, 3 is a location candidate generation unit, 4 is a location candidate storage unit, A location determination unit 6 is an address expression-location association unit.

データベース１は、住所情報とその緯度・経度情報もしくは所定の住所に存在する施設の名称とその住所情報及び緯度・経度情報をレコード毎に登録してなる少なくとも１つのデータベース、ここでは住所データベース１１、店舗データベース１２、駅名データベース１３及びランドマークデータベース１４からなる。 The database 1 is at least one database in which address information and its latitude / longitude information or the name of a facility existing at a predetermined address and its address information and latitude / longitude information are registered for each record. It consists of a store database 12, a station name database 13, and a landmark database 14.

住所データベース１１は、図２に示すように、住所情報とその緯度・経度情報、ここでは日本全国全ての住所情報とその緯度・経度情報をレコード毎に登録してなるもので、各レコードのうち、住所情報が登録されている部分を住所フィールド、緯度・経度情報が登録されている部分を緯度・経度フィールドと呼ぶ。ここで、住所データベース１１は、レコードとして日本全国に存在する番地や号までを含む住所（例えば、「東京都中央区日本橋１丁目ｘｘ番１号」、「東京都中央区日本橋１丁目ｘｘ番２号」等）を全てその緯度・経度情報とともに登録したものであっても良いし、丁目までや市町村名まで等の途中までの住所（例えば、「東京都中央区日本橋」や「東京都中央区日本橋１丁目」や「東京都中央区」）もその緯度・経度情報とともに登録されていても良い。 As shown in FIG. 2, the address database 11 stores address information and its latitude / longitude information, here, address information of all of Japan and its latitude / longitude information for each record. The part where the address information is registered is called an address field, and the part where the latitude / longitude information is registered is called a latitude / longitude field. Here, the address database 11 includes addresses including addresses and issues existing all over Japan as records (for example, “Nihonbashi 1-chome xx-1 No. 1”, “Nihonbashi 1-chome xx-2 Chuo-ku, Tokyo”). ”, Etc.) along with the latitude / longitude information, or addresses up to the middle of the street, such as“ Chuo-ku, Tokyo ”or“ Chuo-ku, Tokyo ” “Nihonbashi 1-chome” and “Chuo-ku, Tokyo”) may also be registered along with the latitude and longitude information.

店舗データベース１２は、図３に示すように、所定の住所に存在する施設、ここでは店舗の名称（店舗名）とその住所情報及び緯度・経度情報をレコード毎に登録してなるもので、各レコードのうち、店舗名が登録されている部分を店舗名フィールド、住所情報が登録されている部分を住所フィールド、緯度・経度情報が登録されている部分を緯度・経度フィールドと呼ぶ。 As shown in FIG. 3, the store database 12 is a facility in which a facility existing at a predetermined address, here a store name (store name), its address information, and latitude / longitude information is registered for each record. In the record, the part where the store name is registered is called the store name field, the part where the address information is registered is called the address field, and the part where the latitude / longitude information is registered is called the latitude / longitude field.

駅名データベース１３は、図４に示すように、所定の住所に存在する施設、ここでは駅の名称（駅名）とその住所情報及び緯度・経度情報をレコード毎に登録してなるもので、各レコードのうち、駅名が登録されている部分を駅名フィールド、住所情報が登録されている部分を住所フィールド、緯度・経度情報が登録されている部分を緯度・経度フィールドと呼ぶ。 As shown in FIG. 4, the station name database 13 is a facility in which a facility existing at a predetermined address, in this case, a station name (station name), its address information, and latitude / longitude information is registered for each record. Of these, a part where a station name is registered is called a station name field, a part where address information is registered is called an address field, and a part where latitude / longitude information is registered is called a latitude / longitude field.

ランドマークデータベース１４は、図５に示すように、所定の住所に存在する施設、ここではランドマークの名称（ランドマーク名）とその住所情報及び緯度・経度情報をレコード毎に登録してなるもので、各レコードのうち、ランドマーク名が登録されている部分をランドマーク名フィールド、住所情報が登録されている部分を住所フィールド、緯度・経度情報が登録されている部分を緯度・経度フィールドと呼ぶ。 As shown in FIG. 5, the landmark database 14 is a facility in which a facility existing at a predetermined address, in this case, a landmark name (landmark name), its address information, and latitude / longitude information is registered for each record. In each record, the part where the landmark name is registered is the landmark name field, the part where the address information is registered is the address field, and the part where the latitude / longitude information is registered is the latitude / longitude field. Call.

住所表現記憶部２は、図示しないキーボード等から直接入力され又は記憶媒体から読み出されて入力され又は通信媒体を介して他の装置等から入力された住所表現を一時的に記憶する。 The address expression storage unit 2 temporarily stores an address expression that is directly input from a keyboard (not shown) or the like, or read and input from a storage medium or input from another device or the like via a communication medium.

所在地候補生成部３は、データベース１を参照し、前記の如く入力された住所表現を含むレコードの住所情報を所在地候補として出力するとともに、当該所在地候補の住所情報と同一及び地理的に近い住所情報を含むレコードの数を有名度スコアとして所在地候補毎に出力するもので、より詳細には、所在地候補抽出部３１と、有名度スコア計算部３２とから構成されている。 The location candidate generation unit 3 refers to the database 1 and outputs the address information of the record including the address expression input as described above as a location candidate, and the same and geographically close address information as the address information of the location candidate. The number of records including “” is output as a famousness score for each location candidate, and more specifically, includes a location candidate extraction unit 31 and a famousity score calculation unit 32.

所在地候補抽出部３１は、入力された住所表現をキーとしてデータベース１を検索し、該当する住所表現を含む全てのレコードの住所情報及びその緯度・経度情報を所在地候補として抽出するもので、さらに詳細には、データベース選択部３１１と、所在地情報取得部３１２とから構成されている。 The location candidate extraction unit 31 searches the database 1 using the input address expression as a key, and extracts address information and latitude / longitude information of all records including the corresponding address expression as location candidates. 1 includes a database selection unit 311 and a location information acquisition unit 312.

データベース選択部３１１は、図６に示すような、文字列ルールとこれに対応するデータベース名及びフィールド名とを記述したデータベース選択テーブルを予め記憶して保持しており、住所表現を入力とし、入力された住所表現の文字列を前記テーブルに記述された各文字列ルールと照合し、一致する文字列ルールがあれば、前記テーブルから当該一致した文字列ルールに対応するデータベース名及びフィールド名を読み出し、選択したデータベース名及びフィールド名として所在地情報取得部３１２に出力する。 The database selection unit 311 stores and holds in advance a database selection table that describes character string rules and corresponding database names and field names as shown in FIG. The addressed character string is compared with each character string rule described in the table, and if there is a matching character string rule, the database name and field name corresponding to the matching character string rule are read from the table. , And outputs the selected database name and field name to the location information acquisition unit 312.

ここで、文字列ルールとは、入力された住所表現の文字列の最後に「店」という文字が存在した場合には入力された住所表現は「店舗名」であると判定するものであり、また、入力された住所表現の文字列の最後に「駅」という文字が存在した場合には入力された住所表現は「駅名」であると判定するためのものである。従って、図６に示すような、前記文字列ルールとこれに対応するデータベース名及びフィールド名とを記述したデータベース選択テーブルを参照することにより、入力された住所表現に対応するデータベース名及びフィールド名を選択して出力することが可能となる。 Here, the character string rule is to determine that the input address expression is “store name” when the character “store” exists at the end of the input address expression character string. In addition, when the character “station” is present at the end of the character string of the input address expression, the input address expression is determined to be “station name”. Accordingly, by referring to the database selection table describing the character string rule and the corresponding database name and field name as shown in FIG. 6, the database name and field name corresponding to the input address expression are obtained. It is possible to select and output.

なお、このデータベース選択部３１１は、所在地候補の抽出に際して、検索対象とすべきデータベースやフィールドの数を少なくするためのもので、必須の構成ではなく、計算量は多くなるが、データベース１中の全てのデータベース及び全てのフィールドに対して、入力された住所表現をキーとする検索を行うようにしても良い。 The database selection unit 311 is used to reduce the number of databases and fields to be searched when extracting location candidates. This database selection unit 311 is not an indispensable component and requires a large amount of calculation. You may make it search with respect to all the databases and all the fields by using the input address expression as a key.

所在地情報取得部３１２は、前述の如く入力され住所表現記憶部２に一時記憶された住所表現と、データベース選択部３１１で選択されて出力されたデータベース名及びフィールド名とを入力とし、該住所表現をキーとしてデータベース１のうち前記選択されたデータベース名及びフィールド名に対応するデータベースのフィールドを検索し、該当する住所表現を含むレコードの住所情報及び緯度・経度情報を所在地候補として取得し、有名度スコア計算部３２及び所在地候補記憶部４に出力する（但し、所在地候補記憶部４に出力する所在地候補は住所情報のみでも良い。）。 The location information acquisition unit 312 receives the address representation input as described above and temporarily stored in the address representation storage unit 2, and the database name and field name selected and output by the database selection unit 311 as input, and the address representation The database field corresponding to the selected database name and field name is searched from the database 1 using as a key, and the address information and latitude / longitude information of the record including the corresponding address expression are acquired as location candidates, It outputs to the score calculation part 32 and the address candidate memory | storage part 4 (however, the address candidate output to the address candidate memory | storage part 4 may be only address information).

有名度スコア計算部３２は、所在地候補抽出部３１で抽出され出力された所在地候補の住所情報及び緯度・経度情報を入力とし、該所在地候補の住所情報とデータベース１中の各レコードの住所情報との間の地理的な距離を、それらの緯度・経度情報を用いてそれぞれ計算し、当該地理的な距離が予め設定した一定の距離以下であるレコードの数を計数し、同様な処理を所在地候補毎に行い、これらを所在地候補毎の有名度スコアとして所在地決定部５に出力する。 The celebrity score calculation unit 32 receives the address information and latitude / longitude information of the location candidate extracted and output by the location candidate extraction unit 31, and inputs the address information of the location candidate and the address information of each record in the database 1. The geographical distance between the two is calculated using the latitude and longitude information, and the number of records in which the geographical distance is less than or equal to a predetermined distance is counted. These are performed every time, and these are output to the location determination unit 5 as the famousness score for each location candidate.

ここで、所在地候補の住所情報とデータベース１中の一のレコードの住所情報との間の地理的な距離Ｄは、それらの緯度情報Ｎ₁，Ｎ₂及び経度情報Ｅ₁，Ｅ₂を用いて、
Ｄ＝｛（Ｎ₁−Ｎ₂）²＋（Ｅ₁−Ｅ₂）²｝^1/2
より計算する。 Here, the geographical distance D between the address information of the candidate location and the address information of one record in the database 1 is obtained by using the latitude information N ₁ and N ₂ and the longitude information E ₁ and E _2. ,
D = {(N ₁ −N ₂ ) ² + (E ₁ −E ₂ ) ² } ^1/2
Calculate more.

なお、距離Ｄについては、前述したように予め設定した一定の距離（の値）との大小が判定できれば足りる、つまり絶対的な距離の値を必要とするわけではないので、上記式中のルート（１／２乗）の部分の計算は省略しても良い。また、本式に関しては、緯度情報及び経度情報を用いた２点間の距離を計算する既知の公式、例えば「緯度及び経度から平面直角座標に変換し測地線長を計算する公式」（B.R.Bowring, "TOTAL INVERSE SOLUTIONS FOR THE GEODESIC AND GREAT ELLIPTIC," Survey Review, Vol.33, No.261, July 1996, pp.461-476（http://vldb.gsi.go.jp/sokuchi/surveycalc/algorithm/）参照）等を用いても良い。 As described above, the distance D is sufficient if it can be determined from the predetermined distance (value) as described above, that is, an absolute distance value is not required. Calculation of the (1/2 power) portion may be omitted. As for this formula, a known formula for calculating the distance between two points using latitude information and longitude information, for example, “formula for calculating geodesic line length by converting latitude and longitude into plane rectangular coordinates” (BRBowring , "TOTAL INVERSE SOLUTIONS FOR THE GEODESIC AND GREAT ELLIPTIC," Survey Review, Vol.33, No.261, July 1996, pp.461-476 (http://vldb.gsi.go.jp/sokuchi/surveycalc/algorithm /) See) etc. may be used.

なお、有名度スコア計算部３２における具体的な計算対象は、各データベースの全てのレコードの緯度・経度フィールドである。また、この際、計算対象のデータベースを限定することにより、様々な仮定に対応する有名度スコアを得ることができる。 A specific calculation object in the famousness score calculation unit 32 is a latitude / longitude field of all records of each database. At this time, by limiting the database to be calculated, famousity scores corresponding to various assumptions can be obtained.

例えば、店舗が多いほど有名な所在地であると仮定した場合は、店舗データベースのみを利用することにより、店舗が多い所在地候補ほど高くなる有名度スコアが得られる。 For example, if it is assumed that the number of stores is more famous, the famousness score that is higher for the location candidates with more stores can be obtained by using only the store database.

また、行政区画が多いほど有名な所在地であると仮定した場合は、住所データベースのみを利用することにより、行政区画が多い所在地候補ほど高くなる有名度スコアが得られる。 Further, when it is assumed that the location is more famous as there are more administrative districts, the use of only the address database can obtain a reputation score that increases as the location candidate has more administrative districts.

さらにまた、ＷＥＢ文書での記述が多いほど有名な所在地であると仮定した場合は、（図１では取り上げなかったが）ＷＥＢページを用いた各住所の検索ヒット文書のデータベースのみを利用することにより、ＷＥＢ文書での記述が多い所在地候補ほど高くなる有名度スコアが得られる。 Furthermore, if it is assumed that the location is more famous as there are more descriptions in the WEB document, it is possible to use only the search hit document database of each address using the WEB page (not shown in FIG. 1). , A famousness score that is higher for a location candidate that has more descriptions in a WEB document can be obtained.

また、有名度スコア計算部３２は、所在地候補が入力されてから各レコードとの距離の計算や該当レコード数の計数等の処理を行う必要はなく、例えば、あらゆる所在地候補に対する有名度スコアを予め計算し、これを登録したデータベースを用意しておき、所在地候補が入力された際、このデータベースから該当する所在地候補の有名度スコアを読み出して出力するようにしても良い。 Further, the famousity score calculation unit 32 does not need to perform processing such as calculating the distance to each record or counting the number of records after the location candidate is input. It is also possible to prepare a database that has been calculated and registered, and when a location candidate is input, the famousness score of the corresponding location candidate may be read from this database and output.

所在地候補記憶部４は、所在地候補生成部３（の所在地候補抽出部３１の所在地情報取得部３１２）から出力された所在地候補を一時的に記憶する。 The location candidate storage unit 4 temporarily stores the location candidate output from the location candidate generation unit 3 (location information acquisition unit 312 of the location candidate extraction unit 31).

所在地決定部５は、所在地候補生成部３から出力された所在地候補のうち、有名度スコアが最も高い所在地候補を所在地に決定する、より詳細には、所在地候補抽出部３１から出力され所在地候補記憶部４に一時記憶された所在地候補と、有名度スコア計算部３２から出力された有名度スコアとを入力とし、有名度スコアが最も高い所在地候補を所在地に決定して住所表現−所在地対応付け部６に出力する。 The location determination unit 5 determines a location candidate having the highest celebrity score from among the location candidates output from the location candidate generation unit 3. More specifically, the location determination unit 5 outputs the location candidate from the location candidate extraction unit 31. The location candidate temporarily stored in the unit 4 and the celebrity score output from the celebrity score calculation unit 32 are input, the location candidate having the highest celebrity score is determined as the location, and the address expression-location association unit 6 is output.

住所表現−所在地対応付け部６は、入力された住所表現と所在地決定部５で決定された所在地とを対応付けて出力する、より詳細には、前述の如く入力され住所表現記憶部２に一時記憶された住所表現と、所在地決定部５から出力された所在地とを入力とし、この住所表現と所在地（の住所情報）とを対応付けて出力する。 The address expression-location associating unit 6 outputs the input address representation and the location determined by the location determining unit 5 in association with each other. More specifically, the address expression-location associating unit 6 inputs the address expression and the address temporarily as described above. The stored address representation and the location output from the location determination unit 5 are input, and the address representation and the location (address information thereof) are output in association with each other.

なお、住所表現と所在地とを対応付けて出力する必要がなく、入力された住所表現に対応する所在地だけ出力すれば良い場合には、所在地決定部５で決定した所在地を、当該所在地決定部５から直接、外部へ出力させるようにすれば良く、この場合、住所表現−所在地対応付け部６は必要ない。 In addition, when it is not necessary to output the address expression and the address in association with each other, and only the address corresponding to the input address expression needs to be output, the address determined by the address determining unit 5 is determined as the address determining unit 5. In this case, the address expression / address associating unit 6 is not necessary.

図７は本発明の所在地解析装置における処理の流れを示すもので、以下、例を挙げてその動作を詳細に説明する。 FIG. 7 shows the flow of processing in the location analysis apparatus of the present invention. Hereinafter, the operation will be described in detail with an example.

前述の如くして入力された住所表現は、住所表現記憶部２に記憶されるとともにデータベース選択部３１１に入力される。データベース選択部３１１に入力された住所表現の文字列は、データベース選択部３１１に予め記憶されて保持されたデータベース選択テーブル中の文字列ルールとの照合が行われ、一致した文字列ルールに対応するデータベース名及びフィールド名が選択されて、そのデータベース名及びフィールド名が所在地候補取得部３１２に出力される（ｓ１）。 The address expression input as described above is stored in the address expression storage unit 2 and input to the database selection unit 311. The character string of the address expression input to the database selection unit 311 is compared with the character string rule in the database selection table stored and held in advance in the database selection unit 311 and corresponds to the matched character string rule. The database name and field name are selected, and the database name and field name are output to the location candidate acquisition unit 312 (s1).

例えば、住所表現「日本橋」が入力された場合、図８に示すように「住所データベース，住所フィールド」が出力される。また、住所表現「東京タワー」が入力された場合、図８に示すように「ランドマークデータベース，ランドマーク名フィールド」が出力される。また、住所表現「ＮＴＴ横須賀店」が入力された場合、図８に示すように「店舗データベース，店舗名フィールド」が出力される。また、住所表現「東京駅」が入力された場合、図８に示すように「駅名データベース，駅名フィールド」が出力される。 For example, when the address expression “Nihonbashi” is input, “address database, address field” is output as shown in FIG. When the address expression “Tokyo Tower” is input, “landmark database, landmark name field” is output as shown in FIG. When the address expression “NTT Yokosuka store” is input, “store database, store name field” is output as shown in FIG. When the address expression “Tokyo station” is input, “station name database, station name field” is output as shown in FIG.

次に、所在地候補取得部３１２において、住所表現記憶部２に記憶された住所表現が読み出されて入力されるとともに、データベース選択部３１１で選択されて出力されたデータベース名及びフィールド名が入力されると、該住所表現をキーとしてデータベース１のうち前記選択されたデータベース名及びフィールド名に対応するデータベースのフィールドが検索され、該当する住所表現を含むレコードの住所情報及び緯度・経度情報が所在地候補として取得され、有名度スコア計算部３２に出力されるとともに所在地候補記憶部４に出力され記憶される（ｓ２）（但し、所在地候補記憶部４に出力され記憶される所在地候補は住所情報のみでも良い。）。 Next, in the address candidate acquisition unit 312, the address expression stored in the address expression storage unit 2 is read and input, and the database name and field name selected and output by the database selection unit 311 are input. Then, the database field corresponding to the selected database name and field name is searched from the database 1 using the address expression as a key, and the address information and latitude / longitude information of the record including the corresponding address expression are the location candidates. (S2) (however, the location candidate that is output and stored in the location candidate storage unit 4 may be address information alone). good.).

例えば、住所表現が「日本橋」、選択されたデータベース名が「住所データベース（ＤＢ）」、選択されたフィールド名が「住所フィールド」である場合、図９に示すように「東京都中央区日本橋，３５．６７７，１３９．７７６」と「大阪府大阪市浪速区日本橋…，３４．６５６，１３５．５０９」とが所在地候補として取得され、出力される（図２に示した住所データベース１１の場合、厳密には「東京都中央区日本橋１丁目，３５．６７９，１３９．７５４」も所在地候補として取得され、出力されるが、ここでは説明を簡略化するため、割愛した。）。 For example, when the address expression is “Nihonbashi”, the selected database name is “address database (DB)”, and the selected field name is “address field”, as shown in FIG. 35.677, 139.776 ”and“ Nipponbashi, Naniwa-ku, Osaka City, Osaka, Japan…, 34.656, 135.509 ”are acquired and output as location candidates (in the case of the address database 11 shown in FIG. Strictly speaking, “Nihonbashi 1-chome, Chuo-ku, Tokyo, 35.679, 139.754” is also acquired and output as a location candidate, but is omitted here for the sake of simplicity.)

なお、以上の処理が「所在地候補抽出処理」である。 The above processing is “location candidate extraction processing”.

次に、有名度スコア計算部３２において、所在地候補抽出部３１で抽出され出力された所在地候補の住所情報及び緯度・経度情報が入力されると、該所在地候補の住所情報とデータベース１中の各レコードの住所情報との間の地理的な距離が、それらの緯度・経度情報を用いてそれぞれ計算され、当該地理的な距離が予め設定した一定の距離以下であるレコードの数が計数され、同様な処理が所在地候補毎に行われ、これらが所在地候補毎の有名度スコアとして所在地決定部５に出力される（ｓ３）。 Next, when the address information and latitude / longitude information of the location candidate extracted and output by the location candidate extraction unit 31 are input to the famousity score calculation unit 32, the address information of the location candidate and each of the information in the database 1 are input. The geographical distance between the address information of the records is calculated using the latitude / longitude information, and the number of records where the geographical distance is equal to or less than a predetermined distance is counted. This processing is performed for each location candidate, and these are output to the location determination unit 5 as a famousity score for each location candidate (s3).

例えば、所在地候補が「東京都中央区日本橋，３５．６７７，１３９．７７６」である場合、データベース１中の一のレコードとの間の地理的な距離Ｄは、該一のレコードの緯度・経度情報が「３４．１２３，１３８．６５４」であれば、
Ｄ＝｛（35.677−34.123）²＋（139.776−138.654）²｝^1/2
＝１．９１６７１５９４１
と計算される。この計算された距離Ｄ（の値）が予め設定した一定の距離（の値）、例えば“１００”と比較され、“１００”以下であれば該当するレコード、即ち所在地候補「東京都中央区日本橋，３５．６７７，１３９．７７６」の住所情報と地理的に近い住所情報を含むレコードの数として計数（＋１）される。データベース１中の全てのレコードとの間で同様な距離の計算及び比較・計数が行われ、最終的な計数値が１５０であれば、図１０に示すように「１５０」が有名度スコアとして出力される。また、所在地候補が「大阪府大阪市浪速区日本橋…，３４．６５６，１３５．５０９」である場合も同様な処理が行われ、最終的な計数値が５０であれば、図１０に示すように「５０」が有名度スコアとして出力される。 For example, when the location candidate is “Nihonbashi, Chuo-ku, Tokyo, 35.677, 139.776”, the geographical distance D between one record in the database 1 is the latitude / longitude of the one record. If the information is “34.123, 138.654”
D = {(35.677−34.123) ² + (139.776−138.654) ² } ^1/2
= 1.916715941
Is calculated. The calculated distance D (value) is compared with a predetermined distance (value), for example, “100”, and if it is “100” or less, the corresponding record, that is, the location candidate “Nihonbashi, Chuo-ku, Tokyo” , 35.677, 139.776 ”is counted (+1) as the number of records including address information geographically close to the address information. The same distance calculation, comparison, and counting are performed with all records in the database 1. If the final count value is 150, “150” is output as the famousity score as shown in FIG. Is done. The same processing is performed when the location candidate is “Nipponbashi, Naniwa-ku, Osaka-shi, Osaka ... 34.656, 135.509”. If the final count value is 50, as shown in FIG. “50” is output as the famousness score.

なお、以上の処理が「所在地候補生成処理」である。 The above processing is “location candidate generation processing”.

次に、所在地決定部５において、所在地候補記憶部４に記憶された所在地候補が読み出されて入力されるとともに、有名度スコア計算部３２で計算されて出力された有名度スコアが入力されると、有名度スコアが最も高い所在地候補が所在地に決定され、住所表現−所在地対応付け部６に出力される（ｓ４）。 Next, in the location determination unit 5, the location candidates stored in the location candidate storage unit 4 are read and input, and the famousity score calculated and output by the famousness score calculation unit 32 is input. Then, the location candidate with the highest celebrity score is determined as the location, and is output to the address expression-location association unit 6 (s4).

例えば、所在地候補が「東京都中央区日本橋」及び「大阪府大阪市浪速区日本橋…」、有名度スコアが「１５０」及び「５０」である場合、図１１に示すように「東京都中央区日本橋」が所在地として決定され、出力される。 For example, if the location candidates are “Nipponbashi, Chuo-ku, Tokyo” and “Nihonbashi, Naniwa-ku, Osaka, Osaka ...” and the celebrity scores are “150” and “50”, as shown in FIG. “Nihonbashi” is determined as the location and output.

最後に、住所表現−所在地対応付け部６において、住所表現記憶部２に記憶された住所表現が読み出されて入力されるとともに、所在地決定部５で決定されて出力された所在地が入力されると、この住所表現と所在地（の住所情報）とが対応付けて出力される（ｓ５）。 Finally, in the address expression-location associating unit 6, the address representation stored in the address representation storage unit 2 is read and input, and the location determined and output by the location determination unit 5 is input. And the address expression and the location (address information) are output in association with each other (s5).

例えば、住所表現が「日本橋」、所在地が「東京都中央区日本橋」である場合、図１２に示すように「日本橋：東京都中央区日本橋」が出力される。 For example, when the address expression is “Nihonbashi” and the location is “Nihonbashi, Chuo-ku, Tokyo”, “Nihonbashi: Nihonbashi, Chuo-ku, Tokyo” is output as shown in FIG.

このように、本実施の形態によれば、入力された住所表現に対応する所在地が２つ以上存在する場合であっても、そのうちの１つのみを自動的に選択して出力することが可能となる。 Thus, according to this embodiment, even when there are two or more locations corresponding to the input address expression, only one of them can be automatically selected and output. It becomes.

なお、前述したように、住所表現と所在地とを対応付けて出力する必要がなく、入力された住所表現に対応する所在地だけ出力すれば良い場合には、所在地決定処理（ｓ４）により決定した所在地を直接、外部へ出力すれば良く、この場合、住所表現−所在地対応付け処理（ｓ５）は必要ない。 As described above, when it is not necessary to output the address expression and the address in association with each other, and only the address corresponding to the input address expression needs to be output, the address determined by the address determination process (s4). May be directly output to the outside, and in this case, the address expression / address associating process (s5) is not necessary.

＜実施の形態２＞
本発明の実施の形態２では、図１３に示すように、都道府県、市区郡、町村、町大字、字・丁目の住所レベル毎に区切られた住所情報とその緯度・経度情報をレコード毎に登録してなる住所データベースを用いる。即ち、例えば、実施の形態１で説明した住所データベースにおける１つのレコードの住所情報が「東京都中央区日本橋…」であったとすると、本実施の形態における同じレコードの住所情報は「東京／都，中央／区，日本橋，…」（但し、「，」は区切りの記号、また、「／」は次の区切りの記号「，」までの文字列については住所表現中にあってもなくても良いことを示す記号である。）のように、都道府県、市区郡、町村、町大字、字・丁目の住所レベル毎に区切られて登録される。 <Embodiment 2>
In the second embodiment of the present invention, as shown in FIG. 13, the address information and the latitude / longitude information divided for each address level in prefectures, municipalities, towns, villages, town squares, and letters / chomes are recorded for each record. Use the address database registered in. That is, for example, if the address information of one record in the address database described in the first embodiment is “Nihonbashi, Chuo-ku, Tokyo ...”, the address information of the same record in the present embodiment is “Tokyo / city, "Chuo / Ku, Nihonbashi, ..." (However, "," is a delimiter symbol, and "/" is the character string up to the next delimiter symbol "," may or may not be in the address expression. As shown in FIG. 5), the registered address is divided into each prefecture, municipality, town, village, town large character, letter / chome address level.

そして、所在地候補生成部３（詳細には所在地候補抽出部３１の所在地候補取得部３１２）では、入力された住所表現、ここでは住所情報の一部からなる住所表現をキーとして住所データベースの住所フィールドを検索し、該当する住所表現を含むレコードの住所情報を取得し、さらに前記住所表現と区切りの一致を満たすレコードの住所情報をその緯度・経度情報とともに所在地候補として出力する。 Then, in the address candidate generation unit 3 (specifically, the address candidate acquisition unit 312 of the address candidate extraction unit 31), the address field of the address database using the input address expression, here, the address expression consisting of a part of the address information as a key. , The address information of the record including the corresponding address expression is acquired, and the address information of the record satisfying the coincidence with the address expression is output together with the latitude / longitude information as a location candidate.

本実施の形態によれば、例えば、入力された住所表現「イラン」に対して住所情報「英国ハイランド州…」を含むレコードが抽出され所在地候補として生成されるような、住所表現を構成する文字列の単なる一致により誤った所在地候補が生成されることが少なくなる。 According to the present embodiment, for example, an address expression is configured such that a record including address information “Highland of England ...” is extracted from the input address expression “Iran” and is generated as a location candidate. It is less likely that incorrect location candidates are generated due to simple matching of character strings.

＜実施の形態３＞
本発明の実施の形態３では、所在地候補生成部３（詳細には所在地候補抽出部３１の所在地候補取得部３１２）から所在地候補記憶部４に出力され記憶され、所在地決定部５に入力される所在地候補として住所情報とともに緯度・経度情報を含むことを必須とし、また、所在地決定部５から出力されて住所表現−所在地対応付け部６に入力され、住所表現−所在地対応付け部６から住所表現と対応付けられて出力される所在地として住所情報とともに緯度・経度情報を含むことを必須とすることを特徴とし、例えば、住所表現が「日本橋」、所在地が「東京都中央区日本橋，３５．６７７，１３９．７７６」である場合、住所表現−所在地対応付け部６では、「日本橋：東京都中央区日本橋；３５．６７７，１３９．７７６」等と出力する。 <Embodiment 3>
In the third embodiment of the present invention, the location candidate generation unit 3 (specifically, the location candidate acquisition unit 312 of the location candidate extraction unit 31) outputs and stores the location candidate storage unit 4 and inputs it to the location determination unit 5. It is essential that latitude / longitude information is included as address candidates together with address information, and is output from the address determining unit 5 and input to the address expression-address associating unit 6, and from the address expression-address associating unit 6. It is essential that latitude / longitude information is included together with address information as a location output in association with, for example, the address expression is “Nihonbashi” and the location is “Nihonbashi, Chuo-ku, Tokyo, 35.677”. , 139.776 ”, the address expression / location associating unit 6 outputs“ Nihonbashi: Nihonbashi, Chuo-ku, Tokyo; 35.677, 139.776 ”or the like.

本実施の形態によれば、住所表現に対応する所在地として、住所情報の他、緯度・経度情報を出力することができる。 According to the present embodiment, it is possible to output latitude / longitude information in addition to address information as the location corresponding to the address expression.

＜実施の形態４＞
本発明の実施の形態４では、所在地候補生成部３において、データベース１を参照し、入力された住所表現を含むレコードの住所情報を所在地候補として出力するとともに、当該所在地候補の住所情報とデータベース１中の各レコードの住所情報との間の地理的な距離のうちｎ（ｎは自然数）番目に小さい距離（の値）を有名度スコアとして所在地候補毎に出力する、詳細には、有名度スコア計算部３２において、所在地候補抽出部３１で抽出され出力された所在地候補の住所情報及び緯度・経度情報を入力とし、該所在地候補の住所情報とデータベース１中の各レコードの住所情報との間の地理的な距離を、それらの緯度・経度情報を用いてそれぞれ計算し、該計算した地理的な距離を小さいものから大きいものへ順に並べた時に１番小さいものからｎ番目（例えば、１００番目）の距離（の値）を決定し、同様な処理を所在地候補毎に行い、これらを所在地候補毎の有名度スコアとして所在地決定部５に出力することを特徴とする。 <Embodiment 4>
In Embodiment 4 of the present invention, the address candidate generation unit 3 refers to the database 1 and outputs the address information of the record including the input address expression as a position candidate. The nth (n is a natural number) smallest distance (value) of the geographical distance between each address in each record is output as a celebrity score for each location candidate. In the calculation unit 32, the address information and latitude / longitude information of the location candidate extracted and output by the location candidate extraction unit 31 are input, and the address information between the location candidate and the address information of each record in the database 1 is input. The geographical distance is calculated using the latitude / longitude information, and the calculated geographical distance is arranged in order from the smallest to the largest. The nth (for example, 100th) distance (value) is determined from the thing, the same processing is performed for each location candidate, and these are output to the location determination unit 5 as the famousity score for each location candidate. And

また、所在地決定部５において、所在地候補生成部３から出力された所在地候補のうち、有名度スコアが最も低い所在地候補を所在地に決定する、より詳細には、所在地候補抽出部３１から出力され所在地候補記憶部４に一時記憶された所在地候補と、有名度スコア計算部３２から出力された有名度スコアとを入力とし、有名度スコアが最も低い所在地候補を所在地に決定して住所表現−所在地対応付け部６に出力することを特徴とする。 Further, the location determination unit 5 determines a location candidate having the lowest celebrity score from among the location candidates output from the location candidate generation unit 3. More specifically, the location determination unit 5 outputs the location candidate output from the location candidate extraction unit 31. Using the location candidate temporarily stored in the candidate storage unit 4 and the celebrity score output from the celebrity score calculation unit 32 as input, the location candidate having the lowest celebrity score is determined as the location, and address expression-location correspondence It outputs to the attaching part 6, It is characterized by the above-mentioned.

本実施の形態によれば、各所在地候補の有名度スコアとして「０」より大きい値を必ず得ることができ（これまでの実施の形態では、計算した地理的な距離が全て一定の距離より大きい場合、有名度スコアは「０」となる。）、全ての所在地候補を相対的に比較することが可能となる。 According to the present embodiment, a value greater than “0” can always be obtained as the famousness score of each location candidate (in the previous embodiments, the calculated geographical distances are all greater than a certain distance). In this case, the famousness score is “0”), and all the location candidates can be relatively compared.

なお、これまでの説明は、所在地候補抽出部３１（所在地候補抽出処理）で抽出される所在地候補の数が２つ以上であることを前提としたが、住所表現の中には対応する所在地が１つしか存在しないものもある。そこで、所在地候補抽出部３１（所在地候補抽出処理）の後に、所在地候補の数が「１つ」か「１つ」より大きいかを判定する所在地候補数判定部（所在地候補数判定処理）を設けて、所在地候補の数が「１」であれば有名度スコア計算部３１（有名度スコア計算処理）及び所在地決定部５（所在地決定処理）をスキップさせ、抽出した１つの所在地候補をそのまま所在地として出力し、もしくは住所表現−所在地対応付け部６（住所表現−所在地対応付け処理）で入力された住所表現と対応付けて出力させるようにしても良い（なお、所在地候補の数が「０」、即ち１つも得られなければ、所在地候補抽出部３１（所在地候補抽出処理）からその旨が出力され、処理が終了することはいうまでもない。）。 The above explanation is based on the assumption that the number of location candidates extracted by the location candidate extraction unit 31 (location candidate extraction process) is two or more. Some only exist. Therefore, after the location candidate extraction unit 31 (location candidate extraction processing), a location candidate number determination unit (location candidate number determination processing) is provided for determining whether the number of location candidates is “one” or larger than “one”. If the number of location candidates is “1”, the famousness score calculation unit 31 (famousness score calculation processing) and the location determination unit 5 (location determination processing) are skipped, and the extracted one location candidate is used as the location as it is. Or may be output in association with the address expression input by the address expression-address associating unit 6 (address expression-address associating process) (note that the number of address candidates is “0”, In other words, if no one is obtained, the fact is output from the location candidate extraction unit 31 (location candidate extraction processing), and it goes without saying that the processing ends.

また、本発明は、前述したデータベースを備え又は接続された周知のコンピュータに媒体もしくは通信回線を介して、図７の流れ図に示された手順を備えるプログラムをインストールすることによっても実現可能である。 The present invention can also be realized by installing a program having the procedure shown in the flowchart of FIG. 7 through a medium or a communication line in a known computer having or connected to the database.

本発明の所在地解析装置の実施の形態の一例を示す構成図The block diagram which shows an example of embodiment of the location analyzer of this invention 住所データベースの一例を示す説明図Explanatory diagram showing an example of the address database 店舗データベースの一例を示す説明図Explanatory drawing which shows an example of a store database 駅名データベースの一例を示す説明図Explanatory diagram showing an example of the station name database ランドマークデータベースの一例を示す説明図An explanatory diagram showing an example of a landmark database データベース選択テーブルの一例を示す説明図Explanatory drawing which shows an example of a database selection table 本発明の所在地解析装置における処理の流れ図Flow chart of processing in the location analysis apparatus of the present invention データベース選択部の入出力値の一例を示す説明図Explanatory drawing which shows an example of the input / output value of a database selection part 所在地候補取得部の入出力値の一例を示す説明図Explanatory drawing which shows an example of the input / output value of a location candidate acquisition part 有名度スコア計算部の入出力値の一例を示す説明図Explanatory drawing which shows an example of the input-output value of a famousness score calculation part 所在地決定部の入出力値の一例を示す説明図Explanatory drawing which shows an example of the input / output value of a location determination part 住所表現−所在地対応付け部の入出力値の一例を示す説明図Explanatory drawing which shows an example of the input-output value of an address expression-address matching part 住所データベースの他の例を示す説明図Explanatory drawing showing another example of address database

Explanation of symbols

１：データベース、２：住所表現記憶部、３：所在地候補生成部、４：所在地候補記憶部、５：所在地決定部、６：住所表現−所在地対応付け部、３１：所在地候補抽出部、３２：有名度スコア計算部、３１１：データベース選択部、３１２：所在地候補取得部、ｓ１：データベース選択処理、ｓ２：所在地候補抽出処理、ｓ３：有名度スコア計算処理、ｓ４：所在地決定処理、ｓ５：住所表現−所在地対応付け処理。 1: Database, 2: Address expression storage unit, 3: Location candidate generation unit, 4: Location candidate storage unit, 5: Location determination unit, 6: Address expression-location association unit, 31: Location candidate extraction unit, 32: Celebrity score calculation unit, 311: database selection unit, 312: location candidate acquisition unit, s1: database selection processing, s2: location candidate extraction processing, s3: celebrity score calculation processing, s4: location determination processing, s5: address expression -Location mapping process.

Claims

In an apparatus for obtaining a location which is information representing a place corresponding to the address expression from an address expression consisting of a part of the address information or the name of a facility existing at a predetermined address,
At least one database in which only the address information or the name of the facility existing at the predetermined address and the address information are registered for each record;
With reference to the database, the address information of the record including the input address expression is output as a location candidate, and the number of records including address information identical and geographically close to the address information of the location candidate is used as a celebrity score A location candidate generator for outputting each location candidate;
A location analysis apparatus comprising: a location determination unit that determines a location candidate having the highest celebrity score among location candidates output from the location candidate generation unit.

The location candidate generator
Searching the database using the input address expression as a key, and a candidate address extracting unit that extracts address information of a record including the corresponding address expression as a candidate address;
The geographical distance between the address information of the address candidate extracted by the address candidate extraction unit and the address information of each record in the database is calculated, and the record of the record whose geographical distance is equal to or less than a certain distance is calculated. The location analysis apparatus according to claim 1, further comprising a famousness score calculation unit that counts the number as a famousness score for each candidate location.

In an apparatus for obtaining a location which is information representing a place corresponding to the address expression from an address expression consisting of a part of the address information or the name of a facility existing at a predetermined address,
At least one database in which only the address information or the name of the facility existing at the predetermined address and the address information are registered for each record;
With reference to the database, the address information of the record including the input address expression is output as a location candidate, and the geographical distance between the address information of the location candidate and the address information of each record in the database A location candidate generator that outputs the nth smallest distance as a celebrity score for each location candidate;
A location analysis apparatus comprising: a location determination unit that determines a location candidate having the lowest celebrity score among location candidates output from the location candidate generation unit.

The location candidate generator
Searching the database using the input address expression as a key, and a candidate address extracting unit that extracts address information of a record including the corresponding address expression as a candidate address;
The geographical distance between the address information of the address candidate extracted by the address candidate extraction unit and the address information of each record in the database is calculated, and the geographical distances are arranged in order from the smallest to the largest. The location analysis device according to claim 3, further comprising: a famousity score calculation unit that determines, for each candidate location, the nth distance from the smallest one as a famousness score.

Use a database with a database name that represents the contents of the database and a field name that represents the contents of the registration information.
A database corresponding to a character string rule that matches a character string included in an input address expression with reference to a database selection table describing a character string rule and a corresponding database name and field name stored in advance A database selection unit for selecting a name and a field name and outputting the selected database name and field name;
The database field corresponding to the database name and field name selected by the database selection unit is searched from the database using the input address expression as a key, and the address information of the record including the corresponding address expression is acquired as a location candidate. The location analysis device according to claim 2 or 4, wherein a location candidate extraction unit comprising a location information acquisition unit is used.

Using at least one database in which the address information and its latitude / longitude information or the name of the facility existing at a given address and its address information and latitude / longitude information are registered for each record,
The location candidate extraction unit extracts latitude / longitude information as well as address information as location candidates,
The celebrity score calculation unit calculates the geographical distance between the address information of the candidate location extracted by the candidate location extraction unit and the address information of each record in the database, using the latitude / longitude information thereof. The location analysis device according to claim 2 or 4, characterized in that:

Use at least a database that registers address information separated for each address level in prefectures, cities, towns, towns, towns, large letters, and letters / chomes,
The address candidate generation unit outputs address information of a record satisfying a delimiter coincidence with the address expression among records including an address expression including a part of the input address information, as the address candidate. The location analysis apparatus according to any one of 1 to 6.

In a method for obtaining a location which is information representing a place corresponding to the address expression from an address expression consisting of part of the address information or the name of a facility existing at a predetermined address,
Using a computer with a database that registers only the address information or the name of the facility existing at the predetermined address and the address information for each record,
The computer
With reference to the database, the address information of the record including the input address expression is output as a location candidate, and the number of records including address information identical and geographically close to the address information of the location candidate is used as a celebrity score A candidate location generation processing step for outputting each candidate location;
A location analysis method comprising: executing a location determination processing step of determining a location candidate having the highest celebrity score as a location among the location candidates output in the location candidate generation processing step.

The location candidate generation processing step
Searching the database using the input address expression as a key, and extracting the address information of the record including the corresponding address expression as a position candidate extraction process step;
A record in which the geographical distance between the address information of the address candidate extracted in the address candidate extraction processing step and the address information of each record in the database is calculated, and the geographical distance is equal to or less than a certain distance. The location analysis method according to claim 8, further comprising: a famousity score calculation processing step of counting the number of each as a famousness score for each location candidate.

In a method for obtaining a location which is information representing a place corresponding to the address expression from an address expression consisting of part of the address information or the name of a facility existing at a predetermined address,
Using a computer with a database that registers only the address information or the name of the facility existing at the predetermined address and the address information for each record,
The computer
With reference to the database, the address information of the record including the input address expression is output as a location candidate, and the geographical distance between the address information of the location candidate and the address information of each record in the database A location candidate generation processing step for outputting the nth smallest distance as a celebrity score for each location candidate;
A location determination method comprising: executing a location determination processing step of determining a location candidate having the lowest celebrity score as a location among the location candidates output in the location candidate generation processing step.

The location candidate generation processing step
Searching the database using the input address expression as a key, and extracting the address information of the record including the corresponding address expression as a position candidate extraction process step;
The geographical distance between the address information of the candidate address extracted in the candidate address extraction processing step and the address information of each record in the database is calculated, and the geographical distance is calculated in order from the smallest to the largest. The location analysis method according to claim 10, further comprising: a famousity score calculation processing step for determining, for each candidate location, the nth distance from the smallest one when arranged, as a famousness score.

Use a database with a database name that represents the contents of the database and a field name that represents the contents of the registration information.
A database corresponding to a character string rule that matches a character string included in an input address expression with reference to a database selection table describing a character string rule and a corresponding database name and field name stored in advance A database selection processing step of selecting a name and a field name and outputting the selected database name and field name;
Search the database field corresponding to the database name and field name selected in the database selection processing step from the database using the input address expression as a key, and acquire the address information of the record including the corresponding address expression as a location candidate. The location analysis method according to claim 9 or 11, wherein a location candidate extraction processing step including a location information acquisition processing step is used.

Using at least one database in which the address information and its latitude / longitude information or the name of the facility existing at a given address and its address information and latitude / longitude information are registered for each record,
In the location candidate extraction processing step, the latitude / longitude information is included together with the address information, and extracted as location candidates.
The celebrity score calculation processing step uses the geographical distance between the address information of the candidate location extracted in the candidate location extraction processing step and the address information of each record in the database, using the latitude / longitude information thereof. The location analysis method according to claim 9 or 11, characterized in that:

Use at least a database that registers address information separated for each address level in prefectures, cities, towns, towns, towns, large letters, and letters / chomes,
The address candidate generation processing step outputs address information of a record satisfying a delimiter coincidence with the address expression out of records including an address expression made up of a part of the input address information as the address candidate. Item 14. The location analysis method according to any one of Items 8 to 13.

A location analysis program for causing a computer to execute each processing step of the location analysis method according to claim 8.

A computer-readable recording medium on which the location analysis program according to claim 15 is recorded.