WO2017063531A1 - 一种基于地址信息的账户映射方法及装置 - Google Patents

一种基于地址信息的账户映射方法及装置 Download PDF

Info

Publication number
WO2017063531A1
WO2017063531A1 PCT/CN2016/101639 CN2016101639W WO2017063531A1 WO 2017063531 A1 WO2017063531 A1 WO 2017063531A1 CN 2016101639 W CN2016101639 W CN 2016101639W WO 2017063531 A1 WO2017063531 A1 WO 2017063531A1
Authority
WO
WIPO (PCT)
Prior art keywords
address information
information
address
latitude
longitude
Prior art date
Application number
PCT/CN2016/101639
Other languages
English (en)
French (fr)
Inventor
周扬
熊军
何帝君
Original Assignee
阿里巴巴集团控股有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 阿里巴巴集团控股有限公司 filed Critical 阿里巴巴集团控股有限公司
Priority to JP2018519762A priority Critical patent/JP6752475B2/ja
Priority to SG11201803052QA priority patent/SG11201803052QA/en
Priority to KR1020187013633A priority patent/KR102117773B1/ko
Priority to EP16854910.3A priority patent/EP3364309B1/en
Publication of WO2017063531A1 publication Critical patent/WO2017063531A1/zh
Priority to US15/949,981 priority patent/US10990353B2/en
Priority to US16/720,916 priority patent/US10725737B2/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9537Spatial or temporal dependent retrieval, e.g. spatiotemporal queries
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F7/00Methods or arrangements for processing data by operating upon the order or content of the data handled
    • G06F7/06Arrangements for sorting, selecting, merging, or comparing data on individual record carriers
    • G06F7/20Comparing separate sets of record carriers arranged in the same sequence to determine whether at least some of the data in one set is identical with that in the other set or sets
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/22Indexing; Data structures therefor; Storage structures
    • G06F16/2291User-Defined Types; Storage management thereof
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/21Design, administration or maintenance of databases
    • G06F16/215Improving data quality; Data cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2457Query processing with adaptation to user needs
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • G06F16/2465Query processing support for facilitating data mining operations in structured databases
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/29Geographical information databases
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q20/00Payment architectures, schemes or protocols
    • G06Q20/08Payment architectures
    • G06Q20/14Payment architectures specially adapted for billing systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/06Energy or water supply

Definitions

  • the present application relates to the field of network technologies, and in particular, to an address mapping method and apparatus based on address information.
  • the user can generally use the application software provided by the service provider to pay the living fee.
  • the payment account number provided by the charging service party needs to be input, but the user account experience is inconvenient because the payment account number is inconvenient for the user to experience.
  • the application service party usually has its own account system, including application account number and user address information, etc., in order to provide users with a variety of different business services, such as in addition to the living payment service, it can also provide information promotion services and item acquisition services for users. Wait. As a fee-paying party for living payment, it also has its own account system, including the payment account number and user address information. It can be seen that if the relationship between the account system of the application service party and the account system of the charging service party can be established, the application service party can directly provide the payment service to the user through the application account registered by the user.
  • the embodiment of the present application provides an account mapping method and device based on address information, which is used to establish an association relationship between different account systems.
  • An embodiment of the present application provides an account mapping method based on address information, including:
  • the address feature information includes one or more of latitude and longitude information, POI information, and digital substring information;
  • the preset address feature matching condition that is satisfied between any second address information in the second address information candidate set and the first address information includes one or more of the following conditions:
  • the distance between the latitude and longitude corresponding to the second address information and the first address information respectively is less than the set distance threshold
  • the second PO information of the second address information and the first address information respectively are the same;
  • the maximum number of consecutive identical digital substrings between the second address information and the first address information is greater than a set number threshold, wherein each digital substring corresponds to a location range of the address, the same address Different digital substrings correspond to different ranges of addresses.
  • the latitude and longitude information corresponding to the extracted first address information and the second address information is found by querying the latitude and longitude library storing the mapping relationship between the address information and the latitude and longitude. ;
  • the address feature information includes POI information, extracting POI information from the first address information and the second address information according to the keyword information of the preset corresponding POI information;
  • the address feature information includes digital substring information, according to a preset digital substring matching mode, from the The digital substring information is extracted from the first address information and the second address information.
  • extracting the address feature information from each of the extracted first address information and the second address information includes:
  • the second address information is classified into the second address information candidate set that matches the first address information.
  • the selecting, by the determined second address information candidate set, a second address information that matches the first address information includes:
  • the second address information corresponding to the highest matching score is selected as the second address information that matches the first address information.
  • d(c1, c2) represents the distance between the latitude and longitude c1 corresponding to any second address information and the latitude and longitude c2 corresponding to the first address information
  • k is a constant
  • the digital substring matching score S2 between the any second address information and the first address information is determined according to the following formula:
  • N is the number of consecutive maximum number of sub-strings between the second address information and the first address information
  • p is a constant
  • An embodiment of the present application provides an account mapping device based on address information, including:
  • An extracting module configured to respectively extract first address information from a plurality of first account information provided by the first server, and extract second address information from the plurality of second account information provided by the second server;
  • a search module configured to search, for each first address information extracted by the extraction module, the second address information that matches the first address information from each of the extracted second address information;
  • a mapping module configured to: after the searching module finds the second address information that matches the first address information, the first account information to which the first address information belongs, and the second address that is found by the searching module The mapping relationship between the second account information to which the information belongs.
  • the mapping relationship between the account information of different account systems is established by using the address information included in the account information, and the mapping relationship can be used to locate the user based on the account information registered by the user at the first service party.
  • the account information at the second service party so that the service processing related to the second service party can be completed through the window provided by the first service party.
  • FIG. 1 is a flowchart of an address mapping based account mapping method according to Embodiment 1 of the present application;
  • FIG. 2 is a flowchart of an address mapping based account mapping method according to Embodiment 2 of the present application
  • Figure 3 is a latitude and longitude scatter plot
  • FIG. 4 is a schematic diagram of determining a second address information candidate set
  • FIG. 5 is a flowchart of an address mapping based account mapping method according to Embodiment 3 of the present application.
  • FIG. 6 is a schematic diagram of an address matching frame corresponding to FIG. 5;
  • FIG. 7 is a schematic structural diagram of an address mapping based account mapping apparatus according to an embodiment of the present disclosure.
  • the first address information is extracted from the plurality of first account information provided by the first service party, and the second address information is separately extracted from the plurality of second account information provided by the second service party; Extracting each of the first address information, searching for the second address information that matches the first address information from the extracted second address information; after finding the second address information that matches the first address information, Establishing a mapping relationship between the first account information to which the first address information belongs and the second account information to which the found second address information belongs. It can be seen that, in the embodiment of the present application, the mapping relationship between the account information of different account systems is established by using the address information included in the account information, and the mapping relationship can be used to locate the user based on the account information registered by the user at the first service party.
  • the account information at the second service party so that the service processing related to the second service party can be completed through the window provided by the first service party.
  • the first service party is the application service party and the second service party is the charging service party for the living payment (water fee, electricity fee, gas fee, etc.)
  • the second service may be provided by the application software provided by the first service party. The fee provided by the party is paid.
  • a flow chart of an address information-based account mapping method includes the following steps:
  • S101 Extract first address information from the plurality of first account information provided by the first service party, and extract second address information from the plurality of second account information provided by the second service party.
  • the execution subject of the embodiment of the present application may be a server of the first servant.
  • the first service party may be a service party that provides an application service to the user.
  • the first account information of the first service party may include an account registered by the user, and the first address information filled in when the user registers or uses the application service, such as receiving the goods. Address, etc.
  • the second service party may provide the second account information of each user to the first service party.
  • the second account information herein includes the user's account number (such as the payment account number) and the second address information.
  • S102 Search, for each of the extracted first address information, the second address information that matches the first address information from each of the extracted second address information.
  • the matching second address information is searched for the first address information in each first account information.
  • the matching between the first address information and the second address information means that the addresses pointed to by the first address information and the second address information are consistent.
  • the address pointed to by the first address information and the second address information may include the case where the first address information and the second address information are the same, and may also include the case that the addresses pointed to by the two are substantially the same address although the representation forms are different.
  • an implementation manner of how to find the second address information that matches the first address information is provided. For details, refer to the description of the second embodiment.
  • mapping relationship between the two account systems can be established through the address information. That is, in the embodiment of the present application, two accounts that are considered to have the same address information belong to the same user.
  • the service processing request related to the second account information triggered by the user for example, the service processing request here is a living payment request, correspondingly, the second account information is the living payment account information, but it is assumed here that the user does not provide the first
  • the second account information is only triggered by the living payment request, and the second account information corresponding to the first account information is determined according to the established mapping relationship and the first account information of the user, and is executed for the found second account information.
  • the service processes the request.
  • the description of the same address may be different, the content may be different, and the detailed level of the description may be different.
  • address matching if only the first address information and the second address information are identical, If the two are matched, the matching success rate of the address information will be very low. Based on this, a preferred embodiment for performing address matching is given in the second embodiment of the present application.
  • a flowchart of an address information-based account mapping method provided by Embodiment 2 of the present application includes the following steps:
  • S201 extracting first address information from the plurality of first account information provided by the first service party, and extracting second address information from the plurality of second account information provided by the second service party, respectively, and extracting each The first address information is executed in the following S202 to S204.
  • S202 Determine, according to the address feature information corresponding to the first address information, and the address feature information corresponding to each of the second address information, a second address information candidate set that matches the first address information, where the A preset address feature matching condition is satisfied between each second address information in the second address information candidate set and the first address information.
  • the address information is likely to be manually input by the user, the input habits of different users are different, and the usage habits of the full-width pinyin, characters, traditional characters, and special symbols are brought to the address matching. It is very difficult, so before the address matching, these interferences can be removed first, that is, the address information is preprocessed in a format. For example, as shown in Table 1 below, there are several types of preprocessing listed.
  • the address feature information herein may include one or more of latitude and longitude information, POI information of interest points, and digital substring information (such as road number, building number, floor, and room number, etc., which are regarded as digital substring information).
  • Each of the first address information or the second address information may have one or more digital substrings, each of the digital substrings corresponding to a range of addresses of the address, and different digital substrings of the same address correspond to different ranges of addresses, such as roads.
  • the number and building number correspond to different location ranges.
  • the latitude and longitude information corresponding to the first address information and the second address information may be found by querying a latitude and longitude library storing the mapping relationship between the address information and the latitude and longitude.
  • the POI information may be extracted according to the keyword information of the preset corresponding POI information (for example, based on the keyword “road”: No. 256 Tianmushan Road, No. 18 Wantang Road), from the first address information and the second address information. Extract POI information.
  • the digital substring information is extracted according to a preset digital substring matching mode (for example, matching an exit number, a building number, a floor, and a room number in sequence).
  • the first address information is “2nd Floor, MM Building, No. 256 Tianmushan Road, Hangzhou, Zhejiang province”
  • the second address information is “2nd Floor, MM Building, No. 18 Wantang Road, Hangzhou, Zhejiang province”; for example, the first address information is analyzed.
  • the POI information is 256 Tianmushan Road, the latitude and longitude is (30.122, 102.589), and the fuzzy matching is used to extract the road number: 256, building number: no, floor: 2nd floor, room number: none, the number of the first address information
  • the substring information is: 256*2*, where "*" indicates that the location range of the identifier does not have a corresponding digital substring; the second address information is parsed, and the POI information is obtained as No. 18 Wantang Road, with a latitude and longitude of 30.121, 102.583), using fuzzy matching to extract the way number: 18 No., No.: No, Floor: 2nd floor, Room No.: None, the digital substring information of the second address information is 256*2*.
  • the first address information may be searched for and matched with the first address information according to a preset address feature matching condition.
  • the at least one second address information of the address feature matching condition is set to form a second address information candidate set.
  • the preset address feature matching condition that is satisfied between any second address information in the second address information candidate set and the first address information includes one or more of the following three conditions:
  • Condition 1 the distance between the latitude and longitude corresponding to the second address information and the first address information is less than the set distance threshold
  • Condition 2 the POI information corresponding to the second address information and the first address information are the same;
  • Condition 3 The maximum number of consecutive identical digital substrings between the second address information and the first address information is greater than a set number threshold.
  • the first address information is parsed to obtain the latitude and longitude corresponding to the first address information
  • the second address information is parsed to obtain the latitude and longitude corresponding to the second address information.
  • the latitude and longitude obtained by the address information may be based on the existing map. Application to achieve.
  • the k-d tree can be used for searching. Since the embodiment of the present application needs to perform searching based on the latitude and longitude points, a 2-dimensional k-d tree (ie, a 2-d tree) can be used. As shown in FIG. 3, it is a latitude and longitude scatter plot, and each dot represents a latitude and longitude point corresponding to an address information. If the traversal method is used, the distance between the latitude and longitude corresponding to each extracted second address information and the latitude and longitude corresponding to the first address information to be matched is calculated in turn, which wastes a lot of computing resources, and uses a 2-d tree search method. It is possible to select only the latitude and longitude (corresponding to the point in the dotted line frame) of the second address information whose latitude and longitude distance is relatively close to the first address information, so that a large amount of computing resources can be saved.
  • the POI information may be extracted from the first address information and the second address information respectively. If the separately extracted POI information is the same, the POI corresponding to the second address information and the first address information respectively is considered. The information is the same; however, since the same address usually has multiple different description modes, when the two account systems describe the same address in different description manners, the first address information and the second address information corresponding to the same address are extracted. The POI information may be different.
  • the extracted first address information and the POI information in each second address information may be mapped to the standardized POI information in the portal database by querying the stored portal database;
  • the first address information and the second address information are uniformly described by using the description manner specified in the portal database, and after the unified description mode, the first address information and the second address belonging to different account systems are determined. Whether the POI information of the address information is consistent, so that the matching success rate of the POI information can be improved.
  • the POI information obtained by parsing the first address information is 256 Tianmushan Road
  • the POI information obtained by parsing the second address information is No. 18 Wantang Road
  • the Tianmushan Road is confirmed by querying the portal database.
  • the standardization of No. 256 is Huanglong Times Square
  • No. 18 Wantang Road it is also Huanglong Times Square. Therefore, it can be considered that the POI information corresponding to the first address information and the second address information are the same.
  • the regular expression can be used to fuzzyly match the digital substrings of the wayout number, the building number, the floor and the room number from the preprocessed first address information and the second address information, when the first address information and the first address information
  • the address information is equal to at least N of the number substrings of the road number, the floor number, the floor, and the room number
  • the second address information is considered to satisfy the condition three
  • N is the set number threshold.
  • the second address information candidate set may be determined based on any of the foregoing conditions, or two or three of the foregoing conditions one to three may be combined to determine the second address information candidate set.
  • condition 1 and condition 3 are combined: for the first address information "No. 2, Unit 2, Building 2, No. 169 Han'an Avenue, Dongxing District, Neijiang City, Sichuan province", four digital substrings can be fuzzyly matched by regular expressions. : 169, 2, 2, 2 (here, the definition of each digital substring can be preset.
  • the definition of the digital substring is the road number, the building number, and the unit according to the geographic range from large to small. No., room number), at the same time, the latitude and longitude corresponding to the first address information is (29.602631, 105.071571).
  • the second address information classified in the second address information candidate set can be found with a radius of 3 km centering on the point corresponding to the latitude and longitude (29.602631, 105.071571), and only 3 is found.
  • their digital substrings are: 168, 2; 169, 2, 2; 168, 3, wherein the same numbers of the digital substrings 169, 2, 2, and 169, 2, 2, 2
  • the number of substrings ie, the public number string
  • is 3 which is greater than the set number threshold 2, so only the second address information corresponding to the digital substrings 169, 2, 2 can be classified into the second address information candidate set.
  • Second address information is 3 which is greater than the set number threshold 2, so only the second address information corresponding to the digital substrings 169, 2, 2 can be classified into the second address information candidate set.
  • the second address information candidate set is determined in combination with the condition 2, the second address information that is included in the second address information candidate set further satisfies: the standardized POI information in the corresponding door address library corresponds to the first address information.
  • the standardized POI information in the portal database is the same, such as the gate corresponding to the second address information in the second address information candidate set.
  • the standardized POI information in the location database is “169 Han'an Avenue, Dongxing District, Neijiang City, Sichuan province”, and the standardized POI information in the portal database corresponding to the first address information is the same.
  • S203 Select a second address information that matches the first address information from the determined second address information candidate set.
  • the second address information when there is only one second address information in the second address information candidate set, the second address information may be used as the finally selected second address information; if the second address information candidate set is included There is a plurality of second address information, and one selection manner is: the determined second address information candidate set can be presented to the user, and the user selects a best matching second address information from the second address information candidate set. Another selection manner is: calculating a matching score between each second address information in the second address information candidate set and the first address information, and selecting a second matching address with the highest matching degree based on the calculated matching score. For details, see the description of the third embodiment.
  • S204 Establish a mapping relationship between the first account information to which the first address information belongs and the second account information to which the selected second address information belongs based on the selected second address information that matches the first address information. .
  • FIG. 5 it is a flowchart of a method for establishing a mapping relationship provided by Embodiment 3 of the present application
  • FIG. 6 is a schematic diagram of an address matching frame corresponding thereto, including the following steps:
  • S501 Extract first address information from the plurality of first account information provided by the first service party, and extract second address information from the plurality of second account information provided by the second service party.
  • the extracted first address information and the second address information may be pre-processed based on the preset pre-processing type, and the following steps are performed after the interference at the address matching is eliminated.
  • the full-width half-width Chinese numerals to Arabic numerals, traditional to simplified, English uppercase to lowercase.
  • S502 Extract latitude and longitude information, POI information and digital substring information from each of the first address information and the second address information, and perform the following S503 to S508 for each of the extracted first address information.
  • S503 Determine, for each second address information, whether the POI information corresponding to the second address information and the first address information are the same. If yes, proceed to S504. If not, determine that the second address information does not belong to the Determining a second set of address information candidates that match the first address information.
  • the POI information in the first address information and the second address information may be first standardized by the portal database, and then it is determined whether the standardized PIO information is consistent.
  • S504 After determining that the POI information corresponding to the second address information and the first address information are the same, determining the second address information and the first address information based on the latitude and longitude information of the second address information and the first address information. Determining the distance between the latitude and longitude corresponding to the information, and determining the consecutive number of consecutive digital substrings between the second address information and the first address information based on the respective digital substrings of the second address information and the first address information The maximum value.
  • S505 The distance between the latitude and longitude corresponding to the second address information and the first address information is less than a set distance threshold, and the number of consecutive digital substrings between the second address information and the first address information is consecutive When the maximum value is greater than the set number threshold, the second address information is classified into the second address information candidate set that matches the first address information.
  • S506 Determine, according to the address feature information corresponding to the first address information, and the determined address feature information of each second address information in the determined second address information candidate set, each of the second address information candidate sets. A matching score between the second address information and the first address information.
  • a matching score between the second address information and the first address information under each address feature may be separately determined, Then, the matching scores between the second address information and the first address information under different kinds of address characteristics are added as a matching score between the second address information and the first address information.
  • the latitude and longitude matching score and the latitude and longitude matching importance degree between the second address information and the first address information may be determined according to any second address information in the determined second address information candidate set, and Determining a digital substring matching score and a digital substring matching importance between the second address information and the first address information, and determining a matching score between the second address information and the first address information .
  • the importance and number substrings match the importance
  • S1 is the latitude and longitude matching score
  • S2 is the digital substring matching score.
  • it can be set empirically, the values of ⁇ 1 and ⁇ 2 , for example, ⁇ 1 can be 0.3, and ⁇ 2 can be 0.7.
  • the latitude and longitude matching score S1 between the any second address information and the first address information may be determined according to the following formula:
  • d(c1, c2) represents the distance between the latitude and longitude c1 corresponding to any second address information and the latitude and longitude c2 corresponding to the first address information
  • k is a constant
  • the digital substring matching score S2 between any of the second address information and the first address information may be determined according to the following formula:
  • N is the number of consecutive maximum number of sub-strings between the second address information and the first address information
  • p is a constant
  • S507 Select a second address information corresponding to the highest matching score as the second address information that matches the first address information.
  • S508 Establish a mapping relationship between the first account information to which the first address information belongs and the second account information to which the second address information that matches the first address information belongs.
  • the embodiment of the present application further provides a mapping relationship establishing device corresponding to the mapping relationship establishing method, and the principle of the device solving the problem is similar to the mapping relationship establishing method in the embodiment of the present application, so the device is Implementation can refer to the implementation of the method, and the repetition will not be repeated.
  • FIG. 7 is a schematic structural diagram of an address information-based account mapping apparatus according to an embodiment of the present application, including:
  • the extracting module 71 is configured to separately extract first address information from the plurality of first account information provided by the first service party, and extract second address information from the plurality of second account information provided by the second service party;
  • the searching module 72 is configured to search, for each first address information extracted by the extraction module 71, the second address information that matches the first address information from each of the extracted second address information;
  • the mapping module 73 is configured to: after the searching module 72 finds the second address information that matches the first address information, establish first account information to which the first address information belongs, and the first searched by the searching module The mapping relationship between the second account information to which the second address information belongs.
  • the searching module 72 is specifically configured to:
  • the address feature information includes one or more of latitude and longitude information, POI information, and digital substring information;
  • the preset address feature matching condition that is satisfied between any second address information in the second address information candidate set and the first address information includes one or more of the following conditions:
  • the distance between the latitude and longitude corresponding to the second address information and the first address information respectively is less than the set distance threshold
  • the second PO information of the second address information and the first address information respectively are the same;
  • the maximum number of consecutive identical digital substrings between the second address information and the first address information is greater than a set number threshold, wherein each digital substring corresponds to a location range of the address, the same address Different digital substrings correspond to different ranges of addresses.
  • the searching module 72 is specifically configured to:
  • the latitude and longitude information corresponding to the extracted first address information and the second address information is found by querying the latitude and longitude library storing the mapping relationship between the address information and the latitude and longitude;
  • the address feature information includes POI information, extracting POI information from the first address information and the second address information according to the keyword information of the preset corresponding POI information;
  • the digital substring information is extracted from the first address information and the second address information according to a preset digital substring matching mode.
  • the searching module 72 is further configured to:
  • the extraction module 71 is specifically configured to:
  • the searching module 72 is specifically configured to:
  • the searching module 72 is specifically configured to:
  • the searching module 72 is specifically configured to determine a latitude and longitude matching score S1 between the any second address information and the first address information according to the following formula:
  • d(c1, c2) represents the distance between the latitude and longitude c1 corresponding to any second address information and the latitude and longitude c2 corresponding to the first address information
  • k is a constant
  • the searching module 72 is specifically configured to determine a digital substring matching score S2 between the any second address information and the first address information according to the following formula:
  • N is the number of consecutive maximum number of sub-strings between the second address information and the first address information
  • p is a constant
  • embodiments of the present application can be provided as a method, system, or computer program product.
  • the present application can take the form of an entirely hardware embodiment, an entirely software embodiment, or an embodiment in combination of software and hardware.
  • the application can take the form of a computer program product embodied on one or more computer-usable storage media (including but not limited to disk storage, CD-ROM, optical storage, etc.) including computer usable program code.
  • These computer program instructions can also be stored in a particular computer capable of booting a computer or other programmable data processing device In a computer readable memory that operates in a computer readable memory, causing instructions stored in the computer readable memory to produce an article of manufacture comprising instruction means implemented in a block or in a flow or a flow diagram and/or block diagram of the flowchart The functions specified in the boxes.
  • These computer program instructions can also be loaded onto a computer or other programmable data processing device such that a series of operational steps are performed on a computer or other programmable device to produce computer-implemented processing for execution on a computer or other programmable device.
  • the instructions provide steps for implementing the functions specified in one or more of the flow or in a block or blocks of a flow diagram.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Business, Economics & Management (AREA)
  • General Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Accounting & Taxation (AREA)
  • Economics (AREA)
  • Finance (AREA)
  • Strategic Management (AREA)
  • General Business, Economics & Management (AREA)
  • Health & Medical Sciences (AREA)
  • Software Systems (AREA)
  • Computational Linguistics (AREA)
  • Remote Sensing (AREA)
  • Development Economics (AREA)
  • Primary Health Care (AREA)
  • General Health & Medical Sciences (AREA)
  • Human Resources & Organizations (AREA)
  • Marketing (AREA)
  • Water Supply & Treatment (AREA)
  • Tourism & Hospitality (AREA)
  • Public Health (AREA)
  • Mathematical Physics (AREA)
  • Probability & Statistics with Applications (AREA)
  • Fuzzy Systems (AREA)
  • Quality & Reliability (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Telephonic Communication Services (AREA)
  • Navigation (AREA)

Abstract

一种基于地址信息的账户映射方法及装置,用以实现建立不同账户体系之间的关联关系,涉及网络技术领域。提供的基于地址信息的账户映射方法包括:从第一服务方提供的多个第一账户信息中分别提取第一地址信息,以及从第二服务方提供的多个第二账户信息中分别提取第二地址信息(S101);针对提取的每个第一地址信息,从提取的各个第二地址信息中,查找与该第一地址信息匹配的第二地址信息(S102);在查找到与该第一地址信息匹配的第二地址信息后,建立该第一地址信息所属的第一账户信息,和查找到的该第二地址信息所属的第二账户信息之间的映射关系(S103)。

Description

一种基于地址信息的账户映射方法及装置
本申请要求2015年10月14日递交的申请号为201510661892.4、发明名称为“一种基于地址信息的账户映射方法及装置”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。
技术领域
本申请涉及网络技术领域,尤其涉及一种基于地址信息的账户映射方法及装置。
背景技术
随着互联网信息技术的发展,各个服务方通过互联网为用户提供业务服务变得越来越普遍。比如,用户普遍都需要的生活缴费服务,如缴纳电费、水费等,也已经可以通过互联网来满足用户需求。
目前,用户一般可以通过应用服务方提供的应用软件来进行生活缴费,在缴费过程中,需要输入收费服务方提供的缴费户号,但由于缴费户号不便于用户记忆,导致用户体验较差。
应用服务方通常拥有自己的账户体系,其中包括应用账号及用户地址信息等,以便为用户提供多种不同的业务服务,比如除生活缴费服务外,还可以为用户提供信息推介服务、物品获取服务等。而作为生活缴费的收费服务方也拥有自己的账户体系,其中包括缴费户号和用户地址信息等。可见,如果能建立应用服务方的账户体系和收费服务方的账户体系之间的关联关系,应用服务方就可以直接通过用户注册的应用账号来为用户提供缴费服务。
目前,尚没有相关技术能够实现建立不同账户体系之间的关联关系。
发明内容
本申请实施例提供一种基于地址信息的账户映射方法及装置,用以实现建立不同账户体系之间的关联关系。
本申请实施例提供一种基于地址信息的账户映射方法,包括:
从第一服务方提供的多个第一账户信息中分别提取第一地址信息,以及从第二服务方提供的多个第二账户信息中分别提取第二地址信息;
针对提取的每个第一地址信息,从提取的各个第二地址信息中,查找与该第一地址 信息匹配的第二地址信息;
在查找到与该第一地址信息匹配的第二地址信息后,建立该第一地址信息所属的第一账户信息,和查找到的该第二地址信息所属的第二账户信息之间的映射关系。
可选地,从第一服务方提供的多个第一账户信息中分别提取第一地址信息,以及从第二服务方提供的多个第二账户信息中分别提取第二地址信息之后,还包括:
从提取的各个第一地址信息和第二地址信息中分别提取地址特征信息;
针对提取的每个第一地址信息,从提取的各个第二地址信息中,查找与该第一地址信息匹配的第二地址信息,包括:
根据该第一地址信息对应的地址特征信息,以及所述各个第二地址信息分别对应的地址特征信息,确定出与该第一地址信息匹配的第二地址信息候选集合;其中,所述第二地址信息候选集合中的每个第二地址信息与该第一地址信息之间满足预设的地址特征匹配条件;
从确定的所述第二地址信息候选集合中选择一个与该第一地址信息匹配的第二地址信息。
可选地,所述地址特征信息包括经纬度信息、兴趣点POI信息和数字子串信息中的一种或多种;
所述第二地址信息候选集合中的任一第二地址信息与该第一地址信息之间满足的所述预设的地址特征匹配条件包括以下条件中的一种或多种:
该任一第二地址信息与该第一地址信息分别对应的经纬度之间的距离小于设定距离阈值;
该任一第二地址信息与该第一地址信息分别对应的POI信息相同;
该任一第二地址信息和该第一地址信息之间连续相同的数字子串个数的最大值大于设定个数阈值,其中,每个数字子串对应地址的一个位置范围,同一地址的不同数字子串对应该地址的不同位置范围。
可选地,若所述地址特征信息包括经纬度信息,则通过查询存储有地址信息与经纬度的映射关系的经纬度库,查找到与提取的所述第一地址信息和第二地址信息对应的经纬度信息;
若所述地址特征信息包括POI信息,则根据预设的对应POI信息的关键字信息,从所述第一地址信息和第二地址信息中提取出POI信息;
若所述地址特征信息包括数字子串信息,则根据预设的数字子串匹配模式,从所述 第一地址信息和第二地址信息中提取出所述数字子串信息。
可选地,根据以下步骤确定所述任一第二地址信息与第一地址信息分别对应的POI信息相同:
当确定所述任一第二地址信息与第一地址信息中存在POI信息后,将所述任一第二地址信息与第一地址信息中的POI信息映射为所述门址库中的标准化POI信息;
当该任一第二地址信息映射的所述门址库中的标准化POI信息与该第一地址信息映射的所述门址库中的标准化POI信息相同时,确定所述任一第二地址信息与第一地址信息分别对应的POI信息相同。
可选地,从提取的各个第一地址信息和第二地址信息中分别提取地址特征信息,包括:
从提取的各个第一地址信息和第二地址信息中分别提取经纬度信息、兴趣点POI信息和数字子串信息;
根据该第一地址信息对应的地址特征信息,以及所述各个第二地址信息分别对应的地址特征信息,确定出与该第一地址信息匹配的第二地址信息候选集合,包括:
针对每个第二地址信息,判断该第二地址信息与该第一地址信息分别对应的POI信息是否相同;
当确定该第二地址信息与第一地址信息分别对应的POI信息相同后,基于该第二地址信息与第一地址信息各自的经纬度信息,确定该第二地址信息与该第一地址信息分别对应的经纬度之间的距离,以及基于该第二地址信息与第一地址信息各自的数字子串信息,确定该第二地址信息与该第一地址信息之间连续相同的数字子串个数的最大值;
当该第二地址信息与该第一地址信息分别对应的经纬度之间的距离小于设定距离阈值、且该第二地址信息和该第一地址信息之间连续相同的数字子串个数的最大值大于设定个数阈值时,将该第二地址信息归入与该第一地址信息匹配的第二地址信息候选集合中。
可选地,所述从确定的所述第二地址信息候选集合中选择一个与该第一地址信息匹配的第二地址信息,包括:
针对确定出的第二地址信息候选集合中的任一第二地址信息,根据该任一第二地址信息与该第一地址信息之间的经纬度匹配分值和经纬度匹配重要度、以及该任一第二地址信息与该第一地址信息之间的数字子串匹配分值和数字子串匹配重要度,确定该任一第二地址信息与该第一地址信息之间的匹配分值;
选取对应的匹配分值最高的第二地址信息作为与该第一地址信息匹配的第二地址信息。
可选地,根据以下公式确定所述任一第二地址信息与第一地址信息之间的经纬度匹配分值S1:
〖S1=ke〗^(-d(c1,c2))
其中,d(c1,c2)表示该任一第二地址信息对应的经纬度c1和该第一地址信息对应的经纬度c2之间的距离,k为常数。
可选地,根据以下公式确定所述任一第二地址信息与第一地址信息之间的数字子串匹配分值S2:
S2=pN
其中,N为将该任一第二地址信息和该第一地址信息之间连续相同的最大数字子串个数,p为常数。
本申请实施例提供一种基于地址信息的账户映射装置,包括:
提取模块,用于从第一服务方提供的多个第一账户信息中分别提取第一地址信息,以及从第二服务方提供的多个第二账户信息中分别提取第二地址信息;
查找模块,用于针对所述提取模块提取的每个第一地址信息,从提取的各个第二地址信息中,查找与该第一地址信息匹配的第二地址信息;
映射模块,用于在所述查找模块查找到与该第一地址信息匹配的第二地址信息后,建立该第一地址信息所属的第一账户信息,和所述查找模块查找到的第二地址信息所属的第二账户信息之间的映射关系。
可见,本申请实施例通过账户信息中包含的地址信息来建立不同账户体系的账户信息之间的映射关系,通过该映射关系,可以基于用户在第一服务方处注册的账户信息定位到用户在第二服务方处的账户信息,从而可以通过第一服务方提供的窗口来完成与第二服务方相关的业务处理。
附图说明
图1为本申请实施例一提供的基于地址信息的账户映射方法流程图;
图2为本申请实施例二提供的基于地址信息的账户映射方法流程图;
图3为经纬度散点图;
图4为确定第二地址信息候选集合的示意图;
图5为本申请实施例三提供的基于地址信息的账户映射方法流程图;
图6为与图5对应的地址匹配框架示意图;
图7为本申请实施例提供的基于地址信息的账户映射装置结构示意图。
具体实施方式
本申请实施例中,从第一服务方提供的多个第一账户信息中分别提取第一地址信息,以及从第二服务方提供的多个第二账户信息中分别提取第二地址信息;针对提取的每个第一地址信息,从提取的各个第二地址信息中,查找与该第一地址信息匹配的第二地址信息;在查找到与该第一地址信息匹配的第二地址信息后,建立该第一地址信息所属的第一账户信息,和查找到的该第二地址信息所属的第二账户信息之间的映射关系。可见,本申请实施例通过账户信息中包含的地址信息来建立不同账户体系的账户信息之间的映射关系,通过该映射关系,可以基于用户在第一服务方处注册的账户信息定位到用户在第二服务方处的账户信息,从而可以通过第一服务方提供的窗口来完成与第二服务方相关的业务处理。比如,当第一服务方为应用服务方,第二服务方为生活缴费(水费、电费、煤气费等)的收费服务方时,可以通过第一服务方提供的应用软件来为第二服务方提供的收费项目进行缴费。
下面结合说明书附图对本申请实施例作进一步详细描述。
实施例一
如图1所示,为本申请实施例一提供的基于地址信息的账户映射方法流程图,包括以下步骤:
S101:从第一服务方提供的多个第一账户信息中分别提取第一地址信息,以及从第二服务方提供的多个第二账户信息中分别提取第二地址信息。
本申请实施例的执行主体可以是第一服务方的服务器。第一服务方可以是为用户提供应用服务的服务方,第一服务方的第一账户信息中可以包含用户注册的账号、及用户注册或使用应用服务时填写的第一地址信息,比如收货地址等。
在具体实施中,第二服务方可以将各个用户的第二账户信息提供给第一服务方。这里的第二账户信息中包括用户的账号(比如缴费户号)和第二地址信息等。
S102:针对提取的每个第一地址信息,从提取的各个第二地址信息中,查找与该第一地址信息匹配的第二地址信息。
该步骤中,为各个第一账户信息中的第一地址信息,查找匹配的第二地址信息。这 里,第一地址信息与第二地址信息匹配是指第一地址信息和第二地址信息所指向的地址是一致的。第一地址信息和第二地址信息所指向的地址一致可以包括第一地址信息和第二地址信息相同的情况,也可以包括二者虽然表现形式不同,但所指向的地址实质为同一地址的情况,基于第二种情况存在的可能性较大,本申请实施例二中给出了具体如何查找与第一地址信息匹配的第二地址信息的实施方式,详见后续实施例二的描述。
S103:在查找到与该第一地址信息匹配的第二地址信息后,建立该第一地址信息所属的第一账户信息,和查找到的该第二地址信息所属的第二账户信息之间的映射关系。
这里,通过查找到的与每个第一地址信息匹配的第二地址信息后,就可以通过地址信息建立起两个账户体系之间的映射关系了。也即,在本申请实施例中,认为地址信息一致的两个账户属于同一用户。
对上述方法的应用:
当接收到用户触发的与第二账户信息相关的业务处理请求后(比如这里的业务处理请求为生活缴费请求,相应地,第二账户信息为生活缴费账户信息,不过这里假设用户并没有提供第二账户信息,只是触发了生活缴费请求),根据建立的映射关系及该用户的第一账户信息,确定与该第一账户信息对应的第二账户信息,并针对查找到的第二账户信息执行所述业务处理请求。
实施例二
由于在不同账户体系中,对同一地址进行描述的方式、内容可能不同、描述的详细程度也可能不同等,在进行地址匹配时,若只有当第一地址信息和第二地址信息完全相同时,才认为二者匹配,那么将会导致地址信息的匹配成功率非常低,基于此,本申请实施例二给出了进行地址匹配的优选实施方式。
如图2所示,为本申请实施例二提供的基于地址信息的账户映射方法流程图,包括以下步骤:
S201:从第一服务方提供的多个第一账户信息中分别提取第一地址信息,以及从第二服务方提供的多个第二账户信息中分别提取第二地址信息,并针对提取的每个第一地址信息,执行下述S202~S204。
S202:根据该第一地址信息对应的地址特征信息,以及所述各个第二地址信息分别对应的地址特征信息,确定出与该第一地址信息匹配的第二地址信息候选集合;其中,所述第二地址信息候选集合中的每个第二地址信息与该第一地址信息之间满足预设的地址特征匹配条件。
在具体实施过程中,由于地址信息很可能是由用户自己手工输入的,不同的用户的输入习惯不同,对全角拼音、字符、繁体字、和特殊符号等的使用习惯不同给地址匹配带来了很大困难,因此在进行地址匹配之前,可以先将这些干扰去除,也即对地址信息进行格式上的预处理。比如,如下表一所示,为列举的几种预处理类型。
预处理类型 示例
全角转半角 “【”转为“[”
中文数字转阿拉伯数字 “一号楼”转为“1号楼”
繁体转简体 “一號樓”转为“一号楼”
英文大写转小写 “A区”转为“a区”
特殊符号替换 “III号”转为“3号”
表一
在对各个第一地址信息和第二地址信息进行预处理之后,对各个第一地址信息和第二地址信息进行解析,得到各个第一地址信息和第二地址信息分别对应的地址特征信息。这里的地址特征信息可以包括经纬度信息、兴趣点POI信息和数字子串信息(比如说路号、楼号、楼层和房号等被看作是数字子串信息)中的一种或多种,每个第一地址信息或第二地址信息可能具有一个或多个数字子串,每个数字子串对应地址的一个位置范围,同一地址的不同数字子串对应该地址的不同位置范围,比如路号、楼号对应不同的位置范围。
在具体实施中,可以通过查询存储有地址信息与经纬度的映射关系的经纬度库,查找到所述第一地址信息和第二地址信息对应的经纬度信息。可以根据预设的对应POI信息的关键字信息(比如基于关键字“路”,提取出POI信息:天目山路256号、万塘路18号),从所述第一地址信息和第二地址信息中提取出POI信息。另外,根据预设的数字子串匹配模式(比如依次匹配出路号、楼号、楼层、房号),提取出所述数字子串信息。
以第一地址信息为“浙江省杭州市天目山路256号MM大楼2楼”,第二地址信息为“浙江省杭州市万塘路18号MM大楼2楼”为例;对第一地址信息进行解析,得到POI信息为天目山路256号,经纬度为(30.122、102.589),采用模糊匹配抽取出路号:256号、楼号:无、楼层:2楼、房号:无,该第一地址信息的数字子串信息即为:256*2*,其中“*”表示其标识的位置范围没有对应的数字子串;对第二地址信息进行解析,得到POI信息为万塘路18号,经纬度为(30.121、102.583),采用模糊匹配抽取出路号:18 号、楼号:无、楼层:2楼、房号:无,该第二地址信息的数字子串信息即为256*2*。
在从各个第一地址信息和第二地址信息中解析得到上述地址特征信息后,针对每个第一地址信息,可以根据预设的地址特征匹配条件,查找与该第一地址信息之间满足预设的地址特征匹配条件的至少一个第二地址信息,组成第二地址信息候选集合。第二地址信息候选集合中的任一第二地址信息与该第一地址信息之间满足的所述预设的地址特征匹配条件包括以下三个条件中的一种或多种:
条件一:该任一第二地址信息与该第一地址信息分别对应的经纬度之间的距离小于设定距离阈值;
条件二:该任一第二地址信息与该第一地址信息分别对应的POI信息相同;
条件三:该任一第二地址信息和该第一地址信息之间连续相同的数字子串个数的最大值大于设定个数阈值。
针对上述条件一,分别对第一地址信息解析得到第一地址信息对应的经纬度,对第二地址信息解析得到第二地址信息对应的经纬度,这里,通过地址信息得到经纬度可以是基于已有的地图应用来实现的。
在具体实施过程中,为了提高查找效率,可以采用k-d树的方式来进行查找,由于本申请实施例需要基于经纬度点来进行查找,因此可以采用2维k-d树(即2-d树)。如图3所示,为经纬度散点图,每一个圆点表示一个地址信息对应的经纬度点。如果采用遍历的方式,依次计算提取的每一个第二地址信息对应的经纬度和待匹配的第一地址信息对应的经纬度之间的距离,就会浪费很多计算资源,采用2-d树搜索方式,可以做到只选择与该第一地址信息对应的经纬度距离较近的第二地址信息的经纬度(对应虚线框中的点)进行计算,这样可以节省大量的计算资源。
针对上述条件二,可以从第一地址信息和第二地址信息中分别提取出POI信息,若分别提取的POI信息相同,则认为该任一第二地址信息与该第一地址信息分别对应的POI信息相同;但是,由于同一地址通常存在多种不同的描述方式,当两个账户体系对同一地址采用不同的描述方式进行描述时,从对应同一地址的第一地址信息和第二地址信息中提取出的POI信息就可能存在不同。
基于此,在S201之后,可以通过查询存储的门址库,将提取的各个第一地址信息和各个第二地址信息中的POI信息映射为所述门址库中的标准化POI信息;
然后,在S202中,根据以下步骤确定所述任一第二地址信息与第一地址信息分别对应的POI信息相同:
当该任一第二地址信息映射的所述门址库中的标准化POI信息与该第一地址信息映射的所述门址库中的标准化POI信息相同时,确定所述任一第二地址信息与第一地址信息分别对应的POI信息相同。
在该方式下,对各个第一地址信息和第二地址信息统一采用门址库中规定的描述方式进行描述,在统一描述方式后,再判断属于不同账户体系中的第一地址信息和第二地址信息的POI信息是否一致,这样可以提高对POI信息的匹配成功率。比如,在上述举例中,对第一地址信息进行解析得到的POI信息为天目山路256号,对第二地址信息进行解析得到的POI信息为万塘路18号,通过查询门址库,确认天目山路256号标准化后为黄龙时代广场,万塘路18号标准化后也为黄龙时代广场,因此可以认为该第一地址信息和第二地址信息分别对应的POI信息相同。
针对上述条件三,可以利用正则表达式从预处理后的各个第一地址信息和第二地址信息中模糊匹配出路号、楼号、楼层和房号这些数字子串,当第一地址信息和第二地址信息在路号、楼号、楼层和房号这些数字子串中的至少N个按照顺序相等时,认为满足条件三,N即为所述设定个数阈值。
在具体实施过程中,可以基于上述任一条件来确定第二地址信息候选集合,也可以将上述条件一~三中的两个或三个进行组合来确定第二地址信息候选集合。
例如:将上述条件一和条件三进行组合:对于第一地址信息“四川省内江市东兴区汉安大道169号2幢2单元2号”,可以通过正则表达式模糊匹配出4个数字子串:169、2、2、2(这里,每个数字子串的定义可以是预先设置的,比如在这里,数字子串定义按照地理范围由大到小的顺序依次为路号、楼号、单元号、房号),同时,该第一地址信息对应的经纬度为(29.602631、105.071571)。假设设置的设定距离阈值为3公里,则可以以经纬度(29.602631、105.071571)对应的点为中心,以半径为3公里寻找归入第二地址信息候选集合中的第二地址信息,发现只有3个,如图4所示,它们的数字子串分别为:168、2;169、2、2;168、3,其中数字子串169、2、2和169、2、2、2的相同数字子串(即公共数字串)个数为3,大于设定个数阈值2,因此只有数字子串169、2、2对应的第二地址信息才是可以归入第二地址信息候选集合中的第二地址信息。如果再结合条件二来确定第二地址信息候选集合,则归入第二地址信息候选集合中的第二地址信息还满足:其对应的门址库中的标准化POI信息与第一地址信息对应的门址库中的标准化POI信息相同,比如这里归入第二地址信息候选集合中的第二地址信息对应的门 址库中的标准化POI信息为“四川省内江市东兴区汉安大道169号”,与第一地址信息对应的门址库中的标准化POI信息相同。
S203:从确定的所述第二地址信息候选集合中选择一个与该第一地址信息匹配的第二地址信息。
在具体实施过程中,当所述第二地址信息候选集合中只有一个第二地址信息时,可以将该第二地址信息作为最终选择的第二地址信息;若所述第二地址信息候选集合中存在多个第二地址信息,一种选择方式是:可以将确定的第二地址信息候选集合呈现给用户,由用户从第二地址信息候选集合中选择一个最匹配的第二地址信息。另一种选择方式是:计算第二地址信息候选集合中的各个第二地址信息分别与第一地址信息之间的匹配分值,基于计算得到的匹配分值,选择匹配程度最高的第二地址信息,详见实施例三的描述。
S204:基于选择的与该第一地址信息匹配的第二地址信息,建立该第一地址信息所属的第一账户信息,和选择的该第二地址信息所属的第二账户信息之间的映射关系。
实施例三
如图5所示,为本申请实施例三提供的映射关系建立方法流程图,图6为与之对应的地址匹配框架示意图,包括以下步骤:
S501:从第一服务方提供的多个第一账户信息中分别提取第一地址信息,以及从第二服务方提供的多个第二账户信息中分别提取第二地址信息。
在该步骤之后,可以首先基于预设的预处理类型,对提取的所述第一地址信息及第二地址信息进行格式上的预处理,在消除地址匹配时的干扰后再执行下述步骤,比如全角转半角、中文数字转阿拉伯数字、繁体转简体、英文大写转小写等。
S502:从各个第一地址信息和第二地址信息中分别提取经纬度信息、兴趣点POI信息和数字子串信息,并针对提取的每个第一地址信息,执行下述S503~S508。
S503:针对每个第二地址信息,判断该第二地址信息与该第一地址信息分别对应的POI信息是否相同,若相同,则进入S504,若不同,则确定该第二地址信息不属于所述与该第一地址信息匹配的第二地址信息候选集合。
在执行该步骤时,可以首先将第一地址信息及第二地址信息中的POI信息通过门址库进行标准化,然后判断标准化后的PIO信息是否一致。
S504:当确定该第二地址信息与第一地址信息分别对应的POI信息相同后,基于该第二地址信息与第一地址信息各自的经纬度信息,确定该第二地址信息与该第一地址信 息分别对应的经纬度之间的距离,以及基于该第二地址信息与第一地址信息各自的数字子串,确定该第二地址信息与该第一地址信息之间连续相同的数字子串个数的最大值。
S505:当该第二地址信息与该第一地址信息分别对应的经纬度之间的距离小于设定距离阈值、且该第二地址信息和该第一地址信息之间连续相同的数字子串个数的最大值大于设定个数阈值时,将该第二地址信息归入与该第一地址信息匹配的第二地址信息候选集合中。
S506:根据该第一地址信息对应的地址特征信息,以及确定出的第二地址信息候选集合中的每个第二地址信息的地址特征信息,确定所述第二地址信息候选集合中的每个第二地址信息与该第一地址信息之间的匹配分值。
在具体实施过程中,针对所述第二地址信息候选集合中的每个第二地址信息,可以分别确定该第二地址信息与第一地址信息之间在每种地址特征下的匹配分值,然后将该第二地址信息与第一地址信息之间在不同种地址特征下的匹配分值相加,作为该第二地址信息与该第一地址信息之间的匹配分值。
比如,可以针对确定出的第二地址信息候选集合中的任一第二地址信息,根据该任一第二地址信息与该第一地址信息之间的经纬度匹配分值和经纬度匹配重要度、以及该任一第二地址信息与该第一地址信息之间的数字子串匹配分值和数字子串匹配重要度,确定该任一第二地址信息与该第一地址信息之间的匹配分值。
这里,可以根据公式S=β1S1+β2S2,来确定该任一第二地址信息与该第一地址信息之间的匹配分值S,这里,,β1、β2分别为经纬度匹配重要度和数字子串匹配重要度,S1为经纬度匹配分值,S2为数字子串匹配分值。在实际实施中,可以根据经验设置,β1、β2的取值,比如β1可以取值为0.3,β2可以取值为0.7。
可选地,可以根据以下公式确定所述任一第二地址信息与第一地址信息之间的经纬度匹配分值S1:
S1=ke-d(c1,c2)
其中,d(c1,c2)表示该任一第二地址信息对应的经纬度c1和该第一地址信息对应的经纬度c2之间的距离,k为常数。
可以根据以下公式确定所述任一第二地址信息与第一地址信息之间的数字子串匹配分值S2:
S2=pN
其中,N为将该任一第二地址信息和该第一地址信息之间连续相同的最大数字子串个数,p为常数。
S507:选取对应的匹配分值最高的第二地址信息作为与该第一地址信息匹配的第二地址信息。
S508:建立该第一地址信息所属的第一账户信息,和与该第一地址信息匹配的第二地址信息所属的第二账户信息之间的映射关系。
基于同一发明构思,本申请实施例中还提供了一种与映射关系建立方法对应的映射关系建立装置,由于该装置解决问题的原理与本申请实施例的映射关系建立方法相似,因此该装置的实施可以参见方法的实施,重复之处不再赘述。
如图7所示,为本申请实施例提供的基于地址信息的账户映射装置结构示意图,包括:
提取模块71,用于从第一服务方提供的多个第一账户信息中分别提取第一地址信息,以及从第二服务方提供的多个第二账户信息中分别提取第二地址信息;
查找模块72,用于针对所述提取模块71提取的每个第一地址信息,从提取的各个第二地址信息中,查找与该第一地址信息匹配的第二地址信息;
映射模块73,用于在所述查找模块72查找到与该第一地址信息匹配的第二地址信息后,建立该第一地址信息所属的第一账户信息,和所述查找模块查找到的第二地址信息所属的第二账户信息之间的映射关系。
可选地,所述查找模块72具体用于:
根据该第一地址信息对应的地址特征信息,以及所述各个第二地址信息分别对应的地址特征信息,确定出与该第一地址信息匹配的第二地址信息候选集合;其中,所述第二地址信息候选集合中的每个第二地址信息与该第一地址信息之间满足预设的地址特征匹配条件;从确定的所述第二地址信息候选集合中选择一个与该第一地址信息匹配的第二地址信息。
可选地,所述地址特征信息包括经纬度信息、兴趣点POI信息和数字子串信息中的一种或多种;
所述第二地址信息候选集合中的任一第二地址信息与该第一地址信息之间满足的所述预设的地址特征匹配条件包括以下条件中的一种或多种:
该任一第二地址信息与该第一地址信息分别对应的经纬度之间的距离小于设定距离阈值;
该任一第二地址信息与该第一地址信息分别对应的POI信息相同;
该任一第二地址信息和该第一地址信息之间连续相同的数字子串个数的最大值大于设定个数阈值,其中,每个数字子串对应地址的一个位置范围,同一地址的不同数字子串对应该地址的不同位置范围。
可选地,所述查找模块72具体用于:
若所述地址特征信息包括经纬度信息,则通过查询存储有地址信息与经纬度的映射关系的经纬度库,查找到与提取的所述第一地址信息和第二地址信息对应的经纬度信息;
若所述地址特征信息包括POI信息,则根据预设的对应POI信息的关键字信息,从所述第一地址信息和第二地址信息中提取出POI信息;
若所述地址特征信息包括数字子串信息,则根据预设的数字子串匹配模式,从所述第一地址信息和第二地址信息中提取出所述数字子串信息。
可选地,所述查找模块72还用于:
当确定所述任一第二地址信息与第一地址信息中存在POI信息后,将所述任一第二地址信息与第一地址信息中的POI信息映射为所述门址库中的标准化POI信息;
当该任一第二地址信息映射的所述门址库中的标准化POI信息与该第一地址信息映射的所述门址库中的标准化POI信息相同时,确定所述任一第二地址信息与第一地址信息分别对应的POI信息相同。
可选地,所述提取模块71具体用于:
从提取的各个第一地址信息和第二地址信息中分别提取经纬度信息、兴趣点POI信息和数字子串信息;
所述查找模块72具体用于:
针对每个第二地址信息,判断该第二地址信息与该第一地址信息分别对应的POI信息是否相同;当确定该第二地址信息与第一地址信息分别对应的POI信息相同后,基于该第二地址信息与第一地址信息各自的经纬度信息,确定该第二地址信息与该第一地址信息分别对应的经纬度之间的距离,以及基于该第二地址信息与第一地址信息各自的数字子串信息,确定该第二地址信息与该第一地址信息之间连续相同的数字子串个数的最大值;当该第二地址信息与该第一地址信息分别对应的经纬度之间的距离小于设定距离阈值、且该第二地址信息和该第一地址信息之间连续相同的数字子串个数的最大值大于设定个数阈值时,将该第二地址信息归入与该第一地址信息匹配的第二地址信息候选集合中。
可选地,所述查找模块72具体用于:
针对确定出的第二地址信息候选集合中的任一第二地址信息,根据该任一第二地址信息与该第一地址信息之间的经纬度匹配分值和经纬度匹配重要度、以及该任一第二地址信息与该第一地址信息之间的数字子串匹配分值和数字子串匹配重要度,确定该任一第二地址信息与该第一地址信息之间的匹配分值;选取对应的匹配分值最高的第二地址信息作为与该第一地址信息匹配的第二地址信息。
可选地,所述查找模块72具体用于根据以下公式确定所述任一第二地址信息与第一地址信息之间的经纬度匹配分值S1:
S1=ke-d(c1,c2)
其中,d(c1,c2)表示该任一第二地址信息对应的经纬度c1和该第一地址信息对应的经纬度c2之间的距离,k为常数。
可选地,所述查找模块72具体用于根据以下公式确定所述任一第二地址信息与第一地址信息之间的数字子串匹配分值S2:
S2=pN
其中,N为将该任一第二地址信息和该第一地址信息之间连续相同的最大数字子串个数,p为常数。
本领域内的技术人员应明白,本申请的实施例可提供为方法、系统、或计算机程序产品。因此,本申请可采用完全硬件实施例、完全软件实施例、或结合软件和硬件方面的实施例的形式。而且,本申请可采用在一个或多个其中包含有计算机可用程序代码的计算机可用存储介质(包括但不限于磁盘存储器、CD-ROM、光学存储器等)上实施的计算机程序产品的形式。
本申请是参照根据本申请实施例的方法、装置(系统)、和计算机程序产品的流程图和/或方框图来描述的。应理解可由计算机程序指令实现流程图和/或方框图中的每一流程和/或方框、以及流程图和/或方框图中的流程和/或方框的结合。可提供这些计算机程序指令到通用计算机、专用计算机、嵌入式处理机或其他可编程数据处理设备的处理器以产生一个机器,使得通过计算机或其他可编程数据处理设备的处理器执行的指令产生用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的装置。
这些计算机程序指令也可存储在能引导计算机或其他可编程数据处理设备以特定方 式工作的计算机可读存储器中,使得存储在该计算机可读存储器中的指令产生包括指令装置的制造品,该指令装置实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能。
这些计算机程序指令也可装载到计算机或其他可编程数据处理设备上,使得在计算机或其他可编程设备上执行一系列操作步骤以产生计算机实现的处理,从而在计算机或其他可编程设备上执行的指令提供用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的步骤。
尽管已描述了本申请的优选实施例,但本领域内的技术人员一旦得知了基本创造性概念,则可对这些实施例作出另外的变更和修改。所以,所附权利要求意欲解释为包括优选实施例以及落入本申请范围的所有变更和修改。
显然,本领域的技术人员可以对本申请进行各种改动和变型而不脱离本申请的精神和范围。这样,倘若本申请的这些修改和变型属于本申请权利要求及其等同技术的范围之内,则本申请也意图包含这些改动和变型在内。

Claims (18)

  1. 一种基于地址信息的账户映射方法,其特征在于,该方法包括:
    从第一服务方提供的多个第一账户信息中分别提取第一地址信息,以及从第二服务方提供的多个第二账户信息中分别提取第二地址信息;
    针对提取的每个第一地址信息,从提取的各个第二地址信息中,查找与该第一地址信息匹配的第二地址信息;
    在查找到与该第一地址信息匹配的第二地址信息后,建立该第一地址信息所属的第一账户信息,和查找到的该第二地址信息所属的第二账户信息之间的映射关系。
  2. 如权利要求1所述的方法,其特征在于,从第一服务方提供的多个第一账户信息中分别提取第一地址信息,以及从第二服务方提供的多个第二账户信息中分别提取第二地址信息之后,还包括:
    从提取的各个第一地址信息和第二地址信息中分别提取地址特征信息;
    针对提取的每个第一地址信息,从提取的各个第二地址信息中,查找与该第一地址信息匹配的第二地址信息,包括:
    根据该第一地址信息对应的地址特征信息,以及所述各个第二地址信息分别对应的地址特征信息,确定出与该第一地址信息匹配的第二地址信息候选集合;其中,所述第二地址信息候选集合中的每个第二地址信息与该第一地址信息之间满足预设的地址特征匹配条件;
    从确定的所述第二地址信息候选集合中选择一个与该第一地址信息匹配的第二地址信息。
  3. 如权利要求2所述的方法,其特征在于,所述地址特征信息包括经纬度信息、兴趣点POI信息和数字子串信息中的一种或多种;
    所述第二地址信息候选集合中的任一第二地址信息与该第一地址信息之间满足的所述预设的地址特征匹配条件包括以下条件中的一种或多种:
    该任一第二地址信息与该第一地址信息分别对应的经纬度之间的距离小于设定距离阈值;
    该任一第二地址信息与该第一地址信息分别对应的POI信息相同;
    该任一第二地址信息和该第一地址信息之间连续相同的数字子串个数的最大值大于设定个数阈值,其中,每个数字子串对应地址的一个位置范围,同一地址的不同数字子串对应该地址的不同位置范围。
  4. 如权利要求3所述的方法,其特征在于,根据以下步骤确定所述任一第二地址信息与第一地址信息分别对应的POI信息相同:
    当确定所述任一第二地址信息与第一地址信息中存在POI信息后,将所述任一第二地址信息与第一地址信息中的POI信息映射为所述门址库中的标准化POI信息;
    当该任一第二地址信息映射的所述门址库中的标准化POI信息与该第一地址信息映射的所述门址库中的标准化POI信息相同时,确定所述任一第二地址信息与第一地址信息分别对应的POI信息相同。
  5. 如权利要求2~4任一所述的方法,其特征在于,从提取的各个第一地址信息和第二地址信息中分别提取地址特征信息,包括:
    从提取的各个第一地址信息和第二地址信息中分别提取经纬度信息、兴趣点POI信息和数字子串信息;
    根据该第一地址信息对应的地址特征信息,以及所述各个第二地址信息分别对应的地址特征信息,确定出与该第一地址信息匹配的第二地址信息候选集合,包括:
    针对每个第二地址信息,判断该第二地址信息与该第一地址信息分别对应的POI信息是否相同;
    当确定该第二地址信息与第一地址信息分别对应的POI信息相同后,基于该第二地址信息与第一地址信息各自的经纬度信息,确定该第二地址信息与该第一地址信息分别对应的经纬度之间的距离,以及基于该第二地址信息与第一地址信息各自的数字子串信息,确定该第二地址信息与该第一地址信息之间连续相同的数字子串个数的最大值;
    当该第二地址信息与该第一地址信息分别对应的经纬度之间的距离小于设定距离阈值、且该第二地址信息和该第一地址信息之间连续相同的数字子串个数的最大值大于设定个数阈值时,将该第二地址信息归入与该第一地址信息匹配的第二地址信息候选集合中。
  6. 如权利要求5所述的方法,其特征在于,所述从确定的所述第二地址信息候选集合中选择一个与该第一地址信息匹配的第二地址信息,包括:
    针对确定出的第二地址信息候选集合中的任一第二地址信息,根据该任一第二地址信息与该第一地址信息之间的经纬度匹配分值和经纬度匹配重要度、以及该任一第二地址信息与该第一地址信息之间的数字子串匹配分值和数字子串匹配重要度,确定该任一第二地址信息与该第一地址信息之间的匹配分值;
    选取对应的匹配分值最高的第二地址信息作为与该第一地址信息匹配的第二地址 信息。
  7. 如权利要求6所述的方法,其特征在于,根据以下公式确定所述任一第二地址信息与第一地址信息之间的经纬度匹配分值S1:
    S1=kg-d(c1,c2)
    其中,d(c1,c2)表示该任一第二地址信息对应的经纬度c1和该第一地址信息对应的经纬度c2之间的距离,k为常数。
  8. 如权利要求6所述的方法,其特征在于,根据以下公式确定所述任一第二地址信息与第一地址信息之间的数字子串匹配分值S2:
    S2=pN
    其中,N为将该任一第二地址信息和该第一地址信息之间连续相同的最大数字子串个数,p为常数。
  9. 如权利要求1所述的方法,其特征在于,针对提取的每个第一地址信息,从提取的各个第二地址信息中,查找与该第一地址信息匹配的第二地址信息,包括:
    采用k-d树的方式,从提取的各个第二地址信息中,查找与该第一地址信息匹配的第二地址信息。
  10. 如权利要求1所述的方法,其特征在于,提取所述第一地址信息及第二地址信息之后,查找与该第一地址信息匹配的第二地址信息之前,还包括:
    根据预设的预处理类型,对提取的所述第一地址信息及第二地址信息进行格式上的预处理。
  11. 一种基于地址信息的账户映射装置,其特征在于,该装置包括:
    提取模块,用于从第一服务方提供的多个第一账户信息中分别提取第一地址信息,以及从第二服务方提供的多个第二账户信息中分别提取第二地址信息;
    查找模块,用于针对所述提取模块提取的每个第一地址信息,从提取的各个第二地址信息中,查找与该第一地址信息匹配的第二地址信息;
    映射模块,用于在所述查找模块查找到与该第一地址信息匹配的第二地址信息后,建立该第一地址信息所属的第一账户信息,和所述查找模块查找到的第二地址信息所属的第二账户信息之间的映射关系。
  12. 如权利要求11所述的装置,其特征在于,所述提取模块还用于:
    从提取的各个第一地址信息和第二地址信息中分别提取地址特征信息;
    所述查找模块具体用于:
    根据该第一地址信息对应的地址特征信息,以及所述各个第二地址信息分别对应的地址特征信息,确定出与该第一地址信息匹配的第二地址信息候选集合;其中,所述第二地址信息候选集合中的每个第二地址信息与该第一地址信息之间满足预设的地址特征匹配条件;从确定的所述第二地址信息候选集合中选择一个与该第一地址信息匹配的第二地址信息。
  13. 如权利要求12所述的装置,其特征在于,所述地址特征信息包括经纬度信息、兴趣点POI信息和数字子串信息中的一种或多种;
    所述第二地址信息候选集合中的任一第二地址信息与该第一地址信息之间满足的所述预设的地址特征匹配条件包括以下条件中的一种或多种:
    该任一第二地址信息与该第一地址信息分别对应的经纬度之间的距离小于设定距离阈值;
    该任一第二地址信息与该第一地址信息分别对应的POI信息相同;
    该任一第二地址信息和该第一地址信息之间连续相同的数字子串个数的最大值大于设定个数阈值,其中,每个数字子串对应地址的一个位置范围,同一地址的不同数字子串对应该地址的不同位置范围。
  14. 如权利要求13所述的装置,其特征在于,所述查找模块还用于:
    当确定所述任一第二地址信息与第一地址信息中存在POI信息后,将所述任一第二地址信息与第一地址信息中的POI信息映射为所述门址库中的标准化POI信息;
    当该任一第二地址信息映射的所述门址库中的标准化POI信息与该第一地址信息映射的所述门址库中的标准化POI信息相同时,确定所述任一第二地址信息与第一地址信息分别对应的POI信息相同。
  15. 如权利要求12~14任一所述的装置,其特征在于,所述提取模块具体用于:
    从提取的各个第一地址信息和第二地址信息中分别提取经纬度信息、兴趣点POI信息和数字子串信息;
    所述查找模块具体用于:
    针对每个第二地址信息,判断该第二地址信息与该第一地址信息分别对应的POI信息是否相同;当确定该第二地址信息与第一地址信息分别对应的POI信息相同后,基于该第二地址信息与第一地址信息各自的经纬度信息,确定该第二地址信息与该第一地址信息分别对应的经纬度之间的距离,以及基于该第二地址信息与第一地址信息各自的 数字子串信息,确定该第二地址信息与该第一地址信息之间连续相同的数字子串个数的最大值;当该第二地址信息与该第一地址信息分别对应的经纬度之间的距离小于设定距离阈值、且该第二地址信息和该第一地址信息之间连续相同的数字子串个数的最大值大于设定个数阈值时,将该第二地址信息归入与该第一地址信息匹配的第二地址信息候选集合中。
  16. 如权利要求12所述的装置,其特征在于,所述查找模块具体用于:
    针对确定出的第二地址信息候选集合中的任一第二地址信息,根据该任一第二地址信息与该第一地址信息之间的经纬度匹配分值和经纬度匹配重要度、以及该任一第二地址信息与该第一地址信息之间的数字子串匹配分值和数字子串匹配重要度,确定该任一第二地址信息与该第一地址信息之间的匹配分值;选取对应的匹配分值最高的第二地址信息作为与该第一地址信息匹配的第二地址信息。
  17. 如权利要求16所述的装置,其特征在于,所述查找模块具体用于根据以下公式确定所述任一第二地址信息与第一地址信息之间的经纬度匹配分值S1:
    S1=ke-d(c1,c2)
    其中,d(c1,c2)表示该任一第二地址信息对应的经纬度c1和该第一地址信息对应的经纬度c2之间的距离,k为常数。
  18. 如权利要求16所述的装置,其特征在于,所述查找模块具体用于根据以下公式确定所述任一第二地址信息与第一地址信息之间的数字子串匹配分值S2:
    S2=pN
    其中,N为将该任一第二地址信息和该第一地址信息之间连续相同的最大数字子串个数,p为常数。
PCT/CN2016/101639 2015-10-14 2016-10-10 一种基于地址信息的账户映射方法及装置 WO2017063531A1 (zh)

Priority Applications (6)

Application Number Priority Date Filing Date Title
JP2018519762A JP6752475B2 (ja) 2015-10-14 2016-10-10 アドレス情報に基づいたアカウントマッピングの方法及び装置
SG11201803052QA SG11201803052QA (en) 2015-10-14 2016-10-10 Account mapping method and apparatus based on address information
KR1020187013633A KR102117773B1 (ko) 2015-10-14 2016-10-10 어드레스 정보에 기초한 어카운트 맵핑 방법 및 장치
EP16854910.3A EP3364309B1 (en) 2015-10-14 2016-10-10 Account mapping method and device based on address information
US15/949,981 US10990353B2 (en) 2015-10-14 2018-04-10 Address information-based account mapping method and apparatus
US16/720,916 US10725737B2 (en) 2015-10-14 2019-12-19 Address information-based account mapping method and apparatus

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201510661892.4 2015-10-14
CN201510661892.4A CN106598965B (zh) 2015-10-14 2015-10-14 一种基于地址信息的账户映射方法及装置

Related Child Applications (1)

Application Number Title Priority Date Filing Date
US15/949,981 Continuation US10990353B2 (en) 2015-10-14 2018-04-10 Address information-based account mapping method and apparatus

Publications (1)

Publication Number Publication Date
WO2017063531A1 true WO2017063531A1 (zh) 2017-04-20

Family

ID=58517080

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2016/101639 WO2017063531A1 (zh) 2015-10-14 2016-10-10 一种基于地址信息的账户映射方法及装置

Country Status (7)

Country Link
US (2) US10990353B2 (zh)
EP (1) EP3364309B1 (zh)
JP (1) JP6752475B2 (zh)
KR (1) KR102117773B1 (zh)
CN (1) CN106598965B (zh)
SG (1) SG11201803052QA (zh)
WO (1) WO2017063531A1 (zh)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109491994A (zh) * 2018-11-28 2019-03-19 中国科学院遥感与数字地球研究所 Landsat-8卫星精选遥感数据集最简化筛选方法
CN111984748A (zh) * 2019-05-22 2020-11-24 深圳中兴飞贷金融科技有限公司 地址信息处理方法和装置、存储介质及电子设备

Families Citing this family (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106598965B (zh) 2015-10-14 2020-03-20 阿里巴巴集团控股有限公司 一种基于地址信息的账户映射方法及装置
CN108090221B (zh) * 2018-01-02 2019-05-10 北京市燃气集团有限责任公司 一种燃气卡数据与用户管理数据的关联方法
CN109741227B (zh) * 2019-01-07 2020-12-08 巩志远 一种基于最近邻算法预测人房一致性处理方法及系统
CN110851661B (zh) * 2019-10-25 2023-04-21 北京深演智能科技股份有限公司 用户id映射关系的确定方法及装置、电子设备
CN112150305A (zh) * 2020-09-14 2020-12-29 深圳供电局有限公司 企业电力用户信息核验方法及其系统、计算机设备、介质
CN112507223B (zh) * 2020-12-10 2023-06-23 北京百度网讯科技有限公司 数据处理方法、装置、电子设备及可读存储介质
US11409660B1 (en) * 2021-11-19 2022-08-09 SafeGraph, Inc. Systems and methods for translating address strings to standardized addresses

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101882163A (zh) * 2010-06-30 2010-11-10 中国科学院地理科学与资源研究所 一种基于匹配规则的模糊中文地址地理赋值方法
CN102932414A (zh) * 2012-09-26 2013-02-13 中国科学院计算技术研究所 一种通讯录与社交网络的融合方法及其系统
WO2014160582A1 (en) * 2013-03-28 2014-10-02 Robert Andrew Eckel System and method for transaction authentication
CN104573057A (zh) * 2015-01-22 2015-04-29 电子科技大学 一种用于跨ugc网站平台的帐户关联方法
CN104767672A (zh) * 2014-01-08 2015-07-08 北大方正集团有限公司 一种账户关系建立方法及设备
CN104954322A (zh) * 2014-03-25 2015-09-30 腾讯科技(深圳)有限公司 一种账号的绑定处理方法、装置及系统

Family Cites Families (26)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100684445B1 (ko) 1999-12-31 2007-02-16 주식회사 케이티 이종 데이터베이스 간의 연동을 위한 연동 시스템 및 그방법
US10684350B2 (en) 2000-06-02 2020-06-16 Tracbeam Llc Services and applications for a communications network
US7738878B2 (en) 2000-12-13 2010-06-15 Trimble Navigation Limited Method for obtaining location information of a mobile unit using a wireline telephone number
US7256711B2 (en) * 2003-02-14 2007-08-14 Networks In Motion, Inc. Method and system for saving and retrieving spatial related information
JP2005011049A (ja) * 2003-06-19 2005-01-13 Nec Soft Ltd データベース統合装置
KR101042361B1 (ko) 2004-01-30 2011-06-17 주식회사 케이티 이종 업종간 정보시스템 통합이 가능한 메타 정보 시스템및 그 통합 방법
US20060271281A1 (en) * 2005-05-20 2006-11-30 Myron Ahn Geographic information knowledge systems
US8732091B1 (en) 2006-03-17 2014-05-20 Raj Abhyanker Security in a geo-spatial environment
US9064288B2 (en) * 2006-03-17 2015-06-23 Fatdoor, Inc. Government structures and neighborhood leads in a geo-spatial environment
US20070233658A1 (en) 2006-03-31 2007-10-04 Aol Llc Identifying a result responsive to location data for multiple users
FI20065288A (fi) * 2006-05-03 2007-11-04 Emillion Oy Autentikointi
US20140123247A1 (en) 2006-11-22 2014-05-01 Raj Abhyanker Nextdoor neighbor connect
EP2229650A1 (en) 2007-12-28 2010-09-22 Koninklijke Philips Electronics N.V. Information interchange system and apparatus
JP5113108B2 (ja) * 2008-06-18 2013-01-09 ヤフー株式会社 注記名寄せ装置、注記名寄せ方法、および、注記名寄せプログラム
JP5711750B2 (ja) * 2009-10-06 2015-05-07 コーニンクレッカ フィリップス エヌ ヴェ 異なるエンティティに保管された患者情報記録の自律的な結合
US8438231B2 (en) * 2010-01-08 2013-05-07 Alcatel Lucent Telecommunication messaging through a social networking service
WO2011142984A1 (en) 2010-05-13 2011-11-17 Northwestern University Geographic location system and method
JP2012174074A (ja) * 2011-02-23 2012-09-10 Esri Japan Co Ltd データベース統合装置
JP5802143B2 (ja) * 2012-02-07 2015-10-28 株式会社コナミデジタルエンタテインメント サーバ、制御方法およびプログラム
US9842120B1 (en) 2013-02-15 2017-12-12 United Parcel Service Of America, Inc. Standardization of addresses and location information
US10536428B2 (en) * 2013-07-12 2020-01-14 Skyhook Wireless, Inc. Processing multiple network address observations
WO2015123604A1 (en) 2014-02-17 2015-08-20 Tourmaline Labs, Inc. Systems and methods for estimating movements of a vehicle using a mobile device
CN104899202B (zh) * 2014-03-04 2019-03-19 华为技术有限公司 一种信息处理方法及系统
CN106161372B (zh) * 2015-04-09 2019-05-31 阿里巴巴集团控股有限公司 一种基于地址匹配的风险识别方法及装置
CN104778606A (zh) * 2015-04-10 2015-07-15 北京京东尚科信息技术有限公司 账户结构数据的处理方法和装置
CN106598965B (zh) 2015-10-14 2020-03-20 阿里巴巴集团控股有限公司 一种基于地址信息的账户映射方法及装置

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101882163A (zh) * 2010-06-30 2010-11-10 中国科学院地理科学与资源研究所 一种基于匹配规则的模糊中文地址地理赋值方法
CN102932414A (zh) * 2012-09-26 2013-02-13 中国科学院计算技术研究所 一种通讯录与社交网络的融合方法及其系统
WO2014160582A1 (en) * 2013-03-28 2014-10-02 Robert Andrew Eckel System and method for transaction authentication
CN104767672A (zh) * 2014-01-08 2015-07-08 北大方正集团有限公司 一种账户关系建立方法及设备
CN104954322A (zh) * 2014-03-25 2015-09-30 腾讯科技(深圳)有限公司 一种账号的绑定处理方法、装置及系统
CN104573057A (zh) * 2015-01-22 2015-04-29 电子科技大学 一种用于跨ugc网站平台的帐户关联方法

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See also references of EP3364309A4 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109491994A (zh) * 2018-11-28 2019-03-19 中国科学院遥感与数字地球研究所 Landsat-8卫星精选遥感数据集最简化筛选方法
CN111984748A (zh) * 2019-05-22 2020-11-24 深圳中兴飞贷金融科技有限公司 地址信息处理方法和装置、存储介质及电子设备

Also Published As

Publication number Publication date
US20180232206A1 (en) 2018-08-16
EP3364309A4 (en) 2018-08-22
JP6752475B2 (ja) 2020-09-09
SG11201803052QA (en) 2018-05-30
CN106598965B (zh) 2020-03-20
EP3364309B1 (en) 2021-06-30
JP2018537760A (ja) 2018-12-20
KR102117773B1 (ko) 2020-06-02
US10725737B2 (en) 2020-07-28
CN106598965A (zh) 2017-04-26
KR20180069869A (ko) 2018-06-25
EP3364309A1 (en) 2018-08-22
US10990353B2 (en) 2021-04-27
US20200125327A1 (en) 2020-04-23

Similar Documents

Publication Publication Date Title
WO2017063531A1 (zh) 一种基于地址信息的账户映射方法及装置
US10366113B2 (en) Method and system for generating a geocode trie and facilitating reverse geocode lookups
US9189746B2 (en) Machine-learning based classification of user accounts based on email addresses and other account information
US10452661B2 (en) Automated database schema annotation
WO2019041521A1 (zh) 用户关键词提取装置、方法及计算机可读存储介质
KR101191172B1 (ko) 이미지 데이터베이스의 이미지들을 관리하는 방법, 장치 및 컴퓨터 판독 가능한 기록 매체
CN102722709B (zh) 一种垃圾图片识别方法和装置
CN107203522B (zh) 一种恶意订单地址的判定方法和装置
CN104866478B (zh) 恶意文本的检测识别方法及装置
EP2695087A1 (en) Processing data in a mapreduce framework
CN111325022B (zh) 识别层级地址的方法和装置
CN107463711B (zh) 一种数据的标签匹配方法及装置
Nesi et al. Geographical localization of web domains and organization addresses recognition by employing natural language processing, Pattern Matching and clustering
US9529823B2 (en) Geo-ontology extraction from entities with spatial and non-spatial attributes
CN103198067A (zh) 一种业务搜索方法及系统
US20240054802A1 (en) System and method for spatial encoding and feature generators for enhancing information extraction
CN105389338B (zh) 一种采购中标数据的解析方法
US10430473B2 (en) Deep mining of network resource references
CN105279249B (zh) 一种网站中兴趣点数据的置信度的判定方法和装置
CN113204613B (zh) 地址生成方法、装置、设备和存储介质
CN105160032B (zh) 一种网站中兴趣点数据的置信度的判定方法和装置
Park et al. Spatio‐temporal query contextualization for microtext retrieval in social media
CN104867056A (zh) 一种基于用户数据分析的智能社交方法
US10204139B2 (en) Systems and methods for processing geographic data
CN114077646A (zh) 信息抽取方法、装置、电子设备及存储介质

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 16854910

Country of ref document: EP

Kind code of ref document: A1

WWE Wipo information: entry into national phase

Ref document number: 11201803052Q

Country of ref document: SG

Ref document number: 2018519762

Country of ref document: JP

NENP Non-entry into the national phase

Ref country code: DE

ENP Entry into the national phase

Ref document number: 20187013633

Country of ref document: KR

Kind code of ref document: A

WWE Wipo information: entry into national phase

Ref document number: 2016854910

Country of ref document: EP