CN110413715A - A kind of standardization processing method and device of address - Google Patents

A kind of standardization processing method and device of address Download PDF

Info

Publication number
CN110413715A
CN110413715A CN201910639806.8A CN201910639806A CN110413715A CN 110413715 A CN110413715 A CN 110413715A CN 201910639806 A CN201910639806 A CN 201910639806A CN 110413715 A CN110413715 A CN 110413715A
Authority
CN
China
Prior art keywords
information
administrative area
administrative
area information
address
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201910639806.8A
Other languages
Chinese (zh)
Inventor
曾伟雄
莫卉星
纪磊
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Union Mobile Pay Co Ltd
Original Assignee
Union Mobile Pay Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Union Mobile Pay Co Ltd filed Critical Union Mobile Pay Co Ltd
Priority to CN201910639806.8A priority Critical patent/CN110413715A/en
Publication of CN110413715A publication Critical patent/CN110413715A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/29Geographical information databases
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q40/00Finance; Insurance; Tax strategies; Processing of corporate or income taxes
    • G06Q40/03Credit; Loans; Processing thereof

Abstract

The present invention provides the standardization processing method and device of a kind of address, to solve the problems, such as that address instruction existing in the prior art is not known.It include: the first information for obtaining user, the first information includes the first address and identity information, and the identity information includes the information that at least one has instruction ownership place function;When determining includes at least two administrative area information with first address matching in the first storage organization, the first administrative area information is matched from the information of at least two administrative area according to the identity information;Wherein, it is stored in first storage organization multiple for characterizing the administrative area information of subordinate relation between multistage administrative area;First address is standardized according to first administrative area information.

Description

A kind of standardization processing method and device of address
Technical field
The present invention relates to technical field of data processing more particularly to the standardization processing methods and device of a kind of address.
Background technique
Client is needed to fill in many personal information in credit industry, transacting business, including such as home address, unit The associated address informations such as location.
Currently, being usually written only to the other administrative area of lower levels such as area's grade, at county level when client's fill address, there is writing not For that is, place name has overlapping, address institute is cannot be distinguished in complete situation, lower for different provinces and cities point of area, the county for having a same names Belong to provinces and cities, is relatively also easy to produce and obscures.Address instruction is unclear, and influence related service handles progress.
Summary of the invention
The present invention provides the standardization processing method and device of a kind of address, to solve address existing in the prior art Indicate unclear problem.
In a first aspect, the embodiment of the invention provides a kind of standardization processing methods of address, comprising:
The first information of user is obtained, the first information includes the first address and identity information, the identity information packet Include at least one information with instruction ownership place function;
When determining includes at least two administrative area information with first address matching in the first storage organization, according to The identity information matches the first administrative area information from the information of at least two administrative area;Wherein, first storage It is stored in structure multiple for characterizing the administrative area information of subordinate relation between multistage administrative area;
First address is standardized according to first administrative area information.
In an optional implementation manner, the identity information includes at least two letters with instruction ownership place function Breath;
It is described that first administrative area information is matched from the information of at least two administrative area according to the identity information, packet It includes:
When the administrative area information in the administrative area that at least two information that the identity information includes are belonged to includes the second row Administrative division information and third administrative area information, and in second administrative area information and at least two administrative area information wherein One matching, when one of matching in third administrative area information and at least two administrative area information, according to institute The weight of the corresponding identity information of the second administrative area information and the weight of the corresponding identity information of third administrative area information are stated, Determine the confidence level of each administrative area information in the information of at least two administrative area;
The maximum administrative area information of confidence level in the information of at least two administrative area is determined as first administrative area Information.
In an optional implementation manner, it is described according to the identity information from the information of at least two administrative area Match the first administrative area information, comprising:
When in the administrative area information in the administrative area that at least two information that the identity information includes are belonged to there is only with When one of them matched fourth line administrative division information in the information of at least two administrative area, by least two administrative area It is determined as first administrative area information with the administrative area information of fourth line administrative division information matches in information.
In an optional implementation manner, it is described according to the identity information from the information of at least two administrative area Match the first administrative area information, comprising:
When the administrative area information in the administrative area that at least two information that the identity information includes are belonged to is fifth line When the information of administrative division, it will be determined in the information of at least two administrative area with the administrative area information of fifth line administrative division information matches For first administrative area information.
In an optional implementation manner, the identity information includes an information with instruction ownership place function;
The first administrative area information is matched from the information of at least two administrative area according to the identity information, comprising:
Determine the 6th administrative area information in the administrative area that the identity information is belonged to;
The first row with the 6th administrative area information matches is determined from the information of at least two administrative area Administrative division information.
In an optional implementation manner, the information with instruction ownership place function is the ID card No. of user, electricity The location information that words number or user are currently located.
Second aspect, the embodiment of the present invention provide a kind of standardization device of address, comprising:
Module is obtained, for obtaining the first information of user, the first information includes the first address and identity information, institute Stating identity information includes the information that at least one has instruction ownership place function;
Matching module, for working as at least two administration determined include in the first storage organization with first address matching When area's information, the first administrative area information is matched from the information of at least two administrative area according to the identity information;Wherein, It is stored in first storage organization multiple for characterizing the administrative area information of subordinate relation between multistage administrative area;
Processing module, for being standardized according to first administrative area information to first address.
In an optional implementation manner, the identity information includes at least two letters with instruction ownership place function Breath;
The matching module is matching the first row from the information of at least two administrative area according to the identity information Administrative division information, is specifically used for:
When the administrative area information in the administrative area that at least two information that the identity information includes are belonged to includes the second row Administrative division information and third administrative area information, and in second administrative area information and at least two administrative area information wherein One matching, when one of matching in third administrative area information and at least two administrative area information, according to institute The weight of the corresponding identity information of the second administrative area information and the weight of the corresponding identity information of third administrative area information are stated, Determine the confidence level of each administrative area information in the information of at least two administrative area;
The maximum administrative area information of confidence level in the information of at least two administrative area is determined as first administrative area Information.
In an optional implementation manner, the matching module, it is described according to the identity information from it is described at least The first administrative area information is matched in two administrative area information, is specifically used for:
When in the administrative area information in the administrative area that at least two information that the identity information includes are belonged to there is only with When one of them matched fourth line administrative division information in the information of at least two administrative area, by least two administrative area It is determined as first administrative area information with the administrative area information of fourth line administrative division information matches in information.
In an optional implementation manner, the matching module, it is described according to the identity information from it is described at least The first administrative area information is matched in two administrative area information, is specifically used for:
When the administrative area information in the administrative area that at least two information that the identity information includes are belonged to is fifth line When the information of administrative division, it will be determined in the information of at least two administrative area with the administrative area information of fifth line administrative division information matches For first administrative area information.
In an optional implementation manner, the identity information includes an information with instruction ownership place function;
The matching module is matching the first row from the information of at least two administrative area according to the identity information Administrative division information, is specifically used for:
Determine the 6th administrative area information in the administrative area that the identity information is belonged to;
The first row with the 6th administrative area information matches is determined from the information of at least two administrative area Administrative division information.
In an optional implementation manner, the information with instruction ownership place function is the ID card No. of user, electricity The location information that words number or user are currently located.
The third aspect, the embodiment of the present invention provide a kind of standardization device of address, comprising:
Memory and processor;
Memory, for storing program instruction;
Processor executes first aspect according to the program of acquisition for calling the program instruction stored in the memory Any implementation described in method.
Fourth aspect, the embodiment of the present invention provide a kind of computer readable storage medium, the computer-readable storage medium Matter is stored with computer instruction, when the computer instruction is run on computers, so that computer executes first aspect Method described in any implementation.
In the embodiment of the present invention, when getting the first address included by the first information of user and identity information, first It is multiple for characterizing the administrative area information of subordinate relation between multistage administrative area based on being stored with, it filters out in the first storage organization Including at least two administrative area information with first address matching, then believed according to identity information from least two administrative areas The first administrative area information is matched in breath, and then the first address is standardized according to the first administrative area information, namely The first address is supplemented completely according to the first administrative area information, avoids influencing doing for related service since address instruction is not known Reason progress can effectively promote business handling efficiency.
Detailed description of the invention
Fig. 1 is a kind of flow diagram of the standardization processing method of address provided in an embodiment of the present invention;
Fig. 2 is a kind of structural schematic diagram of first storage organization provided in an embodiment of the present invention;
Fig. 3 is the flow diagram of the standardization processing method of another address provided in an embodiment of the present invention;
Fig. 4 is a kind of structural block diagram of the standardization device of address provided in an embodiment of the present invention;
Fig. 5 is the structural schematic diagram of the standardization device of another address provided in an embodiment of the present invention.
Specific embodiment
Following will be combined with the drawings in the embodiments of the present invention, and technical solution in the embodiment of the present invention carries out clear, complete Site preparation description, it is clear that described embodiments are only a part of the embodiments of the present invention, instead of all the embodiments.It is based on Embodiment in the present invention, it is obtained by those of ordinary skill in the art without making creative efforts every other Embodiment shall fall within the protection scope of the present invention.
It should be noted that it is multiple involved in the present invention, refer to two or more."and/or", description association pair The incidence relation of elephant, indicate may exist three kinds of relationships, for example, A and/or B, can indicate: individualism A, exist simultaneously A and These three situations of B, individualism B.Character "/" typicallys represent the relationship that forward-backward correlation object is a kind of "or".Additionally, it should manage Solution, although may describe each data using term first, second etc. in embodiments of the present invention, these data be should not necessarily be limited by These terms.These terms are only used to for each data being distinguished from each other out.
Client is needed to fill in many personal information when client handles some business, including such as home address, unit The associated address informations such as location.Currently, being usually written only to the other administrative area of lower levels such as area's grade, at county level when client's fill address, depositing Incomplete situation is being write, is especially encountering place name overlapping, when there are area, the county of same names in lower point of different provinces and cities, address is endless Whole be relatively also easy to produce is obscured, for example, Beijing and the Changchun City Chaoyang District Jun You, if client's fill address is " Chaoyang District people People masses road 666 ", may result in can not clearly be " Chaoyang District, Beijing City people road 666 " or " Jilin governor The Chaoyang District Chun Shi people road 666 ".Address instruction is unclear, and influence related service handles progress.
Based on this, the embodiment of the present invention provides the standardization processing method and device of a kind of address, to solve existing skill The unclear problem of the instruction of address present in art.Wherein, method and apparatus be based on the same inventive concept, due to method and The principle that device solves the problems, such as is similar, therefore the implementation of method and apparatus can be with cross-reference, and overlaps will not be repeated.
Referring to Fig. 1, the embodiment of the invention provides a kind of standardization processing methods of address, comprising:
Step S101, obtains the first information of user, and the first information includes the first address and identity information, identity information packet Include at least one information with instruction ownership place function.
When it is implemented, can be from user's transacting business when such as name, age, identification card number, home address, the hand filled in It in the personal information such as machine number, business address, determines to fill in imperfect, needs to carry out Address Standardization processing relatively Location information, namely obtain aforementioned first address, such as home address, unit address etc.;And determine have in personal information The information of ownership place function, such as identification card number, cell-phone number are indicated, to acquire the identity information of aforementioned user.
Step S102 includes at least two administrative area information with the first address matching in the first storage organization when determining When, the first administrative area information is matched from least two administrative area information according to identity information.
Wherein, multiple administrative area letters for characterizing subordinate relation between multistage administrative area are stored in the first storage organization Breath;The the first administrative area information matched from least two administrative area information according to identity information and aforementioned identity information institute The ownership place of instruction has incidence relation.
Step S103 is standardized the first address according to the first administrative area information.
In the embodiment of the present invention, when getting the first address included by the first information of user and identity information, first It is multiple for characterizing the administrative area information of subordinate relation between multistage administrative area based on being stored with, it filters out in the first storage organization Including at least two administrative area information with first address matching, then believed according to identity information from least two administrative areas The first administrative area information is matched in breath, and then the first address is standardized according to the first administrative area information, namely The first address is supplemented completely according to the first administrative area information, avoids influencing doing for related service since address instruction is not known Reason progress can effectively promote business handling efficiency.
In a kind of optional embodiment, administrative area rank construction that tree is divided for China can be used can The structure tree for characterizing subordinate relation between multistage administrative area, using the structure tree as the first storage organization.For ease of understanding, referring to Fig. 2, the embodiment of the invention provides a kind of structural schematic diagram of first storage organization 200, which is in tree-shaped Structure shape, first storage organization 200 include by root node to the mulitpath of multiple leaf nodes, the corresponding instruction of each path One administrative area information, indicate between country and its lower point of 3 grades of administrative areas (respectively province/municipality directly under the Central Government, city, area/county) from Category relationship, wherein root node indicates that country, the expression of intermediary tree node save/are directly under the jurisdiction of city-level administrative area or grade administrative area, leaf node Indicate area/administrative areas at the county level.
Specifically, Fig. 2 illustrates the root node with China to indicate country;It is with the Chinese Jilin Province Xia Fen, Beijing It indicates to save/be directly under the jurisdiction of tree node among the level-one in city-level administrative area;It is to indicate city-level administration with the Jilin Province Jilin Xia Fen, Changchun Tree node among the second level in area;Using Beijing Chaoyang District Xia Fen, Haidian District and the Changchun Chaoyang District Xia Fen, Nongan County as table Show area/administrative areas at the county level leaf node." China → Beijing → Chaoyang District " indicates by root node (China) to leaf in Fig. 2 One paths of child node (Chaoyang District).
It should be noted that only being shown in Fig. 2 a kind of using the minimum area/administrative areas at the county level that are divided into as the tree of leaf node The first storage organization for type structure, when it is implemented, township level administrative area can also be assigned to down, or be specifically divided into area/ Street name, number at county level etc. are used as leaf node, are based on this, are written only to street in fill address for some users The case where, such as " people road 666 ", it can also in the manner described above, matching and the " people from the first storage organization The corresponding destination node in road " determines the corresponding father and son's link of destination node, then to obtain its corresponding administrative area information.
In addition, the first address that can also fill according to user updates aforementioned first and deposits in a kind of optional embodiment Storage structure, such as the first address " Haidian District people road 666 " that user A is filled in is got, extract wherein characterization row The target keyword of administrative division: " Haidian District ", " people road " judges that there is only destination node " Haidian in the first storage organization " people road " may be not present in area ", is determining the corresponding administration in the first address according to " Haidian District " corresponding father and son's link After area's information, " people road " can newly be added into the first storage organization the next stage node of " Haidian District ", to realize to the The update of one storage organization operates.
Based on the first storage organization 200 shown in Fig. 2, in a kind of optional embodiment, the first storage knot is being determined When including at least two administrative area information with the first address matching in structure, following manner implementation specifically can refer to:
A1 carries out word segmentation processing to the first address, obtains at least one corresponding keyword of the first address.
When it is implemented, participle tool or participle application can be used, such as jieba participle divides the first address Word processing can carry out word segmentation processing to the first address of different user, it is corresponding at least to obtain each first address after word segmentation processing One keyword, the first address to user A as shown in Table 1 below: after " Haidian District people road 666 " word segmentation processing Obtain 4 keywords " Haidian District ", " people road ", " 666 ", " number ":
Table 1
User First address After participle
A Haidian District people road 666 Haidian District/people road/No. 666/
B Southern exposure people road 666 Towards the sunlight/people road/No. 666/
C Chaoyang District people road 666 Chaoyang District/people road/No. 666/
A2 extracts the target keyword for characterizing administrative area at least one keyword, matches from the first storage organization At least two destination nodes corresponding with target keyword.
When it is implemented, the mode that fuzzy matching can be used matched from the first storage organization it is corresponding with target keyword At least two destination nodes.By taking the corresponding keyword in the first address of user B in table 1 as an example, extracts target therein and close Keyword are as follows: " southern exposure ", can fuzzy matching to two destination nodes in the first storage organization, respectively first path: " China → Leaf node " Chaoyang District " and the second path in Beijing → Chaoyang District ": " China → Jilin Province → Changchun → southern exposure Leaf node " Chaoyang District " in area ".It should be noted that the rank based on administrative area indicated by target keyword is different, The destination node matched is not limited in indicating area/administrative areas at the county level leaf node in Fig. 2, is also possible to more advanced administration Area, tree node among second level corresponding to such as city-level administrative area, is not restricted herein.
A3 determines the corresponding administrative area information of at least two destination nodes institute in the first storage organization.
It, can be shown in Fig. 2 after determining destination node " Chaoyang District " in the example of the first address of above-mentioned user B The first storage organization 200 namely structure tree in, with " Chaoyang District " be starting point traverse up it superior node (also referred to as, father save Point), until traversing root node (China), it may be determined that go out two candidate parent subchains corresponding with " Chaoyang District " namely two A administrative area information is respectively as follows: " China → Beijing " and " China → Jilin Province → Changchun ".
Further, first when matching the first administrative area information from least two administrative area information according to identity information First determine identity information in include the number of species with the information of ownership place function, then according in identity information have ownership The information of ground function matches the first administrative area information from least two administrative area information.
Wherein, the information with instruction ownership place function can be ID card No., telephone number or the user of user The location information being currently located.When practical application, if user is filled out in the paper form that the business hall of transacting business provides Its personal information is write, then the location information that aforementioned user is currently located is geographical location locating for business hall;If user is logical It crosses network progress business handling and fills in its personal information, then the location information that aforementioned user is currently located is that user's used terminal is set Standby IP address.
To be convenient to carry out, the embodiment of the present invention is for the information with instruction ownership place function included in identity information Number of species are different, and the method for dividing briefing to match the first administrative area information from least two administrative area information is as follows:
(1) the first situation: identity information includes at least two information with instruction ownership place function.
In a kind of optional embodiment, the first administration is matched from least two administrative area information according to identity information Area's information, comprising:
B1, according at least two administrative area information, filter out that aforementioned identity information includes N number of has instruction ownership place function K information for meeting first condition in the information of energy, according to the corresponding identity of each administrative area information in K administrative area information The weight of information determines the confidence level of each administrative area information at least two administrative area information;
Wherein, N >=2,1≤K≤N, N, K are natural number, first condition are as follows: have the information institute of instruction ownership place function Indicate that the administrative area of ownership matches with one of them at least two administrative area information.When it is implemented, above-mentioned identity letter The weight of breath can be predefined according to the significance level including the different information with instruction ownership place function, for example be set The weight for setting ID card No. is " 1 ", and the weight for the location information that user is currently located is " 1 ", and the weight of telephone number is " 0.5 " etc..
When it is implemented, the administrative area information in the administrative area for example belonged to when at least two information that identity information includes Including the second administrative area information and third administrative area information, in the second administrative area information and at least two administrative area information wherein One matching can be according to the second administration when one of matching in third administrative area information and at least two administrative area information The weight of the corresponding identity information of area's information and the weight of the corresponding identity information of third administrative area information determine at least two The confidence level of each administrative area information in a administrative area information;Wherein, at least two administrative area information with third administrative area One administrative area information of information matches, and can be same with an administrative area information of fourth line administrative division information matches, It is also possible to different one.
The maximum administrative area information of confidence level at least two administrative area information is determined as the first administrative area letter by B2 Breath.
For ease of understanding, the embodiment of the present invention includes that identification card number, telephone number and user are currently located with identity information Location information, for determining the corresponding first administrative area information in the first address of aforementioned user B, to being provided in above-mentioned B1~B2 Mode be illustrated.
According to identity information from the first storage organization, corresponding two candidate parents in the first address of aforementioned user B are matched Subchain, i.e. two administrative area information, respectively " China → Beijing " and " China → Jilin Province → Changchun ", from the two rows Administrative division information determines the first administrative area information, can implement in the following manner:
C1 parses identification card number, telephone number and user's present position information respectively, obtains each information and return The administrative area information in the administrative area of category is determined full in each information according to the administrative area information in the administrative area that each information is belonged to The identity information of sufficient first condition, as shown in table 2 below:
Table 2
Determine that the identity information for meeting first condition in each information is telephone number, user's present position information.
C2, according to the weight for the identity information for meeting first condition, two administrative areas corresponding to the first address of user B Information is voted, and one administrative area information of who gets the most votes in two administrative area information is determined as to the first ground of the user B The corresponding first administrative area information in location.
When it is implemented, judgement parses the administrative area information and two administrative area information in the administrative area of identity information ownership In which matching, just the corresponding ticket of the weight of the identity information is thrown to corresponding administrative area information.Such as telephone number belongs to The administrative area information in administrative area matched with " China → Beijing ", then according to weight shared by telephone number, for " China → north Jing Shi " record gained vote " 0.5 ";User's present position information ownership administrative area administrative area information also with " China → north Jing Shi " matching is won the vote " 1 ", most then according to weight shared by user's present position information for " China → Beijing " record Determine that the gained vote of " China → Beijing " adds up to " 1.5 " eventually, the gained vote of " China → Jilin Province → Changchun " adds up to " 0 ". It is as shown in table 3 below:
Table 3
Candidate parent subchain Gained vote
" China → Beijing " 1.5(0.5+1)
" China → Jilin Province → Changchun " 0
Wherein, " China → Beijing " who gets the most votes, confidence level is maximum, it is determined that " China → Beijing " is first administrative Area's information.
In another optional embodiment, the first row is matched from least two administrative area information according to identity information Administrative division information, comprising:
When in the administrative area information in the administrative area that at least two information that identity information includes are belonged to there is only at least When one of them matched fourth line administrative division information in two administrative area information, by least two administrative area information with the 4th The administrative area information of administrative area information matches is determined as the first administrative area information.
As the following table 4 determines the example of its corresponding first administrative area information, identity information packet for the first address of user B In the location information that identification card number, telephone number and the user included is currently located, there is only user's present position information to return The administrative area information in the administrative area of category can be matched with " China → Beijing " at least two administrative area information, then will " China → Beijing " is determined as the corresponding first administrative area information in the first address of user B.
Table 4
In another optional embodiment, the first row is matched from least two administrative area information according to identity information Administrative division information, comprising:
When the administrative area information in the administrative area that at least two information that identity information includes are belonged to is fifth line administrative division When information, the administrative area information at least two administrative area information with fifth line administrative division information matches is determined as the first administrative area Information.
As the following table 5 determines the example of its corresponding first administrative area information, identity information packet for the first address of user B The administrative area information in the administrative area that the location information that identification card number, telephone number and the user included is currently located belongs to respectively is wrapped Containing " Beijing ", instruction is fifth line administrative division information " China → Beijing ", can be at least two administrative area information In " China → Beijing " matched, then " China → Beijing " is determined as to the corresponding the first row in the first address of user B Administrative division information.
Table 5
(2) second situation: identity information includes an information with instruction ownership place function.
The first administrative area information is matched from least two administrative area information according to identity information, comprising:
Determine the 6th administrative area information in the administrative area that identity information is belonged to;It is determined from least two administrative area information Out with first administrative area information of the 6th administrative area information matches.
Such as the information with instruction ownership place function that identity information includes is identification card number, the identification card number institute The administrative area information in the administrative area of ownership includes " Beijing " then can be corresponding two from its first address for user B Administrative area information: " China → Beijing " and " China → Jilin Province → Changchun ", from the two administrative areas, information determines One administrative area information is " China → Beijing ".
Further, when in the first storage organization 200 shown in Fig. 2 exist and the first address matching unique administrative area believe Breath, directly can be determined as aforementioned first administrative area information for unique administrative area information.Such as the first address to aforementioned user A After word segmentation processing, obtaining and wherein characterizing the target keyword in administrative area is " Haidian District ", determines the first storage organization shown in Fig. 2 In 200 existence anduniquess description " Haidian District " destination node, then with " Haidian District " be starting point traverse up it superior node ( Claim, father node), until traversing root node (China), it may be determined that go out administrative area corresponding with " Haidian District " information, are as follows: " China → Beijing ", can directly determine " China → Beijing " is aforementioned first administrative area information.
Further, in a kind of optional embodiment, the first address is standardized according to from the first administrative area information Processing, specifically includes:
By the first administrative area information supplement to the first address, so that it is complete, such as " China → Beijing " is added to In the first address of user A, sufficient address is obtained are as follows: " city, BeiJing, China, people road, Haidian District 666 ", alternatively, will The information supplement after national root node information removes is characterized in first administrative area information to the first address, such as to user A's The processing of first Address Standardization, obtains " Haidian District, Beijing City people road 666 ".
For ease of understanding, referring to Fig. 3, the embodiment of the invention also provides a kind of standardization processing methods of address, comprising:
Step S301 obtains the first address and the identity information of user.
Identity information includes the ID card No. filled in when filling in personal information by user's transacting business, telephone number, And obtain the location information of user being currently located certainly according to the place based on user's transacting business, for example, user be Business hall scene transacting business, the then location information that aforementioned user is currently located are the geographical location of business hall;If user is By network transacting business, then the location information that aforementioned user is currently located is the IP address of user's used terminal equipment.
Step S302 obtains the target keyword for characterizing administrative area to the first address dividing.
Step S303 judges to whether there is and the matched tree node of target keyword in structure tree;If so, executing step S304 or step S305, if not, terminating;Wherein, structure tree, that is, aforementioned first storage organization 200 shown in Fig. 2, tree node It can be intermediary tree node, be also possible to leaf node.
Step S304, when there is unique tree node matched with target keyword in structure tree, according to unique tree node The administrative area information of place path instruction is standardized the first address.
Step S305, when in structure tree including at least two administrative areas information matched with target keyword, according to body Part information matches the first administrative area information from least two administrative area information, according to the first administrative area information to the first address It is standardized.
In the present embodiment, matched and the first address from the structure tree for being stored with subordinate relation between multistage administrative area first Corresponding administrative area information, when the administrative area information matched is unique, according to the administrative area information matched to the first address It is standardized;When the administrative area information matched is not unique, when being at least two administrative area information, according to identity information It determines and matches the first administrative area information at least two administrative area information, and then according to the first administrative area information to the first address It is standardized.That is filled in when taking full advantage of user's transacting business has the identity information of instruction ownership place function, Such as ID card No., telephone number, and the identity information that non-user is filled in is obtained, such as user's transacting business used terminal IP address combines various identity informations during standardized address, determines most believable first for standardized address Administrative area information, it is relatively reasonable, the first address is supplemented completely, is able to solve and exists in the prior art what address instruction was not known Problem can effectively promote the efficiency of business handling.
A kind of corresponding above-mentioned standardization processing method of address, referring to fig. 4, the embodiment of the present invention provides a kind of mark of address Standardization processing unit 400, comprising:
Module 401 is obtained, for obtaining the first information of user, the first information includes the first address and identity information, body Part information includes information that at least one has instruction ownership place function.
Matching module 402, for working as at least two administration determined include in the first storage organization with the first address matching When area's information, the first administrative area information is matched from least two administrative area information according to identity information;Wherein, the first storage It is stored in structure multiple for characterizing the administrative area information of subordinate relation between multistage administrative area.
Processing module 403, for being standardized according to the first administrative area information to the first address.
In the embodiment of the present invention, when getting the first address included by the first information of user and identity information, first It is multiple for characterizing the administrative area information of subordinate relation between multistage administrative area based on being stored with, it filters out in the first storage organization Including at least two administrative area information with first address matching, then believed according to identity information from least two administrative areas The first administrative area information is matched in breath, and then the first address is standardized according to the first administrative area information, namely The first address is supplemented completely according to the first administrative area information, avoids influencing doing for related service since address instruction is not known Reason progress can effectively promote business handling efficiency.
In a kind of optional embodiment, identity information includes at least two information with instruction ownership place function;
Matching module 402 is matching the first administrative area information according to identity information from least two administrative area information, It is specifically used for:
When the administrative area information in the administrative area that at least two information that identity information includes are belonged to includes the second administrative area Information and third administrative area information, and one of matching in the second administrative area information and at least two administrative area information, the When one of matching in three administrative area information and at least two administrative area information, according to the corresponding body of the second administrative area information Part weight of information and the weight of the corresponding identity information of third administrative area information, determine at least two administrative area information Each administrative area information confidence level;
The maximum administrative area information of confidence level at least two administrative area information is determined as the first administrative area information.
In a kind of optional embodiment, matching module 402, according to identity information from least two administrative area information In match the first administrative area information, be specifically used for:
When in the administrative area information in the administrative area that at least two information that identity information includes are belonged to there is only at least When one of them matched fourth line administrative division information in two administrative area information, by least two administrative area information with the 4th The administrative area information of administrative area information matches is determined as the first administrative area information.
In a kind of optional embodiment, matching module 402, according to identity information from least two administrative area information In match the first administrative area information, be specifically used for:
When the administrative area information in the administrative area that at least two information that identity information includes are belonged to is fifth line administrative division When information, the administrative area information at least two administrative area information with fifth line administrative division information matches is determined as the first administrative area Information.
In a kind of optional embodiment, identity information includes an information with instruction ownership place function;
Matching module 402 is matching the first administrative area information according to identity information from least two administrative area information, It is specifically used for:
Determine the 6th administrative area information in the administrative area that identity information is belonged to;
The first administrative area information with the 6th administrative area information matches is determined from least two administrative area information.
In a kind of optional embodiment, the above-mentioned information with instruction ownership place function is the identification card number of user The location information that code, telephone number or user are currently located.
The corresponding above method, referring to Fig. 5, the embodiment of the invention provides the standardization device 500 of another address, Include:
Communication interface 501, memory 502 and processor 503;
Wherein, the processor 503 is communicated by the communication interface 501 with other equipment, for example, other equipment Terminal device used when can be aforementioned user by network transacting business.Processor 503 can be obtained by communication interface 501 Take the IP address of aforementioned terminals equipment;Memory 502, for storing program instruction;Processor 503, for calling the storage The program instruction stored in device 502 executes the method in above-described embodiment according to the program of acquisition.
In the embodiment of the present application, processor can be general processor, digital signal processor, specific integrated circuit, Field programmable gate array or other programmable logic device, discrete gate or transistor logic, discrete hardware components, It may be implemented or execute disclosed each method, step and the logic diagram in the embodiment of the present application.General processor can be Microprocessor or any conventional processor etc..The step of method in conjunction with disclosed in the embodiment of the present application, can directly embody Execute completion for hardware processor, or in processor hardware and software module combination execute completion.
In the embodiment of the present application, memory can be nonvolatile memory, such as hard disk (hard disk drive, HDD) or solid state hard disk (solid-state drive, SSD) etc., it can also be volatile memory (volatile ), such as random access memory (random-access memory, RAM) memory.Memory, which can also be, can be used in taking Band or storage have the desired program code of instruction or data structure form and can be by any other Jie of computer access Matter, but not limited to this.Memory in the embodiment of the present application can also be circuit or other arbitrarily can be realized store function Device, for storing program instruction and/or data.Do not limited in the embodiment of the present application above-mentioned communication interface, memory and Specific connection medium between processor, such as bus, bus can be divided into address bus, data/address bus, control bus etc..
Further, the embodiment of the invention provides a kind of computer readable storage medium, the computer readable storage mediums It is stored with computer instruction, when computer instruction is run on computers, so that computer executes the above method.
It should be understood by those skilled in the art that, the embodiment of the present invention can provide as method, system or computer program Product.Therefore, complete hardware embodiment, complete software embodiment or reality combining software and hardware aspects can be used in the present invention Apply the form of example.Moreover, it wherein includes the computer of computer usable program code that the present invention, which can be used in one or more, The computer program implemented in usable storage medium (including but not limited to magnetic disk storage, CD-ROM, optical memory etc.) produces The form of product.
The present invention be referring to according to the method for the embodiment of the present invention, the process of equipment (system) and computer program product Figure and/or block diagram describe.It should be understood that every one stream in flowchart and/or the block diagram can be realized by computer program instructions The combination of process and/or box in journey and/or box and flowchart and/or the block diagram.It can provide these computer programs Instruct the processor of general purpose computer, special purpose computer, Embedded Processor or other programmable data processing devices to produce A raw machine, so that being generated by the instruction that computer or the processor of other programmable data processing devices execute for real The device for the function of being specified in present one or more flows of the flowchart and/or one or more blocks of the block diagram.
These computer program instructions, which may also be stored in, is able to guide computer or other programmable data processing devices with spy Determine in the computer-readable memory that mode works, so that it includes referring to that instruction stored in the computer readable memory, which generates, Enable the manufacture of device, the command device realize in one box of one or more flows of the flowchart and/or block diagram or The function of being specified in multiple boxes.
These computer program instructions also can be loaded onto a computer or other programmable data processing device, so that counting Series of operation steps are executed on calculation machine or other programmable devices to generate computer implemented processing, thus in computer or The instruction executed on other programmable devices is provided for realizing in one or more flows of the flowchart and/or block diagram one The step of function of being specified in a box or multiple boxes.
Although preferred embodiments of the present invention have been described, it is created once a person skilled in the art knows basic Property concept, then additional changes and modifications may be made to these embodiments.So it includes excellent that the following claims are intended to be interpreted as It selects embodiment and falls into all change and modification of the scope of the invention.
Obviously, various changes and modifications can be made to the invention without departing from model of the invention by those skilled in the art It encloses.In this way, if these modifications and changes of the present invention is within the scope of the claims of the present invention and its equivalent technology, then The present invention is also intended to include these modifications and variations.

Claims (14)

1. a kind of standardization processing method of address characterized by comprising
The first information of user is obtained, the first information includes the first address and identity information, and the identity information includes extremely Few one has the information of instruction ownership place function;
When determining includes at least two administrative area information with first address matching in the first storage organization, according to described Identity information matches the first administrative area information from the information of at least two administrative area;Wherein, first storage organization In be stored with it is multiple for characterizing the administrative area information of subordinate relation between multistage administrative area;
First address is standardized according to first administrative area information.
2. the method as described in claim 1, which is characterized in that the identity information includes at least two with instruction ownership place The information of function;
It is described that first administrative area information is matched from the information of at least two administrative area according to the identity information, comprising:
When the administrative area information in the administrative area that at least two information that the identity information includes are belonged to includes the second administrative area Information and third administrative area information, and one of them in second administrative area information and at least two administrative area information Matching, when one of matching in third administrative area information and at least two administrative area information, according to described the The weight of the corresponding identity information of two administrative area information and the weight of the corresponding identity information of third administrative area information determine The confidence level of each administrative area information in the information of at least two administrative area;
The maximum administrative area information of confidence level in the information of at least two administrative area is determined as first administrative area information.
3. method according to claim 2, which is characterized in that described administrative from described at least two according to the identity information The first administrative area information is matched in area's information, comprising:
When in the administrative area information in the administrative area that at least two information that the identity information includes are belonged to there is only with it is described When one of them matched fourth line administrative division information at least two administrative area information, by least two administrative area information In with the administrative area information of fourth line administrative division information matches be determined as first administrative area information.
4. method according to claim 2, which is characterized in that described administrative from described at least two according to the identity information The first administrative area information is matched in area's information, comprising:
When the administrative area information in the administrative area that at least two information that the identity information includes are belonged to is fifth line administrative division When information, the administrative area information in the information of at least two administrative area with fifth line administrative division information matches is determined as institute State the first administrative area information.
5. the method as described in claim 1, which is characterized in that the identity information, which includes one, to be had the function of to indicate ownership place Information;
The first administrative area information is matched from the information of at least two administrative area according to the identity information, comprising:
Determine the 6th administrative area information in the administrative area that the identity information is belonged to;
First administrative area with the 6th administrative area information matches is determined from the information of at least two administrative area Information.
6. the method according to claim 1 to 5, which is characterized in that the information with instruction ownership place function is user The location information that is currently located of ID card No., telephone number or user.
7. a kind of standardization device of address characterized by comprising
Module is obtained, for obtaining the first information of user, the first information includes the first address and identity information, the body Part information includes information that at least one has instruction ownership place function;
Matching module, for when determine the first storage organization in include and at least two administrative areas of first address matching believe When breath, the first administrative area information is matched from the information of at least two administrative area according to the identity information;Wherein, described It is stored in first storage organization multiple for characterizing the administrative area information of subordinate relation between multistage administrative area;
Processing module, for being standardized according to first administrative area information to first address.
8. device as claimed in claim 7, which is characterized in that the identity information includes at least two with instruction ownership place The information of function;
The matching module is matching the first administrative area from the information of at least two administrative area according to the identity information Information is specifically used for:
When the administrative area information in the administrative area that at least two information that the identity information includes are belonged to includes the second administrative area Information and third administrative area information, and one of them in second administrative area information and at least two administrative area information Matching, when one of matching in third administrative area information and at least two administrative area information, according to described the The weight of the corresponding identity information of two administrative area information and the weight of the corresponding identity information of third administrative area information determine The confidence level of each administrative area information in the information of at least two administrative area;
The maximum administrative area information of confidence level in the information of at least two administrative area is determined as first administrative area information.
9. device as claimed in claim 8, which is characterized in that the matching module, it is described according to the identity information from The first administrative area information is matched in the information of at least two administrative area, is specifically used for:
When in the administrative area information in the administrative area that at least two information that the identity information includes are belonged to there is only with it is described When one of them matched fourth line administrative division information at least two administrative area information, by least two administrative area information In with the administrative area information of fourth line administrative division information matches be determined as first administrative area information.
10. device as claimed in claim 8, which is characterized in that the matching module, it is described according to the identity information from The first administrative area information is matched in the information of at least two administrative area, is specifically used for:
When the administrative area information in the administrative area that at least two information that the identity information includes are belonged to is fifth line administrative division When information, the administrative area information in the information of at least two administrative area with fifth line administrative division information matches is determined as institute State the first administrative area information.
11. device as claimed in claim 7, which is characterized in that the identity information, which includes one, has instruction ownership place function The information of energy;
The matching module is matching the first administrative area from the information of at least two administrative area according to the identity information Information is specifically used for:
Determine the 6th administrative area information in the administrative area that the identity information is belonged to;
First administrative area with the 6th administrative area information matches is determined from the information of at least two administrative area Information.
12. such as the described in any item devices of claim 7-11, which is characterized in that the information with instruction ownership place function is to use The location information that ID card No., telephone number or the user at family are currently located.
13. a kind of standardization device of address characterized by comprising
Memory and processor;
Memory, for storing program instruction;
Processor requires 1~6 according to the program execution benefit of acquisition for calling the program instruction stored in the memory Described in any item methods.
14. a kind of computer readable storage medium, which is characterized in that the computer-readable recording medium storage has computer to refer to It enables, when the computer instruction is run on computers, so that computer perform claim requires described in any one of 1~6 Method.
CN201910639806.8A 2019-07-16 2019-07-16 A kind of standardization processing method and device of address Pending CN110413715A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910639806.8A CN110413715A (en) 2019-07-16 2019-07-16 A kind of standardization processing method and device of address

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910639806.8A CN110413715A (en) 2019-07-16 2019-07-16 A kind of standardization processing method and device of address

Publications (1)

Publication Number Publication Date
CN110413715A true CN110413715A (en) 2019-11-05

Family

ID=68361634

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910639806.8A Pending CN110413715A (en) 2019-07-16 2019-07-16 A kind of standardization processing method and device of address

Country Status (1)

Country Link
CN (1) CN110413715A (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110060763A1 (en) * 2009-09-09 2011-03-10 Denso Corporation Address search device and method for searching address
CN104572992A (en) * 2015-01-06 2015-04-29 武汉工程大学 Multi-constraint reasoning based standardization method for internet geographical location information
CN105069560A (en) * 2015-07-30 2015-11-18 中国科学院软件研究所 Resume information extraction and characteristic identification analysis system and method based on knowledge base and rule base
CN105335864A (en) * 2015-11-13 2016-02-17 小米科技有限责任公司 Display method, apparatus and system for secondary address information

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110060763A1 (en) * 2009-09-09 2011-03-10 Denso Corporation Address search device and method for searching address
CN104572992A (en) * 2015-01-06 2015-04-29 武汉工程大学 Multi-constraint reasoning based standardization method for internet geographical location information
CN105069560A (en) * 2015-07-30 2015-11-18 中国科学院软件研究所 Resume information extraction and characteristic identification analysis system and method based on knowledge base and rule base
CN105335864A (en) * 2015-11-13 2016-02-17 小米科技有限责任公司 Display method, apparatus and system for secondary address information

Similar Documents

Publication Publication Date Title
CN104050196B (en) A kind of interest point data redundant detecting method and device
CN109033086A (en) A kind of address resolution, matched method and device
CN103678708B (en) Method and device for recognizing preset addresses
CN109598509A (en) The recognition methods of risk clique and device
CN106202028B (en) A kind of address information recognition methods and device
CN106156145A (en) The management method of a kind of address date and device
CN104965876B (en) A kind of method and device carrying out the excavation of user job unit based on location information
CN108304484A (en) Key word matching method and device, electronic equipment and readable storage medium storing program for executing
CN103970747B (en) Data processing method for network side computer to order search results
CN107247791B (en) Parking lot map data generation method and device and machine-readable storage medium
CN102262664A (en) Quality estimating method and quality estimating device
CN106162544A (en) A kind of generation method and apparatus of geography fence
CN107395680A (en) Shop group's information push and output intent and device, equipment
CN105992171A (en) Text information processing method and device
CN107404486A (en) Parse method, apparatus, terminal device and the storage medium of Http data
CN106598946A (en) Content extracting method and device
CN106682100A (en) Data statistical method and system based on Hbase database
CN106503045B (en) A kind of method and device updating template library
CN110516713A (en) A kind of target group's recognition methods, device and equipment
CN107644366A (en) Order fraud recognition methods, system, storage medium and electronic equipment
CN108648017A (en) It is easy to user demand matching process, device, equipment and the storage medium of extension
CN110413715A (en) A kind of standardization processing method and device of address
CN104462392B (en) Share the statistical method and device of capacity of returns
CN110362646A (en) Processing method and processing device, storage medium and the electronic device of address information
CN109582834A (en) Data Risk Forecast Method and device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20191105

RJ01 Rejection of invention patent application after publication