CN110413715A - A kind of standardization processing method and device of address - Google Patents
A kind of standardization processing method and device of address Download PDFInfo
- Publication number
- CN110413715A CN110413715A CN201910639806.8A CN201910639806A CN110413715A CN 110413715 A CN110413715 A CN 110413715A CN 201910639806 A CN201910639806 A CN 201910639806A CN 110413715 A CN110413715 A CN 110413715A
- Authority
- CN
- China
- Prior art keywords
- information
- administrative area
- administrative
- area information
- address
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/29—Geographical information databases
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q40/00—Finance; Insurance; Tax strategies; Processing of corporate or income taxes
- G06Q40/03—Credit; Loans; Processing thereof
Abstract
The present invention provides the standardization processing method and device of a kind of address, to solve the problems, such as that address instruction existing in the prior art is not known.It include: the first information for obtaining user, the first information includes the first address and identity information, and the identity information includes the information that at least one has instruction ownership place function;When determining includes at least two administrative area information with first address matching in the first storage organization, the first administrative area information is matched from the information of at least two administrative area according to the identity information;Wherein, it is stored in first storage organization multiple for characterizing the administrative area information of subordinate relation between multistage administrative area;First address is standardized according to first administrative area information.
Description
Technical field
The present invention relates to technical field of data processing more particularly to the standardization processing methods and device of a kind of address.
Background technique
Client is needed to fill in many personal information in credit industry, transacting business, including such as home address, unit
The associated address informations such as location.
Currently, being usually written only to the other administrative area of lower levels such as area's grade, at county level when client's fill address, there is writing not
For that is, place name has overlapping, address institute is cannot be distinguished in complete situation, lower for different provinces and cities point of area, the county for having a same names
Belong to provinces and cities, is relatively also easy to produce and obscures.Address instruction is unclear, and influence related service handles progress.
Summary of the invention
The present invention provides the standardization processing method and device of a kind of address, to solve address existing in the prior art
Indicate unclear problem.
In a first aspect, the embodiment of the invention provides a kind of standardization processing methods of address, comprising:
The first information of user is obtained, the first information includes the first address and identity information, the identity information packet
Include at least one information with instruction ownership place function;
When determining includes at least two administrative area information with first address matching in the first storage organization, according to
The identity information matches the first administrative area information from the information of at least two administrative area;Wherein, first storage
It is stored in structure multiple for characterizing the administrative area information of subordinate relation between multistage administrative area;
First address is standardized according to first administrative area information.
In an optional implementation manner, the identity information includes at least two letters with instruction ownership place function
Breath;
It is described that first administrative area information is matched from the information of at least two administrative area according to the identity information, packet
It includes:
When the administrative area information in the administrative area that at least two information that the identity information includes are belonged to includes the second row
Administrative division information and third administrative area information, and in second administrative area information and at least two administrative area information wherein
One matching, when one of matching in third administrative area information and at least two administrative area information, according to institute
The weight of the corresponding identity information of the second administrative area information and the weight of the corresponding identity information of third administrative area information are stated,
Determine the confidence level of each administrative area information in the information of at least two administrative area;
The maximum administrative area information of confidence level in the information of at least two administrative area is determined as first administrative area
Information.
In an optional implementation manner, it is described according to the identity information from the information of at least two administrative area
Match the first administrative area information, comprising:
When in the administrative area information in the administrative area that at least two information that the identity information includes are belonged to there is only with
When one of them matched fourth line administrative division information in the information of at least two administrative area, by least two administrative area
It is determined as first administrative area information with the administrative area information of fourth line administrative division information matches in information.
In an optional implementation manner, it is described according to the identity information from the information of at least two administrative area
Match the first administrative area information, comprising:
When the administrative area information in the administrative area that at least two information that the identity information includes are belonged to is fifth line
When the information of administrative division, it will be determined in the information of at least two administrative area with the administrative area information of fifth line administrative division information matches
For first administrative area information.
In an optional implementation manner, the identity information includes an information with instruction ownership place function;
The first administrative area information is matched from the information of at least two administrative area according to the identity information, comprising:
Determine the 6th administrative area information in the administrative area that the identity information is belonged to;
The first row with the 6th administrative area information matches is determined from the information of at least two administrative area
Administrative division information.
In an optional implementation manner, the information with instruction ownership place function is the ID card No. of user, electricity
The location information that words number or user are currently located.
Second aspect, the embodiment of the present invention provide a kind of standardization device of address, comprising:
Module is obtained, for obtaining the first information of user, the first information includes the first address and identity information, institute
Stating identity information includes the information that at least one has instruction ownership place function;
Matching module, for working as at least two administration determined include in the first storage organization with first address matching
When area's information, the first administrative area information is matched from the information of at least two administrative area according to the identity information;Wherein,
It is stored in first storage organization multiple for characterizing the administrative area information of subordinate relation between multistage administrative area;
Processing module, for being standardized according to first administrative area information to first address.
In an optional implementation manner, the identity information includes at least two letters with instruction ownership place function
Breath;
The matching module is matching the first row from the information of at least two administrative area according to the identity information
Administrative division information, is specifically used for:
When the administrative area information in the administrative area that at least two information that the identity information includes are belonged to includes the second row
Administrative division information and third administrative area information, and in second administrative area information and at least two administrative area information wherein
One matching, when one of matching in third administrative area information and at least two administrative area information, according to institute
The weight of the corresponding identity information of the second administrative area information and the weight of the corresponding identity information of third administrative area information are stated,
Determine the confidence level of each administrative area information in the information of at least two administrative area;
The maximum administrative area information of confidence level in the information of at least two administrative area is determined as first administrative area
Information.
In an optional implementation manner, the matching module, it is described according to the identity information from it is described at least
The first administrative area information is matched in two administrative area information, is specifically used for:
When in the administrative area information in the administrative area that at least two information that the identity information includes are belonged to there is only with
When one of them matched fourth line administrative division information in the information of at least two administrative area, by least two administrative area
It is determined as first administrative area information with the administrative area information of fourth line administrative division information matches in information.
In an optional implementation manner, the matching module, it is described according to the identity information from it is described at least
The first administrative area information is matched in two administrative area information, is specifically used for:
When the administrative area information in the administrative area that at least two information that the identity information includes are belonged to is fifth line
When the information of administrative division, it will be determined in the information of at least two administrative area with the administrative area information of fifth line administrative division information matches
For first administrative area information.
In an optional implementation manner, the identity information includes an information with instruction ownership place function;
The matching module is matching the first row from the information of at least two administrative area according to the identity information
Administrative division information, is specifically used for:
Determine the 6th administrative area information in the administrative area that the identity information is belonged to;
The first row with the 6th administrative area information matches is determined from the information of at least two administrative area
Administrative division information.
In an optional implementation manner, the information with instruction ownership place function is the ID card No. of user, electricity
The location information that words number or user are currently located.
The third aspect, the embodiment of the present invention provide a kind of standardization device of address, comprising:
Memory and processor;
Memory, for storing program instruction;
Processor executes first aspect according to the program of acquisition for calling the program instruction stored in the memory
Any implementation described in method.
Fourth aspect, the embodiment of the present invention provide a kind of computer readable storage medium, the computer-readable storage medium
Matter is stored with computer instruction, when the computer instruction is run on computers, so that computer executes first aspect
Method described in any implementation.
In the embodiment of the present invention, when getting the first address included by the first information of user and identity information, first
It is multiple for characterizing the administrative area information of subordinate relation between multistage administrative area based on being stored with, it filters out in the first storage organization
Including at least two administrative area information with first address matching, then believed according to identity information from least two administrative areas
The first administrative area information is matched in breath, and then the first address is standardized according to the first administrative area information, namely
The first address is supplemented completely according to the first administrative area information, avoids influencing doing for related service since address instruction is not known
Reason progress can effectively promote business handling efficiency.
Detailed description of the invention
Fig. 1 is a kind of flow diagram of the standardization processing method of address provided in an embodiment of the present invention;
Fig. 2 is a kind of structural schematic diagram of first storage organization provided in an embodiment of the present invention;
Fig. 3 is the flow diagram of the standardization processing method of another address provided in an embodiment of the present invention;
Fig. 4 is a kind of structural block diagram of the standardization device of address provided in an embodiment of the present invention;
Fig. 5 is the structural schematic diagram of the standardization device of another address provided in an embodiment of the present invention.
Specific embodiment
Following will be combined with the drawings in the embodiments of the present invention, and technical solution in the embodiment of the present invention carries out clear, complete
Site preparation description, it is clear that described embodiments are only a part of the embodiments of the present invention, instead of all the embodiments.It is based on
Embodiment in the present invention, it is obtained by those of ordinary skill in the art without making creative efforts every other
Embodiment shall fall within the protection scope of the present invention.
It should be noted that it is multiple involved in the present invention, refer to two or more."and/or", description association pair
The incidence relation of elephant, indicate may exist three kinds of relationships, for example, A and/or B, can indicate: individualism A, exist simultaneously A and
These three situations of B, individualism B.Character "/" typicallys represent the relationship that forward-backward correlation object is a kind of "or".Additionally, it should manage
Solution, although may describe each data using term first, second etc. in embodiments of the present invention, these data be should not necessarily be limited by
These terms.These terms are only used to for each data being distinguished from each other out.
Client is needed to fill in many personal information when client handles some business, including such as home address, unit
The associated address informations such as location.Currently, being usually written only to the other administrative area of lower levels such as area's grade, at county level when client's fill address, depositing
Incomplete situation is being write, is especially encountering place name overlapping, when there are area, the county of same names in lower point of different provinces and cities, address is endless
Whole be relatively also easy to produce is obscured, for example, Beijing and the Changchun City Chaoyang District Jun You, if client's fill address is " Chaoyang District people
People masses road 666 ", may result in can not clearly be " Chaoyang District, Beijing City people road 666 " or " Jilin governor
The Chaoyang District Chun Shi people road 666 ".Address instruction is unclear, and influence related service handles progress.
Based on this, the embodiment of the present invention provides the standardization processing method and device of a kind of address, to solve existing skill
The unclear problem of the instruction of address present in art.Wherein, method and apparatus be based on the same inventive concept, due to method and
The principle that device solves the problems, such as is similar, therefore the implementation of method and apparatus can be with cross-reference, and overlaps will not be repeated.
Referring to Fig. 1, the embodiment of the invention provides a kind of standardization processing methods of address, comprising:
Step S101, obtains the first information of user, and the first information includes the first address and identity information, identity information packet
Include at least one information with instruction ownership place function.
When it is implemented, can be from user's transacting business when such as name, age, identification card number, home address, the hand filled in
It in the personal information such as machine number, business address, determines to fill in imperfect, needs to carry out Address Standardization processing relatively
Location information, namely obtain aforementioned first address, such as home address, unit address etc.;And determine have in personal information
The information of ownership place function, such as identification card number, cell-phone number are indicated, to acquire the identity information of aforementioned user.
Step S102 includes at least two administrative area information with the first address matching in the first storage organization when determining
When, the first administrative area information is matched from least two administrative area information according to identity information.
Wherein, multiple administrative area letters for characterizing subordinate relation between multistage administrative area are stored in the first storage organization
Breath;The the first administrative area information matched from least two administrative area information according to identity information and aforementioned identity information institute
The ownership place of instruction has incidence relation.
Step S103 is standardized the first address according to the first administrative area information.
In the embodiment of the present invention, when getting the first address included by the first information of user and identity information, first
It is multiple for characterizing the administrative area information of subordinate relation between multistage administrative area based on being stored with, it filters out in the first storage organization
Including at least two administrative area information with first address matching, then believed according to identity information from least two administrative areas
The first administrative area information is matched in breath, and then the first address is standardized according to the first administrative area information, namely
The first address is supplemented completely according to the first administrative area information, avoids influencing doing for related service since address instruction is not known
Reason progress can effectively promote business handling efficiency.
In a kind of optional embodiment, administrative area rank construction that tree is divided for China can be used can
The structure tree for characterizing subordinate relation between multistage administrative area, using the structure tree as the first storage organization.For ease of understanding, referring to
Fig. 2, the embodiment of the invention provides a kind of structural schematic diagram of first storage organization 200, which is in tree-shaped
Structure shape, first storage organization 200 include by root node to the mulitpath of multiple leaf nodes, the corresponding instruction of each path
One administrative area information, indicate between country and its lower point of 3 grades of administrative areas (respectively province/municipality directly under the Central Government, city, area/county) from
Category relationship, wherein root node indicates that country, the expression of intermediary tree node save/are directly under the jurisdiction of city-level administrative area or grade administrative area, leaf node
Indicate area/administrative areas at the county level.
Specifically, Fig. 2 illustrates the root node with China to indicate country;It is with the Chinese Jilin Province Xia Fen, Beijing
It indicates to save/be directly under the jurisdiction of tree node among the level-one in city-level administrative area;It is to indicate city-level administration with the Jilin Province Jilin Xia Fen, Changchun
Tree node among the second level in area;Using Beijing Chaoyang District Xia Fen, Haidian District and the Changchun Chaoyang District Xia Fen, Nongan County as table
Show area/administrative areas at the county level leaf node." China → Beijing → Chaoyang District " indicates by root node (China) to leaf in Fig. 2
One paths of child node (Chaoyang District).
It should be noted that only being shown in Fig. 2 a kind of using the minimum area/administrative areas at the county level that are divided into as the tree of leaf node
The first storage organization for type structure, when it is implemented, township level administrative area can also be assigned to down, or be specifically divided into area/
Street name, number at county level etc. are used as leaf node, are based on this, are written only to street in fill address for some users
The case where, such as " people road 666 ", it can also in the manner described above, matching and the " people from the first storage organization
The corresponding destination node in road " determines the corresponding father and son's link of destination node, then to obtain its corresponding administrative area information.
In addition, the first address that can also fill according to user updates aforementioned first and deposits in a kind of optional embodiment
Storage structure, such as the first address " Haidian District people road 666 " that user A is filled in is got, extract wherein characterization row
The target keyword of administrative division: " Haidian District ", " people road " judges that there is only destination node " Haidian in the first storage organization
" people road " may be not present in area ", is determining the corresponding administration in the first address according to " Haidian District " corresponding father and son's link
After area's information, " people road " can newly be added into the first storage organization the next stage node of " Haidian District ", to realize to the
The update of one storage organization operates.
Based on the first storage organization 200 shown in Fig. 2, in a kind of optional embodiment, the first storage knot is being determined
When including at least two administrative area information with the first address matching in structure, following manner implementation specifically can refer to:
A1 carries out word segmentation processing to the first address, obtains at least one corresponding keyword of the first address.
When it is implemented, participle tool or participle application can be used, such as jieba participle divides the first address
Word processing can carry out word segmentation processing to the first address of different user, it is corresponding at least to obtain each first address after word segmentation processing
One keyword, the first address to user A as shown in Table 1 below: after " Haidian District people road 666 " word segmentation processing
Obtain 4 keywords " Haidian District ", " people road ", " 666 ", " number ":
Table 1
User | First address | After participle |
A | Haidian District people road 666 | Haidian District/people road/No. 666/ |
B | Southern exposure people road 666 | Towards the sunlight/people road/No. 666/ |
C | Chaoyang District people road 666 | Chaoyang District/people road/No. 666/ |
A2 extracts the target keyword for characterizing administrative area at least one keyword, matches from the first storage organization
At least two destination nodes corresponding with target keyword.
When it is implemented, the mode that fuzzy matching can be used matched from the first storage organization it is corresponding with target keyword
At least two destination nodes.By taking the corresponding keyword in the first address of user B in table 1 as an example, extracts target therein and close
Keyword are as follows: " southern exposure ", can fuzzy matching to two destination nodes in the first storage organization, respectively first path: " China →
Leaf node " Chaoyang District " and the second path in Beijing → Chaoyang District ": " China → Jilin Province → Changchun → southern exposure
Leaf node " Chaoyang District " in area ".It should be noted that the rank based on administrative area indicated by target keyword is different,
The destination node matched is not limited in indicating area/administrative areas at the county level leaf node in Fig. 2, is also possible to more advanced administration
Area, tree node among second level corresponding to such as city-level administrative area, is not restricted herein.
A3 determines the corresponding administrative area information of at least two destination nodes institute in the first storage organization.
It, can be shown in Fig. 2 after determining destination node " Chaoyang District " in the example of the first address of above-mentioned user B
The first storage organization 200 namely structure tree in, with " Chaoyang District " be starting point traverse up it superior node (also referred to as, father save
Point), until traversing root node (China), it may be determined that go out two candidate parent subchains corresponding with " Chaoyang District " namely two
A administrative area information is respectively as follows: " China → Beijing " and " China → Jilin Province → Changchun ".
Further, first when matching the first administrative area information from least two administrative area information according to identity information
First determine identity information in include the number of species with the information of ownership place function, then according in identity information have ownership
The information of ground function matches the first administrative area information from least two administrative area information.
Wherein, the information with instruction ownership place function can be ID card No., telephone number or the user of user
The location information being currently located.When practical application, if user is filled out in the paper form that the business hall of transacting business provides
Its personal information is write, then the location information that aforementioned user is currently located is geographical location locating for business hall;If user is logical
It crosses network progress business handling and fills in its personal information, then the location information that aforementioned user is currently located is that user's used terminal is set
Standby IP address.
To be convenient to carry out, the embodiment of the present invention is for the information with instruction ownership place function included in identity information
Number of species are different, and the method for dividing briefing to match the first administrative area information from least two administrative area information is as follows:
(1) the first situation: identity information includes at least two information with instruction ownership place function.
In a kind of optional embodiment, the first administration is matched from least two administrative area information according to identity information
Area's information, comprising:
B1, according at least two administrative area information, filter out that aforementioned identity information includes N number of has instruction ownership place function
K information for meeting first condition in the information of energy, according to the corresponding identity of each administrative area information in K administrative area information
The weight of information determines the confidence level of each administrative area information at least two administrative area information;
Wherein, N >=2,1≤K≤N, N, K are natural number, first condition are as follows: have the information institute of instruction ownership place function
Indicate that the administrative area of ownership matches with one of them at least two administrative area information.When it is implemented, above-mentioned identity letter
The weight of breath can be predefined according to the significance level including the different information with instruction ownership place function, for example be set
The weight for setting ID card No. is " 1 ", and the weight for the location information that user is currently located is " 1 ", and the weight of telephone number is
" 0.5 " etc..
When it is implemented, the administrative area information in the administrative area for example belonged to when at least two information that identity information includes
Including the second administrative area information and third administrative area information, in the second administrative area information and at least two administrative area information wherein
One matching can be according to the second administration when one of matching in third administrative area information and at least two administrative area information
The weight of the corresponding identity information of area's information and the weight of the corresponding identity information of third administrative area information determine at least two
The confidence level of each administrative area information in a administrative area information;Wherein, at least two administrative area information with third administrative area
One administrative area information of information matches, and can be same with an administrative area information of fourth line administrative division information matches,
It is also possible to different one.
The maximum administrative area information of confidence level at least two administrative area information is determined as the first administrative area letter by B2
Breath.
For ease of understanding, the embodiment of the present invention includes that identification card number, telephone number and user are currently located with identity information
Location information, for determining the corresponding first administrative area information in the first address of aforementioned user B, to being provided in above-mentioned B1~B2
Mode be illustrated.
According to identity information from the first storage organization, corresponding two candidate parents in the first address of aforementioned user B are matched
Subchain, i.e. two administrative area information, respectively " China → Beijing " and " China → Jilin Province → Changchun ", from the two rows
Administrative division information determines the first administrative area information, can implement in the following manner:
C1 parses identification card number, telephone number and user's present position information respectively, obtains each information and return
The administrative area information in the administrative area of category is determined full in each information according to the administrative area information in the administrative area that each information is belonged to
The identity information of sufficient first condition, as shown in table 2 below:
Table 2
Determine that the identity information for meeting first condition in each information is telephone number, user's present position information.
C2, according to the weight for the identity information for meeting first condition, two administrative areas corresponding to the first address of user B
Information is voted, and one administrative area information of who gets the most votes in two administrative area information is determined as to the first ground of the user B
The corresponding first administrative area information in location.
When it is implemented, judgement parses the administrative area information and two administrative area information in the administrative area of identity information ownership
In which matching, just the corresponding ticket of the weight of the identity information is thrown to corresponding administrative area information.Such as telephone number belongs to
The administrative area information in administrative area matched with " China → Beijing ", then according to weight shared by telephone number, for " China → north
Jing Shi " record gained vote " 0.5 ";User's present position information ownership administrative area administrative area information also with " China → north
Jing Shi " matching is won the vote " 1 ", most then according to weight shared by user's present position information for " China → Beijing " record
Determine that the gained vote of " China → Beijing " adds up to " 1.5 " eventually, the gained vote of " China → Jilin Province → Changchun " adds up to " 0 ".
It is as shown in table 3 below:
Table 3
Candidate parent subchain | Gained vote |
" China → Beijing " | 1.5(0.5+1) |
" China → Jilin Province → Changchun " | 0 |
Wherein, " China → Beijing " who gets the most votes, confidence level is maximum, it is determined that " China → Beijing " is first administrative
Area's information.
In another optional embodiment, the first row is matched from least two administrative area information according to identity information
Administrative division information, comprising:
When in the administrative area information in the administrative area that at least two information that identity information includes are belonged to there is only at least
When one of them matched fourth line administrative division information in two administrative area information, by least two administrative area information with the 4th
The administrative area information of administrative area information matches is determined as the first administrative area information.
As the following table 4 determines the example of its corresponding first administrative area information, identity information packet for the first address of user B
In the location information that identification card number, telephone number and the user included is currently located, there is only user's present position information to return
The administrative area information in the administrative area of category can be matched with " China → Beijing " at least two administrative area information, then will
" China → Beijing " is determined as the corresponding first administrative area information in the first address of user B.
Table 4
In another optional embodiment, the first row is matched from least two administrative area information according to identity information
Administrative division information, comprising:
When the administrative area information in the administrative area that at least two information that identity information includes are belonged to is fifth line administrative division
When information, the administrative area information at least two administrative area information with fifth line administrative division information matches is determined as the first administrative area
Information.
As the following table 5 determines the example of its corresponding first administrative area information, identity information packet for the first address of user B
The administrative area information in the administrative area that the location information that identification card number, telephone number and the user included is currently located belongs to respectively is wrapped
Containing " Beijing ", instruction is fifth line administrative division information " China → Beijing ", can be at least two administrative area information
In " China → Beijing " matched, then " China → Beijing " is determined as to the corresponding the first row in the first address of user B
Administrative division information.
Table 5
(2) second situation: identity information includes an information with instruction ownership place function.
The first administrative area information is matched from least two administrative area information according to identity information, comprising:
Determine the 6th administrative area information in the administrative area that identity information is belonged to;It is determined from least two administrative area information
Out with first administrative area information of the 6th administrative area information matches.
Such as the information with instruction ownership place function that identity information includes is identification card number, the identification card number institute
The administrative area information in the administrative area of ownership includes " Beijing " then can be corresponding two from its first address for user B
Administrative area information: " China → Beijing " and " China → Jilin Province → Changchun ", from the two administrative areas, information determines
One administrative area information is " China → Beijing ".
Further, when in the first storage organization 200 shown in Fig. 2 exist and the first address matching unique administrative area believe
Breath, directly can be determined as aforementioned first administrative area information for unique administrative area information.Such as the first address to aforementioned user A
After word segmentation processing, obtaining and wherein characterizing the target keyword in administrative area is " Haidian District ", determines the first storage organization shown in Fig. 2
In 200 existence anduniquess description " Haidian District " destination node, then with " Haidian District " be starting point traverse up it superior node (
Claim, father node), until traversing root node (China), it may be determined that go out administrative area corresponding with " Haidian District " information, are as follows:
" China → Beijing ", can directly determine " China → Beijing " is aforementioned first administrative area information.
Further, in a kind of optional embodiment, the first address is standardized according to from the first administrative area information
Processing, specifically includes:
By the first administrative area information supplement to the first address, so that it is complete, such as " China → Beijing " is added to
In the first address of user A, sufficient address is obtained are as follows: " city, BeiJing, China, people road, Haidian District 666 ", alternatively, will
The information supplement after national root node information removes is characterized in first administrative area information to the first address, such as to user A's
The processing of first Address Standardization, obtains " Haidian District, Beijing City people road 666 ".
For ease of understanding, referring to Fig. 3, the embodiment of the invention also provides a kind of standardization processing methods of address, comprising:
Step S301 obtains the first address and the identity information of user.
Identity information includes the ID card No. filled in when filling in personal information by user's transacting business, telephone number,
And obtain the location information of user being currently located certainly according to the place based on user's transacting business, for example, user be
Business hall scene transacting business, the then location information that aforementioned user is currently located are the geographical location of business hall;If user is
By network transacting business, then the location information that aforementioned user is currently located is the IP address of user's used terminal equipment.
Step S302 obtains the target keyword for characterizing administrative area to the first address dividing.
Step S303 judges to whether there is and the matched tree node of target keyword in structure tree;If so, executing step
S304 or step S305, if not, terminating;Wherein, structure tree, that is, aforementioned first storage organization 200 shown in Fig. 2, tree node
It can be intermediary tree node, be also possible to leaf node.
Step S304, when there is unique tree node matched with target keyword in structure tree, according to unique tree node
The administrative area information of place path instruction is standardized the first address.
Step S305, when in structure tree including at least two administrative areas information matched with target keyword, according to body
Part information matches the first administrative area information from least two administrative area information, according to the first administrative area information to the first address
It is standardized.
In the present embodiment, matched and the first address from the structure tree for being stored with subordinate relation between multistage administrative area first
Corresponding administrative area information, when the administrative area information matched is unique, according to the administrative area information matched to the first address
It is standardized;When the administrative area information matched is not unique, when being at least two administrative area information, according to identity information
It determines and matches the first administrative area information at least two administrative area information, and then according to the first administrative area information to the first address
It is standardized.That is filled in when taking full advantage of user's transacting business has the identity information of instruction ownership place function,
Such as ID card No., telephone number, and the identity information that non-user is filled in is obtained, such as user's transacting business used terminal
IP address combines various identity informations during standardized address, determines most believable first for standardized address
Administrative area information, it is relatively reasonable, the first address is supplemented completely, is able to solve and exists in the prior art what address instruction was not known
Problem can effectively promote the efficiency of business handling.
A kind of corresponding above-mentioned standardization processing method of address, referring to fig. 4, the embodiment of the present invention provides a kind of mark of address
Standardization processing unit 400, comprising:
Module 401 is obtained, for obtaining the first information of user, the first information includes the first address and identity information, body
Part information includes information that at least one has instruction ownership place function.
Matching module 402, for working as at least two administration determined include in the first storage organization with the first address matching
When area's information, the first administrative area information is matched from least two administrative area information according to identity information;Wherein, the first storage
It is stored in structure multiple for characterizing the administrative area information of subordinate relation between multistage administrative area.
Processing module 403, for being standardized according to the first administrative area information to the first address.
In the embodiment of the present invention, when getting the first address included by the first information of user and identity information, first
It is multiple for characterizing the administrative area information of subordinate relation between multistage administrative area based on being stored with, it filters out in the first storage organization
Including at least two administrative area information with first address matching, then believed according to identity information from least two administrative areas
The first administrative area information is matched in breath, and then the first address is standardized according to the first administrative area information, namely
The first address is supplemented completely according to the first administrative area information, avoids influencing doing for related service since address instruction is not known
Reason progress can effectively promote business handling efficiency.
In a kind of optional embodiment, identity information includes at least two information with instruction ownership place function;
Matching module 402 is matching the first administrative area information according to identity information from least two administrative area information,
It is specifically used for:
When the administrative area information in the administrative area that at least two information that identity information includes are belonged to includes the second administrative area
Information and third administrative area information, and one of matching in the second administrative area information and at least two administrative area information, the
When one of matching in three administrative area information and at least two administrative area information, according to the corresponding body of the second administrative area information
Part weight of information and the weight of the corresponding identity information of third administrative area information, determine at least two administrative area information
Each administrative area information confidence level;
The maximum administrative area information of confidence level at least two administrative area information is determined as the first administrative area information.
In a kind of optional embodiment, matching module 402, according to identity information from least two administrative area information
In match the first administrative area information, be specifically used for:
When in the administrative area information in the administrative area that at least two information that identity information includes are belonged to there is only at least
When one of them matched fourth line administrative division information in two administrative area information, by least two administrative area information with the 4th
The administrative area information of administrative area information matches is determined as the first administrative area information.
In a kind of optional embodiment, matching module 402, according to identity information from least two administrative area information
In match the first administrative area information, be specifically used for:
When the administrative area information in the administrative area that at least two information that identity information includes are belonged to is fifth line administrative division
When information, the administrative area information at least two administrative area information with fifth line administrative division information matches is determined as the first administrative area
Information.
In a kind of optional embodiment, identity information includes an information with instruction ownership place function;
Matching module 402 is matching the first administrative area information according to identity information from least two administrative area information,
It is specifically used for:
Determine the 6th administrative area information in the administrative area that identity information is belonged to;
The first administrative area information with the 6th administrative area information matches is determined from least two administrative area information.
In a kind of optional embodiment, the above-mentioned information with instruction ownership place function is the identification card number of user
The location information that code, telephone number or user are currently located.
The corresponding above method, referring to Fig. 5, the embodiment of the invention provides the standardization device 500 of another address,
Include:
Communication interface 501, memory 502 and processor 503;
Wherein, the processor 503 is communicated by the communication interface 501 with other equipment, for example, other equipment
Terminal device used when can be aforementioned user by network transacting business.Processor 503 can be obtained by communication interface 501
Take the IP address of aforementioned terminals equipment;Memory 502, for storing program instruction;Processor 503, for calling the storage
The program instruction stored in device 502 executes the method in above-described embodiment according to the program of acquisition.
In the embodiment of the present application, processor can be general processor, digital signal processor, specific integrated circuit,
Field programmable gate array or other programmable logic device, discrete gate or transistor logic, discrete hardware components,
It may be implemented or execute disclosed each method, step and the logic diagram in the embodiment of the present application.General processor can be
Microprocessor or any conventional processor etc..The step of method in conjunction with disclosed in the embodiment of the present application, can directly embody
Execute completion for hardware processor, or in processor hardware and software module combination execute completion.
In the embodiment of the present application, memory can be nonvolatile memory, such as hard disk (hard disk drive,
HDD) or solid state hard disk (solid-state drive, SSD) etc., it can also be volatile memory (volatile
), such as random access memory (random-access memory, RAM) memory.Memory, which can also be, can be used in taking
Band or storage have the desired program code of instruction or data structure form and can be by any other Jie of computer access
Matter, but not limited to this.Memory in the embodiment of the present application can also be circuit or other arbitrarily can be realized store function
Device, for storing program instruction and/or data.Do not limited in the embodiment of the present application above-mentioned communication interface, memory and
Specific connection medium between processor, such as bus, bus can be divided into address bus, data/address bus, control bus etc..
Further, the embodiment of the invention provides a kind of computer readable storage medium, the computer readable storage mediums
It is stored with computer instruction, when computer instruction is run on computers, so that computer executes the above method.
It should be understood by those skilled in the art that, the embodiment of the present invention can provide as method, system or computer program
Product.Therefore, complete hardware embodiment, complete software embodiment or reality combining software and hardware aspects can be used in the present invention
Apply the form of example.Moreover, it wherein includes the computer of computer usable program code that the present invention, which can be used in one or more,
The computer program implemented in usable storage medium (including but not limited to magnetic disk storage, CD-ROM, optical memory etc.) produces
The form of product.
The present invention be referring to according to the method for the embodiment of the present invention, the process of equipment (system) and computer program product
Figure and/or block diagram describe.It should be understood that every one stream in flowchart and/or the block diagram can be realized by computer program instructions
The combination of process and/or box in journey and/or box and flowchart and/or the block diagram.It can provide these computer programs
Instruct the processor of general purpose computer, special purpose computer, Embedded Processor or other programmable data processing devices to produce
A raw machine, so that being generated by the instruction that computer or the processor of other programmable data processing devices execute for real
The device for the function of being specified in present one or more flows of the flowchart and/or one or more blocks of the block diagram.
These computer program instructions, which may also be stored in, is able to guide computer or other programmable data processing devices with spy
Determine in the computer-readable memory that mode works, so that it includes referring to that instruction stored in the computer readable memory, which generates,
Enable the manufacture of device, the command device realize in one box of one or more flows of the flowchart and/or block diagram or
The function of being specified in multiple boxes.
These computer program instructions also can be loaded onto a computer or other programmable data processing device, so that counting
Series of operation steps are executed on calculation machine or other programmable devices to generate computer implemented processing, thus in computer or
The instruction executed on other programmable devices is provided for realizing in one or more flows of the flowchart and/or block diagram one
The step of function of being specified in a box or multiple boxes.
Although preferred embodiments of the present invention have been described, it is created once a person skilled in the art knows basic
Property concept, then additional changes and modifications may be made to these embodiments.So it includes excellent that the following claims are intended to be interpreted as
It selects embodiment and falls into all change and modification of the scope of the invention.
Obviously, various changes and modifications can be made to the invention without departing from model of the invention by those skilled in the art
It encloses.In this way, if these modifications and changes of the present invention is within the scope of the claims of the present invention and its equivalent technology, then
The present invention is also intended to include these modifications and variations.
Claims (14)
1. a kind of standardization processing method of address characterized by comprising
The first information of user is obtained, the first information includes the first address and identity information, and the identity information includes extremely
Few one has the information of instruction ownership place function;
When determining includes at least two administrative area information with first address matching in the first storage organization, according to described
Identity information matches the first administrative area information from the information of at least two administrative area;Wherein, first storage organization
In be stored with it is multiple for characterizing the administrative area information of subordinate relation between multistage administrative area;
First address is standardized according to first administrative area information.
2. the method as described in claim 1, which is characterized in that the identity information includes at least two with instruction ownership place
The information of function;
It is described that first administrative area information is matched from the information of at least two administrative area according to the identity information, comprising:
When the administrative area information in the administrative area that at least two information that the identity information includes are belonged to includes the second administrative area
Information and third administrative area information, and one of them in second administrative area information and at least two administrative area information
Matching, when one of matching in third administrative area information and at least two administrative area information, according to described the
The weight of the corresponding identity information of two administrative area information and the weight of the corresponding identity information of third administrative area information determine
The confidence level of each administrative area information in the information of at least two administrative area;
The maximum administrative area information of confidence level in the information of at least two administrative area is determined as first administrative area information.
3. method according to claim 2, which is characterized in that described administrative from described at least two according to the identity information
The first administrative area information is matched in area's information, comprising:
When in the administrative area information in the administrative area that at least two information that the identity information includes are belonged to there is only with it is described
When one of them matched fourth line administrative division information at least two administrative area information, by least two administrative area information
In with the administrative area information of fourth line administrative division information matches be determined as first administrative area information.
4. method according to claim 2, which is characterized in that described administrative from described at least two according to the identity information
The first administrative area information is matched in area's information, comprising:
When the administrative area information in the administrative area that at least two information that the identity information includes are belonged to is fifth line administrative division
When information, the administrative area information in the information of at least two administrative area with fifth line administrative division information matches is determined as institute
State the first administrative area information.
5. the method as described in claim 1, which is characterized in that the identity information, which includes one, to be had the function of to indicate ownership place
Information;
The first administrative area information is matched from the information of at least two administrative area according to the identity information, comprising:
Determine the 6th administrative area information in the administrative area that the identity information is belonged to;
First administrative area with the 6th administrative area information matches is determined from the information of at least two administrative area
Information.
6. the method according to claim 1 to 5, which is characterized in that the information with instruction ownership place function is user
The location information that is currently located of ID card No., telephone number or user.
7. a kind of standardization device of address characterized by comprising
Module is obtained, for obtaining the first information of user, the first information includes the first address and identity information, the body
Part information includes information that at least one has instruction ownership place function;
Matching module, for when determine the first storage organization in include and at least two administrative areas of first address matching believe
When breath, the first administrative area information is matched from the information of at least two administrative area according to the identity information;Wherein, described
It is stored in first storage organization multiple for characterizing the administrative area information of subordinate relation between multistage administrative area;
Processing module, for being standardized according to first administrative area information to first address.
8. device as claimed in claim 7, which is characterized in that the identity information includes at least two with instruction ownership place
The information of function;
The matching module is matching the first administrative area from the information of at least two administrative area according to the identity information
Information is specifically used for:
When the administrative area information in the administrative area that at least two information that the identity information includes are belonged to includes the second administrative area
Information and third administrative area information, and one of them in second administrative area information and at least two administrative area information
Matching, when one of matching in third administrative area information and at least two administrative area information, according to described the
The weight of the corresponding identity information of two administrative area information and the weight of the corresponding identity information of third administrative area information determine
The confidence level of each administrative area information in the information of at least two administrative area;
The maximum administrative area information of confidence level in the information of at least two administrative area is determined as first administrative area information.
9. device as claimed in claim 8, which is characterized in that the matching module, it is described according to the identity information from
The first administrative area information is matched in the information of at least two administrative area, is specifically used for:
When in the administrative area information in the administrative area that at least two information that the identity information includes are belonged to there is only with it is described
When one of them matched fourth line administrative division information at least two administrative area information, by least two administrative area information
In with the administrative area information of fourth line administrative division information matches be determined as first administrative area information.
10. device as claimed in claim 8, which is characterized in that the matching module, it is described according to the identity information from
The first administrative area information is matched in the information of at least two administrative area, is specifically used for:
When the administrative area information in the administrative area that at least two information that the identity information includes are belonged to is fifth line administrative division
When information, the administrative area information in the information of at least two administrative area with fifth line administrative division information matches is determined as institute
State the first administrative area information.
11. device as claimed in claim 7, which is characterized in that the identity information, which includes one, has instruction ownership place function
The information of energy;
The matching module is matching the first administrative area from the information of at least two administrative area according to the identity information
Information is specifically used for:
Determine the 6th administrative area information in the administrative area that the identity information is belonged to;
First administrative area with the 6th administrative area information matches is determined from the information of at least two administrative area
Information.
12. such as the described in any item devices of claim 7-11, which is characterized in that the information with instruction ownership place function is to use
The location information that ID card No., telephone number or the user at family are currently located.
13. a kind of standardization device of address characterized by comprising
Memory and processor;
Memory, for storing program instruction;
Processor requires 1~6 according to the program execution benefit of acquisition for calling the program instruction stored in the memory
Described in any item methods.
14. a kind of computer readable storage medium, which is characterized in that the computer-readable recording medium storage has computer to refer to
It enables, when the computer instruction is run on computers, so that computer perform claim requires described in any one of 1~6
Method.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910639806.8A CN110413715A (en) | 2019-07-16 | 2019-07-16 | A kind of standardization processing method and device of address |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910639806.8A CN110413715A (en) | 2019-07-16 | 2019-07-16 | A kind of standardization processing method and device of address |
Publications (1)
Publication Number | Publication Date |
---|---|
CN110413715A true CN110413715A (en) | 2019-11-05 |
Family
ID=68361634
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201910639806.8A Pending CN110413715A (en) | 2019-07-16 | 2019-07-16 | A kind of standardization processing method and device of address |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN110413715A (en) |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20110060763A1 (en) * | 2009-09-09 | 2011-03-10 | Denso Corporation | Address search device and method for searching address |
CN104572992A (en) * | 2015-01-06 | 2015-04-29 | 武汉工程大学 | Multi-constraint reasoning based standardization method for internet geographical location information |
CN105069560A (en) * | 2015-07-30 | 2015-11-18 | 中国科学院软件研究所 | Resume information extraction and characteristic identification analysis system and method based on knowledge base and rule base |
CN105335864A (en) * | 2015-11-13 | 2016-02-17 | 小米科技有限责任公司 | Display method, apparatus and system for secondary address information |
-
2019
- 2019-07-16 CN CN201910639806.8A patent/CN110413715A/en active Pending
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20110060763A1 (en) * | 2009-09-09 | 2011-03-10 | Denso Corporation | Address search device and method for searching address |
CN104572992A (en) * | 2015-01-06 | 2015-04-29 | 武汉工程大学 | Multi-constraint reasoning based standardization method for internet geographical location information |
CN105069560A (en) * | 2015-07-30 | 2015-11-18 | 中国科学院软件研究所 | Resume information extraction and characteristic identification analysis system and method based on knowledge base and rule base |
CN105335864A (en) * | 2015-11-13 | 2016-02-17 | 小米科技有限责任公司 | Display method, apparatus and system for secondary address information |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN104050196B (en) | A kind of interest point data redundant detecting method and device | |
CN109033086A (en) | A kind of address resolution, matched method and device | |
CN103678708B (en) | Method and device for recognizing preset addresses | |
CN109598509A (en) | The recognition methods of risk clique and device | |
CN106202028B (en) | A kind of address information recognition methods and device | |
CN106156145A (en) | The management method of a kind of address date and device | |
CN104965876B (en) | A kind of method and device carrying out the excavation of user job unit based on location information | |
CN108304484A (en) | Key word matching method and device, electronic equipment and readable storage medium storing program for executing | |
CN103970747B (en) | Data processing method for network side computer to order search results | |
CN107247791B (en) | Parking lot map data generation method and device and machine-readable storage medium | |
CN102262664A (en) | Quality estimating method and quality estimating device | |
CN106162544A (en) | A kind of generation method and apparatus of geography fence | |
CN107395680A (en) | Shop group's information push and output intent and device, equipment | |
CN105992171A (en) | Text information processing method and device | |
CN107404486A (en) | Parse method, apparatus, terminal device and the storage medium of Http data | |
CN106598946A (en) | Content extracting method and device | |
CN106682100A (en) | Data statistical method and system based on Hbase database | |
CN106503045B (en) | A kind of method and device updating template library | |
CN110516713A (en) | A kind of target group's recognition methods, device and equipment | |
CN107644366A (en) | Order fraud recognition methods, system, storage medium and electronic equipment | |
CN108648017A (en) | It is easy to user demand matching process, device, equipment and the storage medium of extension | |
CN110413715A (en) | A kind of standardization processing method and device of address | |
CN104462392B (en) | Share the statistical method and device of capacity of returns | |
CN110362646A (en) | Processing method and processing device, storage medium and the electronic device of address information | |
CN109582834A (en) | Data Risk Forecast Method and device |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20191105 |
|
RJ01 | Rejection of invention patent application after publication |