CN110445783B - Geographic position information standardization method and device based on client and entity - Google Patents

Geographic position information standardization method and device based on client and entity Download PDF

Info

Publication number
CN110445783B
CN110445783B CN201910726746.3A CN201910726746A CN110445783B CN 110445783 B CN110445783 B CN 110445783B CN 201910726746 A CN201910726746 A CN 201910726746A CN 110445783 B CN110445783 B CN 110445783B
Authority
CN
China
Prior art keywords
entity
node
position information
geographical position
information table
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201910726746.3A
Other languages
Chinese (zh)
Other versions
CN110445783A (en
Inventor
张顺龙
王占一
卢维清
李从宇
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Qax Technology Group Inc
Original Assignee
Qax Technology Group Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Qax Technology Group Inc filed Critical Qax Technology Group Inc
Priority to CN201910726746.3A priority Critical patent/CN110445783B/en
Publication of CN110445783A publication Critical patent/CN110445783A/en
Application granted granted Critical
Publication of CN110445783B publication Critical patent/CN110445783B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L63/00Network architectures or network communication protocols for network security
    • H04L63/10Network architectures or network communication protocols for network security for controlling access to devices or network resources
    • H04L63/107Network architectures or network communication protocols for network security for controlling access to devices or network resources wherein the security policies are location-dependent, e.g. entities privileges depend on current location or allowing specific operations only from locally connected terminals
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/50Network services
    • H04L67/52Network services specially adapted for the location of the user terminal

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Security & Cryptography (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Computer Hardware Design (AREA)
  • Computing Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Computer And Data Communications (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

The embodiment of the invention provides a geographic position information standardization method and a geographic position information standardization device based on a client and an entity, wherein the method comprises the following steps: acquiring a first geographical position information table of a preset number of clients and generating a second geographical position information table according to entity behavior information, establishing an interactive relationship graph of the client nodes and the entity nodes, and determining the connection weight of the client nodes and the entity nodes in the interactive relationship graph according to the entity behavior information; and continuously and alternately updating according to the first geographical position information table, the second geographical position information table, the connection weight between the client node and the entity node in the interactive relation graph and the updating rule, and outputting the updated first geographical position information table and the updated second geographical position information table as geographical position information after the updating stop condition is met, so that business log data completely based on the industry is realized, the entity relation graph is constructed, and the position information mining and unified standardized representation of various entities in the business are completed by utilizing the position updating.

Description

Geographic position information standardization method and device based on client and entity
Technical Field
The invention relates to the technical field of information processing, in particular to a geographic position information standardization method and device based on clients and entities.
Background
With the application and development of the internet technology in each network security industry, criminals are rampant against fraud and theft invasion crimes of industrial customers, information leakage of customers and frequent cases of network financial fraud. In recent years, security threats have changed greatly, and APT attack events and emerging threats are increasing, which provide new challenges for wind control systems in various industries. The geographical location of the customer is a particularly important factor to consider in the wind control system, and a large number of risk factors or features for machine learning are extracted based on the geographical location information of the customer. But the customer's geographic location information is not directly available and needs to be obtained by location-related entities such as IP addresses, ATMs, POS machines, industry agency numbers, etc. The availability, precision, availability, presentation, normalization, etc. of location information in these entities are all different. For example, a POS machine, an ATM machine, an industry organization number, etc., a location code representing a geographical location may be obtained by parsing a corresponding number, which is highly available but with low accuracy, for example, a POS machine may only be accurate to the market; the physical address of the IP address can be obtained by calling the interface to analyze the IP, and the precision is high, but the availability is poor and the usability is poor. This situation greatly limits the application of geographical location information to practical services.
Disclosure of Invention
Aiming at the problems in the prior art, the embodiment of the invention provides a geographic position information standardization method and device based on a client and an entity.
The embodiment of the invention provides a geographic position information standardization method based on clients and entities, which comprises the following steps:
acquiring the identity card information of a preset number of clients, and generating a first geographical position information table according to the identity card information;
acquiring entity behavior information of the client for completing service requirements by using entity equipment within a preset time period, generating a second geographical position information table according to the entity behavior information,
establishing a corresponding number of client nodes according to a preset number, establishing entity nodes according to entity equipment identifiers, establishing an interactive relationship graph of the client nodes and the entity nodes, and determining the connection weight of the client nodes and the entity nodes in the interactive relationship graph according to the entity behavior information;
and continuously and alternately updating according to the first geographical position information table, the second geographical position information table, the connection weight of the customer node and the entity node in the interactive relation graph and an updating rule until an updating stop condition is met, and outputting the updated first geographical position information table and the updated second geographical position information table as standardized geographical position information.
Optionally, the continuously and alternately updating according to the first geographical location information table, the second geographical location information table, the connection weight between the client node and the entity node in the interaction relationship diagram, and the update rule includes:
updating the second geographical position information table according to the first geographical position information table, the connection weight of the client node and the entity node in the interactive relation graph and the updating rule;
and updating the first geographical position information table according to the updated second geographical position information table, the connection weight of the client node and the entity node in the interactive relation graph and the updating rule.
Optionally, the update rule includes:
the first geographical position information table comprises a customer ID, a customer K-level original geographical position and a customer K-level actual geographical position;
the second geographical position information table comprises an entity ID and an entity K-level geographical position;
determining a target node according to the first geographical position information table and the second geographical position information table, wherein the target node is a client node or an entity node;
when the K-level geographic position of the target node to be updated is determined, a node candidate set of the target node is obtained, wherein the node candidate set comprises neighbor nodes of the target node, when the target node is a client node, the neighbor nodes are entity nodes, when the target node is an entity node, the neighbor nodes are client nodes, and the node candidate set is a node of which the K-1-level geographic position in the neighbor nodes is the same as the updated K-level geographic position of the target node;
determining a geographical position candidate set according to the Kth-level geographical position of each node in the node candidate set;
calculating the weight sum of each node in the node candidate set at each geographical position in the geographical position candidate set;
and selecting the geographic position with the maximum weight and the corresponding geographic position as the K-th level geographic position of the target node.
Optionally, the method further comprises: and carrying out availability quantitative analysis processing on the updated first geographical position information table and the updated second geographical position information table to obtain a first geographical address position quantization table and a second geographical position information quantization table, and outputting the first geographical address position quantization table and the second geographical position information quantization table as standardized geographical position information.
The embodiment of the invention provides a geographic position information standardization device based on a client and an entity, which comprises:
the system comprises a first acquisition module, a second acquisition module and a first geographic position information table, wherein the first acquisition module is used for acquiring the identity card information of a preset number of clients and generating the first geographic position information table according to the identity card information;
the second acquisition module is used for acquiring entity behavior information of the customer for completing business requirements by using entity equipment within a preset time period and generating a second geographic position information table according to the entity behavior information;
the establishing module is used for establishing client nodes with corresponding number according to the preset number, establishing entity nodes according to entity equipment identifiers, establishing an interactive relationship graph of the client nodes and the entity nodes, and determining the connection weight of the client nodes and the entity nodes in the interactive relationship graph according to the entity behavior information;
and the first processing module is used for continuously and alternately updating according to the first geographical position information table, the second geographical position information table, the connection weight between the customer node and the entity node in the interactive relation graph and the updating rule until the updating stop condition is met, and outputting the updated first geographical position information table and the updated second geographical position information table as geographical position information.
Optionally, the first processing module is specifically configured to:
updating the second geographical position information table according to the first geographical position information table, the connection weight of the client node and the entity node in the interactive relation graph and the updating rule;
and updating the first geographical position information table according to the updated second geographical position information table, the connection weight of the client node and the entity node in the interactive relation graph and the updating rule.
Optionally, the update rule includes:
the first geographical position information table comprises a customer ID, a customer K-level original geographical position and a customer K-level actual geographical position;
the second geographical position information table comprises an entity ID and an entity K-level geographical position;
determining a target node according to the first geographical position information table and the second geographical position information table, wherein the target node is a client node or an entity node;
when the K-level geographic position of the target node to be updated is determined, a node candidate set of the target node is obtained, wherein the node candidate set comprises neighbor nodes of the target node, when the target node is a client node, the neighbor nodes are entity nodes, when the target node is an entity node, the neighbor nodes are client nodes, and the node candidate set is a node of which the K-1-level geographic position in the neighbor nodes is the same as the updated K-level geographic position of the target node;
determining a geographical position candidate set according to the Kth-level geographical position of each node in the node candidate set;
calculating the weight sum of each node in the node candidate set at each geographical position in the geographical position candidate set;
and selecting the geographic position with the maximum weight and the corresponding geographic position as the K-th level geographic position of the target node.
Optionally, the apparatus further comprises a second processing module, configured to: and carrying out availability quantitative analysis processing on the updated first geographical position information table and the updated second geographical position information table to obtain a first geographical address position quantization table and a second geographical position information quantization table, and outputting the first geographical address position quantization table and the second geographical position information quantization table as geographical position information.
An embodiment of the present invention provides an electronic device, which includes a memory, a processor, and a computer program stored in the memory and executable on the processor, where the processor executes the computer program to implement the steps of the above geographic location information standardization method based on clients and entities.
Embodiments of the present invention provide a non-transitory computer readable storage medium having stored thereon a computer program that, when executed by a processor, performs the steps of the above-described method for standardizing geographic location information based on clients and entities.
The invention provides a geographic position information standardization method and device based on a client and an entity, which obtains a client geographic position information table and an entity geographic position information table through business log data of an industry, continuously and alternately updates the two information tables by combining an interactive relation graph of a client node and an entity node and an update rule until an update stop condition is met, outputs the updated first geographic position information table and the updated second geographic position information table as standardized geographic position information, thereby realizing business log data completely based on the industry, constructing an entity relation graph, finishing position information mining and unified standardized representation of various entities in the business by utilizing position update, achieving the purpose of not depending on a network environment or a third-party offline IP library, solving the problems of difficult acquisition and low precision of part of entity position information, the usability of the position information in the wind control system is improved, and the performance of the wind control system is improved.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, and it is obvious that the drawings in the following description are some embodiments of the present invention, and those skilled in the art can also obtain other drawings according to the drawings without creative efforts.
FIG. 1 is a flow chart of an embodiment of a method for standardizing geographic location information based on customers and entities in accordance with the present invention;
FIG. 2 is a diagram of the interaction relationship between a client node and an entity node according to the present invention;
FIG. 3 is a flow chart of another embodiment of a method for standardizing geographic location information based on customers and entities in accordance with the present invention;
FIG. 4 is a block diagram of an embodiment of a geographic location information standardization apparatus based on clients and entities according to the present invention;
FIG. 5 is a block diagram of another embodiment of the geographic location information standardization apparatus based on clients and entities according to the present invention;
FIG. 6 is a block diagram of an embodiment of an electronic device according to the invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are some, but not all, embodiments of the present invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
Fig. 1 illustrates a method for standardizing geographic location information based on clients and entities according to an embodiment of the present invention, which includes:
s11, obtaining the ID card information of the clients with preset number, and generating a first geographical position information table according to the ID card information.
It should be noted that, in step S11, in the embodiment of the present invention, the identification number of the client includes the most available, standard and reasonable geographic location, that is, the number on the identification number represents the province, city and district code, and the province, city and district code corresponds to the geographic location.
And extracting the administrative region code of the geographic position from the identification card number. According to the coding rule of the identity card number, 1-2, 3-4, 5-6 bits of the identity card number respectively represent codes of districts (district, county, self-government and county-level city) of province (direct administration city and self-government city) city (prefecture city and self-government state).
For example, 11 in 110105XXXXXX 2168 represents Beijing City, 01 represents the prefecture of City, and 05 represents the area facing the sun.
Usually, the geographic location represented by the identification number is the home geographic location, but the client may work and live in different places, so the first geographic location information table needs to include the client original geographic location and the client actual geographic location.
In the embodiment of the present invention, the first geographical location information table includes a customer ID, a customer K-level original geographical location, and a customer K-level actual geographical location. Since the identification number represents a tertiary geographic location. Therefore, the first geographical location information of this embodiment may include: client id (User, unique number or ID number), original first-level geographic location (S)1) Original second level geographic location (S)2) Third level geographical location (S)3) Actual first level geographic location (L)1) Actual second level geographic location (L)2) And actual tertiary geographic location (L)3)。
Since the actual location information of the client cannot be updated when the first geographical location information table is initially established, the actual K-level geographical location on the initial first geographical location information table is the same as the original K-level geographical location. As shown in table 1 below.
Table 1 is an initial first geographical location information table
User S1 S2 S3 L1 L2 L3
001 11 01 05 11 01 05
002 11 01 02 11 01 02
003 54 01 01 54 01 01
004 51 10 28 51 10 28
And S12, acquiring entity behavior information of the customer using the entity equipment to complete service requirements in a preset time period, and generating a second geographical position information table according to the entity behavior information.
With reference to step S12, it should be noted that, in the embodiment of the present invention, the customer uses physical devices (such as POS, ATM, terminal device, etc.) to transact business both at the home location and at the residential area, and the behavior of transacting business is recorded as the physical behavior information. The entity behavior information represents a history of interactions of the customer with the entity. The contents obtainable from the entity behavior information include client id (user), date of transaction (date), entity device (entity). The following table 2 is an entity behavior information table.
Table 2 entity behavior information table for the entity device used by the client to fulfill the service requirement
User Date Entity
001 2018-07-06 (176.61.X.X,ip)
002 2018-07-08 (1234567890,pos)
002 2019-03-02 (135.121.X.X,ip)
001 2019-04-01 (ATM00002345,atm)
003 2019-04-01 (456456456,bank_id)
Since a client may use different entity devices for transacting services for multiple times within a preset time period (e.g., one month, three months, half a year, etc.), the entity behavior information table needs to be removed and a new entity behavior information table needs to be established. Table 3 is an entity behavior information table of the client-entity in the preset time period.
Table 3 is an entity behavior information table of the client-entity in the preset time period
Figure BDA0002159175310000071
Figure BDA0002159175310000081
S13, establishing a corresponding number of client nodes according to a preset number, establishing entity nodes according to entity equipment identifiers, establishing an interactive relationship graph of the client nodes and the entity nodes, and determining the connection weight of the client nodes and the entity nodes in the interactive relationship graph according to the entity behavior information.
With respect to step S13, it should be noted that, in the embodiment of the present invention, an interaction diagram between the client and the entity needs to be established. Thus, each client id represents a client node and each entity device identification (id) represents an entity node. And establishing an interactive relationship graph of the client node and the entity node according to the entity behavior information. Fig. 2 is a diagram showing the interaction relationship between a client node and an entity node. In the figure, u1, u2, u3, u4.. And determining the connection weight of the client node and the entity node in the interactive relationship graph according to the entity behavior information.
In the embodiment of the invention, the connection weight of the client node and the entity node is determined by adopting a weight calculation formula. The method comprises the following specific steps:
Figure BDA0002159175310000082
wherein, wijRepresents customer uiWith entity ejConnection weight of, A (G, u)i) Representing customer node u in interaction relation graph of customer node and entity nodeiNeighbor node of nijRepresents customer uiWith entity ejThe number of days of interactive activity within a preset time period. I.e., Count in table 3. Beta is a hyper-parameter and needs to be adjusted according to the data size, and is generally set to be about 5 according to the experience. The main role of β is to reduce the impact of low frequency users on subsequent calculations. For example, suppose a user u in the history dataiOnly with entity ejIf there is one interaction record and there is no interaction with other entities, then when no beta is added, wijThis clearly exaggerates the effect of the user on subsequent calculations, whereas after adding β, w is 5 in the case of βij0.2, the impact of the user is greatly reduced. U is a collection of client nodes.
Note that, the client node uiThe neighbor node of (1) is a node u which is connected with a client nodeiAll entity nodes with interaction relation. n isikFor a client node uiAnd the number of days of interactive behaviors with the neighbor node in a preset time period.
Therefore, from the perspective of the interaction relationship diagram, the neighbor nodes of all the entity nodes are the client nodes, and the neighbor nodes of all the client nodes are the entity nodes.
And S14, continuously and alternately updating according to the first geographical position information table, the second geographical position information table, the connection weight of the customer node and the entity node in the interactive relation graph and the updating rule until the updating stop condition is met, and outputting the updated first geographical position information table and the updated second geographical position information table as standardized geographical position information.
With respect to step S14, it should be noted that, in the embodiment of the present invention, geographic location information exists in the first geographic location information table and the second geographic location information table, a connection weight between the client node and the entity node exists in the interaction relationship diagram, and then a preset update rule is adopted, so that the first geographic location information table and the second geographic location information can be alternately updated. The alternate update is a repeated update mode, and the following is one of the update processes:
updating the second geographical position information table according to the first geographical position information table, the connection weight of the client node and the entity node in the interactive relation graph and the updating rule;
and updating the first geographical position information table according to the updated second geographical position information table, the connection weight of the client node and the entity node in the interactive relation graph and the updating rule.
And repeatedly and alternately updating according to the updating process until the updating stop condition is met, and outputting the updated first geographical position information table and the updated second geographical position information table as the geographical position information of the client. The update stop condition may be that the updating is stopped when a preset number of times of repetition is reached, or that the first geographical location information table and the second geographical location information table are not changed any more after being updated.
In an embodiment of the present invention, the update rule includes:
the first geographical position information table comprises a customer ID, a customer K-level original geographical position and a customer K-level actual geographical position;
the second geographical position information table comprises an entity ID and an entity K-level geographical position;
determining a target node according to the first geographical position information table and the second geographical position information table, wherein the target node is a client node or an entity node;
when the K-level geographic position of the target node to be updated is determined, a node candidate set of the target node is obtained, wherein the node candidate set comprises neighbor nodes of the target node, when the target node is a client node, the neighbor nodes are entity nodes, when the target node is an entity node, the neighbor nodes are client nodes, and the node candidate set is a node of which the K-1-level geographic position in the neighbor nodes is the same as the updated K-level geographic position of the target node;
determining a geographical position candidate set according to the Kth-level geographical position of each node in the node candidate set;
calculating the weight sum of each node in the node candidate set at each geographical position in the geographical position candidate set;
and selecting the geographic position with the maximum weight and the corresponding geographic position as the K-th level geographic position of the target node.
The following explains step S14 and the update rule by specific examples:
in this example, for better illustration, the example uses one of the entity nodes e1For example, and each client node and entity node e1Is replayed in the first geographical location information table.
First, an initial first geographical location information table is shown in table 4 below.
Table 4 shows the initial first geographical location information table in this example
User S1 S2 S3 L1 L2 L3 wij
u1 11 01 05 11 01 05 0.8
u2 11 01 05 11 01 05 0.8
u3 11 02 03 11 02 03 0.4
u4 51 10 28 51 10 28 0.01
u5 11 01 03 11 01 03 0.2
u6 11 01 05 11 01 05 0.7
All client nodes in table 4 are associated with entity node e1An interactive relationship exists.
Since the geographical location of the entity cannot be initially determined, the initial second geographical location information table is shown in table 5 below. Wherein the entity node e1Is (176.61.X.X, ip).
Table 5 shows the initial second geographical location information table in this example
Entity L1 L2 L3
(176.61.X.X,ip)
(ATM00002345,atm)
(1234567890,pos)
(456456456,bank_id)
The initial second table of geographical location information is updated with the initial first table of geographical location information. At this time, in the above update rule, the entity node e1Is a target node. Update procedures such asThe following:
step 1: updating the first-level geographical position in the second geographical position information table to obtain the entity node e1The candidate set of nodes at the time of the first level of geographic location update. Node candidate set CV ═ u at this time1,u2,u3,u4,u5,u6}。
And determining a geographical position candidate set according to the first-level geographical positions of the nodes in the node candidate set. The geo-location candidate set CL at this time is {11,51 }.
And calculating the weight sum of each node in the node candidate set at each geographical position in the geographical position candidate set. That is, calculating the weighted sum of each customer node in the CV on each option in the CL, we can obtain: w (11) is 2.9, and w (51) is 0.01.
And selecting the geographic position with the maximum weight and the corresponding geographic position as the K-th level geographic position of the target node. That is, L1 corresponding to (176.61.x.x, ip) in table 5 is 11.
Step 2: updating the second-level geographic position in the second geographic position information table to obtain the entity node e1The candidate set of nodes at the time of the second level of geographic location update. The node candidate set CV at this time is { u1, u2, u3, u5, u6 }. As can be seen from the node candidate set in step 1 and the node candidate set in step 2, the node candidate set in step 2 is a node whose first-level geographic location is the same as the updated second-level geographic location of the target node in the neighboring nodes.
And determining a geographical position candidate set according to the second-level geographical positions of the nodes in the node candidate set. The geo-location candidate set CL at this time is {01,02 }.
And calculating the weight sum of each node in the node candidate set at each geographical position in the geographical position candidate set. That is, the sum of the weights of each customer node in the CV on each option in the CL is calculated, where w (01) is 2.5 and w (02) is 0.4.
And selecting the geographic position with the maximum weight and the corresponding geographic position as the second-level geographic position of the target node. That is, L2 corresponding to (176.61.x.x, ip) in table 5 is 01.
And step 3: to pairUpdating the third-level geographic position in the second geographic position information table to obtain the entity node e1At a third level of geographic location update. The node candidate set CV at this time is { u1, u2, u5, u6 }.
And determining a geographical position candidate set according to the third-level geographical positions of the nodes in the node candidate set. The geo-location candidate set CL at this time is {05,03 }.
And calculating the weight sum of each node in the node candidate set at each geographical position in the geographical position candidate set. That is, the sum of the weights of each customer node in the CV on each option in the CL is calculated, w (05) is 2.3, and w (03) is 0.2.
And selecting the geographic position with the maximum weight and the corresponding geographic position as the second-level geographic position of the target node. That is, L3 corresponding to (176.61.x.x, ip) in table 5 is 05.
As can be seen, table 5 is updated to provide table 6 below. Table 6 is a second geographical location information table in which the geographical location of the entity node (176.61.x.x, ip) is updated.
Entity L1 L2 L3
(176.61.X.X,ip) 11 01 05
(ATM00002345,atm)
(1234567890,pos)
(456456456,bank_id)
Through the above method, the geographic locations corresponding to other entities (e.g. ATM00002345, ATM) in table 6 may be updated, which is not illustrated here.
The second geographical location information table is updated according to the first geographical location information table, and the updating process is also adopted if the first geographical location information table is updated according to the second geographical location information table. And will not be illustrated here.
And when the updating stop condition is met, outputting the updated first geographical position information table and the updated second geographical position information table as the standardized customer geographical position information.
The geographic position information standardization method based on the client and the entity provided by the embodiment of the invention obtains the client geographic position information table and the entity geographic position information table through business log data of the industry, continuously and alternately updates the two information tables by combining an interactive relation graph of a client node and an entity node and an update rule until an update stop condition is met, outputs the updated first geographic position information table and the updated second geographic position information table as standardized geographic position information, thereby realizing business log data completely based on the industry, constructing an entity relation graph, finishing position information mining and unified standardized representation of various entities in the business by utilizing position updating, achieving the purpose of not depending on a network environment or a third-party offline IP library, solving the problems of difficult acquisition and low precision of part of the entity position information, and improving the availability of the position information in a wind control system, and the performance of the wind control system is improved.
Fig. 3 illustrates a method for standardizing geographic location information based on clients and entities according to an embodiment of the present invention, which includes:
s21, acquiring the identity card information of a preset number of clients, and generating a first geographical position information table according to the identity card information;
s22, obtaining the entity behavior information of the customer using the entity device to complete the service requirement in the preset time period, generating a second geographical position information table according to the entity behavior information,
s23, establishing a corresponding number of client nodes according to a preset number, establishing entity nodes according to entity equipment identifiers, establishing an interactive relationship graph of the client nodes and the entity nodes, and determining the connection weight of the client nodes and the entity nodes in the interactive relationship graph according to the entity behavior information;
s24, continuously and alternately updating according to the first geographical position information table, the second geographical position information table, the connection weight between the customer node and the entity node in the interactive relation graph and the updating rule until the updating stop condition is met;
and S25, performing availability quantitative analysis processing on the updated first geographic position information table and the updated second geographic position information table to obtain a first geographic address quantitative table and a second geographic position information quantitative table, and outputting the first geographic address quantitative table and the second geographic position information quantitative table as standardized geographic position information.
Regarding the steps S21-S24, these steps are the same as the steps S11-S14 of the above embodiment in principle, and are not described again here.
With reference to step S25, it should be noted that, in the embodiment of the present invention, the availability quantitative analysis processing is performed on the updated first geographic location information table and the updated second geographic location information table, and actually, an information entropy is calculated on an actual geographic location in the geographic information location table according to the first geographic location information table and the updated second geographic location information table by using the above-mentioned selection method of the node candidate set and the geographic location candidate set, so as to obtain the first geographic address location quantization table and the second geographic location information quantization table. The geographical location information quantization table increases the information entropy corresponding to the geographical location.
In embodiments of the present invention, entity location availability is quantitatively described by the actual geographic location distribution of the customers associated therewith, and customer location availability is quantitatively described by the geographic location distribution of the entities associated therewith. For the case where the geographic location of a portion of the entity does not reflect the actual geographic location of the customer, it appears primarily as a relatively confusing distribution of the customer's actual geographic location with which it has historically interacted. The information entropy is an effective index for measuring the chaos degree of data, and the calculation mode is as follows:
H(X)=-∑p(x)log(p(x))
where X is the set of events and p (X) is the probability of the occurrence of event X.
In the invention, in order to reduce the influence of the low-frequency user on the geographical position information entropy, the connection weight is considered in the calculation of the geographical position information entropies of all levels, and an entity eiThe entropy of information at the kth level address is represented as:
Figure BDA0002159175310000141
wherein SUB CL (ei, k) is node eiIs represented as follows:
SUBCL(ei,k)=Set{ujk,uj∈A(G,ei)}
CL(ei,k,lk) Representing the address of the k-th level being lkThen, the geographical location candidate set of level (k + 1). Is represented as follows:
CL(ei,k,lk)=Set{uj(k+1),uj∈A(G,ei)andujk==lk}
p(lik) The k-th level geographical position in the neighbor client node representing the entity ei is likProbability of p (l)ik|li(k-1)) Representing entity eiHas a (k-1) th level geographical position of li(k-1)The kth geographic position is likIs expressed as follows:
Figure BDA0002159175310000142
Figure BDA0002159175310000143
in contrast, according to the above calculation method, information entropies corresponding to the geographic positions of each stage can be obtained, so that a first geographic address position quantization table and a second geographic position information quantization table can be obtained. In this regard, no explanation is made to the examples.
The geographic position information standardization method based on the client and the entity provided by the embodiment of the invention obtains a client geographic position information table and an entity geographic position information table through business log data of the industry, continuously and alternately updates the two information tables by combining an interactive relation graph of a client node and an entity node and an update rule until an update stop condition is met, performs availability quantitative analysis processing on a first geographic position information table and a second geographic position information table after updating to obtain a first geographic address position quantitative table and a second geographic position information quantitative table, outputs the first geographic address position quantitative table and the second geographic position information quantitative table as standardized geographic position information, thereby realizing business log data completely based on the industry, constructing an entity relation graph, and completing position information mining and unified standardized representation of various entities in the business by utilizing position update, the method achieves the purpose of not depending on a network environment or a third-party offline IP library, solves the problems of difficulty in obtaining position information of part of entities and low precision, improves the availability of the position information in the wind control system, and improves the performance of the wind control system.
Fig. 4 shows a geographic location information standardization apparatus based on clients and entities, which includes a first obtaining module 41, a second obtaining module 42, a building module 43, and a first processing module 44, where:
the first obtaining module 41 is configured to obtain identification card information of a preset number of clients, and generate a first geographical location information table according to the identification card information;
a second obtaining module 42, configured to obtain entity behavior information that the customer uses entity equipment to complete a service requirement within a preset time period, and generate a second geographic location information table according to the entity behavior information;
the establishing module 43 is configured to establish a corresponding number of client nodes according to a preset number, establish entity nodes according to an entity device identifier, establish an interaction relationship graph between the client nodes and the entity nodes, and determine connection weights between the client nodes and the entity nodes in the interaction relationship graph according to the entity behavior information;
and the first processing module 44 is configured to perform continuous alternate updating according to the first geographical location information table, the second geographical location information table, the connection weight between the client node and the entity node in the interaction relationship diagram, and the update rule, until an update stop condition is met, and output the updated first geographical location information table and the updated second geographical location information table as standardized geographical location information.
Since the principle of the apparatus according to the embodiment of the present invention is the same as that of the method according to the above embodiment, further details are not described herein for further explanation.
It should be noted that, in the embodiment of the present invention, the relevant functional unit may be implemented by a hardware processor (hardware processor).
The geographic position information standardization device based on the client and the entity obtains the client geographic position information table and the entity geographic position information table through business log data of the industry, continuously and alternately updates the two information tables by combining an interactive relation graph of a client node and an entity node and an update rule until an update stop condition is met, outputs the updated first geographic position information table and the updated second geographic position information table as standardized geographic position information, thereby realizing business log data completely based on the industry, constructing an entity relation graph, mining and uniformly and standardizing the position information of various entities in the business by utilizing position update, achieving the purpose of not depending on a network environment or a third-party offline IP library, solving the problems of difficult acquisition and low precision of part of the position information of the entities, and improving the availability of the position information in a wind control system, and the performance of the wind control system is improved.
Fig. 5 shows a geographic location information standardization apparatus based on clients and entities, which includes a first obtaining module 41, a second obtaining module 42, a building module 43, a first processing module 44, and a second processing module 51, where:
the first obtaining module 41 is configured to obtain identification card information of a preset number of clients, and generate a first geographical location information table according to the identification card information;
a second obtaining module 42, configured to obtain entity behavior information that the customer uses entity equipment to complete a service requirement within a preset time period, and generate a second geographic location information table according to the entity behavior information;
the establishing module 43 is configured to establish a corresponding number of client nodes according to a preset number, establish entity nodes according to an entity device identifier, establish an interaction relationship graph between the client nodes and the entity nodes, and determine connection weights between the client nodes and the entity nodes in the interaction relationship graph according to the entity behavior information;
the first processing module 44 is configured to perform continuous alternate updating according to the first geographical location information table, the second geographical location information table, the connection weight between the client node and the entity node in the interaction relationship diagram, and the updating rule, until the updating stop condition is met, and then stop updating;
the second processing module 51 is configured to perform availability quantization analysis processing on the updated first geographic position information table and the updated second geographic position information table to obtain a first geographic address quantization table and a second geographic position information quantization table, and output the first geographic address quantization table and the second geographic position information quantization table as standardized geographic position information.
Since the principle of the apparatus according to the embodiment of the present invention is the same as that of the method according to the above embodiment, further details are not described herein for further explanation.
It should be noted that, in the embodiment of the present invention, the relevant functional unit may be implemented by a hardware processor (hardware processor).
The geographic position information standardization device based on the client and the entity obtains the client geographic position information table and the entity geographic position information table through business log data of the industry, continuously and alternately updates the two information tables by combining an interactive relation graph of a client node and the entity node and an update rule until an update stop condition is met, performs availability quantitative analysis processing on the first geographic position information table and the second geographic position information table after updating to obtain a first geographic address position quantitative table and a second geographic position information quantitative table, outputs the first geographic address position quantitative table and the second geographic position information quantitative table as standardized geographic position information, thereby realizing business log data completely based on the industry, constructing the entity relation graph, and finishing position information mining and unified standardized representation of various entities in the business by utilizing position update, the method achieves the purpose of not depending on a network environment or a third-party offline IP library, solves the problems of difficulty in obtaining position information of part of entities and low precision, improves the availability of the position information in the wind control system, and improves the performance of the wind control system.
Fig. 6 illustrates a physical structure diagram of an electronic device, which may include, as shown in fig. 6: a processor (processor)61, a communication Interface (communication Interface)62, a memory (memory)63 and a communication bus 64, wherein the processor 61, the communication Interface 62 and the memory 63 complete communication with each other through the communication bus 64. The processor 61 may call logic instructions in the memory 63 to perform the following method: acquiring the identity card information of a preset number of clients, and generating a first geographical position information table according to the identity card information; acquiring entity behavior information of a client for completing business requirements by using entity equipment within a preset time period, generating a second geographical position information table according to the entity behavior information, establishing a corresponding number of client nodes according to a preset number, establishing entity nodes according to entity equipment identifiers, establishing an interactive relationship graph of the client nodes and the entity nodes, and determining the connection weight of the client nodes and the entity nodes in the interactive relationship graph according to the entity behavior information; and continuously and alternately updating according to the first geographical position information table, the second geographical position information table, the connection weight of the customer node and the entity node in the interactive relation graph and an updating rule until an updating stop condition is met, and outputting the updated first geographical position information table and the updated second geographical position information table as standardized geographical position information.
Furthermore, the logic instructions in the memory 63 may be implemented in the form of software functional units and stored in a computer readable storage medium when sold or used as a stand-alone product. Based on such understanding, the technical solution of the present invention may be embodied in the form of a software product, which is stored in a storage medium and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device) to execute all or part of the steps of the method according to the embodiments of the present invention. And the aforementioned storage medium includes: a U-disk, a removable hard disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a magnetic disk or an optical disk, and other various media capable of storing program codes.
An embodiment of the present invention further provides a non-transitory computer-readable storage medium, on which a computer program is stored, where the computer program is implemented to, when executed by a processor, perform the control method of the wind turbine provided in the foregoing embodiments, for example, the method includes: acquiring the identity card information of a preset number of clients, and generating a first geographical position information table according to the identity card information; acquiring entity behavior information of a client for completing business requirements by using entity equipment within a preset time period, generating a second geographical position information table according to the entity behavior information, establishing a corresponding number of client nodes according to a preset number, establishing entity nodes according to entity equipment identifiers, establishing an interactive relationship graph of the client nodes and the entity nodes, and determining the connection weight of the client nodes and the entity nodes in the interactive relationship graph according to the entity behavior information; and continuously and alternately updating according to the first geographical position information table, the second geographical position information table, the connection weight of the customer node and the entity node in the interactive relation graph and an updating rule until an updating stop condition is met, and outputting the updated first geographical position information table and the updated second geographical position information table as standardized geographical position information.
Through the above description of the embodiments, those skilled in the art will clearly understand that each embodiment can be implemented by software plus a necessary general hardware platform, and certainly can also be implemented by hardware. With this understanding in mind, the above-described technical solutions may be embodied in the form of a software product, which can be stored in a computer-readable storage medium such as ROM/RAM, magnetic disk, optical disk, etc., and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device, etc.) to execute the methods described in the embodiments or some parts of the embodiments.
Finally, it should be noted that: the above examples are only intended to illustrate the technical solution of the present invention, but not to limit it; although the present invention has been described in detail with reference to the foregoing embodiments, it will be understood by those of ordinary skill in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some technical features may be equivalently replaced; and such modifications or substitutions do not depart from the spirit and scope of the corresponding technical solutions of the embodiments of the present invention.

Claims (10)

1. A method for standardizing geographic location information based on customers and entities, comprising:
acquiring the identity card information of a preset number of clients, and generating a first geographical position information table according to the identity card information;
acquiring entity behavior information of the client for completing service requirements by using entity equipment within a preset time period, generating a second geographical position information table according to the entity behavior information,
establishing a corresponding number of client nodes according to a preset number, establishing entity nodes according to entity equipment identifiers, establishing an interactive relationship graph of the client nodes and the entity nodes, and determining the connection weight of the client nodes and the entity nodes in the interactive relationship graph according to the entity behavior information;
and continuously and alternately updating according to the first geographical position information table, the second geographical position information table, the connection weight of the customer node and the entity node in the interactive relation graph and an updating rule until an updating stop condition is met, and outputting the updated first geographical position information table and the updated second geographical position information table as standardized geographical position information.
2. The method of claim 1, wherein the continuously and alternately updating according to the first geographical location information table, the second geographical location information table, the connection weights of the client nodes and the entity nodes in the interaction relationship graph and the updating rule comprises:
updating the second geographical position information table according to the first geographical position information table, the connection weight of the client node and the entity node in the interactive relation graph and the updating rule;
and updating the first geographical position information table according to the updated second geographical position information table, the connection weight of the client node and the entity node in the interactive relation graph and the updating rule.
3. The method of claim 2, wherein the update rule comprises:
the first geographical position information table comprises a customer ID, a customer K-level original geographical position and a customer K-level actual geographical position;
the second geographical position information table comprises an entity ID and an entity K-level geographical position;
determining a target node according to the first geographical position information table and the second geographical position information table, wherein the target node is a client node or an entity node;
when the K-level geographic position of the target node to be updated is determined, a node candidate set of the target node is obtained, wherein the node candidate set comprises neighbor nodes of the target node, when the target node is a client node, the neighbor nodes are entity nodes, when the target node is an entity node, the neighbor nodes are client nodes, and the node candidate set is a node of which the K-1-level geographic position in the neighbor nodes is the same as the updated K-level geographic position of the target node;
determining a geographical position candidate set according to the Kth-level geographical position of each node in the node candidate set;
calculating the weight sum of each node in the node candidate set at each geographical position in the geographical position candidate set;
and selecting the geographic position with the maximum weight and the corresponding geographic position as the K-th level geographic position of the target node.
4. The method of claim 1, further comprising: and carrying out availability quantitative analysis processing on the updated first geographical position information table and the updated second geographical position information table to obtain a first geographical address position quantization table and a second geographical position information quantization table, and outputting the first geographical address position quantization table and the second geographical position information quantization table as standardized geographical position information.
5. An apparatus for standardizing geographic location information based on clients and entities, comprising:
the system comprises a first acquisition module, a second acquisition module and a first geographic position information table, wherein the first acquisition module is used for acquiring the identity card information of a preset number of clients and generating the first geographic position information table according to the identity card information;
the second acquisition module is used for acquiring entity behavior information of the customer for completing business requirements by using entity equipment within a preset time period and generating a second geographic position information table according to the entity behavior information;
the establishing module is used for establishing client nodes with corresponding number according to the preset number, establishing entity nodes according to entity equipment identifiers, establishing an interactive relationship graph of the client nodes and the entity nodes, and determining the connection weight of the client nodes and the entity nodes in the interactive relationship graph according to the entity behavior information;
and the first processing module is used for continuously and alternately updating according to the first geographical position information table, the second geographical position information table, the connection weight between the customer node and the entity node in the interactive relation graph and the updating rule until the updating stop condition is met, and outputting the updated first geographical position information table and the updated second geographical position information table as standardized geographical position information.
6. The client and entity based geographic location information normalization apparatus of claim 5, wherein the first processing module is specifically configured to:
updating the second geographical position information table according to the first geographical position information table, the connection weight of the client node and the entity node in the interactive relation graph and the updating rule;
and updating the first geographical position information table according to the updated second geographical position information table, the connection weight of the client node and the entity node in the interactive relation graph and the updating rule.
7. The client and entity based geographical location information standardization means of claim 6, wherein the update rule comprises:
the first geographical position information table comprises a customer ID, a customer K-level original geographical position and a customer K-level actual geographical position;
the second geographical position information table comprises an entity ID and an entity K-level geographical position;
determining a target node according to the first geographical position information table and the second geographical position information table, wherein the target node is a client node or an entity node;
when the K-level geographic position of the target node to be updated is determined, a node candidate set of the target node is obtained, wherein the node candidate set comprises neighbor nodes of the target node, when the target node is a client node, the neighbor nodes are entity nodes, when the target node is an entity node, the neighbor nodes are client nodes, and the node candidate set is a node of which the K-1-level geographic position in the neighbor nodes is the same as the updated K-level geographic position of the target node;
determining a geographical position candidate set according to the Kth-level geographical position of each node in the node candidate set;
calculating the weight sum of each node in the node candidate set at each geographical position in the geographical position candidate set;
and selecting the geographic position with the maximum weight and the corresponding geographic position as the K-th level geographic position of the target node.
8. The client and entity based geographic location information normalization apparatus of claim 5, further comprising a second processing module for: and carrying out availability quantitative analysis processing on the updated first geographical position information table and the updated second geographical position information table to obtain a first geographical address position quantization table and a second geographical position information quantization table, and outputting the first geographical address position quantization table and the second geographical position information quantization table as standardized geographical position information.
9. An electronic device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, wherein the processor when executing the program performs the steps of the method for normalizing geographical location information based on clients and entities according to any one of claims 1 to 4.
10. A non-transitory computer readable storage medium having stored thereon a computer program, wherein the computer program when executed by a processor implements the steps of the method for standardizing geographic location information based on clients and entities according to any one of claims 1 to 4.
CN201910726746.3A 2019-08-07 2019-08-07 Geographic position information standardization method and device based on client and entity Active CN110445783B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910726746.3A CN110445783B (en) 2019-08-07 2019-08-07 Geographic position information standardization method and device based on client and entity

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910726746.3A CN110445783B (en) 2019-08-07 2019-08-07 Geographic position information standardization method and device based on client and entity

Publications (2)

Publication Number Publication Date
CN110445783A CN110445783A (en) 2019-11-12
CN110445783B true CN110445783B (en) 2021-08-31

Family

ID=68433770

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910726746.3A Active CN110445783B (en) 2019-08-07 2019-08-07 Geographic position information standardization method and device based on client and entity

Country Status (1)

Country Link
CN (1) CN110445783B (en)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101075247A (en) * 2007-06-22 2007-11-21 上海众恒信息产业有限公司 System and method for allocating service of geographical information system
CN106549902A (en) * 2015-09-16 2017-03-29 阿里巴巴集团控股有限公司 A kind of recognition methods of suspicious user and equipment
CN107683490A (en) * 2015-06-30 2018-02-09 万事达卡国际股份有限公司 fraud control method and system based on geographical position

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CA2943356A1 (en) * 2014-03-20 2015-09-24 Geocommerce Inc. System and method for identifying users on a network
US20170235792A1 (en) * 2016-02-17 2017-08-17 Www.Trustscience.Com Inc. Searching for entities based on trust score and geography

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101075247A (en) * 2007-06-22 2007-11-21 上海众恒信息产业有限公司 System and method for allocating service of geographical information system
CN107683490A (en) * 2015-06-30 2018-02-09 万事达卡国际股份有限公司 fraud control method and system based on geographical position
CN106549902A (en) * 2015-09-16 2017-03-29 阿里巴巴集团控股有限公司 A kind of recognition methods of suspicious user and equipment

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
A Study On Geo-Location Authentication Techniques;Brinda Shivhare;《2014 International Conference on Computational Intelligence and Communication Networks》;20150326;第744-748页 *
基于IP地址的网络实体地理位置定位技术研究与实现;朱彬;《中国优秀硕士学位论文全文数据库 信息科技辑》;20150815;第1-50页 *

Also Published As

Publication number Publication date
CN110445783A (en) 2019-11-12

Similar Documents

Publication Publication Date Title
CN111369342B (en) Loan approval method, device, equipment and storage medium based on machine learning
WO2021114921A1 (en) Method and apparatus for constructing relationship network based on privacy protection
CN110110093A (en) A kind of recognition methods, device, electronic equipment and the storage medium of knowledge based map
CN106034149B (en) A kind of account recognition methods and device
WO2016022720A2 (en) Method and apparatus of identifying a transaction risk
CN113240505B (en) Method, apparatus, device, storage medium and program product for processing graph data
CN110135978B (en) User financial risk assessment method and device, electronic equipment and readable medium
CN111651741B (en) User identity recognition method, device, computer equipment and storage medium
CN112491877A (en) User behavior sequence anomaly detection method, terminal and storage medium
CN110310020B (en) Project scheme management method based on data analysis, related device and storage medium
CN110689084A (en) Abnormal user identification method and device
CN114037514A (en) Method, device, equipment and storage medium for detecting fraud risk of user group
CN111951052A (en) Method and device for acquiring potential customers based on knowledge graph
CN108205575B (en) Data processing method and device
CN117035082A (en) Knowledge graph real-time operation method and system for identifying fraudulent party
CN110445783B (en) Geographic position information standardization method and device based on client and entity
CN113177840A (en) Client risk identification method and device
CN110751354A (en) Abnormal user detection method and device
CN109711984B (en) Pre-loan risk monitoring method and device based on collection urging
CN110830515A (en) Flow detection method and device and electronic equipment
CN110619564B (en) Anti-fraud feature generation method and device
CN114862108B (en) Site selection processing method and system of self-service equipment
CN110347938A (en) Geographic information processing method, apparatus, electronic equipment and medium
US20230087204A1 (en) Systems and methods to screen a predictive model for risks of the predictive model
CN110113708A (en) Localization method and device based on Wi-Fi location fingerprint

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant