CN113761390B - Method and system for analyzing attribute intimacy - Google Patents

Method and system for analyzing attribute intimacy Download PDF

Info

Publication number
CN113761390B
CN113761390B CN202110955230.3A CN202110955230A CN113761390B CN 113761390 B CN113761390 B CN 113761390B CN 202110955230 A CN202110955230 A CN 202110955230A CN 113761390 B CN113761390 B CN 113761390B
Authority
CN
China
Prior art keywords
attribute
analysis
index
rule
affinity
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202110955230.3A
Other languages
Chinese (zh)
Other versions
CN113761390A (en
Inventor
吴鸿伟
鄢小征
王海滨
林文楷
吴文
王兵
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Xiamen Meiya Pico Information Co Ltd
Original Assignee
Xiamen Meiya Pico Information Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Xiamen Meiya Pico Information Co Ltd filed Critical Xiamen Meiya Pico Information Co Ltd
Priority to CN202110955230.3A priority Critical patent/CN113761390B/en
Publication of CN113761390A publication Critical patent/CN113761390A/en
Application granted granted Critical
Publication of CN113761390B publication Critical patent/CN113761390B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9536Search customisation based on social or collaborative filtering
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/28Databases characterised by their database models, e.g. relational or object models
    • G06F16/284Relational databases

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention provides an analysis method and system for attribute intimacy, which comprises the steps of extracting all associated attribute information in an original data packet, storing the attribute information in a relational database, and storing an analysis index of an attribute association record in an index database, wherein the analysis index comprises a starting attribute, a terminating attribute and an associated edge; traversing the index database, determining whether a record exists in the index database for analyzing the index, if not, executing an unassociated attribute intimacy degree analysis engine, and importing an analysis result into an association edge; if yes, executing the associated attribute intimacy degree adjusting engine, and updating the attribute intimacy degree analysis result to the associated edge; and acquiring all correlation attribute relations of the current analysis index, and updating the attribute affinity analysis result to the correlation edge of the current analysis index if the common correlation attribute exists between the initial attribute and the termination attribute of the current analysis index according to the attribute affinity adjustment rule. The method and the system can quickly analyze the intimacy degree between different attributes and are automatic and accurate.

Description

Method and system for analyzing attribute intimacy
Technical Field
The invention relates to the technical field of data processing, in particular to an analysis method and system for attribute intimacy.
Background
The original data accessed by the big data system is generated by depending on different business requirements and tools, the industry span is large, the unified business data model standard and description specification are lacked, the data has the characteristics of isomerism, ambiguity, large noise and the like, and the data information content and the value density are diluted by mass noise data. The data is full of a large amount of noise, a large amount of isolated and useless entity and attribute relations are generated, the intimacy between different attributes is analyzed by a traditional method and is usually calculated according to a fixed rule, the analysis result of the traditional method is low in accuracy and more in redundant data, the difficulty in business study and judgment is increased, and the main factor for restricting the business study and judgment of a large data system service is formed.
Because the original data accessed by a big data system has the characteristics of isomerism, ambiguity, large noise and the like, the existing attribute affinity analysis method in the market at present carries out calculation according to a fixed rule, and the technologies have the following defects:
1. the attribute intimacy degree analysis range is small, the intimacy degree of the attribute relation can only be calculated and analyzed according to a single condition due to the fact that the conventional method only calculates according to a fixed rule, calculation cannot be automatically carried out according to the service characteristics, the optimal result of the attribute intimacy degree is further obtained, and therefore a plurality of potential valuable clues cannot be automatically mined.
2. The accuracy of attribute intimacy analysis is low, the traditional method can calculate the value of attribute intimacy only after data meeting established rules come in, but in many practical scenes, various original data in new formats need to be accessed, so that the analysis result of attribute intimacy cannot be adjusted in time, the analysis result is repeated or wrong, and the value of researching and judging big data service business is influenced.
Disclosure of Invention
In order to solve a series of technical problems of small attribute intimacy degree analysis range, low accuracy and the like in the prior art, the invention provides an attribute intimacy degree analysis method and system to solve the technical problems.
According to a first aspect of the present invention, there is provided an analysis method for attribute intimacy degree, comprising:
s1: extracting all associated attribute information in an original data packet, storing the attribute information in a relational database, and storing an analysis index of an attribute association record in an index database, wherein the analysis index comprises a starting attribute, a terminating attribute and an associated edge;
s2: traversing the index library, determining whether a record exists in the index library for analyzing the index, if not, executing an unassociated attribute intimacy degree analysis engine, and importing an analysis result into an association edge; if yes, executing the associated attribute intimacy degree adjusting engine, and updating the attribute intimacy degree analysis result to the associated edge; and
s3: traversing the index library, acquiring all the association attribute relations of the current analysis index, and updating the attribute affinity analysis result to the association edge of the current analysis index according to the attribute affinity adjustment rule if the common association attribute exists between the initial attribute and the termination attribute of the current analysis index.
In some specific embodiments, in response to that each attribute association relationship is stored in the relationship database, a new attribute affinity is added to the scheduling task table to calculate the to-be-processed task, and the to-be-processed task is allocated according to the idle condition of the calculation resource. By means of the setting, the influence on the warehousing efficiency of mass data can be avoided.
In some specific embodiments, the starting attribute and the ending attribute in the analysis index are used as arguments, and all attribute affinity analysis rules of the attribute are retrieved from the attribute affinity analysis rule library to form a rule analysis set Pn, where elements of the rule analysis set Pn include a data source, an operation rule, an operation result threshold, a weight, a competition factor, and an interference factor, and the rule analysis set Pn is arranged in a reverse order of the weight.
In some particular embodiments, the unassociated attribute affinity analysis engine includes: traversing the rule analysis set Pn, if the data source of the rule analysis set Pn is the same as the data source in the analysis index, obtaining a result set Mn according to the operation rule, and if the operation result of the result set Mn is greater than the operation result threshold of the rule analysis set Pn, the attribute affinity of the analysis index is the product of the operation result of the result set Mn and the weight. The affinity relationship between the attributes can be quickly analyzed and obtained by utilizing an unassociated attribute affinity analysis engine.
In some specific embodiments, the associated attribute affinity adjustment engine adjusts the associated attribute affinity according to the competition factor of the rule analysis set Pn by performing the associated attribute affinity competition adjustment. Adjusting the associated attribute affinities by the competition factor update may improve the accuracy of the attribute affinities.
In some specific embodiments, the competition factor includes an account opening date, and if the account opening date of the newly added association is later than the old association in the index library, the attribute affinity analysis result is updated to the association edge.
According to a second aspect of the present invention, there is provided a computer readable storage medium having stored thereon one or more computer programs which, when executed by a computer processor, implement the method of any of the above.
According to a third aspect of the present invention, there is provided an analysis system for attribute intimacy, the system comprising:
a database and index database construction unit: the method comprises the steps that configuration is used for extracting all associated attribute information in an original data packet and storing the attribute information in a relational database, and storing an analysis index of an attribute association record in an index database, wherein the analysis index comprises a starting attribute, a terminating attribute and an associated edge;
an intimacy degree analysis adjusting unit: the method comprises the steps that the index library is configured to be traversed, whether records exist in the index library in the analysis indexes or not is determined, if the records do not exist in the index library, an unassociated attribute intimacy degree analysis engine is executed, and an analysis result is led into an association edge; if yes, executing the associated attribute intimacy degree adjusting engine, and updating the attribute intimacy degree analysis result to the associated edge;
an association attribute affinity promotion adjustment unit: and updating the attribute affinity analysis result to the association edge of the current analysis index if the common association attribute exists between the initial attribute and the termination attribute of the current analysis index according to the attribute affinity adjustment rule.
In some specific embodiments, the method further includes: and the configuration is used for responding to the fact that each attribute incidence relation is stored in the relation database, newly adding an attribute intimacy degree in the scheduling task table to calculate the task to be processed, and distributing the task to be processed according to the idle condition of the calculation resources. The arrangement of the scheduling unit can avoid influencing the warehousing efficiency of mass data.
In some specific embodiments, the starting attribute and the ending attribute in the analysis index are used as arguments, and all attribute affinity analysis rules of the attribute are retrieved from the attribute affinity analysis rule library to form a rule analysis set Pn, where elements of the rule analysis set Pn include a data source, an operation rule, an operation result threshold, a weight, a competition factor, and an interference factor, and the rule analysis set Pn is arranged in a reverse order of the weight.
In some specific embodiments, the unassociated attribute affinity analysis engine comprises: traversing the rule analysis set Pn, if the data source of the rule analysis set Pn is the same as the data source in the analysis index, obtaining a result set Mn according to the operation rule, and if the operation result of the result set Mn is greater than the operation result threshold of the rule analysis set Pn, the attribute affinity of the analysis index is the product of the operation result of the result set Mn and the weight. The affinity relationship between the attributes can be quickly analyzed and obtained by utilizing an unassociated attribute affinity analysis engine.
In some specific embodiments, the associated attribute affinity adjustment engine adjusts the associated attribute affinity according to the competition factor of the rule analysis set Pn by performing the associated attribute affinity competition adjustment. Adjusting the associated attribute affinities by the competition factor update may improve the accuracy of the attribute affinities.
In some specific embodiments, the competition factor includes an account opening date, and if the account opening date of the newly added association is later than the old association in the index library, the attribute affinity analysis result is updated to the association edge.
The invention provides an analysis method and a system for attribute intimacy, which mainly utilize two algorithms of unassociated attribute intimacy analysis and associated attribute intimacy adjustment, take pairwise attribute combination as a reference, combine factors such as data elements, sources, operation types, rules, timeliness and the like, quickly analyze intimacy of each group of attribute relationship, find associated attribute nodes and automatically and accurately adjust intimacy between associated attributes. By using two algorithms of correlation attribute intimacy degree competitive adjustment and correlation attribute intimacy degree promoting adjustment, intimacy degree adjustment with the same attribute type and different values is met, and intimacy degree adjustment of correlation attribute relations is met when intimacy degree of a certain pair of attribute relations changes, analysis timeliness of the attribute intimacy degree is greatly improved, and accuracy of business application is guaranteed. The invention accurately adjusts the intimacy of the associated node, each analysis link is asynchronously executed, and the analysis result is backfilled into the relational database, thereby not only utilizing the computing resource to the maximum extent, but also automatically adjusting the attribute intimacy more quickly.
Drawings
The accompanying drawings are included to provide a further understanding of the embodiments and are incorporated in and constitute a part of this specification. The drawings illustrate embodiments and together with the description serve to explain the principles of the invention. Other embodiments and many of the intended advantages of embodiments will be readily appreciated as they become better understood by reference to the following detailed description. Other features, objects and advantages of the present application will become more apparent upon reading of the following detailed description of non-limiting embodiments thereof, made with reference to the accompanying drawings in which:
FIG. 1 is a flow chart of an analysis method for attribute intimacy density according to one embodiment of the present application;
FIG. 2 is a schematic diagram of the main steps of an analysis method for attribute intimacy in accordance with a specific embodiment of the present application;
FIG. 3 is a diagram illustrating an attribute association relationship according to a specific embodiment of the present application;
FIG. 4 is a flow chart of an analysis method for attribute intimacy degree according to a specific embodiment of the present application;
FIG. 5 is a block diagram of an analysis system for attribute intimacy degree according to an embodiment of the present application;
FIG. 6 is a schematic block diagram of a computer system suitable for use to implement the electronic device of an embodiment of the present application.
Detailed Description
The present application will be described in further detail with reference to the drawings and examples. It is to be understood that the specific embodiments described herein are merely illustrative of the relevant invention and not restrictive of the invention. It should be noted that, for convenience of description, only the portions related to the related invention are shown in the drawings.
It should be noted that the embodiments and features of the embodiments in the present application may be combined with each other without conflict. The present application will be described in detail below with reference to the embodiments with reference to the attached drawings.
Fig. 1 shows a flowchart of an analysis method for attribute intimacy degree according to an embodiment of the present application. As shown in fig. 1, the method includes:
s101: extracting all associated attribute information in the original data packet, storing the attribute information in a relational database, and storing an analysis index of the attribute association record in an index database, wherein the analysis index comprises a starting attribute, a terminating attribute and an associated edge.
In a specific embodiment, in response to that each attribute incidence relation is stored in the relational database, an attribute affinity degree calculation task to be processed is newly added in the scheduling task table, and the task to be processed is distributed according to the idle condition of the calculation resources, so that the storage efficiency of mass data is prevented from being influenced.
S102: traversing the index database, determining whether a record exists in the index database for analyzing the index, if not, executing an unassociated attribute intimacy degree analysis engine, and importing an analysis result into an association edge; if yes, executing the associated attribute intimacy degree adjusting engine, and updating the attribute intimacy degree analysis result to the associated edge.
In a specific embodiment, the initial attribute and the end attribute in the analysis index are used as parameters, all attribute affinity analysis rules of the attribute are retrieved from an attribute affinity analysis rule library to form a rule analysis set Pn, elements of the rule analysis set Pn include a data source, an operation rule, an operation result threshold, a weight, a competition factor and an interference factor, and the rule analysis set Pn is arranged in a reverse order of the weight. The unassociated attribute affinity analysis engine comprises: traversing the rule analysis set Pn, if the data source of the rule analysis set Pn is the same as the data source in the analysis index, obtaining a result set Mn according to the operation rule, and if the operation result of the result set Mn is greater than the operation result threshold of the rule analysis set Pn, the attribute affinity of the analysis index is the product of the operation result of the result set Mn and the weight. The associated attribute intimacy degree adjusting engine adjusts the associated attribute intimacy degree according to the competitive factor of the rule analysis set Pn through the associated attribute intimacy degree competitive adjustment, in a specific embodiment, the competitive factor may be an account opening date, if the account opening date of the newly added associated relationship is later than the old associated relationship in the index database, the attribute intimacy degree analysis result is updated to the associated edge, and the competitive factor may also be other parameters such as a time limit and the like, and may be used as the intimacy degree basis for judging the newly added associated relationship.
S103: traversing the index library, acquiring all the association attribute relations of the current analysis index, and updating the attribute affinity analysis result to the association edge of the current analysis index according to the attribute affinity adjustment rule if the common association attribute exists between the initial attribute and the termination attribute of the current analysis index. And finding out the attribute relation associated with the attribute, calling a corresponding attribute intimacy degree adjusting engine according to the attribute intimacy degree adjusting rule, deleting the old attribute intimacy degree analysis result on the edges of the two attribute nodes, and adding a new attribute intimacy degree analysis result.
In a specific embodiment, the method aims to construct an attribute relationship and an affinity calculation set of the associated attribute relationship through an analysis program, automatically analyze the affinity of attribute association, adjust the affinity of the associated attribute in time, backfill an analysis result into a relationship database, improve the working efficiency and accuracy of attribute affinity analysis, and support a user to quickly extract a most useful clue of the highest affinity. The attribute intimacy degree analysis process is mainly based on the following two core libraries: attribute affinity analysis rule base: the rule of calculating the intimacy degree associated with different attributes is obtained and is defined as table 1:
TABLE 1 Attribute intimacy analysis rule base
Figure GDA0003695462850000051
Figure GDA0003695462850000061
An attribute intimacy adjustment rule base: the adjustment rule for obtaining the affinity of each associated attribute is defined as table 2:
TABLE 2 Attribute intimacy adjustment rule Table
Attribute name Attribute description Remarks to note
Id Record Id
RuleID Rule ID
CXLS Starting type 1-person, 2-mobile phone, 3-bank card, 4-QQ …
ZXLS Type of termination 1-person, 2-mobile phone, 3-bank card, 4-QQ …
BCL Adjusting rules
zt State of state 0-not available, 1-available
In addition, an attribute affinity level table is also defined to represent the definition of different levels, which is specifically shown in table 3:
TABLE 3 Attribute intimacy level Table
Figure GDA0003695462850000062
Fig. 2 is a schematic diagram illustrating the main steps of an analysis method for attribute intimacy degree according to a specific embodiment of the present invention, as shown in fig. 2, which mainly includes:
step 201: and analyzing and warehousing the data. The method comprises the steps of accessing an original data packet of a big data system, extracting the data packet through an analysis and storage engine, storing all related attribute information contained in the data packet into a relational database, wherein each attribute related record comprises two attribute nodes and a relational edge, the attribute nodes are stored in a Key-Value mode, all keys are coded by adopting Base64, in order to improve retrieval efficiency, the relational edges contain two special attributes of _ from and _ to, and then the two nodes and the edges are used as unique analysis indexes T (a starting attribute type, a terminating attribute type and a related edge) to be stored into an index Base. Example (c): one interface vehicle passing information is to extract attribute information of association relations such as person-vehicle, person-mobile phone, person-ETC, mobile phone-QQ card, etc., and after data analysis and storage, the attribute information is stored in a relational database, as shown in the attribute association relation in FIG. 3.
Step 202: and (4) scheduling tasks. After each attribute incidence relation is stored in the relational database, a task to be processed with attribute affinity calculation is newly added in the scheduling task table, and the operation of the scheduling task is distributed by the platform according to the idle condition of the calculation resources, so that the storage efficiency of mass data is prevented from being influenced.
Step 203: and (6) judging rules. Traversing the index library, and judging whether the T (initial attribute, termination attribute and associated edge) has repeated records in the index library, if the two attributes of the current processing task have no association relationship, the intimacy is low, and if the two attributes have no association relationship, the intimacy indicates that the intimacy of the associated attribute has been stored before.
Step 204: and performing affinity analysis on the unassociated attributes. If the attribute is not associated, executing an unassociated attribute intimacy degree analysis engine, and marking the analysis result on the edges of the two nodes to obtain an intimacy degree analysis result of the two attributes.
Step 205: the associated attribute affinity is adjusted. If the record is old record, analyzing the change factors of the association relation of the current attribute, executing the associated attribute intimacy degree adjusting engine, deleting the old attribute intimacy degree analysis result on the edges of the two attribute nodes, and adding a new attribute intimacy degree analysis result.
Step 206: and adjusting the chain attribute intimacy. Traversing the index library, finding out the attribute relation associated with the attribute, calling a corresponding attribute affinity adjustment engine according to the attribute affinity adjustment rule, deleting the old attribute affinity analysis result on the edges of the two attribute nodes, and adding a new attribute affinity analysis result.
With continued reference to fig. 4, fig. 4 is a flow chart illustrating an analysis method for attribute intimacy degree according to a specific embodiment of the present invention, as shown in fig. 4, the method including:
s401: analyzing and warehousing original data, and storing the original data into a relational database in a Key-Value mode;
s402: task scheduling, wherein after an attribute association relation is stored in a relational database, a task to be processed with attribute affinity calculation is newly added in a scheduling task table, and the operation of the scheduling task is distributed by a platform according to the idle condition of computing resources, so that the storage efficiency of mass data is prevented from being influenced;
s403: rule judgment, namely traversing the index database, and judging whether the T (initial attribute, termination attribute and associated edge) has repeated records in the index database, namely whether the T is a new relation;
s404: judging whether the relationship is a new relationship, if so, performing unassociated attribute affinity analysis in the step S405, and storing the result in a relationship database in the step S401, and if not, performing associated attribute affinity analysis in the step S406, and storing the result in the relationship database in the step S401;
s407: and adjusting the chain attribute intimacy degree, and saving the result to a relational database in S401.
Through 2 algorithms of unassociated attribute intimacy degree analysis and associated attribute intimacy degree adjustment, pairwise attribute combination is taken as a reference, factors such as data elements, sources, operation types, rules and timeliness are combined, intimacy degree of each group of attribute relations is rapidly analyzed, associated attribute nodes are found out, intimacy degree between associated attributes is automatically and accurately adjusted, each analysis link is asynchronously executed, an analysis result is backfilled into a relation database, a user is supported, clues with low values are rapidly filtered, a troubleshooting range is reduced, and troubleshooting work efficiency is improved.
In a specific embodiment, taking attribute affinity analysis of an identity card-mobile phone number as an example, core content of the attribute affinity analysis is described as follows:
the first step is as follows: taking T (initial attribute type and termination attribute type) as a parameter, retrieving all attribute affinity analysis rules of the attribute from an attribute affinity analysis rule library to form a rule analysis set Pn, wherein the elements are as follows: SJLR data source, WSGZ operation rule, WSZGHZ operation result threshold, QZ weight, ZCRZ competition factor and GLRZ interference factor; the analysis sets Pn are arranged in reverse QZ order.
The second step: the unassociated attribute affinity analysis, traverse Pn,
if ([ Pn ]. SJLR ═ t. data source) { obtain operation rule WSGZ, obtain result Mn from WSGZ operation, traverse Mn,
if ([ Mn ]. operation result > [ Pn ]. WSZGHZ)
{ t. attribute affinity ═ Mn. weight ═ Mn. operation result },
};
for example: the analysis rule base of the attribute affinity of the identity card and the mobile phone number has a rule of operation rule (mobile phone association QQ → QQ disassociates the identity card), the threshold value of an operation result is 5, and the weight is 1; the corresponding algorithm is: 1) acquiring last used imsi of a mobile phone, inquiring virtual identity table relationships, inquiring related QQ according to imsi, 2) sorting according to the last time, returning latest ten pieces of data, and acquiring QQ numbers with the number of login times being more than 5, 3) respectively acquiring QQ remarks from friend lists of QQ meeting conditions, and returning remark names and remark numbers of each QQ number, 4) removing father relation real names { "mother", "old woman", "son", "nephel" }; 5) and finally obtaining an attribute affinity result T of the association relationship between the mobile phone number and the name.
The third step: adjusting affinity competition of the associated attributes, namely adjusting the value of the affinity of the associated attributes according to [ Pn ] competition factors, wherein the affinity adjustment of the association relationships with the same attribute type but different values is mainly satisfied, for example: 130 x 5678 originally is the ID card A in use, the intimacy of 130 x 5678-A is high, but a new correlation relationship of 130 x 5678-B is judged according to [ Pn ] competition factors (opening date), the result of the intimacy analysis of 130 x 5678-B is found to be high, and the intimacy of 130 x 5678-A is changed from high to low.
The fourth step: the method mainly satisfies the condition that when the affinity of a certain pair of attribute relations changes, the affinity of the associated attribute relations correspondingly adjusts, obtains the associated attributes influenced by T from an attribute affinity adjusting rule base, and adjusts according to the adjusting rules, for example: 130 × 5678 cannot determine whether the id a is in use, and the attribute affinity of that 130 × 5678-a is low; when the intimacy degree of 130 x 5678-QQ (1 x 4) -remark name (A) is high, the intimacy degree analysis result of 130 x 5678-QQ (1 x 4) is set to be high, and the intimacy degree of 130 x 5678-A is changed from low to high.
By means of an analysis program, aiming at a scene of how to quickly and accurately analyze the attribute intimacy, an intimacy calculation set of the attribute relationship and the associated attribute relationship is constructed, the intimacy of the attribute association relationship is automatically analyzed, the intimacy of the associated attribute is adjusted in time, and the problem of how to quickly and accurately find the most useful clues for early warning according to the intimacy of the attribute relationship is solved. The method provides 2 algorithms of unassociated attribute intimacy degree analysis and associated attribute intimacy degree adjustment, takes pairwise attribute combination as a reference, combines factors such as data elements, sources, operation types, rules and timeliness, rapidly analyzes the intimacy degree of each group of attribute relations, automatically finds out associated attribute nodes, accurately adjusts the intimacy degree of the associated attributes, asynchronously executes each analysis link, backfills an analysis result into a relational database, supports a user to rapidly filter out clues with low values, reduces a troubleshooting range and improves troubleshooting working efficiency. The method is based on the scene of massive original data, the intimacy degree of the attribute relation generated by the massive original data is automatically analyzed, the analysis speed is high, the analysis result is accurate, the intimacy degree of the attribute is automatically analyzed and early warning is timely carried out, the work of manual analysis and judgment is greatly reduced, and the analysis efficiency is improved by more than 5 times compared with the manual analysis.
With continued reference to fig. 5, fig. 5 illustrates a block diagram of an analysis system for attribute intimacy, according to an embodiment of the present invention. The system specifically includes a database, an index base construction unit 501, an affinity analysis adjustment unit 502, and an associated attribute affinity promotion adjustment unit 503.
In a specific embodiment, the database and index base constructing unit 501 is configured to extract all associated attribute information in an original data packet and store the attribute information in a relational database, and store an analysis index of an attribute association record in an index base, where the analysis index includes a start attribute, a termination attribute, and an association edge; the affinity analysis adjusting unit 502 is configured to traverse the index library, determine whether there is a record in the index library for the analysis index, if not, execute the unassociated affinity analysis engine, and import the analysis result into the associated edge; if yes, executing the associated attribute intimacy degree adjusting engine, and updating the attribute intimacy degree analysis result to the associated edge; the correlation attribute affinity promoting and adjusting unit 503 is configured to traverse the index library, obtain all correlation attribute relationships of the current analysis index, and update the attribute affinity analysis result to the correlation edge of the current analysis index if a common correlation attribute exists between the start attribute and the end attribute of the current analysis index according to the attribute affinity adjusting rule.
Referring now to FIG. 6, shown is a block diagram of a computer system 600 suitable for use in implementing the electronic device of an embodiment of the present application. The electronic device shown in fig. 6 is only an example, and should not bring any limitation to the functions and the use range of the embodiment of the present application.
As shown in fig. 6, the computer system 600 includes a Central Processing Unit (CPU)601 that can perform various appropriate actions and processes according to a program stored in a Read Only Memory (ROM)602 or a program loaded from a storage section 608 into a Random Access Memory (RAM) 603. In the RAM 603, various programs and data necessary for the operation of the system 600 are also stored. The CPU 601, ROM 602, and RAM 603 are connected to each other via a bus 604. An input/output (I/O) interface 605 is also connected to bus 604.
The following components are connected to the I/O interface 605: an input portion 606 including a keyboard, a mouse, and the like; an output portion 607 including a display such as a Liquid Crystal Display (LCD) and a speaker; a storage section 608 including a hard disk and the like; and a communication section 609 including a network interface card such as a LAN card, a modem, or the like. The communication section 609 performs communication processing via a network such as the internet. A driver 610 is also connected to the I/O interface 605 as needed. A removable medium 611 such as a magnetic disk, an optical disk, a magneto-optical disk, a semiconductor memory, or the like is mounted on the drive 610 as necessary, so that the computer program read out therefrom is mounted in the storage section 608 as necessary.
In particular, according to an embodiment of the present disclosure, the processes described above with reference to the flowcharts may be implemented as computer software programs. For example, embodiments of the present disclosure include a computer program product comprising a computer program embodied on a computer readable storage medium, the computer program containing program code for performing the method illustrated by the flow chart. In such an embodiment, the computer program may be downloaded and installed from a network through the communication section 609 and/or installed from the removable medium 611. The computer program performs the above-described functions defined in the method of the present application when executed by a Central Processing Unit (CPU) 601. It should be noted that the computer readable storage medium of the present application can be a computer readable signal medium or a computer readable storage medium or any combination of the two. A computer readable storage medium may be, for example, but not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any combination of the foregoing. More specific examples of the computer readable storage medium may include, but are not limited to: an electrical connection having one or more wires, a portable computer diskette, a hard disk, a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber, a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing. In the context of this application, a computer readable storage medium may be any tangible medium that can contain, or store a program for use by or in connection with an instruction execution system, apparatus, or device. In this application, however, a computer readable signal medium may include a propagated data signal with computer readable program code embodied therein, for example, in baseband or as part of a carrier wave. Such a propagated data signal may take any of a variety of forms, including, but not limited to, electro-magnetic, optical, or any suitable combination thereof. A computer readable signal medium may also be any computer readable storage medium that is not a computer readable storage medium and that can communicate, propagate, or transport a program for use by or in connection with an instruction execution system, apparatus, or device. Program code embodied on a computer readable storage medium may be transmitted using any appropriate medium, including but not limited to: wireless, wire, fiber optic cable, RF, etc., or any suitable combination of the foregoing.
Computer program code for carrying out operations for aspects of the present application may be written in any combination of one or more programming languages, including an object oriented programming language such as Java, Smalltalk, C + +, or the like, as well as conventional procedural programming languages, such as the "C" programming language or similar programming languages. The program code may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server. In the latter scenario, the remote computer may be connected to the user's computer through any type of network, including a Local Area Network (LAN) or a Wide Area Network (WAN), or the connection may be made to an external computer (for example, through the Internet using an Internet service provider).
The flowchart and block diagrams in the figures illustrate the architecture, functionality, and operation of possible implementations of systems, methods and computer program products according to various embodiments of the present application. In this regard, each block in the flowchart or block diagrams may represent a module, segment, or portion of code, which comprises one or more executable instructions for implementing the specified logical function(s). It should also be noted that, in some alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved. It will also be noted that each block of the block diagrams and/or flowchart illustration, and combinations of blocks in the block diagrams and/or flowchart illustration, can be implemented by special purpose hardware-based systems which perform the specified functions or acts, or combinations of special purpose hardware and computer instructions.
The modules described in the embodiments of the present application may be implemented by software or hardware.
As another aspect, the present application also provides a computer-readable storage medium, which may be included in the electronic device described in the above embodiments; or may exist separately without being assembled into the electronic device. The computer readable storage medium carries one or more programs which, when executed by the electronic device, cause the electronic device to: extracting all associated attribute information in the original data packet, storing the attribute information in a relational database, and storing an analysis index of the attribute association record in an index database, wherein the analysis index comprises a starting attribute, a terminating attribute and an associated edge; traversing the index library, determining whether a record exists in the index library for analyzing the index, if not, executing an unassociated attribute intimacy degree analysis engine, and importing an analysis result into an association edge; if yes, executing the associated attribute intimacy degree adjusting engine, and updating the attribute intimacy degree analysis result to the associated edge; traversing the index library, acquiring all the association attribute relations of the current analysis index, and updating the attribute affinity analysis result to the association edge of the current analysis index according to the attribute affinity adjustment rule if the common association attribute exists between the initial attribute and the termination attribute of the current analysis index.
The foregoing description is only exemplary of the preferred embodiments of the application and is illustrative of the principles of the technology employed. It will be appreciated by those skilled in the art that the scope of the invention herein disclosed is not limited to the particular combination of features described above, but also encompasses other arrangements formed by any combination of the above features or their equivalents without departing from the spirit of the invention. For example, the above features may be replaced with (but not limited to) features having similar functions disclosed in the present application.

Claims (7)

1. An analytical method for attribute intimacy, comprising:
s1: extracting all associated attribute information in an original data packet, storing the attribute information in a relational database, and storing an analysis index of an attribute association record in an index database, wherein the analysis index comprises a starting attribute, a terminating attribute and an associated edge;
s2: traversing the index library, determining whether the analysis index has a record in the index library, if not, executing an unassociated attribute intimacy degree analysis engine, and importing an analysis result into the association edge; if yes, executing the associated attribute intimacy degree adjusting engine, and updating the attribute intimacy degree analysis result to the associated edge; and
s3: traversing the index library, acquiring all association attribute relations of the current analysis index, and updating an attribute affinity analysis result to the association edge of the current analysis index according to an attribute affinity adjustment rule if a common association attribute exists between the initial attribute and the termination attribute of the current analysis index;
taking the initial attribute and the termination attribute in the analysis index as reference, retrieving all attribute affinity analysis rules of the initial attribute and the termination attribute from an attribute affinity analysis rule library to form a rule analysis set Pn, wherein the elements of the rule analysis set Pn comprise a data source, an operation rule, an operation result threshold, a weight, a competitive factor and an interference factor, and the rule analysis set Pn is arranged in a weight reverse order;
the executing unassociated attribute affinity analysis engine comprises: traversing the rule analysis set Pn, if the data source of the rule analysis set Pn is the same as the data source in the analysis index, obtaining a result set Mn according to an operation rule, and if the operation result of the result set Mn is greater than the operation result threshold of the rule analysis set Pn, the attribute affinity of the analysis index is the product of the operation result of the result set Mn and the weight;
and the associated attribute intimacy degree adjusting engine adjusts the associated attribute intimacy degree according to the competitive factor of the rule analysis set Pn through the associated attribute intimacy degree competitive adjustment.
2. The method of claim 1, wherein in response to each attribute association being stored in the relational database, a new attribute affinity in the scheduling task table is added to calculate a task to be processed, and the task to be processed is allocated according to a computing resource vacancy condition.
3. The method as claimed in claim 1, wherein the competition factor includes an account opening date, and if the account opening date of the new association is later than the old association in the index database, the attribute affinity analysis result is updated to the association edge.
4. A computer-readable storage medium having one or more computer programs stored thereon, which when executed by a computer processor perform the method of any one of claims 1 to 3.
5. An analysis system for attribute intimacy, the system comprising:
a database and index database construction unit: the method comprises the steps that configuration is carried out, all associated attribute information in an original data packet is extracted and stored in a relational database, and an analysis index of attribute association records is stored in an index database, wherein the analysis index comprises a starting attribute, a terminating attribute and an associated edge;
an intimacy degree analysis adjusting unit: the system is configured to traverse the index library, determine whether the analysis index has a record in the index library, if not, execute an unassociated attribute intimacy degree analysis engine, and import an analysis result into the association edge; if yes, executing the associated attribute intimacy degree adjusting engine, and updating the attribute intimacy degree analysis result to the associated edge;
an association attribute affinity promotion adjustment unit: the index database is configured to be traversed to obtain all correlation attribute relations of the current analysis index, and according to an attribute affinity adjustment rule, if a common correlation attribute exists between the initial attribute and the termination attribute of the current analysis index, an attribute affinity analysis result is updated to the correlation edge of the current analysis index;
taking the initial attribute and the termination attribute in the analysis index as reference, retrieving all attribute affinity analysis rules of the initial attribute and the termination attribute from an attribute affinity analysis rule library to form a rule analysis set Pn, wherein the elements of the rule analysis set Pn comprise a data source, an operation rule, an operation result threshold, a weight, a competitive factor and an interference factor, and the rule analysis set Pn is arranged in a weight reverse order;
the executing unassociated attribute affinity analysis engine comprises: traversing the rule analysis set Pn, if the data source of the rule analysis set Pn is the same as the data source in the analysis index, obtaining a result set Mn according to an operation rule, and if the operation result of the result set Mn is greater than the operation result threshold of the rule analysis set Pn, the attribute affinity of the analysis index is the product of the operation result of the result set Mn and the weight;
and the associated attribute intimacy degree adjusting engine adjusts the associated attribute intimacy degree according to the competitive factor of the rule analysis set Pn through the associated attribute intimacy degree competitive adjustment.
6. The analysis system for attribute intimacy degree according to claim 5, further comprising a task scheduling unit: and the configuration is used for responding to the condition that each attribute incidence relation is stored in the relation database, newly adding an attribute affinity in a scheduling task table to calculate the task to be processed, and distributing the task to be processed according to the idle condition of the calculation resources.
7. The system of claim 5, wherein the competition factor comprises an account opening date, and if the account opening date of the new association is later than the old association in the index database, the attribute affinity analysis result is updated to the association edge.
CN202110955230.3A 2021-08-19 2021-08-19 Method and system for analyzing attribute intimacy Active CN113761390B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110955230.3A CN113761390B (en) 2021-08-19 2021-08-19 Method and system for analyzing attribute intimacy

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110955230.3A CN113761390B (en) 2021-08-19 2021-08-19 Method and system for analyzing attribute intimacy

Publications (2)

Publication Number Publication Date
CN113761390A CN113761390A (en) 2021-12-07
CN113761390B true CN113761390B (en) 2022-07-22

Family

ID=78790488

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110955230.3A Active CN113761390B (en) 2021-08-19 2021-08-19 Method and system for analyzing attribute intimacy

Country Status (1)

Country Link
CN (1) CN113761390B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115865712B (en) * 2022-11-15 2024-08-02 北京众鼎景和科技有限公司 Edge data storage method based on aging intimacy model
CN116578652B (en) * 2023-07-13 2024-01-16 中国人民解放军国防科技大学 Multi-table associated data set backfilling system and method

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108897780A (en) * 2018-06-06 2018-11-27 山东合天智汇信息技术有限公司 A kind of method and system of analytical calculation personnel cohesion
CN109428928A (en) * 2017-08-31 2019-03-05 腾讯科技(深圳)有限公司 Selection method, device and the equipment of information push object
CN111917574A (en) * 2020-07-21 2020-11-10 上海阿尔卡特网络支援系统有限公司 Social network topology model and construction method thereof, user confidence degree and intimacy degree calculation method and telecommunication fraud intelligent interception system

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20170301028A1 (en) * 2016-04-13 2017-10-19 Gregory David Strabel Processing system to generate attribute analysis scores for electronic records

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109428928A (en) * 2017-08-31 2019-03-05 腾讯科技(深圳)有限公司 Selection method, device and the equipment of information push object
CN108897780A (en) * 2018-06-06 2018-11-27 山东合天智汇信息技术有限公司 A kind of method and system of analytical calculation personnel cohesion
CN111917574A (en) * 2020-07-21 2020-11-10 上海阿尔卡特网络支援系统有限公司 Social network topology model and construction method thereof, user confidence degree and intimacy degree calculation method and telecommunication fraud intelligent interception system

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
"群智感知中采用节点社会属性的亲密度量化方法";张文东等;《西安交通大学学报》;20170328;第51卷(第06期);第73-78页 *

Also Published As

Publication number Publication date
CN113761390A (en) 2021-12-07

Similar Documents

Publication Publication Date Title
CN110019218B (en) Data storage and query method and equipment
CN113761390B (en) Method and system for analyzing attribute intimacy
CN111563103B (en) Method and system for detecting data blood relationship
CN107688626B (en) Slow query log processing method and device and electronic equipment
CN111382320A (en) Large-scale data increment processing method for knowledge graph
CN116126901A (en) Data processing method, device, electronic equipment and computer readable storage medium
CN110737727B (en) Data processing method and system
CN111125199A (en) Database access method and device and electronic equipment
CN109189343B (en) Metadata disk-dropping method, device, equipment and computer-readable storage medium
CN117807091A (en) Data association method and device
CN113094444B (en) Data processing method, data processing device, computer equipment and medium
Colosi et al. Time series data management optimized for smart city policy decision
CN113641705B (en) Marketing disposal rule engine method based on calculation engine
CN116010345A (en) Method, device and equipment for realizing table service scheme of flow batch integrated data lake
CN114036160A (en) Real-time data acquisition method and device, electronic equipment and storage medium
CN117472873A (en) Data migration method, device, computing device cluster and storage medium
CN114328606A (en) Method, device and storage medium for improving SQL execution efficiency
CN114048219A (en) Graph database updating method and device
CN114116908A (en) Data management method and device and electronic equipment
CN117421297B (en) Statistical information collection method for dynamic partition table of distributed database
CN114691749B (en) Method for parallel incremental mining of frequent item sets based on sliding window
CN111291040B (en) Data processing method, device, equipment and medium
CN118154139B (en) Intelligent business process monitoring method and system based on AI
CN118427186B (en) Data blood edge tracing method, device, equipment and medium
CN118535584A (en) Data processing method, device, medium and equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant