CN117808601A - Capital tracing method and system based on big data - Google Patents

Capital tracing method and system based on big data Download PDF

Info

Publication number
CN117808601A
CN117808601A CN202410217985.7A CN202410217985A CN117808601A CN 117808601 A CN117808601 A CN 117808601A CN 202410217985 A CN202410217985 A CN 202410217985A CN 117808601 A CN117808601 A CN 117808601A
Authority
CN
China
Prior art keywords
transaction
fund
user
fluctuation
tracing
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202410217985.7A
Other languages
Chinese (zh)
Other versions
CN117808601B (en
Inventor
王执祥
黄光明
李延明
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shandong Lingchao Software Technology Co ltd
Original Assignee
Shandong Lingchao Software Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shandong Lingchao Software Technology Co ltd filed Critical Shandong Lingchao Software Technology Co ltd
Priority to CN202410217985.7A priority Critical patent/CN117808601B/en
Priority claimed from CN202410217985.7A external-priority patent/CN117808601B/en
Publication of CN117808601A publication Critical patent/CN117808601A/en
Application granted granted Critical
Publication of CN117808601B publication Critical patent/CN117808601B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Financial Or Insurance-Related Operations Such As Payment And Settlement (AREA)

Abstract

The invention discloses a capital tracing method and a system based on big data, in particular to the technical field of big data, and the capital tracing method and the system comprise a multi-source data acquisition module, a data preprocessing module, a capital flow chart database storage module, a data processing module, a capital flow real-time monitoring module, a capital tracing execution module and a tracing result output module; the fund flow chart database storage module is used for storing the preprocessed data and constructing a fund flow chart of a user, so that analysis and tracing of fund paths are facilitated; integrating a plurality of transaction records through a data processing module, and calculating related data, thereby providing deep insight into user behaviors; calculating a risk fluctuation index through a fund flow real-time monitoring module, and sending out alarm information; and receiving the alarm information through a fund tracing execution module, executing the fund tracing operation and generating a fund tracing report, and revealing the complete flow path of the fund and the associated transaction information.

Description

Capital tracing method and system based on big data
Technical Field
The invention relates to the technical field of big data, in particular to a fund tracing method and a fund tracing system based on big data.
Background
In the current financial field, the flow of funds and transaction data is enormous. To ensure the safety and compliance of funds, efficient monitoring and management of such data is required.
The existing fund tracing methods mainly depend on the traditional database and data processing technology, have low efficiency when processing a large amount of data, and are difficult to realize real-time monitoring and rapid fund tracing; fund tracing generally faces the problems of data island, incomplete information, insufficient analysis capability and the like, so that the fund flow direction cannot be quickly and accurately traced; in addition, the existing fund tracing method often cannot effectively integrate multi-source data, so that the fund flow direction is unclear, and the supervision difficulty is increased; therefore, a method and a system for tracing funds based on big data are urgently needed, which can efficiently process and analyze a large amount of financial data and realize real-time monitoring and rapid tracing of funds.
Disclosure of Invention
In order to overcome the defects in the prior art, the invention provides a large data-based fund tracing method and a large data-based fund tracing system, which are characterized in that transaction data of all users are collected from the fields of banks, securities and insurance finance through a multi-source data collection module, a comprehensive data set is provided for the system, and all possible fund flowing paths can be traced; the data preprocessing module is used for cleaning, de-duplicating and formatting the acquired data and preprocessing the characteristic engineering, so that the quality and consistency of the data are ensured, and accurate and reliable data are provided for analysis; the fund flow chart database storage module is used for storing the preprocessed data and constructing a fund flow chart of a user, so that the fund path can be conveniently analyzed and traced, and the data retrieval process is accelerated; the data processing module integrates a plurality of transaction records, calculates related data, provides deep insight into user behaviors, and is helpful for identifying abnormal modes and risk behaviors; the fund flow is monitored in real time through the fund flow real-time monitoring module, the risk fluctuation index is calculated, the risk fluctuation index is judged and compared with a preset risk fluctuation index threshold value, and alarm information is sent to the fund tracing execution module, so that possible illegal activities or fraudulent behaviors are responded in real time, and loss is reduced; the fund tracing execution module is used for immediately executing the fund tracing operation after receiving the alarm information transmitted by the fund flow real-time monitoring module, searching from the fund flow to the graph database storage module, generating a fund tracing report, revealing the complete flow path of the fund and the associated transaction information, and providing a powerful tool for a supervision institution and a financial institution to fight money laundering and other financial crimes; the traceability result is displayed in a network diagram through the traceability result output module, so that a supervision organization is helped to understand a complex fund relation network and make corresponding decisions, and the targeted supervision measures are helped to be formulated; to solve the problems set forth in the background art.
In order to achieve the above purpose, the present invention provides the following technical solutions: a big data based funds traceback system comprising:
a multi-source data acquisition module: for collecting transaction data of all users from banking, securities and insurance finance fields, including personal transfer records, stock transaction records, application records, and consumption records;
and a data preprocessing module: the method is used for cleaning, de-duplicating and formatting the acquired data and performing characteristic engineering pretreatment operation;
the fund flow chart database storage module: the system is used for storing the preprocessed data and constructing a fund flow diagram of the user;
and a data processing module: for carrying out preliminary processing on the preprocessed data at the local server, integrating a plurality of transaction records, calculating the transaction times, average transaction amount, transaction time interval and accumulated transaction amount of each user in a given time window, the number of counter-parties is traded, and the number and the frequency of different transaction types of each user in each time window are transmitted to the fund flow real-time monitoring module;
the fund flow real-time monitoring module comprises a data calculation unit, a risk fluctuation index calculation unit and a risk judgment unit, and is used for carrying out real-time monitoring on fund flow through a real-time data analysis technology, calculating a risk fluctuation index, judging and comparing the risk fluctuation index with a preset risk fluctuation index threshold value, and sending alarm information to be transmitted to a fund tracing execution module;
and the fund tracing execution module: the system comprises a fund flow real-time monitoring module, a fund tracing operation, a diagram database storage module, a fund tracing report, a fund flow real-time monitoring module and a transaction information processing module, wherein the fund flow real-time monitoring module is used for receiving the alarm information transmitted by the fund flow real-time monitoring module, immediately executing the fund tracing operation, retrieving from the fund flow to the diagram database storage module, generating the fund tracing report and revealing the complete flow path of the fund and the associated transaction information;
a traceability result output module: the method is used for displaying the traceability result in a network diagram, helping the supervision authorities understand the complex fund relation network and making corresponding decisions.
In a preferred embodiment, the specific processing procedure of the data processing module is as follows:
a1, preprocessing the data according to the following stepsnTime windowTDividing and numbering in turnMjj=1,2,3……n
A2 for dataMjCombining the personal transfer record, the stock trade record, the application record and the consumption record into one record according to all the trade records of the same participant to obtainnIn a single time windowmOf individual participantskA strip transaction record; the transaction record comprises transaction time, transaction times, transaction types and transaction amounts;
a3, counting transaction times of each user in a given window asCai
A4, calculating the average transaction amount of each user in a given time windowPei
WhereinkRepresenting the total number of transaction records,Zeivrepresenting participantsiIs the first of (2)vThe transaction amount of a transaction record,Cairepresenting participantsiIs a number of transactions;
a5, counting the transaction time interval of each user in a given time windowTyi
A6, calculating the accumulated transaction amount of each user in a given time windowLeiWhereinkRepresenting the total number of transaction records,Zeivrepresenting participantsiIs the first of (2)vThe transaction amount of a transaction record,Cairepresenting participantsiIs a number of transactions;
a7, counting the number of the counter-transaction parties of each user in a given window and recording asDsi
A8, counting the number of different transaction types of each user in each time window as followsZsiThe frequency of the signal is calculated,whereinCaiRepresenting participantsiIs a number of transactions.
In a preferred embodiment, the specific calculation process of the fund flow real-time monitoring module is as follows:
b1, calculating transaction liveness, transaction diversity index, average transaction scale and network connectivity;
and B2, calculating a trade activity fluctuation mean value, a trade diversity index fluctuation mean value, an average trade scale fluctuation mean value and a network connectivity fluctuation mean value of each user in all time windows.
In a preferred embodiment, the transaction livenessHyThe calculation formula of (2) is as follows:whereinCaiRepresenting participantsiIs a function of the number of transactions,Tyirepresenting participantsiIs a transaction time interval of (1);
the trade diversity indexDyThe calculation formula of (2) is as follows:which is provided withMiddle pi represents the firstiThe proportion of the transaction type to be used,Ma category representing a transaction type;
the average transaction sizeGmThe calculation formula of (2) is as follows:whereinCaiRepresenting participantsiIs a function of the number of transactions,Leirepresenting participantsiIs a cumulative transaction amount of (a);
the network connectivityLdSpecifically, by constructing a network graph, the nodes are users, the edges are transaction relations, and the network connectivity is calculated according to the number of the nodes and the edgesLdWhereinSxRepresentation and nodexThe number of edges to be joined together,Urepresenting the total number of nodes in the network.
In a preferred embodiment, the transaction liveness fluctuation averagePHyThe calculation formula of (2) is as follows:whereinHyjIndicating that the user is at the firstjTransaction activity for a time window,Hy j+1 indicating that the user is at the firstjTransaction liveness for +1 time window,nrepresenting the number of time windows;
the trade diversity index fluctuation mean valuePDyThe calculation formula of (2) is as follows:
whereinDyjIndicating that the user is at the firstjA trade diversity index for a time window,Dy j+1 indicating that the user is at the firstjTransaction diversity index for +1 time window,nrepresenting the number of time windows;
the mean value of the fluctuation of the average transaction scalePGmThe calculation formula of (2) is as follows:
whereinGmjIndicating that the user is at the firstjThe average transaction size for the individual time windows,Gm j+1 indicating that the user is at the firstjThe average trade size for +1 time window,nrepresenting the number of time windows;
the network connectivity fluctuation mean valuePLdThe calculation formula of (2) is as follows:whereinLdjIndicating that the user is at the firstjThe network connectivity of the time window,Ld j+1 indicating that the user is at the firstjNetwork connectivity for +1 time window,nrepresenting the number of time windows.
In a preferred embodiment, the risk fluctuation index calculation unit is configured to calculate a risk fluctuation index, and the specific calculation process is:
the method comprises the following steps of C1, carrying out standardized processing on a trade activity fluctuation average value, a trade diversity index fluctuation average value, an average trade scale fluctuation average value and a network connectivity fluctuation average value;
c2, according to the standardized transaction activity fluctuation mean valuePHyMean value of trade diversity index fluctuationPDyMean value of fluctuation of average trade sizePGmAnd network connectivity fluctuation meanPLdCalculating risk fluctuation indexQzWhereinγ1、γ2、γ3、γ4 represents the scaling factor of each term.
In a preferred embodiment, the risk judging unit is configured to judge and compare the risk fluctuation index with a preset risk fluctuation index threshold value, and send alarm information to the fund tracing execution module; in particular, risk fluctuation indexQzAnd a preset risk fluctuation index threshold valueQzThreshold is judged and compared, ifQzQzThe threshold then indicates that the user's behavior is at a higher risk, sending out alarm information; and conversely, the behavior of the user is indicated to have lower risk.
In order to achieve the above purpose, the present invention provides the following technical solutions: a fund tracing method based on big data comprises the following steps:
step S1, collecting transaction data of all users from the fields of banks, securities and insurance finance;
s2, cleaning, de-duplicating and formatting the acquired data, and preprocessing the characteristic engineering;
step S3, storing the preprocessed data, and constructing a fund flow chart of the user;
step S4, the preprocessed data are subjected to preliminary processing at the local server, a plurality of transaction records are integrated, and the transaction times, the average transaction amount, the transaction time interval and the accumulated transaction amount of each user in a given time window and the number of transaction counter parties are calculated, wherein the number and the frequency of different transaction types of each user in each time window are calculated;
s5, calculating a risk fluctuation index, judging and comparing the risk fluctuation index with a preset risk fluctuation index threshold value, and sending out alarm information;
s6, immediately executing fund tracing operation after receiving the alarm information, searching from a fund flow chart database storage module, and generating a fund tracing report;
and S7, displaying the traceability result by using a network diagram.
The invention has the technical effects and advantages that:
the invention collects the transaction data of all users from the fields of banks, securities and insurance finance through the multi-source data collection module, provides a comprehensive data set for the system, and ensures that all possible funds flow paths can be tracked; the data preprocessing module is used for cleaning, de-duplicating and formatting the acquired data and preprocessing the characteristic engineering, so that the quality and consistency of the data are ensured, and accurate and reliable data are provided for analysis; the fund flow chart database storage module is used for storing the preprocessed data and constructing a fund flow chart of a user, so that the fund path can be conveniently analyzed and traced, and the data retrieval process is accelerated; the data processing module integrates a plurality of transaction records, calculates related data, provides deep insight into user behaviors, and is helpful for identifying abnormal modes and risk behaviors; the fund flow is monitored in real time through the fund flow real-time monitoring module, the risk fluctuation index is calculated, the risk fluctuation index is judged and compared with a preset risk fluctuation index threshold value, and alarm information is sent to the fund tracing execution module, so that possible illegal activities or fraudulent behaviors are responded in real time, and loss is reduced; the fund tracing execution module is used for immediately executing the fund tracing operation after receiving the alarm information transmitted by the fund flow real-time monitoring module, searching from the fund flow to the graph database storage module, generating a fund tracing report, revealing the complete flow path of the fund and the associated transaction information, and providing a powerful tool for a supervision institution and a financial institution to fight money laundering and other financial crimes; the traceability result is displayed in a network diagram through the traceability result output module, so that a supervision organization is helped to understand a complex fund relation network and make corresponding decisions, and the targeted supervision measures are helped to be formulated; the invention can meet the requirements of the modern financial field on fund flow monitoring and tracing, and has high practical value and market prospect.
Drawings
Fig. 1 is a block diagram showing the overall structure of the present invention.
FIG. 2 is a flow chart of the method steps of the present invention.
Detailed Description
The following description of the embodiments of the present invention will be made clearly and completely with reference to the accompanying drawings, in which it is apparent that the embodiments described are only some embodiments of the present invention, but not all embodiments. All other embodiments, which can be made by those skilled in the art based on the embodiments of the invention without making any inventive effort, are intended to be within the scope of the invention.
The invention provides a large data-based fund tracing method and a large data-based fund tracing system as shown in fig. 1-2, wherein the large data-based fund tracing method and the large data-based fund tracing system comprise a multi-source data acquisition module, a data preprocessing module, a fund flow chart database storage module, a data processing module, a fund flow real-time monitoring module, a fund tracing execution module and a tracing result output module;
the multi-source data acquisition module is used for acquiring transaction data of all users from the fields of banks, securities and insurance finance, including personal account transfer records, stock transaction records, application records and consumption records; the personal transfer record comprises a transaction participant, an account number, a transfer amount and a transfer time; the stock transaction record comprises transaction price, transaction quantity, transaction time, number of holding houses and a profit and loss ratio; the insurance application records comprise the name of the insurance applicant, the insurance date, the insurance amount, the premium amount, the claim settlement amount and the claim settlement time; the consumption record includes a consumption type and a consumption amount;
the implementation needs to specifically explain that the acquisition mode of the multi-source data acquisition module is as follows: the collection of sensitive financial data of personal account transfer records, stock trade records and application records is realized by interfacing with systems of banks, securities and insurance financial institutions; the method comprises the steps of cooperating with a related consumption record provider, and acquiring consumption records through a data interface;
the data preprocessing module is used for cleaning, de-duplicating, formatting and characteristic engineering preprocessing operation on the acquired data so as to ensure the quality and consistency of the data and provide a reliable data basis for subsequent analysis; the operations of cleaning, de-duplication, formatting and preprocessing of the feature engineering are performed on the collected data, which belong to the prior art means, so the embodiment does not make a specific description;
the fund flow chart database storage module is used for storing the preprocessed data and constructing a fund flow chart of a user; wherein the construction of the fund flow diagram, in particular converting the transaction data into a graphic structure, the nodes representing transaction entities, such as users, bank accounts and securities accounts, the edges representing the fund flow;
it should be noted that the fund flow chart database storage module performs structured storage on the processed data so as to facilitate quick retrieval and analysis; by constructing a fund flow graph, the flow path of the user asset, including the source, destination, amount and time information of the funds, can be helped to be revealed;
the data processing module is used for carrying out preliminary processing on the preprocessed data at the local server, integrating a plurality of transaction records, calculating the transaction times, the average transaction amount, the transaction time interval and the accumulated transaction amount of each user in a given time window and the number of transaction counter parties, and transmitting the number and the frequency of different transaction types of each user in each time window to the fund flow real-time monitoring module;
the implementation needs to specifically explain that the specific processing procedure of the data processing module is as follows:
a1, preprocessing the data according to the following stepsnTime windowTDividing and numbering in turnMjj=1,2,3……n
A2 for dataMjCombining the personal transfer record, the stock trade record, the application record and the consumption record into one record according to all the trade records of the same participant to obtainnIn a single time windowmOf individual participantskA strip transaction record; the transaction record comprises transaction time, transaction times, transaction types and transaction amounts;
a3, counting transaction times of each user in a given window asCai
A4, calculating the average transaction amount of each user in a given time windowPei
WhereinkRepresenting the total number of transaction records,Zeivrepresenting participantsiIs the first of (2)vThe transaction amount of a transaction record,Cairepresenting participantsiIs a number of transactions;
a5, counting the transaction time interval of each user in a given time windowTyi
A6, calculating the accumulated transaction amount of each user in a given time windowLeiWhereinkRepresenting transaction record totalsThe number of the product is the number,Zeivrepresenting participantsiIs the first of (2)vThe transaction amount of a transaction record,Cairepresenting participantsiTo analyze the transaction increasing trend of the user;
a7, counting the number of the counter-transaction parties of each user in a given window and recording asDsiTo analyze the user's transaction network and associated parties;
a8, counting the number of different transaction types of each user in each time window as followsZsiThe frequency of the signal is calculated,whereinCaiRepresenting participantsiIs a number of transactions;
the fund flow real-time monitoring module comprises a data calculation unit, a risk fluctuation index calculation unit and a risk judgment unit, and is used for monitoring the fund flow in real time through a real-time data analysis technology, calculating a risk fluctuation index, judging and comparing the risk fluctuation index with a preset risk fluctuation index threshold value, and sending alarm information to be transmitted to a fund tracing execution module;
the implementation needs to be specifically described, the data calculation unit is configured to calculate a transaction activity fluctuation average value, a transaction diversity index fluctuation average value, an average transaction scale fluctuation average value, and a network connectivity fluctuation average value of each user in all time windows, where the specific calculation process is as follows:
b1, calculating transaction liveness, transaction diversity index, average transaction scale and network connectivity;
the transaction livenessHyThe calculation formula of (2) is as follows:whereinCaiRepresenting participantsiIs a function of the number of transactions,Tyirepresenting participantsiIs a transaction time interval of (1); by analyzing the activity of transactions by individuals, nodes that are frequently transacted may be identified, which may be important monitoring objects, particularly where funds flow is rapid and frequent, abnormally high activity may indicate money laundering or other illegal funds flowRisk;
the trade diversity indexDyThe calculation formula of (2) is as follows:wherein pi represents the firstiThe proportion of the transaction type to be used,Ma category representing a transaction type; the diversified transaction types may suggest that the user is conducting normal business, while abrupt changes or singularization of the transaction types may be directed to behavior that circumvents supervision; lack of transaction diversity may indicate that funds are used for specific illegal purposes or from specific criminal activities;
the average transaction sizeGmThe calculation formula of (2) is as follows:whereinCaiRepresenting participantsiIs a function of the number of transactions,Leirepresenting participantsiIs a cumulative transaction amount of (a); high volume transactions may be of concern, particularly if the normal transaction patterns of the individual are not met; for funds traceback, analysis of the average transaction size helps identify focused flows of funds and abnormal transfers of large funds;
the network connectivityLdSpecifically, by constructing a network graph, the nodes are users, the edges are transaction relations, and the network connectivity is calculated according to the number of the nodes and the edgesLdWhereinSxRepresentation and nodexThe number of edges to be joined together,Urepresenting the total number of nodes in the network; the numerical value measures the direct contact quantity of one node and other nodes and reflects the activity degree and influence of the node in the network; in the background of fund tracing, the network connectivity reveals the degree of association between individuals and other individuals, which is helpful for drawing potential collusion networks or organization structures; highly connected nodes may play a critical role in the funding flow network and are therefore important for monitoring and investigation;
b2, calculating a trade activity fluctuation mean value, a trade diversity index fluctuation mean value, an average trade scale fluctuation mean value and a network connectivity fluctuation mean value of each user in all time windows;
the trade activity fluctuation mean valuePHyThe calculation formula of (2) is as follows:whereinHyjIndicating that the user is at the firstjTransaction activity for a time window,Hy j+1 indicating that the user is at the firstjTransaction liveness for +1 time window,nrepresenting the number of time windows; this value indicates the stability of the user's transaction frequency, and high volatility may indicate a discrepancy with normal business activity;
the trade diversity index fluctuation mean valuePDyThe calculation formula of (2) is as follows:
whereinDyjIndicating that the user is at the firstjA trade diversity index for a time window,Dy j+1 indicating that the user is at the firstjTransaction diversity index for +1 time window,nrepresenting the number of time windows; this value reflects the diversity and variability of user transaction types, which may be evidence of manipulation or fraud if one user typically makes multiple types of transactions, but only a few transactions between bursts;
the mean value of the fluctuation of the average transaction scalePGmThe calculation formula of (2) is as follows:
whereinGmjIndicating that the user is at the firstjThe average transaction size for the individual time windows,Gm j+1 indicating that the user is at the firstjThe average trade size for +1 time window,nrepresenting the number of time windows; the value displays the change degree of the transaction amount of the user, and the stable transaction scale possibly accords with the conventional business mode;
the network connectivity fluctuation mean valuePLdThe calculation formula of (2) is as follows:whereinLdjIndicating that the user is at the firstjThe network connectivity of the time window,Ld j+1 indicating that the user is at the firstjNetwork connectivity for +1 time window,nrepresenting the number of time windows; frequently varying connectivity may indicate that a user is attempting to avoid being tracked or conducting suspicious transactions with multiple different individuals;
it should be noted that the mean value of these fluctuations is meant to help identify users whose behavior patterns are inconsistent or do not conform to normal business behavior, and in the funds tracing system, such behavior inconsistencies may be illegal activities, such as money laundering and fraudulent indicators, by monitoring fluctuations in these indicators, financial institutions and regulatory authorities can more effectively identify and prevent potential financial crimes, while improving transparency and compliance of the system;
the implementation needs to specifically explain that the risk fluctuation index calculating unit is used for calculating a risk fluctuation index, and the specific calculating process is as follows:
c1, carrying out standardized processing on a trade activity fluctuation mean value, a trade diversity index fluctuation mean value, an average trade scale fluctuation mean value and a network connectivity fluctuation mean value so as to ensure that comparison can be carried out on the same scale; the normalization processing belongs to the prior art means, so the embodiment does not make a specific description;
c2, according to the standardized transaction activity fluctuation mean valuePHyMean value of trade diversity index fluctuationPDyMean value of fluctuation of average trade sizePGmAnd network connectivity fluctuation meanPLdCalculating risk fluctuation indexQzWhereinγ1、γ2、γ3、γ4 represents the proportionality coefficient of each term, the size of the proportionality coefficient is a specific numerical value obtained by quantizing each parameter, the subsequent comparison is convenient, and the proportionality coefficient is only required to be large as long as the proportionality relation between the parameter and the quantized numerical value is not influenced;
the implementation requires in particularThe risk judging unit is used for judging and comparing the risk fluctuation index with a preset risk fluctuation index threshold value, and sending alarm information to the fund tracing execution module; in particular, risk fluctuation indexQzAnd a preset risk fluctuation index threshold valueQzThreshold is judged and compared, ifQzQzThe threshold then indicates that the user's behavior is at a higher risk, sending out alarm information; otherwise, the behavior of the user is indicated to have lower risk; wherein the risk fluctuation index threshold value is presetQzThe threshold can be specifically set according to specific situations, and specific data is not specifically limited in this embodiment;
the fund tracing execution module is used for immediately executing the fund tracing operation after receiving the alarm information transmitted by the fund flow real-time monitoring module, retrieving the fund flow from the fund flow chart storage module, generating a fund tracing report and revealing the complete flow path of the fund and the associated transaction information; the fund tracing report records detailed information of each fund transaction, including the amount, time, both transaction sides and inflow and outflow nodes of the fund;
the traceability result output module is used for displaying traceability results in a network diagram, helping a supervision organization to understand a complex fund relation network and making corresponding decisions;
in this embodiment, it needs to be specifically described that a funds tracing method based on big data includes the following steps:
step S1, collecting transaction data of all users from the fields of banks, securities and insurance finance;
s2, cleaning, de-duplicating and formatting the acquired data, and preprocessing the characteristic engineering;
step S3, storing the preprocessed data, and constructing a fund flow chart of the user;
step S4, the preprocessed data are subjected to preliminary processing at the local server, a plurality of transaction records are integrated, and the transaction times, the average transaction amount, the transaction time interval and the accumulated transaction amount of each user in a given time window and the number of transaction counter parties are calculated, wherein the number and the frequency of different transaction types of each user in each time window are calculated;
s5, calculating a risk fluctuation index, judging and comparing the risk fluctuation index with a preset risk fluctuation index threshold value, and sending out alarm information;
s6, immediately executing fund tracing operation after receiving the alarm information, searching from a fund flow chart database storage module, and generating a fund tracing report;
and S7, displaying the traceability result by using a network diagram.
Finally: the foregoing description of the preferred embodiments of the invention is not intended to limit the invention to the precise form disclosed, and any such modifications, equivalents, and alternatives falling within the spirit and principles of the invention are intended to be included within the scope of the invention.
The foregoing is merely specific embodiments of the present application, but the scope of the present application is not limited thereto, and any person skilled in the art can easily think about changes or substitutions within the technical scope of the present application, and the changes and substitutions are intended to be covered by the scope of the present application. Therefore, the protection scope of the present application shall be subject to the protection scope of the claims.

Claims (8)

1. A big data based funds traceback system comprising:
a multi-source data acquisition module: for collecting transaction data of all users from banking, securities and insurance finance fields, including personal transfer records, stock transaction records, application records, and consumption records;
and a data preprocessing module: the method is used for cleaning, de-duplicating and formatting the acquired data and performing characteristic engineering pretreatment operation;
the fund flow chart database storage module: the system is used for storing the preprocessed data and constructing a fund flow diagram of the user;
and a data processing module: for carrying out preliminary processing on the preprocessed data at the local server, integrating a plurality of transaction records, calculating the transaction times, average transaction amount, transaction time interval and accumulated transaction amount of each user in a given time window, the number of counter-parties is traded, and the number and the frequency of different transaction types of each user in each time window are transmitted to the fund flow real-time monitoring module;
the fund flow real-time monitoring module comprises a data calculation unit, a risk fluctuation index calculation unit and a risk judgment unit, and is used for carrying out real-time monitoring on fund flow through a real-time data analysis technology, calculating a risk fluctuation index, judging and comparing the risk fluctuation index with a preset risk fluctuation index threshold value, and sending alarm information to be transmitted to a fund tracing execution module;
and the fund tracing execution module: the system comprises a fund flow real-time monitoring module, a fund tracing operation, a diagram database storage module, a fund tracing report, a fund flow real-time monitoring module and a transaction information processing module, wherein the fund flow real-time monitoring module is used for receiving the alarm information transmitted by the fund flow real-time monitoring module, immediately executing the fund tracing operation, retrieving from the fund flow to the diagram database storage module, generating the fund tracing report and revealing the complete flow path of the fund and the associated transaction information;
a traceability result output module: the method is used for displaying the traceability result in a network diagram, helping the supervision authorities understand the complex fund relation network and making corresponding decisions.
2. A big data based funds traceback system as defined in claim 1, wherein:
the specific processing procedure of the data processing module is as follows:
a1, preprocessing the data according to the following stepsnTime windowTDividing and numbering in turnMjj=1,2,3……n
A2 for dataMjCombining the personal transfer record, the stock trade record, the application record and the consumption record into one record according to all the trade records of the same participant to obtainnIn a single time windowmOf individual participantskA strip transaction record; the transaction record comprises transaction time, transaction times, transaction types and transaction amounts;
a3, counting transaction times of each user in a given window asCai
A4, calculating the average transaction amount of each user in a given time windowPei
WhereinkRepresenting the total number of transaction records,Zeivrepresenting participantsiIs the first of (2)vThe transaction amount of a transaction record,Cairepresenting participantsiIs a number of transactions;
a5, counting the transaction time interval of each user in a given time windowTyi
A6, calculating the accumulated transaction amount of each user in a given time windowLeiWhereinkRepresenting the total number of transaction records,Zeivrepresenting participantsiIs the first of (2)vThe transaction amount of a transaction record,Cairepresenting participantsiIs a number of transactions;
a7, counting the number of the counter-transaction parties of each user in a given window and recording asDsi
A8, counting the number of different transaction types of each user in each time window as Zsi, calculating the frequency thereof,where Cai represents the number of transactions by party i.
3. A big data based funds traceback system as defined in claim 1, wherein: the specific calculation process of the fund flow real-time monitoring module is as follows:
b1, calculating transaction liveness, transaction diversity index, average transaction scale and network connectivity;
and B2, calculating a trade activity fluctuation mean value, a trade diversity index fluctuation mean value, an average trade scale fluctuation mean value and a network connectivity fluctuation mean value of each user in all time windows.
4. A big data based funds traceback system according to claim 3, wherein: the transaction livenessHyThe calculation formula of (2) is as follows:whereinCaiRepresenting participantsiIs a function of the number of transactions,Tyirepresenting participantsiIs a transaction time interval of (1);
the trade diversity indexDyThe calculation formula of (2) is as follows:wherein pi represents the firstiThe proportion of the transaction type to be used,Ma category representing a transaction type;
the average transaction sizeGmThe calculation formula of (2) is as follows:whereinCaiRepresenting participantsiIs a function of the number of transactions,Leirepresenting participantsiIs a cumulative transaction amount of (a);
the network connectivityLdSpecifically, by constructing a network graph, the nodes are users, the edges are transaction relations, and the network connectivity is calculated according to the number of the nodes and the edgesLdWhereinSxRepresentation and nodexThe number of edges to be joined together,Urepresenting the total number of nodes in the network.
5. A big data based funds traceback system according to claim 3, wherein: the trade activity fluctuation mean valuePHyThe calculation formula of (2) is as follows:whereinHyjIndicating that the user is at the firstjTransaction activity for a time window,Hy j+1 indicating that the user is at the firstjTransaction liveness for +1 time window,nrepresenting the number of time windows;
the trade diversity index fluctuation mean valuePDyThe calculation formula of (2) is as follows:
whereinDyjIndicating that the user is at the firstjA trade diversity index for a time window,Dy j+1 indicating that the user is at the firstjTransaction diversity index for +1 time window,nrepresenting the number of time windows;
the mean value of the fluctuation of the average transaction scalePGmThe calculation formula of (2) is as follows:
whereinGmjIndicating that the user is at the firstjThe average transaction size for the individual time windows,Gm j+1 indicating that the user is at the firstjThe average trade size for +1 time window,nrepresenting the number of time windows;
the network connectivity fluctuation mean valuePLdThe calculation formula of (2) is as follows:whereinLdjIndicating that the user is at the firstjThe network connectivity of the time window,Ld j+1 indicating that the user is at the firstjNetwork connectivity for +1 time window,nrepresenting the number of time windows.
6. A big data based funds traceback system as defined in claim 1, wherein: the risk fluctuation index calculation unit is used for calculating a risk fluctuation index, and the specific calculation process is as follows:
the method comprises the following steps of C1, carrying out standardized processing on a trade activity fluctuation average value, a trade diversity index fluctuation average value, an average trade scale fluctuation average value and a network connectivity fluctuation average value;
c2, according to the standardAverage value of transaction liveness fluctuation after chemical treatmentPHyMean value of trade diversity index fluctuationPDyMean value of fluctuation of average trade sizePGmAnd network connectivity fluctuation meanPLdCalculating risk fluctuation indexQzWhereinγ1、γ2、γ3、γ4 represents the scaling factor of each term.
7. A big data based funds traceback system as defined in claim 1, wherein: the risk judging unit is used for judging and comparing the risk fluctuation index with a preset risk fluctuation index threshold value, sending alarm information and transmitting the alarm information to the fund tracing execution module; in particular, risk fluctuation indexQzAnd a preset risk fluctuation index threshold valueQzThreshold is judged and compared, ifQzQzThe threshold then indicates that the user's behavior is at a higher risk, sending out alarm information; and conversely, the behavior of the user is indicated to have lower risk.
8. A big data based funds tracing method for implementing the big data based funds tracing system of any one of claims 1-7, comprising the steps of:
step S1, collecting transaction data of all users from the fields of banks, securities and insurance finance;
s2, cleaning, de-duplicating and formatting the acquired data, and preprocessing the characteristic engineering;
step S3, storing the preprocessed data, and constructing a fund flow chart of the user;
step S4, the preprocessed data are subjected to preliminary processing at the local server, a plurality of transaction records are integrated, and the transaction times, the average transaction amount, the transaction time interval and the accumulated transaction amount of each user in a given time window and the number of transaction counter parties are calculated, wherein the number and the frequency of different transaction types of each user in each time window are calculated;
s5, calculating a risk fluctuation index, judging and comparing the risk fluctuation index with a preset risk fluctuation index threshold value, and sending out alarm information;
s6, immediately executing fund tracing operation after receiving the alarm information, searching from a fund flow chart database storage module, and generating a fund tracing report;
and S7, displaying the traceability result by using a network diagram.
CN202410217985.7A 2024-02-28 Capital tracing method and system based on big data Active CN117808601B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202410217985.7A CN117808601B (en) 2024-02-28 Capital tracing method and system based on big data

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202410217985.7A CN117808601B (en) 2024-02-28 Capital tracing method and system based on big data

Publications (2)

Publication Number Publication Date
CN117808601A true CN117808601A (en) 2024-04-02
CN117808601B CN117808601B (en) 2024-05-24

Family

ID=

Citations (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030233319A1 (en) * 2001-03-20 2003-12-18 David Lawrence Electronic fund transfer participant risk management clearing
JP2015088037A (en) * 2013-10-31 2015-05-07 株式会社日立ソリューションズ Fund flow analysis device and method
CN104867055A (en) * 2015-06-16 2015-08-26 咸宁市公安局 Financial network doubtable money tracking and identifying method
CN109919608A (en) * 2018-11-28 2019-06-21 阿里巴巴集团控股有限公司 A kind of recognition methods, device and the server of high-risk transaction agent
JP2021149505A (en) * 2020-03-19 2021-09-27 株式会社クリプタクト Information processing system and information providing method
CN114119026A (en) * 2022-01-26 2022-03-01 成都无糖信息技术有限公司 Virtual currency transaction tracking and tracing method and system
CN114493864A (en) * 2022-01-04 2022-05-13 中科金审(北京)科技有限公司 Capital big data based anomaly detection system and method
CN114819965A (en) * 2021-01-21 2022-07-29 成都链安科技有限公司 Block chain virtual currency monitoring system
CN115439030A (en) * 2022-11-09 2022-12-06 山东民昊健康科技有限公司 Capital and current information management system based on big data analysis
CN115631039A (en) * 2019-09-26 2023-01-20 支付宝(杭州)信息技术有限公司 Fund tracking method, device and equipment
CN115631042A (en) * 2022-10-29 2023-01-20 复旦大学 Account model block chain oriented risk transaction detection method
WO2023109116A1 (en) * 2021-12-14 2023-06-22 同济大学 Rapid anti-money laundering detection method based on transaction graph
CN116957598A (en) * 2023-06-02 2023-10-27 华侨大学 Suspicious fund flow direction tracing method and system based on path bundles

Patent Citations (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030233319A1 (en) * 2001-03-20 2003-12-18 David Lawrence Electronic fund transfer participant risk management clearing
JP2015088037A (en) * 2013-10-31 2015-05-07 株式会社日立ソリューションズ Fund flow analysis device and method
CN104867055A (en) * 2015-06-16 2015-08-26 咸宁市公安局 Financial network doubtable money tracking and identifying method
CN109919608A (en) * 2018-11-28 2019-06-21 阿里巴巴集团控股有限公司 A kind of recognition methods, device and the server of high-risk transaction agent
CN115631039A (en) * 2019-09-26 2023-01-20 支付宝(杭州)信息技术有限公司 Fund tracking method, device and equipment
JP2021149505A (en) * 2020-03-19 2021-09-27 株式会社クリプタクト Information processing system and information providing method
CN114819965A (en) * 2021-01-21 2022-07-29 成都链安科技有限公司 Block chain virtual currency monitoring system
WO2023109116A1 (en) * 2021-12-14 2023-06-22 同济大学 Rapid anti-money laundering detection method based on transaction graph
CN114493864A (en) * 2022-01-04 2022-05-13 中科金审(北京)科技有限公司 Capital big data based anomaly detection system and method
CN114119026A (en) * 2022-01-26 2022-03-01 成都无糖信息技术有限公司 Virtual currency transaction tracking and tracing method and system
CN115631042A (en) * 2022-10-29 2023-01-20 复旦大学 Account model block chain oriented risk transaction detection method
CN115439030A (en) * 2022-11-09 2022-12-06 山东民昊健康科技有限公司 Capital and current information management system based on big data analysis
CN116957598A (en) * 2023-06-02 2023-10-27 华侨大学 Suspicious fund flow direction tracing method and system based on path bundles

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
WU, ZY等: "TRacer: Scalable Graph-Based Transaction Tracing for Account-Based Blockchain Trading Systems", 《IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY》, vol. 18, 12 June 2023 (2023-06-12) *
中亦科技: "图技术赋能资金流向追踪", Retrieved from the Internet <URL:https://zhuanlan.zhihu.com/p/352097596> *
刘璇;张朋柱;李嘉;陈智高;: "商业银行资金异常识别研究", 系统管理学报, no. 03, 15 May 2013 (2013-05-15) *
王达山;: "基于互联网金融的反洗钱模型探索", 金融电子化, no. 03, 15 March 2016 (2016-03-15) *

Similar Documents

Publication Publication Date Title
Krishnan et al. The role of economic trade-offs in the audit opinion decision: An empirical analysis
US8612320B2 (en) Method and apparatus for detecting fraudulent loans
CN111476660B (en) Intelligent wind control system and method based on data analysis
US20030177087A1 (en) Transaction surveillance
AU2005202874A1 (en) Method and system for detecting business behavioral patterns related to a business entity
CN112116464B (en) Abnormal transaction behavior analysis method and system based on event sequence frequent item set
CN111553563A (en) Method and device for determining enterprise fraud risk
CN110675078A (en) Marketing company risk diagnosis method, system, computer terminal and storage medium
Zhang et al. A study on SMIE credit evaluation model based on blockchain technology
CN112581270A (en) Risk account identification method and device, electronic equipment and storage medium
Chiu et al. The automation of financial statement fraud detection: a framework using process mining
CN117808601B (en) Capital tracing method and system based on big data
KR20210155501A (en) Receivable recovery support system for medium-small enterprise account receivable bond decrease and bad debt prevention based on big data
CN117808601A (en) Capital tracing method and system based on big data
CN109360085A (en) A kind of bank client responsible investigation method and system
CN115760363A (en) Interest rate measuring and calculating method and device based on pedestrian credit report
CN115689564A (en) Online payment information security management system and method based on big data
CN114819494A (en) Enterprise risk early warning method, device, equipment and medium
CN108198073A (en) Management method and system after a kind of throwing of stock right financing
Xihua et al. FRAUD RISK MEASUREMENT OF BASIC MEDICAL INSURANCE FOR URBAN AND RURAL RESIDENTS IN CHINA.
CN113496436A (en) Wind control model parameter analysis method based on safe multi-party calculation and application thereof
Kartika et al. Analysis of PSAK 71 Implementation on Allowance for Impairment of Financial Assets
Salhi et al. Alarm system for credit losses impairment under IFRS 9
Wang et al. Crosscorrelation analysis between P2P lending market and stock market in China
CN117291603B (en) Risk assessment system with large data ratio corresponding receipt confirming right

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant