CN113269626A

CN113269626A - Financial manipulation behavior identification method and device, electronic equipment and medium

Info

Publication number: CN113269626A
Application number: CN202110621635.3A
Authority: CN
Inventors: 张军欢; 郑茗译
Original assignee: Beihang University
Current assignee: Beihang University
Priority date: 2021-06-03
Filing date: 2021-06-03
Publication date: 2021-08-17

Abstract

The invention provides a financial manipulation behavior identification method, device, electronic equipment and medium, wherein the financial manipulation behavior identification method includes constructing a financial manipulation behavior identification model; obtaining key characteristic data of the company to be identified; Input to the constructed financial manipulation behavior identification model to identify whether there is financial manipulation in the to-be-identified company. The present invention realizes automatic identification of whether a listed company has financial manipulation by constructing a financial manipulation behavior identification model, which significantly improves the identification efficiency and effect of financial manipulation.

Description

Financial manipulation behavior identification method and device, electronic equipment and medium

Technical Field

The invention relates to the field of big data, in particular to a financial manipulation behavior identification method and device, electronic equipment and a medium.

Background

For the purpose of maintaining stock prices, performance assessment, and funding, listed companies often use various "financial skills" to publish their financial reports, so-called financial manipulations.

Financial manipulations can be broadly divided into two categories: the first category relates to the use of residual power to create accounting rules within the limits permitted by the general accounting system, accounting criteria and related laws in order to intentionally process related accounting data to obtain certain behavioral expectations. It functions within the accounting rules framework and is therefore a legal act on accounting. The second category is protocols that are not subject to accounting rules, where accounting processes performed externally are often manifested as serious violations of current accounting systems, accounting standards, and related legal regulations. This type of accounting process is an illegal accounting process. The provided financial information is not true, i.e., distorted financial information. The term "financial manipulation" as used herein refers to the latter, specifically, the case where no significant matters of the company are revealed in time, no other duties are performed in law, the result of performance prediction is inaccurate or not in time, and false or serious misleading statements are revealed in the information.

Achieving effective identification of financial manipulations of a listed company is crucial to both regulators and large investors. Currently, in order to identify whether financial operations exist in a listed company, a professional having professional financial knowledge and knowing the operation condition of the listed company needs to be engaged, and the financial operations and the operation data can be obtained by performing complicated analysis on the public financial data and the operation data of the listed company. Therefore, there is a need to provide a general financial manipulation behavior recognition model to implement automatic recognition of financial manipulation behaviors of listed companies, so as to finally improve recognition efficiency and recognition effect of financial manipulation and reduce recognition cost.

Disclosure of Invention

In order to achieve the above technical object, a first aspect of the present invention provides a financial manipulation behavior recognition method, which comprises the following detailed technical steps:

a financial manipulation behavior identification method, comprising:

constructing a financial manipulation behavior recognition model;

acquiring key characteristic data of a company to be identified;

and inputting the acquired key characteristic data into the constructed financial manipulation behavior recognition model to recognize whether the to-be-recognized company has financial manipulation behaviors.

In some embodiments, said constructing a financial manipulation behavior recognition model comprises:

determining a candidate feature set, the candidate feature set comprising a number of financial features and a number of non-financial features;

obtaining a sample set, wherein the sample set comprises a positive sample and a negative sample, the positive sample is a company sample with financial manipulation behavior, and the negative sample is a company sample without financial manipulation behavior;

performing a significant difference analysis on each feature in the candidate feature set using the sample set to obtain a number of key features, wherein the key features have significant differences in the positive and negative samples;

and performing logistic regression analysis on the key features to obtain the financial manipulation behavior recognition model.

In some embodiments, said performing a significant difference analysis on features in said candidate set of features using said sample set to obtain a number of key features comprises:

performing significance difference analysis on each feature in the candidate feature set by adopting a single-factor detection method to obtain a first key feature set comprising a first number of candidate features;

performing significance difference analysis on each feature in the candidate feature set by adopting a multivariate logistic regression analysis method to obtain a second key feature set comprising a second number of candidate features;

merging the first and second key feature sets to obtain a third key feature set comprising a third number of the candidate features;

and screening the third key feature set by adopting a factor analysis method to obtain the final key feature.

In some embodiments, the number of positive samples is equal to the number of negative samples. The number of the positive samples and the number of the negative samples are equal and appear in pairs, the corresponding positive samples and the corresponding negative samples belong to the same exchange and the same industry, and the difference of the total market value is within a preset range.

In some embodiments, the single factor detection method is a non-parametric detection method.

In some embodiments, the financial manipulation behavior identification model is a multivariate logistic regression model, represented as follows:

wherein: y is the probability of financial manipulation, X₂Is the flow ratio, X₅Moving the ratio of assets for monetary funds, X₁₄For cash flow to mobile liability ratio, X₁₈Specific gravity, X, of mobile assets for prepaid account₁₉Account specific gravity, X, of mobile assets for accounts receivable₂₂Is the share weight concentration, X₃₄Cash recovery for the entire asset.

A second aspect of the present invention provides a financial manipulation behavior recognition apparatus, comprising:

the modeling module is used for constructing a financial manipulation behavior recognition model;

the acquisition module is used for acquiring key characteristic data of the company to be identified;

and the identification module is used for inputting the acquired key characteristic data into the constructed financial manipulation behavior identification model so as to identify whether the company to be identified has financial manipulation behaviors.

In some embodiments, the modeling module comprises:

a determining sub-module for determining a candidate feature set, the candidate feature set comprising a plurality of financial features and a plurality of non-financial features;

the acquisition sub-module is used for acquiring a sample set, wherein the sample set comprises a positive sample and a negative sample, the positive sample is a company sample with financial manipulation behavior, and the negative sample is a company sample without financial manipulation behavior;

a key feature selection module, configured to perform a significant difference analysis on each feature in the candidate feature set using the sample set to obtain a number of key features, where the key features have significant differences in the positive sample and the negative sample;

and the model training submodule is used for carrying out logistic regression analysis on the key features to obtain the financial manipulation behavior recognition model.

A third aspect of the present invention provides an electronic device comprising a memory, a processor and a computer program stored in the memory and executable on the processor, wherein the processor implements the financial manipulation behavior recognition method according to the first aspect of the present invention when executing the program.

A fourth aspect of the present invention provides a computer-readable storage medium having stored thereon a computer program which, when executed by a processor, implements the financial manipulation behavior recognition method of the first aspect of the present invention.

The invention realizes the automatic identification of whether the financial manipulation exists in the company to be identified by constructing the financial manipulation behavior identification model, obviously improves the identification efficiency and the identification effect of the financial manipulation of the listed company, and obviously reduces the identification cost.

Drawings

FIG. 1 is a flow chart of a financial manipulation behavior identification method according to a first embodiment of the present invention;

FIG. 2 is a flow chart of a financial manipulation behavior identification method according to a first embodiment of the present invention;

fig. 3 is a block diagram showing a financial manipulation behavior recognition apparatus according to a second embodiment of the present invention;

fig. 4 is a block diagram showing a financial manipulation behavior recognition apparatus according to a second embodiment of the present invention;

fig. 5 is a block diagram of an electronic device according to a third embodiment of the present invention.

Detailed Description

In order to make the aforementioned objects, features and advantages of the present invention comprehensible, embodiments accompanied with figures are described in further detail below.

The terms "first," "second," "third," "fourth," and the like in the description and in the claims, as well as in the drawings, if any, are used for distinguishing between similar elements and not necessarily for describing a particular sequential or chronological order. It is to be understood that the data so used is interchangeable under appropriate circumstances such that the embodiments of the invention described herein are, for example, capable of operation in sequences other than those illustrated or otherwise described herein. Furthermore, the terms "comprises," "comprising," and "having," and any variations thereof, are intended to cover a non-exclusive inclusion, such that a process, method, system, article, or apparatus that comprises a list of steps or elements is not necessarily limited to those steps or elements expressly listed, but may include other steps or elements not expressly listed or inherent to such process, method, article, or apparatus.

Example one

As shown in fig. 1, the financial manipulation behavior recognition method 100 provided by the present embodiment includes the following steps:

and S101, constructing a financial manipulation behavior recognition model.

Specifically, as shown in fig. 2, step S101 includes the following sub-steps:

s1011, determining a candidate feature set, wherein the candidate feature set comprises a plurality of financial features and a plurality of non-financial features.

As will be appreciated by those skilled in the art, the traces of financial manipulations must be reflected in the corporate business and financial data, and therefore we choose to determine a candidate feature set from the corporate business and financial data, namely: each candidate feature in the set of candidate features may have an association with a financial manipulation. Of course, the determination of these candidate features is based on research efforts already in the field.

Candidate features belong to two classes: financial and non-financial indicators. The financial index system is subdivided into several dimensions of repayment ability, profit ability, asset quality, profit quality, operation ability, cash flow, development potential, joint transaction degree and risk level, and the non-financial index system is divided into the following aspects: company governance, company operation, operation risk and audit information.

Optionally, in this embodiment, the candidate features and definitions included in the finally determined candidate feature set are shown in table 1:

TABLE 1 candidate features and definitions

S1012, obtaining a sample set, wherein the sample set comprises a positive sample and a negative sample, the positive sample is a company sample with financial manipulation behavior, and the negative sample is a company sample without financial manipulation behavior.

Alternatively, we affirm the listed a stock financial handling company, which has been certified to officially notify and be subject to administrative sanctioning during 1/2012 to 9/30/2020, as a positive sample, where we obtain 263 positive samples without including general notifications of report criticism, disclosure and repriming, etc. to the financial handling company. The financial data, business data and corporate governance data for the sample company are from the wind database.

Next, we performed further screening of the samples by the following procedure:

1. for a company with financial manipulation behaviors for two or more continuous years, taking data of the first year of occurrence of the main discovered manipulation behaviors as data adopted by research according to the penalty time of the certificate and the supervision;

2. since the evaluation standards of the operation mode and the performance index are different from those of the common company, the marketing companies such as finance, insurance and the like are eliminated;

3. eliminating companies whose financial data are not complete and cannot acquire a complete index system:

4. and eliminating the companies which are in the stop-plate on the transaction days before and after the certificate supervision advising penalty day.

After the screening, 132 samples with financial manipulations are finally obtained. The reason for reporting the penalty is different among the samples, one reason is related to the penalty, and multiple reasons are also related to the penalty, and the invention does not distinguish the samples in particular.

Meanwhile, according to 1: and 1, selecting a non-financial manipulation marketing company which is closest to the total market value of the same exchange, the same year, the same industry and the same financial manipulation sample as a negative sample.

After the selection and screening of the samples are completed, the finally determined sample size is that 132 listed companies with financial manipulation behaviors are used as a positive sample group, 132 listed companies with one-to-one correspondence to the financial manipulation are used as a negative sample group, and the total number of the samples is 264 listed companies. The sample set constructed in the above way can control the influence of factors such as industry, year, scale and the like.

And S1013, performing significant difference analysis on each feature in the candidate feature set by using the sample set to obtain a plurality of key features, wherein the key features have significant differences in the positive samples and the negative samples.

Optionally, the specific implementation steps of step S1013 are as follows:

s10131, performing significance difference analysis on the features in the candidate feature set by adopting a single-factor detection method to obtain a first key feature set comprising a first number of the candidate features.

Before conducting a significant difference analysis on each feature in the candidate feature set, a sample-scale univariate test was first performed to exclude the impact of company scale on both sets of data. Specifically, nonparametric inspection is performed on the total market value as an inspection variable in the positive sample group and the negative sample group, the total market value and the industry are used as inspection variables, whether financial manipulation behaviors exist or not is used as a grouping variable, and 0 and 1 represent that the financial manipulation behaviors exist or do not exist respectively. The purpose of this step is to verify whether the size of the company (total market value) between the positive and negative sample sets is affected.

Results of the Many-Whitney test on the company scale in the sample-scale univariate test. The results show that the distribution of company sizes is approximately the same in the positive and negative sample groups, excluding interference from company size factors. The statistical results are shown in table 2:

TABLE 2 results of the company-scale Mantoux test

The results show progressive significance behavior of 0.950, failing at a significance level of 0.05. The two groups of data have no obvious difference in the total market value, namely, the influence of the company size on the two groups of data can be eliminated.

In this embodiment, a significance difference analysis of each feature in the candidate feature set is implemented, and the specific process is as follows:

and performing nonparametric inspection in the positive sample group and the negative sample group, sequentially taking the candidate features X1-X35 as inspection variables, taking the existence or nonexistence of the manipulation behavior as grouping variables, and defining the grouping variables, wherein the positive sample group is set to be 0, and the negative sample group is set to be 1.

Non-parametric tests performed on the positive sample group and the negative sample group resulted in 35 candidate features, which were significantly different between the positive sample group and the negative sample group, and descriptive statistics of each candidate feature are shown in table 3:

TABLE 3 descriptive statistics for each candidate feature

As can be seen from the data in table 3, the candidate features X2, X5, X7, X10, X14, X18, X19, X22, X34 were passed at the level of 0.05 in both the positive and negative sample groups, indicating that these candidate features were significantly different in both groups of data; the significance of the remaining candidate features were each greater than 0.05, indicating that the distributions of these candidates in the two sets of data were approximately the same. Therefore, after the step is executed, 8 candidate features of X2, X5, X7, X10, X14, X18, X19, X22, and X34 are finally selected as the first key feature set.

S10132, performing significance difference analysis on the features in the candidate feature set by adopting a multivariate logistic regression analysis method to obtain a second key feature set comprising a second number of the candidate features.

And searching candidate characteristics with significant difference in the two groups of samples by a logistic regression method. The logistic regression analysis is selected to research the influence of X on Y, and has no requirement on the data type of X, X can be classified data or quantitative data, but Y is required to be classified data, and a corresponding data analysis method is used according to the option number of Y. Because the dependent variable indexes selected by the user contain both quantitative data and classified data, and the independent variables are classified data, the logistic regression method meets the requirements of the user on data processing.

And (3) taking Y as a dependent variable and X1-X35 as covariates, and searching candidate characteristics with significant difference in two groups of samples by a binary logistic regression method. As shown in table 4 below, the variables eventually entered into the equation include X7, X14, X18, X19, X22, X35 by logistic regression (the way to screen the variables is Forward).

TABLE 4 variables in the logistic regression entry equation

Therefore, we finally select 6 candidate features of X7, X14, X18, X19, X22, and X35 as the second key feature set.

S10133, merging the first key feature set and the second key feature set to obtain a third key feature set including a third number of the candidate features.

And combining the first key feature set obtained by the single-factor detection method, namely X2, X5, X7, X10, X14, X18, X19, X22, X34, and a multivariate logistic regression analysis method to obtain second key feature sets, namely X7, X14, X18, X19, X22 and X35, to obtain a third key feature set, wherein the third key feature set comprises 10 features, namely X2, X5, X7, X10, X14, X18, X19, X22, X34 and X35.

S10134, screening the third key feature set by adopting a factor analysis method to obtain the final key features.

Although each feature in the third key feature set has a large correlation with the financial manipulation, the degree of correlation between the included features may be large, that is, there may be multiple collinearity between the features, and therefore, in order to ensure the effectiveness and interpretability of the subsequent identification model, it is necessary to perform further dimension reduction processing on the third key feature set to remove the multiple collinearity between the features. Optionally, in this embodiment, a multivariate factor regression analysis method is used to perform dimension reduction processing, so that a small number of major factors are finally formed to construct a financial manipulation behavior recognition model.

Through multivariate factor regression analysis, the obtained common influence dimensionality of a plurality of independent variable indexes can be obtained, the decisive index of a common factor is obtained through the rotated factor load matrix, the number of the independent variable indexes is reduced, and the independent variable indexes are better adapted to logistic regression.

As in table 5, we used three different approaches in the common factor selection process. The first is the most common case, only dimensions with eigenvalues greater than 1 are taken into account, i.e. the first four common factors are extracted. The information of the original variable covered by the method can only reach 68.263%, and the level is low. The second method is to extract 5 common factors, wherein four common factor characteristic values are more than 1, and the fifth common factor characteristic value is 0.835 which is also close to 1, so that 77.543% of data information of the original variable can be covered, which indicates that the group of data information can be well interpreted. The third method is that the eigenvalue of the 6 th common factor is 0.810, which is also very close to 1, and the information coverage probability of the extracted original variable can reach 86.541%, but the dimension of the selection of the method is too redundant, and when the control group is subjected to financial manipulation behavior recognition under the method, the accuracy is only 68%, which is lower than the first two methods, and therefore, the method is not selected. In summary, we chose the second method for factor analysis.

TABLE 5 Total variance interpretation of factor analysis

As can be seen from the rotated composition matrix table 6, the dimensions of the five common factors are determined by the following variable indexes: x34, X14, X2, X18, X5, X19, X22.

TABLE 6 composition matrix after rotation

That is, after the dimensionality reduction treatment by the factor analysis method, the finally determined key features are X2, X5, X7, X10, X14, X18, X19, X22 and X34.

And S1014, carrying out logistic regression analysis on the key features to obtain the financial manipulation behavior recognition model.

The data from the sample set in the foregoing is still used to perform logistic regression analysis (the way to screen variables is Forward) on the key features obtained by the factorial analysis.

From table 7 we have derived the following regression equation to examine financial handling behavior:

TABLE 7 variables finally entered into the equation

And S102, acquiring key characteristic data of the company to be identified.

Namely, the following data of 7 key features of the company to be identified is obtained from public data: a flow ratio X2, a monetary funds flow asset ratio X5, a cash flow to flow liability ratio X14, a specific gravity of prepaid accounts in the flow asset X18, a specific gravity of receivables in the flow asset X19, a share concentration X22, and a total asset cash recovery rate X24.

S103, inputting the acquired key feature data into the constructed financial manipulation behavior recognition model to recognize whether the to-be-recognized company has financial manipulation.

Of 3063 listed companies in the a-stock market (the industry removed finance, insurance, etc. and the stocks of ST and ST, etc.) estimated according to the existing model, a total of 1869 companies were examined for evidence of financial manipulation, accounting for 61.02%.

As shown in table 8, the mining industry has the highest percentage of companies with financial management, accounting for 77.78% of the industry, based on the industry classification. Secondly, in the industries of building materials, traffic equipment, light industry manufacturing and catering and tourism, the proportion of listed companies with operation behaviors accounts for more than 70 percent of the proportion of the industries. The industries with the least operation behavior proportion are real estate, ferrous metal, transportation and nonferrous metal industries, and are all below 50 percent. Table 7 below is the company proportion for which financial manipulations exist for each industry:

TABLE 8 proportion of companies with financial manipulations in each industry

It is noteworthy here, however, that we cannot directly ascertain with force that if a company belongs to a certain industry, it is more likely that it will have financial manipulations. The reasons are mainly as follows: the first point is that the selected sample capacity of the forecasting company is small and the distribution of the marketing company is not balanced, so that the companies engaged in mechanical equipment and information service are more than those engaged in the comprehensive class and the transportation class. From the list in table 7, it can be seen that the financial handling behavior proportion of the electronic industry and the comprehensive industry is about 70%, however, the sample capacity of the electronic industry is up to 241 families, while the sample capacity of the comprehensive industry is only 10 families, and the difference of the sample capacity is nearly 24 times, so that the financial handling behavior proportion of the comprehensive industry may have relatively large contingency and inaccuracy. The second point is the issue of threshold setting for financial manipulation behavior identification in the embodiment. Setting the threshold value of this embodiment at 50% yields the results in the table above, with values closer to 1 indicating a greater likelihood of steering behavior in the information disclosure. When the threshold is raised, the handling behavior ratio of the industry must be changed, and further the judgment of whether the whole industry is more inclined to financial handling is influenced.

Example two

The present embodiment provides a financial manipulation behavior recognition apparatus 200, and as shown in fig. 3, the financial manipulation behavior recognition apparatus 200 of the present embodiment includes:

and the modeling module 201 is used for constructing a financial manipulation behavior recognition model.

Optionally, as shown in fig. 4, the modeling module 201 may further include:

a determining sub-module 2011 configured to determine a candidate feature set, where the candidate feature set includes a plurality of financial features and a plurality of non-financial features;

the obtaining sub-module 2012 is configured to obtain a sample set, where the sample set includes a positive sample and a negative sample, the positive sample is a company sample where the financial manipulation behavior exists, and the negative sample is a company sample where the financial manipulation behavior does not exist;

a key feature selection module 2013, configured to perform a significant difference analysis on each feature in the candidate feature set by using the sample set to obtain several key features, where the key features have significant differences in the positive sample and the negative sample;

and the model training submodule 2014 is used for performing logistic regression analysis on the key features to obtain the financial manipulation behavior recognition model.

The obtaining module 202 is configured to obtain key feature data of a company to be identified.

And the identification module 203 is used for inputting the acquired key characteristic data into the constructed financial manipulation behavior identification model so as to identify whether the to-be-identified company has financial manipulation.

Since the processing procedure of each functional module of the financial manipulation behavior recognition device 200 provided in the present embodiment is consistent with the processing procedure of the financial manipulation behavior recognition method 100 in the second embodiment, the processing procedure of each functional module of the financial manipulation behavior recognition device 200 will not be described repeatedly in the present embodiment, and reference may be made to the related description of the first embodiment.

EXAMPLE III

Fig. 5 is a schematic structural diagram of an electronic device 300 according to an embodiment of the present disclosure, and as shown in fig. 5, the electronic device 300 includes a processor 301 and a memory 303, and the processor 301 and the memory 303 are connected, for example, through a bus 302.

The processor 301 may be a CPU, general purpose processor, DSP, ASIC, FPGA or other programmable device, transistor logic device, hardware component, or any combination thereof. Which may implement or perform the various illustrative logical blocks, modules, and circuits described in connection with the disclosure. The processor 301 may also be a combination of implementing computing functionality, e.g., including one or more microprocessors, a combination of DSPs and microprocessors, and the like.

Bus 302 may include a path that transfers information between the above components. The bus 302 may be a PCI bus or an EISA bus, etc. The bus 302 may be divided into an address bus, a data bus, a control bus, and the like. For ease of illustration, only one thick line is shown, but this does not mean only one bus or one type of bus.

Memory 303 may be, but is not limited to, a ROM or other type of static storage device that can store static information and instructions, a RAM or other type of dynamic storage device that can store information and instructions, an EEPROM, a CD-ROM or other optical disk storage, magnetic disk storage or other magnetic storage devices, or any other medium that can be used to carry or store desired program code in the form of instructions or data structures and that can be accessed by a computer.

The memory 303 is used for storing application program codes of the present application, and is controlled to be executed by the processor 301. The processor 301 is configured to execute application program code stored in the memory 303 to implement the financial manipulation behavior recognition method according to the first embodiment.

Finally, an embodiment of the present application provides a computer-readable storage medium, where a computer program is stored on the computer-readable storage medium, and when the computer program is executed by a processor, the method for identifying financial manipulation behavior according to the first embodiment is implemented.

The invention has been described above with a certain degree of particularity. It will be understood by those of ordinary skill in the art that the description of the embodiments is merely exemplary and that all changes that come within the true spirit and scope of the invention are desired to be protected. The scope of the invention is defined by the appended claims rather than by the foregoing description of the embodiments.

Claims

1. a financial manipulation behavior identification method, is characterized in that, it comprises:

Build a financial manipulation behavior recognition model;

Obtain key characteristic data of the company to be identified;

The acquired key characteristic data is input into the constructed financial manipulation behavior identification model to identify whether the to-be-identified company has financial manipulation behavior.

2. The method for identifying financial manipulation behaviors as claimed in claim 1, wherein the building a financial manipulation behavior identification model comprises:

determining a candidate feature set, the candidate feature set including several financial features and several non-financial features;

obtaining a sample set, the sample set includes positive samples and negative samples, the positive samples are samples of companies with financial manipulation behaviors, and the negative samples are samples of companies without financial manipulation behaviors;

Using the sample set to perform significant difference analysis on each feature in the candidate feature set to obtain several key features, wherein the key features are significantly different between the positive samples and the negative samples;

Logistic regression analysis is performed on the key features to obtain the financial manipulation behavior identification model.

3. The method for identifying financial manipulation behaviors according to claim 2, wherein the use of the sample set to perform significant difference analysis on each feature in the candidate feature set to obtain several key features comprises:

A single-factor detection method is used to perform significant difference analysis on each feature in the candidate feature set, so as to obtain a first key feature set including a first number of the candidate features;

Using multivariate logistic regression analysis method to perform significant difference analysis on each feature in the candidate feature set, to obtain a second key feature set including a second number of the candidate features;

combining the first key feature set and the second key feature set to obtain a third key feature set including a third number of the candidate features;

The third key feature set is screened by factor analysis to obtain the final key feature.

4. The method for identifying financial manipulation behavior according to claim 2, wherein the number of the positive samples and the number of the negative samples are equal and appear in pairs, and the corresponding positive samples and the negative samples belong to The same exchange, the same industry, and the difference in the total market value is within a predetermined range.

5. The financial manipulation behavior identification method according to claim 3, wherein the single-factor detection method is a non-parametric detection method.

6. The financial manipulation behavior identification method as claimed in claim 1, wherein the financial manipulation behavior identification model is a multivariate logistic regression model, which is expressed as follows:

Among them: Y is the probability of financial manipulation, X ₂ is the current ratio, X ₅ is the current asset ratio of monetary funds, X ₁₄ is the ratio of cash flow to current liabilities, X ₁₈ is the proportion of prepaid accounts to current assets, and X ₁₉ is Accounts receivable accounts for the proportion of current assets, X ₂₂ is the equity concentration, and X ₃₄ is the cash recovery rate of all assets.

7. A financial manipulation behavior identification device, characterized in that it comprises:

A modeling module for building a financial manipulation behavior recognition model;

The acquisition module is used to acquire the key characteristic data of the company to be identified;

The identification module is used for inputting the acquired key characteristic data into the constructed financial manipulation behavior identification model, so as to identify whether the to-be-identified company has financial manipulation behavior.

8. The financial manipulation behavior recognition device according to claim 7, wherein the modeling module comprises:

a determination submodule, used for determining a candidate feature set, the candidate feature set includes several financial features and several non-financial features;

an acquisition sub-module, configured to acquire a sample set, the sample set includes a positive sample and a negative sample, the positive sample is a sample of companies with financial manipulation, and the negative sample is a sample of companies without financial manipulation;

A key feature selection module, configured to perform significant difference analysis on each feature in the candidate feature set using the sample set to obtain several key features, wherein the key features exist in the positive samples and the negative samples Significant differences;

The model training submodule is used for performing logistic regression analysis on the key features to obtain the financial manipulation behavior recognition model.

9. An electronic device, comprising a memory, a processor and a computer program stored in the memory and running on the processor, wherein the processor implements any one of claims 1 to 6 when the processor executes the program The described financial manipulation behavior identification method.

10. A computer-readable storage medium, characterized in that, a computer program is stored on the computer-readable storage medium, and when the program is executed by a processor, the financial manipulation behavior identification according to any one of claims 1 to 6 is realized. method.