WO2022041866A1 - Method, apparatus and device for determining causal relationship, and readable storage medium - Google Patents

Method, apparatus and device for determining causal relationship, and readable storage medium Download PDF

Info

Publication number
WO2022041866A1
WO2022041866A1 PCT/CN2021/094934 CN2021094934W WO2022041866A1 WO 2022041866 A1 WO2022041866 A1 WO 2022041866A1 CN 2021094934 W CN2021094934 W CN 2021094934W WO 2022041866 A1 WO2022041866 A1 WO 2022041866A1
Authority
WO
WIPO (PCT)
Prior art keywords
data
causal relationship
determining
conversion
neural network
Prior art date
Application number
PCT/CN2021/094934
Other languages
French (fr)
Chinese (zh)
Inventor
张天豫
范力欣
吴锦和
Original Assignee
深圳前海微众银行股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳前海微众银行股份有限公司 filed Critical 深圳前海微众银行股份有限公司
Publication of WO2022041866A1 publication Critical patent/WO2022041866A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/903Querying
    • G06F16/90335Query processing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computing arrangements using knowledge-based models
    • G06N5/04Inference or reasoning models
    • G06N5/041Abduction

Definitions

  • the present application relates to the technical field of financial technology (Fintech), and in particular, to a method, apparatus, device and readable storage medium for determining a causal relationship.
  • the main purpose of this application is to provide a method, device, device and readable storage medium for determining a causal relationship, which aims to solve the technical problem of how to infer the causal relationship between data and improve the accuracy of data analysis in the prior art.
  • the present application provides a method for determining a causal relationship, and the method for determining a causal relationship includes the following steps:
  • the corresponding first conversion difficulty degree coefficient and the second conversion difficulty degree coefficient corresponding to the second conversion time determine the causal relationship between the first data and the second data.
  • Two steps of causality between data include:
  • the magnitude relationship is that the first conversion difficulty degree coefficient is greater than the second conversion difficulty degree coefficient, determine the reason why the first data constitutes the causal relationship, and the second data constitutes the result in the causal relationship ;
  • the magnitude relationship is that the first conversion difficulty degree coefficient is smaller than the second conversion difficulty degree coefficient, determine the reason why the second data constitutes the causal relationship, and the first data constitutes the result in the causal relationship .
  • the step of converting the second data based on the preset neural network and the first data to obtain third data matching the divergence of the preset neural network with the first data include:
  • the intermediate data is discriminated, and the divergence value between the first data and the intermediate data is determined;
  • third data matching the divergence of the preset neural network with the first data is determined.
  • the step of discriminating the intermediate data based on a discriminator in a preset neural network, and determining a divergence value between the first data and the intermediate data includes:
  • a second probability distribution of the first data is acquired, and a divergence value between the first data and the intermediate data is generated according to the first probability distribution and the second probability distribution.
  • the step of determining, according to the divergence value, third data that matches the first data on the divergence of the preset neural network includes:
  • the divergence value is not less than the preset threshold, perform the step of converting the second data based on the generator in the preset neural network according to the divergence value, until the divergence The value is less than the preset threshold.
  • the method includes:
  • a first conversion time corresponding to the third data is generated.
  • the method before the step of acquiring the first data and the second data, the method further includes:
  • the present application also provides a device for determining a causal relationship
  • the device for determining a causal relationship includes:
  • the acquisition module is used to acquire the first data and the second data, and based on the preset neural network and the first data, convert the second data to obtain the dispersion of the first data in the preset neural network. degree of matching third data;
  • a conversion module configured to convert the first data based on the preset neural network and the second data to obtain fourth data matching the second data in the divergence of the preset neural network
  • a determination module configured to obtain a first conversion time for conversion between the second data and the third data, and a second conversion time for conversion between the first data and the fourth data, and according to the A first conversion difficulty degree coefficient corresponding to the first conversion time and a second conversion difficulty degree coefficient corresponding to the second conversion time are used to determine a causal relationship between the first data and the second data.
  • the present application also provides a causal relationship determination device, the causal relationship determination device includes a memory, a processor, and a causal relationship stored on the memory and can be run on the processor.
  • a program for determining a relationship when the program for determining a causal relationship is executed by the processor implements the steps of the method for determining a causal relationship as described above.
  • the present application also provides a readable storage medium, on which a program for determining a causal relationship is stored, and the program for determining a causal relationship is executed by a processor to achieve the above-mentioned The steps of the method for determining the causal relationship.
  • the method adopted in the present application is: acquiring the first data and the second data, and using the first data One data is used as a reference, and the second data is converted through a preset neural network to obtain third data matching the divergence of the first data in the preset neural network; Convert the first data to obtain fourth data matching the second data on the divergence of the preset neural network; and then obtain the first conversion time of the conversion between the second data and the third data, and the first data and the The second conversion time for conversion between the fourth data is determined according to the first conversion difficulty degree coefficient corresponding to the first conversion time and the second conversion difficulty degree coefficient corresponding to the second conversion time.
  • the causal relationship between the second data is the length of time for forming the third data
  • the corresponding first conversion difficulty coefficient represents the difficulty of converting the second data into third data matching the first data
  • the second conversion time is The duration of forming the fourth data
  • the corresponding second conversion difficulty coefficient represents the difficulty of converting the first data into fourth data matching the second data.
  • the magnitude relationship between the first conversion difficulty degree coefficient and the second conversion difficulty degree coefficient corresponding to the conversion time and the second conversion time respectively is used to accurately determine the causal relationship between the first data and the second data. It overcomes the defect that it is difficult to infer the causal relationship between data in the prior art, and is beneficial to the improvement of the accuracy of data analysis.
  • FIG. 1 is a schematic structural diagram of a device hardware operating environment involved in a device embodiment solution for determining a causal relationship of the present application
  • FIG. 2 is a schematic flowchart of a first embodiment of a method for determining a causal relationship of the present application
  • FIG. 3 is a schematic flowchart of a third embodiment of a method for determining a causal relationship of the present application
  • FIG. 4 is a schematic diagram of functional modules of a preferred embodiment of an apparatus for determining a causal relationship of the present application.
  • FIG. 1 is a schematic structural diagram of a device hardware operating environment involved in an embodiment solution of the device for determining a causal relationship in the present application.
  • the device for determining the causal relationship may include: a processor 1001 , such as a CPU, a communication bus 1002 , a user interface 1003 , a network interface 1004 , and a memory 1005 .
  • the communication bus 1002 is used to realize the connection and communication between these components.
  • the user interface 1003 may include a display screen (Display), an input unit such as a keyboard (Keyboard), and the optional user interface 1003 may also include a standard wired interface and a wireless interface.
  • the network interface 1004 may include a standard wired interface and a wireless interface (eg, a WI-FI interface).
  • the memory 1005 may be high-speed RAM memory, or may be non-volatile memory, such as disk memory.
  • the memory 1005 may also be a storage device independent of the aforementioned processor 1001 .
  • the hardware structure of the device for determining the causal relationship shown in FIG. 1 does not constitute a limitation on the device for determining the causal relationship, and may include more or less components than those shown in the figure, or combine some components, or a different arrangement of components.
  • the memory 1005 as a readable storage medium may include an operating system, a network communication module, a user interface module and a program for determining a causal relationship.
  • the operating system is a program that manages and controls causal relationship determination equipment and software resources, and supports the operation of network communication modules, user interface modules, causal relationship determination programs, and other programs or software; network communication modules are used to manage and control the network.
  • Interface 1004 the user interface module is used to manage and control the user interface 1003 .
  • the network interface 1004 is mainly used to connect to the background server and perform data communication with the background server;
  • the user interface 1003 is mainly used to connect the client (client) and perform data communication with the client Data communication;
  • the processor 1001 can call the causal relationship determination program stored in the memory 1005, and perform the following operations:
  • the corresponding first conversion difficulty degree coefficient and the second conversion difficulty degree coefficient corresponding to the second conversion time determine the causal relationship between the first data and the second data.
  • the steps for causality between data include:
  • the magnitude relationship is that the first conversion difficulty degree coefficient is greater than the second conversion difficulty degree coefficient, determine the reason why the first data constitutes the causal relationship, and the second data constitutes the result in the causal relationship ;
  • the magnitude relationship is that the first conversion difficulty degree coefficient is smaller than the second conversion difficulty degree coefficient, determine the reason why the second data constitutes the causal relationship, and the first data constitutes the result in the causal relationship .
  • the step of converting the second data based on the preset neural network and the first data to obtain third data matching the divergence of the preset neural network with the first data includes: :
  • the intermediate data is discriminated, and the divergence value between the first data and the intermediate data is determined;
  • third data matching the divergence of the preset neural network with the first data is determined.
  • the step of discriminating the intermediate data based on the discriminator in the preset neural network, and determining the divergence value between the first data and the intermediate data includes:
  • a second probability distribution of the first data is acquired, and a divergence value between the first data and the intermediate data is generated according to the first probability distribution and the second probability distribution.
  • step of determining, according to the divergence value, third data matching the divergence of the preset neural network with the first data includes:
  • the divergence value is not less than the preset threshold, perform the step of converting the second data based on the generator in the preset neural network according to the divergence value, until the divergence The value is less than the preset threshold.
  • the processor 1001 may call the causal relationship determination program stored in the memory 1005, and perform the following operations:
  • a first conversion time corresponding to the third data is generated.
  • the processor 1001 may call the causal relationship determination program stored in the memory 1005, and perform the following operations:
  • the specific implementation of the device for determining the causal relationship of the present application is basically the same as the following embodiments of the method for determining the causal relationship, and details are not repeated here.
  • the present application also provides a method for determining a causal relationship.
  • FIG. 2 is a schematic flowchart of a first embodiment of a method for determining a causal relationship of the present application.
  • This embodiment of the present application provides an embodiment of a method for determining a causal relationship. It should be noted that although a logical sequence is shown in the flowchart, in some cases, the sequence shown here may be performed in a different order. or the described steps. Specifically, the method for determining the causal relationship in this embodiment includes:
  • Step S10 acquiring the first data and the second data, and converting the second data based on the preset neural network and the first data to obtain the divergence of the preset neural network with the first data matching third data;
  • the method for determining a causal relationship in this embodiment is applied to a determining device, and the determining device may be a server or a client.
  • the server it communicates and connects with a plurality of clients that have a causal relationship determination requirement, and this embodiment takes clients as an example for description.
  • the client is deployed with a preset neural network, and the preset neural network is preferably an adversarial neural network (GAN, Generative Neural Network). Adversarial Network), the adversarial neural network includes a generator and a discriminator.
  • GAN Generative Neural Network
  • Adversarial Network Adversarial Network
  • the generator generates simulated information to deceive the discriminator, and the discriminator is used to distinguish the simulated information from the real information; in the end, the generator generates enough information to be "real", and the discriminator is difficult to judge the authenticity of the information generated by the generator. .
  • This implementation analyzes the causal relationship between the data by means of a preset neural network.
  • the data with the causal relationship explanation requirement is used as the first data and the second data, and the second data is converted through a preset neural network and the first data.
  • the generator in the preset neural network converts the second data to generate simulation data similar to the first data.
  • the generated simulation data is discriminated by the discriminator in the preset neural network, and the third data matching the divergence of the preset neural network with the first data is generated according to the difficulty of the discrimination.
  • the divergence is used to compare the closeness of two probability distributions.
  • the divergence of the preset neural network is used to represent the similarity between the simulated data processed by the preset neural network and the real data.
  • the third data matched in divergence indicates that the similarity between the third data generated by the generator and the first data is high, and it is difficult to be discriminated by the discriminator.
  • Step S20 converting the first data based on the preset neural network and the second data to obtain fourth data matching the second data on the divergence of the preset neural network;
  • the preset neural network and the second data are used for conversion.
  • the generator in the preset neural network converts the first data to generate simulation data similar to the second data.
  • the generated simulation data is discriminated by the discriminator in the preset neural network to generate fourth data that matches the second data in the divergence of the preset neural network, reflecting the high similarity with the second data sex.
  • Step S30 Acquire a first conversion time for conversion between the second data and the third data, and a second conversion time for conversion between the first data and the fourth data, and perform the conversion according to the first conversion time.
  • a first conversion difficulty degree coefficient corresponding to a conversion time and a second conversion difficulty degree coefficient corresponding to the second conversion time determine a causal relationship between the first data and the second data.
  • the conversion duration is recorded to obtain the time duration spent generating the third data as the first conversion time between the second data and the third data.
  • the duration is also recorded to obtain the duration of generating the fourth data as the second conversion time between the first data and the fourth data.
  • the difference in duration reflects the difficulty of the conversion. The shorter the conversion time, the easier the conversion, and vice versa.
  • the transformation process reflects the derivation process of the logical relationship between the cause and the result. The easier the transformation, the easier the derivation, reflecting the derivation from the cause to the result. The more difficult the conversion, the more difficult the derivation, reflecting the derivation from the result to the cause.
  • the causal relationship between the data can be reflected by the degree of difficulty represented by the time spent in the conversion. That is, the causal relationship between the first data and the second data is determined by the magnitude relationship between the first conversion time and the second conversion time.
  • the step of determining the causal relationship between the first data and the second data includes:
  • Step S31 comparing the first conversion difficulty coefficient with the second conversion difficulty coefficient to determine the magnitude relationship between the first conversion difficulty coefficient and the second conversion difficulty coefficient ;
  • Step S32 if the magnitude relationship is that the first conversion difficulty degree coefficient is greater than the second conversion difficulty degree coefficient, determine the reason why the first data constitutes the causal relationship, and the second data constitutes the causal relationship result in;
  • Step S33 if the magnitude relationship is that the first conversion difficulty degree coefficient is smaller than the second conversion difficulty degree coefficient, determine the reason why the second data constitutes the causal relationship, and the first data constitutes the causal relationship results in .
  • the first conversion time and the second conversion time are compared to determine the magnitude relationship between the two. If it is determined by comparison that the magnitude relationship is that the first conversion time is greater than the second conversion time, it means that the conversion time of the second data to the first data is long, and the conversion time of the first data to the second data is short, so the first data can be determined.
  • the second data constitutes the cause in the causal relationship, and the second data constitutes the effect in the used relationship.
  • the size relationship is that the first conversion time is less than the second conversion time, it means that the conversion time of the second data to the first data is short, and the conversion time of the first data to the second data is long, so it can be determined that the second data is converted to the second data.
  • the data constitute the cause in the causal relationship, and the first data constitute the effect in the used relationship.
  • a preset time threshold is preset to indicate that the conversion time exceeds the normal conversion time.
  • the causal relationship between the first data and the second data is determined according to the conversion time not greater than the preset time threshold. If the first transition time is greater than the preset time threshold and the second transition time is not greater than the preset interval threshold, the causal relationship between the first data and the second data is determined according to the second transition time. Since the second conversion time is the time when the first data is converted to the second data, and the second conversion time is greater than the preset time threshold, it means that the conversion time of the first data to the second data is short, so the first data constitutes a causal relationship. cause, while the second data constitutes the effect in a causal relationship. In this way, according to the magnitude relationship between the first conversion time and the second conversion time, the causal relationship between the first data and the second data is determined.
  • the method adopted in the present application is: obtaining the first data and the second data, and using the first data as a reference, through the preset neural network
  • the network converts the second data to obtain third data that matches the first data on the divergence of the preset neural network; at the same time, taking the second data as a reference, converts the first data through the preset neural network to obtain the same
  • the second data matches the fourth data on the divergence of the preset neural network; and then obtains the first conversion time of the conversion between the second data and the third data, and the second conversion time of the conversion between the first data and the fourth data.
  • the conversion time is determined, and the causal relationship between the first data and the second data is determined according to the first conversion difficulty degree coefficient corresponding to the first conversion time and the second conversion difficulty degree coefficient corresponding to the second conversion time.
  • the first conversion time is the length of time for forming the third data
  • the corresponding first conversion difficulty coefficient represents the difficulty of converting the second data into third data matching the first data
  • the second conversion time is The duration of forming the fourth data
  • the corresponding second conversion difficulty coefficient represents the difficulty of converting the first data into fourth data matching the second data.
  • the magnitude relationship between the first conversion difficulty degree coefficient and the second conversion difficulty degree coefficient corresponding to the conversion time and the second conversion time respectively is used to accurately determine the causal relationship between the first data and the second data. It overcomes the defect that it is difficult to infer the causal relationship between data in the prior art, and is beneficial to the improvement of the accuracy of data analysis.
  • Step S14 using the first data as a guide, converting the second data based on the generator in the preset neural network to obtain intermediate data;
  • Step S15 based on a discriminator in a preset neural network, discriminate the intermediate data, and determine a divergence value between the first data and the intermediate data;
  • the second data is converted by the preset neural network and the first data, so as to obtain third data matching the divergence of the first data in the preset neural network.
  • the generator in the preset neural network converts the second data to obtain the converted intermediate data.
  • the converted intermediate data is discriminated by a discriminator in a preset neural network, the intermediate data is distinguished from the first data, and a divergence value between the first data and the intermediate data is generated.
  • the intermediate data is discriminated, and the step of determining the divergence value between the first data and the intermediate data includes:
  • Step S151 based on a discriminator in a preset neural network, discriminate the intermediate data, and generate a first probability distribution of the intermediate data;
  • Step S152 Acquire a second probability distribution of the first data, and generate a divergence value between the first data and the intermediate data according to the first probability distribution and the second probability distribution.
  • the discriminator in the preset neural network is invoked to discriminate the intermediate data, generate a first probability distribution of the intermediate data, and describe the information contained in the intermediate data through the first probability distribution.
  • a second probability distribution is generated for the first data, and the information contained in the first data is described by the second probability distribution.
  • the first probability distribution and the second probability distribution are calculated to generate a divergence value between them; wherein, the calculation may be to calculate the expected value of the logarithmic difference between the two, and the obtained expected value result is the divergence value.
  • Step S16 according to the divergence value, determine third data matching the divergence of the preset neural network with the first data.
  • the divergence value is used to represent the similarity between the data converted by the preset neural network and the real data, it is possible to determine the divergence between the first data and the first data by the similarity represented by the divergence value.
  • the matching third data that is, the third data with high similarity with the first data.
  • the step of determining the third data matching the divergence of the preset neural network with the first data includes:
  • Step S161 judging whether the divergence value is less than a preset threshold, and if it is less than a preset threshold, then determining the intermediate data as the third data;
  • Step S162 if the divergence value is not less than the preset threshold, then according to the divergence value, perform the step of converting the second data based on the generator in the preset neural network, until the predetermined threshold is reached.
  • the divergence value is less than the preset threshold.
  • a preset threshold representing the level of similarity is preset, and the generated divergence value is compared with the preset threshold to determine whether the divergence value is smaller than the preset threshold. If it is determined by comparison that the divergence value is smaller than the preset threshold, it means that the probability distribution of the first data and the probability distribution of the intermediate data are close to each other, so that the intermediate data is determined as the first data having a high similarity with the first data. Three data. On the contrary, if it is determined by comparison that the divergence value is not less than the preset threshold, it means that the intermediate data obtained by the preset neural network processing is quite different from the first data, so it is necessary to continue to use the first data through the preset neural network. The data is used as a reference, and the second data is iteratively transformed.
  • the preset neural network converts the second data in the direction of reducing the difference according to the first data, and generates intermediate data again to determine the divergence. value to determine whether it is less than the preset threshold. If it is less than the preset threshold, the intermediate data is determined as the third data. If it is still not less than the preset threshold, continue to convert to obtain new intermediate data; this cycle until the generated divergence value is less than the preset threshold, and the obtained intermediate data is determined to be on the divergence of the preset neural network with the first data. matching third data.
  • the method includes:
  • Step a1 find the first time point of the first conversion to the second data, and determine the second time point of the third data
  • Step a2 generating a first conversion time corresponding to the third data according to the first time point and the second time point.
  • the time point of the conversion is recorded as the first time point.
  • the recorded first time point is searched, and the current time point at which the third data is obtained is recorded as the second time point for generating the third data.
  • the time interval is the time it takes to convert the third data, which is used as the first conversion time corresponding to the third data to be used to determine the difference between the first data and the third data in combination with the second conversion time corresponding to the fourth data.
  • the causal relationship between the two data is the time interval between the two data.
  • the second data is used as a guide, and the first data is converted through a preset neural network to obtain the intermediate data and the second data for discrimination, and generate the divergence between the two. value to determine the fourth data.
  • the specific generation process of the fourth data is similar to the above-mentioned generation process of the third data, and details are not described here.
  • a cyclic update conversion mechanism is set, and for the intermediate data obtained by each conversion, the divergence value calculation is performed with the first data, until the calculated divergence value is greater than the preset threshold, the intermediate data is regarded as close to the first data.
  • the third data of the data Further, the degree of difficulty of conversion is reflected by the length of time spent in the entire process of obtaining the third data, and the accuracy of determining the causal relationship is improved by accurately determining the third data.
  • the difference between the third embodiment of the method for determining a causal relationship and the first or second embodiment of the method for determining a causal relationship is that, before the step of acquiring the first data and the second data, the method further includes:
  • Step S40 obtaining a first data set and a second data set with a preset causal relationship, and processing the first data set and the second data set based on a preset neural network to generate a processing result;
  • Step S50 judging whether the causal relationship in the processing result is consistent with the preset causal relationship, and if so, the first data and the second data are processed based on the preset neural network.
  • the validity of the preset neural network is determined, that is, it is determined whether the preset neural network can effectively determine the causal relationship between data.
  • a first data set and a second data set having a preset causal relationship are acquired, and both the first data set and the second data set contain multiple items of data.
  • the preset causal relationship indicates that a clear causal logical relationship is set between the data in the first data set and the data in the second data set.
  • the data in the first data set is the cause in the causal relationship
  • the data in the second data set is the result in the causal relationship.
  • the preset neural network through the preset neural network, the first data set and the second data set are processed, and a processing result is generated, and the processing result indicates that the first data set and the second data set are judged, and the generated items in the two are generated.
  • the causal relationship between cause and effect between data Then, compare the causal relationship represented by the processing result with the preset causal relationship, and judge whether the two are consistent. If they are consistent, it means that the causal relationship obtained through the preset neural network processing is consistent with the original causal relationship, and the processed results are obtained.
  • the preset neural network can be used to determine the causal relationship, and used to process the first data and the second data to determine the causal relationship between the two.
  • the preset neural network after updating the parameters can accurately obtain the processing result, the preset neural network after updating the parameters is used to process the first data and the second data. Otherwise, continue to update the parameters, and repeat this cycle until an accurate processing result is obtained, which is used to process the first data and the second data to determine the causal relationship between the two.
  • both the first data set and the second data set contain multiple pieces of data with preset causal relationships, and the generated processing results contain causal relationships between multiple pieces of data.
  • the relationship is consistent with the preset causal relationship, its essence is to determine whether the causal relationship between the various data is consistent with the preset causal relationship.
  • a preset threshold such as 95%, is set in advance according to the needs.
  • This embodiment ensures the accuracy of determining the causal relationship between the first data and the second data by determining whether the preset neural network can effectively determine the causal relationship between the data. At the same time, a proportional mechanism is set up to improve the efficiency of judgment while ensuring accurate judgment.
  • the present application also provides a device for determining a causal relationship.
  • FIG. 4 is a schematic diagram of functional modules of a first embodiment of an apparatus for determining a causal relationship of the present application.
  • the means for determining the causal relationship includes:
  • the acquisition module is used to acquire the first data and the second data, and based on the preset neural network and the first data, convert the second data to obtain the dispersion of the first data in the preset neural network. degree of matching third data;
  • a conversion module configured to convert the first data based on the preset neural network and the second data to obtain fourth data matching the second data in the divergence of the preset neural network
  • a determination module configured to obtain a first conversion time for conversion between the second data and the third data, and a second conversion time for conversion between the first data and the fourth data, and according to the A first conversion difficulty degree coefficient corresponding to the first conversion time and a second conversion difficulty degree coefficient corresponding to the second conversion time are used to determine a causal relationship between the first data and the second data.
  • the determining module includes:
  • a first determining unit configured to compare the first conversion difficulty level coefficient with the second conversion difficulty level coefficient, and determine the difference between the first conversion difficulty level coefficient and the second conversion difficulty level coefficient the size relationship between
  • the first constituting unit is configured to determine the reason why the first data constitutes the causal relationship if the magnitude relationship is that the first conversion difficulty degree coefficient is greater than the second conversion difficulty degree coefficient, and the second data constitutes the causal relationship. the outcome of the causal relationship;
  • the second structure unit is configured to determine the reason why the second data constitutes the causal relationship if the magnitude relationship is that the first conversion difficulty degree coefficient is smaller than the second conversion difficulty degree coefficient, and the first data constitutes the causal relationship. result in the causal relationship.
  • the acquisition module includes:
  • a conversion unit configured to use the first data as a guide, and convert the second data based on the generator in the preset neural network to obtain intermediate data
  • a discriminating unit for discriminating the intermediate data based on a discriminator in a preset neural network, and determining a divergence value between the first data and the intermediate data;
  • a determining unit configured to determine, according to the divergence value, third data matching the divergence of the preset neural network with the first data.
  • the discrimination unit is also used for:
  • a second probability distribution of the first data is acquired, and a divergence value between the first data and the intermediate data is generated according to the first probability distribution and the second probability distribution.
  • the determining unit is also used for:
  • the divergence value is not less than the preset threshold, perform the step of converting the second data based on the generator in the preset neural network according to the divergence value, until the divergence The value is less than the preset threshold.
  • the determining unit is also used for:
  • a first conversion time corresponding to the third data is generated.
  • the device for determining the causal relationship also includes:
  • a generating module configured to obtain a first data set and a second data set with a preset causal relationship, and based on a preset neural network, process the first data set and the second data set to generate a processing result
  • the judgment module is configured to judge whether the causal relationship in the processing result is consistent with the preset causal relationship, and if so, process the first data and the second data based on the preset neural network.
  • the specific implementation manner of the device for determining a causal relationship of the present application is basically the same as the embodiments of the above-mentioned methods for determining a causal relationship, and details are not repeated here.
  • an embodiment of the present application also provides a readable storage medium.
  • a program for determining a causal relationship is stored on the readable storage medium, and when the program for determining a causal relationship is executed by a processor, the steps of the method for determining a causal relationship as described above are implemented.
  • the readable storage medium of the present application may be a computer-readable storage medium, and its specific implementation is basically the same as that of the foregoing embodiments of the causal relationship determination method, and details are not described herein again.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • General Engineering & Computer Science (AREA)
  • Software Systems (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Computation (AREA)
  • Mathematical Physics (AREA)
  • Databases & Information Systems (AREA)
  • Computing Systems (AREA)
  • Biomedical Technology (AREA)
  • Molecular Biology (AREA)
  • General Health & Medical Sciences (AREA)
  • Biophysics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

A method, apparatus and device for determining a causal relationship, and a readable storage medium. The method comprises: acquiring first data and second data, and converting the second data on the basis of a preset neural network and the first data, so as to obtain third data matching the first data in terms of the divergence of the preset neural network (S10); converting the first data on the basis of the preset neural network and the second data, so as to obtain fourth data matching the second data in terms of the divergence of the preset neural network (S20); and acquiring a first conversion time for conversion between the second data and the third data and a second conversion time for conversion between the first data and the fourth data, and determining a causal relationship between the first data and the second data according to a first conversion difficulty level coefficient corresponding to the first conversion time and a second conversion difficulty level coefficient corresponding to the second conversion time (S30).

Description

因果关系的确定方法、装置、设备及可读存储介质Method, apparatus, device and readable storage medium for determining causal relationship
本申请要求2020年8月27日申请的,申请号为202010891985.7,名称为“因果关系的确定方法、装置、设备及可读存储介质”的中国专利申请的优先权,在此将其全文引入作为参考。This application claims the priority of the Chinese patent application filed on August 27, 2020, the application number is 202010891985.7, and the title is "Method, Apparatus, Equipment and Readable Storage Medium for Determining Causal Relationship", which is hereby incorporated in its entirety as refer to.
技术领域technical field
本申请涉及金融科技(Fintech)技术领域,尤其涉及一种因果关系的确定方法、装置、设备及可读存储介质。The present application relates to the technical field of financial technology (Fintech), and in particular, to a method, apparatus, device and readable storage medium for determining a causal relationship.
背景技术Background technique
随着金融科技(Fintech),尤其是互联网科技金融的不断发展,越来越多的技术(如人工智能、大数据分析、云存储等)应用在金融领域,但金融领域也对各类技术提出了更高的要求,如要求提升数据分析的精度等。With the continuous development of financial technology (Fintech), especially Internet technology finance, more and more technologies (such as artificial intelligence, big data analysis, cloud storage, etc.) are applied in the financial field, but the financial field also proposes various technologies. higher requirements, such as the requirement to improve the accuracy of data analysis, etc.
目前,机器学习算法作为一种有效的数据分析工具广泛的使用在各个领域,但是机器学习算法在数据间因果关系推断方面的能力稍有欠缺,使得数据分析的精度不高,而限制了机器学习算法在对数据分析精度要求高的领域的推广使用。At present, machine learning algorithms are widely used in various fields as an effective data analysis tool, but the ability of machine learning algorithms to infer causal relationships between data is slightly lacking, which makes the accuracy of data analysis not high, and limits machine learning. The popularization and use of the algorithm in the fields that require high data analysis accuracy.
因此,如何实现数据间因果关系的推断,提升数据分析精度,是当前亟待解决的技术问题。Therefore, how to infer the causal relationship between data and improve the accuracy of data analysis is a technical problem that needs to be solved urgently.
技术问题technical problem
本申请的主要目的在于提供一种因果关系的确定方法、装置、设备及可读存储介质,旨在解决现有技术中如何实现数据间因果关系的推断,提升数据分析精度的技术问题。The main purpose of this application is to provide a method, device, device and readable storage medium for determining a causal relationship, which aims to solve the technical problem of how to infer the causal relationship between data and improve the accuracy of data analysis in the prior art.
技术解决方案technical solutions
为实现上述目的,本申请提供一种因果关系的确定方法,所述因果关系的确定方法包括以下步骤:In order to achieve the above object, the present application provides a method for determining a causal relationship, and the method for determining a causal relationship includes the following steps:
获取第一数据和第二数据,并基于预设神经网络和第一数据,对所述第二数据进行转换,得到与所述第一数据在所述预设神经网络的散度上匹配的第三数据;Obtain the first data and the second data, and convert the second data based on the preset neural network and the first data to obtain the first data matching the divergence of the preset neural network with the first data. three data;
基于所述预设神经网络和第二数据,对所述第一数据进行转换,得到与所述第二数据在所述预设神经网络的散度上匹配的第四数据;Based on the preset neural network and the second data, converting the first data to obtain fourth data matching the divergence of the second data in the preset neural network;
获取所述第二数据与所述第三数据之间转换的第一转换时间,以及所述第一数据与所述第四数据之间转换的第二转换时间,并根据所述第一转换时间对应的第一转换难易程度系数,与所述第二转换时间对应的第二转换难易程度系数,确定所述第一数据与所述第二数据之间的因果关系。Acquiring a first conversion time for conversion between the second data and the third data, and a second conversion time for conversion between the first data and the fourth data, and according to the first conversion time The corresponding first conversion difficulty degree coefficient and the second conversion difficulty degree coefficient corresponding to the second conversion time determine the causal relationship between the first data and the second data.
可选地,所述根据所述第一转换时间对应的第一转换难易程度系数,与所述第二转换时间对应的第二转换难易程度系数,确定所述第一数据与所述第二数据之间的因果关系的步骤包括:Optionally, according to the first conversion difficulty degree coefficient corresponding to the first conversion time and the second conversion difficulty degree coefficient corresponding to the second conversion time, determine the relationship between the first data and the first data Two steps of causality between data include:
将所述第一转换难易程度系数与所述第二转换难易程度系数对比,确定所述第一转换难易程度系数与所述第二转换难易程度系数之间的大小关系;Comparing the first conversion difficulty degree coefficient with the second conversion difficulty degree coefficient, and determining the magnitude relationship between the first conversion difficulty degree coefficient and the second conversion difficulty degree coefficient;
若所述大小关系为第一转换难易程度系数大于第二转换难易程度系数,则确定所述第一数据构成所述因果关系的原因,所述第二数据构成所述因果关系中的结果;If the magnitude relationship is that the first conversion difficulty degree coefficient is greater than the second conversion difficulty degree coefficient, determine the reason why the first data constitutes the causal relationship, and the second data constitutes the result in the causal relationship ;
若所述大小关系为第一转换难易程度系数小于第二转换难易程度系数,则确定所述第二数据构成所述因果关系的原因,所述第一数据构成所述因果关系中的结果。If the magnitude relationship is that the first conversion difficulty degree coefficient is smaller than the second conversion difficulty degree coefficient, determine the reason why the second data constitutes the causal relationship, and the first data constitutes the result in the causal relationship .
可选地,所述基于预设神经网络和第一数据,对所述第二数据进行转换,得到与所述第一数据在所述预设神经网络的散度上匹配的第三数据的步骤包括:Optionally, the step of converting the second data based on the preset neural network and the first data to obtain third data matching the divergence of the preset neural network with the first data include:
将所述第一数据作为指引,基于所述预设神经网络中的生成器对所述第二数据进行转换,得到中间数据;Using the first data as a guide, convert the second data based on the generator in the preset neural network to obtain intermediate data;
基于预设神经网络中的判别器,对所述中间数据进行判别,确定所述第一数据与所述中间数据之间的散度值;Based on the discriminator in the preset neural network, the intermediate data is discriminated, and the divergence value between the first data and the intermediate data is determined;
根据所述散度值,确定与所述第一数据在所述预设神经网络的散度上匹配的第三数据。According to the divergence value, third data matching the divergence of the preset neural network with the first data is determined.
可选地,所述基于预设神经网络中的判别器,对所述中间数据进行判别,确定所述第一数据与所述中间数据之间的散度值的步骤包括:Optionally, the step of discriminating the intermediate data based on a discriminator in a preset neural network, and determining a divergence value between the first data and the intermediate data includes:
基于预设神经网络中的判别器,对所述中间数据进行判别,生成所述中间数据的第一概率分布;based on the discriminator in the preset neural network, discriminate the intermediate data, and generate a first probability distribution of the intermediate data;
获取所述第一数据的第二概率分布,并根据所述第一概率分布和所述第二概率分布,生成所述第一数据与所述中间数据之间的散度值。A second probability distribution of the first data is acquired, and a divergence value between the first data and the intermediate data is generated according to the first probability distribution and the second probability distribution.
可选地,所述根据所述散度值,确定与所述第一数据在所述预设神经网络的散度上匹配的第三数据的步骤包括:Optionally, the step of determining, according to the divergence value, third data that matches the first data on the divergence of the preset neural network includes:
判断所述散度值是否小于预设阈值,若小于预设阈值,则将所述中间数据确定为所述第三数据;Determine whether the divergence value is less than a preset threshold, and if it is less than a preset threshold, determine the intermediate data as the third data;
若所述散度值不小于所述预设阈值,则根据所述散度值,执行基于所述预设神经网络中的生成器对所述第二数据进行转换的步骤,直到所述散度值小于预设阈值。If the divergence value is not less than the preset threshold, perform the step of converting the second data based on the generator in the preset neural network according to the divergence value, until the divergence The value is less than the preset threshold.
可选地,所述将所述中间数据确定为所述第三数据的步骤之后,所述方法包括:Optionally, after the step of determining the intermediate data as the third data, the method includes:
查找对所述第二数据首次转换的第一时间点,以及确定所述第三数据的第二时间点;Find the first time point of the first conversion of the second data, and determine the second time point of the third data;
根据所述第一时间点和所述第二时间点,生成与所述第三数据对应的第一转换时间。According to the first time point and the second time point, a first conversion time corresponding to the third data is generated.
可选地,所述获取第一数据和第二数据的步骤之前,所述方法还包括:Optionally, before the step of acquiring the first data and the second data, the method further includes:
获取具有预设因果关系的第一数据集和第二数据集,并基于预设神经网络,对所述第一数据集和所述第二数据集进行处理,生成处理结果;acquiring a first data set and a second data set with a preset causal relationship, and processing the first data set and the second data set based on a preset neural network to generate a processing result;
判断所述处理结果中的因果关系,与所述预设因果关系是否一致,若一致,则基于预设神经网络对第一数据和第二数据处理。It is judged whether the causal relationship in the processing result is consistent with the preset causal relationship, and if so, the first data and the second data are processed based on the preset neural network.
进一步地,为实现上述目的,本申请还提供一种因果关系的确定装置,所述因果关系的确定装置包括:Further, in order to achieve the above object, the present application also provides a device for determining a causal relationship, the device for determining a causal relationship includes:
获取模块,用于获取第一数据和第二数据,并基于预设神经网络和第一数据,对所述第二数据进行转换,得到与所述第一数据在所述预设神经网络的散度上匹配的第三数据;The acquisition module is used to acquire the first data and the second data, and based on the preset neural network and the first data, convert the second data to obtain the dispersion of the first data in the preset neural network. degree of matching third data;
转换模块,用于基于所述预设神经网络和第二数据,对所述第一数据进行转换,得到与所述第二数据在所述预设神经网络的散度上匹配的第四数据;a conversion module, configured to convert the first data based on the preset neural network and the second data to obtain fourth data matching the second data in the divergence of the preset neural network;
确定模块,用于获取所述第二数据与所述第三数据之间转换的第一转换时间,以及所述第一数据与所述第四数据之间转换的第二转换时间,并根据所述第一转换时间对应的第一转换难易程度系数,与所述第二转换时间对应的第二转换难易程度系数,确定所述第一数据与所述第二数据之间的因果关系。A determination module, configured to obtain a first conversion time for conversion between the second data and the third data, and a second conversion time for conversion between the first data and the fourth data, and according to the A first conversion difficulty degree coefficient corresponding to the first conversion time and a second conversion difficulty degree coefficient corresponding to the second conversion time are used to determine a causal relationship between the first data and the second data.
进一步地,为实现上述目的,本申请还提供一种因果关系的确定设备,所述因果关系的确定设备包括存储器、处理器以及存储在所述存储器上并可在所述处理器上运行的因果关系的确定程序,所述因果关系的确定程序被所述处理器执行时实现如上述所述的因果关系的确定方法的步骤。Further, in order to achieve the above object, the present application also provides a causal relationship determination device, the causal relationship determination device includes a memory, a processor, and a causal relationship stored on the memory and can be run on the processor. A program for determining a relationship, when the program for determining a causal relationship is executed by the processor implements the steps of the method for determining a causal relationship as described above.
进一步地,为实现上述目的,本申请还提供一种可读存储介质,所述可读存储介质上存储有因果关系的确定程序,所述因果关系的确定程序被处理器执行时实现如上所述的因果关系的确定方法的步骤。Further, in order to achieve the above purpose, the present application also provides a readable storage medium, on which a program for determining a causal relationship is stored, and the program for determining a causal relationship is executed by a processor to achieve the above-mentioned The steps of the method for determining the causal relationship.
有益效果beneficial effect
本申请的因果关系的确定方法、装置、设备及可读存储介质,与现有技术难以推断数据间因果关系相比,本申请采用的手段为:获取第一数据和第二数据,并以第一数据为参照,通过预设神经网络对第二数据进行转换,得到与第一数据在预设神经网络的散度上匹配的第三数据;同时以第二数据为参照,通过预设神经网络对第一数据进行转换,得到与第二数据在预设神经网络的散度上匹配的第四数据;进而获取第二数据与第三数据之间转换的第一转换时间,以及第一数据与第四数据之间转换的第二转换时间,并依据第一转换时间对应的第一转换难易程度系数,以及与第二转换时间对应的第二转换难易程度系数,来确定第一数据与第二数据之间的因果关系。其中,第一转换时间为形成第三数据的时长,其对应的第一转换难易程度系数表征将第二数据转换为与第一数据匹配的第三数据的难易程度,第二转换时间为形成第四数据的时长,其对应的第二转换难易程度系数表征将第一数据转换为与第二数据匹配的第四数据的难易程度。时长越短、对应的系数越小,则转换越容易,构成因果关系中的因;时长越长、对应的系数越大,则转换越困难,构成因果关系中的果;以此,通过第一转换时间与第二转换时间分别对应的第一转换难易程度系数和第二转换难易程度系数之间的大小关系,来准确确定第一数据和第二数据之间的因果关系。克服了现有技术中难以推断数据间因果关系的缺陷,有利于数据分析的精度提升。Compared with the method, device, device and readable storage medium for determining the causal relationship of the present application, which is difficult to infer the causal relationship between data in the prior art, the method adopted in the present application is: acquiring the first data and the second data, and using the first data One data is used as a reference, and the second data is converted through a preset neural network to obtain third data matching the divergence of the first data in the preset neural network; Convert the first data to obtain fourth data matching the second data on the divergence of the preset neural network; and then obtain the first conversion time of the conversion between the second data and the third data, and the first data and the The second conversion time for conversion between the fourth data is determined according to the first conversion difficulty degree coefficient corresponding to the first conversion time and the second conversion difficulty degree coefficient corresponding to the second conversion time. The causal relationship between the second data. The first conversion time is the length of time for forming the third data, the corresponding first conversion difficulty coefficient represents the difficulty of converting the second data into third data matching the first data, and the second conversion time is The duration of forming the fourth data, and the corresponding second conversion difficulty coefficient represents the difficulty of converting the first data into fourth data matching the second data. The shorter the time period and the smaller the corresponding coefficient, the easier the conversion is, and the cause in the causal relationship is formed; the longer the time period and the larger the corresponding coefficient, the more difficult the conversion is, and the effect in the causal relationship is formed. The magnitude relationship between the first conversion difficulty degree coefficient and the second conversion difficulty degree coefficient corresponding to the conversion time and the second conversion time respectively is used to accurately determine the causal relationship between the first data and the second data. It overcomes the defect that it is difficult to infer the causal relationship between data in the prior art, and is beneficial to the improvement of the accuracy of data analysis.
附图说明Description of drawings
图1为本申请因果关系的确定设备实施例方案涉及的设备硬件运行环境的结构示意图;1 is a schematic structural diagram of a device hardware operating environment involved in a device embodiment solution for determining a causal relationship of the present application;
图2为本申请因果关系的确定方法第一实施例的流程示意图;2 is a schematic flowchart of a first embodiment of a method for determining a causal relationship of the present application;
图3为本申请因果关系的确定方法第三实施例的流程示意图;3 is a schematic flowchart of a third embodiment of a method for determining a causal relationship of the present application;
图4为本申请因果关系的确定装置较佳实施例的功能模块示意图。FIG. 4 is a schematic diagram of functional modules of a preferred embodiment of an apparatus for determining a causal relationship of the present application.
本申请目的的实现、功能特点及优点将结合实施例,参照附图做进一步说明。The realization, functional characteristics and advantages of the purpose of the present application will be further described with reference to the accompanying drawings in conjunction with the embodiments.
本发明的实施方式Embodiments of the present invention
应当理解,此处所描述的具体实施例仅用以解释本申请,并不用于限定本申请。It should be understood that the specific embodiments described herein are only used to explain the present application, but not to limit the present application.
本申请提供一种因果关系的确定设备,参照图1,图1为本申请因果关系的确定设备实施例方案涉及的设备硬件运行环境的结构示意图。The present application provides a device for determining a causal relationship. Referring to FIG. 1 , FIG. 1 is a schematic structural diagram of a device hardware operating environment involved in an embodiment solution of the device for determining a causal relationship in the present application.
如图1所示,该因果关系的确定设备可以包括:处理器1001,例如CPU,通信总线1002、用户接口1003,网络接口1004,存储器1005。其中,通信总线1002用于实现这些组件之间的连接通信。用户接口1003可以包括显示屏(Display)、输入单元比如键盘(Keyboard),可选用户接口1003还可以包括标准的有线接口、无线接口。网络接口1004可选的可以包括标准的有线接口、无线接口(如WI-FI接口)。存储器1005可以是高速RAM存储器,也可以是稳定的存储器(non-volatile memory),例如磁盘存储器。存储器1005可选的还可以是独立于前述处理器1001的存储设备。As shown in FIG. 1 , the device for determining the causal relationship may include: a processor 1001 , such as a CPU, a communication bus 1002 , a user interface 1003 , a network interface 1004 , and a memory 1005 . Among them, the communication bus 1002 is used to realize the connection and communication between these components. The user interface 1003 may include a display screen (Display), an input unit such as a keyboard (Keyboard), and the optional user interface 1003 may also include a standard wired interface and a wireless interface. Optionally, the network interface 1004 may include a standard wired interface and a wireless interface (eg, a WI-FI interface). The memory 1005 may be high-speed RAM memory, or may be non-volatile memory, such as disk memory. Optionally, the memory 1005 may also be a storage device independent of the aforementioned processor 1001 .
本领域技术人员可以理解,图1中示出的因果关系的确定设备的硬件结构并不构成对因果关系的确定设备的限定,可以包括比图示更多或更少的部件,或者组合某些部件,或者不同的部件布置。Those skilled in the art can understand that the hardware structure of the device for determining the causal relationship shown in FIG. 1 does not constitute a limitation on the device for determining the causal relationship, and may include more or less components than those shown in the figure, or combine some components, or a different arrangement of components.
如图1所示,作为一种可读存储介质的存储器1005中可以包括操作系统、网络通信模块、用户接口模块以及因果关系的确定程序。其中,操作系统是管理和控制因果关系的确定设备与软件资源的程序,支持网络通信模块、用户接口模块、因果关系的确定程序以及其他程序或软件的运行;网络通信模块用于管理和控制网络接口1004;用户接口模块用于管理和控制用户接口1003。As shown in FIG. 1 , the memory 1005 as a readable storage medium may include an operating system, a network communication module, a user interface module and a program for determining a causal relationship. Among them, the operating system is a program that manages and controls causal relationship determination equipment and software resources, and supports the operation of network communication modules, user interface modules, causal relationship determination programs, and other programs or software; network communication modules are used to manage and control the network. Interface 1004 ; the user interface module is used to manage and control the user interface 1003 .
在图1所示的因果关系的确定设备硬件结构中,网络接口1004主要用于连接后台服务器,与后台服务器进行数据通信;用户接口1003主要用于连接客户端(用户端),与客户端进行数据通信;处理器1001可以调用存储器1005中存储的因果关系的确定程序,并执行以下操作:In the hardware structure of the causal relationship determination device shown in FIG. 1 , the network interface 1004 is mainly used to connect to the background server and perform data communication with the background server; the user interface 1003 is mainly used to connect the client (client) and perform data communication with the client Data communication; the processor 1001 can call the causal relationship determination program stored in the memory 1005, and perform the following operations:
获取第一数据和第二数据,并基于预设神经网络和第一数据,对所述第二数据进行转换,得到与所述第一数据在所述预设神经网络的散度上匹配的第三数据;Obtain the first data and the second data, and convert the second data based on the preset neural network and the first data to obtain the first data matching the divergence of the preset neural network with the first data. three data;
基于所述预设神经网络和第二数据,对所述第一数据进行转换,得到与所述第二数据在所述预设神经网络的散度上匹配的第四数据;Based on the preset neural network and the second data, converting the first data to obtain fourth data matching the divergence of the second data in the preset neural network;
获取所述第二数据与所述第三数据之间转换的第一转换时间,以及所述第一数据与所述第四数据之间转换的第二转换时间,并根据所述第一转换时间对应的第一转换难易程度系数,与所述第二转换时间对应的第二转换难易程度系数,确定所述第一数据与所述第二数据之间的因果关系。Acquiring a first conversion time for conversion between the second data and the third data, and a second conversion time for conversion between the first data and the fourth data, and according to the first conversion time The corresponding first conversion difficulty degree coefficient and the second conversion difficulty degree coefficient corresponding to the second conversion time determine the causal relationship between the first data and the second data.
进一步地,所述根据所述第一转换时间对应的第一转换难易程度系数,与所述第二转换时间对应的第二转换难易程度系数,确定所述第一数据与所述第二数据之间的因果关系的步骤包括:Further, according to the first conversion difficulty degree coefficient corresponding to the first conversion time and the second conversion difficulty degree coefficient corresponding to the second conversion time, determine the first data and the second conversion difficulty degree coefficient. The steps for causality between data include:
将所述第一转换难易程度系数与所述第二转换难易程度系数对比,确定所述第一转换难易程度系数与所述第二转换难易程度系数之间的大小关系;Comparing the first conversion difficulty degree coefficient with the second conversion difficulty degree coefficient, and determining the magnitude relationship between the first conversion difficulty degree coefficient and the second conversion difficulty degree coefficient;
若所述大小关系为第一转换难易程度系数大于第二转换难易程度系数,则确定所述第一数据构成所述因果关系的原因,所述第二数据构成所述因果关系中的结果;If the magnitude relationship is that the first conversion difficulty degree coefficient is greater than the second conversion difficulty degree coefficient, determine the reason why the first data constitutes the causal relationship, and the second data constitutes the result in the causal relationship ;
若所述大小关系为第一转换难易程度系数小于第二转换难易程度系数,则确定所述第二数据构成所述因果关系的原因,所述第一数据构成所述因果关系中的结果。If the magnitude relationship is that the first conversion difficulty degree coefficient is smaller than the second conversion difficulty degree coefficient, determine the reason why the second data constitutes the causal relationship, and the first data constitutes the result in the causal relationship .
进一步地,所述基于预设神经网络和第一数据,对所述第二数据进行转换,得到与所述第一数据在所述预设神经网络的散度上匹配的第三数据的步骤包括:Further, the step of converting the second data based on the preset neural network and the first data to obtain third data matching the divergence of the preset neural network with the first data includes: :
将所述第一数据作为指引,基于所述预设神经网络中的生成器对所述第二数据进行转换,得到中间数据;Using the first data as a guide, convert the second data based on the generator in the preset neural network to obtain intermediate data;
基于预设神经网络中的判别器,对所述中间数据进行判别,确定所述第一数据与所述中间数据之间的散度值;Based on the discriminator in the preset neural network, the intermediate data is discriminated, and the divergence value between the first data and the intermediate data is determined;
根据所述散度值,确定与所述第一数据在所述预设神经网络的散度上匹配的第三数据。According to the divergence value, third data matching the divergence of the preset neural network with the first data is determined.
进一步地,所述基于预设神经网络中的判别器,对所述中间数据进行判别,确定所述第一数据与所述中间数据之间的散度值的步骤包括:Further, the step of discriminating the intermediate data based on the discriminator in the preset neural network, and determining the divergence value between the first data and the intermediate data includes:
基于预设神经网络中的判别器,对所述中间数据进行判别,生成所述中间数据的第一概率分布;based on the discriminator in the preset neural network, discriminate the intermediate data, and generate a first probability distribution of the intermediate data;
获取所述第一数据的第二概率分布,并根据所述第一概率分布和所述第二概率分布,生成所述第一数据与所述中间数据之间的散度值。A second probability distribution of the first data is acquired, and a divergence value between the first data and the intermediate data is generated according to the first probability distribution and the second probability distribution.
进一步地,所述根据所述散度值,确定与所述第一数据在所述预设神经网络的散度上匹配的第三数据的步骤包括:Further, the step of determining, according to the divergence value, third data matching the divergence of the preset neural network with the first data includes:
判断所述散度值是否小于预设阈值,若小于预设阈值,则将所述中间数据确定为所述第三数据;Determine whether the divergence value is less than a preset threshold, and if it is less than a preset threshold, determine the intermediate data as the third data;
若所述散度值不小于所述预设阈值,则根据所述散度值,执行基于所述预设神经网络中的生成器对所述第二数据进行转换的步骤,直到所述散度值小于预设阈值。If the divergence value is not less than the preset threshold, perform the step of converting the second data based on the generator in the preset neural network according to the divergence value, until the divergence The value is less than the preset threshold.
进一步地,所述将所述中间数据确定为所述第三数据的步骤之后,处理器1001可以调用存储器1005中存储的因果关系的确定程序,并执行以下操作:Further, after the step of determining the intermediate data as the third data, the processor 1001 may call the causal relationship determination program stored in the memory 1005, and perform the following operations:
查找对所述第二数据首次转换的第一时间点,以及确定所述第三数据的第二时间点;Find the first time point of the first conversion of the second data, and determine the second time point of the third data;
根据所述第一时间点和所述第二时间点,生成与所述第三数据对应的第一转换时间。According to the first time point and the second time point, a first conversion time corresponding to the third data is generated.
进一步地,所述获取第一数据和第二数据的步骤之前,处理器1001可以调用存储器1005中存储的因果关系的确定程序,并执行以下操作:Further, before the step of acquiring the first data and the second data, the processor 1001 may call the causal relationship determination program stored in the memory 1005, and perform the following operations:
获取具有预设因果关系的第一数据集和第二数据集,并基于预设神经网络,对所述第一数据集和所述第二数据集进行处理,生成处理结果;acquiring a first data set and a second data set with a preset causal relationship, and processing the first data set and the second data set based on a preset neural network to generate a processing result;
判断所述处理结果中的因果关系,与所述预设因果关系是否一致,若一致,则基于预设神经网络对第一数据和第二数据处理。It is judged whether the causal relationship in the processing result is consistent with the preset causal relationship, and if so, the first data and the second data are processed based on the preset neural network.
本申请因果关系的确定设备的具体实施方式与下述因果关系的确定方法各实施例基本相同,在此不再赘述。The specific implementation of the device for determining the causal relationship of the present application is basically the same as the following embodiments of the method for determining the causal relationship, and details are not repeated here.
本申请还提供一种因果关系的确定方法。The present application also provides a method for determining a causal relationship.
参照图2,图2为本申请因果关系的确定方法第一实施例的流程示意图。Referring to FIG. 2 , FIG. 2 is a schematic flowchart of a first embodiment of a method for determining a causal relationship of the present application.
本申请实施例提供了因果关系的确定方法的实施例,需要说明的是,虽然在流程图中示出了逻辑顺序,但是在某些情况下,可以以不同于此处的顺序执行所示出或描述的步骤。具体地,本实施例中的因果关系的确定方法包括:This embodiment of the present application provides an embodiment of a method for determining a causal relationship. It should be noted that although a logical sequence is shown in the flowchart, in some cases, the sequence shown here may be performed in a different order. or the described steps. Specifically, the method for determining the causal relationship in this embodiment includes:
步骤S10,获取第一数据和第二数据,并基于预设神经网络和第一数据,对所述第二数据进行转换,得到与所述第一数据在所述预设神经网络的散度上匹配的第三数据;Step S10, acquiring the first data and the second data, and converting the second data based on the preset neural network and the first data to obtain the divergence of the preset neural network with the first data matching third data;
本实施例中因果关系的确定方法应用于确定设备,该确定设备可以是服务器,也可以是客户端。对于服务器,则与多个具有因果关系确定需求的客户端通信连接,本实施例以客户端为例进行说明。其中,客户端部署有预设神经网络,并且该预设神经网络优选为对抗神经网络(GAN,Generative Adversarial Network),对抗神经网络中包括有生成器和判别器。由生成器生成仿真的信息去欺骗判别器,判别器则用以区分仿真的信息和真实的信息;最终实现生成器生成足以“以假乱真”的信息,判别器则难以判断生成器生成信息的真假。本实施借助预设神经网络分析数据之间的因果关系。The method for determining a causal relationship in this embodiment is applied to a determining device, and the determining device may be a server or a client. As for the server, it communicates and connects with a plurality of clients that have a causal relationship determination requirement, and this embodiment takes clients as an example for description. Wherein, the client is deployed with a preset neural network, and the preset neural network is preferably an adversarial neural network (GAN, Generative Neural Network). Adversarial Network), the adversarial neural network includes a generator and a discriminator. The generator generates simulated information to deceive the discriminator, and the discriminator is used to distinguish the simulated information from the real information; in the end, the generator generates enough information to be "real", and the discriminator is difficult to judge the authenticity of the information generated by the generator. . This implementation analyzes the causal relationship between the data by means of a preset neural network.
具体地,将具有因果关系解释需求的数据作为第一数据和第二数据,并通过预设神经网络和第一数据,对第二数据进行转换。将第一数据作为参照,通过预设神经网络中的生成器对第二数据进行转换,生成为与第一数据相似的仿真数据。并且,通过预设神经网络中的判别器对生成的仿真数据进行判别,通过判别的难易程度来生成与第一数据在预设神经网络的散度上匹配的第三数据。其中,散度用于比较两个概率分布的接近程度,本实施例用预设神经网络的散度来表征经预设神经网络处理后的仿真数据与真实数据之间的相似度。散度上匹配的第三数据,表征通过生成器生成的第三数据与第一数据之间的相似度较高,难以通过判别器判别。Specifically, the data with the causal relationship explanation requirement is used as the first data and the second data, and the second data is converted through a preset neural network and the first data. Taking the first data as a reference, the generator in the preset neural network converts the second data to generate simulation data similar to the first data. In addition, the generated simulation data is discriminated by the discriminator in the preset neural network, and the third data matching the divergence of the preset neural network with the first data is generated according to the difficulty of the discrimination. The divergence is used to compare the closeness of two probability distributions. In this embodiment, the divergence of the preset neural network is used to represent the similarity between the simulated data processed by the preset neural network and the real data. The third data matched in divergence indicates that the similarity between the third data generated by the generator and the first data is high, and it is difficult to be discriminated by the discriminator.
步骤S20,基于所述预设神经网络和第二数据,对所述第一数据进行转换,得到与所述第二数据在所述预设神经网络的散度上匹配的第四数据;Step S20, converting the first data based on the preset neural network and the second data to obtain fourth data matching the second data on the divergence of the preset neural network;
进一步地,对于第一数据,则通过预设神经网络和第二数据进行转换。将第二数据作为参照,由预设神经网络中的生成器对第一数据进行转换,生成与第二数据相似的仿真数据。同样地,通过预设神经网络中的判别器对生成的仿真数据进行判别,生成与第二数据在预设神经网络的散度上匹配的第四数据,体现与第二数据之间的高度相似性。Further, for the first data, the preset neural network and the second data are used for conversion. Using the second data as a reference, the generator in the preset neural network converts the first data to generate simulation data similar to the second data. Similarly, the generated simulation data is discriminated by the discriminator in the preset neural network to generate fourth data that matches the second data in the divergence of the preset neural network, reflecting the high similarity with the second data sex.
步骤S30,获取所述第二数据与所述第三数据之间转换的第一转换时间,以及所述第一数据与所述第四数据之间转换的第二转换时间,并根据所述第一转换时间对应的第一转换难易程度系数,与所述第二转换时间对应的第二转换难易程度系数,确定所述第一数据与所述第二数据之间的因果关系。Step S30: Acquire a first conversion time for conversion between the second data and the third data, and a second conversion time for conversion between the first data and the fourth data, and perform the conversion according to the first conversion time. A first conversion difficulty degree coefficient corresponding to a conversion time and a second conversion difficulty degree coefficient corresponding to the second conversion time determine a causal relationship between the first data and the second data.
更进一步地,在对第二数据进行转换的过程中,对转换的时长进行记录,得到生成第三数据所花费的时长,作为第二数据与第三数据之间的第一转换时间。同时,对于第一数据的转换,也记录时长,得到生成第四数据所花费的时长,作为第一数据与第四数据之间的第二转换时间。时长的差异反应了转换的难易程度,转换所花费的时长越短,转换越容易,反之则转换越难。而转换的过程体现了原因和结果之间逻辑关系的推导过程,转换越容易,则推导越容易,体现由原因向结果的推导。转换越难,则推导越不容易,体现由结果向原因的推导。因此,可通过转换所花费的时长所表征的难易程度,来体现数据之间的因果关系。即由第一转换时间和第二转换时间之间的大小关系,来确定第一数据和第二数据之间的因果关系。具体地,根据第一转换时间对应的第一转换难易程度系数,与第二转换时间对应的第二转换难易程度系数,确定第一数据与第二数据之间的因果关系的步骤包括:Furthermore, in the process of converting the second data, the conversion duration is recorded to obtain the time duration spent generating the third data as the first conversion time between the second data and the third data. At the same time, for the conversion of the first data, the duration is also recorded to obtain the duration of generating the fourth data as the second conversion time between the first data and the fourth data. The difference in duration reflects the difficulty of the conversion. The shorter the conversion time, the easier the conversion, and vice versa. The transformation process reflects the derivation process of the logical relationship between the cause and the result. The easier the transformation, the easier the derivation, reflecting the derivation from the cause to the result. The more difficult the conversion, the more difficult the derivation, reflecting the derivation from the result to the cause. Therefore, the causal relationship between the data can be reflected by the degree of difficulty represented by the time spent in the conversion. That is, the causal relationship between the first data and the second data is determined by the magnitude relationship between the first conversion time and the second conversion time. Specifically, according to the first conversion difficulty degree coefficient corresponding to the first conversion time, and the second conversion difficulty degree coefficient corresponding to the second conversion time, the step of determining the causal relationship between the first data and the second data includes:
步骤S31,将所述第一转换难易程度系数与所述第二转换难易程度系数对比,确定所述第一转换难易程度系数与所述第二转换难易程度系数之间的大小关系;Step S31, comparing the first conversion difficulty coefficient with the second conversion difficulty coefficient to determine the magnitude relationship between the first conversion difficulty coefficient and the second conversion difficulty coefficient ;
步骤S32,若所述大小关系为第一转换难易程度系数大于第二转换难易程度系数,则确定所述第一数据构成所述因果关系的原因,所述第二数据构成所述因果关系中的结果;Step S32, if the magnitude relationship is that the first conversion difficulty degree coefficient is greater than the second conversion difficulty degree coefficient, determine the reason why the first data constitutes the causal relationship, and the second data constitutes the causal relationship result in;
步骤S33,若所述大小关系为第一转换难易程度系数小于第二转换难易程度系数,则确定所述第二数据构成所述因果关系的原因,所述第一数据构成所述因果关系中的结果。Step S33, if the magnitude relationship is that the first conversion difficulty degree coefficient is smaller than the second conversion difficulty degree coefficient, determine the reason why the second data constitutes the causal relationship, and the first data constitutes the causal relationship results in .
更进一步地,将第一转换时间和第二转换时间对比,确定两者之间的大小关系。若经对比确定大小关系为第一转换时间大于第二转换时间,则说明第二数据向第一数据转换的时间长,而第一数据向第二数据转换的时间短,故可确定第一数据构成因果关系中的原因,第二数据构成用过关系中的结果。反之若经对比确定大小关系为第一转换时间小于第二转换时间,则说明第二数据向第一数据转换的时间短,而第一数据向第二数据转换的时间长,故可确定第二数据构成因果关系中的原因,第一数据构成用过关系中的结果。Further, the first conversion time and the second conversion time are compared to determine the magnitude relationship between the two. If it is determined by comparison that the magnitude relationship is that the first conversion time is greater than the second conversion time, it means that the conversion time of the second data to the first data is long, and the conversion time of the first data to the second data is short, so the first data can be determined. The second data constitutes the cause in the causal relationship, and the second data constitutes the effect in the used relationship. Conversely, if it is determined by comparison that the size relationship is that the first conversion time is less than the second conversion time, it means that the conversion time of the second data to the first data is short, and the conversion time of the first data to the second data is long, so it can be determined that the second data is converted to the second data. The data constitute the cause in the causal relationship, and the first data constitute the effect in the used relationship.
需要说明的是,对于第一转换时间和第二转换时间的大小关系相同的情况,难以区分第一数据和第二数据之间的因果关系,则输出第一数据和第二数据之间不存在因果关系的提示信息。此外,为了防止转换时间过长,预先设定表征转换时长超出正常转换时长的预设时间阈值,当第一转换时间和第二转换时间中均大于该预设时间阈值,则说明第一数据难以向第二数据转换,且第二数据也难以向第一数据转换,第一数据和第二数据之间不存在原因和结果之间的逻辑关系,故而输出不存在因果关系的提示信息。当第一转换时间和第二转换时间中存在任一项大于预设时间阈值,则根据不大于预设时间阈值的转换时间,确定第一数据和第二数据之间的因果关系。如若第一转换时间大于预设时间阈值,而第二转换时间不大于预设之间阈值,则依据第二转换时间确定第一数据和第二数据之间的因果关系。因第二转换时间为第一数据向第二数据转换的时间,第二转换时间大于预设时间阈值,则说明第一数据向第二数据转换的时间短,故第一数据构成因果关系中的原因,而第二数据构成因果关系中的结果。以此,依据第一转换时间和第二转换时间之间的大小关系,确定第一数据和第二数据之间的因果关系。It should be noted that, for the case where the magnitude relationship between the first conversion time and the second conversion time is the same, it is difficult to distinguish the causal relationship between the first data and the second data, and there is no output between the first data and the second data. Hint information about causality. In addition, in order to prevent the conversion time from being too long, a preset time threshold is preset to indicate that the conversion time exceeds the normal conversion time. When both the first conversion time and the second conversion time are greater than the preset time threshold, it means that the first data is difficult to Converting to the second data, and the second data is also difficult to convert to the first data, there is no logical relationship between the cause and the result between the first data and the second data, so the prompt information that there is no causal relationship is output. When any one of the first conversion time and the second conversion time is greater than the preset time threshold, the causal relationship between the first data and the second data is determined according to the conversion time not greater than the preset time threshold. If the first transition time is greater than the preset time threshold and the second transition time is not greater than the preset interval threshold, the causal relationship between the first data and the second data is determined according to the second transition time. Since the second conversion time is the time when the first data is converted to the second data, and the second conversion time is greater than the preset time threshold, it means that the conversion time of the first data to the second data is short, so the first data constitutes a causal relationship. cause, while the second data constitutes the effect in a causal relationship. In this way, according to the magnitude relationship between the first conversion time and the second conversion time, the causal relationship between the first data and the second data is determined.
本申请的因果关系的确定方法,与现有技术难以推断数据间因果关系相比,本申请采用的手段为:获取第一数据和第二数据,并以第一数据为参照,通过预设神经网络对第二数据进行转换,得到与第一数据在预设神经网络的散度上匹配的第三数据;同时以第二数据为参照,通过预设神经网络对第一数据进行转换,得到与第二数据在预设神经网络的散度上匹配的第四数据;进而获取第二数据与第三数据之间转换的第一转换时间,以及第一数据与第四数据之间转换的第二转换时间,并依据第一转换时间对应的第一转换难易程度系数,以及与第二转换时间对应的第二转换难易程度系数,来确定第一数据与第二数据之间的因果关系。其中,第一转换时间为形成第三数据的时长,其对应的第一转换难易程度系数表征将第二数据转换为与第一数据匹配的第三数据的难易程度,第二转换时间为形成第四数据的时长,其对应的第二转换难易程度系数表征将第一数据转换为与第二数据匹配的第四数据的难易程度。时长越短、对应的系数越小,则转换越容易,构成因果关系中的因;时长越长、对应的系数越大,则转换越困难,构成因果关系中的果;以此,通过第一转换时间与第二转换时间分别对应的第一转换难易程度系数和第二转换难易程度系数之间的大小关系,来准确确定第一数据和第二数据之间的因果关系。克服了现有技术中难以推断数据间因果关系的缺陷,有利于数据分析的精度提升。Compared with the method for determining the causal relationship of the present application, which is difficult to infer the causal relationship between the data in the prior art, the method adopted in the present application is: obtaining the first data and the second data, and using the first data as a reference, through the preset neural network The network converts the second data to obtain third data that matches the first data on the divergence of the preset neural network; at the same time, taking the second data as a reference, converts the first data through the preset neural network to obtain the same The second data matches the fourth data on the divergence of the preset neural network; and then obtains the first conversion time of the conversion between the second data and the third data, and the second conversion time of the conversion between the first data and the fourth data. The conversion time is determined, and the causal relationship between the first data and the second data is determined according to the first conversion difficulty degree coefficient corresponding to the first conversion time and the second conversion difficulty degree coefficient corresponding to the second conversion time. The first conversion time is the length of time for forming the third data, the corresponding first conversion difficulty coefficient represents the difficulty of converting the second data into third data matching the first data, and the second conversion time is The duration of forming the fourth data, and the corresponding second conversion difficulty coefficient represents the difficulty of converting the first data into fourth data matching the second data. The shorter the time period and the smaller the corresponding coefficient, the easier the conversion is, and the cause in the causal relationship is formed; the longer the time period and the larger the corresponding coefficient, the more difficult the conversion is, and the effect in the causal relationship is formed. The magnitude relationship between the first conversion difficulty degree coefficient and the second conversion difficulty degree coefficient corresponding to the conversion time and the second conversion time respectively is used to accurately determine the causal relationship between the first data and the second data. It overcomes the defect that it is difficult to infer the causal relationship between data in the prior art, and is beneficial to the improvement of the accuracy of data analysis.
进一步地,基于本申请因果关系的确定方法的第一实施例,提出本申请因果关系的确定方法第二实施例。Further, based on the first embodiment of the method for determining a causal relationship of the present application, a second embodiment of the method for determining a causal relationship of the present application is proposed.
所述因果关系的确定方法第二实施例与所述因果关系的确定方法第一实施例的区别在于,所述基于预设神经网络和第一数据,对所述第二数据进行转换,得到与所述第一数据在所述预设神经网络的散度上匹配的第三数据的步骤包括:The difference between the second embodiment of the method for determining a causal relationship and the first embodiment of the method for determining a causal relationship is that the second data is converted based on the preset neural network and the first data to obtain the same The step of matching the third data with the first data on the divergence of the preset neural network includes:
步骤S14,将所述第一数据作为指引,基于所述预设神经网络中的生成器对所述第二数据进行转换,得到中间数据;Step S14, using the first data as a guide, converting the second data based on the generator in the preset neural network to obtain intermediate data;
步骤S15,基于预设神经网络中的判别器,对所述中间数据进行判别,确定所述第一数据与所述中间数据之间的散度值;Step S15, based on a discriminator in a preset neural network, discriminate the intermediate data, and determine a divergence value between the first data and the intermediate data;
本实施例通过预设神经网络和第一数据对第二数据进行转换,得到与第一数据在预设神经网络的散度上匹配的第三数据。具体地,以第一数据作为指引,由预设神经网络中的生成器对第二数据进行转换,得到转换的中间数据。此后,通过预设神经网络中的判别器对转换的中间数据进行判别,将中间数据从第一数据中区分出来,并生成第一数据与中间数据之间的散度值。具体地,基于预设神经网络中的判别器,对中间数据进行判别,确定第一数据与中间数据之间的散度值的步骤包括:In this embodiment, the second data is converted by the preset neural network and the first data, so as to obtain third data matching the divergence of the first data in the preset neural network. Specifically, using the first data as a guide, the generator in the preset neural network converts the second data to obtain the converted intermediate data. Thereafter, the converted intermediate data is discriminated by a discriminator in a preset neural network, the intermediate data is distinguished from the first data, and a divergence value between the first data and the intermediate data is generated. Specifically, based on the discriminator in the preset neural network, the intermediate data is discriminated, and the step of determining the divergence value between the first data and the intermediate data includes:
步骤S151,基于预设神经网络中的判别器,对所述中间数据进行判别,生成所述中间数据的第一概率分布;Step S151, based on a discriminator in a preset neural network, discriminate the intermediate data, and generate a first probability distribution of the intermediate data;
步骤S152,获取所述第一数据的第二概率分布,并根据所述第一概率分布和所述第二概率分布,生成所述第一数据与所述中间数据之间的散度值。Step S152: Acquire a second probability distribution of the first data, and generate a divergence value between the first data and the intermediate data according to the first probability distribution and the second probability distribution.
进一步地,调用预设神经网络中的判别器,对中间数据进行判别,生成中间数据的第一概率分布,通过第一概率分布描述中间数据所含有的信息。同时针对第一数据生成第二概率分布,由第二概率分布描述第一数据所含有的信息。进而对第一概率分布和第二概率分布进行计算,生成两者之间的散度值;其中,计算可以是计算两者对数差的期望值,所得到的期望值结果即为散度值。Further, the discriminator in the preset neural network is invoked to discriminate the intermediate data, generate a first probability distribution of the intermediate data, and describe the information contained in the intermediate data through the first probability distribution. At the same time, a second probability distribution is generated for the first data, and the information contained in the first data is described by the second probability distribution. Then, the first probability distribution and the second probability distribution are calculated to generate a divergence value between them; wherein, the calculation may be to calculate the expected value of the logarithmic difference between the two, and the obtained expected value result is the divergence value.
步骤S16,根据所述散度值,确定与所述第一数据在所述预设神经网络的散度上匹配的第三数据。Step S16, according to the divergence value, determine third data matching the divergence of the preset neural network with the first data.
更进一步地,因散度值用于表征通过预设神经网络转换得到的数据与真实数据之间的相似性,故可通过散度值所表征的相似性高低,确定与第一数据在散度上匹配的第三数据,即与第一数据相似性高的第三数据。具体地,根据散度值,确定与第一数据在预设神经网络的散度上匹配的第三数据的步骤包括:Furthermore, since the divergence value is used to represent the similarity between the data converted by the preset neural network and the real data, it is possible to determine the divergence between the first data and the first data by the similarity represented by the divergence value. The matching third data, that is, the third data with high similarity with the first data. Specifically, according to the divergence value, the step of determining the third data matching the divergence of the preset neural network with the first data includes:
步骤S161,判断所述散度值是否小于预设阈值,若小于预设阈值,则将所述中间数据确定为所述第三数据;Step S161, judging whether the divergence value is less than a preset threshold, and if it is less than a preset threshold, then determining the intermediate data as the third data;
步骤S162,若所述散度值不小于所述预设阈值,则根据所述散度值,执行基于所述预设神经网络中的生成器对所述第二数据进行转换的步骤,直到所述散度值小于预设阈值。Step S162, if the divergence value is not less than the preset threshold, then according to the divergence value, perform the step of converting the second data based on the generator in the preset neural network, until the predetermined threshold is reached. The divergence value is less than the preset threshold.
进一步地,预先设置表征相似性高低的预设阈值,将所生成的散度值和该预设阈值对比,判断散度值是否小于预设阈值。若经对比确定散度值小于预设阈值,则说明第一数据的概率分布和中间数据的概率分布之间接近程度较高,从而将中间数据确定为与第一数据具有较高相似性的第三数据。反之,若经对比确定散度值不小于预设阈值,则说明经预设神经网络处理得到的中间数据,与第一数据之间的差别较大,故而需要通过预设神经网络继续以第一数据为参考,对第二数据迭代转换。Further, a preset threshold representing the level of similarity is preset, and the generated divergence value is compared with the preset threshold to determine whether the divergence value is smaller than the preset threshold. If it is determined by comparison that the divergence value is smaller than the preset threshold, it means that the probability distribution of the first data and the probability distribution of the intermediate data are close to each other, so that the intermediate data is determined as the first data having a high similarity with the first data. Three data. On the contrary, if it is determined by comparison that the divergence value is not less than the preset threshold, it means that the intermediate data obtained by the preset neural network processing is quite different from the first data, so it is necessary to continue to use the first data through the preset neural network. The data is used as a reference, and the second data is iteratively transformed.
更进一步地,在对第二数据迭代转换过程中,将散度值作为参考,由预设神经网络依据第一数据将第二数据朝向缩小差别的方向转换,再次生成中间数据进行判别得到散度值,确定是否小于预设阈值。若小于预设阈值,则将该中间数据确定为第三数据。若仍不小于预设阈值,则继续转换得到新的中间数据;如此循环,直到所生成的散度值小于预设阈值,得到中间数据确定为与第一数据在预设神经网络的散度上匹配的第三数据。Further, in the iterative conversion process of the second data, the divergence value is used as a reference, the preset neural network converts the second data in the direction of reducing the difference according to the first data, and generates intermediate data again to determine the divergence. value to determine whether it is less than the preset threshold. If it is less than the preset threshold, the intermediate data is determined as the third data. If it is still not less than the preset threshold, continue to convert to obtain new intermediate data; this cycle until the generated divergence value is less than the preset threshold, and the obtained intermediate data is determined to be on the divergence of the preset neural network with the first data. matching third data.
进一步地,所述将所述中间数据确定为所述第三数据的步骤之后,所述方法包括:Further, after the step of determining the intermediate data as the third data, the method includes:
步骤a1,查找对所述第二数据首次转换的第一时间点,以及确定所述第三数据的第二时间点;Step a1, find the first time point of the first conversion to the second data, and determine the second time point of the third data;
步骤a2,根据所述第一时间点和所述第二时间点,生成与所述第三数据对应的第一转换时间。Step a2, generating a first conversion time corresponding to the third data according to the first time point and the second time point.
本实施例在开始对第二数据进行首次转换时,对转换的时间点进行记录,记录为第一时间点。在得到第三数据后,查找该记录的第一时间点,同时记录当前得到第三数据的时间点,作为生成第三数据的第二时间点。在第一时间点和第二时间点之间对比,得到两者之间的时间间隔。该时间间隔即为转换得到第三数据所花费的时间,将其作为与第三数据对应的第一转换时间,以用于结合第四数据对应的第二转换时间,来确定第一数据与第二数据之间的因果关系。In this embodiment, when the second data is converted for the first time, the time point of the conversion is recorded as the first time point. After the third data is obtained, the recorded first time point is searched, and the current time point at which the third data is obtained is recorded as the second time point for generating the third data. Compare between the first time point and the second time point to get the time interval between the two. The time interval is the time it takes to convert the third data, which is used as the first conversion time corresponding to the third data to be used to determine the difference between the first data and the third data in combination with the second conversion time corresponding to the fourth data. The causal relationship between the two data.
需要说明的是,在生成第四数据时,以第二数据作为指引,并通过预设神经网络对第一数据进行转换,得到中间数据和第二数据进行判别,生成两者之间的散度值,以此确定第四数据。其中,该第四数据的具体生成过程,与上述第三数据的生成过程具有相似性,在此不做赘述。It should be noted that when generating the fourth data, the second data is used as a guide, and the first data is converted through a preset neural network to obtain the intermediate data and the second data for discrimination, and generate the divergence between the two. value to determine the fourth data. The specific generation process of the fourth data is similar to the above-mentioned generation process of the third data, and details are not described here.
本实施例设置循环更新转换机制,对于每次转换得到的中间数据,均和第一数据进行散度值计算,直到所计算得到的散度值大于预设阈值后,将中间数据作为接近第一数据的第三数据。进而由得到第三数据整个过程所花费的时间长短体现转换的难易程度,通过准确确定第三数据来提高因果关系确定的准确性。In this embodiment, a cyclic update conversion mechanism is set, and for the intermediate data obtained by each conversion, the divergence value calculation is performed with the first data, until the calculated divergence value is greater than the preset threshold, the intermediate data is regarded as close to the first data. The third data of the data. Further, the degree of difficulty of conversion is reflected by the length of time spent in the entire process of obtaining the third data, and the accuracy of determining the causal relationship is improved by accurately determining the third data.
进一步地,请参照图3,基于本申请因果关系的确定方法的第一或第二实施例,提出本申请因果关系的确定方法第三实施例。Further, referring to FIG. 3 , based on the first or second embodiment of the method for determining the causal relationship of the present application, a third embodiment of the method for determining the causal relationship of the present application is proposed.
所述因果关系的确定方法第三实施例与所述因果关系的确定方法第一或第二实施例的区别在于,所述获取第一数据和第二数据的步骤之前,所述方法还包括:The difference between the third embodiment of the method for determining a causal relationship and the first or second embodiment of the method for determining a causal relationship is that, before the step of acquiring the first data and the second data, the method further includes:
步骤S40,获取具有预设因果关系的第一数据集和第二数据集,并基于预设神经网络,对所述第一数据集和所述第二数据集进行处理,生成处理结果;Step S40, obtaining a first data set and a second data set with a preset causal relationship, and processing the first data set and the second data set based on a preset neural network to generate a processing result;
步骤S50,判断所述处理结果中的因果关系,与所述预设因果关系是否一致,若一致,则基于预设神经网络对第一数据和第二数据处理。Step S50, judging whether the causal relationship in the processing result is consistent with the preset causal relationship, and if so, the first data and the second data are processed based on the preset neural network.
本实施例对预设神经网络的有效性进行确定,即确定预设神经网络是否可有效对数据间的因果关系进行判定。具体地,获取具有预设因果关系的第一数据集和第二数据集,第一数据集和第二数据集中均包含有多项数据。预设因果关系表征第一数据集中数据和第二数据集中数据之间设定有明确的因果逻辑关系。如第一数据集中的数据为因果关系中的原因,第二数据集中的数据为因果关系中的结果。In this embodiment, the validity of the preset neural network is determined, that is, it is determined whether the preset neural network can effectively determine the causal relationship between data. Specifically, a first data set and a second data set having a preset causal relationship are acquired, and both the first data set and the second data set contain multiple items of data. The preset causal relationship indicates that a clear causal logical relationship is set between the data in the first data set and the data in the second data set. For example, the data in the first data set is the cause in the causal relationship, and the data in the second data set is the result in the causal relationship.
进一步地,通过预设神经网络,对第一数据集和第二数据集进行处理,生成处理结果,该处理结果表征对第一数据集和第二数据集进行判断,所生成两者中各项数据之间原因和结果的因果关系。进而将处理结果所表征的因果关系和预设因果关系对比,判断两者是否一致,若一致则说明通过预设神经网络处理所得到的因果关系,与原本的因果关系一致,所处理得到的结果准确,可将预设神经网络用于因果关系的判定,用于对第一数据和第二数据进行处理,以确定两者之间的因果关系。Further, through the preset neural network, the first data set and the second data set are processed, and a processing result is generated, and the processing result indicates that the first data set and the second data set are judged, and the generated items in the two are generated. The causal relationship between cause and effect between data. Then, compare the causal relationship represented by the processing result with the preset causal relationship, and judge whether the two are consistent. If they are consistent, it means that the causal relationship obtained through the preset neural network processing is consistent with the original causal relationship, and the processed results are obtained. Precisely, the preset neural network can be used to determine the causal relationship, and used to process the first data and the second data to determine the causal relationship between the two.
反之,若处理所得到的因果关系与原本的因果关系不一致,则说明处理结果不准确,不能将预设神经网络用于因果关系的判定,而需要重新设定更新参数进行判定。若更新参数后的预设神经网络可准确得出处理结果,则用更新参数后的预设神经网络对第一数据和第二数据进行处理。否则继续更新参数,如此循环,直到得到准确的处理结果后,用于对第一数据和第二数据进行处理,判断两者之间的因果关系。Conversely, if the causal relationship obtained by processing is inconsistent with the original causal relationship, it means that the processing result is inaccurate, and the preset neural network cannot be used to determine the causal relationship, and the update parameters need to be reset for determination. If the preset neural network after updating the parameters can accurately obtain the processing result, the preset neural network after updating the parameters is used to process the first data and the second data. Otherwise, continue to update the parameters, and repeat this cycle until an accurate processing result is obtained, which is used to process the first data and the second data to determine the causal relationship between the two.
需要说明的是,第一数据集和第二数据集中均包含多项具有预设因果关系的数据,所生成的处理结果中则包含多项数据之间的因果关系,在确定处理结果中的因果关系与预设因果关系是否一致时,其实质为确定各项数据之间的因果关系是否与预设因果关系一致。考虑到处理过程中外界环境因素的影响,难以到达各项因果关系均与预设因果关系一致的情形,故预先依据需求设定预设阈值,如95%。当各项因果关系中与预设因果关系一致的比例大于预设阈值,则判定为一致,反之则判定为不一致而需要更新参数重新处理,直到比例大于预设阈值。It should be noted that both the first data set and the second data set contain multiple pieces of data with preset causal relationships, and the generated processing results contain causal relationships between multiple pieces of data. When the relationship is consistent with the preset causal relationship, its essence is to determine whether the causal relationship between the various data is consistent with the preset causal relationship. Considering the influence of external environmental factors in the processing process, it is difficult to reach a situation where each causal relationship is consistent with the preset causal relationship, so a preset threshold, such as 95%, is set in advance according to the needs. When the proportion of each causal relationship that is consistent with the preset causal relationship is greater than the preset threshold, it is determined to be consistent; otherwise, it is determined to be inconsistent and parameters need to be updated for reprocessing until the proportion is greater than the preset threshold.
本实施例通过确定预设神经网络是否可有效对数据间的因果关系进行判定,来确保对第一数据和第二数据之间因果关系判定的准确性。同时设置比例机制,在确保准确判定的同时提升判定效率。This embodiment ensures the accuracy of determining the causal relationship between the first data and the second data by determining whether the preset neural network can effectively determine the causal relationship between the data. At the same time, a proportional mechanism is set up to improve the efficiency of judgment while ensuring accurate judgment.
本申请还提供一种因果关系的确定装置。The present application also provides a device for determining a causal relationship.
参照图4,图4为本申请因果关系的确定装置第一实施例的功能模块示意图。所述因果关系的确定装置包括:Referring to FIG. 4 , FIG. 4 is a schematic diagram of functional modules of a first embodiment of an apparatus for determining a causal relationship of the present application. The means for determining the causal relationship includes:
获取模块,用于获取第一数据和第二数据,并基于预设神经网络和第一数据,对所述第二数据进行转换,得到与所述第一数据在所述预设神经网络的散度上匹配的第三数据;The acquisition module is used to acquire the first data and the second data, and based on the preset neural network and the first data, convert the second data to obtain the dispersion of the first data in the preset neural network. degree of matching third data;
转换模块,用于基于所述预设神经网络和第二数据,对所述第一数据进行转换,得到与所述第二数据在所述预设神经网络的散度上匹配的第四数据;a conversion module, configured to convert the first data based on the preset neural network and the second data to obtain fourth data matching the second data in the divergence of the preset neural network;
确定模块,用于获取所述第二数据与所述第三数据之间转换的第一转换时间,以及所述第一数据与所述第四数据之间转换的第二转换时间,并根据所述第一转换时间对应的第一转换难易程度系数,与所述第二转换时间对应的第二转换难易程度系数,确定所述第一数据与所述第二数据之间的因果关系。A determination module, configured to obtain a first conversion time for conversion between the second data and the third data, and a second conversion time for conversion between the first data and the fourth data, and according to the A first conversion difficulty degree coefficient corresponding to the first conversion time and a second conversion difficulty degree coefficient corresponding to the second conversion time are used to determine a causal relationship between the first data and the second data.
进一步地,所述确定模块包括:Further, the determining module includes:
第一确定单元,用于将所述第一转换难易程度系数与所述第二转换难易程度系数对比,确定所述第一转换难易程度系数与所述第二转换难易程度系数之间的大小关系;a first determining unit, configured to compare the first conversion difficulty level coefficient with the second conversion difficulty level coefficient, and determine the difference between the first conversion difficulty level coefficient and the second conversion difficulty level coefficient the size relationship between
第一构成单元,用于若所述大小关系为第一转换难易程度系数大于第二转换难易程度系数,则确定所述第一数据构成所述因果关系的原因,所述第二数据构成所述因果关系中的结果;The first constituting unit is configured to determine the reason why the first data constitutes the causal relationship if the magnitude relationship is that the first conversion difficulty degree coefficient is greater than the second conversion difficulty degree coefficient, and the second data constitutes the causal relationship. the outcome of the causal relationship;
第二构成单元,用于若所述大小关系为第一转换难易程度系数小于第二转换难易程度系数,则确定所述第二数据构成所述因果关系的原因,所述第一数据构成所述因果关系中的结果。The second structure unit is configured to determine the reason why the second data constitutes the causal relationship if the magnitude relationship is that the first conversion difficulty degree coefficient is smaller than the second conversion difficulty degree coefficient, and the first data constitutes the causal relationship. result in the causal relationship.
进一步地,所述获取模块包括:Further, the acquisition module includes:
转换单元,用于将所述第一数据作为指引,基于所述预设神经网络中的生成器对所述第二数据进行转换,得到中间数据;a conversion unit, configured to use the first data as a guide, and convert the second data based on the generator in the preset neural network to obtain intermediate data;
判别单元,用于基于预设神经网络中的判别器,对所述中间数据进行判别,确定所述第一数据与所述中间数据之间的散度值;A discriminating unit for discriminating the intermediate data based on a discriminator in a preset neural network, and determining a divergence value between the first data and the intermediate data;
确定单元,用于根据所述散度值,确定与所述第一数据在所述预设神经网络的散度上匹配的第三数据。A determining unit, configured to determine, according to the divergence value, third data matching the divergence of the preset neural network with the first data.
进一步地,所述判别单元还用于:Further, the discrimination unit is also used for:
基于预设神经网络中的判别器,对所述中间数据进行判别,生成所述中间数据的第一概率分布;based on the discriminator in the preset neural network, discriminate the intermediate data, and generate a first probability distribution of the intermediate data;
获取所述第一数据的第二概率分布,并根据所述第一概率分布和所述第二概率分布,生成所述第一数据与所述中间数据之间的散度值。A second probability distribution of the first data is acquired, and a divergence value between the first data and the intermediate data is generated according to the first probability distribution and the second probability distribution.
进一步地,所述确定单元还用于:Further, the determining unit is also used for:
判断所述散度值是否小于预设阈值,若小于预设阈值,则将所述中间数据确定为所述第三数据;Determine whether the divergence value is less than a preset threshold, and if it is less than a preset threshold, determine the intermediate data as the third data;
若所述散度值不小于所述预设阈值,则根据所述散度值,执行基于所述预设神经网络中的生成器对所述第二数据进行转换的步骤,直到所述散度值小于预设阈值。If the divergence value is not less than the preset threshold, perform the step of converting the second data based on the generator in the preset neural network according to the divergence value, until the divergence The value is less than the preset threshold.
进一步地,所述确定单元还用于:Further, the determining unit is also used for:
查找对所述第二数据首次转换的第一时间点,以及确定所述第三数据的第二时间点;Find the first time point of the first conversion of the second data, and determine the second time point of the third data;
根据所述第一时间点和所述第二时间点,生成与所述第三数据对应的第一转换时间。According to the first time point and the second time point, a first conversion time corresponding to the third data is generated.
进一步地,所述因果关系的确定装置还包括:Further, the device for determining the causal relationship also includes:
生成模块,用于获取具有预设因果关系的第一数据集和第二数据集,并基于预设神经网络,对所述第一数据集和所述第二数据集进行处理,生成处理结果;a generating module, configured to obtain a first data set and a second data set with a preset causal relationship, and based on a preset neural network, process the first data set and the second data set to generate a processing result;
判断模块,用于判断所述处理结果中的因果关系,与所述预设因果关系是否一致,若一致,则基于预设神经网络对第一数据和第二数据处理。The judgment module is configured to judge whether the causal relationship in the processing result is consistent with the preset causal relationship, and if so, process the first data and the second data based on the preset neural network.
本申请因果关系的确定装置具体实施方式与上述因果关系的确定方法各实施例基本相同,在此不再赘述。The specific implementation manner of the device for determining a causal relationship of the present application is basically the same as the embodiments of the above-mentioned methods for determining a causal relationship, and details are not repeated here.
此外,本申请实施例还提出一种可读存储介质。In addition, an embodiment of the present application also provides a readable storage medium.
可读存储介质上存储有因果关系的确定程序,因果关系的确定程序被处理器执行时实现如上所述的因果关系的确定方法的步骤。A program for determining a causal relationship is stored on the readable storage medium, and when the program for determining a causal relationship is executed by a processor, the steps of the method for determining a causal relationship as described above are implemented.
本申请可读存储介质可以是计算机可读存储介质,其具体实施方式与上述因果关系的确定方法各实施例基本相同,在此不再赘述。The readable storage medium of the present application may be a computer-readable storage medium, and its specific implementation is basically the same as that of the foregoing embodiments of the causal relationship determination method, and details are not described herein again.
上面结合附图对本申请的实施例进行了描述,但是本申请并不局限于上述的具体实施方式,上述的具体实施方式仅仅是示意性的,而不是限制性的,本领域的普通技术人员在本申请的启示下,在不脱离本申请宗旨和权利要求所保护的范围情况下,还可做出很多形式,凡是利用本申请说明书及附图内容所作的等效结构或等效流程变换,或直接或间接运用在其他相关的技术领域,这些均属于本申请的保护之内。The embodiments of the present application have been described above in conjunction with the accompanying drawings, but the present application is not limited to the above-mentioned specific embodiments, which are merely illustrative rather than restrictive. Under the inspiration of this application, without departing from the scope of protection of the purpose of this application and the claims, many forms can be made. Directly or indirectly applied in other related technical fields, these all fall within the protection of this application.

Claims (20)

  1. 一种因果关系的确定方法,其中,所述因果关系的确定方法包括以下步骤:A method for determining a causal relationship, wherein the method for determining a causal relationship comprises the following steps:
    获取第一数据和第二数据,并基于预设神经网络和第一数据,对所述第二数据进行转换,得到与所述第一数据在所述预设神经网络的散度上匹配的第三数据;Obtain the first data and the second data, and convert the second data based on the preset neural network and the first data to obtain the first data matching the divergence of the preset neural network with the first data. three data;
    基于所述预设神经网络和第二数据,对所述第一数据进行转换,得到与所述第二数据在所述预设神经网络的散度上匹配的第四数据;Based on the preset neural network and the second data, converting the first data to obtain fourth data matching the divergence of the second data in the preset neural network;
    获取所述第二数据与所述第三数据之间转换的第一转换时间,以及所述第一数据与所述第四数据之间转换的第二转换时间,并根据所述第一转换时间对应的第一转换难易程度系数,与所述第二转换时间对应的第二转换难易程度系数,确定所述第一数据与所述第二数据之间的因果关系。Acquiring a first conversion time for conversion between the second data and the third data, and a second conversion time for conversion between the first data and the fourth data, and according to the first conversion time The corresponding first conversion difficulty degree coefficient and the second conversion difficulty degree coefficient corresponding to the second conversion time determine the causal relationship between the first data and the second data.
  2. 如权利要求1所述的因果关系的确定方法,其中,所述根据所述第一转换时间对应的第一转换难易程度系数,与所述第二转换时间对应的第二转换难易程度系数,确定所述第一数据与所述第二数据之间的因果关系的步骤包括:The method for determining a causal relationship according to claim 1, wherein the first conversion difficulty degree coefficient corresponding to the first conversion time is based on the second conversion difficulty degree coefficient corresponding to the second conversion time. , the step of determining the causal relationship between the first data and the second data includes:
    将所述第一转换难易程度系数与所述第二转换难易程度系数对比,确定所述第一转换难易程度系数与所述第二转换难易程度系数之间的大小关系;Comparing the first conversion difficulty degree coefficient with the second conversion difficulty degree coefficient, and determining the magnitude relationship between the first conversion difficulty degree coefficient and the second conversion difficulty degree coefficient;
    若所述大小关系为第一转换难易程度系数大于第二转换难易程度系数,则确定所述第一数据构成所述因果关系的原因,所述第二数据构成所述因果关系中的结果;If the magnitude relationship is that the first conversion difficulty degree coefficient is greater than the second conversion difficulty degree coefficient, determine the reason why the first data constitutes the causal relationship, and the second data constitutes the result in the causal relationship ;
    若所述大小关系为第一转换难易程度系数小于第二转换难易程度系数,则确定所述第二数据构成所述因果关系的原因,所述第一数据构成所述因果关系中的结果。If the magnitude relationship is that the first conversion difficulty degree coefficient is smaller than the second conversion difficulty degree coefficient, determine the reason why the second data constitutes the causal relationship, and the first data constitutes the result in the causal relationship .
  3. 如权利要求1所述的因果关系的确定方法,其中,所述基于预设神经网络和第一数据,对所述第二数据进行转换,得到与所述第一数据在所述预设神经网络的散度上匹配的第三数据的步骤包括:The method for determining a causal relationship according to claim 1, wherein the second data is converted based on a preset neural network and the first data to obtain the first data in the preset neural network. The steps of matching the third data on the divergence include:
    将所述第一数据作为指引,基于所述预设神经网络中的生成器对所述第二数据进行转换,得到中间数据;Using the first data as a guide, convert the second data based on the generator in the preset neural network to obtain intermediate data;
    基于预设神经网络中的判别器,对所述中间数据进行判别,确定所述第一数据与所述中间数据之间的散度值;Based on the discriminator in the preset neural network, the intermediate data is discriminated, and the divergence value between the first data and the intermediate data is determined;
    根据所述散度值,确定与所述第一数据在所述预设神经网络的散度上匹配的第三数据。According to the divergence value, third data matching the divergence of the preset neural network with the first data is determined.
  4. 如权利要求3所述的因果关系的确定方法,其中,所述基于预设神经网络中的判别器,对所述中间数据进行判别,确定所述第一数据与所述中间数据之间的散度值的步骤包括:The method for determining a causal relationship according to claim 3, wherein the intermediate data is discriminated based on a discriminator in a preset neural network, and the dispersion between the first data and the intermediate data is determined. The steps of the degree value include:
    基于预设神经网络中的判别器,对所述中间数据进行判别,生成所述中间数据的第一概率分布;based on the discriminator in the preset neural network, discriminate the intermediate data, and generate a first probability distribution of the intermediate data;
    获取所述第一数据的第二概率分布,并根据所述第一概率分布和所述第二概率分布,生成所述第一数据与所述中间数据之间的散度值。A second probability distribution of the first data is acquired, and a divergence value between the first data and the intermediate data is generated according to the first probability distribution and the second probability distribution.
  5. 如权利要求3所述的因果关系的确定方法,其中,所述根据所述散度值,确定与所述第一数据在所述预设神经网络的散度上匹配的第三数据的步骤包括:The method for determining a causal relationship according to claim 3, wherein the step of determining, according to the divergence value, third data matching the divergence of the preset neural network with the first data comprises the following steps: :
    判断所述散度值是否小于预设阈值,若小于预设阈值,则将所述中间数据确定为所述第三数据;Determine whether the divergence value is less than a preset threshold, and if it is less than a preset threshold, determine the intermediate data as the third data;
    若所述散度值不小于所述预设阈值,则根据所述散度值,执行基于所述预设神经网络中的生成器对所述第二数据进行转换的步骤,直到所述散度值小于预设阈值。If the divergence value is not less than the preset threshold, perform the step of converting the second data based on the generator in the preset neural network according to the divergence value, until the divergence The value is less than the preset threshold.
  6. 如权利要求5所述的因果关系的确定方法,其中,所述将所述中间数据确定为所述第三数据的步骤之后,所述方法包括:The method for determining a causal relationship according to claim 5, wherein after the step of determining the intermediate data as the third data, the method comprises:
    查找对所述第二数据首次转换的第一时间点,以及确定所述第三数据的第二时间点;Find the first time point of the first conversion of the second data, and determine the second time point of the third data;
    根据所述第一时间点和所述第二时间点,生成与所述第三数据对应的第一转换时间。According to the first time point and the second time point, a first conversion time corresponding to the third data is generated.
  7. 如权利要求1-6任一项所述的因果关系的确定方法,其中,所述获取第一数据和第二数据的步骤之前,所述方法还包括:The method for determining a causal relationship according to any one of claims 1-6, wherein, before the step of acquiring the first data and the second data, the method further comprises:
    获取具有预设因果关系的第一数据集和第二数据集,并基于预设神经网络,对所述第一数据集和所述第二数据集进行处理,生成处理结果;acquiring a first data set and a second data set with a preset causal relationship, and processing the first data set and the second data set based on a preset neural network to generate a processing result;
    判断所述处理结果中的因果关系,与所述预设因果关系是否一致,若一致,则基于预设神经网络对第一数据和第二数据处理。It is judged whether the causal relationship in the processing result is consistent with the preset causal relationship, and if so, the first data and the second data are processed based on the preset neural network.
  8. 一种因果关系的确定装置,其中,所述因果关系的确定装置包括:A device for determining a causal relationship, wherein the device for determining a causal relationship includes:
    获取模块,用于获取第一数据和第二数据,并基于预设神经网络和第一数据,对所述第二数据进行转换,得到与所述第一数据在所述预设神经网络的散度上匹配的第三数据;The acquisition module is used to acquire the first data and the second data, and based on the preset neural network and the first data, convert the second data to obtain the dispersion of the first data in the preset neural network. degree of matching third data;
    转换模块,用于基于所述预设神经网络和第二数据,对所述第一数据进行转换,得到与所述第二数据在所述预设神经网络的散度上匹配的第四数据;a conversion module, configured to convert the first data based on the preset neural network and the second data to obtain fourth data matching the second data in the divergence of the preset neural network;
    确定模块,用于获取所述第二数据与所述第三数据之间转换的第一转换时间,以及所述第一数据与所述第四数据之间转换的第二转换时间,并根据所述第一转换时间对应的第一转换难易程度系数,与所述第二转换时间对应的第二转换难易程度系数,确定所述第一数据与所述第二数据之间的因果关系。A determination module, configured to obtain a first conversion time for conversion between the second data and the third data, and a second conversion time for conversion between the first data and the fourth data, and according to the A first conversion difficulty degree coefficient corresponding to the first conversion time and a second conversion difficulty degree coefficient corresponding to the second conversion time are used to determine a causal relationship between the first data and the second data.
  9. 一种因果关系的确定设备,其中,所述因果关系的确定设备包括存储器、处理器以及存储在所述存储器上并可在所述处理器上运行的因果关系的确定程序,所述因果关系的确定程序被所述处理器执行时实现如权利要求1所述因果关系的确定方法的步骤。A device for determining a causal relationship, wherein the device for determining a causal relationship includes a memory, a processor, and a program for determining a causal relationship that is stored on the memory and can be run on the processor. When the determination program is executed by the processor, the steps of implementing the method for determining a causal relationship according to claim 1 are implemented.
  10. 一种因果关系的确定设备,其中,所述因果关系的确定设备包括存储器、处理器以及存储在所述存储器上并可在所述处理器上运行的因果关系的确定程序,所述因果关系的确定程序被所述处理器执行时实现如权利要求2所述因果关系的确定方法的步骤。A device for determining a causal relationship, wherein the device for determining a causal relationship includes a memory, a processor, and a program for determining a causal relationship that is stored on the memory and can be run on the processor. When the determination program is executed by the processor, the steps of implementing the method for determining a causal relationship as claimed in claim 2 are implemented.
  11. 一种因果关系的确定设备,其中,所述因果关系的确定设备包括存储器、处理器以及存储在所述存储器上并可在所述处理器上运行的因果关系的确定程序,所述因果关系的确定程序被所述处理器执行时实现如权利要求3所述因果关系的确定方法的步骤。A device for determining a causal relationship, wherein the device for determining a causal relationship includes a memory, a processor, and a program for determining a causal relationship that is stored on the memory and can be run on the processor. When the determination program is executed by the processor, the steps of implementing the method for determining a causal relationship as claimed in claim 3 are implemented.
  12. 一种因果关系的确定设备,其中,所述因果关系的确定设备包括存储器、处理器以及存储在所述存储器上并可在所述处理器上运行的因果关系的确定程序,所述因果关系的确定程序被所述处理器执行时实现如权利要求4所述因果关系的确定方法的步骤。A device for determining a causal relationship, wherein the device for determining a causal relationship includes a memory, a processor, and a program for determining a causal relationship that is stored on the memory and can be run on the processor. When the determination program is executed by the processor, the steps of implementing the method for determining a causal relationship as claimed in claim 4 are implemented.
  13. 一种因果关系的确定设备,其中,所述因果关系的确定设备包括存储器、处理器以及存储在所述存储器上并可在所述处理器上运行的因果关系的确定程序,所述因果关系的确定程序被所述处理器执行时实现如权利要求5所述因果关系的确定方法的步骤。A device for determining a causal relationship, wherein the device for determining a causal relationship includes a memory, a processor, and a program for determining a causal relationship that is stored on the memory and can be run on the processor. When the determination program is executed by the processor, the steps of implementing the method for determining a causal relationship as claimed in claim 5 are implemented.
  14. 一种因果关系的确定设备,其中,所述因果关系的确定设备包括存储器、处理器以及存储在所述存储器上并可在所述处理器上运行的因果关系的确定程序,所述因果关系的确定程序被所述处理器执行时实现如权利要求6所述因果关系的确定方法的步骤。A device for determining a causal relationship, wherein the device for determining a causal relationship includes a memory, a processor, and a program for determining a causal relationship that is stored on the memory and can be run on the processor. When the determination program is executed by the processor, the steps of implementing the method for determining a causal relationship as claimed in claim 6 are implemented.
  15. 一种可读存储介质,其中,所述可读存储介质上存储有因果关系的确定程序,所述因果关系的确定程序被处理器执行时实现如权利要求1所述因果关系的确定方法的步骤。A readable storage medium, wherein a program for determining a causal relationship is stored on the readable storage medium, and when the program for determining a causal relationship is executed by a processor, the steps of the method for determining a causal relationship according to claim 1 are implemented .
  16. 一种可读存储介质,其中,所述可读存储介质上存储有因果关系的确定程序,所述因果关系的确定程序被处理器执行时实现如权利要求2所述因果关系的确定方法的步骤。A readable storage medium, wherein a program for determining a causal relationship is stored on the readable storage medium, and when the program for determining a causal relationship is executed by a processor, the steps of the method for determining a causal relationship according to claim 2 are implemented .
  17. 一种可读存储介质,其中,所述可读存储介质上存储有因果关系的确定程序,所述因果关系的确定程序被处理器执行时实现如权利要求3所述因果关系的确定方法的步骤。A readable storage medium, wherein a program for determining a causal relationship is stored on the readable storage medium, and when the program for determining a causal relationship is executed by a processor, the steps of the method for determining a causal relationship according to claim 3 are implemented .
  18. 一种可读存储介质,其中,所述可读存储介质上存储有因果关系的确定程序,所述因果关系的确定程序被处理器执行时实现如权利要求4所述因果关系的确定方法的步骤。A readable storage medium, wherein a program for determining a causal relationship is stored on the readable storage medium, and when the program for determining a causal relationship is executed by a processor, the steps of the method for determining a causal relationship according to claim 4 are implemented .
  19. 一种可读存储介质,其中,所述可读存储介质上存储有因果关系的确定程序,所述因果关系的确定程序被处理器执行时实现如权利要求5所述因果关系的确定方法的步骤。A readable storage medium, wherein a program for determining a causal relationship is stored on the readable storage medium, and when the program for determining a causal relationship is executed by a processor, the steps of the method for determining a causal relationship according to claim 5 are implemented .
  20. 一种可读存储介质,其中,所述可读存储介质上存储有因果关系的确定程序,所述因果关系的确定程序被处理器执行时实现如权利要求6所述因果关系的确定方法的步骤。A readable storage medium, wherein a program for determining a causal relationship is stored on the readable storage medium, and when the program for determining a causal relationship is executed by a processor, the steps of the method for determining a causal relationship according to claim 6 are implemented .
PCT/CN2021/094934 2020-08-27 2021-05-20 Method, apparatus and device for determining causal relationship, and readable storage medium WO2022041866A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202010891985.7 2020-08-27
CN202010891985.7A CN112000856A (en) 2020-08-27 2020-08-27 Method, device and equipment for determining causal relationship and readable storage medium

Publications (1)

Publication Number Publication Date
WO2022041866A1 true WO2022041866A1 (en) 2022-03-03

Family

ID=73465986

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/094934 WO2022041866A1 (en) 2020-08-27 2021-05-20 Method, apparatus and device for determining causal relationship, and readable storage medium

Country Status (2)

Country Link
CN (1) CN112000856A (en)
WO (1) WO2022041866A1 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112000856A (en) * 2020-08-27 2020-11-27 深圳前海微众银行股份有限公司 Method, device and equipment for determining causal relationship and readable storage medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101408769A (en) * 2008-11-21 2009-04-15 冶金自动化研究设计院 On-line energy forecasting system and method based on product ARIMA model
US20110167031A1 (en) * 2008-05-21 2011-07-07 New York University Method, system, and computer-accessible medium for inferring and/or determining causation in time course data with temporal logic
US20190294671A1 (en) * 2018-03-20 2019-09-26 Wipro Limited Method and device for extracting causal from natural language sentences for intelligent systems
CN111488740A (en) * 2020-03-27 2020-08-04 北京百度网讯科技有限公司 Causal relationship judging method and device, electronic equipment and storage medium
CN112000856A (en) * 2020-08-27 2020-11-27 深圳前海微众银行股份有限公司 Method, device and equipment for determining causal relationship and readable storage medium

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110167031A1 (en) * 2008-05-21 2011-07-07 New York University Method, system, and computer-accessible medium for inferring and/or determining causation in time course data with temporal logic
CN101408769A (en) * 2008-11-21 2009-04-15 冶金自动化研究设计院 On-line energy forecasting system and method based on product ARIMA model
US20190294671A1 (en) * 2018-03-20 2019-09-26 Wipro Limited Method and device for extracting causal from natural language sentences for intelligent systems
CN111488740A (en) * 2020-03-27 2020-08-04 北京百度网讯科技有限公司 Causal relationship judging method and device, electronic equipment and storage medium
CN112000856A (en) * 2020-08-27 2020-11-27 深圳前海微众银行股份有限公司 Method, device and equipment for determining causal relationship and readable storage medium

Also Published As

Publication number Publication date
CN112000856A (en) 2020-11-27

Similar Documents

Publication Publication Date Title
CN107608877B (en) Automatic application program interface testing method and system based on machine learning
US20200019816A1 (en) Classification method and classification device of indoor scene
EP3311311A1 (en) Automatic entity resolution with rules detection and generation system
CN112884016B (en) Cloud platform credibility assessment model training method and cloud platform credibility assessment method
CN109766259B (en) Classifier testing method and system based on composite metamorphic relation
CN111340233B (en) Training method and device of machine learning model, and sample processing method and device
CN111444094B (en) Test data generation method and system
US11914507B2 (en) Software test apparatus and software test method
CN112559538A (en) Incidence relation generation method and device, computer equipment and storage medium
WO2022041866A1 (en) Method, apparatus and device for determining causal relationship, and readable storage medium
WO2020173270A1 (en) Method and device used for parsing data and computer storage medium
CN113541985A (en) Internet of things fault diagnosis method, training method of model and related device
WO2021228152A1 (en) Classification model information output
CN111783812A (en) Method and device for identifying forbidden images and computer readable storage medium
CN113657538A (en) Model training method, data classification method, device, equipment, storage medium and product
Kusa et al. Vombat: A tool for visualising evaluation measure behaviour in high-recall search tasks
US11868440B1 (en) Statistical model training systems
US20240095581A1 (en) Processing method, processing system, and processing program
CN115016950A (en) Data analysis method and system based on multithreading model
Farhad et al. Keep your distance: determining sampling and distance thresholds in machine learning monitoring
CN109446325B (en) Natural language classifier system with high accuracy
JPWO2021038840A5 (en)
CN111178443B (en) Model parameter selection, image classification and information identification methods, devices and equipment
CN117743568B (en) Content generation method and system based on fusion of resource flow and confidence
CN115391084B (en) Intelligent solution method and system for cloud game abnormity

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21859711

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 21859711

Country of ref document: EP

Kind code of ref document: A1