CN110532302B - Audit method, system and readable storage medium - Google Patents

Audit method, system and readable storage medium Download PDF

Info

Publication number
CN110532302B
CN110532302B CN201910815699.XA CN201910815699A CN110532302B CN 110532302 B CN110532302 B CN 110532302B CN 201910815699 A CN201910815699 A CN 201910815699A CN 110532302 B CN110532302 B CN 110532302B
Authority
CN
China
Prior art keywords
data
matching
checked
feature
audit
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201910815699.XA
Other languages
Chinese (zh)
Other versions
CN110532302A (en
Inventor
黄楚维
谢志林
冯挺
闭秀萍
韦海玲
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nanning Power Supply Bureau of Guangxi Power Grid Co Ltd
Original Assignee
Nanning Power Supply Bureau of Guangxi Power Grid Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nanning Power Supply Bureau of Guangxi Power Grid Co Ltd filed Critical Nanning Power Supply Bureau of Guangxi Power Grid Co Ltd
Priority to CN201910815699.XA priority Critical patent/CN110532302B/en
Publication of CN110532302A publication Critical patent/CN110532302A/en
Application granted granted Critical
Publication of CN110532302B publication Critical patent/CN110532302B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • G06F16/2465Query processing support for facilitating data mining operations in structured databases
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/25Integrating or interfacing systems involving database management systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/10Office automation; Time management

Abstract

The invention provides an auditing method, system and readable storage medium, wherein the method comprises the following steps: obtaining an audit data text; word segmentation processing is carried out on the audit data text, and word segmentation results are obtained; obtaining a data set to be checked according to the word segmentation result; the data set to be examined comprises: at least one set of data to be reviewed; the data to be examined comprises: audit data type and corresponding number; determining a corresponding calculation formula according to the audit data type; and carrying out corresponding calculation according to the numbers corresponding to the audit data types by using the calculation formula to obtain an audit result.

Description

Audit method, system and readable storage medium
Technical Field
The invention relates to the technical field of auditing, in particular to an auditing method, an auditing system and a readable storage medium.
Background
In recent years, along with the rapid development of computer technology and informatization construction, the informatization breadth and depth of economic management activities are also advanced, and the economic management activities are used as auditing of economic activity supervision, evaluation and authentication, so that the conventional manual auditing can not adapt to the auditing requirements under informatization conditions, and the informatization of auditing objects and the development of auditing themselves all require that auditing operation modes must be advanced with time, and corresponding adjustment is made. Therefore, the method conforms to the informatization development trend, updates the audit supervision idea and innovates the audit method.
How to provide a method for extracting audit data from texts containing the audit data and performing corresponding calculation to help traditional manual audit, thereby improving audit mode and audit efficiency is a problem to be solved at present.
Disclosure of Invention
To solve at least one of the above technical problems, the present invention proposes an auditing method, system and readable storage medium.
To achieve the above object, a first aspect of the present invention proposes an auditing method, the method comprising:
obtaining an audit data text;
word segmentation processing is carried out on the audit data text, and word segmentation results are obtained; obtaining a data set to be checked according to the word segmentation result; the data set to be examined comprises: at least one set of data to be reviewed; the data to be examined comprises: audit data type and corresponding number;
determining a corresponding calculation formula according to the audit data type; and carrying out corresponding calculation according to the numbers corresponding to the audit data types by using the calculation formula to obtain an audit result.
Further, after the obtaining the data set to be examined, the method further includes:
determining a preset auditing rule, inquiring the data set to be audited according to the auditing rule, determining data to be audited conforming to the auditing rule, and determining a target data set to be audited according to the data to be audited conforming to the auditing rule for auditing.
Further, the determining the data to be inspected, which accords with the auditing rule, includes:
determining characteristic data of the data to be checked in the data set to be checked;
matching the feature data of the data to be checked with the matching features in the auditing rules, and determining the matching degree of the feature data of the data to be checked and the matching features;
and when the matching degree is higher than a preset threshold value, determining to-be-checked data conforming to the auditing rule.
Further, the determining the matching degree of the feature data of the data to be checked and the matching feature includes:
calculating the matching degree of the feature data of the data to be checked and the matching feature by using the following formula:
wherein m is the same character matching degree between the feature data of the data to be checked and the matching feature, t is the editing distance between the matching sequence of the feature data of the data to be checked and the matching sequence of the matching feature, and SA and SB are the feature data of the data to be checked and the character string length of the matching feature respectively;
wherein, the calculation formula of m is as follows:
wherein N represents the total number of characters successfully matched between the feature data of the data to be checked and the matching feature, and Δ (a, i+1, i, b) represents the position interval between the (i+1) th character and the corresponding character in the matching feature in the matching sequence of the feature data of the data to be checked.
Further, before the applying the calculation formula and performing corresponding calculation according to the number corresponding to the audit data type, the method further includes:
determining a first format of the number;
determining a second format of the number corresponding to the calculation formula;
judging whether the first format and the second format are the same, if not, converting the first format into the second format;
correspondingly, the applying the calculation formula, and performing corresponding calculation according to the numbers corresponding to the audit data types, includes:
and carrying out corresponding calculation according to the numbers in the second format corresponding to the audit data type by using the calculation formula.
The second aspect of the present invention also proposes an auditing system, including: the system comprises a memory and a processor, wherein the memory comprises an auditing method program which is executed by the processor to realize the following steps:
obtaining an audit data text;
word segmentation processing is carried out on the audit data text, and word segmentation results are obtained; obtaining a data set to be checked according to the word segmentation result; the data set to be examined comprises: at least one set of data to be reviewed; the data to be examined comprises: audit data type and corresponding number;
determining a corresponding calculation formula according to the audit data type; and carrying out corresponding calculation according to the numbers corresponding to the audit data types by using the calculation formula to obtain an audit result.
Further, after the obtaining the data set to be examined, the method further includes:
determining a preset auditing rule, inquiring the data set to be audited according to the auditing rule, determining data to be audited conforming to the auditing rule, and determining a target data set to be audited according to the data to be audited conforming to the auditing rule for auditing.
Further, the determining the data to be inspected, which accords with the auditing rule, includes:
determining characteristic data of the data to be checked in the data set to be checked;
matching the feature data of the data to be checked with the matching features in the auditing rules, and determining the matching degree of the feature data of the data to be checked and the matching features;
and when the matching degree is higher than a preset threshold value, determining to-be-checked data conforming to the auditing rule.
Further, the determining the matching degree of the feature data of the data to be checked and the matching feature includes:
calculating the matching degree of the feature data of the data to be checked and the matching feature by using the following formula:
wherein m is the same character matching degree between the feature data of the data to be checked and the matching feature, t is the editing distance between the matching sequence of the feature data of the data to be checked and the matching sequence of the matching feature, and SA and SB are the feature data of the data to be checked and the character string length of the matching feature respectively;
wherein, the calculation formula of m is as follows:
wherein N represents the total number of characters successfully matched between the feature data of the data to be checked and the matching feature, and Δ (a, i+1, i, b) represents the position interval between the (i+1) th character and the corresponding character in the matching feature in the matching sequence of the feature data of the data to be checked.
The third aspect of the present invention also proposes a computer-readable storage medium, in which an auditing method program is included, which, when executed by a processor, implements the steps of an auditing method as described in any of the above.
The embodiment of the invention provides an auditing method, an auditing system and a storage medium, which are used for acquiring audit data text; word segmentation processing is carried out on the audit data text, and word segmentation results are obtained; obtaining a data set to be checked according to the word segmentation result; the data set to be examined comprises: at least one set of data to be reviewed; the data to be examined comprises: audit data type and corresponding number; determining a corresponding calculation formula according to the audit data type; and carrying out corresponding calculation according to the numbers corresponding to the audit data types by using the calculation formula to obtain an audit result. By the auditing method, the extraction of relevant data from the auditing data text is realized, corresponding calculation is carried out according to the type of the relevant data, and an auditing result is obtained, so that manual auditing is facilitated, the labor cost is saved, and the auditing efficiency is improved.
Additional aspects and advantages of the invention will be set forth in part in the description which follows, and in part will be obvious from the description, or may be learned by practice of the invention.
Drawings
FIG. 1 is a schematic flow chart of an auditing method according to an embodiment of the present invention;
fig. 2 is a schematic structural diagram of an audit system according to an embodiment of the present invention.
Detailed Description
In order that the above-recited objects, features and advantages of the present invention will be more clearly understood, a more particular description of the invention will be rendered by reference to the appended drawings and appended detailed description. It should be noted that, in the case of no conflict, the embodiments of the present application and the features in the embodiments may be combined with each other.
In the following description, numerous specific details are set forth in order to provide a thorough understanding of the present invention, however, the present invention may be practiced in other ways than those described herein, and therefore the scope of the present invention is not limited to the specific embodiments disclosed below.
FIG. 1 is a schematic flow chart of an auditing method according to an embodiment of the present invention; as shown in fig. 1, the method can be applied to loading intelligent electronic devices such as servers, computers and the like of an auditing system; the method comprises the following steps:
and 101, acquiring audit data text.
102, word segmentation processing is carried out on the audit data text to obtain word segmentation results; obtaining a data set to be checked according to the word segmentation result; the data set to be examined comprises: at least one set of data to be reviewed; the data to be examined comprises: the data type and its corresponding number are audited.
Step 103, determining a corresponding calculation formula according to the audit data type; and carrying out corresponding calculation according to the numbers corresponding to the audit data types by using the calculation formula to obtain an audit result.
Specifically, after the obtaining the data set to be examined in step 101, the method further includes:
determining a preset auditing rule, inquiring the data set to be audited according to the auditing rule, determining data to be audited conforming to the auditing rule, and determining a target data set to be audited according to the data to be audited conforming to the auditing rule for auditing.
Step 103, determining a corresponding calculation formula according to the audit data type; and carrying out corresponding calculation according to the numbers corresponding to the audit data types by using the calculation formula to obtain an audit result, wherein the audit result is carried out on the target data set to be inspected.
The target data set to be inspected comprises at least one group of data to be inspected; the data to be examined comprises: the data type and its corresponding number are audited.
At least one group of data to be inspected of the target data set to be inspected belongs to the data set to be inspected.
The screening of the data in the data set to be inspected is realized through the steps, so that the data to be inspected, which is required by a user, is inspected, and the working efficiency is improved.
The preset audit rule may be preset and stored by the user.
Specifically, the determining the data to be inspected, which accords with the auditing rule, includes:
determining characteristic data of the data to be checked in the data set to be checked;
matching the feature data of the data to be checked with the matching features in the auditing rules, and determining the matching degree of the feature data of the data to be checked and the matching features;
and when the matching degree is higher than a preset threshold value, determining to-be-checked data conforming to the auditing rule.
That is, the matching features in the above-mentioned audit rule may be preset and saved by the user. The matching features in the audit rule can comprise characters and an identifier for referring to a specific content; the screening of the data to be inspected may be realized based on the matching features, which are not limited herein.
Correspondingly, the feature data of the data to be inspected can also comprise characters, an identifier for referring to a specific content and the like, and the screening of the data to be inspected can be realized only by combining the matching features. The feature data of the data to be examined can be determined based on at least one word in the word segmentation result, or can be determined based on a combination of a plurality of words in the word segmentation result.
Specifically, the determining the matching degree of the feature data of the data to be checked and the matching feature includes:
calculating the matching degree of the feature data of the data to be checked and the matching feature by using the following formula:
wherein m is the same character matching degree between the feature data of the data to be checked and the matching feature, t is the editing distance between the matching sequence of the feature data of the data to be checked and the matching sequence of the matching feature, and SA and SB are the feature data of the data to be checked and the character string length of the matching feature respectively;
wherein, the calculation formula of m is as follows:
wherein N represents the total number of characters successfully matched between the feature data of the data to be checked and the matching feature, and Δ (a, i+1, i, b) represents the position interval between the (i+1) th character and the corresponding character in the matching feature in the matching sequence of the feature data of the data to be checked.
Specifically, in the step 102, the word segmentation processing may be performed on the audit data text by any word segmentation method (for example, the existing forward maximum matching method, that is, assuming that the number of Chinese characters contained in the maximum vocabulary entry in the dictionary is n, the first n characters of the character string to be processed are taken as matching fields. The following word segmentation method provided in this embodiment may also be used.
Specifically, the word segmentation method may adopt any one of the following:
converting the text to be segmented into a character sequence; matching the character strings meeting the preset length contained in the character sequence with standard words in a pre-built dictionary, determining a matched character string matched with the standard words, and respectively distributing corresponding dictionary labels for each character of the matched character string and each character except the matched character string in the character sequence to obtain a dictionary label sequence; determining at least one word segmentation label corresponding to each character in the character sequence to obtain a plurality of word segmentation label sequences; determining the conditional probability that the character sequence is marked as each word segmentation tag sequence according to the character sequence, the dictionary tag sequence and a pre-trained conditional probability prediction model; determining a word segmentation tag sequence corresponding to the conditional probability meeting the preset condition as a target word segmentation tag sequence, and performing word segmentation processing on the text to be segmented based on the target word segmentation tag sequence.
Selecting at least one preset word segmentation method combination aiming at a text to be segmented, wherein the word segmentation method combination comprises at least one of a word segmentation disambiguation optimization method, a single word optimization method and a proper noun optimization method, and a basic word segmentation method; performing word segmentation on the text to be segmented by using the selected combination of the preset word segmentation methods; the method for segmenting the text to be segmented by using the selected combination of the preset word segmentation methods comprises a basic word segmentation method and a word segmentation disambiguation optimization method, and comprises the following steps: performing word segmentation on the text to be segmented by adopting a basic word segmentation method to obtain a basic word segmentation result; acquiring an intersecting word tuple and a non-intersecting word tuple which are included in the basic word segmentation result; determining non-intersecting combinations of the tokens in each intersecting token group; determining the word elements without ambiguity according to the word forming probability of each word element in the non-intersecting word element combination; and taking the lemma in the non-intersecting lemma and the lemma without ambiguity as a word segmentation result of the text to be segmented.
Specifically, in step 103, before the applying the calculation formula and performing corresponding calculation according to the number corresponding to the audit data type, the method further includes:
determining a first format of the number;
determining a second format of the number corresponding to the calculation formula;
judging whether the first format and the second format are the same, if not, converting the first format into the second format;
correspondingly, the applying the calculation formula, and performing corresponding calculation according to the numbers corresponding to the audit data types, includes:
and carrying out corresponding calculation according to the numbers in the second format corresponding to the audit data type by using the calculation formula.
Specifically, the audit data type includes: financial domain business and cross-business domains; the service data corresponding to different services may be different, for example: financial domain business data may include: business data such as daily charge reimbursement, travel charge reimbursement, engineering payment, electric charge payment, salary payment and the like; cross-service domain data may include: engineering project, engineering contract, material contract, engineering budget, material entry and exit list, project settlement report, marketing financial account checking and electric charge collection.
It should be noted that various audit data types may correspond to different keywords, the correspondence between the keywords and the audit data types may be preset and stored, and the audit data types may be determined according to the keywords by querying the correspondence. The keyword can be obtained according to the audit data text and can be one of word segmentation results.
Specifically, keywords are obtained through the word segmentation process, the corresponding relation is queried according to the keywords, and the audit data type corresponding to the keywords can be determined.
Here, determining a corresponding calculation formula according to the audit data type as described in step 103; and carrying out corresponding calculation according to the numbers corresponding to the audit data types by using the calculation formula to obtain an audit result for further explanation.
After the audit data types are determined, the calculation can be directly performed according to formulas corresponding to the audit data types stored by the server, so that the corresponding calculation formulas can be determined according to the audit data types.
Before calculation, it is considered that the numbers with different formats may not be applied to the formula, so the numbers need to be converted, and the above conversion method is specifically adopted, which is not described herein.
After the conversion is completed, corresponding calculation can be performed according to the data and the formula, and a result is obtained.
By the method, manual audit is assisted, so that the working efficiency is improved, and the labor cost is saved.
In this embodiment, several methods for obtaining audit data text are provided, specifically, the method for obtaining audit data text includes at least one of the following:
transmitting a data acquisition instruction to at least one database corresponding to at least one audit object, and receiving audit data text transmitted by the at least one database;
sending an access instruction to at least one database corresponding to at least one audit object, and after receiving an agreement message sent by the at least one database, accessing the at least one database by using a WebService, http service method to obtain the audit data text;
sending a data acquisition instruction to a central database, and receiving the audit data text sent by the central database; the central database is used for periodically acquiring the audit data text from at least one database corresponding to the at least one audit object;
and sending an access instruction to the central database, and after receiving the approval message sent by the central database, accessing the central database by using a WebService, http service method to obtain the audit data text.
Fig. 2 is a block diagram of an audit system according to an embodiment of the present invention. As shown in fig. 2, the second aspect of the present invention further proposes an auditing system 2, the auditing system 2 comprising: the device comprises a memory 21 and a processor 22, wherein the memory 21 comprises an auditing method program which realizes the following steps when being executed by the processor 22:
obtaining an audit data text;
word segmentation processing is carried out on the audit data text, and word segmentation results are obtained; obtaining a data set to be checked according to the word segmentation result; the data set to be examined comprises: at least one set of data to be reviewed; the data to be examined comprises: audit data type and corresponding number;
determining a corresponding calculation formula according to the audit data type; and carrying out corresponding calculation according to the numbers corresponding to the audit data types by using the calculation formula to obtain an audit result.
It should be noted that the system of the present invention may be operated in a terminal device such as a server, a PC, a mobile phone, a PAD, etc.
It should be noted that the processor may be a central processing unit (Central Processing Unit, CPU), other general purpose processors, digital signal processing (Digital Signal Processor, DSP), application specific integrated circuit (Application Specific Integrated Circuit, ASIC), off-the-shelf programmable gate array (Field-Programmable Gate Array, FPGA) or other programmable logic device, discrete gate or transistor logic device, discrete hardware components, etc. A general purpose processor may be a microprocessor or the processor may be any conventional processor or the like.
Further, after the obtaining the data set to be examined, the method further includes:
determining a preset auditing rule, inquiring the data set to be audited according to the auditing rule, determining data to be audited conforming to the auditing rule, and determining a target data set to be audited according to the data to be audited conforming to the auditing rule for auditing.
Further, the determining the data to be inspected, which accords with the auditing rule, includes:
determining characteristic data of the data to be checked in the data set to be checked;
matching the feature data of the data to be checked with the matching features in the auditing rules, and determining the matching degree of the feature data of the data to be checked and the matching features;
and when the matching degree is higher than a preset threshold value, determining to-be-checked data conforming to the auditing rule.
Further, the determining the matching degree of the feature data of the data to be checked and the matching feature includes:
calculating the matching degree of the feature data of the data to be checked and the matching feature by using the following formula:
wherein m is the same character matching degree between the feature data of the data to be checked and the matching feature, t is the editing distance between the matching sequence of the feature data of the data to be checked and the matching sequence of the matching feature, and SA and SB are the feature data of the data to be checked and the character string length of the matching feature respectively;
wherein, the calculation formula of m is as follows:
wherein N represents the total number of characters successfully matched between the feature data of the data to be checked and the matching feature, and Δ (a, i+1, i, b) represents the position interval between the (i+1) th character and the corresponding character in the matching feature in the matching sequence of the feature data of the data to be checked.
Further, before the applying the calculation formula and performing corresponding calculation according to the number corresponding to the audit data type, the method further includes:
determining a first format of the number;
determining a second format of the number corresponding to the calculation formula;
judging whether the first format and the second format are the same, if not, converting the first format into the second format;
correspondingly, the applying the calculation formula, and performing corresponding calculation according to the numbers corresponding to the audit data types, includes:
and carrying out corresponding calculation according to the numbers in the second format corresponding to the audit data type by using the calculation formula.
The third aspect of the present invention also proposes a computer-readable storage medium, in which an auditing method program is included, which, when executed by a processor, implements the steps of an auditing method as described above.
In the several embodiments provided in this application, it should be understood that the disclosed apparatus and method may be implemented in other ways. The above described device embodiments are only illustrative, e.g. the division of the units is only one logical function division, and there may be other divisions in practice, such as: multiple units or components may be combined or may be integrated into another system, or some features may be omitted, or not performed. In addition, the various components shown or discussed may be coupled or directly coupled or communicatively coupled to each other via some interface, whether indirectly coupled or communicatively coupled to devices or units, whether electrically, mechanically, or otherwise.
The units described above as separate components may or may not be physically separate, and components shown as units may or may not be physical units; can be located in one place or distributed to a plurality of network units; some or all of the units may be selected according to actual needs to achieve the purpose of the solution of this embodiment.
In addition, each functional unit in each embodiment of the present invention may be integrated in one processing unit, or each unit may be separately used as one unit, or two or more units may be integrated in one unit; the integrated units may be implemented in hardware or in hardware plus software functional units.
Those of ordinary skill in the art will appreciate that: all or part of the steps for implementing the above method embodiments may be implemented by hardware related to program instructions, and the foregoing program may be stored in a computer readable storage medium, where the program, when executed, performs steps including the above method embodiments; and the aforementioned storage medium includes: a mobile storage device, a Read-Only Memory (ROM), a random access Memory (RAM, random Access Memory), a magnetic disk or an optical disk, or the like, which can store program codes.
Alternatively, the above-described integrated units of the present invention may be stored in a computer-readable storage medium if implemented in the form of software functional modules and sold or used as separate products. Based on such understanding, the technical solutions of the embodiments of the present invention may be embodied in essence or a part contributing to the prior art in the form of a software product stored in a storage medium, including several instructions for causing a computer device (which may be a personal computer, a server, or a network device, etc.) to execute all or part of the methods described in the embodiments of the present invention. And the aforementioned storage medium includes: a removable storage device, ROM, RAM, magnetic or optical disk, or other medium capable of storing program code.
The foregoing is merely illustrative of the present invention, and the present invention is not limited thereto, and any person skilled in the art will readily recognize that variations or substitutions are within the scope of the present invention. Therefore, the protection scope of the present invention shall be subject to the protection scope of the claims.

Claims (3)

1. An auditing method, the method comprising:
obtaining an audit data text;
word segmentation processing is carried out on the audit data text, and word segmentation results are obtained; obtaining a data set to be checked according to the word segmentation result; the data set to be examined comprises: at least one set of data to be reviewed; the data to be examined comprises: audit data type and corresponding number;
determining a corresponding calculation formula according to the audit data type; using the calculation formula, and carrying out corresponding calculation according to the numbers corresponding to the audit data types to obtain an audit result;
after the obtaining the data set to be examined, the method further includes:
determining a preset auditing rule, inquiring the data set to be audited according to the auditing rule, determining data to be audited conforming to the auditing rule, and determining a target data set to be audited according to the data to be audited conforming to the auditing rule for auditing;
the determining the data to be inspected, which accords with the auditing rule, comprises the following steps:
determining characteristic data of the data to be checked in the data set to be checked;
matching the feature data of the data to be checked with the matching features in the auditing rules, and determining the matching degree of the feature data of the data to be checked and the matching features;
when the matching degree is higher than a preset threshold value, determining to-be-checked data conforming to an auditing rule;
the determining the matching degree of the feature data of the data to be checked and the matching feature comprises the following steps:
calculating the matching degree of the feature data of the data to be checked and the matching feature by using the following formula:
wherein m is the same character matching degree between the feature data of the data to be checked and the matching feature, t is the editing distance between the matching sequence of the feature data of the data to be checked and the matching sequence of the matching feature, and SA and SB are the feature data of the data to be checked and the character string length of the matching feature respectively;
wherein, the calculation formula of m is as follows:
wherein N represents the total number of characters successfully matched between the feature data of the data to be checked and the matching feature, and Δ (a, i+1, i, b) represents the position interval between the (i+1) th character and the corresponding character in the matching feature in the matching sequence of the feature data of the data to be checked;
before the calculation formula is applied and corresponding calculation is performed according to the numbers corresponding to the audit data types, the method further comprises:
determining a first format of the number;
determining a second format of the number corresponding to the calculation formula;
judging whether the first format and the second format are the same, if not, converting the first format into the second format;
correspondingly, the applying the calculation formula, and performing corresponding calculation according to the numbers corresponding to the audit data types, includes:
and carrying out corresponding calculation according to the numbers in the second format corresponding to the audit data type by using the calculation formula.
2. An auditing system, the auditing system comprising: the system comprises a memory and a processor, wherein the memory comprises an auditing method program which is executed by the processor to realize the following steps:
obtaining an audit data text;
word segmentation processing is carried out on the audit data text, and word segmentation results are obtained; obtaining a data set to be checked according to the word segmentation result; the data set to be examined comprises: at least one set of data to be reviewed; the data to be examined comprises: audit data type and corresponding number;
determining a corresponding calculation formula according to the audit data type; using the calculation formula, and carrying out corresponding calculation according to the numbers corresponding to the audit data types to obtain an audit result;
after the obtaining the data set to be examined, the method further includes:
determining a preset auditing rule, inquiring the data set to be audited according to the auditing rule, determining data to be audited conforming to the auditing rule, and determining a target data set to be audited according to the data to be audited conforming to the auditing rule for auditing;
the determining the data to be inspected, which accords with the auditing rule, comprises the following steps:
determining characteristic data of the data to be checked in the data set to be checked;
matching the feature data of the data to be checked with the matching features in the auditing rules, and determining the matching degree of the feature data of the data to be checked and the matching features;
when the matching degree is higher than a preset threshold value, determining to-be-checked data conforming to an auditing rule;
the determining the matching degree of the feature data of the data to be checked and the matching feature comprises the following steps:
calculating the matching degree of the feature data of the data to be checked and the matching feature by using the following formula:
wherein m is the same character matching degree between the feature data of the data to be checked and the matching feature, t is the editing distance between the matching sequence of the feature data of the data to be checked and the matching sequence of the matching feature, and SA and SB are the feature data of the data to be checked and the character string length of the matching feature respectively;
wherein, the calculation formula of m is as follows:
wherein N represents the total number of characters successfully matched between the feature data of the data to be checked and the matching feature, and Δ (a, i+1, i, b) represents the position interval between the (i+1) th character and the corresponding character in the matching feature in the matching sequence of the feature data of the data to be checked;
before the calculation formula is applied and corresponding calculation is performed according to the numbers corresponding to the audit data types, the method further comprises:
determining a first format of the number;
determining a second format of the number corresponding to the calculation formula;
judging whether the first format and the second format are the same, if not, converting the first format into the second format;
correspondingly, the applying the calculation formula, and performing corresponding calculation according to the numbers corresponding to the audit data types, includes:
and carrying out corresponding calculation according to the numbers in the second format corresponding to the audit data type by using the calculation formula.
3. A computer readable storage medium, comprising an audit method program, which when executed by a processor, implements the steps of an audit method as claimed in claim 1.
CN201910815699.XA 2019-08-30 2019-08-30 Audit method, system and readable storage medium Active CN110532302B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910815699.XA CN110532302B (en) 2019-08-30 2019-08-30 Audit method, system and readable storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910815699.XA CN110532302B (en) 2019-08-30 2019-08-30 Audit method, system and readable storage medium

Publications (2)

Publication Number Publication Date
CN110532302A CN110532302A (en) 2019-12-03
CN110532302B true CN110532302B (en) 2024-01-19

Family

ID=68665619

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910815699.XA Active CN110532302B (en) 2019-08-30 2019-08-30 Audit method, system and readable storage medium

Country Status (1)

Country Link
CN (1) CN110532302B (en)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106446076A (en) * 2016-09-07 2017-02-22 南京理工大学 Hierarchical clustering-based log audit method
CN106503102A (en) * 2016-10-17 2017-03-15 汉蓝(北京)科技有限公司 A kind of search engine formula audit analysis method
CN109598484A (en) * 2018-12-04 2019-04-09 广东电网有限责任公司 A kind of project under construction turns fixed assets number auditing method and device
CN109726272A (en) * 2018-12-20 2019-05-07 杭州数梦工场科技有限公司 Audit regulation recommended method and device
CN109741029A (en) * 2018-12-27 2019-05-10 广东电网有限责任公司 The building method and device in a kind of power grid enterprises' audit regulation storehouse

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7627878B2 (en) * 2005-12-23 2009-12-01 Eloda Inc. Method and System for automated auditing of advertising

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106446076A (en) * 2016-09-07 2017-02-22 南京理工大学 Hierarchical clustering-based log audit method
CN106503102A (en) * 2016-10-17 2017-03-15 汉蓝(北京)科技有限公司 A kind of search engine formula audit analysis method
CN109598484A (en) * 2018-12-04 2019-04-09 广东电网有限责任公司 A kind of project under construction turns fixed assets number auditing method and device
CN109726272A (en) * 2018-12-20 2019-05-07 杭州数梦工场科技有限公司 Audit regulation recommended method and device
CN109741029A (en) * 2018-12-27 2019-05-10 广东电网有限责任公司 The building method and device in a kind of power grid enterprises' audit regulation storehouse

Also Published As

Publication number Publication date
CN110532302A (en) 2019-12-03

Similar Documents

Publication Publication Date Title
AU2022204197B2 (en) Security weakness and infiltration detection and repair in obfuscated website content
CA3087534C (en) System and method for information extraction with character level features
CN110020422B (en) Feature word determining method and device and server
CN108572990B (en) Information pushing method and device
CN110532301B (en) Audit method, system and readable storage medium
CN111352907A (en) Method and device for analyzing pipeline file, computer equipment and storage medium
CN111753171A (en) Malicious website identification method and device
CN111859093A (en) Sensitive word processing method and device and readable storage medium
CN113381963A (en) Domain name detection method, device and storage medium
CN110543565A (en) Auditing method, system and readable storage medium based on convolutional neural network model
CN114386408A (en) Government affair sensitive information identification method, device, equipment, medium and program product
CN113010785A (en) User recommendation method and device
CN110532302B (en) Audit method, system and readable storage medium
CN109359274B (en) Method, device and equipment for identifying character strings generated in batch
CN110705258A (en) Text entity identification method and device
CN110941713A (en) Self-optimization financial information plate classification method based on topic model
CN113743982A (en) Advertisement putting scheme recommendation method and device, computer equipment and storage medium
CN114840872A (en) Secret text desensitization method and device, computer equipment and readable storage medium
CN108711073B (en) User analysis method, device and terminal
CN115840808B (en) Technological project consultation method, device, server and computer readable storage medium
CN113362151B (en) Data processing method and device for financial business, electronic equipment and storage medium
CN112783501A (en) Hot spot compiling unit determining method and device and server
CN113239194B (en) Patent evaluation method, system, storage medium and electronic device
CN112989810B (en) Text information identification method and device, server and storage medium
CN112015888B (en) Abstract information extraction method and abstract information extraction system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant