CN110532302A - Auditing method, system and readable storage medium storing program for executing - Google Patents
Auditing method, system and readable storage medium storing program for executing Download PDFInfo
- Publication number
- CN110532302A CN110532302A CN201910815699.XA CN201910815699A CN110532302A CN 110532302 A CN110532302 A CN 110532302A CN 201910815699 A CN201910815699 A CN 201910815699A CN 110532302 A CN110532302 A CN 110532302A
- Authority
- CN
- China
- Prior art keywords
- data
- unexamined
- characteristic
- audit
- matching
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 62
- 238000003860 storage Methods 0.000 title claims abstract description 20
- 238000012550 audit Methods 0.000 claims abstract description 141
- 238000004364 calculation method Methods 0.000 claims abstract description 43
- 230000011218 segmentation Effects 0.000 claims abstract description 33
- 238000012545 processing Methods 0.000 claims abstract description 15
- 230000033228 biological regulation Effects 0.000 claims description 38
- 238000010586 diagram Methods 0.000 description 4
- 238000005457 optimization Methods 0.000 description 4
- 238000010168 coupling process Methods 0.000 description 3
- 238000005859 coupling reaction Methods 0.000 description 3
- 238000011161 development Methods 0.000 description 3
- 230000018109 developmental process Effects 0.000 description 3
- 238000012216 screening Methods 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 2
- 238000004891 communication Methods 0.000 description 2
- 230000008878 coupling Effects 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 230000005611 electricity Effects 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 230000006870 function Effects 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 238000011430 maximum method Methods 0.000 description 2
- 230000005540 biological transmission Effects 0.000 description 1
- 238000011109 contamination Methods 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 230000014759 maintenance of location Effects 0.000 description 1
- 238000007726 management method Methods 0.000 description 1
- 238000005192 partition Methods 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/24—Querying
- G06F16/245—Query processing
- G06F16/2458—Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
- G06F16/2465—Query processing support for facilitating data mining operations in structured databases
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/25—Integrating or interfacing systems involving database management systems
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/10—Office automation; Time management
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- General Physics & Mathematics (AREA)
- Business, Economics & Management (AREA)
- General Engineering & Computer Science (AREA)
- Strategic Management (AREA)
- Computational Linguistics (AREA)
- Software Systems (AREA)
- Human Resources & Organizations (AREA)
- Entrepreneurship & Innovation (AREA)
- Mathematical Physics (AREA)
- Molecular Biology (AREA)
- Probability & Statistics with Applications (AREA)
- Biophysics (AREA)
- Evolutionary Computation (AREA)
- General Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- Computing Systems (AREA)
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Biomedical Technology (AREA)
- Fuzzy Systems (AREA)
- Economics (AREA)
- Marketing (AREA)
- Operations Research (AREA)
- Quality & Reliability (AREA)
- Tourism & Hospitality (AREA)
- General Business, Economics & Management (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
The present invention provides a kind of auditing method, system and readable storage medium storing program for executing, which comprises obtains Audit data text;Word segmentation processing is carried out to the Audit data text, obtains word segmentation result;Unexamined data acquisition system is obtained according to the word segmentation result;The unexamined data acquisition system, comprising: at least one set of unexamined data;The unexamined data include: Audit data type and its corresponding number;Corresponding calculation formula is determined according to the Audit data type;With the calculation formula, calculated accordingly according to the corresponding number of the Audit data type, be audited result.
Description
Technical field
The present invention relates to audit technique field more particularly to a kind of auditing methods, system and readable storage medium storing program for executing.
Background technique
In recent years, with the rapid development of computer technology and informatization, the information-based range of Economic Management Activities
It also grows with each passing hour with depth, the audit for supervising as economic activity, evaluating and discerning encounters unprecedented challenge, tradition
Manual audit can not adapt to the audit demand under Information Condition, the informationization of the audit target and audit itself development all
It is required that audit operation mode must grow with each passing hour, corresponding adjustment is made.Therefore, responsive message development trend updates audit
Supervision theory, Innovation auditing method are extremely urgent.
How a kind of method is provided, to extract Audit data from the text comprising Audit data, and is carried out corresponding
It calculates to help traditional manual audit, so as to improve audit measure, improves audit efficiency, be current problem to be solved.
Summary of the invention
In order to solve at least one above-mentioned technical problem, the invention proposes a kind of auditing method, system and readable storages
Medium.
To achieve the goals above, first aspect present invention proposes a kind of auditing method, which comprises
Obtain Audit data text;
Word segmentation processing is carried out to the Audit data text, obtains word segmentation result;It is obtained according to the word segmentation result pending
Look into data acquisition system;The unexamined data acquisition system, comprising: at least one set of unexamined data;The unexamined data include: audit
Data type and its corresponding number;
Corresponding calculation formula is determined according to the Audit data type;With the calculation formula, according to the audit
The corresponding number of data type is calculated accordingly, and be audited result.
Further, after the unexamined data acquisition system of acquisition, the method also includes:
It determines preset audit regulation, inquires the unexamined data acquisition system according to the audit regulation, be determined for compliance with careful
The unexamined data for counting rule, determine the unexamined data acquisition system of target according to the unexamined data for meeting audit regulation, use
To audit.
Further, the unexamined data for being determined for compliance with audit regulation, comprising:
Determine the characteristic of the unexamined data in the unexamined data acquisition system;
The characteristic of the unexamined data is matched with the matching characteristic in the audit regulation, described in determination
The matching degree of the characteristic of unexamined data and the matching characteristic;
When determining that the matching degree is higher than preset threshold, it is determined that be the unexamined data for meeting audit regulation.
Further, the matching degree of the characteristic of the determination unexamined data and the matching characteristic, comprising:
The characteristic of the unexamined data and the matching degree of the matching characteristic are calculated with following formula:
Wherein, m is the identical characters matching degree between the characteristic and the matching characteristic of the unexamined data, t
For matching sequence and the editing distance of the matching characteristic matched between sequence of the characteristic of the unexamined data, |
SA |, | SB | it is respectively characteristic, the string length of the matching characteristic of the unexamined data;
Wherein, the calculation formula of the m is as follows:
Wherein, N indicates that the character of successful match between the characteristic and the matching characteristic of the unexamined data is total
Number, Δ (A, i+1, i, B) indicate the i+1 character and i-th in the matching sequence of the characteristic of the unexamined data
A character corresponds to the location interval between character in the matching characteristic.
Further, described to use the calculation formula, it is carried out according to the corresponding number of the Audit data type corresponding
Calculating before, the method also includes:
Determine the first format of the number;
Determine the second format of number corresponding to the calculation formula;
Judge whether first format and second format are identical, if not identical, first format is converted
For second format;
Correspondingly, described use the calculation formula, carried out according to the corresponding number of the Audit data type corresponding
It calculates, comprising:
With the calculation formula, counted accordingly according to the number of corresponding second format of the Audit data type
It calculates.
Second aspect of the present invention also proposes that a kind of auditing system, the careful auditing system include: memory and processor, institute
Stating includes a kind of auditing method program in memory, and following step is realized when the auditing method program is executed by the processor
It is rapid:
Obtain Audit data text;
Word segmentation processing is carried out to the Audit data text, obtains word segmentation result;It is obtained according to the word segmentation result pending
Look into data acquisition system;The unexamined data acquisition system, comprising: at least one set of unexamined data;The unexamined data include: audit
Data type and its corresponding number;
Corresponding calculation formula is determined according to the Audit data type;With the calculation formula, according to the audit
The corresponding number of data type is calculated accordingly, and be audited result.
Further, after the unexamined data acquisition system of acquisition, the method also includes:
It determines preset audit regulation, inquires the unexamined data acquisition system according to the audit regulation, be determined for compliance with careful
The unexamined data for counting rule, determine the unexamined data acquisition system of target according to the unexamined data for meeting audit regulation, use
To audit.
Further, the unexamined data for being determined for compliance with audit regulation, comprising:
Determine the characteristic of the unexamined data in the unexamined data acquisition system;
The characteristic of the unexamined data is matched with the matching characteristic in the audit regulation, described in determination
The matching degree of the characteristic of unexamined data and the matching characteristic;
When determining that the matching degree is higher than preset threshold, it is determined that be the unexamined data for meeting audit regulation.
Further, the matching degree of the characteristic of the determination unexamined data and the matching characteristic, comprising:
The characteristic of the unexamined data and the matching degree of the matching characteristic are calculated with following formula:
Wherein, m is the identical characters matching degree between the characteristic and the matching characteristic of the unexamined data, t
For matching sequence and the editing distance of the matching characteristic matched between sequence of the characteristic of the unexamined data, |
SA |, | SB | it is respectively characteristic, the string length of the matching characteristic of the unexamined data;
Wherein, the calculation formula of the m is as follows:
Wherein, N indicates that the character of successful match between the characteristic and the matching characteristic of the unexamined data is total
Number, Δ (A, i+1, i, B) indicate the i+1 character and i-th in the matching sequence of the characteristic of the unexamined data
A character corresponds to the location interval between character in the matching characteristic.
Third aspect present invention also proposes a kind of computer readable storage medium, wraps in the computer readable storage medium
A kind of auditing method program is included, when the auditing method program is executed by processor, realizes one kind as described in any of the above item
The step of auditing method.
The embodiment of the invention provides a kind of auditing method, system and storage mediums, obtain Audit data text;To described
Audit data text carries out word segmentation processing, obtains word segmentation result;Unexamined data acquisition system is obtained according to the word segmentation result;It is described
Unexamined data acquisition system, comprising: at least one set of unexamined data;The unexamined data include: Audit data type and its right
The number answered;Corresponding calculation formula is determined according to the Audit data type;With the calculation formula, according to the audit
The corresponding number of data type is calculated accordingly, and be audited result.By above-mentioned auditing method, realize from Audit data
The extraction that related data is carried out in text, and is calculated accordingly according to the type of related data, be audited as a result, with
Manual audit is helped, human cost is saved, improves audit efficiency.
Additional aspect and advantage of the invention will provide in following description section, will partially become from the following description
Obviously, or practice through the invention is recognized.
Detailed description of the invention
Fig. 1 is a kind of flow diagram of auditing method provided in an embodiment of the present invention;
Fig. 2 is a kind of structural schematic diagram of auditing system provided in an embodiment of the present invention.
Specific embodiment
To better understand the objects, features and advantages of the present invention, with reference to the accompanying drawing and specific real
Applying mode, the present invention is further described in detail.It should be noted that in the absence of conflict, the implementation of the application
Feature in example and embodiment can be combined with each other.
In the following description, numerous specific details are set forth in order to facilitate a full understanding of the present invention, still, the present invention may be used also
To be implemented using other than the one described here other modes, therefore, protection scope of the present invention is not by described below
Specific embodiment limitation.
Fig. 1 is a kind of flow diagram of auditing method provided in an embodiment of the present invention;As shown in Figure 1, the method can
To be applied to load by intelligent electronic devices such as server, the computers of auditing system;The described method includes:
Step 101 obtains Audit data text.
Step 102 carries out word segmentation processing to the Audit data text, obtains word segmentation result;According to the word segmentation result
Obtain unexamined data acquisition system;The unexamined data acquisition system, comprising: at least one set of unexamined data;The unexamined data
It include: Audit data type and its corresponding number.
Step 103 determines corresponding calculation formula according to the Audit data type;With the calculation formula, according to
The corresponding number of the Audit data type is calculated accordingly, and be audited result.
Specifically, after the unexamined data acquisition system of acquisition described in the step 101, the method also includes:
It determines preset audit regulation, inquires the unexamined data acquisition system according to the audit regulation, be determined for compliance with careful
The unexamined data for counting rule, determine the unexamined data acquisition system of target according to the unexamined data for meeting audit regulation, use
To audit.
Corresponding calculation formula is determined to according to the Audit data type i.e. in step 103;With the calculation formula,
It is calculated, is audited as a result, being pending for above-mentioned target accordingly according to the corresponding number of the Audit data type
Look into data acquisition system progress.
The unexamined data acquisition system of target, including at least one set of unexamined data;The unexamined data include: audit
Data type and its corresponding number.
The unexamined data of at least one set of the unexamined data acquisition system of target belong to the unexamined data acquisition system.
Realize the screening to data in unexamined data acquisition system, through the above steps to realize to the unexamined of user demand
Data are audited, to improve working efficiency.
The preset audit regulation can be user and preset and save.
Specifically, the unexamined data for being determined for compliance with audit regulation, comprising:
Determine the characteristic of the unexamined data in the unexamined data acquisition system;
The characteristic of the unexamined data is matched with the matching characteristic in the audit regulation, described in determination
The matching degree of the characteristic of unexamined data and the matching characteristic;
When determining that the matching degree is higher than preset threshold, it is determined that be the unexamined data for meeting audit regulation.
Matching characteristic in i.e. above-described audit regulation can be user and preset and save.The audit
Matching characteristic in rule may include text, the mark for referring to a certain particular content;Here without limitation, only explanation can be with
The screening to unexamined data is realized according to above-mentioned matching characteristic.
Correspondingly, the characteristic of the unexamined data, also may include text, the mark for referring to a certain particular content
Deng only needing to realize screening to unexamined data in conjunction with above-mentioned matching characteristic.The characteristic of the unexamined data
According to, it can be determined based on the word of at least one in word segmentation result, it can also be true based on contaminations progress multiple in word segmentation result
It is fixed.
Specifically, the matching degree of the characteristic of the determination unexamined data and the matching characteristic, comprising:
The characteristic of the unexamined data and the matching degree of the matching characteristic are calculated with following formula:
Wherein, m is the identical characters matching degree between the characteristic and the matching characteristic of the unexamined data, t
For matching sequence and the editing distance of the matching characteristic matched between sequence of the characteristic of the unexamined data, |
SA |, | SB | it is respectively characteristic, the string length of the matching characteristic of the unexamined data;
Wherein, the calculation formula of the m is as follows:
Wherein, N indicates that the character of successful match between the characteristic and the matching characteristic of the unexamined data is total
Number, Δ (A, i+1, i, B) indicate the i+1 character and i-th in the matching sequence of the characteristic of the unexamined data
A character corresponds to the location interval between character in the matching characteristic.
Specifically, described that word segmentation processing is carried out to the Audit data text in the step 102, it can be using any
A kind of segmenting method (such as existing Forward Maximum Method method, i.e., number of Chinese characters contained by maximum entry is n in hypothesis dictionary
It is a, take the preceding n word of character string to be processed as matching field.If containing the word in dictionary, successful match separates the word,
Then the field for taking n word to form again from according with the n+1 of string since comparand matches in dictionary again;If do not matched
Success, then by this n word composition field last position reject, with remaining mono- 1 words of n composition field in dictionary into
Row matching, so goes on, until cutting success.Reverse maximum matching method, the difference with Forward Maximum Method method exist
In a word for subtracting foremost if matching is unsuccessful), the higher the better for accuracy rate.Can also be used it is provided in this embodiment under
State segmenting method.
Specifically, the segmenting method, can using it is following any one:
It is character string by text conversion to be segmented;The character string for meeting preset length that will include in the character string
It is matched with the standard words in the dictionary constructed in advance, the determining and matched matched character string of the standard words, is the word
Each character of matched character string described in symbol sequence and each character in addition to the matched character string distribute correspondence respectively
Dictionary label, obtain dictionary sequence label;Determine the corresponding at least one participle label of each character in the character string,
Obtain a variety of participle sequence labels;According to the character string, the dictionary sequence label and conditional probability trained in advance
Prediction model determines that the character string is marked as the conditional probability of every kind of participle sequence label;Preset condition will be met
The corresponding participle sequence label of conditional probability is determined as target participle sequence label, and segments sequence label pair based on the target
The text to be segmented carries out word segmentation processing.
For text to be segmented, at least one default segmenting method combination is selected, wherein the segmenting method, which combines, includes
Participle disambiguates at least one of optimization method, individual character optimization method and proper noun optimization method, and basic segmenting method;
It is combined using the default segmenting method of selection, the text to be segmented is segmented;Wherein, the default segmenting method
Combination includes basic segmenting method and participle disambiguates optimization method, described to be combined using the default segmenting method of selection, right
The text to be segmented is segmented, comprising: is segmented using basic segmenting method to the text to be segmented, is obtained basis
Word segmentation result;Obtain the intersection lemma group and non-intersection lemma group for including in the basic word segmentation result;For each intersection word
Tuple is determined not intersect lemma combination in the intersection lemma group respectively;Each lemma in lemma combination is not intersected according to described
At Word probability, the lemma that ambiguity is not present is determined;By in the non-intersection lemma group lemma and described there is no the words of ambiguity
Member, the word segmentation result as the text to be segmented.
Specifically, described with the calculation formula in the step 103, it is corresponding according to the Audit data type
Before number calculate accordingly, the method also includes:
Determine the first format of the number;
Determine the second format of number corresponding to the calculation formula;
Judge whether first format and second format are identical, if not identical, first format is converted
For second format;
Correspondingly, described use the calculation formula, carried out according to the corresponding number of the Audit data type corresponding
It calculates, comprising:
With the calculation formula, counted accordingly according to the number of corresponding second format of the Audit data type
It calculates.
Specifically, the Audit data type, comprising: financial domain business and across business domains;The corresponding business of different business
Data can be different, such as: financial domain business data may include: running cost reimbursement, reimbursing travelling expenses, engineering payment, the electricity charge
The business datums such as payment, emolument payment;Trans-sectoral business numeric field data may include: engineering project, engineering contract, goods and materials contract, engineering
It is receivable paid that preliminary budget, goods and materials go out storage bill, project final report, marketing financial counting, the electricity charge.
It should be noted that various Audit data types can correspond to different keywords, keyword and Audit data class
The corresponding relationship of type presets and saves, and by inquiring corresponding relationship, Audit data type can be determined according to keyword.Institute
It states keyword and can be and obtained according to Audit data text, can be one in word segmentation result.
Specifically, by the word segmentation processing, keyword is obtained, according to corresponding relationship described in the keyword query,
It can determine the corresponding Audit data type of keyword.
Here, corresponding calculation formula is determined according to the Audit data type for described in step 103;With described
Calculation formula is calculated accordingly according to the corresponding number of the Audit data type, is audited as a result, doing furtherly
It is bright.
After determining Audit data type, the corresponding formula of various Audit data types that can also be saved according to server is straight
Capable calculating is tapped into, therefore, corresponding calculation formula can be determined according to the Audit data type here.
And before being calculated, it is contemplated that the number of different-format not necessarily can be applied to the formula, and therefore, it is necessary to logarithms
Word is converted, and specifically uses above-mentioned conversion method, which is not described herein again.
After converting, it can just be calculated accordingly according to data and formula, obtain result.
By the above method, indirect labor's audit saves human cost to improve working efficiency.
In the present embodiment, several methods for obtaining Audit data text, specifically, the acquisition Audit data are provided
Text comprises at least one of the following:
Data acquisition instructions are sent at least one corresponding database of at least one audit target, receive at least one number
The Audit data text sent according to library;
Access instruction is sent at least one corresponding database of at least one audit target, receives at least one database
After the acceptance message of transmission, it is described to obtain that at least one described database is accessed with WebService, Http method of servicing
Audit data text;
Data acquisition instructions are sent to central database, receive the Audit data text that the central database is sent
This;Wherein, the central database is obtained to periodicity from least one corresponding database of at least one described audit target
Take the Audit data text;
Access instruction is sent to the central database, after receiving the acceptance message that central database is sent, is used
WebService, Http method of servicing access the central database to obtain the Audit data text.
Fig. 2 is a kind of block diagram of auditing system provided in an embodiment of the present invention.As shown in Fig. 2, second aspect of the present invention is also
It proposes that a kind of auditing system 2, the auditing system 2 include: memory 21 and processor 22, includes one kind in the memory 21
Auditing method program, the auditing method program realize following steps when being executed by the processor 22:
Obtain Audit data text;
Word segmentation processing is carried out to the Audit data text, obtains word segmentation result;It is obtained according to the word segmentation result pending
Look into data acquisition system;The unexamined data acquisition system, comprising: at least one set of unexamined data;The unexamined data include: audit
Data type and its corresponding number;
Corresponding calculation formula is determined according to the Audit data type;With the calculation formula, according to the audit
The corresponding number of data type is calculated accordingly, and be audited result.
It should be noted that system of the invention can be grasped in the terminal devices such as server, PC, mobile phone, PAD
Make.
It should be noted that the processor can be central processing unit (Central Processing Unit,
CPU), it can also be other general processors, Digital Signal Processing (Digital Signal Processor, DSP), dedicated collection
At circuit (Application Specific Integrated Circuit, ASIC), ready-made programmable gate array (Field-
Programmable Gate Array, FPGA) either other programmable logic device, discrete gate or transistor logic,
Discrete hardware components etc..General processor can be microprocessor or the processor is also possible to any conventional processor
Deng.
Further, after the unexamined data acquisition system of acquisition, the method also includes:
It determines preset audit regulation, inquires the unexamined data acquisition system according to the audit regulation, be determined for compliance with careful
The unexamined data for counting rule, determine the unexamined data acquisition system of target according to the unexamined data for meeting audit regulation, use
To audit.
Further, the unexamined data for being determined for compliance with audit regulation, comprising:
Determine the characteristic of the unexamined data in the unexamined data acquisition system;
The characteristic of the unexamined data is matched with the matching characteristic in the audit regulation, described in determination
The matching degree of the characteristic of unexamined data and the matching characteristic;
When determining that the matching degree is higher than preset threshold, it is determined that be the unexamined data for meeting audit regulation.
Further, the matching degree of the characteristic of the determination unexamined data and the matching characteristic, comprising:
The characteristic of the unexamined data and the matching degree of the matching characteristic are calculated with following formula:
Wherein, m is the identical characters matching degree between the characteristic and the matching characteristic of the unexamined data, t
For matching sequence and the editing distance of the matching characteristic matched between sequence of the characteristic of the unexamined data, |
SA |, | SB | it is respectively characteristic, the string length of the matching characteristic of the unexamined data;
Wherein, the calculation formula of the m is as follows:
Wherein, N indicates that the character of successful match between the characteristic and the matching characteristic of the unexamined data is total
Number, Δ (A, i+1, i, B) indicate the i+1 character and i-th in the matching sequence of the characteristic of the unexamined data
A character corresponds to the location interval between character in the matching characteristic.
Further, described to use the calculation formula, it is carried out according to the corresponding number of the Audit data type corresponding
Calculating before, the method also includes:
Determine the first format of the number;
Determine the second format of number corresponding to the calculation formula;
Judge whether first format and second format are identical, if not identical, first format is converted
For second format;
Correspondingly, described use the calculation formula, carried out according to the corresponding number of the Audit data type corresponding
It calculates, comprising:
With the calculation formula, counted accordingly according to the number of corresponding second format of the Audit data type
It calculates.
Third aspect present invention also proposes a kind of computer readable storage medium, wraps in the computer readable storage medium
A kind of auditing method program is included, when the auditing method program is executed by processor, is realized such as a kind of above-mentioned auditing method
Step.
In several embodiments provided herein, it should be understood that disclosed device and method can pass through it
Its mode is realized.Apparatus embodiments described above are merely indicative, for example, the division of the unit, only
A kind of logical function partition, there may be another division manner in actual implementation, such as: multiple units or components can combine, or
It is desirably integrated into another system, or some features can be ignored or not executed.In addition, shown or discussed each composition portion
Mutual coupling or direct-coupling or communication connection is divided to can be through some interfaces, the INDIRECT COUPLING of equipment or unit
Or communication connection, it can be electrical, mechanical or other forms.
Above-mentioned unit as illustrated by the separation member, which can be or may not be, to be physically separated, aobvious as unit
The component shown can be or may not be physical unit;Both it can be located in one place, and may be distributed over multiple network lists
In member;Some or all of units can be selected to achieve the purpose of the solution of this embodiment according to the actual needs.
In addition, each functional unit in various embodiments of the present invention can be fully integrated in one processing unit, it can also
To be each unit individually as a unit, can also be integrated in one unit with two or more units;It is above-mentioned
Integrated unit both can take the form of hardware realization, can also realize in the form of hardware adds SFU software functional unit.
Those of ordinary skill in the art will appreciate that: realize that all or part of the steps of above method embodiment can pass through
The relevant hardware of program instruction is completed, and program above-mentioned can store in computer-readable storage medium, which exists
When execution, step including the steps of the foregoing method embodiments is executed;And storage medium above-mentioned includes: movable storage device, read-only deposits
Reservoir (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), magnetic disk or
The various media that can store program code such as CD.
If alternatively, the above-mentioned integrated unit of the present invention is realized in the form of software function module and as independent product
When selling or using, it also can store in a computer readable storage medium.Based on this understanding, the present invention is implemented
Substantially the part that contributes to existing technology can be embodied in the form of software products the technical solution of example in other words,
The computer software product is stored in a storage medium, including some instructions are used so that computer equipment (can be with
It is personal computer, server or network equipment etc.) execute all or part of each embodiment the method for the present invention.
And storage medium above-mentioned includes: that movable storage device, ROM, RAM, magnetic or disk etc. are various can store program code
Medium.
The above description is merely a specific embodiment, but scope of protection of the present invention is not limited thereto, any
Those familiar with the art in the technical scope disclosed by the present invention, can easily think of the change or the replacement, and should all contain
Lid is within protection scope of the present invention.Therefore, protection scope of the present invention should be based on the protection scope of the described claims.
Claims (10)
1. a kind of auditing method, which is characterized in that the described method includes:
Obtain Audit data text;
Word segmentation processing is carried out to the Audit data text, obtains word segmentation result;Unexamined number is obtained according to the word segmentation result
According to set;The unexamined data acquisition system, comprising: at least one set of unexamined data;The unexamined data include: Audit data
Type and its corresponding number;
Corresponding calculation formula is determined according to the Audit data type;With the calculation formula, according to the Audit data
The corresponding number of type is calculated accordingly, and be audited result.
2. auditing method according to claim 1, which is characterized in that described after the unexamined data acquisition system of acquisition
Method further include:
It determines preset audit regulation, inquires the unexamined data acquisition system according to the audit regulation, be determined for compliance with audit rule
Unexamined data then determine the unexamined data acquisition system of target according to the unexamined data for meeting audit regulation, into
Row audit.
3. auditing method according to claim 2, which is characterized in that the unexamined number for being determined for compliance with audit regulation
According to, comprising:
Determine the characteristic of the unexamined data in the unexamined data acquisition system;
The characteristic of the unexamined data is matched with the matching characteristic in the audit regulation, is determined described pending
Look into the characteristic of data and the matching degree of the matching characteristic;
When determining that the matching degree is higher than preset threshold, it is determined that be the unexamined data for meeting audit regulation.
4. auditing method according to claim 3, which is characterized in that the characteristic of the determination unexamined data
With the matching degree of the matching characteristic, comprising:
The characteristic of the unexamined data and the matching degree of the matching characteristic are calculated with following formula:
Wherein, m is the identical characters matching degree between the characteristic and the matching characteristic of the unexamined data, and t is institute
The matching sequence of the characteristic of unexamined data and the editing distance of the matching characteristic matched between sequence are stated, | SA |, |
SB | it is respectively characteristic, the string length of the matching characteristic of the unexamined data;
Wherein, the calculation formula of the m is as follows:
Wherein, N indicates the character total number of successful match between the characteristic and the matching characteristic of the unexamined data,
Δ (A, i+1, i, B) indicates the i+1 character and i-th of character in the matching sequence of the characteristic of the unexamined data
The location interval between character is corresponded in the matching characteristic.
5. auditing method according to claim 1, which is characterized in that it is described to use the calculation formula, it is examined according to described
Before counting the corresponding number progress calculating accordingly of type, the method also includes:
Determine the first format of the number;
Determine the second format of number corresponding to the calculation formula;
Judge whether first format and second format are identical, if not identical, first format is converted into institute
State the second format;
Correspondingly, described use the calculation formula, calculated accordingly according to the corresponding number of the Audit data type,
Include:
With the calculation formula, calculated accordingly according to the number of corresponding second format of the Audit data type.
6. a kind of auditing system, which is characterized in that the auditing system includes: memory and processor, is wrapped in the memory
A kind of auditing method program is included, the auditing method program realizes following steps when being executed by the processor:
Obtain Audit data text;
Word segmentation processing is carried out to the Audit data text, obtains word segmentation result;Unexamined number is obtained according to the word segmentation result
According to set;The unexamined data acquisition system, comprising: at least one set of unexamined data;The unexamined data include: Audit data
Type and its corresponding number;
Corresponding calculation formula is determined according to the Audit data type;With the calculation formula, according to the Audit data
The corresponding number of type is calculated accordingly, and be audited result.
7. auditing system according to claim 6, which is characterized in that described after the unexamined data acquisition system of acquisition
Method further include:
It determines preset audit regulation, inquires the unexamined data acquisition system according to the audit regulation, be determined for compliance with audit rule
Unexamined data then determine the unexamined data acquisition system of target according to the unexamined data for meeting audit regulation, into
Row audit.
8. auditing system according to claim 7, which is characterized in that the unexamined number for being determined for compliance with audit regulation
According to, comprising:
Determine the characteristic of the unexamined data in the unexamined data acquisition system;
The characteristic of the unexamined data is matched with the matching characteristic in the audit regulation, is determined described pending
Look into the characteristic of data and the matching degree of the matching characteristic;
When determining that the matching degree is higher than preset threshold, it is determined that be the unexamined data for meeting audit regulation.
9. auditing system according to claim 8, which is characterized in that the characteristic of the determination unexamined data
With the matching degree of the matching characteristic, comprising:
The characteristic of the unexamined data and the matching degree of the matching characteristic are calculated with following formula:
Wherein, m is the identical characters matching degree between the characteristic and the matching characteristic of the unexamined data, and t is institute
The matching sequence of the characteristic of unexamined data and the editing distance of the matching characteristic matched between sequence are stated, | SA |, |
SB | it is respectively characteristic, the string length of the matching characteristic of the unexamined data;
Wherein, the calculation formula of the m is as follows:
Wherein, N indicates the character total number of successful match between the characteristic and the matching characteristic of the unexamined data,
Δ (A, i+1, i, B) indicates the i+1 character and i-th of character in the matching sequence of the characteristic of the unexamined data
The location interval between character is corresponded in the matching characteristic.
10. a kind of computer readable storage medium, which is characterized in that include a kind of audit in the computer readable storage medium
Method program when the auditing method program is executed by processor, realizes one kind as described in any one of claims 1 to 5
The step of auditing method.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910815699.XA CN110532302B (en) | 2019-08-30 | 2019-08-30 | Audit method, system and readable storage medium |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910815699.XA CN110532302B (en) | 2019-08-30 | 2019-08-30 | Audit method, system and readable storage medium |
Publications (2)
Publication Number | Publication Date |
---|---|
CN110532302A true CN110532302A (en) | 2019-12-03 |
CN110532302B CN110532302B (en) | 2024-01-19 |
Family
ID=68665619
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201910815699.XA Active CN110532302B (en) | 2019-08-30 | 2019-08-30 | Audit method, system and readable storage medium |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN110532302B (en) |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20070157224A1 (en) * | 2005-12-23 | 2007-07-05 | Jean-Francois Pouliot | Method and system for automated auditing of advertising |
CN106446076A (en) * | 2016-09-07 | 2017-02-22 | 南京理工大学 | Hierarchical clustering-based log audit method |
CN106503102A (en) * | 2016-10-17 | 2017-03-15 | 汉蓝(北京)科技有限公司 | A kind of search engine formula audit analysis method |
CN109598484A (en) * | 2018-12-04 | 2019-04-09 | 广东电网有限责任公司 | A kind of project under construction turns fixed assets number auditing method and device |
CN109726272A (en) * | 2018-12-20 | 2019-05-07 | 杭州数梦工场科技有限公司 | Audit regulation recommended method and device |
CN109741029A (en) * | 2018-12-27 | 2019-05-10 | 广东电网有限责任公司 | The building method and device in a kind of power grid enterprises' audit regulation storehouse |
-
2019
- 2019-08-30 CN CN201910815699.XA patent/CN110532302B/en active Active
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20070157224A1 (en) * | 2005-12-23 | 2007-07-05 | Jean-Francois Pouliot | Method and system for automated auditing of advertising |
CN106446076A (en) * | 2016-09-07 | 2017-02-22 | 南京理工大学 | Hierarchical clustering-based log audit method |
CN106503102A (en) * | 2016-10-17 | 2017-03-15 | 汉蓝(北京)科技有限公司 | A kind of search engine formula audit analysis method |
CN109598484A (en) * | 2018-12-04 | 2019-04-09 | 广东电网有限责任公司 | A kind of project under construction turns fixed assets number auditing method and device |
CN109726272A (en) * | 2018-12-20 | 2019-05-07 | 杭州数梦工场科技有限公司 | Audit regulation recommended method and device |
CN109741029A (en) * | 2018-12-27 | 2019-05-10 | 广东电网有限责任公司 | The building method and device in a kind of power grid enterprises' audit regulation storehouse |
Also Published As
Publication number | Publication date |
---|---|
CN110532302B (en) | 2024-01-19 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110020422B (en) | Feature word determining method and device and server | |
CN108170792B (en) | Question and answer guiding method and device based on artificial intelligence and computer equipment | |
WO2019085236A1 (en) | Search intention recognition method and apparatus, and electronic device and readable storage medium | |
US8949204B2 (en) | Efficient development of a rule-based system using crowd-sourcing | |
CN106919575B (en) | Application program searching method and device | |
CN104298679A (en) | Application service recommendation method and device | |
CN110909540B (en) | Method and device for identifying new words of short message spam and electronic equipment | |
CN104199965A (en) | Semantic information retrieval method | |
CN103279478A (en) | Method for extracting features based on distributed mutual information documents | |
CN105389341A (en) | Text clustering and analysis method for repeating caller work orders of customer service calls | |
CN104866511A (en) | Method and equipment for adding multi-media files | |
CN109740642A (en) | Invoice category recognition methods, device, electronic equipment and readable storage medium storing program for executing | |
CN111061837A (en) | Topic identification method, device, equipment and medium | |
CN113268615A (en) | Resource label generation method and device, electronic equipment and storage medium | |
CN111782793A (en) | Intelligent customer service processing method, system and equipment | |
CN106919588A (en) | A kind of application program search system and method | |
CN113449753B (en) | Service risk prediction method, device and system | |
CN116151220A (en) | Word segmentation model training method, word segmentation processing method and device | |
CN112287111B (en) | Text processing method and related device | |
CN109697224B (en) | Bill message processing method, device and storage medium | |
CN114461783A (en) | Keyword generation method and device, computer equipment, storage medium and product | |
CN114328800A (en) | Text processing method and device, electronic equipment and computer readable storage medium | |
CN113505117A (en) | Data quality evaluation method, device, equipment and medium based on data indexes | |
CN113392920A (en) | Method, apparatus, device, medium, and program product for generating cheating prediction model | |
CN110399617A (en) | Audit data processing method, system and readable storage medium storing program for executing |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |