CN110532302A - Auditing method, system and readable storage medium storing program for executing - Google Patents

Auditing method, system and readable storage medium storing program for executing Download PDF

Info

Publication number
CN110532302A
CN110532302A CN201910815699.XA CN201910815699A CN110532302A CN 110532302 A CN110532302 A CN 110532302A CN 201910815699 A CN201910815699 A CN 201910815699A CN 110532302 A CN110532302 A CN 110532302A
Authority
CN
China
Prior art keywords
data
unexamined
characteristic
audit
matching
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201910815699.XA
Other languages
Chinese (zh)
Other versions
CN110532302B (en
Inventor
黄楚维
谢志林
冯挺
闭秀萍
韦海玲
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nanning Power Supply Bureau of Guangxi Power Grid Co Ltd
Original Assignee
Nanning Power Supply Bureau of Guangxi Power Grid Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nanning Power Supply Bureau of Guangxi Power Grid Co Ltd filed Critical Nanning Power Supply Bureau of Guangxi Power Grid Co Ltd
Priority to CN201910815699.XA priority Critical patent/CN110532302B/en
Publication of CN110532302A publication Critical patent/CN110532302A/en
Application granted granted Critical
Publication of CN110532302B publication Critical patent/CN110532302B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • G06F16/2465Query processing support for facilitating data mining operations in structured databases
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/25Integrating or interfacing systems involving database management systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/10Office automation; Time management

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • General Physics & Mathematics (AREA)
  • Business, Economics & Management (AREA)
  • General Engineering & Computer Science (AREA)
  • Strategic Management (AREA)
  • Computational Linguistics (AREA)
  • Software Systems (AREA)
  • Human Resources & Organizations (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Mathematical Physics (AREA)
  • Molecular Biology (AREA)
  • Probability & Statistics with Applications (AREA)
  • Biophysics (AREA)
  • Evolutionary Computation (AREA)
  • General Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Computing Systems (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Biomedical Technology (AREA)
  • Fuzzy Systems (AREA)
  • Economics (AREA)
  • Marketing (AREA)
  • Operations Research (AREA)
  • Quality & Reliability (AREA)
  • Tourism & Hospitality (AREA)
  • General Business, Economics & Management (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The present invention provides a kind of auditing method, system and readable storage medium storing program for executing, which comprises obtains Audit data text;Word segmentation processing is carried out to the Audit data text, obtains word segmentation result;Unexamined data acquisition system is obtained according to the word segmentation result;The unexamined data acquisition system, comprising: at least one set of unexamined data;The unexamined data include: Audit data type and its corresponding number;Corresponding calculation formula is determined according to the Audit data type;With the calculation formula, calculated accordingly according to the corresponding number of the Audit data type, be audited result.

Description

Auditing method, system and readable storage medium storing program for executing
Technical field
The present invention relates to audit technique field more particularly to a kind of auditing methods, system and readable storage medium storing program for executing.
Background technique
In recent years, with the rapid development of computer technology and informatization, the information-based range of Economic Management Activities It also grows with each passing hour with depth, the audit for supervising as economic activity, evaluating and discerning encounters unprecedented challenge, tradition Manual audit can not adapt to the audit demand under Information Condition, the informationization of the audit target and audit itself development all It is required that audit operation mode must grow with each passing hour, corresponding adjustment is made.Therefore, responsive message development trend updates audit Supervision theory, Innovation auditing method are extremely urgent.
How a kind of method is provided, to extract Audit data from the text comprising Audit data, and is carried out corresponding It calculates to help traditional manual audit, so as to improve audit measure, improves audit efficiency, be current problem to be solved.
Summary of the invention
In order to solve at least one above-mentioned technical problem, the invention proposes a kind of auditing method, system and readable storages Medium.
To achieve the goals above, first aspect present invention proposes a kind of auditing method, which comprises
Obtain Audit data text;
Word segmentation processing is carried out to the Audit data text, obtains word segmentation result;It is obtained according to the word segmentation result pending Look into data acquisition system;The unexamined data acquisition system, comprising: at least one set of unexamined data;The unexamined data include: audit Data type and its corresponding number;
Corresponding calculation formula is determined according to the Audit data type;With the calculation formula, according to the audit The corresponding number of data type is calculated accordingly, and be audited result.
Further, after the unexamined data acquisition system of acquisition, the method also includes:
It determines preset audit regulation, inquires the unexamined data acquisition system according to the audit regulation, be determined for compliance with careful The unexamined data for counting rule, determine the unexamined data acquisition system of target according to the unexamined data for meeting audit regulation, use To audit.
Further, the unexamined data for being determined for compliance with audit regulation, comprising:
Determine the characteristic of the unexamined data in the unexamined data acquisition system;
The characteristic of the unexamined data is matched with the matching characteristic in the audit regulation, described in determination The matching degree of the characteristic of unexamined data and the matching characteristic;
When determining that the matching degree is higher than preset threshold, it is determined that be the unexamined data for meeting audit regulation.
Further, the matching degree of the characteristic of the determination unexamined data and the matching characteristic, comprising:
The characteristic of the unexamined data and the matching degree of the matching characteristic are calculated with following formula:
Wherein, m is the identical characters matching degree between the characteristic and the matching characteristic of the unexamined data, t For matching sequence and the editing distance of the matching characteristic matched between sequence of the characteristic of the unexamined data, | SA |, | SB | it is respectively characteristic, the string length of the matching characteristic of the unexamined data;
Wherein, the calculation formula of the m is as follows:
Wherein, N indicates that the character of successful match between the characteristic and the matching characteristic of the unexamined data is total Number, Δ (A, i+1, i, B) indicate the i+1 character and i-th in the matching sequence of the characteristic of the unexamined data A character corresponds to the location interval between character in the matching characteristic.
Further, described to use the calculation formula, it is carried out according to the corresponding number of the Audit data type corresponding Calculating before, the method also includes:
Determine the first format of the number;
Determine the second format of number corresponding to the calculation formula;
Judge whether first format and second format are identical, if not identical, first format is converted For second format;
Correspondingly, described use the calculation formula, carried out according to the corresponding number of the Audit data type corresponding It calculates, comprising:
With the calculation formula, counted accordingly according to the number of corresponding second format of the Audit data type It calculates.
Second aspect of the present invention also proposes that a kind of auditing system, the careful auditing system include: memory and processor, institute Stating includes a kind of auditing method program in memory, and following step is realized when the auditing method program is executed by the processor It is rapid:
Obtain Audit data text;
Word segmentation processing is carried out to the Audit data text, obtains word segmentation result;It is obtained according to the word segmentation result pending Look into data acquisition system;The unexamined data acquisition system, comprising: at least one set of unexamined data;The unexamined data include: audit Data type and its corresponding number;
Corresponding calculation formula is determined according to the Audit data type;With the calculation formula, according to the audit The corresponding number of data type is calculated accordingly, and be audited result.
Further, after the unexamined data acquisition system of acquisition, the method also includes:
It determines preset audit regulation, inquires the unexamined data acquisition system according to the audit regulation, be determined for compliance with careful The unexamined data for counting rule, determine the unexamined data acquisition system of target according to the unexamined data for meeting audit regulation, use To audit.
Further, the unexamined data for being determined for compliance with audit regulation, comprising:
Determine the characteristic of the unexamined data in the unexamined data acquisition system;
The characteristic of the unexamined data is matched with the matching characteristic in the audit regulation, described in determination The matching degree of the characteristic of unexamined data and the matching characteristic;
When determining that the matching degree is higher than preset threshold, it is determined that be the unexamined data for meeting audit regulation.
Further, the matching degree of the characteristic of the determination unexamined data and the matching characteristic, comprising:
The characteristic of the unexamined data and the matching degree of the matching characteristic are calculated with following formula:
Wherein, m is the identical characters matching degree between the characteristic and the matching characteristic of the unexamined data, t For matching sequence and the editing distance of the matching characteristic matched between sequence of the characteristic of the unexamined data, | SA |, | SB | it is respectively characteristic, the string length of the matching characteristic of the unexamined data;
Wherein, the calculation formula of the m is as follows:
Wherein, N indicates that the character of successful match between the characteristic and the matching characteristic of the unexamined data is total Number, Δ (A, i+1, i, B) indicate the i+1 character and i-th in the matching sequence of the characteristic of the unexamined data A character corresponds to the location interval between character in the matching characteristic.
Third aspect present invention also proposes a kind of computer readable storage medium, wraps in the computer readable storage medium A kind of auditing method program is included, when the auditing method program is executed by processor, realizes one kind as described in any of the above item The step of auditing method.
The embodiment of the invention provides a kind of auditing method, system and storage mediums, obtain Audit data text;To described Audit data text carries out word segmentation processing, obtains word segmentation result;Unexamined data acquisition system is obtained according to the word segmentation result;It is described Unexamined data acquisition system, comprising: at least one set of unexamined data;The unexamined data include: Audit data type and its right The number answered;Corresponding calculation formula is determined according to the Audit data type;With the calculation formula, according to the audit The corresponding number of data type is calculated accordingly, and be audited result.By above-mentioned auditing method, realize from Audit data The extraction that related data is carried out in text, and is calculated accordingly according to the type of related data, be audited as a result, with Manual audit is helped, human cost is saved, improves audit efficiency.
Additional aspect and advantage of the invention will provide in following description section, will partially become from the following description Obviously, or practice through the invention is recognized.
Detailed description of the invention
Fig. 1 is a kind of flow diagram of auditing method provided in an embodiment of the present invention;
Fig. 2 is a kind of structural schematic diagram of auditing system provided in an embodiment of the present invention.
Specific embodiment
To better understand the objects, features and advantages of the present invention, with reference to the accompanying drawing and specific real Applying mode, the present invention is further described in detail.It should be noted that in the absence of conflict, the implementation of the application Feature in example and embodiment can be combined with each other.
In the following description, numerous specific details are set forth in order to facilitate a full understanding of the present invention, still, the present invention may be used also To be implemented using other than the one described here other modes, therefore, protection scope of the present invention is not by described below Specific embodiment limitation.
Fig. 1 is a kind of flow diagram of auditing method provided in an embodiment of the present invention;As shown in Figure 1, the method can To be applied to load by intelligent electronic devices such as server, the computers of auditing system;The described method includes:
Step 101 obtains Audit data text.
Step 102 carries out word segmentation processing to the Audit data text, obtains word segmentation result;According to the word segmentation result Obtain unexamined data acquisition system;The unexamined data acquisition system, comprising: at least one set of unexamined data;The unexamined data It include: Audit data type and its corresponding number.
Step 103 determines corresponding calculation formula according to the Audit data type;With the calculation formula, according to The corresponding number of the Audit data type is calculated accordingly, and be audited result.
Specifically, after the unexamined data acquisition system of acquisition described in the step 101, the method also includes:
It determines preset audit regulation, inquires the unexamined data acquisition system according to the audit regulation, be determined for compliance with careful The unexamined data for counting rule, determine the unexamined data acquisition system of target according to the unexamined data for meeting audit regulation, use To audit.
Corresponding calculation formula is determined to according to the Audit data type i.e. in step 103;With the calculation formula, It is calculated, is audited as a result, being pending for above-mentioned target accordingly according to the corresponding number of the Audit data type Look into data acquisition system progress.
The unexamined data acquisition system of target, including at least one set of unexamined data;The unexamined data include: audit Data type and its corresponding number.
The unexamined data of at least one set of the unexamined data acquisition system of target belong to the unexamined data acquisition system.
Realize the screening to data in unexamined data acquisition system, through the above steps to realize to the unexamined of user demand Data are audited, to improve working efficiency.
The preset audit regulation can be user and preset and save.
Specifically, the unexamined data for being determined for compliance with audit regulation, comprising:
Determine the characteristic of the unexamined data in the unexamined data acquisition system;
The characteristic of the unexamined data is matched with the matching characteristic in the audit regulation, described in determination The matching degree of the characteristic of unexamined data and the matching characteristic;
When determining that the matching degree is higher than preset threshold, it is determined that be the unexamined data for meeting audit regulation.
Matching characteristic in i.e. above-described audit regulation can be user and preset and save.The audit Matching characteristic in rule may include text, the mark for referring to a certain particular content;Here without limitation, only explanation can be with The screening to unexamined data is realized according to above-mentioned matching characteristic.
Correspondingly, the characteristic of the unexamined data, also may include text, the mark for referring to a certain particular content Deng only needing to realize screening to unexamined data in conjunction with above-mentioned matching characteristic.The characteristic of the unexamined data According to, it can be determined based on the word of at least one in word segmentation result, it can also be true based on contaminations progress multiple in word segmentation result It is fixed.
Specifically, the matching degree of the characteristic of the determination unexamined data and the matching characteristic, comprising:
The characteristic of the unexamined data and the matching degree of the matching characteristic are calculated with following formula:
Wherein, m is the identical characters matching degree between the characteristic and the matching characteristic of the unexamined data, t For matching sequence and the editing distance of the matching characteristic matched between sequence of the characteristic of the unexamined data, | SA |, | SB | it is respectively characteristic, the string length of the matching characteristic of the unexamined data;
Wherein, the calculation formula of the m is as follows:
Wherein, N indicates that the character of successful match between the characteristic and the matching characteristic of the unexamined data is total Number, Δ (A, i+1, i, B) indicate the i+1 character and i-th in the matching sequence of the characteristic of the unexamined data A character corresponds to the location interval between character in the matching characteristic.
Specifically, described that word segmentation processing is carried out to the Audit data text in the step 102, it can be using any A kind of segmenting method (such as existing Forward Maximum Method method, i.e., number of Chinese characters contained by maximum entry is n in hypothesis dictionary It is a, take the preceding n word of character string to be processed as matching field.If containing the word in dictionary, successful match separates the word, Then the field for taking n word to form again from according with the n+1 of string since comparand matches in dictionary again;If do not matched Success, then by this n word composition field last position reject, with remaining mono- 1 words of n composition field in dictionary into Row matching, so goes on, until cutting success.Reverse maximum matching method, the difference with Forward Maximum Method method exist In a word for subtracting foremost if matching is unsuccessful), the higher the better for accuracy rate.Can also be used it is provided in this embodiment under State segmenting method.
Specifically, the segmenting method, can using it is following any one:
It is character string by text conversion to be segmented;The character string for meeting preset length that will include in the character string It is matched with the standard words in the dictionary constructed in advance, the determining and matched matched character string of the standard words, is the word Each character of matched character string described in symbol sequence and each character in addition to the matched character string distribute correspondence respectively Dictionary label, obtain dictionary sequence label;Determine the corresponding at least one participle label of each character in the character string, Obtain a variety of participle sequence labels;According to the character string, the dictionary sequence label and conditional probability trained in advance Prediction model determines that the character string is marked as the conditional probability of every kind of participle sequence label;Preset condition will be met The corresponding participle sequence label of conditional probability is determined as target participle sequence label, and segments sequence label pair based on the target The text to be segmented carries out word segmentation processing.
For text to be segmented, at least one default segmenting method combination is selected, wherein the segmenting method, which combines, includes Participle disambiguates at least one of optimization method, individual character optimization method and proper noun optimization method, and basic segmenting method; It is combined using the default segmenting method of selection, the text to be segmented is segmented;Wherein, the default segmenting method Combination includes basic segmenting method and participle disambiguates optimization method, described to be combined using the default segmenting method of selection, right The text to be segmented is segmented, comprising: is segmented using basic segmenting method to the text to be segmented, is obtained basis Word segmentation result;Obtain the intersection lemma group and non-intersection lemma group for including in the basic word segmentation result;For each intersection word Tuple is determined not intersect lemma combination in the intersection lemma group respectively;Each lemma in lemma combination is not intersected according to described At Word probability, the lemma that ambiguity is not present is determined;By in the non-intersection lemma group lemma and described there is no the words of ambiguity Member, the word segmentation result as the text to be segmented.
Specifically, described with the calculation formula in the step 103, it is corresponding according to the Audit data type Before number calculate accordingly, the method also includes:
Determine the first format of the number;
Determine the second format of number corresponding to the calculation formula;
Judge whether first format and second format are identical, if not identical, first format is converted For second format;
Correspondingly, described use the calculation formula, carried out according to the corresponding number of the Audit data type corresponding It calculates, comprising:
With the calculation formula, counted accordingly according to the number of corresponding second format of the Audit data type It calculates.
Specifically, the Audit data type, comprising: financial domain business and across business domains;The corresponding business of different business Data can be different, such as: financial domain business data may include: running cost reimbursement, reimbursing travelling expenses, engineering payment, the electricity charge The business datums such as payment, emolument payment;Trans-sectoral business numeric field data may include: engineering project, engineering contract, goods and materials contract, engineering It is receivable paid that preliminary budget, goods and materials go out storage bill, project final report, marketing financial counting, the electricity charge.
It should be noted that various Audit data types can correspond to different keywords, keyword and Audit data class The corresponding relationship of type presets and saves, and by inquiring corresponding relationship, Audit data type can be determined according to keyword.Institute It states keyword and can be and obtained according to Audit data text, can be one in word segmentation result.
Specifically, by the word segmentation processing, keyword is obtained, according to corresponding relationship described in the keyword query, It can determine the corresponding Audit data type of keyword.
Here, corresponding calculation formula is determined according to the Audit data type for described in step 103;With described Calculation formula is calculated accordingly according to the corresponding number of the Audit data type, is audited as a result, doing furtherly It is bright.
After determining Audit data type, the corresponding formula of various Audit data types that can also be saved according to server is straight Capable calculating is tapped into, therefore, corresponding calculation formula can be determined according to the Audit data type here.
And before being calculated, it is contemplated that the number of different-format not necessarily can be applied to the formula, and therefore, it is necessary to logarithms Word is converted, and specifically uses above-mentioned conversion method, which is not described herein again.
After converting, it can just be calculated accordingly according to data and formula, obtain result.
By the above method, indirect labor's audit saves human cost to improve working efficiency.
In the present embodiment, several methods for obtaining Audit data text, specifically, the acquisition Audit data are provided Text comprises at least one of the following:
Data acquisition instructions are sent at least one corresponding database of at least one audit target, receive at least one number The Audit data text sent according to library;
Access instruction is sent at least one corresponding database of at least one audit target, receives at least one database After the acceptance message of transmission, it is described to obtain that at least one described database is accessed with WebService, Http method of servicing Audit data text;
Data acquisition instructions are sent to central database, receive the Audit data text that the central database is sent This;Wherein, the central database is obtained to periodicity from least one corresponding database of at least one described audit target Take the Audit data text;
Access instruction is sent to the central database, after receiving the acceptance message that central database is sent, is used WebService, Http method of servicing access the central database to obtain the Audit data text.
Fig. 2 is a kind of block diagram of auditing system provided in an embodiment of the present invention.As shown in Fig. 2, second aspect of the present invention is also It proposes that a kind of auditing system 2, the auditing system 2 include: memory 21 and processor 22, includes one kind in the memory 21 Auditing method program, the auditing method program realize following steps when being executed by the processor 22:
Obtain Audit data text;
Word segmentation processing is carried out to the Audit data text, obtains word segmentation result;It is obtained according to the word segmentation result pending Look into data acquisition system;The unexamined data acquisition system, comprising: at least one set of unexamined data;The unexamined data include: audit Data type and its corresponding number;
Corresponding calculation formula is determined according to the Audit data type;With the calculation formula, according to the audit The corresponding number of data type is calculated accordingly, and be audited result.
It should be noted that system of the invention can be grasped in the terminal devices such as server, PC, mobile phone, PAD Make.
It should be noted that the processor can be central processing unit (Central Processing Unit, CPU), it can also be other general processors, Digital Signal Processing (Digital Signal Processor, DSP), dedicated collection At circuit (Application Specific Integrated Circuit, ASIC), ready-made programmable gate array (Field- Programmable Gate Array, FPGA) either other programmable logic device, discrete gate or transistor logic, Discrete hardware components etc..General processor can be microprocessor or the processor is also possible to any conventional processor Deng.
Further, after the unexamined data acquisition system of acquisition, the method also includes:
It determines preset audit regulation, inquires the unexamined data acquisition system according to the audit regulation, be determined for compliance with careful The unexamined data for counting rule, determine the unexamined data acquisition system of target according to the unexamined data for meeting audit regulation, use To audit.
Further, the unexamined data for being determined for compliance with audit regulation, comprising:
Determine the characteristic of the unexamined data in the unexamined data acquisition system;
The characteristic of the unexamined data is matched with the matching characteristic in the audit regulation, described in determination The matching degree of the characteristic of unexamined data and the matching characteristic;
When determining that the matching degree is higher than preset threshold, it is determined that be the unexamined data for meeting audit regulation.
Further, the matching degree of the characteristic of the determination unexamined data and the matching characteristic, comprising:
The characteristic of the unexamined data and the matching degree of the matching characteristic are calculated with following formula:
Wherein, m is the identical characters matching degree between the characteristic and the matching characteristic of the unexamined data, t For matching sequence and the editing distance of the matching characteristic matched between sequence of the characteristic of the unexamined data, | SA |, | SB | it is respectively characteristic, the string length of the matching characteristic of the unexamined data;
Wherein, the calculation formula of the m is as follows:
Wherein, N indicates that the character of successful match between the characteristic and the matching characteristic of the unexamined data is total Number, Δ (A, i+1, i, B) indicate the i+1 character and i-th in the matching sequence of the characteristic of the unexamined data A character corresponds to the location interval between character in the matching characteristic.
Further, described to use the calculation formula, it is carried out according to the corresponding number of the Audit data type corresponding Calculating before, the method also includes:
Determine the first format of the number;
Determine the second format of number corresponding to the calculation formula;
Judge whether first format and second format are identical, if not identical, first format is converted For second format;
Correspondingly, described use the calculation formula, carried out according to the corresponding number of the Audit data type corresponding It calculates, comprising:
With the calculation formula, counted accordingly according to the number of corresponding second format of the Audit data type It calculates.
Third aspect present invention also proposes a kind of computer readable storage medium, wraps in the computer readable storage medium A kind of auditing method program is included, when the auditing method program is executed by processor, is realized such as a kind of above-mentioned auditing method Step.
In several embodiments provided herein, it should be understood that disclosed device and method can pass through it Its mode is realized.Apparatus embodiments described above are merely indicative, for example, the division of the unit, only A kind of logical function partition, there may be another division manner in actual implementation, such as: multiple units or components can combine, or It is desirably integrated into another system, or some features can be ignored or not executed.In addition, shown or discussed each composition portion Mutual coupling or direct-coupling or communication connection is divided to can be through some interfaces, the INDIRECT COUPLING of equipment or unit Or communication connection, it can be electrical, mechanical or other forms.
Above-mentioned unit as illustrated by the separation member, which can be or may not be, to be physically separated, aobvious as unit The component shown can be or may not be physical unit;Both it can be located in one place, and may be distributed over multiple network lists In member;Some or all of units can be selected to achieve the purpose of the solution of this embodiment according to the actual needs.
In addition, each functional unit in various embodiments of the present invention can be fully integrated in one processing unit, it can also To be each unit individually as a unit, can also be integrated in one unit with two or more units;It is above-mentioned Integrated unit both can take the form of hardware realization, can also realize in the form of hardware adds SFU software functional unit.
Those of ordinary skill in the art will appreciate that: realize that all or part of the steps of above method embodiment can pass through The relevant hardware of program instruction is completed, and program above-mentioned can store in computer-readable storage medium, which exists When execution, step including the steps of the foregoing method embodiments is executed;And storage medium above-mentioned includes: movable storage device, read-only deposits Reservoir (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), magnetic disk or The various media that can store program code such as CD.
If alternatively, the above-mentioned integrated unit of the present invention is realized in the form of software function module and as independent product When selling or using, it also can store in a computer readable storage medium.Based on this understanding, the present invention is implemented Substantially the part that contributes to existing technology can be embodied in the form of software products the technical solution of example in other words, The computer software product is stored in a storage medium, including some instructions are used so that computer equipment (can be with It is personal computer, server or network equipment etc.) execute all or part of each embodiment the method for the present invention. And storage medium above-mentioned includes: that movable storage device, ROM, RAM, magnetic or disk etc. are various can store program code Medium.
The above description is merely a specific embodiment, but scope of protection of the present invention is not limited thereto, any Those familiar with the art in the technical scope disclosed by the present invention, can easily think of the change or the replacement, and should all contain Lid is within protection scope of the present invention.Therefore, protection scope of the present invention should be based on the protection scope of the described claims.

Claims (10)

1. a kind of auditing method, which is characterized in that the described method includes:
Obtain Audit data text;
Word segmentation processing is carried out to the Audit data text, obtains word segmentation result;Unexamined number is obtained according to the word segmentation result According to set;The unexamined data acquisition system, comprising: at least one set of unexamined data;The unexamined data include: Audit data Type and its corresponding number;
Corresponding calculation formula is determined according to the Audit data type;With the calculation formula, according to the Audit data The corresponding number of type is calculated accordingly, and be audited result.
2. auditing method according to claim 1, which is characterized in that described after the unexamined data acquisition system of acquisition Method further include:
It determines preset audit regulation, inquires the unexamined data acquisition system according to the audit regulation, be determined for compliance with audit rule Unexamined data then determine the unexamined data acquisition system of target according to the unexamined data for meeting audit regulation, into Row audit.
3. auditing method according to claim 2, which is characterized in that the unexamined number for being determined for compliance with audit regulation According to, comprising:
Determine the characteristic of the unexamined data in the unexamined data acquisition system;
The characteristic of the unexamined data is matched with the matching characteristic in the audit regulation, is determined described pending Look into the characteristic of data and the matching degree of the matching characteristic;
When determining that the matching degree is higher than preset threshold, it is determined that be the unexamined data for meeting audit regulation.
4. auditing method according to claim 3, which is characterized in that the characteristic of the determination unexamined data With the matching degree of the matching characteristic, comprising:
The characteristic of the unexamined data and the matching degree of the matching characteristic are calculated with following formula:
Wherein, m is the identical characters matching degree between the characteristic and the matching characteristic of the unexamined data, and t is institute The matching sequence of the characteristic of unexamined data and the editing distance of the matching characteristic matched between sequence are stated, | SA |, | SB | it is respectively characteristic, the string length of the matching characteristic of the unexamined data;
Wherein, the calculation formula of the m is as follows:
Wherein, N indicates the character total number of successful match between the characteristic and the matching characteristic of the unexamined data, Δ (A, i+1, i, B) indicates the i+1 character and i-th of character in the matching sequence of the characteristic of the unexamined data The location interval between character is corresponded in the matching characteristic.
5. auditing method according to claim 1, which is characterized in that it is described to use the calculation formula, it is examined according to described Before counting the corresponding number progress calculating accordingly of type, the method also includes:
Determine the first format of the number;
Determine the second format of number corresponding to the calculation formula;
Judge whether first format and second format are identical, if not identical, first format is converted into institute State the second format;
Correspondingly, described use the calculation formula, calculated accordingly according to the corresponding number of the Audit data type, Include:
With the calculation formula, calculated accordingly according to the number of corresponding second format of the Audit data type.
6. a kind of auditing system, which is characterized in that the auditing system includes: memory and processor, is wrapped in the memory A kind of auditing method program is included, the auditing method program realizes following steps when being executed by the processor:
Obtain Audit data text;
Word segmentation processing is carried out to the Audit data text, obtains word segmentation result;Unexamined number is obtained according to the word segmentation result According to set;The unexamined data acquisition system, comprising: at least one set of unexamined data;The unexamined data include: Audit data Type and its corresponding number;
Corresponding calculation formula is determined according to the Audit data type;With the calculation formula, according to the Audit data The corresponding number of type is calculated accordingly, and be audited result.
7. auditing system according to claim 6, which is characterized in that described after the unexamined data acquisition system of acquisition Method further include:
It determines preset audit regulation, inquires the unexamined data acquisition system according to the audit regulation, be determined for compliance with audit rule Unexamined data then determine the unexamined data acquisition system of target according to the unexamined data for meeting audit regulation, into Row audit.
8. auditing system according to claim 7, which is characterized in that the unexamined number for being determined for compliance with audit regulation According to, comprising:
Determine the characteristic of the unexamined data in the unexamined data acquisition system;
The characteristic of the unexamined data is matched with the matching characteristic in the audit regulation, is determined described pending Look into the characteristic of data and the matching degree of the matching characteristic;
When determining that the matching degree is higher than preset threshold, it is determined that be the unexamined data for meeting audit regulation.
9. auditing system according to claim 8, which is characterized in that the characteristic of the determination unexamined data With the matching degree of the matching characteristic, comprising:
The characteristic of the unexamined data and the matching degree of the matching characteristic are calculated with following formula:
Wherein, m is the identical characters matching degree between the characteristic and the matching characteristic of the unexamined data, and t is institute The matching sequence of the characteristic of unexamined data and the editing distance of the matching characteristic matched between sequence are stated, | SA |, | SB | it is respectively characteristic, the string length of the matching characteristic of the unexamined data;
Wherein, the calculation formula of the m is as follows:
Wherein, N indicates the character total number of successful match between the characteristic and the matching characteristic of the unexamined data, Δ (A, i+1, i, B) indicates the i+1 character and i-th of character in the matching sequence of the characteristic of the unexamined data The location interval between character is corresponded in the matching characteristic.
10. a kind of computer readable storage medium, which is characterized in that include a kind of audit in the computer readable storage medium Method program when the auditing method program is executed by processor, realizes one kind as described in any one of claims 1 to 5 The step of auditing method.
CN201910815699.XA 2019-08-30 2019-08-30 Audit method, system and readable storage medium Active CN110532302B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910815699.XA CN110532302B (en) 2019-08-30 2019-08-30 Audit method, system and readable storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910815699.XA CN110532302B (en) 2019-08-30 2019-08-30 Audit method, system and readable storage medium

Publications (2)

Publication Number Publication Date
CN110532302A true CN110532302A (en) 2019-12-03
CN110532302B CN110532302B (en) 2024-01-19

Family

ID=68665619

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910815699.XA Active CN110532302B (en) 2019-08-30 2019-08-30 Audit method, system and readable storage medium

Country Status (1)

Country Link
CN (1) CN110532302B (en)

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070157224A1 (en) * 2005-12-23 2007-07-05 Jean-Francois Pouliot Method and system for automated auditing of advertising
CN106446076A (en) * 2016-09-07 2017-02-22 南京理工大学 Hierarchical clustering-based log audit method
CN106503102A (en) * 2016-10-17 2017-03-15 汉蓝(北京)科技有限公司 A kind of search engine formula audit analysis method
CN109598484A (en) * 2018-12-04 2019-04-09 广东电网有限责任公司 A kind of project under construction turns fixed assets number auditing method and device
CN109726272A (en) * 2018-12-20 2019-05-07 杭州数梦工场科技有限公司 Audit regulation recommended method and device
CN109741029A (en) * 2018-12-27 2019-05-10 广东电网有限责任公司 The building method and device in a kind of power grid enterprises' audit regulation storehouse

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070157224A1 (en) * 2005-12-23 2007-07-05 Jean-Francois Pouliot Method and system for automated auditing of advertising
CN106446076A (en) * 2016-09-07 2017-02-22 南京理工大学 Hierarchical clustering-based log audit method
CN106503102A (en) * 2016-10-17 2017-03-15 汉蓝(北京)科技有限公司 A kind of search engine formula audit analysis method
CN109598484A (en) * 2018-12-04 2019-04-09 广东电网有限责任公司 A kind of project under construction turns fixed assets number auditing method and device
CN109726272A (en) * 2018-12-20 2019-05-07 杭州数梦工场科技有限公司 Audit regulation recommended method and device
CN109741029A (en) * 2018-12-27 2019-05-10 广东电网有限责任公司 The building method and device in a kind of power grid enterprises' audit regulation storehouse

Also Published As

Publication number Publication date
CN110532302B (en) 2024-01-19

Similar Documents

Publication Publication Date Title
CN110020422B (en) Feature word determining method and device and server
CN108170792B (en) Question and answer guiding method and device based on artificial intelligence and computer equipment
WO2019085236A1 (en) Search intention recognition method and apparatus, and electronic device and readable storage medium
US8949204B2 (en) Efficient development of a rule-based system using crowd-sourcing
CN106919575B (en) Application program searching method and device
CN104298679A (en) Application service recommendation method and device
CN110909540B (en) Method and device for identifying new words of short message spam and electronic equipment
CN104199965A (en) Semantic information retrieval method
CN103279478A (en) Method for extracting features based on distributed mutual information documents
CN105389341A (en) Text clustering and analysis method for repeating caller work orders of customer service calls
CN104866511A (en) Method and equipment for adding multi-media files
CN109740642A (en) Invoice category recognition methods, device, electronic equipment and readable storage medium storing program for executing
CN111061837A (en) Topic identification method, device, equipment and medium
CN113268615A (en) Resource label generation method and device, electronic equipment and storage medium
CN111782793A (en) Intelligent customer service processing method, system and equipment
CN106919588A (en) A kind of application program search system and method
CN113449753B (en) Service risk prediction method, device and system
CN116151220A (en) Word segmentation model training method, word segmentation processing method and device
CN112287111B (en) Text processing method and related device
CN109697224B (en) Bill message processing method, device and storage medium
CN114461783A (en) Keyword generation method and device, computer equipment, storage medium and product
CN114328800A (en) Text processing method and device, electronic equipment and computer readable storage medium
CN113505117A (en) Data quality evaluation method, device, equipment and medium based on data indexes
CN113392920A (en) Method, apparatus, device, medium, and program product for generating cheating prediction model
CN110399617A (en) Audit data processing method, system and readable storage medium storing program for executing

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant