CN117455699A - Automatic identification billing algorithm model based on machine learning - Google Patents

Automatic identification billing algorithm model based on machine learning Download PDF

Info

Publication number
CN117455699A
CN117455699A CN202311453325.0A CN202311453325A CN117455699A CN 117455699 A CN117455699 A CN 117455699A CN 202311453325 A CN202311453325 A CN 202311453325A CN 117455699 A CN117455699 A CN 117455699A
Authority
CN
China
Prior art keywords
billing
bill
automatic identification
model
identification
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202311453325.0A
Other languages
Chinese (zh)
Inventor
陈煜明
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shanghai Technical Institute of Electronics and Information
Original Assignee
Shanghai Technical Institute of Electronics and Information
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shanghai Technical Institute of Electronics and Information filed Critical Shanghai Technical Institute of Electronics and Information
Priority to CN202311453325.0A priority Critical patent/CN117455699A/en
Publication of CN117455699A publication Critical patent/CN117455699A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q40/00Finance; Insurance; Tax strategies; Processing of corporate or income taxes
    • G06Q40/12Accounting
    • G06Q40/125Finance or payroll
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution
    • G06F16/3344Query execution using natural language analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/10Office automation; Time management
    • G06Q10/103Workflow collaboration or project management

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Physics & Mathematics (AREA)
  • Strategic Management (AREA)
  • General Engineering & Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Finance (AREA)
  • Human Resources & Organizations (AREA)
  • Data Mining & Analysis (AREA)
  • Accounting & Taxation (AREA)
  • Marketing (AREA)
  • Economics (AREA)
  • General Business, Economics & Management (AREA)
  • Software Systems (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Computational Linguistics (AREA)
  • Operations Research (AREA)
  • Tourism & Hospitality (AREA)
  • Quality & Reliability (AREA)
  • Databases & Information Systems (AREA)
  • Technology Law (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • General Health & Medical Sciences (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Computation (AREA)
  • Medical Informatics (AREA)
  • Development Economics (AREA)
  • Computing Systems (AREA)
  • Mathematical Physics (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention relates to the technical field of automatic identification and accounting, and discloses an automatic identification and accounting algorithm model based on machine learning.

Description

Automatic identification billing algorithm model based on machine learning
Technical Field
The invention relates to the technical field of automatic identification and billing, in particular to an automatic identification and billing algorithm model based on machine learning.
Background
With the development of scientific technology, the identification billing converts text or image information provided by a user into structured billing data, and the text or the provided bill pictures input by the user are analyzed and identified through text analysis, pattern matching, computer vision and image processing technologies, so that the workload of manual billing is reduced, the billing efficiency is improved, and human errors and omission are reduced.
The application publication number CN111428599A discloses a bill identification method, a bill identification device and bill identification equipment, and the bill identification method in the scheme comprises the following steps: acquiring an image of a bill to be identified; inputting the image of the bill to be identified into a pre-trained bill classification model to obtain the type of the bill to be identified; acquiring a corresponding relation between the type of at least one bill stored in advance and a corresponding bill model; searching a bill model corresponding to the type of the bill to be identified from the corresponding relation; identifying the image of the bill to be identified based on the found bill model to obtain bill information of the bill to be identified; the method avoids possible billing errors caused by manual billing, saves the input time and improves the accuracy and the working efficiency.
The application publication number CN115578192a discloses a refund billing method, a processor, a computer device and a machine-readable storage medium, the method comprising: acquiring fund information of a client remittance; determining a target client to which the funds belong according to the fund information; obtaining a repayment model of a target customer; determining an order repayment plan, order overdue data and order repayment priority of a target customer according to the repayment model; distributing and confirming remittance funds of the target clients according to the order repayment plan, the order overdue data and the order repayment priority, and generating a repayment decomposition list; generating a financial accounting voucher according to the refund decomposition list to finish refund accounting; the method for billing the refund is systematic, online, automatic and transparent; the fund can automatically identify the affiliated clients, so that the fund recognition efficiency is improved; financial accounting is carried out automatically, so that the working efficiency is improved; and the fund use and the optimal allocation detail are automatically determined through the repayment model, so that manual checking operation is reduced, and the working quality is improved.
The problems presented in the background art exist in the above patents: only the diversity of bill identification is researched, the data dependency in the identification process is not considered, and the constructed various model decision processes are difficult to explain, so that the trust degree of users on the results is reduced. In order to solve the problem, the invention provides an automatic identification billing algorithm model based on machine learning.
Disclosure of Invention
This section is intended to outline some aspects of embodiments of the invention and to briefly introduce some preferred embodiments. Some simplifications or omissions may be made in this section as well as in the description summary and in the title of the application, to avoid obscuring the purpose of this section, the description summary and the title of the invention, which should not be used to limit the scope of the invention.
The present invention has been made in view of the above-mentioned problems with the prior art implementation of automatic recognition billing algorithm model based on machine learning.
It is therefore an object of the present invention to provide a machine learning based implementation of an automatic identification billing algorithm model.
In order to solve the technical problems, the invention provides the following technical scheme: collecting and collating billing data;
extracting features from the billing data;
stage classification is carried out on the characteristics, and an automatic identification billing model is constructed;
evaluating the automatically identified billing model;
perfecting the automatic identification billing model;
the automatic identification billing model is monitored periodically.
As a preferred solution for implementing the automatic identification billing algorithm model based on machine learning according to the present invention, the preferred solution is as follows: the billing process interaction flow of the billing data is as follows:
uploading the bills singly or in batches, and carrying out compliance examination on the bills;
if the bill is not compliant, returning the bill;
if the bill is compliant, carrying out authenticity verification;
if the bill information is not complete, prompting filling information, and carrying out authenticity verification after compliance;
if the bill is not true, returning the bill;
if the bill is true, checking the bill again;
if the bill is repeated, returning the bill;
if the bill is not repeated, comparing the bill with a bank statement;
if the bank statement has a balance record, carrying out grading calculation according to the record of the statement;
if the bank has no balance record on the bill, the bank automatically recognizes the bill to generate the balance record, and then carries out grading calculation according to the record of the bill.
As a preferred solution for implementing the automatic identification billing algorithm model based on machine learning according to the present invention, the preferred solution is as follows: and extracting different features according to the data name, wherein the extraction rule is as follows:
if the data name is a bank transaction class, extracting the data name as a bank statement;
if the data name is a value-added tax invoice class, extracting the data name as a value-added tax invoice;
and if the data name is other special bill types, extracting the data name as other bill types.
As a preferred solution for implementing the automatic identification billing algorithm model based on machine learning according to the present invention, the preferred solution is as follows: the rules of the phase classification are as follows:
if the characteristic is a value-added tax invoice, direction one identification, bill name identification and semantic one identification are carried out;
if the characteristics are other types of notes, direction one recognition and note name recognition are carried out;
wherein the direction one identification comprises an entry bill and a pin bill, and the semantic one identification comprises a name and a remark;
if the direction I is identified as an entry bill, department identification and enterprise type identification are carried out;
if the direction I is identified as the sales item bill, carrying out enterprise type identification;
the department identification comprises a production department, a technical department, a management department and a business department, and the enterprise type identification comprises a general tax payer and a small-scale tax payer;
if the characteristic is a bank statement, performing direction two recognition and semantic two recognition;
the second direction identification comprises a borrower and a lender, and the second semantic identification comprises a purpose, a abstract and a remark.
As a preferred solution for implementing the automatic identification billing algorithm model based on machine learning according to the present invention, the preferred solution is as follows: according to the stage classification, the constructed automatic identification billing model is as follows:
wherein KA represents an automatic recognition billing model function, c represents a constant, h n A fitting function representing the phase classification, λ representing the optimal coefficients of the fitting function, N representing the billing data, N representing the total number of billing data;
wherein, the function expression of the fitting function is as follows:
h n =min[min∑(n-a 1 ) 2 +min∑(n-a 2 ) 2 +min∑(n-a 3 ) 2 ];
in the formula, h n Fitting functions representing the phase classifications, n representing the billing data, a 1 Representing the value added tax invoice, a 2 Representing said other types of notes, a 3 Representing the bank statement.
As a preferred solution for implementing the automatic identification billing algorithm model based on machine learning according to the present invention, the preferred solution is as follows: and carrying out evaluation calculation on the automatic identification billing model, wherein the calculation formula is as follows:
wherein AS represents the automatic identification billing modelEvaluation result, a 1 Representing the value added tax invoice, a 2 Representing said other types of notes, a 3 Representing the bank statement, KA represents an automatically identifying billing model function, and n represents the billing data.
As a preferred solution for implementing the automatic identification billing algorithm model based on machine learning according to the present invention, the preferred solution is as follows: and perfecting the automatic identification billing model to enhance the interpretability of the model, wherein the perfected function expression is as follows:
ξ(n)=∑κ n (KA-KA * ) 2
where ζ (n) represents the refined result of the auto-id billing model, n represents the billing data, κ n Indicating the proximity of the classification result and the actual result of the billing data stage, KA indicates an automatic identification billing model function, KA * An interpretable auto-id billing model function is represented.
As a preferred solution for implementing the automatic identification billing algorithm model based on machine learning according to the present invention, the preferred solution is as follows: monitoring the automatic identification billing model, wherein the monitored function expression is as follows:
M=argmax[ρ(KA)];
wherein M represents the monitoring result of the automatic identification billing model, ρ (KA) represents the online learning function of the automatic identification billing model, and KA represents the automatic identification billing model function.
A computer device comprising, a memory for storing instructions; and a processor for executing the instructions to cause the device to perform implementing an automatic identification billing algorithm model based on machine learning.
A computer readable storage medium having stored thereon a computer program which, when executed by the processor, implements a machine learning based implementation of an automatic identification billing algorithm model.
The invention has the beneficial effects that: according to the invention, by collecting and arranging the billing data, solving the data dependency, ensuring the effectiveness of data collection, extracting the characteristics of the billing data, classifying the characteristics in stages, constructing the automatic identification billing model, realizing the accurate identification and automatic identification billing of the billing data, evaluating the automatic identification billing model, verifying the generalization capability of the automatic identification billing model, improving the performance of the automatic identification billing model, perfecting the automatic identification billing model, enhancing the interpretability of the automatic identification billing model, periodically monitoring the automatic identification billing model, finding out the condition of the performance degradation of the automatic identification billing model in time and making corrective measures in time.
Drawings
In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings that are needed in the description of the embodiments will be briefly described below, it being obvious that the drawings in the following description are only some embodiments of the present invention, and that other drawings may be obtained according to these drawings without inventive effort for a person skilled in the art. Wherein:
FIG. 1 is a diagram of steps in a method for implementing an automatic identification billing algorithm model based on machine learning in accordance with the present invention;
FIG. 2 is a flow chart of the billing process interaction described by the automatic identification billing algorithm model based on machine learning implementation of the present invention.
Detailed Description
In order that the above-recited objects, features and advantages of the present invention will become more readily apparent, a more particular description of the invention will be rendered by reference to specific embodiments thereof which are illustrated in the appended drawings.
In the following description, numerous specific details are set forth in order to provide a thorough understanding of the present invention, but the present invention may be practiced in other ways other than those described herein, and persons skilled in the art will readily appreciate that the present invention is not limited to the specific embodiments disclosed below.
Further, reference herein to "one embodiment" or "an embodiment" means that a particular feature, structure, or characteristic can be included in at least one implementation of the invention. The appearances of the phrase "in one embodiment" in various places in the specification are not necessarily all referring to the same embodiment, nor are separate or alternative embodiments mutually exclusive of other embodiments.
Further, in describing the embodiments of the present invention in detail, the cross-sectional view of the device structure is not partially enlarged to a general scale for convenience of description, and the schematic is only an example, which should not limit the scope of protection of the present invention. In addition, the three-dimensional dimensions of length, width and depth should be included in actual fabrication.
Example 1
In this embodiment, an overall structure schematic diagram of an automatic identification billing algorithm model based on machine learning is provided, as shown in fig. 1, and the automatic identification billing algorithm model based on machine learning includes:
s1, collecting and sorting billing data.
Collecting and sorting data comprising bank statements and notes, ensuring the quality and integrity of the data according to an accounting processing interaction flow, solving the data dependence, ensuring the effectiveness of data collection, wherein the accounting processing interaction flow is shown in fig. 2, and the specific flow steps are as follows:
uploading the bills singly or in batches, and carrying out compliance examination on the bills;
if the bill is not compliant, returning the bill;
if the bill is qualified, carrying out authenticity verification;
if the bill information is not complete, prompting filling information, and carrying out authenticity verification after compliance;
if the bill is not true, returning the bill;
if the bill is true, checking the bill again;
if the bill is repeated, returning the bill;
if the bill is not repeated, comparing the bill with the bank statement;
if the bank has a balance record on the bill, carrying out grading calculation according to the record of the bill;
if the bank has no balance record on the bill, firstly automatically identifying the bill to generate the balance record, and then carrying out grading calculation according to the record of the bill;
the compliance examination is to check whether the ticket information is complete, whether the ticket name is consistent with the enterprise name, whether the invoice is provided with a seal, whether the invoice is out of date, and whether the invoice is not aligned; the authenticity check refers to calling an invoice verification API interface, checking whether the ticket information is consistent with verification data, and returning inconsistent, and not allowing accounting; checking the bill according to four factors of bill code, bill number and amount according to the date of making out the bill, checking whether all the bills in the database have repeated account, and accumulating automatically when the bills are not repeated but are the same as a plurality of bills in a unit; the bank account bill comparison means that all bank transaction records of a database are checked according to three elements of the name of an issuing unit, the name of a reimbursement account person and the name of an item, and whether the bank account bill has the same unit, the same name and the same item name; the hierarchical calculation means calculation according to different levels, the levels including that the total amount of the ticket is equal to the pre-receipt prepaid balance, the total amount of the ticket is greater than the pre-receipt prepaid balance, and the total amount of the ticket is less than the pre-receipt prepaid balance; the automatic identification and accounting means that natural language processing identification is carried out according to five major elements of direction, bill name, semantics, department and special case;
in the specific application, 2000 pieces of data are collected and arranged, compliance examination, authenticity verification, bill check and bank account checking comparison are respectively carried out on the 2000 pieces of data according to the accounting processing interaction flow, the 23 pieces of data with incomplete information are manually filled in, and 6 pieces of data of the same company are automatically accumulated.
S2, extracting features of the billing data.
And extracting different characteristics according to the data name, wherein the extraction rule is as follows:
if the data name is a bank transaction class, extracting the data name as a bank statement;
if the data name is a value-added tax invoice, extracting the data name as a value-added tax invoice;
if the data name is other special bill types, extracting the data name as other bill types;
extracting features from the data, wherein all field information of each banking transaction is one piece of data, all field information of each bill is one piece of data, and each piece of data is bound with a data name;
the data names are bank transaction types including bank statement collection, bank statement payment, initial balance of the home period and end balance of the home period, value tax invoice types including value tax general invoice and value tax special invoice, and other special bill types including plane ticket, train ticket, bus ticket, ship ticket, taxi ticket, road bridge ticket, fuel ticket, business acceptance draft, bank acceptance draft, rating invoice, administrative department legal ticket, administrative department commodity inspection receipt, administrative department fine receipt, donation unified ticket, administrative department non-tax admission ticket, administrative department unified charge receipt, administrative department school education charge receipt, administrative department tax receipt, administrative hospital charge ticket, property maintenance fund special ticket, administrative fund bill, motor vehicle sales unified invoice, second hand vehicle sales unified invoice, insurance fund special invoice, utility ticket, scenic region entrance ticket and business sales invoice;
in the specific application, 2000 pieces of data are subjected to feature extraction, wherein 1160 pieces of banking transaction data, 480 pieces of value-added tax invoice and 360 pieces of other special notes are subjected to conversion according to the requirements, and the final required result is output.
S3, classifying the characteristics in stages, and constructing an automatic identification billing model.
The rules of the phase classification are as follows:
if the characteristic is a value-added tax invoice, direction one identification, bill name identification and semantic one identification are carried out;
if the characteristics are other types of notes, direction one recognition and note name recognition are carried out;
the direction one identification comprises an entry bill and a sales bill, and the semantic one identification comprises a name and remarks;
if the direction one is identified as an entry bill, department identification and enterprise type identification are carried out;
if the direction one is identified as the sales item bill, carrying out enterprise type identification;
the department identification comprises a production department, a technical department, a management department and a business department, and the enterprise type identification comprises a general tax payer and a small-scale tax payer;
if the characteristics are bank statement, direction two recognition and semantic two recognition are carried out;
the direction two identification comprises a borrower and a lender, and the semantic two identification comprises a use, a abstract and a remark;
the ticket name identification comprises the classification of the data names, and is sequentially value-added tax common invoice, value-added tax special invoice, plane ticket, train ticket, automobile ticket, ship ticket, taxi ticket, road and bridge ticket, fuel ticket, business acceptance draft, bank acceptance draft, rating invoice, administrative court litigation ticket, administrative commodity inspection receipt, administrative fine non-fine receipt, donation unified ticket, administrative non-tax income ticket, administrative unified charge receipt, administrative school education charge receipt, administrative tax fine receipt, administrative hospital charge ticket, property maintenance fund special ticket, administrative fund exchange ticket, motor vehicle sales unified invoice, second hand vehicle sales unified invoice, insurance fund special invoice, public utility payment ticket, scenic spot ticket and business sales invoice;
according to the stage classification, the automatic identification billing model is constructed as follows:
wherein KA represents an automatic recognition billing model function, c represents a constant, h n A fitting function representing the phase classification, λ representing the optimal coefficient of the fitting function, N representing billing data, N representing the total number of billing data;
wherein, the function expression of the fitting function is as follows:
h n =min[min∑(n-a 1 ) 2 +min∑(n-a 2 ) 2 +min∑(n-a 3 ) 2 ];
in the formula, h n Fitting function representing phase classification, n representing billing data, a 1 Representing value-added tax invoice, a 2 Representing other kinds of notes, a 3 Representing a bank statement;
according to the stage classification, the billing data are classified, a decision tree is constructed, and accurate identification and automatic identification billing of the billing data are realized by using a powerful classifier of the decision tree.
In the specific application, there are 1160 bank statements, 480 value-added tax invoices and 360 other kinds of invoices, a decision tree is constructed through classification rules, and other billing data are automatically identified and billed by using the decision tree.
S4, evaluating an automatic identification billing model.
And (3) carrying out evaluation calculation on the automatic identification billing model, wherein the calculation formula is as follows:
wherein AS represents the evaluation result of the automatic identification billing model, a 1 Representing value-added tax invoice, a 2 Representing other kinds of notes, a 3 Representing a bank statement, KA represents an automatic identification billing model function, and n represents billing data;
after the constructed automatic identification billing model is trained by utilizing a large amount of data, model evaluation is needed, so that the effects of verifying the generalization capability of the model, improving the model performance and feeding back the data quality can be achieved, the evaluation result of the automatic identification billing model is calculated, the value range is [0,1], and the larger the value is, the more the classification result is matched with the real situation;
in the specific application, the evaluation result of the automatic identification billing model is calculated to be 0.92, and the classification result is proved to be very consistent with the real situation, and meanwhile, the classification result is checked, so that the real situation is basically consistent.
S5, perfecting an automatic identification billing model.
The automatic identification billing model is perfected, the interpretability of the model is enhanced, and a perfected functional expression is as follows:
ξ(n)=∑κ n (KA-KA * ) 2
where ζ (n) represents the perfected result of the automatic identification billing model, n represents billing data, κ n Indicating the proximity of the classification result to the actual result in the billing data stage, KA indicating an automatic recognition billing model function, KA * Representing an interpretable auto-recognition billing model function;
the interpretation and the credibility of the automatic identification billing model need to be interpreted by using a related algorithm, the automatic identification billing model is interpreted by using the formula, the interpretation and the credibility of the automatic identification billing model are enhanced, the decision reasons of stage classification are given for each billing data, the acceptability and the applicability of the model are enhanced, and the effect and the sustainable development of machine learning in practical application are improved;
s6, periodically monitoring and automatically identifying an accounting model.
Monitoring the automatic identification billing model, wherein the monitored function expression is as follows:
M=argmax[ρ(KA)];
wherein M represents a monitoring result of the automatic identification billing model, ρ (KA) represents an online learning function of the automatic identification billing model, and KA represents an automatic identification billing model function;
the performance of the automatic identification billing model in actual application changes along with the change of time, the condition of the performance reduction of the automatic identification billing model can be timely found by periodically monitoring the automatic identification billing model, corresponding measures are taken to correct and optimize the model, feedback information of a user can be collected, the problem of the automatic identification billing model under specific conditions is found, timely repair and improvement are carried out, the accuracy and stability of the automatic identification billing model are improved, and meanwhile, the automatic identification billing model can be analyzed for controllability and interpretability;
in the specific application, when the evaluation result of the automatic identification billing model is lower than 0.7, a manager is reminded to correct and optimize the automatic identification billing model, meanwhile, the automatic identification billing model is interpreted by combining the perfection result of the automatic identification billing model, and the interpretability and the credibility of the automatic identification billing model are enhanced.
Example 2
In this embodiment, a computer device is provided that includes a memory for storing instructions and a processor for executing the instructions to cause the computer device to perform steps for implementing an automatic identification billing algorithm model based on machine learning as described above.
Example 3
In this embodiment, a computer readable storage medium is provided, on which a computer program is stored, which when executed, implements a method for implementing an automatic identification billing algorithm model based on machine learning as described above.
The computer readable storage medium may include: various media capable of storing program codes, such as a U disk, a mobile hard disk, a read-only memory, a random access memory, a magnetic disk or an optical disk.
It should be noted that the above embodiments are only for illustrating the technical solution of the present invention and not for limiting the same, and although the present invention has been described in detail with reference to the preferred embodiments, it should be understood by those skilled in the art that the technical solution of the present invention may be modified or substituted without departing from the spirit and scope of the technical solution of the present invention, which is intended to be covered in the scope of the claims of the present invention.

Claims (10)

1. An automatic identification billing algorithm model based on machine learning is realized, which is characterized in that: comprising the steps of (a) a step of,
collecting and collating billing data;
extracting features from the billing data;
stage classification is carried out on the characteristics, and an automatic identification billing model is constructed;
evaluating the automatically identified billing model;
perfecting the automatic identification billing model;
the automatic identification billing model is monitored periodically.
2. The machine-learning based implementation of an automatic identification billing algorithm model of claim 1 wherein: the billing process interaction flow of the billing data is as follows:
uploading the bills singly or in batches, and carrying out compliance examination on the bills;
if the bill is not compliant, returning the bill;
if the bill is compliant, carrying out authenticity verification;
if the bill information is not complete, prompting filling information, and carrying out authenticity verification after compliance;
if the bill is not true, returning the bill;
if the bill is true, checking the bill again;
if the bill is repeated, returning the bill;
if the bill is not repeated, comparing the bill with a bank statement;
if the bank statement has a balance record, carrying out grading calculation according to the record of the statement;
if the bank has no balance record on the bill, the bank automatically recognizes the bill to generate the balance record, and then carries out grading calculation according to the record of the bill.
3. The machine-learning based implementation of an automatic identification billing algorithm model of claim 2 wherein: and extracting different features according to the data name, wherein the extraction rule is as follows:
if the data name is a bank transaction class, extracting the data name as a bank statement;
if the data name is a value-added tax invoice class, extracting the data name as a value-added tax invoice;
and if the data name is other special bill types, extracting the data name as other bill types.
4. A machine learning based implementation of an automatic identification billing algorithm model of claim 3 wherein: the rules of the phase classification are as follows:
if the characteristic is a value-added tax invoice, direction one identification, bill name identification and semantic one identification are carried out;
if the characteristics are other types of notes, direction one recognition and note name recognition are carried out;
wherein the direction one identification comprises an entry bill and a pin bill, and the semantic one identification comprises a name and a remark;
if the direction I is identified as an entry bill, department identification and enterprise type identification are carried out;
if the direction I is identified as the sales item bill, carrying out enterprise type identification;
the department identification comprises a production department, a technical department, a management department and a business department, and the enterprise type identification comprises a general tax payer and a small-scale tax payer;
if the characteristic is a bank statement, performing direction two recognition and semantic two recognition;
the second direction identification comprises a borrower and a lender, and the second semantic identification comprises a purpose, a abstract and a remark.
5. The machine-learning based implementation of an automatic identification billing algorithm model of claim 4 wherein: according to the stage classification, the constructed automatic identification billing model is as follows:
wherein KA represents an automatic recognition billing model function, c represents a constant, h n A fitting function representing the phase classification, λ representing the optimal coefficients of the fitting function, N representing the billing data, N representing the total number of billing data;
wherein, the function expression of the fitting function is as follows:
h n =min[min∑(n-a 1 ) 2 +min∑(n-a 2 ) 2 +min∑(n-a 3 ) 2 ];
in the formula, h n Fitting functions representing the phase classifications, n representing the billing data, a 1 Representing the value added tax invoice, a 2 Representing said other types of notes, a 3 Representing the bank statement.
6. The machine-learning based implementation of an automatic identification billing algorithm model of claim 5 wherein: and carrying out evaluation calculation on the automatic identification billing model, wherein the calculation formula is as follows:
wherein AS represents the evaluation result of the automatic identification billing model, a 1 Representing the value added tax invoice, a 2 Representing said other types of notes, a 3 Representing the bank statement, KA represents an automatically identifying billing model function, and n represents the billing data.
7. The machine-learning based implementation of an automatic identification billing algorithm model of claim 6 wherein: and perfecting the automatic identification billing model to enhance the interpretability of the model, wherein the perfected function expression is as follows:
ξ(n)=∑κ n (KA-KA * ) 2
where ζ (n) represents the refined result of the auto-id billing model, n represents the billing data, κ n Indicating the proximity of the classification result and the actual result of the billing data stage, KA indicates an automatic identification billing model function, KA * An interpretable auto-id billing model function is represented.
8. The machine-learning based implementation of an automatic identification billing algorithm model of claim 7 wherein: monitoring the automatic identification billing model, wherein the monitored function expression is as follows:
M=argmax[ρ(KA)];
wherein M represents the monitoring result of the automatic identification billing model, ρ (KA) represents the online learning function of the automatic identification billing model, and KA represents the automatic identification billing model function.
9. A computer device, characterized by: comprising the steps of (a) a step of,
a memory for storing instructions;
a processor for executing the instructions to cause the device to perform the steps of implementing a machine learning based implementation of an automatic identification billing algorithm model as claimed in any of claims 1 to 8.
10. A computer-readable storage medium having stored thereon a computer program, characterized by: the computer program, when executed by said processor, implements a method for implementing an automatic identification billing algorithm model based on machine learning as claimed in any one of claims 1 to 8.
CN202311453325.0A 2023-11-02 2023-11-02 Automatic identification billing algorithm model based on machine learning Pending CN117455699A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202311453325.0A CN117455699A (en) 2023-11-02 2023-11-02 Automatic identification billing algorithm model based on machine learning

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202311453325.0A CN117455699A (en) 2023-11-02 2023-11-02 Automatic identification billing algorithm model based on machine learning

Publications (1)

Publication Number Publication Date
CN117455699A true CN117455699A (en) 2024-01-26

Family

ID=89583255

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202311453325.0A Pending CN117455699A (en) 2023-11-02 2023-11-02 Automatic identification billing algorithm model based on machine learning

Country Status (1)

Country Link
CN (1) CN117455699A (en)

Similar Documents

Publication Publication Date Title
Bellini IFRS 9 and CECL Credit Risk Modelling and Validation: A Practical Guide with Examples Worked in R and SAS
Finlay Credit scoring, response modeling, and insurance rating: a practical guide to forecasting consumer behavior
US20120011041A1 (en) Post bankruptcy pattern and transaction detection and recovery apparatus and method
CN109739844B (en) Data classification method based on attenuation weight
KR102246782B1 (en) System for Debt Repayment Capability Evaluation Of Corporation Including Evaluation Model Based On AI Using Commerce Data
Cao et al. Bond rating using support vector machine
Ruyu et al. A comparison of credit rating classification models based on spark-evidence from lending-club
Barua et al. Swindle: Predicting the probability of loan defaults using catboost algorithm
CN112037039A (en) Loan assessment method and device
Ahmed Robust drivers of Bitcoin price movements: an extreme bounds analysis
Hu Predicting and improving invoice-to-cash collection through machine learning
CN117114812A (en) Financial product recommendation method and device for enterprises
CN117455699A (en) Automatic identification billing algorithm model based on machine learning
CN115827798A (en) Bill management method established based on industry characteristics
Del Pilar et al. Towards the Improvement of Credit Card Approval Process Using Classification Algorithm
JP2015125534A (en) Transaction history data analysis system and transaction history data analysis method
CN114331105A (en) Electronic draft processing system, method, electronic device and storage medium
Zand Towards intelligent risk-based customer segmentation in banking
US20180137578A1 (en) System and method for prediction of deduction claim success based on an analysis of electronic documents
CN111932368B (en) Credit card issuing system and construction method and device thereof
KR102249015B1 (en) Calculation System for Corporate Debt Payment Capability
Zaytsev Selection and evaluation of relevant predictors for credit scoring in peer-to-peer lending with random forest based methods
CN113888143B (en) Bill data processing method and device and storage medium
Budd Modelling credit card usage for individual card-holders
Plaksiy et al. The National Research Nuclear University MEPHI (Moscow Engineering Physics Institute), 31 Kashirskoye shosse, Moscow, Russia kirillplaksiy@ gmail. com, andreinikiforov993@ gmail. com, NGMiloslavskaya@ mephi. ru

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication