CN108960223B - Method for automatically generating voucher based on intelligent bill identification - Google Patents

Method for automatically generating voucher based on intelligent bill identification Download PDF

Info

Publication number
CN108960223B
CN108960223B CN201810483413.8A CN201810483413A CN108960223B CN 108960223 B CN108960223 B CN 108960223B CN 201810483413 A CN201810483413 A CN 201810483413A CN 108960223 B CN108960223 B CN 108960223B
Authority
CN
China
Prior art keywords
voucher
template
bill
temporary
key information
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201810483413.8A
Other languages
Chinese (zh)
Other versions
CN108960223A (en
Inventor
张欢欢
尚友新
张浦铭
郑红伟
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Dajingfang Network Technology Co.,Ltd.
Original Assignee
Beijing Dazhangfang Network Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Dazhangfang Network Technology Co ltd filed Critical Beijing Dazhangfang Network Technology Co ltd
Priority to CN201810483413.8A priority Critical patent/CN108960223B/en
Publication of CN108960223A publication Critical patent/CN108960223A/en
Application granted granted Critical
Publication of CN108960223B publication Critical patent/CN108960223B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/20Image preprocessing
    • G06V10/22Image preprocessing by selection of a specific region containing or referencing a pattern; Locating or processing of specific regions to guide the detection or recognition
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/10Office automation; Time management
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q40/00Finance; Insurance; Tax strategies; Processing of corporate or income taxes
    • G06Q40/12Accounting
    • G06Q40/123Tax preparation or submission
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Strategic Management (AREA)
  • General Physics & Mathematics (AREA)
  • Finance (AREA)
  • Theoretical Computer Science (AREA)
  • Accounting & Taxation (AREA)
  • Physics & Mathematics (AREA)
  • Economics (AREA)
  • General Business, Economics & Management (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Marketing (AREA)
  • Human Resources & Organizations (AREA)
  • Development Economics (AREA)
  • Operations Research (AREA)
  • Tourism & Hospitality (AREA)
  • Data Mining & Analysis (AREA)
  • Quality & Reliability (AREA)
  • Multimedia (AREA)
  • Technology Law (AREA)
  • Financial Or Insurance-Related Operations Such As Payment And Settlement (AREA)
  • Character Input (AREA)

Abstract

The invention provides a method for automatically generating a voucher based on intelligent identification of a bill, which comprises the following steps: s1, uploading the bill for intelligent recognition, extracting key information, classifying the bill according to the extracted key information, and associating and storing the key information and the bill; s2, presetting a voucher template database in the voucher template system, wherein the sources of the voucher template mainly comprise manual generation, system generation and temporary template conversion generation; s3, matching the templates to generate a certificate; and S4, manually processing the temporary voucher. The invention sorts and stores the electronic data acquired through intelligent recognition, establishes an index relationship, generates accurate and effective voucher information according to classification and matching voucher templates, and associates the voucher information with the original bill, thereby reducing the workload of accountants, improving the working efficiency and freeing the accountants from complicated invoice oceans.

Description

Method for automatically generating voucher based on intelligent bill identification
Technical Field
The invention relates to the field of intelligent recognition of bills, in particular to a method for automatically generating a voucher based on intelligent recognition of the bills.
Background
In the existing electronic information system, the method for processing the bills to generate the vouchers is to manually arrange the original bills through accounting and then compile the vouchers in the system, so that the method has the disadvantages of large workload, easy error occurrence, need of multiple checking by accounting and great influence on the working efficiency. Especially in some middle or small enterprises, accounting document handling problems are very likely to occur due to lack of excellent accounting.
With the continuous innovation and popularization of internet technology, artificial intelligence gains more and more attention in the computer field, and more industries begin to use artificial intelligence to create value for enterprises and get more and more extensive application in bill identification. The automatic identification of the bill layout can be realized by the code scanning identification of the bill based on the artificial intelligence technology (intelligent identification), so that the breakthrough progress is achieved, the change of the bill content into electronic data is possible, and the purpose of intelligent accounting is achieved. However, how to convert the identified electronic data into accurate and effective financial data becomes a new problem for processing the identified electronic data.
Disclosure of Invention
In order to overcome the defects of the prior art, the invention provides a method for automatically generating a voucher based on intelligent identification of the voucher, which arranges and stores electronic data acquired through intelligent identification, establishes an index relationship, generates accurate and effective voucher information according to classification and matching voucher templates, and associates the voucher information with an original voucher, thereby reducing the workload of accountants, improving the working efficiency and freeing the accountants from complicated invoice oceans.
Specifically, the invention provides a method for automatically generating a voucher based on intelligent identification of a bill, which comprises the following steps:
s1, in the voucher template system, presetting a voucher template to form a voucher template database, wherein the source of the voucher template mainly comprises system generation, manual generation and temporary template conversion, the method for generating the voucher template by the system is to create thousands of voucher templates according to the types, commodity attributes, abstracts and coins and introduce the voucher templates into the voucher template system, the method for manually generating the voucher template is to supplement and add special voucher templates by users according to own business rules during the use process, and the method for generating the voucher template by the temporary template conversion is to convert the temporary template during the use process into the voucher template;
s2, the bills are uploaded to be intelligently identified, key information is extracted, the bills are classified according to the extracted key information, the key information is related to the bills and stored in a voucher template system to form a key information database, and the specific method for associating the key information with the bills comprises the following steps: uploading an invoice, generating a unique bill identification code in the uploading process, intelligently identifying the invoice, extracting key information from the invoice, corresponding the key information and the unique bill identification code one by one, and storing the key information in a voucher template, wherein the key information comprises bill types, a payee, a payer, invoice detail and amount;
s3, matching the template to generate a certificate, and specifically comprising the following steps:
matching the electronic data of the bill with the voucher template, and generating a final voucher through the voucher template: firstly, extracting key information according to point data of a bill, wherein the key information comprises a bill type and company information, then determining the type of the bill and the data environment of the bill according to the key information, carrying out retrieval comparison on the key information database generated in the step S1 based on the type of the bill and the data environment of the bill, and selecting a corresponding voucher template from a voucher template database in the step S2 as a final voucher according to a comparison result;
s4, manual processing and manual intervention: manually processing the temporary certificate, and if the certificate information is correct, adjusting the temporary certificate into a formal certificate; if the certificate information is not accurate, the certificate is modified to generate a formal certificate. After generating the formal voucher, generating a new voucher template according to the electronic information of the voucher and the subject of the voucher;
and S5, marking the ticket generating the voucher template as identified.
Preferably, the credential template database is preset with a first similarity threshold and a second similarity threshold.
Preferably, the searching and comparing in step S3 specifically includes the following steps:
s31, matching similarity of characters: performing character matching on the extracted key information, enabling the key information to form a new character string, comparing the new character string with existing key information in a key information database of the certificate template system to confirm the similarity of the new character string and selecting the certificate template corresponding to the key information with the highest similarity stored in the system as a standby template;
s32, comparing the similarity obtained in step S31 with a first similarity threshold and a second similarity threshold, if the similarity is higher than the first similarity threshold, selecting the standby template as the final certificate,
if the similarity is lower than the first similarity threshold and higher than or equal to the second similarity threshold, generating a temporary template and a temporary voucher from the electronic data, classifying the temporary template and the temporary voucher as a first type, and proceeding to step S4;
if the similarity is below the second similarity threshold, a temporary template and temporary voucher, which need to be marked, are also generated from the electronic data, and are classified as a second type of temporary template and temporary voucher, and the process proceeds to step 4.
Preferably, the character matching rule in step S31 is to convert each chinese character into a new character string according to pinyin, quadrangle code, font and stroke number.
Preferably, the specific method for generating the credential template by the system in step S2 is as follows: the system generates a voucher template according to the bill type, the commodity attribute, the abstract and the currency, extracts the information of the voucher template to be matched with the information of the voucher template in the system after generating a formal voucher template, ignores the voucher template generated by the information if the matching is successful, and generates a new voucher template for storage if the matching is unsuccessful.
Preferably, the concrete method for generating the voucher template by the temporary voucher template in the step S2 is as follows: the temporary certificate template is derived from the temporary certificate, the temporary certificate is manually adjusted, a formal certificate is generated after the adjustment is finished, whether the temporary certificate template is deleted or not is inquired to a user, if yes, the temporary template is deleted, and if not, the temporary template is converted into a new certificate template to be stored according to the generated formal certificate.
Preferably, the information of the voucher template comprises subject information, summary information and ticket information.
Preferably, in step S3, the specific method for extracting the key information according to the point data of the bill is to classify the bill by intelligent identification, and assemble the key information corresponding to the classification according to the classification information defined by the system.
Preferably, the first similarity threshold is 85-95% and the second similarity threshold is 65-75% in step S32.
Compared with the prior art, the invention has the following beneficial effects:
the invention sorts and stores the electronic data acquired through intelligent recognition, establishes an index relationship, generates accurate and effective voucher information according to classification and matching voucher templates, and associates the voucher information with the original bill, thereby reducing the workload of accountants, improving the working efficiency and freeing the accountants from complicated invoice oceans.
The accountant can generate a voucher template in the process of identifying the bill, directly call the voucher template aiming at the bill type in the later work, continuously learn to generate a new voucher template in the process of identifying the bill and expand the voucher template in the system template library.
The invention provides a new method idea for generating a certificate, which solves the barriers bothering technical personnel in the field, adds artificial intelligence into a method for accounting personnel to make the certificate, and increases the working efficiency.
Drawings
FIG. 1 is a schematic flow chart of the present invention;
FIG. 2 is a schematic illustration of ticket identification categories in an embodiment of the present invention;
FIG. 3 is a diagram of a matched credential template in an embodiment of the present invention;
FIG. 4 is a schematic illustration of a marked invoice in an embodiment of the invention;
fig. 5 is a schematic diagram of storing ticket information in an embodiment of the invention.
Detailed Description
Exemplary embodiments, features and aspects of the present invention will be described in detail below with reference to the accompanying drawings. In the drawings, like reference numbers can indicate functionally identical or similar elements. While the various aspects of the embodiments are presented in drawings, the drawings are not necessarily drawn to scale unless specifically indicated.
Specifically, the present invention provides a method for automatically generating a voucher based on intelligent ticket identification, as shown in fig. 1, comprising the following steps:
and S1, in the voucher template system, presetting a voucher template to form a voucher template database, wherein the sources of the voucher template mainly comprise system generation, manual generation and temporary template conversion.
The method for generating the voucher template by the system is to create thousands of voucher templates according to bill types, commodity attributes, abstracts and coins and introduce the voucher templates into the voucher template system, the method for manually generating the voucher templates is to supplement and add special voucher templates according to own business rules by general users in the using process, and the method for generating the voucher templates by the temporary template conversion is to convert the temporary templates in the using process into the voucher templates.
Preferably, the specific method for generating the credential template by the system in step S1 is as follows: the system generates a voucher template according to the bill type, the commodity attribute, the abstract and the currency, extracts the information of the voucher template to be matched with the information of the voucher template in the system after generating a formal voucher template, ignores the voucher template generated by the bill if the matching is successful, and generates a new voucher template for storage if the matching is unsuccessful.
Preferably, the information of the voucher template comprises subject information, summary information and ticket information. The voucher template can be a sales voucher, a purchase voucher or other voucher, etc. required by the accountant.
Preferably, the concrete method for generating the voucher template by the temporary voucher template in the step S1 is as follows: the temporary certificate template is derived from the temporary certificate, the temporary certificate is manually adjusted, a formal certificate is generated after the adjustment is finished, whether a user deletes the temporary certificate template or not is inquired, if yes, the temporary template is deleted, if not, the temporary template can be converted into a new certificate template according to the generated formal certificate and stored, the step of generating the new certificate template is the same as the step of generating the certificate template by the system, after the formal certificate template is generated, the information of the certificate template is extracted and matched with the information of the certificate template in the system, if the matching is successful, the certificate template generated by the system is ignored, and if the matching is unsuccessful, a new certificate template is generated and stored.
And S2, uploading the bills for intelligent recognition, extracting key information, classifying the bills according to the extracted key information, and enabling the key information to be associated with the bills and stored in the voucher template system to form a key information database.
The specific method for associating the key information with the bill is as follows: uploading an invoice, generating a unique bill identification code in the uploading process, intelligently identifying the invoice, extracting key information from the invoice, carrying out one-to-one correspondence with the unique bill identification code, and storing the key information and the unique bill identification code in a voucher template corresponding to the bill type.
The key information includes the type of the ticket, the payee, the payer, the invoice particulars and the amount.
When the intelligent bill identification system is used specifically, bills are uploaded to the system, intelligent identification is conducted on the bills after the bills are uploaded, the key information of the bills can be obtained in the intelligent bill identification process, the key information is classified according to the key information of the bills obtained through identification, the bills are classified, the key information and the bills are associated and stored in the voucher template system, and a key information database is formed. The credential template may be a sales template or an entry template, etc.
The intelligent recognition of the bill can adopt a mode of mixed scanning of the bill to extract key information of the bill, and the steps of mixed scanning are as follows:
1. after learning various types of bills, the intelligent identification system stores key information of the various types of bills, identifies different key information of the various types of bills and defines key words for bank bills, machine-issued bills and quota invoices, establishes a bill key information database through continuous learning and storage in the bill scanning process, wherein the bill key information database comprises an identification sequence list, a key word list, a key information list and a corresponding bill type list, and the key word list, the key information list and the corresponding bill type list are in one-to-one correspondence.
Specifically, the bill key information database is described in the following table:
Figure BDA0001666191430000051
the specific learning process is to scan a large number of bills, distinguish the key information of the bills, associate the key information of the bills with the actual bill types, and define keywords for certain specific invoices, such as bank bills, machine-issued bills and quota invoices, wherein the keywords are defined in the learning process of the invoices of the types, correspond the keywords with the key information, and when the invoices are identified, the required key information can be extracted from the keywords as long as the keywords can be scanned and picked up. In other words, the keywords defined for some tickets contain the required keyword information, and as long as the keywords can be scanned, the keyword information contained in the keywords can be obtained. The learning of the database is based on a large number of scans, and in practical application, the list can be directly defined and embedded into the database or more types of invoice types can be added into the database.
2. Various mixed bills are scanned into electronic images through a scanner, the electronic images are uploaded to an intelligent recognition system to obtain keywords, and the intelligent recognition system automatically recognizes and corrects inclined and rotating pictures. The electronic version image may be a color image or a black and white image.
3. Comparing the obtained electronic domain image with stored key information or key words according to the scanned information to obtain the bill type of the bill, wherein the comparison sequence is performed according to the sequence of the identification sequence list, if the bill type is a value-added tax invoice, the inspection is performed, if the inspection is successful, the inspection result is returned to the intelligent identification terminal to be displayed, and if the inspection is failed, the invoice is classified as an inspection error class; and if the bill type is other than the value-added tax invoice, directly returning the invoice type of the invoice to the intelligent identification terminal for displaying, and if the invoice type of the invoice cannot be identified, classifying the invoice with the unidentifiable invoice type into an unidentifiable invoice and returning an identification result. The value-added tax invoice comprises a value-added tax common invoice, a roll invoice, a motor vehicle invoice, a value-added tax common electronic invoice and a value-added tax special invoice.
The method comprises the steps of scanning an obtained electronic domain image to obtain information, wherein the information obtained by scanning is a previously defined keyword or key information, positioning a two-dimensional code of a scanned invoice, analyzing the content stored in the two-dimensional code to obtain the information hidden in the two-dimensional code, comparing the information according to a corresponding sequence, and judging the invoice type of the invoice.
Preferably, step 3 specifically comprises the following steps:
31. extracting key information directly from the obtained electronic domain image, if the key information can be extracted directly, comparing the scanned key information with key information columns of value-added tax ordinary invoices, roll invoices, motor vehicle invoices or value-added tax special invoices in a key information list stored in a bill key information database, if the invoices belong to the value-added tax ordinary invoices, the roll invoices, the motor vehicle invoices or the value-added tax special invoices, checking, if the checking is successful, returning the invoice type and the key information corresponding to the invoice type, and if the checking is failed, classifying the invoice as a checking error class and returning an identification result; if the key information cannot be directly extracted, extracting the key words, acquiring the key information corresponding to the key words according to the extracted key words, and entering the step 32;
32. comparing the extracted keywords with the keyword column of the bank bill in the keyword list stored in the bill keyword information database, if the invoice belongs to the bank bill, identifying the keyword information contained in the keywords according to the keywords, returning the bill type and the corresponding keyword information, and if the invoice does not belong to the bank bill, entering step 33;
33. comparing the extracted keywords with the keyword column of the machine invoice stored in the keyword list in the bill keyword information database, if the invoice belongs to the machine invoice, identifying the keyword information contained in the keywords according to the keywords, returning the bill type and the corresponding keyword information, and if the invoice does not belong to the machine invoice, entering step 34;
34. comparing the extracted keywords with the keyword column of the quota invoice in the keyword list stored in the bill keyword information database, if the invoice belongs to the quota invoice, identifying the keyword information contained in the keywords according to the keywords, returning the bill type and the corresponding keyword information, and if the invoice does not belong to the quota invoice, entering step 35;
35. if the invoice type of the invoice cannot be identified, classifying the invoice with the invoice type not identified into a non-identifiable class and returning an identification result.
4. The invoices which cannot be identified or are checked by the tax administration wrongly are subjected to secondary identification after image processing, and the image processing method is determined according to the specific reasons which cannot be identified and specifically comprises the steps of locking the position of key information, and cutting blocks, eliminating red marks, removing lines or performing machine learning training on incomplete numbers according to the coordinates of pixel points.
S3, matching the template to generate a certificate, and specifically comprising the following steps:
matching the electronic data of the bill with the voucher template, and generating a final voucher through the voucher template: firstly, key information is extracted according to point data of the bill, the point data of the bill is obtained by scanning a two-dimensional code of the bill, the key information comprises the bill type and company information, then the type of the bill and the data environment of the bill are determined according to the key information, retrieval and comparison are carried out on the key information database generated in the step S1 according to the type of the bill and the data environment of the bill, and a corresponding voucher template is selected from the voucher template database in the step S2 as a final voucher according to a comparison result.
Preferably, in step S3, the specific method for extracting the key information according to the point data of the bill is to classify the bill by intelligent identification, and assemble the key information corresponding to the classification according to the classification information defined by the system.
In the intelligent identification process of the bill, the data classification and the key information of the data can be determined. With this information the template is then matched.
Preferably, the searching and comparing in step S3 specifically includes the following steps:
s31, matching similarity of characters: and performing character matching on the extracted key information, forming a new character string by the key information, comparing the new character string with the existing key information in a key information database of the certificate template system to confirm the similarity of the new character string and selecting the certificate template corresponding to the key information with the highest similarity stored in the system as a standby template.
And S32, comparing the similarity obtained in the step S31 with a first similarity threshold and a second similarity threshold, and if the similarity is higher than the first similarity threshold, selecting the standby template as the final certificate.
If the electronic data is lower than the first similarity threshold and higher than or equal to the second similarity threshold, a provisional template and a provisional voucher, which are classified as a first type provisional template and a provisional voucher, are generated from the electronic data, and the process proceeds to step S4.
If the similarity is below the second similarity threshold, a temporary template and temporary voucher, which need to be marked, are also generated from the electronic data, and are classified as a second type of temporary template and temporary voucher, and the process proceeds to step 4.
Preferably, the character matching rule in step S31 is to convert each chinese character into a new character string according to pinyin, quadrangle code, font and stroke number.
Preferably, the first similarity threshold is 85-95% and the second similarity threshold is 65-75% in step S32. When the system is initially used, the first similarity threshold is 85%, the second similarity threshold is 65%, and the system continuously learns in the process of continuously identifying the generated certificates, and the first similarity threshold and the second similarity threshold also increase along with the generated number of the certificates.
S4, manual processing and manual intervention: manually processing the temporary certificate, and if the certificate information is correct, adjusting the temporary certificate into a formal certificate; if the certificate information is not accurate, the certificate is modified to generate a formal certificate. And after the formal voucher is generated, generating a new voucher template according to the electronic information of the voucher and the subject of the voucher.
The first type of temporary template and the temporary voucher are complete, the accuracy of matching information such as subjects and abstracts is high, and a user can modify a small part of information. The accuracy of the second type of temporary template and temporary voucher information is generally low, and more places need to be modified, so that the user needs to adjust subject, abstract and even amount information.
And S5, marking the ticket generating the voucher template as identified.
Detailed description of the preferred embodiment
Firstly, uploading bills, such as third-class bank receipt bills, through an online accounting voucher;
and step two, after the bill uploading is finished, browsing and uploading the bill in the system.
And step three, identifying the bill information through intelligent identification. Classification is already completed in the identification process, and specific identified categories can be referred to in fig. 2.
Step four, the generated certificate data is matched with the certificate template through the identified classification information, and the matched certificate template is shown in figure 3.
And step five, identifying the bill after generating the voucher as recognized, and marking the result as shown in figure 4.
And step six, extracting key information from the invoice, performing one-to-one correspondence between the key information and the unique identification code of the bill, and storing the key information and the unique identification code of the bill in a voucher template corresponding to the bill type, as shown in fig. 5.
The invention sorts and stores the electronic data acquired through intelligent recognition, establishes an index relationship, generates accurate and effective voucher information according to classification and matching voucher templates, and associates the voucher information with the original bill, thereby reducing the workload of accountants, improving the working efficiency and freeing the accountants from complicated invoice oceans.
The accountant can generate a voucher template in the process of identifying the bill, directly call the voucher template aiming at the bill type in the later work, continuously learn to generate a new voucher template in the process of identifying the bill and expand the voucher template in the system template library.
Finally, it should be noted that: the above-mentioned embodiments are only used for illustrating the technical solution of the present invention, and not for limiting the same; although the present invention has been described in detail with reference to the foregoing embodiments, it will be understood by those of ordinary skill in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some or all of the technical features may be equivalently replaced; and the modifications or the substitutions do not make the essence of the corresponding technical solutions depart from the scope of the technical solutions of the embodiments of the present invention.

Claims (8)

1. A method for automatically generating a voucher based on intelligent identification of a bill is characterized in that: which comprises the following steps:
s1, in the voucher template system, presetting a voucher template to form a voucher template database, wherein the source of the voucher template mainly comprises system generation, manual generation and temporary template conversion, the method for generating the voucher template by the system is to create thousands of voucher templates according to bill types, commodity attributes, abstracts and coins and introduce the voucher templates into the voucher template system, the method for manually generating the voucher template is to supplement and add special voucher templates according to own business rules during the use process and store the special voucher templates in the voucher template system, and the method for generating the voucher template by the temporary template conversion is to convert the temporary template during the use process into the voucher template and store the voucher template in the voucher template system;
s2, the bills are uploaded to be intelligently identified, key information is extracted, the bills are classified according to the extracted key information, the key information is related to the bills and stored in a voucher template system to form a key information database, and the specific method for associating the key information with the bills comprises the following steps: uploading an invoice, generating a unique bill identification code in the uploading process, intelligently identifying the invoice, extracting key information from the invoice, enabling the key information and the unique bill identification code to be in one-to-one correspondence, and directly nesting and storing the key information and the unique bill identification code in a voucher template, wherein the key information comprises bill types, a payee, a payer, invoice drawing details and money amount;
s3, matching the template to generate a certificate, and specifically comprising the following steps:
matching the electronic data of the bill with the voucher template, and generating a final voucher through the voucher template: extracting key information according to point data of a bill, wherein the point data of the bill is obtained by scanning a two-dimensional code of the bill, the key information extracted according to the point data of the bill comprises a bill type and company information, then determining the type of the bill and the data environment of the bill according to the key information, retrieving and comparing the key information database generated in the step S2 according to the type of the bill and the data environment of the bill, and selecting a corresponding voucher template from the voucher template database in the step S1 as a final voucher according to a comparison result;
the retrieval comparison specifically comprises the following steps:
s31, matching similarity of characters: performing character matching on the extracted key information, enabling the key information to form a new character string, comparing the new character string with existing key information in a key information database of the certificate template system to confirm the similarity of the new character string and selecting the certificate template corresponding to the key information with the highest similarity stored in the system as a standby template;
s32, comparing the similarity obtained in step S31 with a first similarity threshold and a second similarity threshold, if the similarity is higher than the first similarity threshold, selecting the standby template as the final certificate,
if the similarity is lower than the first similarity threshold and higher than or equal to the second similarity threshold, generating a temporary template and a temporary voucher from the electronic data, classifying the temporary template and the temporary voucher as a first type, and proceeding to step S4;
if the similarity is lower than the second similarity threshold, generating a temporary template and a temporary certificate for the electronic data, but marking the temporary template and the temporary certificate, classifying the temporary template and the temporary certificate into a second type of temporary template and temporary certificate, and entering step S4;
s4, manual processing and manual intervention: manually processing the temporary certificate, and if the certificate information is correct, adjusting the temporary certificate into a formal certificate; if the certificate information is not accurate, modifying the certificate to generate a formal certificate; after generating the formal voucher, generating a new voucher template according to the electronic information of the voucher and the subject of the voucher;
and S5, marking the ticket generating the voucher template as identified.
2. The method for automatically generating the voucher based on intelligent recognition of a ticket according to claim 1, wherein: a first similarity threshold and a second similarity threshold are preset in the voucher template database.
3. The method for automatically generating the voucher based on intelligent recognition of a ticket according to claim 1, wherein: the character matching rule in step S31 is to convert each chinese character into a new character string according to pinyin, four-corner code, font and stroke number.
4. The method for automatically generating the voucher based on intelligent recognition of a ticket according to claim 1, wherein: the specific method for generating the voucher template by the system in the step S1 is as follows: the system generates a voucher template according to the bill type, the commodity attribute, the abstract and the currency, extracts the information of the voucher template to be matched with the information of the voucher template in the system after generating a formal voucher template, ignores the voucher template generated by the information if the matching is successful, and generates a new voucher template for storage if the matching is unsuccessful.
5. The method for automatically generating the voucher based on intelligent recognition of a ticket according to claim 1, wherein: the concrete method for generating the voucher template by the temporary voucher template in the step S1 is as follows: the temporary certificate template is derived from the temporary certificate, the temporary certificate is manually adjusted, a formal certificate is generated after the adjustment is finished, whether the temporary certificate template is deleted or not is inquired to a user, if yes, the temporary template is deleted, and if not, the temporary template is converted into a new certificate template to be stored according to the generated formal certificate.
6. The method for automatically generating the voucher based on intelligent recognition of a ticket according to claim 1, wherein: the information of the voucher template comprises subject information, summary information and ticket information.
7. The method for automatically generating the voucher based on intelligent recognition of a ticket according to claim 1, wherein: the specific method for extracting the key information according to the point data of the bill in the step S3 is to classify the bill by intelligently identifying the obtained information, and assemble the key information corresponding to the classification according to the classification information defined by the system.
8. The method for automatically generating the voucher based on intelligent recognition of a ticket according to claim 1, wherein: in step S32, the first similarity threshold is 85% to 95%, and the second similarity threshold is 65% to 75%.
CN201810483413.8A 2018-05-18 2018-05-18 Method for automatically generating voucher based on intelligent bill identification Active CN108960223B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810483413.8A CN108960223B (en) 2018-05-18 2018-05-18 Method for automatically generating voucher based on intelligent bill identification

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810483413.8A CN108960223B (en) 2018-05-18 2018-05-18 Method for automatically generating voucher based on intelligent bill identification

Publications (2)

Publication Number Publication Date
CN108960223A CN108960223A (en) 2018-12-07
CN108960223B true CN108960223B (en) 2020-10-30

Family

ID=64499292

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810483413.8A Active CN108960223B (en) 2018-05-18 2018-05-18 Method for automatically generating voucher based on intelligent bill identification

Country Status (1)

Country Link
CN (1) CN108960223B (en)

Families Citing this family (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109636557A (en) * 2018-12-11 2019-04-16 厦门商集网络科技有限责任公司 A kind of intelligent classification bookkeeping methods and equipment based on bank slip recognition
CN109858016A (en) * 2018-12-20 2019-06-07 航天信息股份有限公司 A kind of business credential match method
CN109783790A (en) * 2019-01-23 2019-05-21 国网山东省电力公司济宁供电公司 One kind is secondary to pacify ticket generation method and the system of arranging
CN110245656B (en) * 2019-05-10 2021-02-02 上海果藤互联网金融信息服务有限公司 Bill operation management method and system
CN110210470B (en) * 2019-06-05 2023-06-23 复旦大学 Commodity information image recognition system
CN110399463A (en) * 2019-07-29 2019-11-01 国网河北省电力有限公司 The Similarity Match Method and device of work ticket
CN110399851B (en) * 2019-07-30 2022-02-15 广东工业大学 Image processing device, method, equipment and readable storage medium
CN110516664A (en) * 2019-08-16 2019-11-29 咪咕数字传媒有限公司 Bill identification method and device, electronic equipment and storage medium
CN110765749A (en) * 2019-09-12 2020-02-07 深圳微企宝计算机系统有限公司 Method for intelligently generating certificate
CN110851677A (en) * 2019-11-18 2020-02-28 深圳春沐源控股有限公司 Reimbursement certificate processing method, device, terminal and computer readable storage medium
CN111178238B (en) * 2019-12-25 2024-09-20 未鲲(上海)科技服务有限公司 Certificate testing method, device, equipment and computer readable storage medium
CN111210328A (en) * 2019-12-31 2020-05-29 航天信息股份有限公司 Voucher generation method and device, storage medium and electronic equipment
CN111275037B (en) * 2020-01-09 2021-06-08 上海知达教育科技有限公司 Bill identification method and device
CN111401002A (en) * 2020-03-11 2020-07-10 山东浪潮通软信息科技有限公司 Method, device and computer storage medium for automatically identifying PDF electronic receipt information
CN111666885A (en) * 2020-06-08 2020-09-15 成都知识视觉科技有限公司 Template construction and matching method for medical document structured knowledge extraction
CN111680983A (en) * 2020-06-15 2020-09-18 山东理工职业学院 Automatic accounting document generating device for database
CN111783703B (en) * 2020-07-07 2024-01-26 常州市第三人民医院 Intelligent recognition and electronic certificate automatic generation system and method for paper bill
JP2022127766A (en) * 2021-02-22 2022-09-01 京セラドキュメントソリューションズ株式会社 Information generating system, workflow system, information generating program, and workflow program
CN115311651B (en) * 2022-10-12 2023-08-08 泰安协同软件有限公司 Real estate certificate data acquisition and arrangement method
CN116561602B (en) * 2023-07-10 2023-09-19 三峡高科信息技术有限责任公司 Automatic sales material matching method for sales cost transfer

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2014052572A1 (en) * 2012-09-28 2014-04-03 Order Inn, Inc. Method and system for offering combinations of goods and services for purchase and controlling expenses
CN103761599A (en) * 2013-12-23 2014-04-30 远光软件股份有限公司 Method and device for generating compensating vouchers of internal transaction businesses
CN105095842A (en) * 2014-05-22 2015-11-25 阿里巴巴集团控股有限公司 Method and device for identifying information of bill
CN107423731A (en) * 2017-04-06 2017-12-01 云南小鹰科技有限公司 The data processing method and system of aviation document

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020143621A1 (en) * 2001-03-30 2002-10-03 Donnelly Dennis P. System and method for transferring credits as an incentive for prompt payment

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2014052572A1 (en) * 2012-09-28 2014-04-03 Order Inn, Inc. Method and system for offering combinations of goods and services for purchase and controlling expenses
CN103761599A (en) * 2013-12-23 2014-04-30 远光软件股份有限公司 Method and device for generating compensating vouchers of internal transaction businesses
CN105095842A (en) * 2014-05-22 2015-11-25 阿里巴巴集团控股有限公司 Method and device for identifying information of bill
CN107423731A (en) * 2017-04-06 2017-12-01 云南小鹰科技有限公司 The data processing method and system of aviation document

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
The Research of Output and Authentication Methods of Digital Bills Based on the Fusion of Identity Information;Wang Hong等;《IEEE》;20141208;第1084-1088页 *
具有QoS约束的语义Web服务发现的研究;李春梅等;《计算机科学》;20070630;第34卷(第6期);第116-121页 *

Also Published As

Publication number Publication date
CN108960223A (en) 2018-12-07

Similar Documents

Publication Publication Date Title
CN108960223B (en) Method for automatically generating voucher based on intelligent bill identification
CN107067044B (en) Financial reimbursement complete ticket intelligent auditing system
US9639751B2 (en) Property record document data verification systems and methods
US8897563B1 (en) Systems and methods for automatically processing electronic documents
Embley et al. Table-processing paradigms: a research survey
CN101292259B (en) Method and system for image matching in a mixed media environment
CN106485243B (en) A kind of bank slip recognition error correction method and device
US8064703B2 (en) Property record document data validation systems and methods
US20050289182A1 (en) Document management system with enhanced intelligent document recognition capabilities
US8108764B2 (en) Document recognition using static and variable strings to create a document signature
US20070065011A1 (en) Method and system for collecting data from a plurality of machine readable documents
US11436852B2 (en) Document information extraction for computer manipulation
CN110334640A (en) A kind of ticket processing method and system
JP2004334339A (en) Information processor, information processing method, and storage medium, and program
CN115828874A (en) Industry table digital processing method based on image recognition technology
CN116092231A (en) Ticket identification method, ticket identification device, terminal equipment and storage medium
CN111860450A (en) Ticket recognition device and ticket information management system
CN112668335B (en) Method for identifying and extracting business license structured information by using named entity
US20070217691A1 (en) Property record document title determination systems and methods
CN111241955B (en) Bill information extraction method and system
Scius-Bertrand et al. Annotation-free keyword spotting in historical Vietnamese manuscripts using graph matching
US20240143632A1 (en) Extracting information from documents using automatic markup based on historical data
Xu et al. A mobile financial reimbursement information collection system based on OCR
JP2005208872A (en) Image processing system
Li et al. Steel Delivery Order Recognition Based on Deep Learning and Posterior Error Correction Technology

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CP03 Change of name, title or address

Address after: 501-018, floor 5, No. 15, wanquanzhuang Road, Haidian District, Beijing 100089

Patentee after: Dajingfang Network Technology Co.,Ltd.

Address before: 100000 405, No. 15, wanquanzhuang Road, Haidian District, Beijing

Patentee before: BEIJING DAZHANGFANG NETWORK TECHNOLOGY Co.,Ltd.

CP03 Change of name, title or address