CN108960223B - Method for automatically generating voucher based on intelligent bill identification - Google Patents
Method for automatically generating voucher based on intelligent bill identification Download PDFInfo
- Publication number
- CN108960223B CN108960223B CN201810483413.8A CN201810483413A CN108960223B CN 108960223 B CN108960223 B CN 108960223B CN 201810483413 A CN201810483413 A CN 201810483413A CN 108960223 B CN108960223 B CN 108960223B
- Authority
- CN
- China
- Prior art keywords
- voucher
- template
- bill
- temporary
- key information
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/20—Image preprocessing
- G06V10/22—Image preprocessing by selection of a specific region containing or referencing a pattern; Locating or processing of specific regions to guide the detection or recognition
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/10—Office automation; Time management
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q40/00—Finance; Insurance; Tax strategies; Processing of corporate or income taxes
- G06Q40/12—Accounting
- G06Q40/123—Tax preparation or submission
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/10—Character recognition
Landscapes
- Engineering & Computer Science (AREA)
- Business, Economics & Management (AREA)
- Strategic Management (AREA)
- General Physics & Mathematics (AREA)
- Finance (AREA)
- Theoretical Computer Science (AREA)
- Accounting & Taxation (AREA)
- Physics & Mathematics (AREA)
- Economics (AREA)
- General Business, Economics & Management (AREA)
- Entrepreneurship & Innovation (AREA)
- Marketing (AREA)
- Human Resources & Organizations (AREA)
- Development Economics (AREA)
- Operations Research (AREA)
- Tourism & Hospitality (AREA)
- Data Mining & Analysis (AREA)
- Quality & Reliability (AREA)
- Multimedia (AREA)
- Technology Law (AREA)
- Financial Or Insurance-Related Operations Such As Payment And Settlement (AREA)
- Character Input (AREA)
Abstract
The invention provides a method for automatically generating a voucher based on intelligent identification of a bill, which comprises the following steps: s1, uploading the bill for intelligent recognition, extracting key information, classifying the bill according to the extracted key information, and associating and storing the key information and the bill; s2, presetting a voucher template database in the voucher template system, wherein the sources of the voucher template mainly comprise manual generation, system generation and temporary template conversion generation; s3, matching the templates to generate a certificate; and S4, manually processing the temporary voucher. The invention sorts and stores the electronic data acquired through intelligent recognition, establishes an index relationship, generates accurate and effective voucher information according to classification and matching voucher templates, and associates the voucher information with the original bill, thereby reducing the workload of accountants, improving the working efficiency and freeing the accountants from complicated invoice oceans.
Description
Technical Field
The invention relates to the field of intelligent recognition of bills, in particular to a method for automatically generating a voucher based on intelligent recognition of the bills.
Background
In the existing electronic information system, the method for processing the bills to generate the vouchers is to manually arrange the original bills through accounting and then compile the vouchers in the system, so that the method has the disadvantages of large workload, easy error occurrence, need of multiple checking by accounting and great influence on the working efficiency. Especially in some middle or small enterprises, accounting document handling problems are very likely to occur due to lack of excellent accounting.
With the continuous innovation and popularization of internet technology, artificial intelligence gains more and more attention in the computer field, and more industries begin to use artificial intelligence to create value for enterprises and get more and more extensive application in bill identification. The automatic identification of the bill layout can be realized by the code scanning identification of the bill based on the artificial intelligence technology (intelligent identification), so that the breakthrough progress is achieved, the change of the bill content into electronic data is possible, and the purpose of intelligent accounting is achieved. However, how to convert the identified electronic data into accurate and effective financial data becomes a new problem for processing the identified electronic data.
Disclosure of Invention
In order to overcome the defects of the prior art, the invention provides a method for automatically generating a voucher based on intelligent identification of the voucher, which arranges and stores electronic data acquired through intelligent identification, establishes an index relationship, generates accurate and effective voucher information according to classification and matching voucher templates, and associates the voucher information with an original voucher, thereby reducing the workload of accountants, improving the working efficiency and freeing the accountants from complicated invoice oceans.
Specifically, the invention provides a method for automatically generating a voucher based on intelligent identification of a bill, which comprises the following steps:
s1, in the voucher template system, presetting a voucher template to form a voucher template database, wherein the source of the voucher template mainly comprises system generation, manual generation and temporary template conversion, the method for generating the voucher template by the system is to create thousands of voucher templates according to the types, commodity attributes, abstracts and coins and introduce the voucher templates into the voucher template system, the method for manually generating the voucher template is to supplement and add special voucher templates by users according to own business rules during the use process, and the method for generating the voucher template by the temporary template conversion is to convert the temporary template during the use process into the voucher template;
s2, the bills are uploaded to be intelligently identified, key information is extracted, the bills are classified according to the extracted key information, the key information is related to the bills and stored in a voucher template system to form a key information database, and the specific method for associating the key information with the bills comprises the following steps: uploading an invoice, generating a unique bill identification code in the uploading process, intelligently identifying the invoice, extracting key information from the invoice, corresponding the key information and the unique bill identification code one by one, and storing the key information in a voucher template, wherein the key information comprises bill types, a payee, a payer, invoice detail and amount;
s3, matching the template to generate a certificate, and specifically comprising the following steps:
matching the electronic data of the bill with the voucher template, and generating a final voucher through the voucher template: firstly, extracting key information according to point data of a bill, wherein the key information comprises a bill type and company information, then determining the type of the bill and the data environment of the bill according to the key information, carrying out retrieval comparison on the key information database generated in the step S1 based on the type of the bill and the data environment of the bill, and selecting a corresponding voucher template from a voucher template database in the step S2 as a final voucher according to a comparison result;
s4, manual processing and manual intervention: manually processing the temporary certificate, and if the certificate information is correct, adjusting the temporary certificate into a formal certificate; if the certificate information is not accurate, the certificate is modified to generate a formal certificate. After generating the formal voucher, generating a new voucher template according to the electronic information of the voucher and the subject of the voucher;
and S5, marking the ticket generating the voucher template as identified.
Preferably, the credential template database is preset with a first similarity threshold and a second similarity threshold.
Preferably, the searching and comparing in step S3 specifically includes the following steps:
s31, matching similarity of characters: performing character matching on the extracted key information, enabling the key information to form a new character string, comparing the new character string with existing key information in a key information database of the certificate template system to confirm the similarity of the new character string and selecting the certificate template corresponding to the key information with the highest similarity stored in the system as a standby template;
s32, comparing the similarity obtained in step S31 with a first similarity threshold and a second similarity threshold, if the similarity is higher than the first similarity threshold, selecting the standby template as the final certificate,
if the similarity is lower than the first similarity threshold and higher than or equal to the second similarity threshold, generating a temporary template and a temporary voucher from the electronic data, classifying the temporary template and the temporary voucher as a first type, and proceeding to step S4;
if the similarity is below the second similarity threshold, a temporary template and temporary voucher, which need to be marked, are also generated from the electronic data, and are classified as a second type of temporary template and temporary voucher, and the process proceeds to step 4.
Preferably, the character matching rule in step S31 is to convert each chinese character into a new character string according to pinyin, quadrangle code, font and stroke number.
Preferably, the specific method for generating the credential template by the system in step S2 is as follows: the system generates a voucher template according to the bill type, the commodity attribute, the abstract and the currency, extracts the information of the voucher template to be matched with the information of the voucher template in the system after generating a formal voucher template, ignores the voucher template generated by the information if the matching is successful, and generates a new voucher template for storage if the matching is unsuccessful.
Preferably, the concrete method for generating the voucher template by the temporary voucher template in the step S2 is as follows: the temporary certificate template is derived from the temporary certificate, the temporary certificate is manually adjusted, a formal certificate is generated after the adjustment is finished, whether the temporary certificate template is deleted or not is inquired to a user, if yes, the temporary template is deleted, and if not, the temporary template is converted into a new certificate template to be stored according to the generated formal certificate.
Preferably, the information of the voucher template comprises subject information, summary information and ticket information.
Preferably, in step S3, the specific method for extracting the key information according to the point data of the bill is to classify the bill by intelligent identification, and assemble the key information corresponding to the classification according to the classification information defined by the system.
Preferably, the first similarity threshold is 85-95% and the second similarity threshold is 65-75% in step S32.
Compared with the prior art, the invention has the following beneficial effects:
the invention sorts and stores the electronic data acquired through intelligent recognition, establishes an index relationship, generates accurate and effective voucher information according to classification and matching voucher templates, and associates the voucher information with the original bill, thereby reducing the workload of accountants, improving the working efficiency and freeing the accountants from complicated invoice oceans.
The accountant can generate a voucher template in the process of identifying the bill, directly call the voucher template aiming at the bill type in the later work, continuously learn to generate a new voucher template in the process of identifying the bill and expand the voucher template in the system template library.
The invention provides a new method idea for generating a certificate, which solves the barriers bothering technical personnel in the field, adds artificial intelligence into a method for accounting personnel to make the certificate, and increases the working efficiency.
Drawings
FIG. 1 is a schematic flow chart of the present invention;
FIG. 2 is a schematic illustration of ticket identification categories in an embodiment of the present invention;
FIG. 3 is a diagram of a matched credential template in an embodiment of the present invention;
FIG. 4 is a schematic illustration of a marked invoice in an embodiment of the invention;
fig. 5 is a schematic diagram of storing ticket information in an embodiment of the invention.
Detailed Description
Exemplary embodiments, features and aspects of the present invention will be described in detail below with reference to the accompanying drawings. In the drawings, like reference numbers can indicate functionally identical or similar elements. While the various aspects of the embodiments are presented in drawings, the drawings are not necessarily drawn to scale unless specifically indicated.
Specifically, the present invention provides a method for automatically generating a voucher based on intelligent ticket identification, as shown in fig. 1, comprising the following steps:
and S1, in the voucher template system, presetting a voucher template to form a voucher template database, wherein the sources of the voucher template mainly comprise system generation, manual generation and temporary template conversion.
The method for generating the voucher template by the system is to create thousands of voucher templates according to bill types, commodity attributes, abstracts and coins and introduce the voucher templates into the voucher template system, the method for manually generating the voucher templates is to supplement and add special voucher templates according to own business rules by general users in the using process, and the method for generating the voucher templates by the temporary template conversion is to convert the temporary templates in the using process into the voucher templates.
Preferably, the specific method for generating the credential template by the system in step S1 is as follows: the system generates a voucher template according to the bill type, the commodity attribute, the abstract and the currency, extracts the information of the voucher template to be matched with the information of the voucher template in the system after generating a formal voucher template, ignores the voucher template generated by the bill if the matching is successful, and generates a new voucher template for storage if the matching is unsuccessful.
Preferably, the information of the voucher template comprises subject information, summary information and ticket information. The voucher template can be a sales voucher, a purchase voucher or other voucher, etc. required by the accountant.
Preferably, the concrete method for generating the voucher template by the temporary voucher template in the step S1 is as follows: the temporary certificate template is derived from the temporary certificate, the temporary certificate is manually adjusted, a formal certificate is generated after the adjustment is finished, whether a user deletes the temporary certificate template or not is inquired, if yes, the temporary template is deleted, if not, the temporary template can be converted into a new certificate template according to the generated formal certificate and stored, the step of generating the new certificate template is the same as the step of generating the certificate template by the system, after the formal certificate template is generated, the information of the certificate template is extracted and matched with the information of the certificate template in the system, if the matching is successful, the certificate template generated by the system is ignored, and if the matching is unsuccessful, a new certificate template is generated and stored.
And S2, uploading the bills for intelligent recognition, extracting key information, classifying the bills according to the extracted key information, and enabling the key information to be associated with the bills and stored in the voucher template system to form a key information database.
The specific method for associating the key information with the bill is as follows: uploading an invoice, generating a unique bill identification code in the uploading process, intelligently identifying the invoice, extracting key information from the invoice, carrying out one-to-one correspondence with the unique bill identification code, and storing the key information and the unique bill identification code in a voucher template corresponding to the bill type.
The key information includes the type of the ticket, the payee, the payer, the invoice particulars and the amount.
When the intelligent bill identification system is used specifically, bills are uploaded to the system, intelligent identification is conducted on the bills after the bills are uploaded, the key information of the bills can be obtained in the intelligent bill identification process, the key information is classified according to the key information of the bills obtained through identification, the bills are classified, the key information and the bills are associated and stored in the voucher template system, and a key information database is formed. The credential template may be a sales template or an entry template, etc.
The intelligent recognition of the bill can adopt a mode of mixed scanning of the bill to extract key information of the bill, and the steps of mixed scanning are as follows:
1. after learning various types of bills, the intelligent identification system stores key information of the various types of bills, identifies different key information of the various types of bills and defines key words for bank bills, machine-issued bills and quota invoices, establishes a bill key information database through continuous learning and storage in the bill scanning process, wherein the bill key information database comprises an identification sequence list, a key word list, a key information list and a corresponding bill type list, and the key word list, the key information list and the corresponding bill type list are in one-to-one correspondence.
Specifically, the bill key information database is described in the following table:
the specific learning process is to scan a large number of bills, distinguish the key information of the bills, associate the key information of the bills with the actual bill types, and define keywords for certain specific invoices, such as bank bills, machine-issued bills and quota invoices, wherein the keywords are defined in the learning process of the invoices of the types, correspond the keywords with the key information, and when the invoices are identified, the required key information can be extracted from the keywords as long as the keywords can be scanned and picked up. In other words, the keywords defined for some tickets contain the required keyword information, and as long as the keywords can be scanned, the keyword information contained in the keywords can be obtained. The learning of the database is based on a large number of scans, and in practical application, the list can be directly defined and embedded into the database or more types of invoice types can be added into the database.
2. Various mixed bills are scanned into electronic images through a scanner, the electronic images are uploaded to an intelligent recognition system to obtain keywords, and the intelligent recognition system automatically recognizes and corrects inclined and rotating pictures. The electronic version image may be a color image or a black and white image.
3. Comparing the obtained electronic domain image with stored key information or key words according to the scanned information to obtain the bill type of the bill, wherein the comparison sequence is performed according to the sequence of the identification sequence list, if the bill type is a value-added tax invoice, the inspection is performed, if the inspection is successful, the inspection result is returned to the intelligent identification terminal to be displayed, and if the inspection is failed, the invoice is classified as an inspection error class; and if the bill type is other than the value-added tax invoice, directly returning the invoice type of the invoice to the intelligent identification terminal for displaying, and if the invoice type of the invoice cannot be identified, classifying the invoice with the unidentifiable invoice type into an unidentifiable invoice and returning an identification result. The value-added tax invoice comprises a value-added tax common invoice, a roll invoice, a motor vehicle invoice, a value-added tax common electronic invoice and a value-added tax special invoice.
The method comprises the steps of scanning an obtained electronic domain image to obtain information, wherein the information obtained by scanning is a previously defined keyword or key information, positioning a two-dimensional code of a scanned invoice, analyzing the content stored in the two-dimensional code to obtain the information hidden in the two-dimensional code, comparing the information according to a corresponding sequence, and judging the invoice type of the invoice.
Preferably, step 3 specifically comprises the following steps:
31. extracting key information directly from the obtained electronic domain image, if the key information can be extracted directly, comparing the scanned key information with key information columns of value-added tax ordinary invoices, roll invoices, motor vehicle invoices or value-added tax special invoices in a key information list stored in a bill key information database, if the invoices belong to the value-added tax ordinary invoices, the roll invoices, the motor vehicle invoices or the value-added tax special invoices, checking, if the checking is successful, returning the invoice type and the key information corresponding to the invoice type, and if the checking is failed, classifying the invoice as a checking error class and returning an identification result; if the key information cannot be directly extracted, extracting the key words, acquiring the key information corresponding to the key words according to the extracted key words, and entering the step 32;
32. comparing the extracted keywords with the keyword column of the bank bill in the keyword list stored in the bill keyword information database, if the invoice belongs to the bank bill, identifying the keyword information contained in the keywords according to the keywords, returning the bill type and the corresponding keyword information, and if the invoice does not belong to the bank bill, entering step 33;
33. comparing the extracted keywords with the keyword column of the machine invoice stored in the keyword list in the bill keyword information database, if the invoice belongs to the machine invoice, identifying the keyword information contained in the keywords according to the keywords, returning the bill type and the corresponding keyword information, and if the invoice does not belong to the machine invoice, entering step 34;
34. comparing the extracted keywords with the keyword column of the quota invoice in the keyword list stored in the bill keyword information database, if the invoice belongs to the quota invoice, identifying the keyword information contained in the keywords according to the keywords, returning the bill type and the corresponding keyword information, and if the invoice does not belong to the quota invoice, entering step 35;
35. if the invoice type of the invoice cannot be identified, classifying the invoice with the invoice type not identified into a non-identifiable class and returning an identification result.
4. The invoices which cannot be identified or are checked by the tax administration wrongly are subjected to secondary identification after image processing, and the image processing method is determined according to the specific reasons which cannot be identified and specifically comprises the steps of locking the position of key information, and cutting blocks, eliminating red marks, removing lines or performing machine learning training on incomplete numbers according to the coordinates of pixel points.
S3, matching the template to generate a certificate, and specifically comprising the following steps:
matching the electronic data of the bill with the voucher template, and generating a final voucher through the voucher template: firstly, key information is extracted according to point data of the bill, the point data of the bill is obtained by scanning a two-dimensional code of the bill, the key information comprises the bill type and company information, then the type of the bill and the data environment of the bill are determined according to the key information, retrieval and comparison are carried out on the key information database generated in the step S1 according to the type of the bill and the data environment of the bill, and a corresponding voucher template is selected from the voucher template database in the step S2 as a final voucher according to a comparison result.
Preferably, in step S3, the specific method for extracting the key information according to the point data of the bill is to classify the bill by intelligent identification, and assemble the key information corresponding to the classification according to the classification information defined by the system.
In the intelligent identification process of the bill, the data classification and the key information of the data can be determined. With this information the template is then matched.
Preferably, the searching and comparing in step S3 specifically includes the following steps:
s31, matching similarity of characters: and performing character matching on the extracted key information, forming a new character string by the key information, comparing the new character string with the existing key information in a key information database of the certificate template system to confirm the similarity of the new character string and selecting the certificate template corresponding to the key information with the highest similarity stored in the system as a standby template.
And S32, comparing the similarity obtained in the step S31 with a first similarity threshold and a second similarity threshold, and if the similarity is higher than the first similarity threshold, selecting the standby template as the final certificate.
If the electronic data is lower than the first similarity threshold and higher than or equal to the second similarity threshold, a provisional template and a provisional voucher, which are classified as a first type provisional template and a provisional voucher, are generated from the electronic data, and the process proceeds to step S4.
If the similarity is below the second similarity threshold, a temporary template and temporary voucher, which need to be marked, are also generated from the electronic data, and are classified as a second type of temporary template and temporary voucher, and the process proceeds to step 4.
Preferably, the character matching rule in step S31 is to convert each chinese character into a new character string according to pinyin, quadrangle code, font and stroke number.
Preferably, the first similarity threshold is 85-95% and the second similarity threshold is 65-75% in step S32. When the system is initially used, the first similarity threshold is 85%, the second similarity threshold is 65%, and the system continuously learns in the process of continuously identifying the generated certificates, and the first similarity threshold and the second similarity threshold also increase along with the generated number of the certificates.
S4, manual processing and manual intervention: manually processing the temporary certificate, and if the certificate information is correct, adjusting the temporary certificate into a formal certificate; if the certificate information is not accurate, the certificate is modified to generate a formal certificate. And after the formal voucher is generated, generating a new voucher template according to the electronic information of the voucher and the subject of the voucher.
The first type of temporary template and the temporary voucher are complete, the accuracy of matching information such as subjects and abstracts is high, and a user can modify a small part of information. The accuracy of the second type of temporary template and temporary voucher information is generally low, and more places need to be modified, so that the user needs to adjust subject, abstract and even amount information.
And S5, marking the ticket generating the voucher template as identified.
Detailed description of the preferred embodiment
Firstly, uploading bills, such as third-class bank receipt bills, through an online accounting voucher;
and step two, after the bill uploading is finished, browsing and uploading the bill in the system.
And step three, identifying the bill information through intelligent identification. Classification is already completed in the identification process, and specific identified categories can be referred to in fig. 2.
Step four, the generated certificate data is matched with the certificate template through the identified classification information, and the matched certificate template is shown in figure 3.
And step five, identifying the bill after generating the voucher as recognized, and marking the result as shown in figure 4.
And step six, extracting key information from the invoice, performing one-to-one correspondence between the key information and the unique identification code of the bill, and storing the key information and the unique identification code of the bill in a voucher template corresponding to the bill type, as shown in fig. 5.
The invention sorts and stores the electronic data acquired through intelligent recognition, establishes an index relationship, generates accurate and effective voucher information according to classification and matching voucher templates, and associates the voucher information with the original bill, thereby reducing the workload of accountants, improving the working efficiency and freeing the accountants from complicated invoice oceans.
The accountant can generate a voucher template in the process of identifying the bill, directly call the voucher template aiming at the bill type in the later work, continuously learn to generate a new voucher template in the process of identifying the bill and expand the voucher template in the system template library.
Finally, it should be noted that: the above-mentioned embodiments are only used for illustrating the technical solution of the present invention, and not for limiting the same; although the present invention has been described in detail with reference to the foregoing embodiments, it will be understood by those of ordinary skill in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some or all of the technical features may be equivalently replaced; and the modifications or the substitutions do not make the essence of the corresponding technical solutions depart from the scope of the technical solutions of the embodiments of the present invention.
Claims (8)
1. A method for automatically generating a voucher based on intelligent identification of a bill is characterized in that: which comprises the following steps:
s1, in the voucher template system, presetting a voucher template to form a voucher template database, wherein the source of the voucher template mainly comprises system generation, manual generation and temporary template conversion, the method for generating the voucher template by the system is to create thousands of voucher templates according to bill types, commodity attributes, abstracts and coins and introduce the voucher templates into the voucher template system, the method for manually generating the voucher template is to supplement and add special voucher templates according to own business rules during the use process and store the special voucher templates in the voucher template system, and the method for generating the voucher template by the temporary template conversion is to convert the temporary template during the use process into the voucher template and store the voucher template in the voucher template system;
s2, the bills are uploaded to be intelligently identified, key information is extracted, the bills are classified according to the extracted key information, the key information is related to the bills and stored in a voucher template system to form a key information database, and the specific method for associating the key information with the bills comprises the following steps: uploading an invoice, generating a unique bill identification code in the uploading process, intelligently identifying the invoice, extracting key information from the invoice, enabling the key information and the unique bill identification code to be in one-to-one correspondence, and directly nesting and storing the key information and the unique bill identification code in a voucher template, wherein the key information comprises bill types, a payee, a payer, invoice drawing details and money amount;
s3, matching the template to generate a certificate, and specifically comprising the following steps:
matching the electronic data of the bill with the voucher template, and generating a final voucher through the voucher template: extracting key information according to point data of a bill, wherein the point data of the bill is obtained by scanning a two-dimensional code of the bill, the key information extracted according to the point data of the bill comprises a bill type and company information, then determining the type of the bill and the data environment of the bill according to the key information, retrieving and comparing the key information database generated in the step S2 according to the type of the bill and the data environment of the bill, and selecting a corresponding voucher template from the voucher template database in the step S1 as a final voucher according to a comparison result;
the retrieval comparison specifically comprises the following steps:
s31, matching similarity of characters: performing character matching on the extracted key information, enabling the key information to form a new character string, comparing the new character string with existing key information in a key information database of the certificate template system to confirm the similarity of the new character string and selecting the certificate template corresponding to the key information with the highest similarity stored in the system as a standby template;
s32, comparing the similarity obtained in step S31 with a first similarity threshold and a second similarity threshold, if the similarity is higher than the first similarity threshold, selecting the standby template as the final certificate,
if the similarity is lower than the first similarity threshold and higher than or equal to the second similarity threshold, generating a temporary template and a temporary voucher from the electronic data, classifying the temporary template and the temporary voucher as a first type, and proceeding to step S4;
if the similarity is lower than the second similarity threshold, generating a temporary template and a temporary certificate for the electronic data, but marking the temporary template and the temporary certificate, classifying the temporary template and the temporary certificate into a second type of temporary template and temporary certificate, and entering step S4;
s4, manual processing and manual intervention: manually processing the temporary certificate, and if the certificate information is correct, adjusting the temporary certificate into a formal certificate; if the certificate information is not accurate, modifying the certificate to generate a formal certificate; after generating the formal voucher, generating a new voucher template according to the electronic information of the voucher and the subject of the voucher;
and S5, marking the ticket generating the voucher template as identified.
2. The method for automatically generating the voucher based on intelligent recognition of a ticket according to claim 1, wherein: a first similarity threshold and a second similarity threshold are preset in the voucher template database.
3. The method for automatically generating the voucher based on intelligent recognition of a ticket according to claim 1, wherein: the character matching rule in step S31 is to convert each chinese character into a new character string according to pinyin, four-corner code, font and stroke number.
4. The method for automatically generating the voucher based on intelligent recognition of a ticket according to claim 1, wherein: the specific method for generating the voucher template by the system in the step S1 is as follows: the system generates a voucher template according to the bill type, the commodity attribute, the abstract and the currency, extracts the information of the voucher template to be matched with the information of the voucher template in the system after generating a formal voucher template, ignores the voucher template generated by the information if the matching is successful, and generates a new voucher template for storage if the matching is unsuccessful.
5. The method for automatically generating the voucher based on intelligent recognition of a ticket according to claim 1, wherein: the concrete method for generating the voucher template by the temporary voucher template in the step S1 is as follows: the temporary certificate template is derived from the temporary certificate, the temporary certificate is manually adjusted, a formal certificate is generated after the adjustment is finished, whether the temporary certificate template is deleted or not is inquired to a user, if yes, the temporary template is deleted, and if not, the temporary template is converted into a new certificate template to be stored according to the generated formal certificate.
6. The method for automatically generating the voucher based on intelligent recognition of a ticket according to claim 1, wherein: the information of the voucher template comprises subject information, summary information and ticket information.
7. The method for automatically generating the voucher based on intelligent recognition of a ticket according to claim 1, wherein: the specific method for extracting the key information according to the point data of the bill in the step S3 is to classify the bill by intelligently identifying the obtained information, and assemble the key information corresponding to the classification according to the classification information defined by the system.
8. The method for automatically generating the voucher based on intelligent recognition of a ticket according to claim 1, wherein: in step S32, the first similarity threshold is 85% to 95%, and the second similarity threshold is 65% to 75%.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810483413.8A CN108960223B (en) | 2018-05-18 | 2018-05-18 | Method for automatically generating voucher based on intelligent bill identification |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810483413.8A CN108960223B (en) | 2018-05-18 | 2018-05-18 | Method for automatically generating voucher based on intelligent bill identification |
Publications (2)
Publication Number | Publication Date |
---|---|
CN108960223A CN108960223A (en) | 2018-12-07 |
CN108960223B true CN108960223B (en) | 2020-10-30 |
Family
ID=64499292
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201810483413.8A Active CN108960223B (en) | 2018-05-18 | 2018-05-18 | Method for automatically generating voucher based on intelligent bill identification |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN108960223B (en) |
Families Citing this family (20)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109636557A (en) * | 2018-12-11 | 2019-04-16 | 厦门商集网络科技有限责任公司 | A kind of intelligent classification bookkeeping methods and equipment based on bank slip recognition |
CN109858016A (en) * | 2018-12-20 | 2019-06-07 | 航天信息股份有限公司 | A kind of business credential match method |
CN109783790A (en) * | 2019-01-23 | 2019-05-21 | 国网山东省电力公司济宁供电公司 | One kind is secondary to pacify ticket generation method and the system of arranging |
CN110245656B (en) * | 2019-05-10 | 2021-02-02 | 上海果藤互联网金融信息服务有限公司 | Bill operation management method and system |
CN110210470B (en) * | 2019-06-05 | 2023-06-23 | 复旦大学 | Commodity information image recognition system |
CN110399463A (en) * | 2019-07-29 | 2019-11-01 | 国网河北省电力有限公司 | The Similarity Match Method and device of work ticket |
CN110399851B (en) * | 2019-07-30 | 2022-02-15 | 广东工业大学 | Image processing device, method, equipment and readable storage medium |
CN110516664A (en) * | 2019-08-16 | 2019-11-29 | 咪咕数字传媒有限公司 | Bill identification method and device, electronic equipment and storage medium |
CN110765749A (en) * | 2019-09-12 | 2020-02-07 | 深圳微企宝计算机系统有限公司 | Method for intelligently generating certificate |
CN110851677A (en) * | 2019-11-18 | 2020-02-28 | 深圳春沐源控股有限公司 | Reimbursement certificate processing method, device, terminal and computer readable storage medium |
CN111178238B (en) * | 2019-12-25 | 2024-09-20 | 未鲲(上海)科技服务有限公司 | Certificate testing method, device, equipment and computer readable storage medium |
CN111210328A (en) * | 2019-12-31 | 2020-05-29 | 航天信息股份有限公司 | Voucher generation method and device, storage medium and electronic equipment |
CN111275037B (en) * | 2020-01-09 | 2021-06-08 | 上海知达教育科技有限公司 | Bill identification method and device |
CN111401002A (en) * | 2020-03-11 | 2020-07-10 | 山东浪潮通软信息科技有限公司 | Method, device and computer storage medium for automatically identifying PDF electronic receipt information |
CN111666885A (en) * | 2020-06-08 | 2020-09-15 | 成都知识视觉科技有限公司 | Template construction and matching method for medical document structured knowledge extraction |
CN111680983A (en) * | 2020-06-15 | 2020-09-18 | 山东理工职业学院 | Automatic accounting document generating device for database |
CN111783703B (en) * | 2020-07-07 | 2024-01-26 | 常州市第三人民医院 | Intelligent recognition and electronic certificate automatic generation system and method for paper bill |
JP2022127766A (en) * | 2021-02-22 | 2022-09-01 | 京セラドキュメントソリューションズ株式会社 | Information generating system, workflow system, information generating program, and workflow program |
CN115311651B (en) * | 2022-10-12 | 2023-08-08 | 泰安协同软件有限公司 | Real estate certificate data acquisition and arrangement method |
CN116561602B (en) * | 2023-07-10 | 2023-09-19 | 三峡高科信息技术有限责任公司 | Automatic sales material matching method for sales cost transfer |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2014052572A1 (en) * | 2012-09-28 | 2014-04-03 | Order Inn, Inc. | Method and system for offering combinations of goods and services for purchase and controlling expenses |
CN103761599A (en) * | 2013-12-23 | 2014-04-30 | 远光软件股份有限公司 | Method and device for generating compensating vouchers of internal transaction businesses |
CN105095842A (en) * | 2014-05-22 | 2015-11-25 | 阿里巴巴集团控股有限公司 | Method and device for identifying information of bill |
CN107423731A (en) * | 2017-04-06 | 2017-12-01 | 云南小鹰科技有限公司 | The data processing method and system of aviation document |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20020143621A1 (en) * | 2001-03-30 | 2002-10-03 | Donnelly Dennis P. | System and method for transferring credits as an incentive for prompt payment |
-
2018
- 2018-05-18 CN CN201810483413.8A patent/CN108960223B/en active Active
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2014052572A1 (en) * | 2012-09-28 | 2014-04-03 | Order Inn, Inc. | Method and system for offering combinations of goods and services for purchase and controlling expenses |
CN103761599A (en) * | 2013-12-23 | 2014-04-30 | 远光软件股份有限公司 | Method and device for generating compensating vouchers of internal transaction businesses |
CN105095842A (en) * | 2014-05-22 | 2015-11-25 | 阿里巴巴集团控股有限公司 | Method and device for identifying information of bill |
CN107423731A (en) * | 2017-04-06 | 2017-12-01 | 云南小鹰科技有限公司 | The data processing method and system of aviation document |
Non-Patent Citations (2)
Title |
---|
The Research of Output and Authentication Methods of Digital Bills Based on the Fusion of Identity Information;Wang Hong等;《IEEE》;20141208;第1084-1088页 * |
具有QoS约束的语义Web服务发现的研究;李春梅等;《计算机科学》;20070630;第34卷(第6期);第116-121页 * |
Also Published As
Publication number | Publication date |
---|---|
CN108960223A (en) | 2018-12-07 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN108960223B (en) | Method for automatically generating voucher based on intelligent bill identification | |
CN107067044B (en) | Financial reimbursement complete ticket intelligent auditing system | |
US9639751B2 (en) | Property record document data verification systems and methods | |
US8897563B1 (en) | Systems and methods for automatically processing electronic documents | |
Embley et al. | Table-processing paradigms: a research survey | |
CN101292259B (en) | Method and system for image matching in a mixed media environment | |
CN106485243B (en) | A kind of bank slip recognition error correction method and device | |
US8064703B2 (en) | Property record document data validation systems and methods | |
US20050289182A1 (en) | Document management system with enhanced intelligent document recognition capabilities | |
US8108764B2 (en) | Document recognition using static and variable strings to create a document signature | |
US20070065011A1 (en) | Method and system for collecting data from a plurality of machine readable documents | |
US11436852B2 (en) | Document information extraction for computer manipulation | |
CN110334640A (en) | A kind of ticket processing method and system | |
JP2004334339A (en) | Information processor, information processing method, and storage medium, and program | |
CN115828874A (en) | Industry table digital processing method based on image recognition technology | |
CN116092231A (en) | Ticket identification method, ticket identification device, terminal equipment and storage medium | |
CN111860450A (en) | Ticket recognition device and ticket information management system | |
CN112668335B (en) | Method for identifying and extracting business license structured information by using named entity | |
US20070217691A1 (en) | Property record document title determination systems and methods | |
CN111241955B (en) | Bill information extraction method and system | |
Scius-Bertrand et al. | Annotation-free keyword spotting in historical Vietnamese manuscripts using graph matching | |
US20240143632A1 (en) | Extracting information from documents using automatic markup based on historical data | |
Xu et al. | A mobile financial reimbursement information collection system based on OCR | |
JP2005208872A (en) | Image processing system | |
Li et al. | Steel Delivery Order Recognition Based on Deep Learning and Posterior Error Correction Technology |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
CP03 | Change of name, title or address |
Address after: 501-018, floor 5, No. 15, wanquanzhuang Road, Haidian District, Beijing 100089 Patentee after: Dajingfang Network Technology Co.,Ltd. Address before: 100000 405, No. 15, wanquanzhuang Road, Haidian District, Beijing Patentee before: BEIJING DAZHANGFANG NETWORK TECHNOLOGY Co.,Ltd. |
|
CP03 | Change of name, title or address |