CN109219809A - The method and system for automatically generating data reporting based on electronic document - Google Patents

The method and system for automatically generating data reporting based on electronic document Download PDF

Info

Publication number
CN109219809A
CN109219809A CN201780027071.2A CN201780027071A CN109219809A CN 109219809 A CN109219809 A CN 109219809A CN 201780027071 A CN201780027071 A CN 201780027071A CN 109219809 A CN109219809 A CN 109219809A
Authority
CN
China
Prior art keywords
data
transaction
electronic document
template
reporting
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201780027071.2A
Other languages
Chinese (zh)
Inventor
N·古兹曼
I·萨夫特
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Vatbox Ltd
Original Assignee
Vatbox Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US15/361,934 external-priority patent/US20170154385A1/en
Application filed by Vatbox Ltd filed Critical Vatbox Ltd
Publication of CN109219809A publication Critical patent/CN109219809A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q40/00Finance; Insurance; Tax strategies; Processing of corporate or income taxes
    • G06Q40/12Accounting
    • G06Q40/123Tax preparation or submission
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/04Billing or invoicing

Landscapes

  • Business, Economics & Management (AREA)
  • Accounting & Taxation (AREA)
  • Development Economics (AREA)
  • Finance (AREA)
  • Engineering & Computer Science (AREA)
  • Marketing (AREA)
  • Economics (AREA)
  • Strategic Management (AREA)
  • Physics & Mathematics (AREA)
  • General Business, Economics & Management (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Technology Law (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

A kind of system and method that data reporting is automatically generated based on electronic document.This method includes at least one parameter transaction for analyzing electronic document to determine transaction, and wherein electronic document includes at least partly unstructured data;Template of the creation for transaction, wherein the template is the structured data sets for including at least one identified parameter transaction;Multiple reporting requirements are obtained based on the template created;And based on the template and multiple reporting requirements obtained created, generate qualified data reporting.

Description

The method and system for automatically generating data reporting based on electronic document
Cross reference to related applications
This application claims the equity for the U.S. Provisional Application 62/307,497 that on March 13rd, 2016 submits.This application is also The part continuation application for the U.S. Patent application 15/361,934 applied that on November 28th, 2016 submits.It is above-mentioned multiple The content of application is hereby incorporated by reference.
Technical field
The disclosure relates generally to data analyses, in particular to the report generation based on the analysis of non-structural data.
Background technique
With commercially gradually relying on the technology for runing in terms of related data management, reasonably maintenance and data reporting Suitable system have become successful key factor.For large enterprise, the business data amount used daily can It can be huge.Thus, manually examine and report that such data are unpractical.Other than normal sales data, answer More data can be also collected and use with the enterprise of the country /region of value-added tax, it is therefore desirable to additional report.
Value-added tax (VAT) is a broad-based consumption tax, which carried out according to commodity and the value added of service Assessment.Specific VAT is suitable for purchase or the most of commodity sold and service in particular community.When a people is in external trip When row does shopping and needs to pay VAT, the subsequent expenses of taxation that the people may have the right to obtain the secondary purchase are returned.Under specific circumstances, together Sample can return other expenses of taxation for being suitable for purchase.In addition, the seller can sell under certain places and specific condition for purchase Product discount offered.The following procedure that this refund of purchase price can be established by refund entity is returned (reclaim)。
The laws and regulations of many countries assign overseas tourist's compensation or return the right of a certain tax category, are e.g. overseas Commodity and/or service and the VAT paid.Since such laws and regulations are different because of country variant, it is thus determined that the reality obtained of having the right VAT refund in border usually requires that the claimer of refund possesses the knowledge in a large amount of overseas tax law fields.In addition, traveller is having no right to obtain VAT may be also returned in continuation application in the case where refund, to spend time and efforts and make a futile effort.Furthermore, if can It may be also because that the type of purchase and qualification VAT bill whether there is is different to obtain VAT refund.
The program claimed for tax refund is to fill up a form in person close to the customs officer on airport, and submit during the visit The corresponding original receipt of the expense of generation.This program should register or boarding to next destination before execute.In addition, Especially for the commodity in foreign procurement, it is desirable that the program of refund may require requestee and be not used to customs officer's displaying Commodity, to verify whether export goods is consistent with the cargo of the paid VAT of requestee.
Since traveller is unfamiliar with the specific laws and rules of requirement refund, traveller is although in the case where being not eligible for Refund may also be submitted to apply.If traveller finally learns that he or she haves no right to be refunded, which can also be unnecessarily It wastes time.Therefore it provides a kind of solution for overcoming prior art defect is advantageous, the program is effective by providing Mode to handle VAT refund electronicly, is preferably handled by internet.
Seek to refund, the client institute facing challenges for especially seeking VAT refund may make its discouraged and can not be with Into to be refunded.When client is the employee of enterprise, this problem is more complicated, because client is not directly from refund It is benefited.In addition, employee can submit incoherent or duplicate information, these information are unnecessary for seeking refund.It crosses Filter these unnecessary information may be it is time-consuming, it is with high costs and influenced by significantly mistake.
In addition, the employee of enterprise must retain the transaction record of payment VAT in foreign countries' shopping, it is both used for accounting purpose, It is also used for seeking to return.Manual report based on such record is labor-intensive, and there are mistakes.In addition, existing Have and manual input data are usually required based on this solution reported automatically that records, this equally exists mistake.
Therefore it provides a kind of solution for overcoming prior art defect will be beneficial.
Summary of the invention
It is the general introduction of several example embodiments of the disclosure below.These embodiments are provided substantially in order to facilitate reader Understand, and this general introduction provided not fully limits the scope of the invention.The not all contemplated embodiments of the general introduction it is extensive general It states, and is neither intended to the key or important element for identifying all embodiments, be not intended to the model in terms of describing any or all It encloses.Its sole purpose is that some concepts of one or more embodiments are presented in simplified form, more detailed as what is presented later The preamble of description.For convenience, term " some embodiments " can be used herein to refer to single embodiment of the invention or more A embodiment.
Certain embodiments of the disclosure include a kind of method for generating data reporting based on electronic document.The method packet Include: analysis electronic document is to determine at least one parameter transaction in transaction, wherein the electronic document includes at least partly Non-structured data;Template of the creation for transaction, wherein the template is institutional data set, which includes institute At least one determining parameter transaction;Based on the template created, multiple reporting requirements are obtained;And based on the template that is created and Acquired reporting requirement generates qualified data reporting.
Certain embodiments of the disclosure further include non-transitory computer-readable medium, and being stored thereon with, which can be used for making, locates The instruction that circuit executes step is managed, which includes: to analyze electronic document to determine at least one parameter transaction in transaction, Described in electronic document include at least partly non-structured data;Template of the creation for transaction, wherein the template is Institutional data set, the data set include at least one identified parameter transaction;Based on the template created, obtain multiple Reporting requirement;And based on the template and acquired reporting requirement created, qualified data reporting is generated.
Certain embodiments of the disclosure further include a kind of system for generating data reporting based on electronic document.The system packet It includes: processing circuit;Memory, the memory include that processing circuit configures the instruction of the system when executing: analysis electronic document with At least one parameter transaction in transaction is determined, wherein the electronic document includes at least partly non-structured data;Wound The template for transaction is built, wherein the template is institutional data set, which includes at least one identified friendship Easy parameter;Based on the template created, multiple reporting requirements are obtained;And it is wanted based on the template and acquired report created It asks, generates qualified data reporting.
Detailed description of the invention
It is particularly pointed out in the claim summarized by specification and is distinctly claimed presently disclosed subject matter.Pass through Below in conjunction with the detailed description of attached drawing, the foregoing and other objects, features and advantages of the disclosed embodiments be will be evident.
Fig. 1 is the network to describe multiple open embodiments.
Fig. 2 is the schematic diagram according to the data integrity manager of one embodiment.
Fig. 3 is to illustrate a kind of flow chart for automatically generating data reporting method according to one embodiment.
Fig. 4 is to illustrate a kind of flow chart that data set is generated based at least one electronic document according to one embodiment.
Specific embodiment
Want it is important to note that, embodiment disclosed herein is only the example of many advantageous uses of this paper innovative teachings.Generally For, the made statement in the description of the present application unnecessarily limits any multiple embodiments to be protected.In addition, some Statement is likely to be suited for certain inventive features and is not suitable for other features.In general, unless otherwise stated, singular elements can To be plural number, otherwise without loss of generality.In the accompanying drawings, in several views, identical label indicates identical part.
Various disclosed embodiments include the method and system that data reporting is automatically generated based on electronic document.In a reality It applies in example, data set is created based on electronic document.At least one electronic document is at least partly non-structured.Creation transaction The template of attribute.The requirement of report is obtained based on template.Based on template and reporting requirement, friendship included in electronic document is determined It is easily whether qualified, if it is, generating qualified data reporting.Qualified data reporting may include for example complete value-added tax (VAT) table is returned, at least one transaction e proves document, or both.The return on qualification data of generation can be sent to Such as report authorization server.
Fig. 1 shows an exemplary network diagram 100 to describe various disclosed embodiments.In illustrative network In schematic diagram 100, Report Builder 120, business system 130, database 140, multiple network source 150-1 to 150-N are (below only Individually and with being referred to as it is described as network source 150, merely for succinct intention) it is communicatedly connected by network 110.Network 110 can be but not limited to wireless network, cellular network or cable network, local area network (LAN), wide area network (WAN), Metropolitan Area Network (MAN) (MAN), similar network such as internet, WWW (WWW) and combinations thereof.
Business system 130 is associated with enterprise, can store relevant data of buying to enterprise or enterprise's representative generation and The relevant data of enterprise itself.The enterprise can be but not limited to, and employee may purchase in overseas needs to pay VAT's The enterprise of commodity or service.Business system 130 can be but not limited to server, database, enterprise resource planning, client The system of relationship management system or other storage related datas.
The data being stored in business system 130 can include but is not limited to electronic document (for example, display such as invoice scans The image file of part, text document, electronic form document etc.).It include that data in each electronic document can be structuring , it is semi-structured, it is non-structured or combinations thereof.Structuring or partly-structured data may be with cannot be by Report Builders The format storage of 120 identifications, thus can only be handled in a manner of unstructured data.
Database 140 at least stores the data of confirmation transaction.Such data may include but be not limited to comprising transaction phase The electronic identification document of pass.The electronic identification document may include but be not limited to invoice, receipt and other similar bill.
Network source 150 at least stores the requirement (report and application of e.g. VAT refund) of data report.The requirement It can be stored such as the mode of rule.Network source 150 can also store the data for being used as and generating report, and the report is not limited to could fill out Report form (e.g., could fill out VAT return application form).Different network sources 150 can store different reporting requirement and table Lattice (e.g., for the reporting requirement of country variant and table).As a non-limitative example, network source 140-1 can store needle Regulatory requirements are reported to the VAT of France.And in another non-limitative example, network source 140-8 can store VAT and return table, The table is used to return report for the VAT of Italy.
In one embodiment, Report Builder 120 is configurable to generate the template based on parameter transaction, the parameter transaction It is identified in an electronic document by machine vision.In one further embodiment, Report Builder 120 is configured as The template and at least one reporting requirement are compared, to determine whether qualified (e.g., the needle of transaction representated by the data in the template VAT is returned).In a further embodiment, Report Builder 120 is configured as when confirming the transaction qualification, is based on mould Plate generates qualified data reporting.It is (e.g., including complete that the data reporting of qualification generated may include but be not limited to electronic document VAT return table electronic document), transaction confirmation data (e.g., including receipt relevant with the transaction, invoice electronics text Both shelves) or above-mentioned.Qualified data reporting can transmit to such as report organ (for example, suitable tax authority).
In one embodiment, Report Builder 120 is configured as based on (such as non-including at least partly unstructured data Structural data, semi-structured data or the structural data with unknown structure) electronic document generate data set.For This, Report Builder 120 can also be configured to determine electricity using optical character identification (OCR) or other image processing Data in subdocument.
In one embodiment, Report Builder 120 is configured as analyzing created data set to determine and transaction phase The parameter transaction of pass is indicated in an electronic document.In another embodiment, Report Builder 120 can be configured to be based on Whether data set meets at least one predetermined constraint, to determine whether created data set is suitable for returning.
In one embodiment, Report Builder 120 is configured as based on the data set created come drawing template establishment.The mould Plate be include identified parameter transaction structuring data set.The template created is used as potential report template.
It should be noted that for purposes of simplicity without limiting disclosed embodiment, the above-mentioned embodiment about Fig. 1 In only relate to a business system 130.Without departing from the scope of the disclosure, can comparably be using multiple enterprises System.
Fig. 2 is the example schematic diagram according to the Report Builder 120 of one embodiment.Report Builder 120 includes connection In the processing circuit 410 of memory 215, memory 220, optical character identification (OCR) processor 230 and socket 240. In one embodiment, the component of Report Builder 120 can communicatedly be connected by bus 250.
Processing circuit 210 can be realized by one or more hardware logic components and circuit.Such as, but not limited to, it can use Hardware logic component type to illustrate includes, programmable gate array (FPGA), specific integrated circuit (ASIC), application specific standard produce Product (ASSP), system on a chip (SOC), general purpose microprocessor, microcontroller, digital signal processor (DSP) or other classes As device or other executable calculate or hardware logic components of processing information.
Memory 215 can be (for example, the RAM etc.) of volatibility, non-volatile (for example, ROM, flash memory etc.) or its In conjunction with.In one configuration, the computer-readable instruction for executing one or more embodiment of the present disclosure is storable in memory 220 In.
In another embodiment, memory 215 is configured as storage software.The software should be broadly interpreted as any The instruction of type, either software, firmware, middleware, microcode, hardware description language etc..Instruction may include code (e.g., source Code format, binary code form, executable code format or any other suitable code format).Work as one or more When processing circuit 210 executes described instruction, processing circuit 210 executes various steps described herein.Particularly, described instruction quilt When execution, processing circuit 210 executes the step of automatically generating data reporting based on electronic document, as described in this in this way.
Memory 220 can be magnetic storage, optical memory etc., and may, for example, be by flash memory or other storages The mode of device technology, CD-ROM, digital versatile disc (DVD) or any other medium is realized, can be used for storing information needed.
OCR processor 230 may include but be not limited to, feature and/or figure recognizing unit (RU) 235, and figure identification is single Member is configured as the figure of identification unstructured data collection form, feature or both.Particularly, in one embodiment, Optical character identification (OCR) processor 230 is configured as at least identifying character in unstructured data.The character identified can As creation validation data set, which includes data required for verifying is traded.
Socket 240 allow Report Builder 120 and business system 130, database 140, network source 150 or they In conjunction with being communicated, such as to retrieve data, storing data etc..
It should be appreciated that embodiment described herein is not limited to specific structure shown in Fig. 2, and do not departing from Other structures can be comparably used in the case where the range of disclosed embodiment.
Fig. 3 shows a kind of the exemplary of the method for generation data reporting based on electronic document according to one embodiment Flow chart 300.In one embodiment, the method can be performed by Report Builder 120.
Step S310 creates the data set based on the electronic document for including transaction related information.The electronic document can wrap It includes but is not limited to unstructured data, semi-structured data, do not expect or unauthorized structural data or above-mentioned knot It closes.In one embodiment, step S310 further includes analyzing electronic document using optical character identification (OCR) to determine electricity The data of subdocument identify the critical field in data, identify value or above-mentioned combination in data.Below according to Fig. 4 to base It is further described in electronic document creation data set.
Step S320 analyzes created data set.In one embodiment, analyzing the data set may include but unlimited In determining that parameter transaction such as, but not limited to determines at least one group's identifier (e.g., consumer's enterprise identifier, businessman Enterprise identifier, or both), transaction relevant information (e.g., date, time, price, the type for selling commodity or service Deng), or both.In another embodiment, analyzing the data set may also include collection identification friendship based on the data Easily.
Optional step S330, is based on the analysis, the step determine created data set if appropriate for for reporting, If so, then continuing to execute step S340, otherwise, program interrupt.In one embodiment, step S330 may include that determination is created Data set whether meet at least one pre-determined constraint.For example, if data set to meet at least one pre-determined about Beam, then the data set is suitable for report.Constraint determined by advance may include but be not limited to, to information type in verification process Requirement, accuracy requirement, or both combination.For example, if in electronic document not including enterprise of manufacturer in transaction If the price of country or transaction, cannot successfully it report.During determining whether transaction is suitable for report, by making The use of computing resource can be reduced with the report for only meeting minimum requirements.
In another embodiment, step S330, which may also include, determines at least one constraint based on the data set created. In a further embodiment, determine that at least one constraint may include searching at least one data based on the data set created Library (e.g., uses the position of the enterprise of manufacturer pointed out in created data set).In a further embodiment, step S330 is also May include the reporting requirement (e.g., a VAT return table) for analyzing at least one electronic document, with determine it is described at least one about Beam.Described analyze executes OCR or other image processing on the electronic document that may additionally include each request for a report.For example, base There is field " price " in one, the VAT of " commodities purchased " and " position " returns the analysis of table, at least one described constraint can It requires to include, price, the data set of at least one commodity or service and position is eligible.
In another embodiment, when determining that the data are not suitable for report, additional data, replacement data or above-mentioned two Person can retrieve from least one data source and be contained in created data set.As a non-limitative example, if A buying is implemented in some country, needs the title of enterprise of manufacturer to carry out VAT return in state buying, but on invoice It include the title of the enterprise of manufacturer, then the title of the manufacturer can be based on the other information on invoice from the database of government Middle extraction.In a further embodiment, when executing retrieval alternative information, step S340 will be continued to execute.In another reality It applies in example, when executing retrieval alternative information, need to determine whether the data set created with alternative information is suitable, if so, Then continue to execute step S340, otherwise, program interrupt.
In step S340, the template based on analyzed data set is created.The template can be but not limited to, including multiple words The data structure of section.The field may include identified parameter transaction.The field can be with predefined.
By the attribute of the structuring of the template created, drawing template establishment allows quickly to handle from electronic document.Example Such as, it compared to the data set of not structuring, is lined up and processing operation can more efficiently carry out in the data set of structuring. It further, is the data set of structuring by information tissue from electronic document, storage is deposited comprising information in an electronic document The quantity of reservoir can greatly reduce.The usually electronic document of image is needed comprising identical information than data set Want more memory spaces.For example, the data set of the electronic document of 1000,000 image of characterization can be in the form of data record It is stored in text document.The size of text document in this way is by the size of far smaller than 1000,000 pictures.
Step S350 obtains multiple reporting requirements based on the template.In one embodiment, step S350 may include to A data source is selected less, and the reporting requirement can be obtained from the data source.In a further embodiment, which can base In the template.As a non-limitative example, the VAT based on the buying in Europe is returned, and the seller in transaction must be On the white list of European organization.Thus, it selectes and inquires the network source for being stored with white list.In a further embodiment, institute Stating at least one reporting requirement may include one or more rule, and the rule is for determining potential Report Parameters.As one Non-limitative example, at least one described reporting requirement may include one based on one or more parameter transaction returned to calculate VAT The rule of quantity also.
In another embodiment, step S350 includes that at least one reporting requirement is retrieved from least one data source (for example, the related VAT that regulatory agency establishes returns desired database).In a further embodiment, it is described at least one Reporting requirement can be retrieved based at least one portion of the template.Each potential reporting requirement parameter can be with It is the parameter of request or other modes report.As a non-limitative example, if " position " this word in the template The position indicated in section is France, then the requirement of report can be obtained from the server of the French tax authority.
In still another embodiment, step S350 may include being retrieved from least one report electronic document, institute It states report electronic document to be such as, but not limited to, the electronic document of request table is returned comprising VAT.In a further embodiment, Step S350 includes at least one the report electronic document retrieved by machine image analysing computer, to determine that at least one report is wanted It asks.
Step S360 determines that transaction pointed in template is based on acquired reporting requirement and the template created No qualification continues to execute step S370 if qualified;Otherwise, program interrupt.In one embodiment, step S360 includes using The data in data and reporting requirement in the template compare.If each reporting requirement all meets, which is to close Lattice.As a non-limitative example, the transaction of a reporting requirement based on the German tax authority, if including in the template Position " Germany ", and the commodity purchased belong to the VAT that meets identified in advance and return in defined product list, and should The country origin for purchasing this is not " Germany ", then this transaction is qualified.
Step S370 then generates qualified data reporting when transaction is confirmed as qualification.In one embodiment, it walks Rapid S370 includes the electronic report document for generating the data comprising meeting acquired reporting requirement.In further embodiment In, step S370 may also comprise the electronic document that retrieval is near completion.As a non-limitative example, step S370 can be wrapped It includes, the template returning table using the VAT for including a blank and being created, generates a complete VAT and return table.It is examined The electronic document (e.g., the VAT of the blank returns table) of rope can carry out structuring and handle to be inserted into detailed information.It can be based on The structure completes retrieved electronic document.
In another embodiment, step S370 may also include, and obtain at least one electronics confirmation text relevant to transaction Shelves.In a further embodiment, step S370 further includes, using the data of the template, inquiring at least one and being stored with friendship The data source of easy relevant information.As a non-limitative example, the electronic document including purchase receipts can be from the factory of transaction In the server of quotient, inquired by using the transaction identifiers in the template.
Fig. 4 is according to a kind of exemplary process diagram based on electronic document creation data set shown in one embodiment S310。
Step S410 obtains electronic document.Obtaining the electronic document may include but be not limited to, from consumer's business system Middle reception electronic document (as received scan image) or the retrieval electronic document are (such as from consumer business system, enterprise of manufacturer Electronic document is retrieved in system or database).
Step S420 analyzes the electronic document.The analysis may include but be not limited to, and use optical character identification (OCR) character in the electronic document is determined.
Step S430 is based on the analysis, identifies critical field and value in electronic document.The critical field may include But it is not limited to title and the address, date, currency, the commodity of sale or service, transaction identifiers, invoice number etc. of manufacturer.Electricity It may include some non-essential details in subdocument, these details will be not as key value.For example, the mark of manufacturer may not It is necessary, then not being just a key value.It in one embodiment, can be with the list of predefined critical field, with institute The critical field data segment that matches is stated to be extracted.Then, cleanup step is executed to guarantee that information is accurately presented.For example, if The data that OCR is identified are " 1211212005 ", then this data is converted to 12/12/2005 by cleanup step.In another example In son, if title is identified as " Mo $ den ", which will be converted into " Mosden ".Cleanup step will use external information Source, such as dictionary, calendar.
In a further embodiment, check whether extracted data segment is complete.For example, if the title of manufacturer can be known Not but address lacks, then the critical field of the address of the manufacturer is exactly incomplete.Complete this of supplement will be attempted at this time to lack The primary key value of mistake.The trial may include inquiring external system and database, associated with the invoice information of previous analysis, Or both combination.The example of external system and database includes CompanyAddress, Universal Product Code (UPC), and package is thrown Pass with tracking system, etc..In one embodiment, step S340 obtain predefined critical field and its analog value it is complete Whole group is closed.
Step S440 generates the data set of structuring.Data set generated includes identified critical field and value.
Various embodiments disclosed herein can be implemented as hardware, firmware, software or any combination thereof.In addition, software is excellent Selection of land is embodied as the computer for being tangibly embodied in program storage unit (PSU) or being combined by part or certain equipment and/or equipment group Application program on readable medium.Application program can upload to the machine including any suitable architecture and be executed by it.It is preferred that Ground, the machine have such as one or more central processing unit (" CPU "), the hardware of memory and input/output interface Computer platform on realize.Computer platform can also include operating system and micro-instruction code.Various mistakes described herein Journey and function can be a part or any combination of them of a part of micro-instruction code perhaps application program, can To be executed by CPU, regardless of whether explicitly showing such computer or processor.In addition, various other peripheral cells can To be connected to computer platform, such as additional-data storage unit and print unit.In addition, non-transitory computer-readable medium It is any computer-readable medium other than temporary transmitting signal.
It should be appreciated that the titles such as " first " used herein, " second " do not limit any reference of element generally The quantity or sequence of these elements.On the contrary, these titles are typically used as distinguishing two or more members of element herein The facilitated method of element or example.Therefore, being not meant to the reference of the first and second elements there only can be using two Element or first element must be in some way before second elements.Moreover, a unless otherwise stated, set of pieces Including one or more elements.
As it is used herein, phrase "at least one" followed by bulleted list mean to can be used alone any list Project, or can use two or more any combination in listed item.For example, if system is described as including " at least one of A, B and C ", then system may include only A;Only B;Only C;A and B combination;B and C in combination;A and C in combination;Or A, B and C in combination use.
Herein cited all examples and conditional statement are intended for teaching purpose to help reader to understand disclosed implementation The further field of principle and inventor of example and the concept that provides, and should be to be construed as being without limitation of that these specifically quote shows Example and condition.In addition, describing the principle of disclosed embodiment, all statements and its specific example of aspect and embodiment here It is intended to comprising its structure and function equivalent.In addition, these equivalents are intended to include currently known equivalent and open in the future The equivalent of hair, that is, exploitation execution identical function any element, but regardless of structure how.

Claims (21)

1. a kind of method for automatically generating data reporting based on electronic document, comprising:
Electronic document is analyzed to determine at least one parameter transaction in transaction, wherein the electronic document includes at least partly Non-structured data;
Template of the creation for transaction, wherein the template is institutional data set, the data set include it is identified at least One parameter transaction;
Based on the template created, multiple reporting requirements are obtained;And
Based on the template and acquired reporting requirement created, qualified data reporting is generated.
2. according to the method described in claim 1, further include:
Created template and the multiple reporting requirement are compared, whether can be used to report qualifiedly with determining transaction, wherein institute State the generation when the transaction determination can be used to report qualifiedly of qualified data reporting.
3. according to the method described in claim 1, wherein generating qualified data reporting further include:
Based on the template created, electronic report document is generated, wherein the data reporting of the qualification includes electronics generated Report file.
4. according to the method described in claim 1, wherein generating qualified data reporting further include:
Retrieving at least one based on the template created can prove that the electronic identification document of the transaction, wherein the qualification Data reporting includes at least one the electronic identification document retrieved.
5. according to the method described in claim 1, wherein analyzing electronic document further include:
In the electronic document, at least one critical field and at least one value are verified;
Based on the electronic document, data set is created, wherein the data set created includes at least one critical field and at least One value;And
Analyze created data set, wherein at least one parameter transaction is determined based on the analysis.
6. according to the method described in claim 1, wherein verifying at least one critical field and at least one value further include:
The electronic document is analyzed, to determine data in the electronic document;And
Based on pre-determined critical field list, a part of the data of at least described determination is extracted, it is wherein at least described true A part of fixed data matches at least one critical field in pre-determined critical field list.
7. according to the method described in claim 6, wherein analyzing the electronic document further include:
Optical character identification is executed on the electronic document.
8. according to the method described in claim 6, further include:
Cleanup step is executed at least part of the data of the determination of the extraction.
9. according to the method described in claim 6, further include:
Check whether at least part of the data of the determination of the extraction is complete;And
To each incomplete data, executes at least one of following: inquiring at least one external source;By identified data with The data in electronic document analyzed before at least one are associated.
10. according to the method described in claim 1, wherein the transaction is at least financial transaction, wherein the data reporting is extremely It is few related to the data that value-added tax is returned.
It, should 11. a kind of non-transitory computer-readable medium is stored thereon with the instruction for making processing circuit execute a process Process includes:
Electronic document is analyzed to determine at least one parameter transaction in transaction, wherein the electronic document includes at least partly Non-structured data;
Template of the creation for transaction, wherein the template is institutional data set, the data set include it is identified at least One parameter transaction;
Based on the template created, multiple reporting requirements are obtained;And
Based on the template and acquired reporting requirement created, qualified data reporting is generated.
12. a kind of system for the transaction that confirmation electronic document is presented, comprising:
Processing circuit;And
Memory, the memory include instruction, and instruction circuit processed configures the system execution when executing:
Electronic document is analyzed to determine at least one parameter transaction in transaction, wherein the electronic document includes at least partly Non-structured data;
Template of the creation for transaction, wherein the template is institutional data set, the data set include it is identified at least One parameter transaction;
Based on the template created, multiple reporting requirements are obtained;And
Based on the template and acquired reporting requirement created, qualified data reporting is generated.
13. system according to claim 12, wherein the system is also configured to
Created template and the multiple reporting requirement are compared, whether can be used to report qualifiedly with determining transaction, wherein institute State the generation when the transaction determination can be used to report qualifiedly of qualified data reporting.
14. system according to claim 12, wherein the system is also configured to
Based on the template created, electronic report document is generated, wherein the data reporting of the qualification includes electronics generated Report file.
15. system according to claim 12, wherein the system is also configured to
Retrieving at least one based on the template created can prove that the electronic identification document of the transaction, wherein the qualification Data reporting includes at least one the electronic identification document retrieved.
16. system according to claim 12, wherein the system is also configured to
In the electronic document, at least one critical field and at least one value are verified;
Based on the electronic document, data set is created, wherein the data set created includes at least one critical field and at least One value;And
Analyze created data set, wherein at least one parameter transaction is determined based on the analysis.
17. system according to claim 12, wherein the system is also configured to
The electronic document is analyzed, to determine data in the electronic document;And
Based on pre-determined critical field list, a part of the data of at least described determination is extracted, it is wherein at least described true A part of fixed data matches at least one critical field in pre-determined critical field list.
18. system according to claim 17, wherein the system is also configured to
Optical character identification is executed on the electronic document.
19. system according to claim 17, wherein the system is also configured to
Cleanup step is executed at least part of the data of the determination of the extraction.
20. system according to claim 17, wherein the system is also configured to
Check whether at least part of the data of the determination of the extraction is complete;And
To each incomplete data, executes at least one of following: at least one external source is inquired, by the data of the determination It is associated with the data in the electronic document analyzed before at least one.
21. system according to claim 12, wherein the transaction is at least financial transaction, wherein the data reporting is extremely It is few related to the data that value-added tax is returned.
CN201780027071.2A 2016-03-13 2017-01-25 The method and system for automatically generating data reporting based on electronic document Pending CN109219809A (en)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
US201662307497P 2016-03-13 2016-03-13
US62/307,497 2016-03-13
US15/361,934 US20170154385A1 (en) 2015-11-29 2016-11-28 System and method for automatic validation
US15/361,934 2016-11-28
PCT/US2017/014874 WO2017160403A1 (en) 2016-03-13 2017-01-25 System and method for automatically generating reporting data based on electronic documents

Publications (1)

Publication Number Publication Date
CN109219809A true CN109219809A (en) 2019-01-15

Family

ID=59850546

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201780027071.2A Pending CN109219809A (en) 2016-03-13 2017-01-25 The method and system for automatically generating data reporting based on electronic document

Country Status (3)

Country Link
EP (1) EP3430540A4 (en)
CN (1) CN109219809A (en)
WO (1) WO2017160403A1 (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110516219A (en) * 2019-08-27 2019-11-29 上海美吉生物医药科技有限公司 A kind of method and system based on the production report of product collection
CN111107154A (en) * 2019-12-23 2020-05-05 南京医康科技有限公司 Data reporting method and device
CN111340038A (en) * 2020-05-20 2020-06-26 四川新网银行股份有限公司 Disposable image data acquisition method for MOCK test

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040267620A1 (en) * 2003-06-30 2004-12-30 Yuliya Feldman Method and system for assessing and reporting VAT charges for network-based marketplace services
CN1892642A (en) * 2005-07-06 2007-01-10 国际商业机器公司 Method and system for processing forms
US20070168382A1 (en) * 2006-01-03 2007-07-19 Michael Tillberg Document analysis system for integration of paper records into a searchable electronic database
CN102654874A (en) * 2011-03-02 2012-09-05 顾菊林 Bill data management method and system
CN103121324A (en) * 2013-02-06 2013-05-29 心医国际数字医疗系统(大连)有限公司 Medical image centralized printing system
US20140079294A1 (en) * 2009-02-10 2014-03-20 Kofax, Inc. Systems, methods and computer program products for determining document validity
CN105243117A (en) * 2015-09-28 2016-01-13 四川长虹电器股份有限公司 Data processing system and method

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040267620A1 (en) * 2003-06-30 2004-12-30 Yuliya Feldman Method and system for assessing and reporting VAT charges for network-based marketplace services
CN1892642A (en) * 2005-07-06 2007-01-10 国际商业机器公司 Method and system for processing forms
US20070168382A1 (en) * 2006-01-03 2007-07-19 Michael Tillberg Document analysis system for integration of paper records into a searchable electronic database
US20140079294A1 (en) * 2009-02-10 2014-03-20 Kofax, Inc. Systems, methods and computer program products for determining document validity
CN102654874A (en) * 2011-03-02 2012-09-05 顾菊林 Bill data management method and system
CN103121324A (en) * 2013-02-06 2013-05-29 心医国际数字医疗系统(大连)有限公司 Medical image centralized printing system
CN105243117A (en) * 2015-09-28 2016-01-13 四川长虹电器股份有限公司 Data processing system and method

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110516219A (en) * 2019-08-27 2019-11-29 上海美吉生物医药科技有限公司 A kind of method and system based on the production report of product collection
CN111107154A (en) * 2019-12-23 2020-05-05 南京医康科技有限公司 Data reporting method and device
CN111107154B (en) * 2019-12-23 2022-12-09 医渡云(北京)技术有限公司 Data reporting method and device
CN111340038A (en) * 2020-05-20 2020-06-26 四川新网银行股份有限公司 Disposable image data acquisition method for MOCK test
CN111340038B (en) * 2020-05-20 2020-08-21 四川新网银行股份有限公司 Disposable image data acquisition method for MOCK test

Also Published As

Publication number Publication date
EP3430540A1 (en) 2019-01-23
EP3430540A4 (en) 2019-10-09
WO2017160403A1 (en) 2017-09-21

Similar Documents

Publication Publication Date Title
US10546351B2 (en) System and method for automatic generation of reports based on electronic documents
US11062132B2 (en) System and method for identification of missing data elements in electronic documents
US20170193608A1 (en) System and method for automatically generating reporting data based on electronic documents
US20170323006A1 (en) System and method for providing analytics in real-time based on unstructured electronic documents
US11138372B2 (en) System and method for reporting based on electronic documents
US20190236127A1 (en) Generating a modified evidencing electronic document including missing elements
US20180011846A1 (en) System and method for matching transaction electronic documents to evidencing electronic documents
CN109219809A (en) The method and system for automatically generating data reporting based on electronic document
CN109791537A (en) Electronic document is supplemented into complete system and method
US20170169518A1 (en) System and method for automatically tagging electronic documents
US20170323157A1 (en) System and method for determining an entity status based on unstructured electronic documents
US20180025225A1 (en) System and method for generating consolidated data for electronic documents
US20180046663A1 (en) System and method for completing electronic documents
CN109154949A (en) Analysis is provided in real time based on non-structured electronic document
US20170161315A1 (en) System and method for maintaining data integrity
CN108713198A (en) Automatic checking request based on electronic document
US20180024984A1 (en) System and method for obtaining reissues of electronic documents lacking required data
CN109791540A (en) The system and method reported based on electronic document
US20170169519A1 (en) System and method for automatically verifying transactions based on electronic documents
CN110023970A (en) System and method for verifying non-structured Enterprise Resources Plan data
CN109791643A (en) System and method for generating the merging data of electronic document
CN109791548A (en) Match trading electronic document and proof electronic document
CN109983489A (en) Electronic document is proved based on non-structured data search
CN109313765A (en) The System and method for of automatic verifying transaction is carried out based on electronic document
WO2017142624A1 (en) System and method for automatically tagging electronic documents

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
WD01 Invention patent application deemed withdrawn after publication
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20190115