CN110023970A - System and method for verifying non-structured Enterprise Resources Plan data - Google Patents

System and method for verifying non-structured Enterprise Resources Plan data Download PDF

Info

Publication number
CN110023970A
CN110023970A CN201780071509.7A CN201780071509A CN110023970A CN 110023970 A CN110023970 A CN 110023970A CN 201780071509 A CN201780071509 A CN 201780071509A CN 110023970 A CN110023970 A CN 110023970A
Authority
CN
China
Prior art keywords
electronic document
transaction
data
template
matched
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201780071509.7A
Other languages
Chinese (zh)
Inventor
N·古兹曼
I·萨夫特
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Vatbox Ltd
Original Assignee
Vatbox Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US15/361,934 external-priority patent/US20170154385A1/en
Application filed by Vatbox Ltd filed Critical Vatbox Ltd
Publication of CN110023970A publication Critical patent/CN110023970A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/25Integrating or interfacing systems involving database management systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/93Document management systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/06Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
    • G06Q10/063Operations research, analysis or management
    • G06Q10/0631Resource planning, allocation, distributing or scheduling for enterprises or organisations
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/10Office automation; Time management
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q20/00Payment architectures, schemes or protocols
    • G06Q20/04Payment circuits
    • G06Q20/047Payment circuits using payment protocols involving electronic receipts
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q20/00Payment architectures, schemes or protocols
    • G06Q20/38Payment protocols; Details thereof
    • G06Q20/389Keeping log of transactions for guaranteeing non-repudiation of a transaction
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/04Billing or invoicing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q40/00Finance; Insurance; Tax strategies; Processing of corporate or income taxes
    • G06Q40/12Accounting
    • G06Q40/123Tax preparation or submission

Landscapes

  • Business, Economics & Management (AREA)
  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Strategic Management (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Accounting & Taxation (AREA)
  • General Business, Economics & Management (AREA)
  • Human Resources & Organizations (AREA)
  • Economics (AREA)
  • Finance (AREA)
  • Development Economics (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Marketing (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Tourism & Hospitality (AREA)
  • Quality & Reliability (AREA)
  • Operations Research (AREA)
  • General Engineering & Computer Science (AREA)
  • Educational Administration (AREA)
  • Game Theory and Decision Science (AREA)
  • Technology Law (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

System and method for verifying non-structured Enterprise Resources Plan data.This method includes at least one parameter transaction for analyzing the first electronic document to determine transaction, wherein the first electronic document includes at least partly non-structured data;For transaction creation template, wherein template is the structured data sets for including at least one determining parameter transaction;In enterprise resource planning, based on matched second electronic document of template search created;And when finding matched second electronic document, the transaction is verified.

Description

System and method for verifying non-structured Enterprise Resources Plan data
Cross reference to related applications
This application claims submitted on October 9th, 2016 application No. is the power of 62/405,921 U.S. Provisional Application Benefit.The application simultaneously be also on November 28th, 2016 it is submitting, currently examine in, application No. is 15/361,934 beauty The part continuation application of state's patent application.The content of above-mentioned application is incorporated herein by reference in their entirety.
Technical field
Generally, this disclosure relates to verify business system, and relate more specifically to verify in enterprise resource planning Non-structured data.
Background technique
Enterprise Resources Plan (ERP) is business management software, commonly used in collecting, storage, manages and explains from various The data of business activity, such as the spending of enterprise staff.ERP system usually collects the business activity phase in enterprise with various departments The data of pass.Data collected in this way can come from different data sources, and can be different formats.ERP system mentions It for the integrated view of the business activity data, and is further able to generate expenditure report, this report can be sent to phase later The tax authority of pass.
Particularly in large enterprise, employee is engaged in a large amount of business activity.Such business activity may further result in The a large amount of business of tax authority's report is paid.Report that such business expenditure can bring deductions and exemptions of taxes and refund.For this purpose, employee is logical Receipt according to the expenditure occurred is often provided, and usually requires to indicate the type of such expenditure.Based on the instruction, ERP system Report can be generated in system, and this report provides any received receipt to the relevant tax authority.
In addition, according to data relevant to business activity are managed, ERP system must be associated with and be tracked between managed collection Relationship.For example, relevant to the tax affairs report of receipt information must be saved and be associated with receipt itself.Between data set Any mistake in association can lead to the report of mistake, caused by this can then be caused by unsuccessful redemption and exempt from taxation Loss of income, and do not meet laws and rules.Therefore, accurate data management is most important for ERP system.
When the data of part are unstructured, additional challenge can be brought by tracking such data.For example, there is also with chase after Track is stored as the relevant difficulty of expenditure receipt of image file.Existing solution design for these challenges is based on user The file extension of offer identifies the content of the file comprising unstructured data.This solution is limited to mistake (for example, wrong word, file content of mistake etc.), and possibly content therein can not all be described.These disadvantages may be into One step leads to the inaccuracy in ERP system.
The amount of receipt of the employee obtained in business process may be very huge.This large amount of receipt causes to be supplied to The data of ERP system significantly increase, so as to cause being difficult to manage the data in such ERP system.Specifically, existing solution party Case faces the challenge on safeguarding the correct association in managed data.These difficulties may cause mistake and mismatch.Work as mistake It may be mistake with multiple proofs or the related result of other incorrect reports when being captured not in time with mismatch.Manually It is time and effort consuming that whether verifying report matches with receipt, and is limited to mistake.Further, this manual authentication sheet Body can not correct the problem of closed tube reason data.
In addition, the existing solution for verifying transaction automatically is utilizing the electricity comprising at least partly unstructured data It faces the challenge when subfile.Specifically, this solution can identify transaction data in the receipt of scanning and other Non-structured data, but when utilizing identified transaction data, it may be possible to it is inefficient and inaccurate.
Therefore it provides the technical solution of many disadvantages of the prior art is overcome to be advantageous.
Summary of the invention
Several exemplary embodiments of the disclosure are summarized as follows.There is provided general introduction is in order to facilitate reader, offer pair The basic comprehension of such embodiment and not exclusively limit disclosed range.The not all contemplated embodiments of the general introduction it is extensive It summarizes, and is neither intended to the key or important element for identifying all embodiments, be not intended in terms of describing any or all Range.Its sole purpose is that some concepts of one or more embodiments are presented in simplified form, more detailed as what is presented later The preamble carefully described.For convenience, term " some embodiments " or " some embodiments " Lai Zhidai disclosure can be used herein Single embodiment or multiple embodiments.
Some embodiments disclosed herein include the method for verifying non-structured Enterprise Resources Plan data.The party Method includes: at least one parameter transaction for analyzing the first electronic document to determine transaction, wherein the first electronic document includes at least The non-structured data in part;For transaction creation template, wherein template is the structuring for including at least one determining parameter transaction Data set;In enterprise resource planning, based on matched second electronic document of template search created;And when lookup When to matched second electronic document, the transaction is verified.
Some embodiments disclosed herein further include non-transitory computer-readable medium, are stored on it so that handling The program that circuit executes, the program include: at least one parameter transaction for analyzing the first electronic document to determine transaction, wherein the One electronic document includes at least partly non-structured data;For transaction creation template, wherein template be include determining at least one The data set of the structuring of a parameter transaction;In enterprise resource planning, based on the template search created matched Two electronic documents;And when finding matched second electronic document, the transaction is verified.
Some embodiments disclosed herein further include the system for verifying non-structured Enterprise Resources Plan data.It should System includes: processing circuit;And memory, which includes instruction, when the instruction is executed by processing circuit, configuration system System are as follows: the first electronic document of analysis is to determine at least one parameter transaction traded, wherein the first electronic document includes at least portion Divide non-structured data;For transaction creation template, wherein template is the structuring for including at least one determining parameter transaction Data set;In enterprise resource planning, based on matched second electronic document of template search created;And works as and find When matched second electronic document, the transaction is verified.
Detailed description of the invention
It is particularly pointed out in claims at specification ending and is distinctly claimed presently disclosed subject matter. By detailed description with the accompanying drawing below, foregoing end other objects, the feature and advantage of disclosed embodiment will be aobvious and easy See.
Fig. 1 is the network for describing various open embodiments.
Fig. 2 is to show the flow chart of the method according to the embodiment for being used to verify Enterprise Resources Plan data.
Fig. 3 is to show the flow chart of the method according to the embodiment for drawing template establishment.
Fig. 4 is the block diagram according to the validator of embodiment.
Specific embodiment
It is important that, it should be noted that embodiment disclosed herein is only showing for many advantageous uses of the innovative teachings of this paper Example.Generally, the statement made in the description of the present application not necessarily limits the embodiment of any various requirement protection.This Outside, some statements are likely to be suited for certain inventive features and are not suitable for other features.In general, unless otherwise stated, single Number elements can be plural number, vice versa and without loss of generality.In the accompanying drawings, similar labelled notation indicates in several views Similar component.
Various disclosed embodiments include the format by the way that at least partly non-structured data to be converted to structuring, are used In the system and method for verifying Enterprise Resources Plan data.To report electronic document drawing template establishment for the first of verifying.Report Electronic document includes at least partly non-structured data of the instruction for the parameter transaction of transaction.Pass through report electronic document Analysis is based on critical field and value, drawing template establishment.It is used to report the metadata of electronic document based on the template generation created. Using metadata, report is verified by searching for matched second proof electronic document in the memory of enterprise resource planning Accuse electronic document.If not finding matched proof electronic document (that is, report electronic document is invalidated), may search for One or more data sources are to retrieve the matched proof electronic document for verifying report electronic document.
Fig. 1 shows example network Figure 100 for describing various open embodiments.Network 100 includes and passes through net Validator 120 that network 110 communicatedly connects, business system 130,140 user equipment 150 of database.Network can be but unlimited In wireless network, honeycomb or cable network, local area network (LAN), wide area network (WAN), Metropolitan Area Network (MAN) (MAN), internet, WWW (WWW), similar network and any combination.
Business system 130 is associated with enterprise, and can store and represent that carry out transaction related by enterprise or enterprise Data, and instruction enterprise feature enterprise characteristic parameter, the enterprise characteristic parameter be such as, but not limited to formed country, Income data, structured data etc..Enterprise, which may be, but not limited to, its employee, can represent the enterprise of enterprise's purchase commodity and service Industry.Business system 130 can be but not limited to server, database, enterprise resource planning, CRM system or Store any other system of related data.In an exemplary embodiment, business system 130 is storage report electronics text The enterprise resource planning of part, proof electronic document or both.
Database 140, which at least stores, proves electronic document.In the exemplary embodiment, database 140 can by with enterprise 130 associated enterprise operations or associated with it of industry system.Therefore, database 140, which can store, is not stored in business system Proof electronic document in 130, for example, not uploading to the proof electronic document of business system 130.When based on be stored in enterprise system System 130 in proofs electronic document can not verify report electronic document in indicated by transaction when, can inquire database 140 with Determine whether database 140 stores suitable proof electronic document.
User equipment 150 may be, but not limited to, personal computer, laptop computer, tablet computer, smart phone, Wearable computing machine equipment or any other equipment that can grab, store and send non-structured data set.As Non-limiting example, user equipment 150 can be the smart phone including camera.User equipment 150 can by for example with enterprise The employee of the associated tissue of industry system 130 uses.
In one embodiment, validator 120 includes optical recognition process device (for example, the optical recognition process device in Fig. 4 430).Optical recognition process device is configured at least identification data, the character especially in non-structured data.Validator 120 are configured to receive the first report electronic document from business system 130.Report electronic document is at least partly non-structured electricity Subfile, including but not limited to non-structured data, partly-structured data lack known format (for example, by validator The predefined formats of 120 identifications) data of structuring or combinations thereof.
It is usually from the received report electronic document of business system 130, but is not limited to electronic document, which can be with Such as by employee's hand filling (inputting information for example, by typewriting or other modes).In the exemplary embodiment, report electricity Subfile can be the image for showing report on expenses, or non-structuring or the semi-structured text text of the text including report on expenses Part.Report electronic document indicates and one or more information relevant with transaction.As non-limiting embodiment, electronics is reported File includes the row filled in by employee, wherein explanation: " 60 Euros of taxi, 10 Euros of the@each run in Paris ", actually Refer to 10 Euros every time of No. 6 taxi strokes, i.e. 6 different transaction.In this case, in order to verify expense, by 6 phases Corresponding proof electronic document matches with report on expenses.
Report electronic document can upload to business system 130 by the user of such as user equipment 150.For example, user sets Standby 150 user can shoot the image of report on expenses by the camera (not shown) of user equipment 150, and image is sent To business system 130.
In one embodiment, validator 120 is configured at least partly non-structured report electronic document of analysis.Analysis can To include, but are not limited to identify the member shown at least partly non-structured electronic document by computer vision technique Element, and the template based on the element creation transaction attribute identified.This computer vision technique may further include image Identification, pattern-recognition, signal processing, character recognition etc..
Each created template is the data set of structuring, including the identified parameter transaction for transaction.Tool Body, template includes the field of one or more classifications for representing transaction data, wherein each field includes suitable transaction ginseng Number.The creation of the data set template of structuring is discussed further below.
Based on the template created, validator 120 is configured to generate at least partly non-structured report electronic document The metadata of each transaction (for example, the business activity such as paid) of middle instruction.The metadata of transaction can indicate one A or multiple parameter transactions indicated in report electronic document, and can be about one or more fields of corresponding template It generates.
The field for generating metadata can be predefined field, be chosen to uniquely identify the Transaction Information of transaction, make It obtains and provides the proof of transaction with the proof electronic document (for example, receipt) of meta data match.As non-limiting example, for producing The purchase activity of raw expenditure, metadata may include generating the position (indicating in " position " field) of expenditure, generate expenditure (example Such as, as indicated in " merchandise news " field) commodity place feature (for example, the type of commodity, sell product type Deng), the time (for example, as indicated in " time " field) of expenditure is generated, the amount of money is (for example, the currency indicated in respective field Value or quantity), and combinations thereof etc..
Validator 120 is configured to metadata generated of just trading, and passes through matched card in searching enterprise system 130 Bright electronic document verifies each transaction indicated in report electronic document.Specifically, it can generate and inquire according to metadata, And inquiry can be used to search for matched proof electronic document in business system 130.Matched proof electronic document can With associated with metadata, electronic document is generated, metadata phases more than predefined thresholds with report for the metadata Match.
If it is determined that the metadata of transaction matches with the metadata of corresponding proof electronic document, then completion report is tested Card.It mismatches if it is determined that existing (that is, identifying the structural data and any proof being stored in business system electricity of report The metadata of subfile mismatches), then it can be generated about unmatched notice, and be sent to such as user equipment 150.Substitution Ground or jointly, when the mismatch identified so that one or more transaction are not verified, validator 120 be configurable to for Each not verified matched proof electronic document of Trading Research in database 140.If finding matching in the database, Validator 120 is configurable to for matched proof electronic document being stored in business system 130, and determines and have verified that accordingly Transaction.
It is than such as directly using non-structured data more effective that business data is verified using the template of structuring With accurate method of determination.Specifically, it can be based on template generation metadata relative to specific field, so that metadata more has Effect and the parameter for more accurately showing unique identification transaction.Therefore, metadata can be used for correctly searching for matched proof electricity Subfile, while reducing processing capacity and the time for being related to comparing metadata.Furthermore it is possible to store mould in business system 130 Plate rather than report electronic document, and therefore compared with storing electronic document itself, it is possible to reduce the use of memory, especially It is when other digital representations that electronic document is image or vision data, because such visual representation is usually than the base of structuring It needs in the file of text using more memories.
The processing circuit that validator 120 generally includes to be coupled to memory (for example, memory 415 of Fig. 4) is (for example, Fig. 4 In processing circuit 410).Processing circuit may include or for processor (not shown) component, or be coupled to the place of memory Manage device array.Memory includes the instruction that can be executed by processing circuit.When processing circuit executes the instruction, instruction configuration Processing circuit is to execute various functions described herein.
It should be appreciated that presently disclosed embodiment is not limited to specific structure shown in Fig. 1, and this public affairs is not being departed from Other structures can be equally applicable in the case where opening the range of embodiment.Specifically, validator 120 may reside within cloud computing Platform, data center etc..In addition, in some embodiments, there may be multiple validators of operation as described above, and It is configured to have one as backup, with load sharing between them, or is divided into different functions between them.
It shall yet further be noted that some embodiments about Fig. 1 discussion are described as only interacting with a business system 130, this is only It is rather than the limitation for the disclosure for purposes of simplicity.Data from additional enterprise resource planning can be by Validator 120 is verified, without departing from the range of disclosed embodiment.In addition, database 140 can equally be another data Source, for example, may have access to the server of one or more database.It, can be in addition, without departing from the scope of the disclosure Use multiple databases.
Fig. 2 is to show example flow Figure 200 of the method according to the embodiment for being used to verify the data in business system. In embodiment, this method can be executed by validator (for example, validator 120).
In S210, receives or electronic document is reported in retrieval first.Report electronic document includes being related to one or more friendships Easy at least partly non-structured data.At least partly non-structured data include, but are not limited to non-structured data, Partly-structured data or lack known format structuring data.Transaction e file can be from such as Enterprise Resources Plan (ERP) system (for example, business system 130 of Fig. 1), or can be from such as user equipment (for example, user equipment 150 of Fig. 1) It receives.
In the exemplary embodiment, transaction e file can be image, show such as relevant to business activity one A or multiple expenditure reports.It, can be as operated by the employee of the tissue of shooting report on expenses table as non-limiting example Mobile device captures image.
In S220, for each transaction creation template indicated in report electronic document.In embodiment, pass through optical character Identification (OCR) processor can analyze transaction e file.The analysis can also include that at least portion is identified using machine vision Divide element, cleaning or the elimination data and generation structural data in non-structured data, which includes extremely The non-structured data of small part identify in key character and value.As an example, for the image of receipt, machine vision can With relevant to the transaction recorded in receipt information, such as price, position, date, buyer, the seller etc. for identification.
In some embodiments, ambiguity eliminate may include identified in a group field and key value in template it is multiple Transaction.The identification of multiple transaction can more transaction identifications rule based on one or more.For example, such rule can be based on total Valence, for example, can determine that total price represents multiple transaction if total price is higher than threshold value.It can be each of multiple transaction Drawing template establishment.Ambiguity elimination during drawing template establishment is further described below with reference to Fig. 3.
In S230, based on one in the template created, metadata is generated for corresponding transaction.It can be based on unique mark The value in the field of transaction is known to generate metadata.As non-limiting example, for including field " date ", " price ", " number The metadata for indicating the value in those fields can be generated in the template of amount " and " project name ".
In S240, corresponding proof electronic document is searched for verify transaction.In one embodiment, S240 may include being based on Metadata generates inquiry, and searches for matched proof electronics text by one or more data sources using inquiry generated Part.In the exemplary embodiment, data source includes enterprise resource planning.
In embodiment, if not finding matched proof electronic document in the first data source, it may search for Two groups of data sources are to find matched proof electronic document.If finding matched proof electronics text in the second data source Part can then be retrieved and be stored it in the first data source.
In optional S250, notice can be generated.Whether notice can indicate whether transaction is verified, that is, can be the friendship Easy-to-search is to matched proof electronic document.
It in S260, checks whether additional transaction will be verified, also, if it is, continue to execute S230, otherwise executes end Only.
Fig. 3 is to show according to the embodiment based on the electronic document including at least partly non-structured data, is used for The example flow diagram S220 of the method for drawing template establishment.
In S310, electronic document is obtained.Obtaining electronic document may include, but be not limited to, reception electronic document (for example, Receive the image of scanning) or electronic document is retrieved (for example, examining from Client Enterprise system, businessman's business system or database Rope).
In S320, electronic document is analyzed to identify the element at least partly non-structured data.Analysis may include But it is not limited to using optical character identification (OCR) to determine the character in electronic document.
Element can include but is not limited to character relevant to transaction, character string or both.As non-limiting example, member Element may include the print data appeared in expenditure receipt relevant to business activity.Such print data may include but It is not limited to date, time, quantity, seller name, the type of seller business, value-added tax value, the type of bought product, payer Method number of registration etc..
In S330, it is based on the analysis, identifies critical field and value in electronic document.Critical field may include but unlimited In the title of businessman and address, date, currency, the commodity of sale or service, transaction identifiers, invoice number etc..Electronic document can To include the unnecessary details for not being considered as key value.As an example, may not be needed the mark of businessman, therefore it is not to close Key assignments.In embodiment, can predefined key value list, and can extract and the matched data slot of critical field. Then, liquidation procedures is executed to ensure that information is accurately presented.For example, if OCR will lead to data and be shown as " 1211212005 ", then this data can be converted to 12/12/2005 by liquidation procedures.Another example, if title is shown as " Mo $ Den " will be then changed to " Mosden ".It can use the external informations resource such as dictionary, calendar and execute liquidation procedures.
In another embodiment, check whether the data slot of extraction is complete.For example, if can identify the name of businessman Claim, but lose its address, then the critical field of seller addresses is imperfect.Execute the trial for improving the primary key value of missing. The trial may include inquiring the correlation of external system and database, inquiry and previous analyzed invoice, or combinations thereof.It is external System and the example of database may include enterprise content, Universial Product Code (Universal Product Code, UPC) number According to library, package delivery and tracking system etc..In one embodiment, S430 generate one group completely predefined critical field and its Respective value.
In another embodiment, S330 can also include the ambiguity for eliminating non-structured data.Ambiguity elimination can be with base In but file name, dictionary, algorithm, the synonym etc. that are not limited to non-structured data set.Ambiguity elimination may be implemented more The identification accurately traded.Ambiguity elimination can be based on but be not limited to, and the structure of data is (for example, the number in field " destination " According to disambiguation can be carried out with location-based title), dictionary, algorithm, synonym etc..In some embodiments, if ambiguity Eliminate it is unsuccessful, then can be generated notify and be sent to user (such as user of user equipment 150), prompt user provide into one The explanation of step.
As non-limiting example, for the image in the file of entitled " purchase receipt ", can use and character string " total price " is located at character string " 300.00 " character in a line the value determined to include in " purchasing price " field 300.00.As another example, can based on dictionary eliminate character string " Drance " ambiguity to generate metadata, the metadata Indicate that position associated with non-structured data set is France.As another example, in field relevant to charge type, The data of the structuring of field can be " Paris taxi ", and the value of the field can be " 60 Euros ".Based on maximum The one or more rule of taxi price can determine that " 60 Euros " are too high for multiple taxi fares use, and because This field corresponds to multistage taxi stroke.
In S340, the data set of structuring is generated.The data set of generation includes identified field and value.
It should be noted that embodiment relevant to data in ERP system as described above, be described as the number of structuring According to, person just for the sake of simplified purpose, rather than the limitation to the disclosed embodiments.The scope of the present disclosure is not being departed from In the case of, it can equally use partly-structured data.In addition, data can store any database or with except ERP system Other storage units connected to system communication other than system.It should also be noted that only for the purposes of illustration, rather than to all public affairs The limitation for the embodiment opened, above with reference to Fig. 2 and 3 described embodiments with reference to Fig. 1 discussion.
Fig. 4 is the schematic block diagram according to the validator 120 of embodiment.Validator 120 include be coupled to memory 415, The processing circuit 410 of reservoir 420 and network interface 440.In embodiment, validator 120 may include optical character identification (OCR) processor 430.In another embodiment, the component of validator 120 can communicatedly be connected by bus 450.
Processing circuit 410 can be implemented as one or more hardware logic components and circuit.Such as rather than limit, can make The exemplary types of hardware logic component include field programmable gate array (field programmable gate Array, FPGA), specific integrated circuit (application-specific integrated circuit, ASIC), dedicated mark Quasi- product (Application-specific standard products, ASSP), system level chip system (system-on- A-chip system, SOC), general purpose microprocessor, microcontroller, digital signal processor (digital signal Processor, DSP) etc., or it is able to carry out other any hardware logic components of calculating or other information processing.
Memory 415 can be volatibility (such as RAM), non-volatile (such as ROM, flash memory) or combinations thereof.At one In configuration, the computer-readable instruction for executing one or more embodiments as described herein is stored in reservoir 420.
In another embodiment, memory 415 is configured to storage software.Software is broadly interpreted that any type of finger Enable, no matter refer to software, firmware, middleware, microcode, hardware description language or other.Instruction may include code (example Such as, source code format, binary code form, executable code format or other any code formats appropriate).When by one Or multiple processors, when executing the instruction, the instruction is so that processing circuit 410 executes various processes described herein.Specifically, As discussed herein, instruction when being executed, makes processing circuit 410 be based on electronic document and generates merging data.
Reservoir 420 can be magnetic reservoir, optical storage device etc., and may be embodied as such as flash memory or other storages Technology, CD-ROM, digital versatile disc (DVD) or other any media that can be used in storing desired information.
Reservoir 420, which can also be stored, generates first number to the analysis of non-structured data based on OCR processor 430 According to.In another embodiment, reservoir 420 can also be stored based on metadata inquiry generated.
OCR processor 430 can include but is not limited to be configured to identify mode in non-structured data set, feature or The feature and/or pattern recognition processor (recognition processor, RP) 435 of both.Specifically, implement one In example, OCR processor 430 is configured at least identify the character in non-structured data.It can use identified character wound It builds including the data set for data needed for checking request.
Network interface 440 allow validator 120 and business system 130, database 140, user equipment 150 or combinations thereof into Row communication notifies for example to receive electronic document, to send, searches for electronic document, storing data etc..
It should be appreciated that embodiment as described herein is not limited to specific structure shown in Fig. 4, and the disclosure is not being departed from Other structures can be equally used in the case where the range of embodiment.
It should be noted that the various implementations about the proof electronic document for verifying single deals match of discussion described herein Example, it is only for simplify purpose rather than the limitation to the disclosed embodiments.The case where not departing from the scope of the present disclosure Under, it can serially or parallelly verify the multiple transaction indicated in report electronic document.As non-limiting example, report electricity Subfile can be the report on expenses of the multiple transaction of instruction that are being made by employee.
Implementable various embodiments disclosed herein is hardware, firmware, software or any combination thereof.In addition, software is preferred Ground be embodied as visibly realizing on program storage unit (PSU) or the computer-readable medium that is made of component on or certain equipment And/or the combination of equipment.Application program can upload to the machine including any suitable architecture and be executed by it.Preferably, should Machine is in the computer platform with such as one or more central processing unit (" CPU "), memory and input/output structure Upper implementation.Computer platform can also include operating system and micro-instruction code.Various processes and function described herein can be with It is a part of micro-instruction code either application program, or is any combination of them, can be executed by CPU, nothing By whether explicitly showing such computer or processor.In addition, various other peripheral cells may be coupled to computer Platform, such as additional-data storage unit and print unit.In addition, non-transitory computer-readable medium is to propagate letter except temporary Any computer-readable medium except number.
All examples as described herein and conditional statement are intended for instructing purpose, to help reader to understand disclosed reality The principle and inventor of applying example promote the concept that this field is contributed, and should be understood as not to it is such specifically quote show Example and condition make limitation.In addition, record herein the principle of embodiment of the disclosure, aspect and embodiment and its specifically show All statements of example, it is intended to including its structural and functional equivalent.In addition, such equivalent includes currently known etc. Jljl and in the future exploitation equivalent, that is, exploitation execution identical function any element, but regardless of structure how.

Claims (17)

1. the method for verifying non-structured Enterprise Resources Plan data, comprising:
The first electronic document is analyzed to determine at least one parameter transaction of transaction, wherein the first electronic document includes at least partly Non-structured data;
For transaction creation template, wherein the template be include at least one identified parameter transaction structuring data Collection;
In enterprise resource planning, based on matched second electronic document of template search created;And
When finding matched second electronic document, the transaction is verified.
2. according to the method described in claim 1, wherein determining at least one parameter transaction further include:
At least one critical field and at least one value are identified in the first electronic document;
Based on the first electronic document, create data set, wherein the data set created include at least one described critical field and At least one described value;And
Created data set is analyzed, wherein determining at least one described parameter transaction based on analysis.
3. according to the method described in claim 2, wherein identifying at least one described critical field and at least one described value, also Include:
The first electronic document is analyzed to determine the data in the first electronic document;And
Based on the list of predefined critical field, at least part of identified data is extracted, wherein identified data At least part matched at least one critical field in predefined critical field list.
4. according to the method described in claim 3, wherein analyzing the first electronic document, further includes:
Optical character identification is executed to the first electronic document.
5. according to the method described in claim 1, further include:
Based on the template generation inquiry created, wherein described search includes inquiring corporate resources using inquiry generated Planning system.
6. according to the method described in claim 1, further include:
Based on template, the metadata for transaction is generated, wherein inquiry is generated based on the metadata, wherein second electronics File is associated with metadata, wherein the metadata of matched second electronic document with it is more than predefined threshold value generated Metadata matches.
7. according to the method described in claim 1, wherein drawing template establishment further include:
Ambiguity elimination is carried out at least partly non-structured data.
8. according to the method described in claim 1, further include:
When not finding matched second electronic document in enterprise resource planning, matched second is searched in the database Electronic document;And
When finding the second electronic document of matching in the database, the second electronic document is stored in enterprise resource planning In.
9. a kind of non-transitory computer-readable medium is stored thereon with instruction, for executing one or more processing units For verifying the program of non-structured Enterprise Resources Plan data, described program includes:
The first electronic document is analyzed to determine at least one parameter transaction of transaction, wherein the first electronic document includes at least partly Non-structured data;
For transaction creation template, wherein the template is the structured data sets for including at least one identified parameter transaction;
In enterprise resource planning, matched second electronic document is searched based on the template created;And
When finding the second electronic document of matching, the transaction is verified.
10. the system for verifying non-structured Enterprise Resources Plan data, comprising:
Processing circuit;And
Memory, the memory includes instruction, when executing described instruction by processing circuit, the system configuration are as follows:
The first electronic document is analyzed to determine at least one parameter transaction of transaction, wherein the first electronic document includes at least partly Non-structured data;
For transaction creation template, wherein the template be include at least one identified parameter transaction structuring data Collection;
In enterprise resource planning, matched second electronic document is searched based on the template created;And
When finding the second electronic document of matching, the transaction is verified.
11. system according to claim 10, wherein the system is additionally configured to:
At least one critical field and at least one value are identified in the first electronic document;
Based on the first electronic document, create data set, wherein the data set created include at least one described critical field and At least one described value;And
Created data set is analyzed, wherein determining at least one described parameter transaction based on analysis.
12. system according to claim 11, wherein the system is additionally configured to:
The first electronic document is analyzed to determine the data in the first electronic document;And
Based on the list of predefined critical field, at least part of identified data is extracted, wherein identified data At least part matched at least one critical field in predefined critical field list.
13. according to the method for claim 12, wherein the system is additionally configured to:
Optical character identification is executed to the first electronic document.
14. according to the method described in claim 10, wherein the system is additionally configured to:
Based on the template generation inquiry created, wherein described search includes inquiring corporate resources using inquiry generated Planning system.
15. according to the method described in claim 10, wherein the system is additionally configured to:
Based on template, the metadata for transaction is generated, wherein inquiry is generated based on the metadata, wherein second electronics File is associated with metadata, wherein the metadata of matched second electronic document with it is more than predefined threshold value generated Metadata matches.
16. according to the method described in claim 10, wherein drawing template establishment further include:
Ambiguity elimination is carried out at least partly non-structured data.
17. according to the method described in claim 10, wherein the system is additionally configured to:
When not finding matched second electronic document in enterprise resource planning, matched second is searched in the database Electronic document, and
When finding matched second electronic document in the database, the second electronic document is stored in enterprise resource planning In.
CN201780071509.7A 2016-10-09 2017-10-04 System and method for verifying non-structured Enterprise Resources Plan data Pending CN110023970A (en)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
US201662405921P 2016-10-09 2016-10-09
US62/405,921 2016-10-09
US15/361,934 2016-11-28
US15/361,934 US20170154385A1 (en) 2015-11-29 2016-11-28 System and method for automatic validation
PCT/US2017/055135 WO2018067698A1 (en) 2016-10-09 2017-10-04 System and method for verifying unstructured enterprise resource planning data

Publications (1)

Publication Number Publication Date
CN110023970A true CN110023970A (en) 2019-07-16

Family

ID=61832191

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201780071509.7A Pending CN110023970A (en) 2016-10-09 2017-10-04 System and method for verifying non-structured Enterprise Resources Plan data

Country Status (3)

Country Link
EP (1) EP3523771A4 (en)
CN (1) CN110023970A (en)
WO (1) WO2018067698A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2023279037A1 (en) * 2021-06-30 2023-01-05 Pricewaterhousecoopers Llp Ai-augmented auditing platform including techniques for automated assessment of vouching evidence

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3225912B2 (en) * 1998-01-08 2001-11-05 日本電気株式会社 Information retrieval apparatus, method and recording medium
US20030212617A1 (en) * 2002-05-13 2003-11-13 Stone James S. Accounts payable process
US20100161616A1 (en) * 2008-12-16 2010-06-24 Carol Mitchell Systems and methods for coupling structured content with unstructured content
US8774516B2 (en) * 2009-02-10 2014-07-08 Kofax, Inc. Systems, methods and computer program products for determining document validity

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2023279037A1 (en) * 2021-06-30 2023-01-05 Pricewaterhousecoopers Llp Ai-augmented auditing platform including techniques for automated assessment of vouching evidence

Also Published As

Publication number Publication date
EP3523771A1 (en) 2019-08-14
WO2018067698A1 (en) 2018-04-12
EP3523771A4 (en) 2020-04-29

Similar Documents

Publication Publication Date Title
US10614527B2 (en) System and method for automatic generation of reports based on electronic documents
CN108985912B (en) Data reconciliation
US11138372B2 (en) System and method for reporting based on electronic documents
US20170323006A1 (en) System and method for providing analytics in real-time based on unstructured electronic documents
US20170169292A1 (en) System and method for automatically verifying requests based on electronic documents
US20180011846A1 (en) System and method for matching transaction electronic documents to evidencing electronic documents
EP3494495A1 (en) System and method for completing electronic documents
CN110023970A (en) System and method for verifying non-structured Enterprise Resources Plan data
US20180046663A1 (en) System and method for completing electronic documents
US20170169518A1 (en) System and method for automatically tagging electronic documents
US10558880B2 (en) System and method for finding evidencing electronic documents based on unstructured data
US10387561B2 (en) System and method for obtaining reissues of electronic documents lacking required data
US20180096435A1 (en) System and method for verifying unstructured enterprise resource planning data
EP3494496A1 (en) System and method for reporting based on electronic documents
US20180025224A1 (en) System and method for identifying unclaimed electronic documents
CN109983489A (en) Electronic document is proved based on non-structured data search
WO2017201012A1 (en) Providing analytics in real-time based on unstructured electronic documents
CN108713198A (en) Automatic checking request based on electronic document
US20200118122A1 (en) Techniques for completing missing and obscured transaction data items
US20170323395A1 (en) System and method for creating historical records based on unstructured electronic documents
EP3491554A1 (en) Matching transaction electronic documents to evidencing electronic
WO2018027133A1 (en) Obtaining reissues of electronic documents lacking required data
WO2017201013A1 (en) System and method for creating historical records based on unstructured electronic documents
WO2017142624A1 (en) System and method for automatically tagging electronic documents
CN109313765A (en) The System and method for of automatic verifying transaction is carried out based on electronic document

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20190716

WD01 Invention patent application deemed withdrawn after publication