CN110023970A - System and method for verifying non-structured Enterprise Resources Plan data - Google Patents
System and method for verifying non-structured Enterprise Resources Plan data Download PDFInfo
- Publication number
- CN110023970A CN110023970A CN201780071509.7A CN201780071509A CN110023970A CN 110023970 A CN110023970 A CN 110023970A CN 201780071509 A CN201780071509 A CN 201780071509A CN 110023970 A CN110023970 A CN 110023970A
- Authority
- CN
- China
- Prior art keywords
- electronic document
- transaction
- data
- template
- matched
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/25—Integrating or interfacing systems involving database management systems
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/93—Document management systems
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/06—Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
- G06Q10/063—Operations research, analysis or management
- G06Q10/0631—Resource planning, allocation, distributing or scheduling for enterprises or organisations
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/10—Office automation; Time management
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q20/00—Payment architectures, schemes or protocols
- G06Q20/04—Payment circuits
- G06Q20/047—Payment circuits using payment protocols involving electronic receipts
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q20/00—Payment architectures, schemes or protocols
- G06Q20/38—Payment protocols; Details thereof
- G06Q20/389—Keeping log of transactions for guaranteeing non-repudiation of a transaction
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q30/00—Commerce
- G06Q30/04—Billing or invoicing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q40/00—Finance; Insurance; Tax strategies; Processing of corporate or income taxes
- G06Q40/12—Accounting
- G06Q40/123—Tax preparation or submission
Landscapes
- Business, Economics & Management (AREA)
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Strategic Management (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Accounting & Taxation (AREA)
- General Business, Economics & Management (AREA)
- Human Resources & Organizations (AREA)
- Economics (AREA)
- Finance (AREA)
- Development Economics (AREA)
- Entrepreneurship & Innovation (AREA)
- Marketing (AREA)
- Databases & Information Systems (AREA)
- Data Mining & Analysis (AREA)
- Tourism & Hospitality (AREA)
- Quality & Reliability (AREA)
- Operations Research (AREA)
- General Engineering & Computer Science (AREA)
- Educational Administration (AREA)
- Game Theory and Decision Science (AREA)
- Technology Law (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
System and method for verifying non-structured Enterprise Resources Plan data.This method includes at least one parameter transaction for analyzing the first electronic document to determine transaction, wherein the first electronic document includes at least partly non-structured data;For transaction creation template, wherein template is the structured data sets for including at least one determining parameter transaction;In enterprise resource planning, based on matched second electronic document of template search created;And when finding matched second electronic document, the transaction is verified.
Description
Cross reference to related applications
This application claims submitted on October 9th, 2016 application No. is the power of 62/405,921 U.S. Provisional Application
Benefit.The application simultaneously be also on November 28th, 2016 it is submitting, currently examine in, application No. is 15/361,934 beauty
The part continuation application of state's patent application.The content of above-mentioned application is incorporated herein by reference in their entirety.
Technical field
Generally, this disclosure relates to verify business system, and relate more specifically to verify in enterprise resource planning
Non-structured data.
Background technique
Enterprise Resources Plan (ERP) is business management software, commonly used in collecting, storage, manages and explains from various
The data of business activity, such as the spending of enterprise staff.ERP system usually collects the business activity phase in enterprise with various departments
The data of pass.Data collected in this way can come from different data sources, and can be different formats.ERP system mentions
It for the integrated view of the business activity data, and is further able to generate expenditure report, this report can be sent to phase later
The tax authority of pass.
Particularly in large enterprise, employee is engaged in a large amount of business activity.Such business activity may further result in
The a large amount of business of tax authority's report is paid.Report that such business expenditure can bring deductions and exemptions of taxes and refund.For this purpose, employee is logical
Receipt according to the expenditure occurred is often provided, and usually requires to indicate the type of such expenditure.Based on the instruction, ERP system
Report can be generated in system, and this report provides any received receipt to the relevant tax authority.
In addition, according to data relevant to business activity are managed, ERP system must be associated with and be tracked between managed collection
Relationship.For example, relevant to the tax affairs report of receipt information must be saved and be associated with receipt itself.Between data set
Any mistake in association can lead to the report of mistake, caused by this can then be caused by unsuccessful redemption and exempt from taxation
Loss of income, and do not meet laws and rules.Therefore, accurate data management is most important for ERP system.
When the data of part are unstructured, additional challenge can be brought by tracking such data.For example, there is also with chase after
Track is stored as the relevant difficulty of expenditure receipt of image file.Existing solution design for these challenges is based on user
The file extension of offer identifies the content of the file comprising unstructured data.This solution is limited to mistake
(for example, wrong word, file content of mistake etc.), and possibly content therein can not all be described.These disadvantages may be into
One step leads to the inaccuracy in ERP system.
The amount of receipt of the employee obtained in business process may be very huge.This large amount of receipt causes to be supplied to
The data of ERP system significantly increase, so as to cause being difficult to manage the data in such ERP system.Specifically, existing solution party
Case faces the challenge on safeguarding the correct association in managed data.These difficulties may cause mistake and mismatch.Work as mistake
It may be mistake with multiple proofs or the related result of other incorrect reports when being captured not in time with mismatch.Manually
It is time and effort consuming that whether verifying report matches with receipt, and is limited to mistake.Further, this manual authentication sheet
Body can not correct the problem of closed tube reason data.
In addition, the existing solution for verifying transaction automatically is utilizing the electricity comprising at least partly unstructured data
It faces the challenge when subfile.Specifically, this solution can identify transaction data in the receipt of scanning and other
Non-structured data, but when utilizing identified transaction data, it may be possible to it is inefficient and inaccurate.
Therefore it provides the technical solution of many disadvantages of the prior art is overcome to be advantageous.
Summary of the invention
Several exemplary embodiments of the disclosure are summarized as follows.There is provided general introduction is in order to facilitate reader, offer pair
The basic comprehension of such embodiment and not exclusively limit disclosed range.The not all contemplated embodiments of the general introduction it is extensive
It summarizes, and is neither intended to the key or important element for identifying all embodiments, be not intended in terms of describing any or all
Range.Its sole purpose is that some concepts of one or more embodiments are presented in simplified form, more detailed as what is presented later
The preamble carefully described.For convenience, term " some embodiments " or " some embodiments " Lai Zhidai disclosure can be used herein
Single embodiment or multiple embodiments.
Some embodiments disclosed herein include the method for verifying non-structured Enterprise Resources Plan data.The party
Method includes: at least one parameter transaction for analyzing the first electronic document to determine transaction, wherein the first electronic document includes at least
The non-structured data in part;For transaction creation template, wherein template is the structuring for including at least one determining parameter transaction
Data set;In enterprise resource planning, based on matched second electronic document of template search created;And when lookup
When to matched second electronic document, the transaction is verified.
Some embodiments disclosed herein further include non-transitory computer-readable medium, are stored on it so that handling
The program that circuit executes, the program include: at least one parameter transaction for analyzing the first electronic document to determine transaction, wherein the
One electronic document includes at least partly non-structured data;For transaction creation template, wherein template be include determining at least one
The data set of the structuring of a parameter transaction;In enterprise resource planning, based on the template search created matched
Two electronic documents;And when finding matched second electronic document, the transaction is verified.
Some embodiments disclosed herein further include the system for verifying non-structured Enterprise Resources Plan data.It should
System includes: processing circuit;And memory, which includes instruction, when the instruction is executed by processing circuit, configuration system
System are as follows: the first electronic document of analysis is to determine at least one parameter transaction traded, wherein the first electronic document includes at least portion
Divide non-structured data;For transaction creation template, wherein template is the structuring for including at least one determining parameter transaction
Data set;In enterprise resource planning, based on matched second electronic document of template search created;And works as and find
When matched second electronic document, the transaction is verified.
Detailed description of the invention
It is particularly pointed out in claims at specification ending and is distinctly claimed presently disclosed subject matter.
By detailed description with the accompanying drawing below, foregoing end other objects, the feature and advantage of disclosed embodiment will be aobvious and easy
See.
Fig. 1 is the network for describing various open embodiments.
Fig. 2 is to show the flow chart of the method according to the embodiment for being used to verify Enterprise Resources Plan data.
Fig. 3 is to show the flow chart of the method according to the embodiment for drawing template establishment.
Fig. 4 is the block diagram according to the validator of embodiment.
Specific embodiment
It is important that, it should be noted that embodiment disclosed herein is only showing for many advantageous uses of the innovative teachings of this paper
Example.Generally, the statement made in the description of the present application not necessarily limits the embodiment of any various requirement protection.This
Outside, some statements are likely to be suited for certain inventive features and are not suitable for other features.In general, unless otherwise stated, single
Number elements can be plural number, vice versa and without loss of generality.In the accompanying drawings, similar labelled notation indicates in several views
Similar component.
Various disclosed embodiments include the format by the way that at least partly non-structured data to be converted to structuring, are used
In the system and method for verifying Enterprise Resources Plan data.To report electronic document drawing template establishment for the first of verifying.Report
Electronic document includes at least partly non-structured data of the instruction for the parameter transaction of transaction.Pass through report electronic document
Analysis is based on critical field and value, drawing template establishment.It is used to report the metadata of electronic document based on the template generation created.
Using metadata, report is verified by searching for matched second proof electronic document in the memory of enterprise resource planning
Accuse electronic document.If not finding matched proof electronic document (that is, report electronic document is invalidated), may search for
One or more data sources are to retrieve the matched proof electronic document for verifying report electronic document.
Fig. 1 shows example network Figure 100 for describing various open embodiments.Network 100 includes and passes through net
Validator 120 that network 110 communicatedly connects, business system 130,140 user equipment 150 of database.Network can be but unlimited
In wireless network, honeycomb or cable network, local area network (LAN), wide area network (WAN), Metropolitan Area Network (MAN) (MAN), internet, WWW
(WWW), similar network and any combination.
Business system 130 is associated with enterprise, and can store and represent that carry out transaction related by enterprise or enterprise
Data, and instruction enterprise feature enterprise characteristic parameter, the enterprise characteristic parameter be such as, but not limited to formed country,
Income data, structured data etc..Enterprise, which may be, but not limited to, its employee, can represent the enterprise of enterprise's purchase commodity and service
Industry.Business system 130 can be but not limited to server, database, enterprise resource planning, CRM system or
Store any other system of related data.In an exemplary embodiment, business system 130 is storage report electronics text
The enterprise resource planning of part, proof electronic document or both.
Database 140, which at least stores, proves electronic document.In the exemplary embodiment, database 140 can by with enterprise
130 associated enterprise operations or associated with it of industry system.Therefore, database 140, which can store, is not stored in business system
Proof electronic document in 130, for example, not uploading to the proof electronic document of business system 130.When based on be stored in enterprise system
System 130 in proofs electronic document can not verify report electronic document in indicated by transaction when, can inquire database 140 with
Determine whether database 140 stores suitable proof electronic document.
User equipment 150 may be, but not limited to, personal computer, laptop computer, tablet computer, smart phone,
Wearable computing machine equipment or any other equipment that can grab, store and send non-structured data set.As
Non-limiting example, user equipment 150 can be the smart phone including camera.User equipment 150 can by for example with enterprise
The employee of the associated tissue of industry system 130 uses.
In one embodiment, validator 120 includes optical recognition process device (for example, the optical recognition process device in Fig. 4
430).Optical recognition process device is configured at least identification data, the character especially in non-structured data.Validator
120 are configured to receive the first report electronic document from business system 130.Report electronic document is at least partly non-structured electricity
Subfile, including but not limited to non-structured data, partly-structured data lack known format (for example, by validator
The predefined formats of 120 identifications) data of structuring or combinations thereof.
It is usually from the received report electronic document of business system 130, but is not limited to electronic document, which can be with
Such as by employee's hand filling (inputting information for example, by typewriting or other modes).In the exemplary embodiment, report electricity
Subfile can be the image for showing report on expenses, or non-structuring or the semi-structured text text of the text including report on expenses
Part.Report electronic document indicates and one or more information relevant with transaction.As non-limiting embodiment, electronics is reported
File includes the row filled in by employee, wherein explanation: " 60 Euros of taxi, 10 Euros of the@each run in Paris ", actually
Refer to 10 Euros every time of No. 6 taxi strokes, i.e. 6 different transaction.In this case, in order to verify expense, by 6 phases
Corresponding proof electronic document matches with report on expenses.
Report electronic document can upload to business system 130 by the user of such as user equipment 150.For example, user sets
Standby 150 user can shoot the image of report on expenses by the camera (not shown) of user equipment 150, and image is sent
To business system 130.
In one embodiment, validator 120 is configured at least partly non-structured report electronic document of analysis.Analysis can
To include, but are not limited to identify the member shown at least partly non-structured electronic document by computer vision technique
Element, and the template based on the element creation transaction attribute identified.This computer vision technique may further include image
Identification, pattern-recognition, signal processing, character recognition etc..
Each created template is the data set of structuring, including the identified parameter transaction for transaction.Tool
Body, template includes the field of one or more classifications for representing transaction data, wherein each field includes suitable transaction ginseng
Number.The creation of the data set template of structuring is discussed further below.
Based on the template created, validator 120 is configured to generate at least partly non-structured report electronic document
The metadata of each transaction (for example, the business activity such as paid) of middle instruction.The metadata of transaction can indicate one
A or multiple parameter transactions indicated in report electronic document, and can be about one or more fields of corresponding template
It generates.
The field for generating metadata can be predefined field, be chosen to uniquely identify the Transaction Information of transaction, make
It obtains and provides the proof of transaction with the proof electronic document (for example, receipt) of meta data match.As non-limiting example, for producing
The purchase activity of raw expenditure, metadata may include generating the position (indicating in " position " field) of expenditure, generate expenditure (example
Such as, as indicated in " merchandise news " field) commodity place feature (for example, the type of commodity, sell product type
Deng), the time (for example, as indicated in " time " field) of expenditure is generated, the amount of money is (for example, the currency indicated in respective field
Value or quantity), and combinations thereof etc..
Validator 120 is configured to metadata generated of just trading, and passes through matched card in searching enterprise system 130
Bright electronic document verifies each transaction indicated in report electronic document.Specifically, it can generate and inquire according to metadata,
And inquiry can be used to search for matched proof electronic document in business system 130.Matched proof electronic document can
With associated with metadata, electronic document is generated, metadata phases more than predefined thresholds with report for the metadata
Match.
If it is determined that the metadata of transaction matches with the metadata of corresponding proof electronic document, then completion report is tested
Card.It mismatches if it is determined that existing (that is, identifying the structural data and any proof being stored in business system electricity of report
The metadata of subfile mismatches), then it can be generated about unmatched notice, and be sent to such as user equipment 150.Substitution
Ground or jointly, when the mismatch identified so that one or more transaction are not verified, validator 120 be configurable to for
Each not verified matched proof electronic document of Trading Research in database 140.If finding matching in the database,
Validator 120 is configurable to for matched proof electronic document being stored in business system 130, and determines and have verified that accordingly
Transaction.
It is than such as directly using non-structured data more effective that business data is verified using the template of structuring
With accurate method of determination.Specifically, it can be based on template generation metadata relative to specific field, so that metadata more has
Effect and the parameter for more accurately showing unique identification transaction.Therefore, metadata can be used for correctly searching for matched proof electricity
Subfile, while reducing processing capacity and the time for being related to comparing metadata.Furthermore it is possible to store mould in business system 130
Plate rather than report electronic document, and therefore compared with storing electronic document itself, it is possible to reduce the use of memory, especially
It is when other digital representations that electronic document is image or vision data, because such visual representation is usually than the base of structuring
It needs in the file of text using more memories.
The processing circuit that validator 120 generally includes to be coupled to memory (for example, memory 415 of Fig. 4) is (for example, Fig. 4
In processing circuit 410).Processing circuit may include or for processor (not shown) component, or be coupled to the place of memory
Manage device array.Memory includes the instruction that can be executed by processing circuit.When processing circuit executes the instruction, instruction configuration
Processing circuit is to execute various functions described herein.
It should be appreciated that presently disclosed embodiment is not limited to specific structure shown in Fig. 1, and this public affairs is not being departed from
Other structures can be equally applicable in the case where opening the range of embodiment.Specifically, validator 120 may reside within cloud computing
Platform, data center etc..In addition, in some embodiments, there may be multiple validators of operation as described above, and
It is configured to have one as backup, with load sharing between them, or is divided into different functions between them.
It shall yet further be noted that some embodiments about Fig. 1 discussion are described as only interacting with a business system 130, this is only
It is rather than the limitation for the disclosure for purposes of simplicity.Data from additional enterprise resource planning can be by
Validator 120 is verified, without departing from the range of disclosed embodiment.In addition, database 140 can equally be another data
Source, for example, may have access to the server of one or more database.It, can be in addition, without departing from the scope of the disclosure
Use multiple databases.
Fig. 2 is to show example flow Figure 200 of the method according to the embodiment for being used to verify the data in business system.
In embodiment, this method can be executed by validator (for example, validator 120).
In S210, receives or electronic document is reported in retrieval first.Report electronic document includes being related to one or more friendships
Easy at least partly non-structured data.At least partly non-structured data include, but are not limited to non-structured data,
Partly-structured data or lack known format structuring data.Transaction e file can be from such as Enterprise Resources Plan
(ERP) system (for example, business system 130 of Fig. 1), or can be from such as user equipment (for example, user equipment 150 of Fig. 1)
It receives.
In the exemplary embodiment, transaction e file can be image, show such as relevant to business activity one
A or multiple expenditure reports.It, can be as operated by the employee of the tissue of shooting report on expenses table as non-limiting example
Mobile device captures image.
In S220, for each transaction creation template indicated in report electronic document.In embodiment, pass through optical character
Identification (OCR) processor can analyze transaction e file.The analysis can also include that at least portion is identified using machine vision
Divide element, cleaning or the elimination data and generation structural data in non-structured data, which includes extremely
The non-structured data of small part identify in key character and value.As an example, for the image of receipt, machine vision can
With relevant to the transaction recorded in receipt information, such as price, position, date, buyer, the seller etc. for identification.
In some embodiments, ambiguity eliminate may include identified in a group field and key value in template it is multiple
Transaction.The identification of multiple transaction can more transaction identifications rule based on one or more.For example, such rule can be based on total
Valence, for example, can determine that total price represents multiple transaction if total price is higher than threshold value.It can be each of multiple transaction
Drawing template establishment.Ambiguity elimination during drawing template establishment is further described below with reference to Fig. 3.
In S230, based on one in the template created, metadata is generated for corresponding transaction.It can be based on unique mark
The value in the field of transaction is known to generate metadata.As non-limiting example, for including field " date ", " price ", " number
The metadata for indicating the value in those fields can be generated in the template of amount " and " project name ".
In S240, corresponding proof electronic document is searched for verify transaction.In one embodiment, S240 may include being based on
Metadata generates inquiry, and searches for matched proof electronics text by one or more data sources using inquiry generated
Part.In the exemplary embodiment, data source includes enterprise resource planning.
In embodiment, if not finding matched proof electronic document in the first data source, it may search for
Two groups of data sources are to find matched proof electronic document.If finding matched proof electronics text in the second data source
Part can then be retrieved and be stored it in the first data source.
In optional S250, notice can be generated.Whether notice can indicate whether transaction is verified, that is, can be the friendship
Easy-to-search is to matched proof electronic document.
It in S260, checks whether additional transaction will be verified, also, if it is, continue to execute S230, otherwise executes end
Only.
Fig. 3 is to show according to the embodiment based on the electronic document including at least partly non-structured data, is used for
The example flow diagram S220 of the method for drawing template establishment.
In S310, electronic document is obtained.Obtaining electronic document may include, but be not limited to, reception electronic document (for example,
Receive the image of scanning) or electronic document is retrieved (for example, examining from Client Enterprise system, businessman's business system or database
Rope).
In S320, electronic document is analyzed to identify the element at least partly non-structured data.Analysis may include
But it is not limited to using optical character identification (OCR) to determine the character in electronic document.
Element can include but is not limited to character relevant to transaction, character string or both.As non-limiting example, member
Element may include the print data appeared in expenditure receipt relevant to business activity.Such print data may include but
It is not limited to date, time, quantity, seller name, the type of seller business, value-added tax value, the type of bought product, payer
Method number of registration etc..
In S330, it is based on the analysis, identifies critical field and value in electronic document.Critical field may include but unlimited
In the title of businessman and address, date, currency, the commodity of sale or service, transaction identifiers, invoice number etc..Electronic document can
To include the unnecessary details for not being considered as key value.As an example, may not be needed the mark of businessman, therefore it is not to close
Key assignments.In embodiment, can predefined key value list, and can extract and the matched data slot of critical field.
Then, liquidation procedures is executed to ensure that information is accurately presented.For example, if OCR will lead to data and be shown as
" 1211212005 ", then this data can be converted to 12/12/2005 by liquidation procedures.Another example, if title is shown as " Mo $
Den " will be then changed to " Mosden ".It can use the external informations resource such as dictionary, calendar and execute liquidation procedures.
In another embodiment, check whether the data slot of extraction is complete.For example, if can identify the name of businessman
Claim, but lose its address, then the critical field of seller addresses is imperfect.Execute the trial for improving the primary key value of missing.
The trial may include inquiring the correlation of external system and database, inquiry and previous analyzed invoice, or combinations thereof.It is external
System and the example of database may include enterprise content, Universial Product Code (Universal Product Code, UPC) number
According to library, package delivery and tracking system etc..In one embodiment, S430 generate one group completely predefined critical field and its
Respective value.
In another embodiment, S330 can also include the ambiguity for eliminating non-structured data.Ambiguity elimination can be with base
In but file name, dictionary, algorithm, the synonym etc. that are not limited to non-structured data set.Ambiguity elimination may be implemented more
The identification accurately traded.Ambiguity elimination can be based on but be not limited to, and the structure of data is (for example, the number in field " destination "
According to disambiguation can be carried out with location-based title), dictionary, algorithm, synonym etc..In some embodiments, if ambiguity
Eliminate it is unsuccessful, then can be generated notify and be sent to user (such as user of user equipment 150), prompt user provide into one
The explanation of step.
As non-limiting example, for the image in the file of entitled " purchase receipt ", can use and character string
" total price " is located at character string " 300.00 " character in a line the value determined to include in " purchasing price " field
300.00.As another example, can based on dictionary eliminate character string " Drance " ambiguity to generate metadata, the metadata
Indicate that position associated with non-structured data set is France.As another example, in field relevant to charge type,
The data of the structuring of field can be " Paris taxi ", and the value of the field can be " 60 Euros ".Based on maximum
The one or more rule of taxi price can determine that " 60 Euros " are too high for multiple taxi fares use, and because
This field corresponds to multistage taxi stroke.
In S340, the data set of structuring is generated.The data set of generation includes identified field and value.
It should be noted that embodiment relevant to data in ERP system as described above, be described as the number of structuring
According to, person just for the sake of simplified purpose, rather than the limitation to the disclosed embodiments.The scope of the present disclosure is not being departed from
In the case of, it can equally use partly-structured data.In addition, data can store any database or with except ERP system
Other storage units connected to system communication other than system.It should also be noted that only for the purposes of illustration, rather than to all public affairs
The limitation for the embodiment opened, above with reference to Fig. 2 and 3 described embodiments with reference to Fig. 1 discussion.
Fig. 4 is the schematic block diagram according to the validator 120 of embodiment.Validator 120 include be coupled to memory 415,
The processing circuit 410 of reservoir 420 and network interface 440.In embodiment, validator 120 may include optical character identification
(OCR) processor 430.In another embodiment, the component of validator 120 can communicatedly be connected by bus 450.
Processing circuit 410 can be implemented as one or more hardware logic components and circuit.Such as rather than limit, can make
The exemplary types of hardware logic component include field programmable gate array (field programmable gate
Array, FPGA), specific integrated circuit (application-specific integrated circuit, ASIC), dedicated mark
Quasi- product (Application-specific standard products, ASSP), system level chip system (system-on-
A-chip system, SOC), general purpose microprocessor, microcontroller, digital signal processor (digital signal
Processor, DSP) etc., or it is able to carry out other any hardware logic components of calculating or other information processing.
Memory 415 can be volatibility (such as RAM), non-volatile (such as ROM, flash memory) or combinations thereof.At one
In configuration, the computer-readable instruction for executing one or more embodiments as described herein is stored in reservoir 420.
In another embodiment, memory 415 is configured to storage software.Software is broadly interpreted that any type of finger
Enable, no matter refer to software, firmware, middleware, microcode, hardware description language or other.Instruction may include code (example
Such as, source code format, binary code form, executable code format or other any code formats appropriate).When by one
Or multiple processors, when executing the instruction, the instruction is so that processing circuit 410 executes various processes described herein.Specifically,
As discussed herein, instruction when being executed, makes processing circuit 410 be based on electronic document and generates merging data.
Reservoir 420 can be magnetic reservoir, optical storage device etc., and may be embodied as such as flash memory or other storages
Technology, CD-ROM, digital versatile disc (DVD) or other any media that can be used in storing desired information.
Reservoir 420, which can also be stored, generates first number to the analysis of non-structured data based on OCR processor 430
According to.In another embodiment, reservoir 420 can also be stored based on metadata inquiry generated.
OCR processor 430 can include but is not limited to be configured to identify mode in non-structured data set, feature or
The feature and/or pattern recognition processor (recognition processor, RP) 435 of both.Specifically, implement one
In example, OCR processor 430 is configured at least identify the character in non-structured data.It can use identified character wound
It builds including the data set for data needed for checking request.
Network interface 440 allow validator 120 and business system 130, database 140, user equipment 150 or combinations thereof into
Row communication notifies for example to receive electronic document, to send, searches for electronic document, storing data etc..
It should be appreciated that embodiment as described herein is not limited to specific structure shown in Fig. 4, and the disclosure is not being departed from
Other structures can be equally used in the case where the range of embodiment.
It should be noted that the various implementations about the proof electronic document for verifying single deals match of discussion described herein
Example, it is only for simplify purpose rather than the limitation to the disclosed embodiments.The case where not departing from the scope of the present disclosure
Under, it can serially or parallelly verify the multiple transaction indicated in report electronic document.As non-limiting example, report electricity
Subfile can be the report on expenses of the multiple transaction of instruction that are being made by employee.
Implementable various embodiments disclosed herein is hardware, firmware, software or any combination thereof.In addition, software is preferred
Ground be embodied as visibly realizing on program storage unit (PSU) or the computer-readable medium that is made of component on or certain equipment
And/or the combination of equipment.Application program can upload to the machine including any suitable architecture and be executed by it.Preferably, should
Machine is in the computer platform with such as one or more central processing unit (" CPU "), memory and input/output structure
Upper implementation.Computer platform can also include operating system and micro-instruction code.Various processes and function described herein can be with
It is a part of micro-instruction code either application program, or is any combination of them, can be executed by CPU, nothing
By whether explicitly showing such computer or processor.In addition, various other peripheral cells may be coupled to computer
Platform, such as additional-data storage unit and print unit.In addition, non-transitory computer-readable medium is to propagate letter except temporary
Any computer-readable medium except number.
All examples as described herein and conditional statement are intended for instructing purpose, to help reader to understand disclosed reality
The principle and inventor of applying example promote the concept that this field is contributed, and should be understood as not to it is such specifically quote show
Example and condition make limitation.In addition, record herein the principle of embodiment of the disclosure, aspect and embodiment and its specifically show
All statements of example, it is intended to including its structural and functional equivalent.In addition, such equivalent includes currently known etc.
Jljl and in the future exploitation equivalent, that is, exploitation execution identical function any element, but regardless of structure how.
Claims (17)
1. the method for verifying non-structured Enterprise Resources Plan data, comprising:
The first electronic document is analyzed to determine at least one parameter transaction of transaction, wherein the first electronic document includes at least partly
Non-structured data;
For transaction creation template, wherein the template be include at least one identified parameter transaction structuring data
Collection;
In enterprise resource planning, based on matched second electronic document of template search created;And
When finding matched second electronic document, the transaction is verified.
2. according to the method described in claim 1, wherein determining at least one parameter transaction further include:
At least one critical field and at least one value are identified in the first electronic document;
Based on the first electronic document, create data set, wherein the data set created include at least one described critical field and
At least one described value;And
Created data set is analyzed, wherein determining at least one described parameter transaction based on analysis.
3. according to the method described in claim 2, wherein identifying at least one described critical field and at least one described value, also
Include:
The first electronic document is analyzed to determine the data in the first electronic document;And
Based on the list of predefined critical field, at least part of identified data is extracted, wherein identified data
At least part matched at least one critical field in predefined critical field list.
4. according to the method described in claim 3, wherein analyzing the first electronic document, further includes:
Optical character identification is executed to the first electronic document.
5. according to the method described in claim 1, further include:
Based on the template generation inquiry created, wherein described search includes inquiring corporate resources using inquiry generated
Planning system.
6. according to the method described in claim 1, further include:
Based on template, the metadata for transaction is generated, wherein inquiry is generated based on the metadata, wherein second electronics
File is associated with metadata, wherein the metadata of matched second electronic document with it is more than predefined threshold value generated
Metadata matches.
7. according to the method described in claim 1, wherein drawing template establishment further include:
Ambiguity elimination is carried out at least partly non-structured data.
8. according to the method described in claim 1, further include:
When not finding matched second electronic document in enterprise resource planning, matched second is searched in the database
Electronic document;And
When finding the second electronic document of matching in the database, the second electronic document is stored in enterprise resource planning
In.
9. a kind of non-transitory computer-readable medium is stored thereon with instruction, for executing one or more processing units
For verifying the program of non-structured Enterprise Resources Plan data, described program includes:
The first electronic document is analyzed to determine at least one parameter transaction of transaction, wherein the first electronic document includes at least partly
Non-structured data;
For transaction creation template, wherein the template is the structured data sets for including at least one identified parameter transaction;
In enterprise resource planning, matched second electronic document is searched based on the template created;And
When finding the second electronic document of matching, the transaction is verified.
10. the system for verifying non-structured Enterprise Resources Plan data, comprising:
Processing circuit;And
Memory, the memory includes instruction, when executing described instruction by processing circuit, the system configuration are as follows:
The first electronic document is analyzed to determine at least one parameter transaction of transaction, wherein the first electronic document includes at least partly
Non-structured data;
For transaction creation template, wherein the template be include at least one identified parameter transaction structuring data
Collection;
In enterprise resource planning, matched second electronic document is searched based on the template created;And
When finding the second electronic document of matching, the transaction is verified.
11. system according to claim 10, wherein the system is additionally configured to:
At least one critical field and at least one value are identified in the first electronic document;
Based on the first electronic document, create data set, wherein the data set created include at least one described critical field and
At least one described value;And
Created data set is analyzed, wherein determining at least one described parameter transaction based on analysis.
12. system according to claim 11, wherein the system is additionally configured to:
The first electronic document is analyzed to determine the data in the first electronic document;And
Based on the list of predefined critical field, at least part of identified data is extracted, wherein identified data
At least part matched at least one critical field in predefined critical field list.
13. according to the method for claim 12, wherein the system is additionally configured to:
Optical character identification is executed to the first electronic document.
14. according to the method described in claim 10, wherein the system is additionally configured to:
Based on the template generation inquiry created, wherein described search includes inquiring corporate resources using inquiry generated
Planning system.
15. according to the method described in claim 10, wherein the system is additionally configured to:
Based on template, the metadata for transaction is generated, wherein inquiry is generated based on the metadata, wherein second electronics
File is associated with metadata, wherein the metadata of matched second electronic document with it is more than predefined threshold value generated
Metadata matches.
16. according to the method described in claim 10, wherein drawing template establishment further include:
Ambiguity elimination is carried out at least partly non-structured data.
17. according to the method described in claim 10, wherein the system is additionally configured to:
When not finding matched second electronic document in enterprise resource planning, matched second is searched in the database
Electronic document, and
When finding matched second electronic document in the database, the second electronic document is stored in enterprise resource planning
In.
Applications Claiming Priority (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201662405921P | 2016-10-09 | 2016-10-09 | |
US62/405,921 | 2016-10-09 | ||
US15/361,934 | 2016-11-28 | ||
US15/361,934 US20170154385A1 (en) | 2015-11-29 | 2016-11-28 | System and method for automatic validation |
PCT/US2017/055135 WO2018067698A1 (en) | 2016-10-09 | 2017-10-04 | System and method for verifying unstructured enterprise resource planning data |
Publications (1)
Publication Number | Publication Date |
---|---|
CN110023970A true CN110023970A (en) | 2019-07-16 |
Family
ID=61832191
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201780071509.7A Pending CN110023970A (en) | 2016-10-09 | 2017-10-04 | System and method for verifying non-structured Enterprise Resources Plan data |
Country Status (3)
Country | Link |
---|---|
EP (1) | EP3523771A4 (en) |
CN (1) | CN110023970A (en) |
WO (1) | WO2018067698A1 (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2023279037A1 (en) * | 2021-06-30 | 2023-01-05 | Pricewaterhousecoopers Llp | Ai-augmented auditing platform including techniques for automated assessment of vouching evidence |
Family Cites Families (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP3225912B2 (en) * | 1998-01-08 | 2001-11-05 | 日本電気株式会社 | Information retrieval apparatus, method and recording medium |
US20030212617A1 (en) * | 2002-05-13 | 2003-11-13 | Stone James S. | Accounts payable process |
US20100161616A1 (en) * | 2008-12-16 | 2010-06-24 | Carol Mitchell | Systems and methods for coupling structured content with unstructured content |
US8774516B2 (en) * | 2009-02-10 | 2014-07-08 | Kofax, Inc. | Systems, methods and computer program products for determining document validity |
-
2017
- 2017-10-04 CN CN201780071509.7A patent/CN110023970A/en active Pending
- 2017-10-04 WO PCT/US2017/055135 patent/WO2018067698A1/en active Application Filing
- 2017-10-04 EP EP17859117.8A patent/EP3523771A4/en not_active Withdrawn
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2023279037A1 (en) * | 2021-06-30 | 2023-01-05 | Pricewaterhousecoopers Llp | Ai-augmented auditing platform including techniques for automated assessment of vouching evidence |
Also Published As
Publication number | Publication date |
---|---|
EP3523771A1 (en) | 2019-08-14 |
WO2018067698A1 (en) | 2018-04-12 |
EP3523771A4 (en) | 2020-04-29 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US10614527B2 (en) | System and method for automatic generation of reports based on electronic documents | |
CN108985912B (en) | Data reconciliation | |
US11138372B2 (en) | System and method for reporting based on electronic documents | |
US20170323006A1 (en) | System and method for providing analytics in real-time based on unstructured electronic documents | |
US20170169292A1 (en) | System and method for automatically verifying requests based on electronic documents | |
US20180011846A1 (en) | System and method for matching transaction electronic documents to evidencing electronic documents | |
EP3494495A1 (en) | System and method for completing electronic documents | |
CN110023970A (en) | System and method for verifying non-structured Enterprise Resources Plan data | |
US20180046663A1 (en) | System and method for completing electronic documents | |
US20170169518A1 (en) | System and method for automatically tagging electronic documents | |
US10558880B2 (en) | System and method for finding evidencing electronic documents based on unstructured data | |
US10387561B2 (en) | System and method for obtaining reissues of electronic documents lacking required data | |
US20180096435A1 (en) | System and method for verifying unstructured enterprise resource planning data | |
EP3494496A1 (en) | System and method for reporting based on electronic documents | |
US20180025224A1 (en) | System and method for identifying unclaimed electronic documents | |
CN109983489A (en) | Electronic document is proved based on non-structured data search | |
WO2017201012A1 (en) | Providing analytics in real-time based on unstructured electronic documents | |
CN108713198A (en) | Automatic checking request based on electronic document | |
US20200118122A1 (en) | Techniques for completing missing and obscured transaction data items | |
US20170323395A1 (en) | System and method for creating historical records based on unstructured electronic documents | |
EP3491554A1 (en) | Matching transaction electronic documents to evidencing electronic | |
WO2018027133A1 (en) | Obtaining reissues of electronic documents lacking required data | |
WO2017201013A1 (en) | System and method for creating historical records based on unstructured electronic documents | |
WO2017142624A1 (en) | System and method for automatically tagging electronic documents | |
CN109313765A (en) | The System and method for of automatic verifying transaction is carried out based on electronic document |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20190716 |
|
WD01 | Invention patent application deemed withdrawn after publication |