CN110362802A - For by the method, apparatus of document information input system, calculate equipment, medium - Google Patents

For by the method, apparatus of document information input system, calculate equipment, medium Download PDF

Info

Publication number
CN110362802A
CN110362802A CN201910649744.9A CN201910649744A CN110362802A CN 110362802 A CN110362802 A CN 110362802A CN 201910649744 A CN201910649744 A CN 201910649744A CN 110362802 A CN110362802 A CN 110362802A
Authority
CN
China
Prior art keywords
field
document information
typing
information
document
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201910649744.9A
Other languages
Chinese (zh)
Inventor
张振师
冯强
刘英华
丛芝芳
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Industrial and Commercial Bank of China Ltd ICBC
Original Assignee
Industrial and Commercial Bank of China Ltd ICBC
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Industrial and Commercial Bank of China Ltd ICBC filed Critical Industrial and Commercial Bank of China Ltd ICBC
Priority to CN201910649744.9A priority Critical patent/CN110362802A/en
Publication of CN110362802A publication Critical patent/CN110362802A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/166Editing, e.g. inserting or deleting
    • G06F40/177Editing, e.g. inserting or deleting of tables; using ruled lines
    • G06F40/18Editing, e.g. inserting or deleting of tables; using ruled lines of spreadsheets
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/40Document-oriented image-based pattern recognition
    • G06V30/41Analysis of document content
    • G06V30/413Classification of content, e.g. text, photographs or tables
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Artificial Intelligence (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Character Input (AREA)

Abstract

Present disclose provides a kind of for by the method for document information input system, which comprises obtains document image;The document image is handled to obtain document information, the document information includes the first field and field values corresponding with first field;The target pages in the system are obtained, the target pages include the second field and typing corresponding with the second field region;Typing rule is obtained, the typing rule includes the corresponding relationship between first field and second field;And based on the typing rule, the field values corresponding with first field are entered into the typing region.The disclosure additionally provide for by the device of document information input system, calculate equipment and medium.

Description

For by the method, apparatus of document information input system, calculate equipment, medium
Technical field
This disclosure relates to field of computer technology, more particularly to it is a kind of for by the method for document information input system, Device calculates equipment and medium.
Background technique
At present when by document information input system, generally by artificial field input system one by one, and pass through artificial nucleus Pair mode it is whether correct to check the document information in input system.But manual operation exist in practical applications it is all More problems, for example, by increasing in particular with the subsidiary document of business manually by the low efficiency of document information input system, Every business has tens even up to a hundred documents, takes considerable time when business being caused to be handled, inefficiency.In addition, logical Cross it is manually that the accuracy of document information input system is low, such as by manual entry document information often will appear leakage record, accidentally Situations such as record, and can not find typing mistake in time by way of artificial nucleus couple.
Summary of the invention
An aspect of this disclosure provides a kind of for by the method for document information input system, which comprises Obtain document image, handle the document image to obtain document information, the document information include the first field and with institute State the corresponding field values of the first field, obtain the target pages in the system, the target pages include the second field with And typing corresponding with the second field region, typing rule is obtained, the typing rule includes first field and described the Corresponding relationship between two fields will the field values typing corresponding with first field based on the typing rule Into the typing region.
Optionally, the above method further include: export the input result in the target pages, the input result includes institute State the second field and with the field values in typing region, according to the typing rule by the input result and the document Information is compared, and obtains comparison result, indicates the input result and the document information not in response to the comparison result Unanimously, the field values in the typing region are modified based on the document information, result updates the record based on the comparison Enter rule.
Optionally, the above-mentioned processing document image is to obtain document information, comprising: determines the document figure described for identification The preset quantity of picture repeats to identify the document image according to the preset quantity, obtains preset quantity initial recognition result, It handles the preset quantity initial recognition result and obtains the document information.
Optionally, the above-mentioned processing preset quantity initial recognition result obtains the document information, comprising: will be described Preset quantity initial recognition result is compared each other, obtains the different information between the initial recognition result, is based on institute Different information and the initial recognition result are stated, the document information is obtained.
Optionally, above-mentioned to be based on the different information and the initial recognition result, the document information is obtained, is wrapped It includes: based on preset rules, determining the first object information in the different information, determination is corresponding with the first object information Initial recognition result is as target identification as a result, using the target identification result as the document information.
Optionally, above-mentioned to be based on the different information and the initial recognition result, the document information is obtained, is wrapped It includes: handling the different information and obtain the second target information, be based on initial recognition result described in second target information processing, Obtain the document information.
Optionally, the above method further include: the preset rules are updated based on the different information.
Another aspect of the disclosure provides a kind of for by the device of document information input system, described device packet Include: first, which obtains module, processing module, the second acquisition module, third, obtains module and recording module.Wherein, it first obtains Module obtains document image, and for the processing module processing document image to obtain document information, the document information includes first Field and field values corresponding with first field, the second acquisition module obtain the target pages in the system, institute Stating target pages includes the second field and typing corresponding with the second field region, and third obtains module and obtains typing rule, The typing rule includes the corresponding relationship between first field and second field, and recording module is based on the typing The field values corresponding with first field are entered into the typing region by rule.
Another aspect of the present disclosure provides a kind of calculating equipment, comprising: one or more processors;Memory is used for Store one or more programs, wherein when one or more of programs are executed by one or more of processors, so that One or more of processors realize method as described above.
Another aspect of the present disclosure provides a kind of non-volatile readable storage medium, is stored with the executable finger of computer It enables, described instruction is when executed for realizing method as described above.
Another aspect of the present disclosure provides a kind of computer program, and the computer program, which includes that computer is executable, to be referred to It enables, described instruction is when executed for realizing method as described above.
Detailed description of the invention
In order to which the disclosure and its advantage is more fully understood, referring now to being described below in conjunction with attached drawing, in which:
Fig. 1, which is diagrammatically illustrated, to be used for according to the embodiment of the present disclosure by the method and apparatus of document information input system Application scenarios;
Fig. 2, which is diagrammatically illustrated, to be used for according to the embodiment of the present disclosure by the process of the method for document information input system Figure;
Fig. 3, which is diagrammatically illustrated, to be used for according to another embodiment of the disclosure by the stream of the method for document information input system Cheng Tu;
Fig. 4 diagrammatically illustrates the flow chart of the processing document image according to the embodiment of the present disclosure;
Fig. 5, which is diagrammatically illustrated, to be used for according to the embodiment of the present disclosure by the block diagram of the device of document information input system;
Fig. 6, which is diagrammatically illustrated, to be used for according to another embodiment of the disclosure by the frame of the device of document information input system Figure;
Fig. 7 diagrammatically illustrates the block diagram of the processing module according to the embodiment of the present disclosure;And
Fig. 8, which is diagrammatically illustrated, to be used for according to the embodiment of the present disclosure by the computer system of document information input system Block diagram.
Specific embodiment
Hereinafter, will be described with reference to the accompanying drawings embodiment of the disclosure.However, it should be understood that these descriptions are only exemplary , and it is not intended to limit the scope of the present disclosure.In the following detailed description, to elaborate many specific thin convenient for explaining Section is to provide the comprehensive understanding to the embodiment of the present disclosure.It may be evident, however, that one or more embodiments are not having these specific thin It can also be carried out in the case where section.In addition, in the following description, descriptions of well-known structures and technologies are omitted, to avoid Unnecessarily obscure the concept of the disclosure.
Term as used herein is not intended to limit the disclosure just for the sake of description specific embodiment.It uses herein The terms "include", "comprise" etc. show the presence of the feature, step, operation and/or component, but it is not excluded that in the presence of Or add other one or more features, step, operation or component.
There are all terms (including technical and scientific term) as used herein those skilled in the art to be generally understood Meaning, unless otherwise defined.It should be noted that term used herein should be interpreted that with consistent with the context of this specification Meaning, without that should be explained with idealization or excessively mechanical mode.
It, in general should be according to this using statement as " at least one in A, B and C etc. " is similar to Field technical staff is generally understood the meaning of the statement to make an explanation (for example, " system at least one in A, B and C " Should include but is not limited to individually with A, individually with B, individually with C, with A and B, with A and C, have B and C, and/or System etc. with A, B, C).Using statement as " at least one in A, B or C etc. " is similar to, generally come Saying be generally understood the meaning of the statement according to those skilled in the art to make an explanation (for example, " having in A, B or C at least One system " should include but is not limited to individually with A, individually with B, individually with C, with A and B, have A and C, have B and C, and/or the system with A, B, C etc.).
Shown in the drawings of some block diagrams and/or flow chart.It should be understood that some sides in block diagram and/or flow chart Frame or combinations thereof can be realized by computer program instructions.These computer program instructions can be supplied to general purpose computer, The processor of special purpose computer or other programmable control units, so that these instructions can create when executed by this processor For realizing function/operation device illustrated in these block diagrams and/or flow chart.
Therefore, the technology of the disclosure can be realized in the form of hardware and/or software (including firmware, microcode etc.).Separately Outside, the technology of the disclosure can take the form of the computer program product on the computer-readable medium for being stored with instruction, should Computer program product uses for instruction execution system or instruction execution system is combined to use.In the context of the disclosure In, computer-readable medium, which can be, can include, store, transmitting, propagating or transmitting the arbitrary medium of instruction.For example, calculating Machine readable medium can include but is not limited to electricity, magnetic, optical, electromagnetic, infrared or semiconductor system, device, device or propagation medium. The specific example of computer-readable medium includes: magnetic memory apparatus, such as tape or hard disk (HDD);Light storage device, such as CD (CD-ROM);Memory, such as random access memory (RAM) or flash memory;And/or wire/wireless communication link.
Embodiment of the disclosure provide it is a kind of for by the method for document information input system, this method comprises: obtaining Document image, handles document image to obtain document information, and document information includes the first field and corresponding with the first field Field values, the target pages in acquisition system, target pages include the second field and typing corresponding with the second field area Domain obtains typing rule, and typing rule includes the corresponding relationship between the first field and the second field, will based on typing rule Field values corresponding with the first field are entered into typing region.
Fig. 1, which is diagrammatically illustrated, to be used for according to the embodiment of the present disclosure by the method and apparatus of document information input system Application scenarios.It should be noted that being only the example that can apply the system architecture of the embodiment of the present disclosure shown in Fig. 1, to help Those skilled in the art understand that the technology contents of the disclosure, but it is not meant to that the embodiment of the present disclosure may not be usable for other and set Standby, system, environment or scene.
As shown in Figure 1, the application scenarios 100 are for example including document image 110 and the target pages of system 120.
According to the embodiment of the present disclosure, document for example may include the related tickets such as export license, customs declaration, packaging document According to.When one business of every completion, an at least document can be generated.It, can be by obtaining in order to by document information input system The document image 110 of every document is taken, obtains document information in order to carry out image recognition to document image 110, and by document The target pages 120 of information automatic input system.
For example, illustrating by customs declaration of document, after getting document image 110, figure can be carried out to document image 110 Document information is obtained as identifying, the document information for example, operating unit is " XXX company ", and total price is " 100 yuan ", import Date is " 2019-06-24 ".And it automatically will be in the target pages 120 of acquired document information input system.
Fig. 2, which is diagrammatically illustrated, to be used for according to the embodiment of the present disclosure by the process of the method for document information input system Figure.
As shown in Fig. 2, this method includes operation S210~S250.
In operation S210, document image is obtained.
According to the embodiment of the present disclosure, the mode for obtaining document image is varied, such as can be shot by camera single Document image is obtained according to obtaining document image, or by scanner scanning document.
In operation S220, document image is handled to obtain document information, document information includes the first field and with first The corresponding field values of field.
According to the embodiment of the present disclosure, such as the information in document can be obtained by carrying out image recognition to document image. Wherein, document information is for example including multiple first fields.For example, using document as customs declaration illustrate, the first field may include through Seek unit, total price, date of arrival etc..Wherein, each first field has corresponding field values, for example, and operating unit Corresponding field values are " XXX company ", and field values corresponding with total price are " 100 yuan ", field corresponding with date of arrival Numerical value is " 2019-06-24 ".
In operation S230, target pages in acquisition system, target pages include the second field and with the second field pair The typing region answered.
According to the embodiment of the present disclosure, the target pages in system for example can be data form, which for example wraps Include multiple second fields.Wherein, multiple second fields are for example including " operating unit ", " amount of money ", " date " etc..And the mesh Marking the page further includes typing corresponding with the second field region, for example including corresponding with " operating unit " for typing " XXX public affairs The region of department ", the region for typing " 100 yuan " corresponding with " amount of money " are corresponding with " date " to be used for typing " 2019-06- 24 " region.
In operation S240, typing rule is obtained, typing rule includes the corresponding relationship between the first field and the second field.
According to the embodiment of the present disclosure, since there may be state inconsistent feelings for the first field and corresponding second field Condition, such as the first field are " total price ", and the second field corresponding with the first field is " amount of money ", although the first field and the second word It is equivalent in meaning expressed by section, but there is the difference in statement between fields.Therefore, in order to will be corresponding with the first field Field values are entered into typing corresponding with the second field region, need to obtain first between the first field and the second field Corresponding relationship.
Field values corresponding with the first field are entered into typing region based on typing rule in operation S250.
It, can will be with the first word according to the corresponding relationship between the first field and the second field according to the embodiment of the present disclosure The corresponding field values of section are entered into typing corresponding with the second field region.For example, " 100 yuan " are entered into and the second word In the corresponding typing region of section " amount of money ".
In the embodiments of the present disclosure, the corresponding field values of the first field recognized from document image sometimes cannot be straight The corresponding typing region of the second field of typing is connect, needs to carry out the first field corresponding field values conversion process, and will place The corresponding typing region of the second field of field values typing after reason.For example, be " type " when including the first field in document, it should " type " corresponding field values are " usance letter of credit ", and the first field " type " corresponds to the second field in system " type ".Before by the corresponding typing region of field values " usance letter of credit " the second field of typing " type ", need word Number of segment value " usance letter of credit " is converted to field " usance ", and by the corresponding typing of the second field of " usance " typing " type " Region.
In the embodiments of the present disclosure, typing rule for example may include between first field and multiple second fields Corresponding relationship.For example, the corresponding field values of the first field " Description of Goods " are when the first field is " Description of Goods " " 30 cargos, material are metal ".First field is for example corresponding with multiple second fields, such as corresponding second field is " quantity " and " material ".At this time, it may be necessary to which field values " 30 cargos, material are metal " is carried out deconsolidation process, what is obtained is more A field values are, for example, " 30 " and " metal ", and the second field of the field values obtained after fractionation " 30 " typing " is counted It, will be in the corresponding typing region of field values " metal " the second field of typing " material " in the corresponding typing region of amount ".
In the embodiments of the present disclosure, typing rule for example can also include between multiple first fields and second field Corresponding relationship.For example, when multiple first fields include " buyer ", " supplier ", " contract value ", the first field " buyer " corresponding field values are " company A ", and the corresponding field values of the first field " supplier " are " B company ", The corresponding field values of first field " contract value " are " 100 yuan ".Multiple first field is for example corresponding with second word Section, such as corresponding second field are " contract notes ".At this time, it may be necessary to which the corresponding field values of multiple first fields are carried out group Conjunction processing, to obtain the corresponding field values of the second field " contract notes ".For example, by multiple field values " company A ", " B is public The field values obtained after department ", " 100 yuan " combinations are " contract parties is company A and B company, and contract value is 100 yuan ", and Field values " contract parties is company A and B company, and contract value is 100 yuan " after combination are entered into the second field " contract Illustrate " in corresponding typing region.
According to the technical solution of the embodiment of the present disclosure, image recognition is carried out by obtaining document image, and to document image To obtain document information, and according to typing rule automatic input document information, the efficiency of document entry is improved, is reduced artificial The error rate of typing, reduces cost of labor.
Fig. 3, which is diagrammatically illustrated, to be used for according to another embodiment of the disclosure by the stream of the method for document information input system Cheng Tu.
As shown in figure 3, this method includes operation S210~S250 and operation S310~S340.Wherein, operate S210~ The operation that S250 is described on reference to Fig. 2 is same or like, and details are not described herein.
Operation S310, export target pages in input result, input result include the second field and with typing area Field values in domain.
According to the embodiment of the present disclosure, since document image has unclear or image recognition, there are the feelings of error Condition, therefore, the document information in typing target pages a possibility that there are typing mistakes.Therefore, by document information typing mesh After marking in the page, the input result in target pages can be exported, convenient for input result and document information are carried out reverse contrast To verify the typing accuracy of document information.For example, being " amount of money " citing with the second field, input result is for example including " amount of money " And field values corresponding with the second field, such as field values corresponding with the second field are " 2019-06-24 ", then table Show that field values " 2019-06-24 " corresponding with " amount of money " are the information of typing mistake in target pages.
In operation S320, input result is compared with document information according to typing rule, obtains comparison result.
According to the embodiment of the present disclosure, typing rule includes " total price " in document information and " amount of money " in target pages Corresponding relationship.It therefore, can will be " 2019-06-24 " corresponding with " amount of money " in input result and single after exporting input result It is believed that " 100 yuan " corresponding with " total price " compare in breath.
In operation S330, indicates that input result is inconsistent with document information in response to comparison result, repaired based on document information Change the field values in typing region.
According to the embodiment of the present disclosure, when comparison result is inconsistent, such as " 2019-06-24 " and " 100 yuan " inconsistent, Then based on " 2019-06-24 " in " 100 yuan " modification typing regions in document information, such as will be in typing region " 2019-06-24 " is modified as " 100 yuan ".
In operation S340, typing rule is updated based on comparative result.
It is " amount of money " citing with the second field, typing rule is for example including " total price " and " gold according to the embodiment of the present disclosure Corresponding relationship between volume ".When comparison result is inconsistent, then it may indicate that the corresponding pass between the first field and the second field There are mistakes for system, such as the corresponding relationship of mistake is " date of arrival " correspondence " amount of money ".At this point it is possible to update the typing of the mistake Rule, for example, by typing Policy Updates be " total price " correspondence " amount of money ".
The embodiment of the present disclosure sentences the field values of institute's typing by reverse contrast input result and document information It is disconnected, if it is judged that the field values in the field values and document information of institute's typing are inconsistent, then based in document information Field values modify the field values of institute's typing, and update corresponding typing rule, step up typing accuracy rate to realize.
Fig. 4 diagrammatically illustrates the flow chart of the processing document image according to the embodiment of the present disclosure.
As shown in figure 4, operation S220 includes operation S221~S223.
In operation S221, the preset quantity of document image for identification is determined.
According to the embodiment of the present disclosure, in order to improve the accuracy of image recognition, can be carried out for a document image more Secondary image recognition.For example, the number for repeating identification for a document image can be determined first, which is, for example, present count Measure N.
In operation S222, identification document image is repeated according to preset quantity, obtains preset quantity initial recognition result.Example Such as, n times identification is carried out to a document image, obtains N number of initial recognition result.
In operation S223, processing preset quantity initial recognition result obtains document information.For example, by by preset quantity A initial recognition result is compared each other, obtains the different information between initial recognition result.For example, by N number of initial identification As a result it is compared to each other, obtains the difference between N number of initial recognition result, and be based on different information and initial recognition result, Obtain document information.Specifically, it is based on different information and initial recognition result, obtaining document information includes following (1)~(2) At least one of mode.
(1) preset rules are based on, determine the first object information in different information, determination is corresponding with first object information Initial recognition result is as target identification as a result, using target identification result as document information.
According to the embodiment of the present disclosure, preset rules for example may include the historical difference information being stored in image rule base And the corresponding modification mode of the historical difference information.For example, the preset rules may is that when N number of initial recognition result occurs When different information, using the initial recognition result to occupy the majority in N number of initial recognition result as document information.For example, when N is 3, Multiple initial recognition results include initial recognition result 1, initial recognition result 2, initial recognition result 3.When initial recognition result 1 It is consistent with the different information in initial recognition result 2, and the different information in initial recognition result 3 and initial recognition result 1 When inconsistent with the different information of initial recognition result 2, by the different information in initial recognition result 1 and initial recognition result 2 As first object information, and will be any in the corresponding initial recognition result 1 of the first object information and initial recognition result 2 One as target identification as a result, the target identification result can be used as document information.
(2) processing difference information obtains the second target information, is based on the second target information processing initial recognition result, obtains Document information.
According to the embodiment of the present disclosure, for example, when N is 3, when initial recognition result 1, initial recognition result 2, initial identification As a result 3 it is inconsistent when, respectively obtain initial recognition result 1, initial recognition result 2, the difference letter in initial recognition result 3 Breath.There are errors by taking initial recognition result 1 as an example, such as since document image has unclear or image recognition Situation, when obtaining the different information of initial recognition result 1 by image recognition are as follows: field values corresponding with the first field are " 10O member ", wherein " O " is letter.Then handling the different information and obtaining the second target information is " 100 yuan ", and based on this " 100 yuan " modification initial recognition results 1 of two target informations obtain document information, such as by " the 10O member " in initial recognition result 1 It is revised as " 100 yuan ".
According to the embodiment of the present disclosure, it is also based on different information and updates preset rules.For example, by different information and right The processing mode of different information is stored to image recognition library to update image recognition library.For example, by different information " 10O member " and The mode that " 10O member " is revised as " 100 yuan " is stored into image recognition library, convenient for the different information in rich image identification library And processing mode, improve the accuracy of subsequent image identification.
Fig. 5, which is diagrammatically illustrated, to be used for according to the embodiment of the present disclosure by the block diagram of the device of document information input system.
As shown in figure 5, for including the first acquisition module 510, processing module by the device 500 of document information input system 520, second module 530, third acquisition module 540 and recording module 550 are obtained.
First acquisition module 510 can be used for obtaining document image.Module 510 is obtained according to the embodiment of the present disclosure, first Such as the operation S210 described above with reference to Fig. 2 can be executed, details are not described herein.
Processing module 520 can be used for handling document image to obtain document information, document information include the first field with And field values corresponding with the first field.According to the embodiment of the present disclosure, processing module 520 can for example be executed above with reference to figure The operation S220 of 2 descriptions, details are not described herein.
Second acquisition module 530 can be used for the target pages in acquisition system, target pages include the second field and Typing corresponding with the second field region.According to the embodiment of the present disclosure, second acquisition module 530 can for example execute above with reference to The operation S230 of Fig. 2 description, details are not described herein.
Third, which obtains module 540, can be used for obtaining typing rule, typing rule include the first field and the second field it Between corresponding relationship.According to the embodiment of the present disclosure, third, which obtains module 540, can for example execute the behaviour described above with reference to Fig. 2 Make S240, details are not described herein.
Recording module 550 can be used for that field values corresponding with the first field are entered into typing based on typing rule In region.According to the embodiment of the present disclosure, recording module 550 can for example execute the operation S250 above with reference to Fig. 2 description, herein It repeats no more.
Fig. 6, which is diagrammatically illustrated, to be used for according to another embodiment of the disclosure by the frame of the device of document information input system Figure.
As shown in fig. 6, for including the first acquisition module 510, processing module by the device 600 of document information input system 520, second module 530, third acquisition module 540, recording module 550, output module 610, comparison module 620, modification are obtained Module 630 and the first update module 640.Wherein, first obtain module 510, processing module 520, second obtain module 530, The module that third acquisition module 540 and recording module 550 are described on reference to Fig. 5 is same or like, and details are not described herein.
Output module 610 can be used for exporting the input result in target pages, input result include the second field and With the field values in typing region.According to the embodiment of the present disclosure, output module 610 can for example be executed and be retouched above with reference to Fig. 3 The operation S310 stated, details are not described herein.
Comparison module 620 can be used for being compared input result with document information according to typing rule, be compared As a result.According to the embodiment of the present disclosure, comparison module 620 can for example execute the operation S320 above with reference to Fig. 3 description, herein not It repeats again.
Modified module 620 can be used for indicating that input result and document information are inconsistent in response to comparison result, based on single It is believed that the field values in breath modification typing region.According to the embodiment of the present disclosure, modified module 620 can for example execute ginseng above The operation S330 of Fig. 3 description is examined, details are not described herein.
First update module 640 can be used for updating typing rule based on comparative result.According to the embodiment of the present disclosure, first Update module 640 can for example execute the operation S340 above with reference to Fig. 3 description, and details are not described herein.
Fig. 7 diagrammatically illustrates the block diagram of the processing module according to the embodiment of the present disclosure.
As shown in fig. 7, processing module 520 includes determining submodule 521, identification submodule 522 and processing submodule 523。
Determine that submodule 521 is determined for the preset quantity of document image for identification.According to the embodiment of the present disclosure, The operation S221 described above with reference to Fig. 4 can be executed by determining submodule 521 for example, and details are not described herein.
Identification submodule 522 can be used for repeating identification document image according to preset quantity, and it is initial to obtain preset quantity Recognition result.According to the embodiment of the present disclosure, identify that submodule 522 can for example execute the operation S222 above with reference to Fig. 4 description, Details are not described herein.
Processing submodule 523 can be used for handling preset quantity initial recognition result and obtain document information.According to this public affairs Embodiment is opened, processing submodule 523 can for example execute the operation S223 above with reference to Fig. 4 description, and details are not described herein.
According to the embodiment of the present disclosure, handles preset quantity initial recognition result and obtain document information, comprising: by preset quantity A initial recognition result is compared each other, obtains the different information between initial recognition result, based on different information and just Beginning recognition result, obtains document information.
According to the embodiment of the present disclosure, it is based on different information and initial recognition result, obtains document information, comprising: be based on Preset rules determine the first object information in different information, determine that initial recognition result corresponding with first object information is made For target identification as a result, using target identification result as document information.
According to the embodiment of the present disclosure, it is based on different information and initial recognition result, obtains document information, comprising: processing Different information obtains the second target information, is based on the second target information processing initial recognition result, obtains document information.
According to the embodiment of the present disclosure, for by the device of document information input system further include: the second update module is used for Preset rules are updated based on different information.
It is module according to an embodiment of the present disclosure, submodule, unit, any number of or in which any more in subelement A at least partly function can be realized in a module.It is single according to the module of the embodiment of the present disclosure, submodule, unit, son Any one or more in member can be split into multiple modules to realize.According to the module of the embodiment of the present disclosure, submodule, Any one or more in unit, subelement can at least be implemented partly as hardware circuit, such as field programmable gate Array (FPGA), programmable logic array (PLA), system on chip, the system on substrate, the system in encapsulation, dedicated integrated electricity Road (ASIC), or can be by the hardware or firmware for any other rational method for integrate or encapsulate to circuit come real Show, or with any one in three kinds of software, hardware and firmware implementations or with wherein any several appropriately combined next reality It is existing.Alternatively, can be at least by part according to one or more of the module of the embodiment of the present disclosure, submodule, unit, subelement Ground is embodied as computer program module, when the computer program module is run, can execute corresponding function.
For example, first obtains module 510, processing module 520, second obtains module 530, third obtains module 540, typing Module 550, comparison module 620, modified module 630, the first update module 640, determines submodule 521, knows output module 610 Any number of may be incorporated in a module in small pin for the case module 522 and processing submodule 523 is realized or therein Module of anticipating can be split into multiple modules.Alternatively, at least partly function of one or more modules in these modules It can combine at least partly function of other modules, and be realized in a module.In accordance with an embodiment of the present disclosure, first Obtain module 510, processing module 520, second obtains module 530, third obtains module 540, recording module 550, output module 610, comparison module 620, modified module 630, the first update module 640, determine submodule 521, identification submodule 522 and place At least one of reason submodule 523 can at least be implemented partly as hardware circuit, such as field programmable gate array (FPGA), programmable logic array (PLA), system on chip, the system on substrate, the system in encapsulation, specific integrated circuit (ASIC), it or can be realized by carrying out the hardware such as any other rational method that is integrated or encapsulating or firmware to circuit, Or it several appropriately combined is realized with any one in three kinds of software, hardware and firmware implementations or with wherein any. Alternatively, first obtain module 510, processing module 520, second obtain module 530, third obtain module 540, recording module 550, Output module 610, modified module 630, the first update module 640, determines submodule 521, identification submodule at comparison module 620 At least one of 522 and processing submodule 523 can at least be implemented partly as computer program module, when the calculating When machine program module is run, corresponding function can be executed.
Fig. 8, which is diagrammatically illustrated, to be used for according to the embodiment of the present disclosure by the computer system of document information input system Block diagram.Computer system shown in Fig. 8 is only an example, should not function and use scope band to the embodiment of the present disclosure Carry out any restrictions.
As shown in figure 8, computer system 800 includes processor 801, computer readable storage medium 802.The system 800 The method according to the embodiment of the present disclosure can be executed.
Specifically, processor 801 for example may include general purpose microprocessor, instruction set processor and/or related chip group And/or special microprocessor (for example, specific integrated circuit (ASIC)), etc..Processor 801 can also include using for caching The onboard storage device on way.Processor 801 can be the different movements for executing the method flow according to the embodiment of the present disclosure Single treatment unit either multiple processing units.
Computer readable storage medium 802, such as can be times can include, store, transmitting, propagating or transmitting instruction Meaning medium.For example, readable storage medium storing program for executing can include but is not limited to electricity, magnetic, optical, electromagnetic, infrared or semiconductor system, device, Device or propagation medium.The specific example of readable storage medium storing program for executing includes: magnetic memory apparatus, such as tape or hard disk (HDD);Optical storage Device, such as CD (CD-ROM);Memory, such as random access memory (RAM) or flash memory;And/or wire/wireless communication chain Road.
Computer readable storage medium 802 may include computer program 803, which may include generation Code/computer executable instructions execute processor 801 according to the embodiment of the present disclosure Method or its any deformation.
Computer program 803 can be configured to have the computer program code for example including computer program module.Example Such as, in the exemplary embodiment, the code in computer program 803 may include one or more program modules, for example including 803A, module 803B ....It should be noted that the division mode and number of module are not fixation, those skilled in the art can To be combined according to the actual situation using suitable program module or program module, when these program modules are combined by processor 801 When execution, processor 801 is executed according to the method for the embodiment of the present disclosure or its any deformation.
In accordance with an embodiment of the present disclosure, the first acquisition module 510, the acquisition of processing module 520, second module 530, third obtain Modulus block 540, recording module 550, output module 610, comparison module 620, modified module 630, the first update module 640, really At least one of stator modules 521, identification submodule 522 and processing submodule 523 can be implemented as with reference to Fig. 8 description Corresponding operating described above may be implemented when being executed by processor 801 in computer program module.
The disclosure additionally provides a kind of computer-readable medium, which, which can be in above-described embodiment, retouches Included in the equipment/device/system stated;It is also possible to individualism, and without in the supplying equipment/device/system.On It states computer-readable medium and carries one or more program, when said one or multiple programs are performed, in realization State method.
In accordance with an embodiment of the present disclosure, computer-readable medium can be computer-readable signal media or computer can Read storage medium either the two any combination.Computer readable storage medium for example can be --- but it is unlimited In system, device or the device of --- electricity, magnetic, optical, electromagnetic, infrared ray or semiconductor, or any above combination.It calculates The more specific example of machine readable storage medium storing program for executing can include but is not limited to: have the electrical connection, portable of one or more conducting wires Formula computer disk, hard disk, random access storage device (RAM), read-only memory (ROM), erasable programmable read only memory (EPROM or flash memory), optical fiber, portable compact disc read-only memory (CD-ROM), light storage device, magnetic memory device or The above-mentioned any appropriate combination of person.In the disclosure, computer readable storage medium can be it is any include or storage program Tangible medium, which can be commanded execution system, device or device use or in connection.And in this public affairs In opening, computer-readable signal media may include in a base band or as carrier wave a part propagate data-signal, In carry computer-readable program code.The data-signal of this propagation can take various forms, including but not limited to Electromagnetic signal, optical signal or above-mentioned any appropriate combination.Computer-readable signal media can also be computer-readable Any computer-readable medium other than storage medium, the computer-readable medium can send, propagate or transmit for by Instruction execution system, device or device use or program in connection.The journey for including on computer-readable medium Sequence code can transmit with any suitable medium, including but not limited to: wireless, wired, optical cable, radiofrequency signal etc., or Above-mentioned any appropriate combination.
Flow chart and block diagram in attached drawing are illustrated according to the system of the various embodiments of the disclosure, method and computer journey The architecture, function and operation in the cards of sequence product.In this regard, each box in flowchart or block diagram can generation A part of one module, program segment or code of table, a part of above-mentioned module, program segment or code include one or more Executable instruction for implementing the specified logical function.It should also be noted that in some implementations as replacements, institute in box The function of mark can also occur in a different order than that indicated in the drawings.For example, two boxes succeedingly indicated are practical On can be basically executed in parallel, they can also be executed in the opposite order sometimes, and this depends on the function involved.Also it wants It is noted that the combination of each box in block diagram or flow chart and the box in block diagram or flow chart, can use and execute rule The dedicated hardware based systems of fixed functions or operations is realized, or can use the group of specialized hardware and computer instruction It closes to realize.
It will be understood by those skilled in the art that the feature recorded in each embodiment and/or claim of the disclosure can To carry out multiple combinations and/or combination, even if such combination or combination are not expressly recited in the disclosure.Particularly, exist In the case where not departing from disclosure spirit or teaching, the feature recorded in each embodiment and/or claim of the disclosure can To carry out multiple combinations and/or combination.All these combinations and/or combination each fall within the scope of the present disclosure.
Although the disclosure, art technology has shown and described referring to the certain exemplary embodiments of the disclosure Personnel it should be understood that in the case where the spirit and scope of the present disclosure limited without departing substantially from the following claims and their equivalents, A variety of changes in form and details can be carried out to the disclosure.Therefore, the scope of the present disclosure should not necessarily be limited by above-described embodiment, But should be not only determined by appended claims, also it is defined by the equivalent of appended claims.

Claims (10)

1. a kind of for by the method for document information input system, which comprises
Obtain document image;
The document image is handled to obtain document information, the document information include the first field and with first field Corresponding field values;
The target pages in the system are obtained, the target pages include the second field and typing corresponding with the second field Region;
Typing rule is obtained, the typing rule includes the corresponding relationship between first field and second field;With And
Based on the typing rule, the field values corresponding with first field are entered into the typing region.
2. according to the method described in claim 1, further include:
Export the input result in the target pages, the input result include second field and in typing region Field values;
The input result is compared with the document information according to the typing rule, obtains comparison result;
It indicates that the input result and the document information are inconsistent in response to the comparison result, is repaired based on the document information Change the field values in the typing region;And
Result updates the typing rule based on the comparison.
3. according to the method described in claim 1, wherein, the processing document image is to obtain document information, comprising:
Determine the preset quantity of the document image described for identification;
It repeats to identify the document image according to the preset quantity, obtains preset quantity initial recognition result;And
It handles the preset quantity initial recognition result and obtains the document information.
4. according to the method described in claim 3, wherein, the processing preset quantity initial recognition result obtains described Document information, comprising:
The preset quantity initial recognition result is compared each other, obtains the difference letter between the initial recognition result Breath;And
Based on the different information and the initial recognition result, the document information is obtained.
5. it is described to be based on the different information and the initial recognition result according to the method described in claim 4, wherein, Obtain the document information, comprising:
Based on preset rules, the first object information in the different information is determined;
Determine initial recognition result corresponding with the first object information as target identification result;And
Using the target identification result as the document information.
6. it is described to be based on the different information and the initial recognition result according to the method described in claim 5, wherein, Obtain the document information, comprising:
It handles the different information and obtains the second target information;
Based on initial recognition result described in second target information processing, the document information is obtained.
7. according to the method described in claim 6, further include:
The preset rules are updated based on the different information.
8. a kind of for by the device of document information input system, described device includes:
First obtains module, obtains document image;
Processing module, handles the document image to obtain document information, the document information include the first field and with institute State the corresponding field values of the first field;
Second obtains module, obtains the target pages in the system, and the target pages include the second field and with second The corresponding typing region of field;
Third obtains module, obtains typing rule, the typing rule includes between first field and second field Corresponding relationship;And
The field values corresponding with first field are entered into the record based on the typing rule by recording module Enter in region.
9. a kind of calculating equipment, comprising:
One or more processors;
Memory, for storing one or more programs,
Wherein, when one or more of programs are executed by one or more of processors, so that one or more of Processor realizes method described in any one of claims 1 to 7.
10. a kind of computer readable storage medium, is stored with computer executable instructions, described instruction is used for reality when executed Method described in existing any one of claims 1 to 7.
CN201910649744.9A 2019-07-18 2019-07-18 For by the method, apparatus of document information input system, calculate equipment, medium Pending CN110362802A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910649744.9A CN110362802A (en) 2019-07-18 2019-07-18 For by the method, apparatus of document information input system, calculate equipment, medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910649744.9A CN110362802A (en) 2019-07-18 2019-07-18 For by the method, apparatus of document information input system, calculate equipment, medium

Publications (1)

Publication Number Publication Date
CN110362802A true CN110362802A (en) 2019-10-22

Family

ID=68220086

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910649744.9A Pending CN110362802A (en) 2019-07-18 2019-07-18 For by the method, apparatus of document information input system, calculate equipment, medium

Country Status (1)

Country Link
CN (1) CN110362802A (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111180082A (en) * 2019-12-30 2020-05-19 泰康保险集团股份有限公司 Medical information system data initialization method, system, device and storage medium
CN111767818A (en) * 2020-06-23 2020-10-13 北京思特奇信息技术股份有限公司 Method and device for intelligently accepting service
CN112861497A (en) * 2019-11-27 2021-05-28 贝壳技术有限公司 Contract template generation method and system
CN112967121A (en) * 2021-02-04 2021-06-15 金蝶软件(中国)有限公司 Image retrieval method and device and computer storage medium
CN113296613A (en) * 2021-03-12 2021-08-24 阿里巴巴新加坡控股有限公司 Customs clearance information processing method and device and electronic equipment
CN113449496A (en) * 2021-06-25 2021-09-28 北京京东振世信息技术有限公司 Method and device for automatically generating maintenance document
CN113553826A (en) * 2021-06-17 2021-10-26 北京来也网络科技有限公司 Information input method and device combining RPA and AI and electronic equipment

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090226090A1 (en) * 2008-03-06 2009-09-10 Okita Kunio Information processing system, information processing apparatus, information processing method, and storage medium
CN107688772A (en) * 2017-06-23 2018-02-13 平安科技(深圳)有限公司 Method, apparatus, computer equipment and the storage medium of policy information typing
CN109558440A (en) * 2018-10-18 2019-04-02 平安科技(深圳)有限公司 Batch data processing method, device, computer equipment and storage medium
CN109829444A (en) * 2019-02-28 2019-05-31 广州达安临床检验中心有限公司 Document input method, device, computer equipment and storage medium

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090226090A1 (en) * 2008-03-06 2009-09-10 Okita Kunio Information processing system, information processing apparatus, information processing method, and storage medium
CN107688772A (en) * 2017-06-23 2018-02-13 平安科技(深圳)有限公司 Method, apparatus, computer equipment and the storage medium of policy information typing
CN109558440A (en) * 2018-10-18 2019-04-02 平安科技(深圳)有限公司 Batch data processing method, device, computer equipment and storage medium
CN109829444A (en) * 2019-02-28 2019-05-31 广州达安临床检验中心有限公司 Document input method, device, computer equipment and storage medium

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112861497A (en) * 2019-11-27 2021-05-28 贝壳技术有限公司 Contract template generation method and system
CN111180082A (en) * 2019-12-30 2020-05-19 泰康保险集团股份有限公司 Medical information system data initialization method, system, device and storage medium
CN111767818A (en) * 2020-06-23 2020-10-13 北京思特奇信息技术股份有限公司 Method and device for intelligently accepting service
CN111767818B (en) * 2020-06-23 2024-04-26 北京思特奇信息技术股份有限公司 Method and device for intelligently accepting business
CN112967121A (en) * 2021-02-04 2021-06-15 金蝶软件(中国)有限公司 Image retrieval method and device and computer storage medium
CN113296613A (en) * 2021-03-12 2021-08-24 阿里巴巴新加坡控股有限公司 Customs clearance information processing method and device and electronic equipment
CN113553826A (en) * 2021-06-17 2021-10-26 北京来也网络科技有限公司 Information input method and device combining RPA and AI and electronic equipment
CN113449496A (en) * 2021-06-25 2021-09-28 北京京东振世信息技术有限公司 Method and device for automatically generating maintenance document
CN113449496B (en) * 2021-06-25 2024-05-17 北京京东振世信息技术有限公司 Method and device for automatically generating maintenance bill

Similar Documents

Publication Publication Date Title
CN110362802A (en) For by the method, apparatus of document information input system, calculate equipment, medium
KR102263985B1 (en) Method and system for providing validated, auditable, and immutable inputs to a smart contract
CN108171276B (en) Method and apparatus for generating information
CN113792159A (en) Knowledge graph data fusion method and system
US20220114821A1 (en) Methods, systems, articles of manufacture and apparatus to categorize image text
CN107644286A (en) Workflow processing method and device
CN108805637A (en) Auto-associating bin simultaneously confirms point method and apparatus broadcast
CN108228463A (en) For detecting the method and apparatus of initial screen time
CN111562965B (en) Page data verification method and device based on decision tree
CN110166463A (en) A kind of message transmissions conversion method and device
US20220051140A1 (en) Model creation method, model creation apparatus, and program
CN107315729A (en) For the data processing method of chart, medium, device and computing device
CN109299096A (en) A kind of processing method of pipelined data, device and equipment
CN109948762A (en) Method and apparatus for generating two dimensional code
US11520620B2 (en) Electronic device and non-transitory storage medium implementing test path coordination method
CN109614327A (en) Method and apparatus for output information
CN108694194A (en) A kind of method and apparatus of construction data object
CN108416408A (en) Methods, devices and systems for asset management
CN108334335A (en) A kind of software source code version determines method and device
CN116863116A (en) Image recognition method, device, equipment and medium based on artificial intelligence
CN113380363B (en) Medical data quality evaluation method and system based on artificial intelligence
CN109657073A (en) Method and apparatus for generating information
CN104423964A (en) Method and system used for determining visualization credibility
CN114219310A (en) Order auditing method, system, electronic equipment and storage medium
CN114721943A (en) Method and device for determining test range

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination