CN103870826B - The method and system that a kind of electronic record scanning recognition is filed - Google Patents

The method and system that a kind of electronic record scanning recognition is filed Download PDF

Info

Publication number
CN103870826B
CN103870826B CN201410125970.4A CN201410125970A CN103870826B CN 103870826 B CN103870826 B CN 103870826B CN 201410125970 A CN201410125970 A CN 201410125970A CN 103870826 B CN103870826 B CN 103870826B
Authority
CN
China
Prior art keywords
map file
electronic record
scanning
filed
ocr
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201410125970.4A
Other languages
Chinese (zh)
Other versions
CN103870826A (en
Inventor
鲁淳
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Travel Polytron Technologies Inc
Original Assignee
Shenzhen Travel Polytron Technologies Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen Travel Polytron Technologies Inc filed Critical Shenzhen Travel Polytron Technologies Inc
Priority to CN201410125970.4A priority Critical patent/CN103870826B/en
Publication of CN103870826A publication Critical patent/CN103870826A/en
Application granted granted Critical
Publication of CN103870826B publication Critical patent/CN103870826B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Abstract

The present invention provides a kind of method that electronic record scanning recognition is filed, including:Step 1, scanning files compress the assigned catalogue to assigned catalogue;Compressed package is transferred to by map file server by procotol after the completion of step 2, compression;Step 3, map file server decompress the compressed file received automatically, and carry out OCR identifications to the map file after decompression, are automatically associated to map file in corresponding archives catalog by rule by extracting the text information in map file.The present invention also provides the system that a kind of electronic record scanning recognition is filed.The method and system that a kind of electronic record scanning recognition provided by the present invention is filed, realize a key operation, paper document is converted into electronic record, the efficiency that more traditional file scanning uploads the steps such as filing is substantially improved, by integrated OCR pictographs identification technology, associating for electronic record and Business Entity is realized, without human users, human cost is substantially reduced, building time is reduced.

Description

The method and system that a kind of electronic record scanning recognition is filed
Technical field
The present invention relates to automatic officeization field, the method that more particularly to a kind of quick scanning recognition of electronic record is filed And system.
Background technology
At present, application systems software is by development for many years, and many electronic archive systems occurs in industry, is directed to solving paper Matter archives are more, cumbersome, and it is difficult to consult, and borrow difficulty, the problem of security difficult to govern control.But traditional electronic archive system, is required for one The individual process that paper document is converted into electronic document, generally requires to put into huge human cost, and wastes time and energy, easily Error.Electronic record later stage cost in use has often been even more than the cost of Software Construction.
It is therefore desirable to propose a kind of new mode, on traditional archives economy, quick electronic record scanning is realized The method filed is recognized, so as to realize with minimum cost, quickly realizes that archives of paper quality is converted into the function of electronic record.
The content of the invention
It is an object of the invention to provide the method and system that a kind of electronic record scanning recognition is filed, realize that a key is grasped Make, paper document is converted into electronic record, the efficiency that more traditional file scanning uploads the steps such as filing is substantially improved, passes through Integrated OCR(Optical Character Recognition, optical character identification)Pictograph identification technology, realizes electronics Archives are associated with Business Entity, without human users, substantially reduce human cost, reduce building time.
To solve above technical problem, the present invention provides a kind of method that electronic record scanning recognition is filed, including:
Step 1, scanning files compress the assigned catalogue to assigned catalogue;
Compressed package is transferred to by map file server by procotol after the completion of step 2, compression;
Step 3, map file server decompress the compressed file received automatically, and carry out OCR knowledges to the map file after decompression Not, map file is automatically associated in corresponding archives catalog by rule by extracting the text information in map file.
Further, the step 1 is specifically included:
The browser of step 1.1, startup with ActiveX plug-in units;
Step 1.2, ActiveX plug-in units control scanner scanning files, and the electronic record completed will be scanned Store assigned catalogue;
Step 1.3, the ActiveX plug-in units compress the assigned catalogue automatically after the completion of the scanning of whole files.
Further, the step 3 is specifically included:
Step 3.1, map file server decompress the compressed file received automatically;
Step 3.2, the text information carried out to map file in OCR Text regions, extraction map file;
The map file is automatically associated to corresponding archives catalog by the text information that step 3.3, basis are extracted by rule In.
Further, the rule is:The electronic record title arrived by OCR Text regions, with current archives catalog Title is compared, and character is identical then to assert Current electronic Archiveownership to the archives catalog.
To solve above technical problem, the present invention also provides the system that a kind of electronic record scanning recognition is filed, including:Visitor Family machine, scanner, interchanger, map file server, wherein:
The client computer, including the browser with ActiveX plug-in units, the ActiveX plug-in units can control scanner to sweep Files are retouched, the electronic record storage completed will be scanned and referred to assigned catalogue, and after the completion of whole archives scans to described Determine catalogue to be compressed;
The scanner, for the ActiveX plug-in unit instruction scan files according to client computer;
The interchanger, for compressed package to be transferred into map file server by procotol;
The map file server set decompresses the compressed file received automatically into OCR Text regions, and to decompression after Map file carries out OCR identifications, and corresponding archives are automatically associated to by rule by extracting the text information in map file by drawing files In catalogue.
Further, the rule is:The electronic record title arrived by OCR Text regions, with current archives catalog mark Topic is compared, and character is identical then to assert Current electronic Archiveownership to the archives catalog.
Compared with conventional art, the present invention provides the method and system that a kind of electronic record scanning recognition is filed, and can pass through Common IE browser, realizes a key operation, directly manipulates scanner, and paper document batch scanning is converted into electronic record, Server can be automatically uploaded to after scanning, manually selecting file without user is uploaded.Pressure can be taken during upload The technology of contracting, is compressed into zip bags by electronic record automatically, realizes and uploads, so that maximized raising network performance.Upload to clothes The file system of business device is decompressed automatically, then by server set into OCR(Optical Character Recognition, Optical character identification)Picture character identification technology, the Text region on picture is come out, so as to be automatically associated to corresponding archives Under catalogue, associating for electronic record and Business Entity is realized by the scanning of system, realization is filed.Whole process be it is complete, Continuously, full process automatization is completed, and is not required to human users, substantially reduces human cost, reduces building time.
Brief description of the drawings
Accompanying drawing described herein is used for providing a further understanding of the present invention, constitutes the part of the present invention, this hair Bright schematic description and description is used to explain the present invention, does not constitute inappropriate limitation of the present invention.In the accompanying drawings:
Fig. 1 is the flow chart that the present invention provides a kind of method that electronic record scanning recognition is filed.
Fig. 2 is the physical schematic that the present invention provides the system that a kind of electronic record scanning recognition is filed.
Embodiment
In order that technical problems, technical solutions and advantages to be solved are clearer, clear, tie below Drawings and examples are closed, the present invention will be described in further detail.It should be appreciated that specific embodiment described herein is only used To explain the present invention, it is not intended to limit the present invention.
As shown in figure 1, the present invention provides a kind of method that electronic record scanning recognition is filed, including:
Step 1, scan files and be automatically stored to assigned catalogue, and compress the assigned catalogue;
Step 1.1, startup scanning:The present invention is that basic browser is scanned, and user opens browser, phase relation of entering System webpage, clicks scan button, starts scanning.
Step 1.2, scanning files to assigned catalogue:
System controls scanner, instruction scan instrument scanning files using ActiveX plug-in units, and can will scan what is completed Assigned catalogue is arrived in electronic record storage.
The assigned catalogue is compressed automatically after the completion of step 1.3, scanning:System ActiveX plug-in units are in whole archives scans After the completion of trigger event the assigned catalogue is compressed.
Compressed package is transferred to by map file server by procotol after the completion of step 2, compression;
Step 3, map file server are decompressed automatically to the compressed file received, and carry out OCR to the map file after decompression Map file, is automatically associated in corresponding archives catalog by identification by extracting the text information in map file by rule, is realized quick Filing.
Step 3.1, map file server decompress the compressed file received automatically;
Step 3.2, the text information carried out to the map file after decompression in OCR identifications, extraction map file:Map file server set into OCR Text regions carry out Text region to map file, extract the text information in map file.
Map file is automatically associated in corresponding archives catalog by the text information that step 3.3, basis are extracted by rule.
Wherein, the rule is as follows:The electronic record title arrived by OCR Text regions, the mark with current archives catalog Topic is compared, and character is identical then to assert Current electronic Archiveownership to the archives catalog.Here, described " character is identical " not It is required that character is identical, when character match degree is in certain proportion(This ratio can be configured, and such as 80%)Or more phase With then it is considered that character is identical.For example, the title of archives catalog is " enterprise's business license ", but OCR identifications scan what is come Heading character is " Mode of Enterprises in Shenzhen business license ", although both are not completely the same, but overall consistent, and also will be considered that both is word Symbol is identical.
As shown in Fig. 2 the present invention provides the system that a kind of electronic record scanning recognition is filed, including:Client computer 10, scanning Instrument 20, interchanger 30, map file server 40, wherein:
The client computer 10, including the browser with ActiveX plug-in units, the ActiveX plug-in units can control scanner, Instruction scan instrument scans files, and the electronic record storage for scanning completion is arrived into assigned catalogue, and complete in whole archives scans The assigned catalogue is compressed into rear.
The scanner 20, for the ActiveX plug-in unit instruction scan files according to client computer 10.
The interchanger 30, for the compressed package after the completion of compression to be transferred into map file server 40 by procotol;
The integrated OCR Text regions of map file server 40, are decompressed automatically for the compressed file to receiving, and right Map file after decompression carries out OCR identifications, by extract the text information in map file map file is automatically associated to by rule it is corresponding In archives catalog.
Describe the implementation of the present invention in detail below in conjunction with specific case study on implementation, the present invention how should whereby Practical business is solved the problems, such as with technological means.
In the case study on implementation of the present invention, certain corporate history is managed by certain tax bureau and paid taxes exemplified by archives, need to be by taxpayer's The original paper paper archives such as business license, contract, status of a legal person card, inventory of paying taxes are converted into electronic record and are put in storage filing.
A kind of method that quick scanning recognition of electronic record is filed is provided according to the present invention, including:
Archives of paper quality data is put on scanner by the first step, layer Info person, can random order discharge.
Second step:IE browser is opened, login system inquires the client that pays taxes, click scan button.
3rd step:Subsequent step is automatically performed by system entirely.
a)ActiveX plug-in units can direct access scan instrument, transmission instruction, control scanner progress batch scanning operation.
b)The file of scanning is automatically stored to client computer assigned catalogue, after the completion of scanning, by electronic record compressing file Into zip bags.
c)Map file server is transferred to by procotol.
d)Map file server is received and decompressed after file.
e)To every part of map file, OCR pictograph identifications are carried out, the meeting of identity card class is referred under identity card class, contract class Meeting be referred under contract class, business license class can be referred under business license class.
f)After the completion of processing, user is pointed out to operate successfully.
The present invention provides the method and system that a kind of electronic record scanning recognition is filed, can by common IE browser, A key operation is realized, scanner is directly manipulated, paper document batch scanning is converted into electronic record, can be uploaded automatically after scanning To server, manually select file without user and uploaded.The technology of compression can be taken during upload, automatically by electricity Sub-file is compressed into zip bags, realizes and uploads, so that maximized raising network performance.The file system uploaded onto the server is certainly Dynamic decompression, is then come out the Text region on picture into OCR picture character identification technologies by server set, so that from It is dynamic to be associated with corresponding archives catalog, associating for electronic record and Business Entity is realized by the scanning of system, realization is filed. Whole process is complete, continuous, and full process automatization is completed, and is not required to human users, is substantially reduced human cost, reduction is built If the time.
A preferred embodiment of the present invention has shown and described in described above, but as previously described, it should be understood that the present invention Be not limited to form disclosed herein, be not to be taken as the exclusion to other embodiment, and available for various other combinations, Modification and environment, and above-mentioned teaching or the technology or knowledge of association area can be passed through in invention contemplated scope described herein It is modified., then all should be in this hair and the change and change that those skilled in the art are carried out do not depart from the spirit and scope of the present invention In the protection domain of bright appended claims.

Claims (2)

1. a kind of method that electronic record scanning recognition is filed, it is characterised in that including:
Step 1, scanning files compress the assigned catalogue to assigned catalogue;Including:Step 1.1, startup have The browser of ActiveX plug-in units;Step 1.2, ActiveX plug-in units control scanner scanning files, and will scan through Into electronic record storage arrive assigned catalogue;Step 1.3, the ActiveX plug-in units whole files scanning after the completion of from The dynamic pressure contracting assigned catalogue;
Compressed package is transferred to by map file server by procotol after the completion of step 2, compression;
Step 3, map file server set decompress the compressed file received automatically into OCR Text regions, and to the figure after decompression Shelves carry out OCR identifications, are automatically associated to map file in corresponding archives catalog by rule by extracting the text information in map file, The rule is:The electronic record title arrived by OCR Text regions, is compared, character with the title of current archives catalog It is identical then to assert Current electronic Archiveownership to the archives catalog.
2. the system that a kind of electronic record scanning recognition is filed, it is characterised in that including:Client computer, scanner, interchanger, figure Shelves server, wherein:
The client computer, including the browser with ActiveX plug-in units, the ActiveX plug-in units can control scanner scanning shelves Case file, will scan the electronic record storage completed to assigned catalogue, and to the specified mesh after the completion of whole archives scans Record is compressed;
The scanner, for the ActiveX plug-in unit instruction scan files according to client computer;
The interchanger, for compressed package to be transferred into map file server by procotol;
The map file server set decompresses the compressed file received automatically into OCR Text regions, and to the map file after decompression OCR identifications are carried out, corresponding archives catalog is automatically associated to by rule by extracting the text information in map file by drawing files In, the rule is the electronic record title arrived by OCR Text regions, is compared with the title of current archives catalog, word Symbol is identical then to assert Current electronic Archiveownership to the archives catalog.
CN201410125970.4A 2014-03-31 2014-03-31 The method and system that a kind of electronic record scanning recognition is filed Active CN103870826B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410125970.4A CN103870826B (en) 2014-03-31 2014-03-31 The method and system that a kind of electronic record scanning recognition is filed

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410125970.4A CN103870826B (en) 2014-03-31 2014-03-31 The method and system that a kind of electronic record scanning recognition is filed

Publications (2)

Publication Number Publication Date
CN103870826A CN103870826A (en) 2014-06-18
CN103870826B true CN103870826B (en) 2017-10-13

Family

ID=50909342

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410125970.4A Active CN103870826B (en) 2014-03-31 2014-03-31 The method and system that a kind of electronic record scanning recognition is filed

Country Status (1)

Country Link
CN (1) CN103870826B (en)

Families Citing this family (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104217290A (en) * 2014-09-01 2014-12-17 南通北城科技创业管理有限公司 An archive management system
CN105512197A (en) * 2015-11-27 2016-04-20 广州宝钢南方贸易有限公司 Digitized archiving device of documents and archiving and searching device thereof
CN106874278A (en) * 2015-12-11 2017-06-20 中国电信股份有限公司 A kind of online method for previewing, device, system and cloud storage platform
CN105654273A (en) * 2015-12-29 2016-06-08 中国科学院信息工程研究所 Barcode technology-based electronic document management system and method
CN105760554A (en) * 2016-03-31 2016-07-13 华律网络科技(武汉)有限公司 Automatic filing system and method for lawsuit electronic files
CN106156325A (en) * 2016-07-05 2016-11-23 浪潮软件集团有限公司 File comparison method and device
TWI665632B (en) * 2017-07-18 2019-07-11 兆豐國際商業銀行股份有限公司 Picture uploading system
CN107809555A (en) * 2017-09-22 2018-03-16 苏州大成有方数据科技有限公司 A kind of high security company information includes filing system
CN107809553A (en) * 2017-09-22 2018-03-16 苏州大成有方数据科技有限公司 It is a kind of can be with the information management system of automatic arranging file
CN107809554A (en) * 2017-09-22 2018-03-16 苏州大成有方数据科技有限公司 A kind of efficiently company information includes filing system
CN109962958B (en) * 2017-12-26 2022-05-03 阿里巴巴(中国)有限公司 Document processing method and device
CN108681597A (en) * 2018-05-18 2018-10-19 苏州吉成智能科技有限公司 Archive management method
CN108984670A (en) * 2018-06-29 2018-12-11 郑州中博奥信息技术有限公司 A kind of method of cross-platform electronic record batch mounting
CN109359878B (en) * 2018-10-26 2021-02-02 珠海市时杰信息科技有限公司 Archive data processing method, computer device and computer readable storage medium
CN109658062A (en) * 2018-12-13 2019-04-19 广州华资软件技术有限公司 A kind of electronic record intelligent processing method based on deep learning
CN110232046A (en) * 2019-05-27 2019-09-13 武汉市润普网络科技有限公司 A kind of electronics folder is with case production method
CN110233996A (en) * 2019-06-25 2019-09-13 艺生活(北京)电子商务有限公司 The true true evidence collection method of the art work
CN110555410B (en) * 2019-09-04 2021-11-02 青岛大学 Automatic paper file digitalizing method
CN110674091A (en) * 2019-09-30 2020-01-10 深圳前海环融联易信息科技服务有限公司 File uploading method and system based on artificial intelligence and storage medium
CN111126952A (en) * 2019-12-16 2020-05-08 深圳供电局有限公司 Electronic file filing processing system and method
CN112836073A (en) * 2021-02-02 2021-05-25 嘉应学院 Historical literature digitization method, system, device and storage medium
CN114363472A (en) * 2021-12-10 2022-04-15 航天信息股份有限公司 Method and device for realizing image acquisition based on http protocol adaptive scanner
CN113947389B (en) * 2021-12-20 2022-04-22 佛山众陶联供应链服务有限公司 Digitization method and digitization system for balance sheet of ceramic supply chain system

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN201570028U (en) * 2009-12-16 2010-09-01 东莞市万维网络科技信息有限公司 System for electronic file filing management
CN102299953A (en) * 2011-06-24 2011-12-28 浪潮齐鲁软件产业有限公司 Image acquisition cloud processing method for taxpayer data
CN203093298U (en) * 2012-09-05 2013-07-31 浙江华丽达包装有限公司 Paper delivery mechanism of offset printing press

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2008083856A (en) * 2006-09-26 2008-04-10 Toshiba Corp Information processor, information processing method and information processing program
CN103093298B (en) * 2012-06-18 2015-12-02 北京航星永志科技有限公司 The multi version digital archives management of a kind of image or image file and application process

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN201570028U (en) * 2009-12-16 2010-09-01 东莞市万维网络科技信息有限公司 System for electronic file filing management
CN102299953A (en) * 2011-06-24 2011-12-28 浪潮齐鲁软件产业有限公司 Image acquisition cloud processing method for taxpayer data
CN203093298U (en) * 2012-09-05 2013-07-31 浙江华丽达包装有限公司 Paper delivery mechanism of offset printing press

Also Published As

Publication number Publication date
CN103870826A (en) 2014-06-18

Similar Documents

Publication Publication Date Title
CN103870826B (en) The method and system that a kind of electronic record scanning recognition is filed
US7930226B1 (en) User-driven document-based data collection
CN107665233A (en) Database data processing method, device, computer equipment and storage medium
CN110490721B (en) Financial voucher generating method and related product
US20210279667A1 (en) Method and computer readable storage medium for agent matching in remote interview signature
AU2017302245B2 (en) Optical character recognition utilizing hashed templates
US11743216B2 (en) Digital file recognition and deposit system
EP2668571A1 (en) Document workflow architecture
US20150278248A1 (en) Personal Information Management Service System
US20180107686A1 (en) Search method and apparatus
KR20210047350A (en) Attendance management system, method and electronic device
CN106412360A (en) Apparatus and method for applying settings, and computer-readable storage medium for computer program
CN112560411A (en) Intelligent personnel information input method and system
US20140029854A1 (en) Metadata supersets for matching images
CN116383693A (en) Data issuing method based on data security automatic classification grading result
JP2019124981A (en) Cooperation system, information processing apparatus, information registration method and program
CN111598128B (en) Control state identification and control method, device, equipment and medium of user interface
US20150086122A1 (en) Image processing system, image processing method, and medium
EP3188036B1 (en) A method and a system for providing an extract document
WO2021081705A1 (en) Method and device for payment platform management, payment platform, and computer storage medium
JP6899123B2 (en) Business card management program, business card management system, business card management server
US20220076208A1 (en) Methods and systems for processing training records and documents of employees
CN115019325A (en) Service processing method and device based on image recognition and storage medium
JP2014215983A (en) Information processing device, condition display method, and program
CN107239534B (en) Bar code scanning method, bar code scanning device, mobile terminal and computer readable storage medium

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
CB02 Change of applicant information

Address after: 518000 Guangdong city of Shenzhen province Nanshan District South Road seven No. 002 Shenzhen Digital Technology Park B1 building 6 floor A District No. 1

Applicant after: Shenzhen travel Polytron Technologies Inc

Address before: 518000 Guangdong city of Shenzhen province Nanshan District South Road seven No. 002 Shenzhen Digital Technology Park B1 building 6 floor A District No. 1

Applicant before: Shenzhen Vispractice Technology Corporation

CB02 Change of applicant information
GR01 Patent grant
GR01 Patent grant