CN108985187A - A kind of method that automatic quality inspection is realized in self verification of digital archive - Google Patents
A kind of method that automatic quality inspection is realized in self verification of digital archive Download PDFInfo
- Publication number
- CN108985187A CN108985187A CN201810680619.XA CN201810680619A CN108985187A CN 108985187 A CN108985187 A CN 108985187A CN 201810680619 A CN201810680619 A CN 201810680619A CN 108985187 A CN108985187 A CN 108985187A
- Authority
- CN
- China
- Prior art keywords
- data
- verification
- quality inspection
- item
- picture
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/40—Document-oriented image-based pattern recognition
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Artificial Intelligence (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- General Health & Medical Sciences (AREA)
- Health & Medical Sciences (AREA)
- General Engineering & Computer Science (AREA)
- Multimedia (AREA)
- General Factory Administration (AREA)
Abstract
The invention discloses a kind of methods of self automatic quality inspection of verification realization of digital archive, comprising the following steps: S1: setting data memory format;S2: setting checking parameter and rule;S3: the batch of archives to be detected is selected;S4: verification;S5: analysis log is generated;S6: quality inspection;S7: software systems detection;S8: data judgement;S9: data are generated.The present invention passes through data memory format, the verification rule of file data content item, the parameter of verification, pass through the correctness from text, the repeatability of data, the integrality of picture, the verification to file data item respectively of the similitude of picture, detailed analysis log is automatically generated to there are abnormal data in checking procedure, present invention configuration is flexible, it can satisfy different types of data check, it is fully automated, and it does not need manually to participate in, data can be checked comprehensively rather than be inspected by random samples, the inspection of mass data can be coped with completely, improve quality inspection efficiency and quality inspection quality.
Description
Technical field
The present invention relates to digital archive quality inspection technical field, self verification of specially a kind of digital archive is realized automatic
The method of quality inspection.
Background technique
Archival digitalization processing achievement is generally made of archives catalog data and content data, is processed into archival digitalization
The quality examination of fruit, generally using " archives of paper quality digitizing technique specification " (DA/T 31-2005) as prevailing quality inspection mark
Standard is all using artificial quality inspection as main quality detecting method all the time.It is checked in work in long-term archival digitalization processing,
Having found artificial quality detecting method, there are many deficiencies, in order to solve these deficiencies, also once throw manpower using increasing, promote quality inspection personnel
Technical ability, the methods of strengthen management, final quality of achievement is also promoted, but is not had obvious effects on always, and these people
Drawback in working medium detecting method becomes increasingly conspicuous, and summarizes by analysis, compares distinct issues mainly and has and is following:
First, error rate is high.For example, quality inspection personnel when checking catalogue data, is difficult to find mistake therein not
Word affects the normalization and accuracy requirement of archival digitalization processing achievement;For another example, it is checked to scanning result file
When, it is difficult to the case where page, is leaked in discovery, detects to the relevance of catalogue and computer documents, it is also difficult to identify mistake therein,
The integrality of archival digitalization processing achievement is not can guarantee.
Second, it is comprehensive not can guarantee inspection.The conscious degree of subjectivity for relying only on quality inspection personnel, is easy to appear the feelings of missing inspection
Condition, is unable to ensure every a material, each single item must examine catalogue data all and have passed through inspection, and entire archival digitalization is caused to be processed into
Fruit it is with a low credibility.
Third, low efficiency.For example, when carrying out number of pages inspection it may first have to count the page of relevant each part material again
Number, then calculates overall result, then count again in scanning achievement file total page number (one file of single page file is one page,
The file of multipage then needs to obtain the total page number of this document), finally check that the catalogue number of pages registered in advance, three number of pages information are necessary
It is completely the same to indicate that number of pages information is correct.The inspection work for only completing this, must just take a substantial amount of time, and
In checking process must energy high concentration just can guarantee the accurate of number of pages data, otherwise can only put into more energy for returning
Work inspection.
4th, the inspection of mass data can not be coped with.With the development of Information Construction of Archive, carries out digitlization and add
The archives quantity of work is more and more, and the archival digitalization of generation processes achievement enormous amount, archival digitalization processing capacity easily on
Ten million, the method for only relying on artificial quality inspection is unable to complete and all carries out quality examination to all files Digital manufacturing achievement and appoint
Business can only take the method for sampling observation to carry out quality examination to archival digitalization processing achievement.According to " archives of paper quality digitlization rule
Model " requirement in (DA/T 31-2005) to data acceptance, as soon as the sampling observation ratio of archives need to be not less than 5% in general archive, if
It is inspected by random samples by 5% sampling observation ratio, even if all data of sampling observation are all qualified, but still there are also 95% data not to be detected
It looked into, the quality of these data will be difficult to ensure.
Summary of the invention
The purpose of the present invention is to provide a kind of method of self automatic quality inspection of verification realization of digital archive, configuration spirits
It is living, it can satisfy different types of data check, it is fully automated, and do not need manually to participate in, data can be carried out comprehensive
It checks rather than inspects by random samples, the inspection of mass data can be coped with completely, improve quality inspection efficiency and quality inspection quality, it is above-mentioned to solve
The problem of being proposed in background technique.
To achieve the above object, the invention provides the following technical scheme: a kind of self verification of digital archive is realized automatically
The method of quality inspection, comprising the following steps:
S1: data memory format is arranged in setting data memory format, and the data after Digital manufacturing are stored in Excel table
In lattice, the scanned picture of each material is stored in file;
S2: setting checking parameter and rule increases detection library newly according to the demand of quality inspection, provides detection, set interface,
User can needing to be added and check item and delete check item according to oneself;
S3: selecting the batch of archives to be detected, selects the IP address and port information in be articulated to client's actual data library
And whether database user information repeats for detecting file data to be detected with the data for being put in storage actual data library, together
When selection quality inspection path the basic data of archives to be detected and the image data of scanning machining are set, finally be arranged quality inspection report with
The outgoing route of the analysis log information of abnormal conditions;
S4: verification carries out school from text, data item and scanned picture to archives according to the rule of verification and parameter respectively
It tests;
S5: generating analysis log, generates detailed analysis log by there are abnormal data in checking procedure, for pair
Data are checked and are repaired;
S6: quality inspection after the completion of the above configuration data of step S5, starts to carry out quality inspection work;
S7: software systems detection, the item detected automatically by software systems pass through the MD5 calculated scanned picture
Code and SHA1 code and MD5 code when scanning completion compare with SHA1 code judge picture whether completely with whether repaired by others
Change;
S8: data judgement, for software systems according to the material class number in material data, number of pages, number data, intelligent measurement is every
The whether whole necessary beings of scanning file corresponding to the data of one archival digitalization processing achievement;
S9: generating data, and the report of data quality checking is generated according to the data of quality inspection.
It preferably, include at least one set of detection option in detection library in the step S2, each detection option includes
Verification rule and checking parameter.
Preferably, the content of the verification of the step S4 include the correctness of text, the repeatability of data, picture it is complete
The similitude of property and picture.
Preferably, when the step S6 quality inspection, quality inspection item includes the letter of the item of information of archives, the item of information of material, picture
Cease item, the item of information of catalogue and the item of information of classification, and each corresponding inspection parameter of rule setting for detecting as needed.
Preferably, the rule of the detection includes identity card verification, date verification, integer verification, function verification and canonical
Verification.
Preferably, the automatic detection of software systems includes picture quality, resolution ratio and file size inspection in the step S7
It surveys.
A kind of technical effect and advantage of the invention: self automatic quality inspection of verification realization of digital archive proposed by the present invention
Method have the advantage that compared with prior art
The present invention by the standard set data memory format that oneself is arranged, the verification rule of file data content item,
The parameter of verification, by the repeatability, the integrality of picture, the similitude of picture of correctness, data from text respectively to shelves
The verification of case data item automatically generates detailed analysis log to there are abnormal data in checking procedure, and the present invention configures spirit
It is living, it can satisfy different types of data check, it is fully automated, and do not need manually to participate in, data can be carried out comprehensive
It checks rather than inspects by random samples, the inspection of mass data can be coped with completely, improve quality inspection efficiency and quality inspection quality.
Detailed description of the invention
Fig. 1 is process flow chart of the invention.
Specific embodiment
Following will be combined with the drawings in the embodiments of the present invention, and technical solution in the embodiment of the present invention carries out clear, complete
Site preparation description, it is clear that described embodiments are only a part of the embodiments of the present invention, instead of all the embodiments.It is based on
Embodiment in the present invention, it is obtained by those of ordinary skill in the art without making creative efforts every other
Embodiment shall fall within the protection scope of the present invention.
Referring to Fig. 1, the present invention provides a kind of technical solution: automatic quality inspection is realized in a kind of self verification of digital archive
Method, comprising the following steps:
S1: data memory format is arranged in setting data memory format, and the data after Digital manufacturing are stored in Excel table
In lattice, the scanned picture of each material is stored in file;
S2: setting checking parameter and rule increases detection library newly according to the demand of quality inspection, provides detection, set interface,
User can needing to be added and check item and delete check item according to oneself;
S3: selecting the batch of archives to be detected, selects the IP address and port information in be articulated to client's actual data library
And whether database user information repeats for detecting file data to be detected with the data for being put in storage actual data library, together
When selection quality inspection path the basic data of archives to be detected and the image data of scanning machining are set, finally be arranged quality inspection report with
The outgoing route of the analysis log information of abnormal conditions;
S4: verification carries out school from text, data item and scanned picture to archives according to the rule of verification and parameter respectively
It tests;
S5: generating analysis log, generates detailed analysis log by there are abnormal data in checking procedure, for pair
Data are checked and are repaired;
S6: quality inspection after the completion of the above configuration data of step S5, starts to carry out quality inspection work;
S7: software systems detection, the item detected automatically by software systems pass through the MD5 calculated scanned picture
Code and SHA1 code and MD5 code when scanning completion compare with SHA1 code judge picture whether completely with whether repaired by others
Change;
S8: data judgement, for software systems according to the material class number in material data, number of pages, number data, intelligent measurement is every
The whether whole necessary beings of scanning file corresponding to the data of one archival digitalization processing achievement;
S9: generating data, and the report of data quality checking is generated according to the data of quality inspection.
Specifically, including at least one set of detection option in detection library in the step S2, each detection option includes
Verification rule and checking parameter.
Specifically, the content of the verification of the step S4 include the correctness of text, the repeatability of data, picture it is complete
The similitude of property and picture.
Specifically, quality inspection item includes the letter of the item of information of archives, the item of information of material, picture when the step S6 quality inspection
Cease item, the item of information of catalogue and the item of information of classification, and each corresponding inspection parameter of rule setting for detecting as needed.
Specifically, the rule of the detection includes identity card verification, date verification, integer verification, function verification and canonical
Verification.
Specifically, the automatic detection of software systems includes picture quality, resolution ratio and file size inspection in the step S7
It surveys.
In conclusion the present invention is by the standard set data memory format that oneself is arranged, file data content item
Verification rule, the parameter of verification, pass through the repeatability, the integrality of picture, the similitude of picture of correctness, data from text
Respectively to the verification of file data item, detailed analysis log, this hair are automatically generated to there are abnormal data in checking procedure
Bright configuration flexibly, can satisfy different types of data check, fully automated, and not need manually to participate in, can be to data
It is checked rather than is inspected by random samples comprehensively, the inspection of mass data can be coped with completely, improve quality inspection efficiency and quality inspection quality.
It although an embodiment of the present invention has been shown and described, for the ordinary skill in the art, can be with
A variety of variations, modification, replacement can be carried out to these embodiments without departing from the principles and spirit of the present invention by understanding
And modification, the scope of the present invention is defined by the appended.
Claims (6)
1. the method that automatic quality inspection is realized in a kind of self verification of digital archive, it is characterised in that: the following steps are included:
S1: data memory format is arranged in setting data memory format, and the data after Digital manufacturing are stored in Excel table,
The scanned picture of each material is stored in file;
S2: setting checking parameter and rule increase detection library newly according to the demand of quality inspection, provide detection, set interface, user
It can needing to be added and check item and delete check item according to oneself;
S3: selecting the batch of archives to be detected, select be articulated to client's actual data library IP address and port information and
Database user information is used to detect file data to be detected whether repeat with the data for being put in storage actual data library, selects simultaneously
It selects quality inspection path and the basic data of archives to be detected and the image data of scanning machining is set, quality inspection is finally set and is reported and exception
The outgoing route of the analysis log information of situation;
S4: verification verifies archives from text, data item and scanned picture according to the rule of verification and parameter respectively;
S5: generating analysis log, detailed analysis log is generated by there are abnormal data in checking procedure, for data
It is checked and is repaired;
S6: quality inspection after the completion of the above configuration data of step S5, starts to carry out quality inspection work;
S7: software systems detection, the item detected automatically by software systems, by the MD5 code that calculates scanned picture and
MD5 code when SHA1 code and scanning are completed compares judge whether picture is complete with SHA1 code, if is modified by others;
S8: data judgement, software systems are according to the material class number in material data, number of pages, number data, intelligent measurement each
Archival digitalization processes the whether whole necessary beings of scanning file corresponding to the data of achievement;
S9: generating data, and the report of data quality checking is generated according to the data of quality inspection.
2. the method that automatic quality inspection is realized in a kind of self verification of digital archive according to claim 1, it is characterised in that:
It include at least one set of detection option in detection library in the step S2, each detection option includes verification rule and verification ginseng
Number.
3. the method that automatic quality inspection is realized in a kind of self verification of digital archive according to claim 1, it is characterised in that:
The content of the verification of the step S4 includes the similar of the correctness of text, the repeatability of data, the integrality of picture and picture
Property.
4. the method that automatic quality inspection is realized in a kind of self verification of digital archive according to claim 1, it is characterised in that:
When the step S6 quality inspection, quality inspection item include the item of information of archives, the item of information of material, the item of information of picture, catalogue information
And classification item of information, and each corresponding inspection parameter of rule setting for detecting as needed.
5. the method that automatic quality inspection is realized in a kind of self verification of digital archive according to claim 4, it is characterised in that:
The rule of the detection includes identity card verification, date verification, integer verification, function verification and canonical verification.
6. the method that automatic quality inspection is realized in a kind of self verification of digital archive according to claim 1, it is characterised in that:
The automatic detection of software systems includes picture quality, resolution ratio and file size detection in the step S7.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810680619.XA CN108985187A (en) | 2018-06-27 | 2018-06-27 | A kind of method that automatic quality inspection is realized in self verification of digital archive |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810680619.XA CN108985187A (en) | 2018-06-27 | 2018-06-27 | A kind of method that automatic quality inspection is realized in self verification of digital archive |
Publications (1)
Publication Number | Publication Date |
---|---|
CN108985187A true CN108985187A (en) | 2018-12-11 |
Family
ID=64538541
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201810680619.XA Pending CN108985187A (en) | 2018-06-27 | 2018-06-27 | A kind of method that automatic quality inspection is realized in self verification of digital archive |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN108985187A (en) |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109919435A (en) * | 2019-01-29 | 2019-06-21 | 国网物资有限公司 | Call for bid the automatic screening system and method for technical parameter |
CN111325460A (en) * | 2020-02-18 | 2020-06-23 | 深圳中兴网信科技有限公司 | Archive quality evaluation method, evaluation device, and computer-readable storage medium |
CN112416864A (en) * | 2020-11-18 | 2021-02-26 | 广东电网有限责任公司佛山供电局 | Automatic quality inspection method for digital files |
CN112883139A (en) * | 2021-03-15 | 2021-06-01 | 国家海洋信息中心 | Automatic checking method for spatial data and service data based on GIS vector calculation |
CN113379254A (en) * | 2021-06-15 | 2021-09-10 | 深圳市聚赢档案管理有限公司 | Automatic quality inspection system for notarization archives |
CN116976735A (en) * | 2023-08-01 | 2023-10-31 | 深圳市畅飞扬信息系统有限公司 | File digital quality detection and improvement method and system |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105589837A (en) * | 2014-10-22 | 2016-05-18 | 北京广利核系统工程有限公司 | Automatic electronic document checking method |
CN105634841A (en) * | 2014-10-29 | 2016-06-01 | 任子行网络技术股份有限公司 | Method and device for decreasing redundant logs of network auditing system |
CN107194659A (en) * | 2017-04-26 | 2017-09-22 | 珠海泰坦软件系统有限公司 | A kind of archival digitalization copy quality automated detection method |
CN107665399A (en) * | 2017-09-06 | 2018-02-06 | 北京联合大学 | A kind of personal file storage based on digital signature technology and credible management of electronic documents method |
CN107909380A (en) * | 2017-12-15 | 2018-04-13 | 定远县网萌电子商务有限公司 | Archival digitalization processes after-sale service management platform |
-
2018
- 2018-06-27 CN CN201810680619.XA patent/CN108985187A/en active Pending
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105589837A (en) * | 2014-10-22 | 2016-05-18 | 北京广利核系统工程有限公司 | Automatic electronic document checking method |
CN105634841A (en) * | 2014-10-29 | 2016-06-01 | 任子行网络技术股份有限公司 | Method and device for decreasing redundant logs of network auditing system |
CN107194659A (en) * | 2017-04-26 | 2017-09-22 | 珠海泰坦软件系统有限公司 | A kind of archival digitalization copy quality automated detection method |
CN107665399A (en) * | 2017-09-06 | 2018-02-06 | 北京联合大学 | A kind of personal file storage based on digital signature technology and credible management of electronic documents method |
CN107909380A (en) * | 2017-12-15 | 2018-04-13 | 定远县网萌电子商务有限公司 | Archival digitalization processes after-sale service management platform |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109919435A (en) * | 2019-01-29 | 2019-06-21 | 国网物资有限公司 | Call for bid the automatic screening system and method for technical parameter |
CN111325460A (en) * | 2020-02-18 | 2020-06-23 | 深圳中兴网信科技有限公司 | Archive quality evaluation method, evaluation device, and computer-readable storage medium |
CN112416864A (en) * | 2020-11-18 | 2021-02-26 | 广东电网有限责任公司佛山供电局 | Automatic quality inspection method for digital files |
CN112883139A (en) * | 2021-03-15 | 2021-06-01 | 国家海洋信息中心 | Automatic checking method for spatial data and service data based on GIS vector calculation |
CN113379254A (en) * | 2021-06-15 | 2021-09-10 | 深圳市聚赢档案管理有限公司 | Automatic quality inspection system for notarization archives |
CN116976735A (en) * | 2023-08-01 | 2023-10-31 | 深圳市畅飞扬信息系统有限公司 | File digital quality detection and improvement method and system |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN108985187A (en) | A kind of method that automatic quality inspection is realized in self verification of digital archive | |
CN113434485B (en) | Data quality health degree analysis method and system based on multidimensional analysis technology | |
CN109934554A (en) | A kind of method, electric terminal and storage medium for examining invoice | |
CN112800044B (en) | Data quality judging and monitoring method, management system, storage medium and terminal | |
CN109598228A (en) | Paper document electronization is recorded to the method and system of filing | |
CN113298497A (en) | Drawing review method and system based on building information model | |
CN110046789B (en) | Automatic generation method and system for student information literacy assessment test paper | |
CN111427928A (en) | Data quality detection method and device | |
CN116012178B (en) | Automatic financial accounting method based on artificial intelligence | |
CN111311120A (en) | Self-evaluation method and system for enterprise declared science and technology project | |
CN107194659A (en) | A kind of archival digitalization copy quality automated detection method | |
CN112486841A (en) | Method and device for checking data collected by buried point | |
CN115795319A (en) | Test item detection method and related device based on CNAS detection laboratory | |
CN113791980B (en) | Conversion analysis method, device and equipment for test cases and storage medium | |
CN113220594B (en) | Automatic test method, device, equipment and storage medium | |
CN115423421A (en) | Method and device for automatically auditing process of inspection report, electronic equipment and medium | |
CN112465456A (en) | Engineering evaluation data information management method, system and electronic equipment | |
CN114648256A (en) | Data security check method, system and equipment | |
CN113327023A (en) | Traversal test method and device, electronic equipment and computer readable storage medium | |
CN109446192B (en) | Data testing method and device | |
CN116303104B (en) | Automated process defect screening management method, system and readable storage medium | |
Avlokulov et al. | GUIDELINES FOR THE APPLICATION OF SAMPLING METHODS IN THE GATHERING OF AUDIT EVIDENCE | |
CN117726300B (en) | Automatic intelligent processing system for verifying bidding agency business data | |
CN118154113A (en) | Merchant information examination method, device, electronic equipment and readable storage medium | |
Wittorf et al. | Automated Image Metadata Verification |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20181211 |