CN108985187A - A kind of method that automatic quality inspection is realized in self verification of digital archive - Google Patents

A kind of method that automatic quality inspection is realized in self verification of digital archive Download PDF

Info

Publication number
CN108985187A
CN108985187A CN201810680619.XA CN201810680619A CN108985187A CN 108985187 A CN108985187 A CN 108985187A CN 201810680619 A CN201810680619 A CN 201810680619A CN 108985187 A CN108985187 A CN 108985187A
Authority
CN
China
Prior art keywords
data
verification
quality inspection
item
picture
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201810680619.XA
Other languages
Chinese (zh)
Inventor
崔晓明
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangzhou Southern Human Resources Evaluation Center Co Ltd
Original Assignee
Guangzhou Southern Human Resources Evaluation Center Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangzhou Southern Human Resources Evaluation Center Co Ltd filed Critical Guangzhou Southern Human Resources Evaluation Center Co Ltd
Priority to CN201810680619.XA priority Critical patent/CN108985187A/en
Publication of CN108985187A publication Critical patent/CN108985187A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/40Document-oriented image-based pattern recognition
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Artificial Intelligence (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • General Factory Administration (AREA)

Abstract

The invention discloses a kind of methods of self automatic quality inspection of verification realization of digital archive, comprising the following steps: S1: setting data memory format;S2: setting checking parameter and rule;S3: the batch of archives to be detected is selected;S4: verification;S5: analysis log is generated;S6: quality inspection;S7: software systems detection;S8: data judgement;S9: data are generated.The present invention passes through data memory format, the verification rule of file data content item, the parameter of verification, pass through the correctness from text, the repeatability of data, the integrality of picture, the verification to file data item respectively of the similitude of picture, detailed analysis log is automatically generated to there are abnormal data in checking procedure, present invention configuration is flexible, it can satisfy different types of data check, it is fully automated, and it does not need manually to participate in, data can be checked comprehensively rather than be inspected by random samples, the inspection of mass data can be coped with completely, improve quality inspection efficiency and quality inspection quality.

Description

A kind of method that automatic quality inspection is realized in self verification of digital archive
Technical field
The present invention relates to digital archive quality inspection technical field, self verification of specially a kind of digital archive is realized automatic The method of quality inspection.
Background technique
Archival digitalization processing achievement is generally made of archives catalog data and content data, is processed into archival digitalization The quality examination of fruit, generally using " archives of paper quality digitizing technique specification " (DA/T 31-2005) as prevailing quality inspection mark Standard is all using artificial quality inspection as main quality detecting method all the time.It is checked in work in long-term archival digitalization processing, Having found artificial quality detecting method, there are many deficiencies, in order to solve these deficiencies, also once throw manpower using increasing, promote quality inspection personnel Technical ability, the methods of strengthen management, final quality of achievement is also promoted, but is not had obvious effects on always, and these people Drawback in working medium detecting method becomes increasingly conspicuous, and summarizes by analysis, compares distinct issues mainly and has and is following:
First, error rate is high.For example, quality inspection personnel when checking catalogue data, is difficult to find mistake therein not Word affects the normalization and accuracy requirement of archival digitalization processing achievement;For another example, it is checked to scanning result file When, it is difficult to the case where page, is leaked in discovery, detects to the relevance of catalogue and computer documents, it is also difficult to identify mistake therein, The integrality of archival digitalization processing achievement is not can guarantee.
Second, it is comprehensive not can guarantee inspection.The conscious degree of subjectivity for relying only on quality inspection personnel, is easy to appear the feelings of missing inspection Condition, is unable to ensure every a material, each single item must examine catalogue data all and have passed through inspection, and entire archival digitalization is caused to be processed into Fruit it is with a low credibility.
Third, low efficiency.For example, when carrying out number of pages inspection it may first have to count the page of relevant each part material again Number, then calculates overall result, then count again in scanning achievement file total page number (one file of single page file is one page, The file of multipage then needs to obtain the total page number of this document), finally check that the catalogue number of pages registered in advance, three number of pages information are necessary It is completely the same to indicate that number of pages information is correct.The inspection work for only completing this, must just take a substantial amount of time, and In checking process must energy high concentration just can guarantee the accurate of number of pages data, otherwise can only put into more energy for returning Work inspection.
4th, the inspection of mass data can not be coped with.With the development of Information Construction of Archive, carries out digitlization and add The archives quantity of work is more and more, and the archival digitalization of generation processes achievement enormous amount, archival digitalization processing capacity easily on Ten million, the method for only relying on artificial quality inspection is unable to complete and all carries out quality examination to all files Digital manufacturing achievement and appoint Business can only take the method for sampling observation to carry out quality examination to archival digitalization processing achievement.According to " archives of paper quality digitlization rule Model " requirement in (DA/T 31-2005) to data acceptance, as soon as the sampling observation ratio of archives need to be not less than 5% in general archive, if It is inspected by random samples by 5% sampling observation ratio, even if all data of sampling observation are all qualified, but still there are also 95% data not to be detected It looked into, the quality of these data will be difficult to ensure.
Summary of the invention
The purpose of the present invention is to provide a kind of method of self automatic quality inspection of verification realization of digital archive, configuration spirits It is living, it can satisfy different types of data check, it is fully automated, and do not need manually to participate in, data can be carried out comprehensive It checks rather than inspects by random samples, the inspection of mass data can be coped with completely, improve quality inspection efficiency and quality inspection quality, it is above-mentioned to solve The problem of being proposed in background technique.
To achieve the above object, the invention provides the following technical scheme: a kind of self verification of digital archive is realized automatically The method of quality inspection, comprising the following steps:
S1: data memory format is arranged in setting data memory format, and the data after Digital manufacturing are stored in Excel table In lattice, the scanned picture of each material is stored in file;
S2: setting checking parameter and rule increases detection library newly according to the demand of quality inspection, provides detection, set interface, User can needing to be added and check item and delete check item according to oneself;
S3: selecting the batch of archives to be detected, selects the IP address and port information in be articulated to client's actual data library And whether database user information repeats for detecting file data to be detected with the data for being put in storage actual data library, together When selection quality inspection path the basic data of archives to be detected and the image data of scanning machining are set, finally be arranged quality inspection report with The outgoing route of the analysis log information of abnormal conditions;
S4: verification carries out school from text, data item and scanned picture to archives according to the rule of verification and parameter respectively It tests;
S5: generating analysis log, generates detailed analysis log by there are abnormal data in checking procedure, for pair Data are checked and are repaired;
S6: quality inspection after the completion of the above configuration data of step S5, starts to carry out quality inspection work;
S7: software systems detection, the item detected automatically by software systems pass through the MD5 calculated scanned picture Code and SHA1 code and MD5 code when scanning completion compare with SHA1 code judge picture whether completely with whether repaired by others Change;
S8: data judgement, for software systems according to the material class number in material data, number of pages, number data, intelligent measurement is every The whether whole necessary beings of scanning file corresponding to the data of one archival digitalization processing achievement;
S9: generating data, and the report of data quality checking is generated according to the data of quality inspection.
It preferably, include at least one set of detection option in detection library in the step S2, each detection option includes Verification rule and checking parameter.
Preferably, the content of the verification of the step S4 include the correctness of text, the repeatability of data, picture it is complete The similitude of property and picture.
Preferably, when the step S6 quality inspection, quality inspection item includes the letter of the item of information of archives, the item of information of material, picture Cease item, the item of information of catalogue and the item of information of classification, and each corresponding inspection parameter of rule setting for detecting as needed.
Preferably, the rule of the detection includes identity card verification, date verification, integer verification, function verification and canonical Verification.
Preferably, the automatic detection of software systems includes picture quality, resolution ratio and file size inspection in the step S7 It surveys.
A kind of technical effect and advantage of the invention: self automatic quality inspection of verification realization of digital archive proposed by the present invention Method have the advantage that compared with prior art
The present invention by the standard set data memory format that oneself is arranged, the verification rule of file data content item, The parameter of verification, by the repeatability, the integrality of picture, the similitude of picture of correctness, data from text respectively to shelves The verification of case data item automatically generates detailed analysis log to there are abnormal data in checking procedure, and the present invention configures spirit It is living, it can satisfy different types of data check, it is fully automated, and do not need manually to participate in, data can be carried out comprehensive It checks rather than inspects by random samples, the inspection of mass data can be coped with completely, improve quality inspection efficiency and quality inspection quality.
Detailed description of the invention
Fig. 1 is process flow chart of the invention.
Specific embodiment
Following will be combined with the drawings in the embodiments of the present invention, and technical solution in the embodiment of the present invention carries out clear, complete Site preparation description, it is clear that described embodiments are only a part of the embodiments of the present invention, instead of all the embodiments.It is based on Embodiment in the present invention, it is obtained by those of ordinary skill in the art without making creative efforts every other Embodiment shall fall within the protection scope of the present invention.
Referring to Fig. 1, the present invention provides a kind of technical solution: automatic quality inspection is realized in a kind of self verification of digital archive Method, comprising the following steps:
S1: data memory format is arranged in setting data memory format, and the data after Digital manufacturing are stored in Excel table In lattice, the scanned picture of each material is stored in file;
S2: setting checking parameter and rule increases detection library newly according to the demand of quality inspection, provides detection, set interface, User can needing to be added and check item and delete check item according to oneself;
S3: selecting the batch of archives to be detected, selects the IP address and port information in be articulated to client's actual data library And whether database user information repeats for detecting file data to be detected with the data for being put in storage actual data library, together When selection quality inspection path the basic data of archives to be detected and the image data of scanning machining are set, finally be arranged quality inspection report with The outgoing route of the analysis log information of abnormal conditions;
S4: verification carries out school from text, data item and scanned picture to archives according to the rule of verification and parameter respectively It tests;
S5: generating analysis log, generates detailed analysis log by there are abnormal data in checking procedure, for pair Data are checked and are repaired;
S6: quality inspection after the completion of the above configuration data of step S5, starts to carry out quality inspection work;
S7: software systems detection, the item detected automatically by software systems pass through the MD5 calculated scanned picture Code and SHA1 code and MD5 code when scanning completion compare with SHA1 code judge picture whether completely with whether repaired by others Change;
S8: data judgement, for software systems according to the material class number in material data, number of pages, number data, intelligent measurement is every The whether whole necessary beings of scanning file corresponding to the data of one archival digitalization processing achievement;
S9: generating data, and the report of data quality checking is generated according to the data of quality inspection.
Specifically, including at least one set of detection option in detection library in the step S2, each detection option includes Verification rule and checking parameter.
Specifically, the content of the verification of the step S4 include the correctness of text, the repeatability of data, picture it is complete The similitude of property and picture.
Specifically, quality inspection item includes the letter of the item of information of archives, the item of information of material, picture when the step S6 quality inspection Cease item, the item of information of catalogue and the item of information of classification, and each corresponding inspection parameter of rule setting for detecting as needed.
Specifically, the rule of the detection includes identity card verification, date verification, integer verification, function verification and canonical Verification.
Specifically, the automatic detection of software systems includes picture quality, resolution ratio and file size inspection in the step S7 It surveys.
In conclusion the present invention is by the standard set data memory format that oneself is arranged, file data content item Verification rule, the parameter of verification, pass through the repeatability, the integrality of picture, the similitude of picture of correctness, data from text Respectively to the verification of file data item, detailed analysis log, this hair are automatically generated to there are abnormal data in checking procedure Bright configuration flexibly, can satisfy different types of data check, fully automated, and not need manually to participate in, can be to data It is checked rather than is inspected by random samples comprehensively, the inspection of mass data can be coped with completely, improve quality inspection efficiency and quality inspection quality.
It although an embodiment of the present invention has been shown and described, for the ordinary skill in the art, can be with A variety of variations, modification, replacement can be carried out to these embodiments without departing from the principles and spirit of the present invention by understanding And modification, the scope of the present invention is defined by the appended.

Claims (6)

1. the method that automatic quality inspection is realized in a kind of self verification of digital archive, it is characterised in that: the following steps are included:
S1: data memory format is arranged in setting data memory format, and the data after Digital manufacturing are stored in Excel table, The scanned picture of each material is stored in file;
S2: setting checking parameter and rule increase detection library newly according to the demand of quality inspection, provide detection, set interface, user It can needing to be added and check item and delete check item according to oneself;
S3: selecting the batch of archives to be detected, select be articulated to client's actual data library IP address and port information and Database user information is used to detect file data to be detected whether repeat with the data for being put in storage actual data library, selects simultaneously It selects quality inspection path and the basic data of archives to be detected and the image data of scanning machining is set, quality inspection is finally set and is reported and exception The outgoing route of the analysis log information of situation;
S4: verification verifies archives from text, data item and scanned picture according to the rule of verification and parameter respectively;
S5: generating analysis log, detailed analysis log is generated by there are abnormal data in checking procedure, for data It is checked and is repaired;
S6: quality inspection after the completion of the above configuration data of step S5, starts to carry out quality inspection work;
S7: software systems detection, the item detected automatically by software systems, by the MD5 code that calculates scanned picture and MD5 code when SHA1 code and scanning are completed compares judge whether picture is complete with SHA1 code, if is modified by others;
S8: data judgement, software systems are according to the material class number in material data, number of pages, number data, intelligent measurement each Archival digitalization processes the whether whole necessary beings of scanning file corresponding to the data of achievement;
S9: generating data, and the report of data quality checking is generated according to the data of quality inspection.
2. the method that automatic quality inspection is realized in a kind of self verification of digital archive according to claim 1, it is characterised in that: It include at least one set of detection option in detection library in the step S2, each detection option includes verification rule and verification ginseng Number.
3. the method that automatic quality inspection is realized in a kind of self verification of digital archive according to claim 1, it is characterised in that: The content of the verification of the step S4 includes the similar of the correctness of text, the repeatability of data, the integrality of picture and picture Property.
4. the method that automatic quality inspection is realized in a kind of self verification of digital archive according to claim 1, it is characterised in that: When the step S6 quality inspection, quality inspection item include the item of information of archives, the item of information of material, the item of information of picture, catalogue information And classification item of information, and each corresponding inspection parameter of rule setting for detecting as needed.
5. the method that automatic quality inspection is realized in a kind of self verification of digital archive according to claim 4, it is characterised in that: The rule of the detection includes identity card verification, date verification, integer verification, function verification and canonical verification.
6. the method that automatic quality inspection is realized in a kind of self verification of digital archive according to claim 1, it is characterised in that: The automatic detection of software systems includes picture quality, resolution ratio and file size detection in the step S7.
CN201810680619.XA 2018-06-27 2018-06-27 A kind of method that automatic quality inspection is realized in self verification of digital archive Pending CN108985187A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810680619.XA CN108985187A (en) 2018-06-27 2018-06-27 A kind of method that automatic quality inspection is realized in self verification of digital archive

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810680619.XA CN108985187A (en) 2018-06-27 2018-06-27 A kind of method that automatic quality inspection is realized in self verification of digital archive

Publications (1)

Publication Number Publication Date
CN108985187A true CN108985187A (en) 2018-12-11

Family

ID=64538541

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810680619.XA Pending CN108985187A (en) 2018-06-27 2018-06-27 A kind of method that automatic quality inspection is realized in self verification of digital archive

Country Status (1)

Country Link
CN (1) CN108985187A (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109919435A (en) * 2019-01-29 2019-06-21 国网物资有限公司 Call for bid the automatic screening system and method for technical parameter
CN111325460A (en) * 2020-02-18 2020-06-23 深圳中兴网信科技有限公司 Archive quality evaluation method, evaluation device, and computer-readable storage medium
CN112416864A (en) * 2020-11-18 2021-02-26 广东电网有限责任公司佛山供电局 Automatic quality inspection method for digital files
CN112883139A (en) * 2021-03-15 2021-06-01 国家海洋信息中心 Automatic checking method for spatial data and service data based on GIS vector calculation
CN113379254A (en) * 2021-06-15 2021-09-10 深圳市聚赢档案管理有限公司 Automatic quality inspection system for notarization archives
CN116976735A (en) * 2023-08-01 2023-10-31 深圳市畅飞扬信息系统有限公司 File digital quality detection and improvement method and system

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105589837A (en) * 2014-10-22 2016-05-18 北京广利核系统工程有限公司 Automatic electronic document checking method
CN105634841A (en) * 2014-10-29 2016-06-01 任子行网络技术股份有限公司 Method and device for decreasing redundant logs of network auditing system
CN107194659A (en) * 2017-04-26 2017-09-22 珠海泰坦软件系统有限公司 A kind of archival digitalization copy quality automated detection method
CN107665399A (en) * 2017-09-06 2018-02-06 北京联合大学 A kind of personal file storage based on digital signature technology and credible management of electronic documents method
CN107909380A (en) * 2017-12-15 2018-04-13 定远县网萌电子商务有限公司 Archival digitalization processes after-sale service management platform

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105589837A (en) * 2014-10-22 2016-05-18 北京广利核系统工程有限公司 Automatic electronic document checking method
CN105634841A (en) * 2014-10-29 2016-06-01 任子行网络技术股份有限公司 Method and device for decreasing redundant logs of network auditing system
CN107194659A (en) * 2017-04-26 2017-09-22 珠海泰坦软件系统有限公司 A kind of archival digitalization copy quality automated detection method
CN107665399A (en) * 2017-09-06 2018-02-06 北京联合大学 A kind of personal file storage based on digital signature technology and credible management of electronic documents method
CN107909380A (en) * 2017-12-15 2018-04-13 定远县网萌电子商务有限公司 Archival digitalization processes after-sale service management platform

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109919435A (en) * 2019-01-29 2019-06-21 国网物资有限公司 Call for bid the automatic screening system and method for technical parameter
CN111325460A (en) * 2020-02-18 2020-06-23 深圳中兴网信科技有限公司 Archive quality evaluation method, evaluation device, and computer-readable storage medium
CN112416864A (en) * 2020-11-18 2021-02-26 广东电网有限责任公司佛山供电局 Automatic quality inspection method for digital files
CN112883139A (en) * 2021-03-15 2021-06-01 国家海洋信息中心 Automatic checking method for spatial data and service data based on GIS vector calculation
CN113379254A (en) * 2021-06-15 2021-09-10 深圳市聚赢档案管理有限公司 Automatic quality inspection system for notarization archives
CN116976735A (en) * 2023-08-01 2023-10-31 深圳市畅飞扬信息系统有限公司 File digital quality detection and improvement method and system

Similar Documents

Publication Publication Date Title
CN108985187A (en) A kind of method that automatic quality inspection is realized in self verification of digital archive
CN113434485B (en) Data quality health degree analysis method and system based on multidimensional analysis technology
CN109934554A (en) A kind of method, electric terminal and storage medium for examining invoice
CN112800044B (en) Data quality judging and monitoring method, management system, storage medium and terminal
CN109598228A (en) Paper document electronization is recorded to the method and system of filing
CN113298497A (en) Drawing review method and system based on building information model
CN110046789B (en) Automatic generation method and system for student information literacy assessment test paper
CN111427928A (en) Data quality detection method and device
CN116012178B (en) Automatic financial accounting method based on artificial intelligence
CN111311120A (en) Self-evaluation method and system for enterprise declared science and technology project
CN107194659A (en) A kind of archival digitalization copy quality automated detection method
CN112486841A (en) Method and device for checking data collected by buried point
CN115795319A (en) Test item detection method and related device based on CNAS detection laboratory
CN113791980B (en) Conversion analysis method, device and equipment for test cases and storage medium
CN113220594B (en) Automatic test method, device, equipment and storage medium
CN115423421A (en) Method and device for automatically auditing process of inspection report, electronic equipment and medium
CN112465456A (en) Engineering evaluation data information management method, system and electronic equipment
CN114648256A (en) Data security check method, system and equipment
CN113327023A (en) Traversal test method and device, electronic equipment and computer readable storage medium
CN109446192B (en) Data testing method and device
CN116303104B (en) Automated process defect screening management method, system and readable storage medium
Avlokulov et al. GUIDELINES FOR THE APPLICATION OF SAMPLING METHODS IN THE GATHERING OF AUDIT EVIDENCE
CN117726300B (en) Automatic intelligent processing system for verifying bidding agency business data
CN118154113A (en) Merchant information examination method, device, electronic equipment and readable storage medium
Wittorf et al. Automated Image Metadata Verification

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20181211