Electronic archives information reorganization method and device
Technical field
The invention relates to archive information technology, particularly about a kind of electronic archives information reorganization method and device.
Background technology
The collection of file, qualification and reorganization are the basis accumulation stages of archives life cycle.Are Dialectical movement processes that a criticism is inherited from file to archives, file is the predecessor of archives, and archives are homes to return to of file, archives are while succession file primitiveness, also inherit the record of file, there is Historic Reappearance, be worth so only have archives just to have voucher.
Along with the high speed development of communication technology and network, e-file has been widely used in government offices, enterprises and institutions and social organization's activity.Meanwhile, electronic record has become with archives of paper quality and the information resources of depositing, and has become the informationalized archives main body of modern society.But along with improving constantly of file administration demand and making rapid progress of network technology, the artificial qualification of traditionally on paper archives and reorganization mode, not only waste a large amount of material resources, manpower and time, also brings hard work burden to Archives Workers.
The electronic record qualification of current main flow and reorganization method, adopt the mode of manual hand manipulation usually.The business such as qualification, group volume, mounted box, reorganization, Data Enter of archives, all operate computing machine by hand to complete, qualification, reorganization work numerous and diverse, easily there is mistake, this greatly limits archive information qualification and reorganization efficiency, also increase the weight of the work load of collection of documents housekeeping personnel simultaneously.
Summary of the invention
The invention provides a kind of electronic archives information reorganization method and device, with while guarantee File Identification and reorganization accuracy rate, effectively improve the qualification of electronic record, reorganization efficiency, reduce the duplication of labour, effectively reduce the labour intensity of Archives Workers.
For achieving the above object, the invention provides a kind of electronic archives information reorganization method, described electronic archives information reorganization method comprises:
E-file in each database is resolved to XML file according to the template of correspondence, and described XML file is circulated in loose library;
XML file in described loose library is mated according to the multiple file filter templates preset, XML file by coupling is sent to reorganization storehouse, wherein, the file filter template preset, make according to map file type " archive scope and retention period regulation " (being called for short " regulation "), each rule of this " regulation " is carried out Chinese word segmentation, and then obtain the file filter template that is made up of Chinese word segmentation, therefore, the matching process of filtering profile, is namely converted into: the matching process of keyword;
According to the classification of documents mode preset, classify to the XML file in described reorganization storehouse, luggage box of going forward side by side, generates the e-file after reorganization;
E-file after described reorganization is filed, enters electronic archives library, form electronic record.
In one embodiment, described electronic archives information reorganization method also comprises: will not be sent to documentation storehouse by the XML file of coupling.
In one embodiment, the XML file in described loose library is mated according to the multiple file filter templates preset, comprising:
Be multiple described file filter templates by default scope of archiving standard resolution;
XML file in described loose library is mated with file filter template described in each;
The number of the described file filter template that judgement is mated with the XML file in described loose library is more than or equal to 1;
If so, the XML file by coupling is sent to reorganization storehouse.
In one embodiment, according to the classification of documents mode preset, classify to the XML file in described reorganization storehouse, luggage box of going forward side by side, generates the e-file after reorganization, comprising:
Different mode classifications is preset to dissimilar archives;
Read the XML file in described reorganization storehouse successively, and judge the type belonging to XML file of current reading;
The type belonging to XML file of described current reading whether is there is in retrieval current box;
If so, the XML file of described current reading is loaded when in archives front cabinet;
If not, create the first new archive box, and the XML file of described current reading is loaded in described first new archive box.
In one embodiment, the XML file of described current reading is loaded in current filing container, comprising:
Obtain the number of pages filling the XML file of number of pages and described current reading of current filing container;
Judge whether the number of pages sum filling the XML file of number of pages and described current reading of current filing container is greater than the total page number of current filing container;
If not, the XML file of described current reading is loaded when in archives front cabinet;
If so, create the second new archive box, the XML file of described current reading is loaded in described second new archive box.
To achieve these goals, the present invention also provides a kind of electronic archives information to reorganize device, it is characterized in that, described electronic archives information reorganization device comprises:
Restoring files unit, for the e-file in each database is resolved to XML file according to the template of correspondence, and circulates in loose library by described XML file;
File qualification unit, for mating according to the multiple file filter templates preset the XML file in described loose library, is sent to reorganization storehouse by the XML file by coupling;
File reorganization unit, for according to the classification of documents mode preset, classifies to the XML file in described reorganization storehouse, and luggage box of going forward side by side, generates the e-file after reorganization;
Archive unit, for filing the e-file after described reorganization, enters electronic archives library, forms electronic record.
In one embodiment, described file qualification unit is not also for being sent to documentation storehouse by the XML file of coupling.
In one embodiment, described file qualification unit comprises:
Split module, for being multiple described file filter templates by default scope of archiving standard resolution;
File matching module, for mating the XML file in described loose library with file filter template described in each;
Judge module, for judging that the number of the described file filter template of mating with the XML file in described loose library is more than or equal to 1;
File send module, for being sent to reorganization storehouse by the XML file by coupling.
In one embodiment, described file reorganization unit, comprising:
Classification setting module, for presetting different mode classifications to dissimilar archives;
Type judging module, for reading the XML file in described reorganization storehouse successively, and judges the type belonging to XML file of current reading;
Retrieval module, for retrieving in current box the type belonging to the XML file that whether there is described current reading;
Mounted box module, for loading when in archives front cabinet by the XML file of described current reading;
New box creation module, for creating the first new archive box, and is loaded the XML file of described current reading in described first new archive box by described mounted box module.
In one embodiment, described mounted box module is specifically for the number of pages filling the XML file of number of pages and described current reading that obtains current filing container; Judge whether the number of pages sum filling the XML file of number of pages and described current reading of current filing container is greater than the total page number of current filing container; If not, the XML file of described current reading is loaded when in archives front cabinet; If so, described new box creation module creates the second new archive box, and the XML file of described current reading is loaded in described second new archive box by described mounted box module.
While the present invention can ensure File Identification and reorganization accuracy rate, effectively improve the qualification of electronic record, reorganization efficiency, reduce the duplication of labour, effectively reduce the labour intensity of Archives Workers.
Accompanying drawing explanation
In order to be illustrated more clearly in the embodiment of the present invention or technical scheme of the prior art, be briefly described to the accompanying drawing used required in embodiment or description of the prior art below, apparently, accompanying drawing in the following describes is only some embodiments of the present invention, for those of ordinary skill in the art, under the prerequisite not paying creative work, other accompanying drawing can also be obtained according to these accompanying drawings.
Fig. 1 is that embodiment of the present invention electronic record is automatically identified, reorganized schematic flow sheet;
Fig. 2 is the electronic archives information reorganization method flow diagram of the embodiment of the present invention;
Fig. 3 is that the heterogeneous databases integration layer of the embodiment of the present invention designs a model figure;
Fig. 4 is the process flow diagram of the S202 of Fig. 2;
Fig. 5 is the process flow diagram of the S203 of Fig. 2;
Fig. 6 is the process flow diagram of the S504 of Fig. 5;
Fig. 7 is the reorganization process flow diagram of the embodiment of the present invention;
Fig. 8 is the automatic mounted box structural representation of the e-file of the embodiment of the present invention;
Fig. 9 is the BoxStatus data dictionary schematic diagram of the embodiment of the present invention;
Figure 10 is the BoxStatus data representation intention of the embodiment of the present invention;
Figure 11 is the structured flowchart of the electronic archives information reorganization device of the embodiment of the present invention;
Figure 12 is the structured flowchart of the file qualification unit 1102 of Figure 11;
Figure 13 is the structured flowchart of the file reorganization unit 1103 of Figure 11.
Embodiment
Below in conjunction with the accompanying drawing in the embodiment of the present invention, be clearly and completely described the technical scheme in the embodiment of the present invention, obviously, described embodiment is only the present invention's part embodiment, instead of whole embodiments.Based on the embodiment in the present invention, those of ordinary skill in the art, not making the every other embodiment obtained under creative work prerequisite, belong to the scope of protection of the invention.
As shown in Figure 1, after unlatching Automation Reorganization switch, user needs to preset Automation Reorganization rule the reorganization process of archive information of the present invention, after having new file to arrive in loose library, will carry out Automation Reorganization according to the reorganization rule preset.In Automation Reorganization process, also manually can be adjusted by manual reorganization mode.As can be seen from Figure 1, the reorganization process of archive information of the present invention comprises: the qualification of restoring files, file, file reorganization and electronic records filing four steps, the following detailed description of the reorganization process of electronic archives information.
As shown in Figure 2, the embodiment of the present invention provides a kind of electronic archives information reorganization method, and described electronic archives information reorganization method comprises:
S201: the e-file in each database is resolved to XML file according to the template of correspondence, and described XML file is circulated in loose library;
S202: mate according to the multiple file filter templates preset the XML file in described loose library, is sent to reorganization storehouse by the XML file by coupling.The file filter template preset, make according to map file type " archive scope and retention period regulation " (being called for short " regulation "), each rule of this " regulation " is carried out Chinese word segmentation, and then obtain the file filter template that is made up of Chinese word segmentation, therefore, the matching process of filtering profile, is namely converted into: the matching process of keyword.
S203: according to the classification of documents mode preset, classifies to the XML file in described reorganization storehouse, luggage box of going forward side by side, and generates the e-file after reorganization;
S204: file the e-file after described reorganization, enter electronic archives library, forms electronic record.
From the flow process of Fig. 2, in order to realize the reorganization to electronic archives information, first the e-file in each database is resolved to XML file according to the template of correspondence by the present invention, and described XML file is circulated in loose library.Then, the XML file in described loose library is mated according to the multiple file filter templates preset, the XML file by coupling is sent to reorganization storehouse.Afterwards, according to the classification of documents mode preset, classify to the XML file in described reorganization storehouse, luggage box of going forward side by side, generates the e-file after reorganization.Finally, the e-file after described reorganization is filed, enters electronic archives library, form electronic record.By the above-mentioned automatic flow to e-file, automatically qualification, Automation Reorganization and filing, while File Identification and reorganization accuracy rate can be ensured, the qualification of effective raising electronic record, reorganization efficiency, reduce the duplication of labour, effectively reduce the labour intensity of Archives Workers.
Because e-file is distributed in different computing machines, operating system, infosystem and data, there is different data layouts, data standard and management method, these e-files are due to data class, data structure difference, cannot mutually exchange and utilize, define " information island ", cause the trouble waters of " data are set up a separatist regime by force of arms ", have impact on sharing and utilizing of e-file resource.
For the problems referred to above, in S201, select to take XML file as conventional data Interchange Format, unified by each heterogeneous database by XML analysis mode, the isomery completing e-file resource is integrated.By XML extend markup language (ExtensibleMarkupLanguage), arrange the e-file be distributed in each document data bank, Uniform data format, circulation enters loose library.Each document data bank can comprise MIS, OA, administrative examination and approval and other databases.
In order to realize the shielding to bottom heterogeneous database complicacy, reach good dirigibility, configurability, decoupling zero is fallen apart library and heterogeneous database layer, and ensures the high reusability of heterogeneous database layer, and heterogeneous database layer designs a model as shown in Figure 3.
During concrete enforcement, can according to the heterogeneous database structural information of each distribution and shared demand, dynamic configuration database is shared and is arranged, and resolves to XML file, generate Map Profile according to different templates (template that different databases is corresponding different); Then, XML file packaging is become message, sends it on central application server by Java messenger service (JMS); Finally, analyzing XML file, utilizes XML file data message to upgrade central shared data bank, finally realizes heterogeneous databases integration and shares and synchronous refresh real-time update.
As shown in Figure 4, in one embodiment, S202 can comprise the steps:
Default scope of archiving standard resolution is multiple described file filter templates by S401: preset scope of archiving standard.
S402: the XML file in described loose library is mated with file filter template described in each.
S403: the number of the described file filter template that judgement is mated with the XML file in described loose library is more than or equal to 1, if the number of the file filter template of mating with the XML file in loose library is greater than or equal to 1, carries out S404.
S404: the XML file by coupling is sent to reorganization storehouse.
During concrete enforcement, the automatic qualification of e-file such as can based on " Municipal Commission of Science and Technology's documents scope of archiving and secretarial document retention period regulation " (hereinafter referred to as " scope of archiving "), " scope of archiving " is disassembled the filter criteria (rule) into fixed qty, such as, can disassemble as Party Committees meeting minutes.Then the filter criteria disassembled is made the scope of archiving template that computing machine can identify.
When the e-file circulation of different information systems is come, by this scope of archiving template, every a e-file is filtered, eventually through the file of filtering profile, be the e-file needing to carry out filing, added reorganization storehouse; Not by the file of filtering profile, be the e-file not needing to carry out filing, added documentation storehouse, retain as documentation.
" scope of archiving " can be disassembled into N number of file filter template, suppose to be designated as: m
1, m
2, m
3... m
n.For m
1, m
2, m
3... m
nthe coupling of this N number of filtering profile, by Chinese information Keywords matching, completes each template m
ithe filter process of (1≤i≤N), if i.e.: Keywords matching success, is then considered as meeting set filter criteria, by filtering profile, otherwise, be then considered as not passing through filtering profile.
To filtering profile set { m
1, m
2, m
3... m
n, the filtering profile that e-file can pass through is designated as set { m
i(1≤i≤N), will { m be gathered
iradix be designated as X, so:
If i) X >=1, be then considered as e-file by filtering profile, joined in reorganization storehouse;
Ii) if X=0, be then considered as e-file not by filtering profile, will be sent in documentation storehouse by the XML e-file of coupling.
Be the reorganization process of entity archives in simulating reality to the Automation Reorganization of e-file in reorganization storehouse, automatically complete the classification of file, arrangement and mounted box by computing machine.In a database, there is not real filing container, so for every part archives, only need give a box number, just can pass through box number, the File reorganizing mounted box process in reality is embodied.
As shown in Figure 5, in one embodiment, S203 comprises:
S501: different mode classifications is preset to dissimilar archives.
S502: read the XML file in reorganization storehouse successively, and judge the type belonging to XML file of current reading;
S503: the type belonging to XML file that whether there is described current reading in retrieval current box; If there is the type belonging to XML file of described current reading in current box, carry out S504; If there is not the type belonging to XML file of described current reading in current box, carry out S505.
S504: the XML file of described current reading is loaded when in archives front cabinet;
S505: create the first new archive box, and the XML file of described current reading is loaded in described first new archive box.
As shown in Figure 6, in one embodiment, S504 comprises:
S601: the number of pages filling the XML file of number of pages and described current reading obtaining current filing container;
S602: judge whether the number of pages sum filling the XML file of number of pages and described current reading of current filing container is greater than the total page number of current filing container; If not, carry out S603: if carry out S604.
S603: the XML file of described current reading is loaded when in archives front cabinet;
S604: create the second new archive box, loads the XML file of described current reading in described second new archive box.
The Automation Reorganization Module flow process of the e-file in reorganization storehouse can be summarized as: according to the classification of documents mode preset, all records in traversal reorganization storehouse, determine which classification every bar record is mapped in, and All Files arrangement mounted box in storehouse will be reorganized, be inserted in file store, concrete reorganization flow process as shown in Figure 7.
As shown in Figure 7, reorganize flow process to comprise:
S701: read Article 1 reorganization storehouse record.This reorganization storehouse record can have multiple, and as rewarded archives, administer archive etc., are only described to reward archives below.
S702: obtain file type, year, mechanism, number of pages PageNum from the record of this reorganization storehouse.For above-mentioned award archives, sorting technique is: award-year-subject-grade-reel number, such as rewards prize such as-2014-chemical industry-1 grade-50.In S702, for award archives, need to obtain in the record of this this reorganization storehouse award-year-information such as subject-grade-reel number.
S703: retrieve in the ArchiveType field of table BoxStatus: file type-year-mechanism, for award archives, need retrieval award-year-subject.
S704: judge whether to exist in the ArchiveType field of table BoxStatus file type-year-mechanism's (award-year-subject), if not, carry out S705; If so, S706 is carried out.
S705: create new archive classification, new box: insert new record in table BoxStatus, record ArchiveType field be file type-year-mechanism's (award-year-subject), the current box number (CurBoxID) of new box is set to 1.
S706: this e-file is loaded new box, current number of pages CurBoxPage=file number of pages PageNum in box.
S707: obtain CurBoxPage (in current box number of pages), be added with PageNum, Sum=CurBoxPage+PageNum.
S708: each filing container can only hold N page at most, needs to judge whether Sum is greater than N, if so, carries out S710; If not, S709 is carried out.
S709: this e-file is loaded in current box, current number of pages CurBoxPage=CurBoxPage+PageNum in amendment current box.
S710: current box is full, creates new box: box CurBoxID=CurBoxID+1, and mounted box: current number of pages CurBoxPage=PageNum (file number of pages) in box.
S711: this reorganization storehouse record is inserted in file store, and adds archive information metadata tag, wherein box label: BoxID=CurBoxID.
S712: delete this record from reorganization storehouse.
S713: read next record of reorganization storehouse.
S714: judge whether next record is empty, if so, terminates reorganization process; If not, S702 is carried out.
As shown in Figure 7, when each archives mounted box is complete, last batch of filing container may not filled, because the number of pages of each mounted box can not be just the multiple of N (N represents the capacity of filing container), and during next mounted box, just need to find which filing container not fill, from the filing container that these are not filled, start to continue loading procedure.
As shown in Figure 8, for each classification of documents, last filing container may be all discontented, therefore, design a relation table BoxStatus, be used for representing the state of " current filing container " (filing container do not filled), the data dictionary design of relation table BoxStatus and tables of data are distinguished as shown in FIG. 9 and 10.
The artificial qualification of traditionally on paper archives and reorganization mode loaded down with trivial details, complicated and uninteresting, operative norm is very strict, is a heavy burden of file clerk.Artificial qualification and reorganization, waste a large amount of material resources, manpower and time.Electronic archives information reorganization method is according to setting file Automation Reorganization standard, journey, the electronic archives information gathered automatically is identified, reorganized, and carry out auxiliary adjustment manually to reorganize, while guarantee File Identification and reorganization accuracy rate, effectively improve the qualification of electronic record, reorganization efficiency, decrease the duplication of labour, effectively reduce the labour intensity of Archives Workers.
As shown in figure 11, the embodiment of the present invention provides a kind of electronic archives information to reorganize device, and this electronic archives information reorganization device comprises: restoring files unit 1101, file qualification unit 1102, file reorganization unit 1103 and archive unit 1104.
Described XML file for the e-file in each database is resolved to XML file according to the template of correspondence, and circulates in loose library by restoring files unit 1101.
XML file by coupling, for mating according to the multiple file filter templates preset the XML file in described loose library, is sent to reorganization storehouse by file qualification unit 1102.
File reorganization unit 1103, for according to the classification of documents mode preset, is classified to the XML file in described reorganization storehouse, and luggage box of going forward side by side, generates the e-file after reorganization.
Archive unit 1104, for filing the e-file after described reorganization, enters electronic archives library, forms electronic record.
In one embodiment, this file qualification unit 1102 is not also for being sent to documentation storehouse by the XML file of coupling.
In one embodiment, as shown in figure 12, this file qualification unit 1102 comprises: split module 1201, file matching module 1202, judge module 1203 and file send module 1204.
Split module 1201 for being multiple described file filter templates by default scope of archiving standard resolution.
File matching module 1202 is for mating the XML file in described loose library with file filter template described in each.
Judge module 1203 is for judging that the number of the described file filter template of mating with the XML file in described loose library is more than or equal to 1.
File send module 1204 is for being sent to reorganization storehouse by the XML file by coupling.
In one embodiment, as shown in figure 13, this file reorganization unit 1103, comprising: classification setting module 1301, type judging module 1302, retrieval module 1303, mounted box module 1304 and new box creation module 1305.
Classification setting module 1301 is for presetting different mode classifications to dissimilar archives.
Type judging module 1302 for reading the XML file in described reorganization storehouse successively, and judges the type belonging to XML file of current reading.
Retrieval module 1303 is for retrieving in current box the type belonging to the XML file that whether there is described current reading.
Mounted box module 1304 is for loading the XML file of described current reading when in archives front cabinet.
The XML file of described current reading for creating the first new archive box, and is loaded in described first new archive box by described mounted box module by new box creation module 1305.
In one embodiment, mounted box module 1304 may be used for: the number of pages filling the XML file of number of pages and described current reading obtaining current filing container; Judge whether the number of pages sum filling the XML file of number of pages and described current reading of current filing container is greater than the total page number of current filing container; If not, the XML file of described current reading is loaded when in archives front cabinet; If so, described new box creation module creates the second new archive box, and the XML file of described current reading is loaded in described second new archive box by described mounted box module.
The artificial qualification of traditionally on paper archives and reorganization mode loaded down with trivial details, complicated and uninteresting, operative norm is very strict, is a heavy burden of file clerk.Artificial qualification and reorganization, waste a large amount of material resources, manpower and time.Electronic archives information reorganization method is according to setting file Automation Reorganization standard, journey, the electronic archives information gathered automatically is identified, reorganized, and carry out auxiliary adjustment manually to reorganize, while guarantee File Identification and reorganization accuracy rate, effectively improve the qualification of electronic record, reorganization efficiency, decrease the duplication of labour, effectively reduce the labour intensity of Archives Workers.
Those skilled in the art should understand, embodiments of the invention can be provided as method, system or computer program.Therefore, the present invention can adopt the form of complete hardware embodiment, completely software implementation or the embodiment in conjunction with software and hardware aspect.And the present invention can adopt in one or more form wherein including the upper computer program implemented of computer-usable storage medium (including but not limited to magnetic disk memory, CD-ROM, optical memory etc.) of computer usable program code.
The present invention describes with reference to according to the process flow diagram of the method for the embodiment of the present invention, equipment (system) and computer program and/or block scheme.Should understand can by the combination of the flow process in each flow process in computer program instructions realization flow figure and/or block scheme and/or square frame and process flow diagram and/or block scheme and/or square frame.These computer program instructions can being provided to the processor of multi-purpose computer, special purpose computer, Embedded Processor or other programmable data processing device to produce a machine, making the instruction performed by the processor of computing machine or other programmable data processing device produce device for realizing the function of specifying in process flow diagram flow process or multiple flow process and/or block scheme square frame or multiple square frame.
These computer program instructions also can be stored in can in the computer-readable memory that works in a specific way of vectoring computer or other programmable data processing device, the instruction making to be stored in this computer-readable memory produces the manufacture comprising command device, and this command device realizes the function of specifying in process flow diagram flow process or multiple flow process and/or block scheme square frame or multiple square frame.
These computer program instructions also can be loaded in computing machine or other programmable data processing device, make on computing machine or other programmable devices, to perform sequence of operations step to produce computer implemented process, thus the instruction performed on computing machine or other programmable devices is provided for the step realizing the function of specifying in process flow diagram flow process or multiple flow process and/or block scheme square frame or multiple square frame.
Apply specific embodiment in the present invention to set forth principle of the present invention and embodiment, the explanation of above embodiment just understands method of the present invention and core concept thereof for helping; Meanwhile, for one of ordinary skill in the art, according to thought of the present invention, all will change in specific embodiments and applications, in sum, this description should not be construed as limitation of the present invention.