CN115878559A - Electronic file management system - Google Patents
Electronic file management system Download PDFInfo
- Publication number
- CN115878559A CN115878559A CN202211541720.XA CN202211541720A CN115878559A CN 115878559 A CN115878559 A CN 115878559A CN 202211541720 A CN202211541720 A CN 202211541720A CN 115878559 A CN115878559 A CN 115878559A
- Authority
- CN
- China
- Prior art keywords
- text
- archive
- application
- storage format
- storage
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Landscapes
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The invention relates to the technical field of electronic archive management, and discloses an electronic archive management system which comprises a target archive text archive statistics module, a text archive filing information acquisition module, a text archive historical application parameter extraction module, a text archive application value analysis module, a text archive storage value analysis module, a text archive typesetting requirement analysis module, an archive management library, a text archive adaptive storage format analysis module and a key text archive identification output terminal.
Description
Technical Field
The invention relates to the technical field of electronic archive management, in particular to a text archive management technology, and specifically relates to an electronic archive management system.
Background
The archives are original records of various characters, images, sound images and other forms which have preservation value and are formed in social activities of countries, organizations, social organizations and individuals, and have functions and functions irreplaceable with other data. The record carrier of traditional archives is mainly paper, and under the continuous development of science and technology, the mode of archives preservation also changes the storage mode of electronization into from single paper preservation, forms electronic file, and electronic file relies on its advantage of being convenient for to store, duplicate, transmit, makes its daily file management who adapts to the archives of present era more.
The paper form dominates the presentation form of the whole record carrier, so that the existing electronic archives stored in archives have the largest text archive ratio, and the storage formats of texts in computer systems are various, such as txt, doc, pdf, etc., and the various storage formats have their own advantages and disadvantages. In such a situation, currently, an archive needs to select one of several text storage formats when storing the text archive.
However, in the prior art, the selection of the storage format of the text archive is basically determined subjectively by the archives or a unified default storage format is selected for saving the storage space, and due to the lack of objective scientific selection basis, the selection mode is difficult to adapt to the targeted storage requirement of the text archive, so that the adaptation degree of the selection result is not high, which not only brings inconvenience to the search and application of the text archive, but also may bring certain risk potential to the storage safety of the text archive, and the storage effect of the text archive is not good, which is not favorable for the permanent safety application of the text archive.
Disclosure of Invention
In order to solve the technical problems, the invention is realized by the following technical scheme: an electronic archive management system comprising: and the target archive text archive counting module is used for taking the archive to be subjected to electronic archive management as a target archive, counting the number of the text archives stored in the target archive, and numbering the text archives according to the sequence of the filing time points.
And the text archive filing information acquisition module is used for acquiring filing information of each text archive.
And the text archive historical application parameter extraction module is used for setting an application time period based on the filing time point corresponding to each text archive, and further extracting the historical application parameters corresponding to each text archive in the set application time period.
And the text file application value analysis module is used for analyzing the application value degree corresponding to each text file according to the historical application parameters corresponding to each text file.
And the text archive storage value analysis module is used for analyzing the storage value degree corresponding to each text archive according to the filing information corresponding to each text archive.
And the text file typesetting requirement analysis module is used for extracting the application purpose corresponding to each application from the historical application parameters corresponding to each text file and analyzing the typesetting requirement degree corresponding to each text file according to the application purpose.
The file management library is used for storing characteristic parameters corresponding to various storage formats to which the text files belong, storing source importance indexes corresponding to various formers, and storing content importance indexes corresponding to various filing types.
And the text archive adaptive storage format analysis module is used for analyzing the adaptive storage format corresponding to each text archive based on the application value degree, the storage value degree and the typesetting requirement degree corresponding to each text archive and the characteristic parameters corresponding to various storage formats to which the text archive belongs.
And the key text archive identification output terminal is used for acquiring the actual storage format corresponding to each text archive, comparing the actual storage format with the adaptive storage format corresponding to each text archive, recording the text archive as a key text archive if the actual storage format corresponding to a certain text archive is inconsistent with the adaptive storage format corresponding to the text archive, and further outputting the serial number of the key text archive and the adaptive storage format corresponding to the key text archive in a background for modification by an archive manager.
As applied to the above embodiment, the archive information includes a creator and an archive category.
The specific setting manner of setting the application time period based on the filing time points corresponding to the text files is to compare the filing time points corresponding to the text files, screen out the earliest filing time point from the comparison, use the earliest filing time point as the initial application time point, further calculate the application ending time point according to the initial application time point and the preset time interval, and set the time period between the initial application time point and the application ending time point as the application time period.
Applied to the above embodiment, the historical application parameters include the number of application records, the interval duration between adjacent application records, the application duration corresponding to each application record, and the application purpose, where the application purpose includes information lookup or text printing.
Applied to the above embodiment, the analyzing the application value degree corresponding to each text archive specifically refers to the following steps: extracting the application record quantity and the interval duration of adjacent application records from the historical application parameters corresponding to each text file, and calculating the application frequency FU corresponding to each text file according to the application record quantity and the interval duration i Where i is represented as a text archive number, i =1,2, · n,Δd i j the time lengths of the interval between the j +1 th application record and the j th application record corresponding to the ith text file are respectively expressed, j is an application record number, j =1,2.
Extracting the application duration corresponding to each application record from the historical application parameters corresponding to each text file, and calculating the application value degree AV corresponding to each text file by combining the application duration corresponding to each text file with the application frequency degree corresponding to each text file i Whereint i j represents the application duration of the ith text file corresponding to the jth application record, and m represents the number of the application records.
For the embodiment, the analysis of the preservation value degree corresponding to each text archive refers to the following steps: and extracting the formers from the filing information corresponding to each text file, and further matching the formers corresponding to each text file with the source importance indexes corresponding to various formers stored in the file management library, so as to match the source importance indexes corresponding to each text file.
And extracting the filing type from the filing information corresponding to each text file, matching the filing type corresponding to each text file with the content importance index corresponding to each filing type stored in the file management library, and matching the content importance index corresponding to each text file.
Substituting the source importance index and the content importance index corresponding to each text file into a preservation value degree analysis formulaAnalyzing the corresponding storage value PV of each text archive i ,η i 、ξ i Respectively expressed as a source importance index and a content importance index corresponding to the ith text file, U is expressed as a preset constant, and U is expressed as a>1。
Applied to the above embodiment, the analyzing the layout requirement degree corresponding to each text archive specifically includes: and comparing the application purposes of the application records corresponding to the text files with each other, classifying the application records corresponding to the same application purpose in the text files, and counting the number of the application records existing in the text printing in the text files.
Calculating the typesetting requirement degree corresponding to each text file according to the number of application records existing in the text printing in each text file, wherein the calculation formula isWherein TR i Expressed as the corresponding typesetting demand degree, x, of the ith text file i Number of application records, m, existing for printing of text in ith text file i Expressed as the number of application records corresponding to the ith text archive.
Applied to the above embodiment, the characteristic parameters include a loader compatibility, a storage loss rate, and a text layout support rate.
Applied to the above embodiment, the analyzing the adapted storage format corresponding to each text archive specifically includes: and comparing the application value degree corresponding to each text file with the application value degree interval corresponding to each set application value grade, and determining the application value grade corresponding to each text file.
And matching the application value grade corresponding to each text file with the compatibility of the demand loader corresponding to each set application value grade, and matching the compatibility of the demand loader corresponding to each text file.
Extracting the compatibility of the loader from the characteristic parameters, comparing the compatibility of the required loader corresponding to each text file with the compatibility of the loader corresponding to each storage format of the text file, and calculating the similarity index of the compatibility of the loader corresponding to each text file in each storage formatWherein delta k→i Expressed as loader compatibility similarity index, lambda, of the kth storage format relative to the ith text file k Expressed as the loader compatibility, lambda, corresponding to the kth storage format to which the text file belongs i The required loader compatibility corresponding to the ith text file is represented, Δ λ is represented as a preset reference loader compatibility contrast difference, k is represented as the number of various storage formats to which the text file belongs, and k =1,2.
Comparing the loader compatibility similarity indexes of various storage formats relative to the text files with a set similarity index threshold, and screening out the storage formats of which the loader compatibility similarity indexes are larger than the set similarity index threshold from the text files as compatible storage formats corresponding to the text files to form a compatible storage format set corresponding to the text files.
And comparing the preservation value degree corresponding to each text file with the preservation value degree intervals corresponding to various set preservation value levels, and determining the preservation value level corresponding to each text file.
And matching the storage value level corresponding to each text file with the storage loss rate lower limit threshold corresponding to each set storage value level, and matching the storage loss rate lower limit threshold corresponding to each text file.
And extracting the storage loss rate from the characteristic parameters, comparing the lower limit threshold of the storage loss rate corresponding to each text archive with the storage loss rates corresponding to various storage formats to which the text archive belongs, screening out the storage formats meeting the lower limit threshold of the storage loss rate corresponding to each text archive, and taking the storage formats as the stable consistent storage formats corresponding to each text archive to form a stable consistent storage format set corresponding to each text archive.
And extracting the text typesetting support rate from the characteristic parameters, further matching the text typesetting support rate corresponding to each storage format to which the text archive belongs with the typesetting requirement degree which can be met by the set text typesetting support rate, and matching the typesetting requirement degree which can be met by each storage format from the text typesetting support rate.
Comparing the typesetting requirement degree corresponding to each text file with the typesetting requirement degree which can be met by each storage format, if the typesetting requirement degree which can be met by a certain storage format is more than or equal to the typesetting requirement degree corresponding to a certain text file, taking the storage format as the typesetting conforming storage format corresponding to the text file, extracting the typesetting conforming storage format corresponding to each text file, and forming a typesetting conforming storage format set corresponding to each text file.
And determining the adaptive storage format corresponding to each text archive based on the compatible storage format set, the stable consistent storage format set and the typesetting consistent storage format set corresponding to each text archive.
Applied to the above embodiment, the determining of the adapted storage format corresponding to each text archive is specifically as follows: comparing the compatible storage format set, the stable storage format set and the typesetting storage format set corresponding to each text archive, if a certain corresponding storage format corresponding to a certain text archive appears in the sets, using the text archive as a common text archive, otherwise, using the text archive as a special text archive.
For the common text archive, the corresponding storage formats appearing in the sets are used as the adaptive storage formats corresponding to the common text archive.
For the special text archives, the compatible storage format set, the stable storage format set and the typesetting storage format set corresponding to the special text archives are classified, and the occurrence frequency corresponding to each corresponding storage format and the related set type and the corresponding degree of conformity corresponding to each occurrence are counted.
Calculating the corresponding advantage saliency of each corresponding storage format in the special text archive according to the corresponding frequency of occurrence of each corresponding storage format and the related set type and the corresponding conformity of each occurrenceWherein->The storage format is expressed as the corresponding dominant saliency of the r-th consistent storage format in the special text file, r is expressed as the consistent storage format number, r =1,2 r Expressed as the occurrence frequency corresponding to the matching storage format in the r-th text file, is/are>μ r f represents the weight factor and the consistency of the corresponding storage format of the r-th consistent storage format in the special text file corresponding to the related set type at the f-th occurrence, f represents the number of each occurrence, and f =1,2.
And comparing the dominant saliency corresponding to each consistent storage format in the special text archive, and extracting the consistent storage format corresponding to the maximum dominant saliency from the dominant saliency to serve as the adaptive storage format corresponding to the special text archive.
Compared with the prior art, the invention has the following advantages: 1. the method and the device have the advantages that the archiving information and the historical application parameters of each text archive stored in the target archive are acquired, the adaptive storage format corresponding to each text archive is analyzed according to the acquired archiving information and the historical application parameters, and the text archive with the actual storage format inconsistent with the adaptive storage format is subjected to key identification and modification.
2. When the adaptive storage format corresponding to each text archive is analyzed according to the text archive filing information and the historical application parameters, the multi-dimensional intelligent analysis of the adaptive storage format of the text archive is realized by analyzing the application value, the storage value and the typesetting requirement corresponding to each text archive and combining the characteristic parameters corresponding to various storage formats to which the text archive belongs, so that the analysis result is more accurate and reliable, the application range is wider, and the method has higher available value.
Drawings
The invention is further illustrated by means of the attached drawings, but the embodiments in the drawings do not constitute any limitation to the invention, and for a person skilled in the art, other drawings can be obtained on the basis of the following drawings without inventive effort.
FIG. 1 is a schematic diagram of the system connection of the present invention.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
Referring to fig. 1, the present invention provides an electronic archive management system, which comprises a target archive text archive statistics module, a text archive archiving information acquisition module, a text archive historical application parameter extraction module, a text archive application value analysis module, a text archive storage value analysis module, a text archive typesetting requirement analysis module, an archive management library, a text archive adaptive storage format analysis module and a key text archive identification output terminal, wherein the target archive text archive statistics module is respectively connected with the text archive archiving information acquisition module and the text archive historical application parameter extraction module, the text archive historical application parameter extraction module is respectively connected with the text archive application value analysis module and the text archive typesetting requirement analysis module, the text archive historical application parameter extraction module is connected with the text archive storage value analysis module, the text archive application value analysis module, the text archive storage value analysis module and the text archive typesetting requirement analysis module, the text archive adaptive storage format analysis module is connected with the key text archive identification output terminal, and the archive management library is respectively connected with the text archive storage value analysis module and the text archive adaptive storage format analysis module.
The target archive text archive counting module is used for taking an archive to be subjected to electronic archive management as a target archive, further counting the number of the text archives stored in the target archive, and numbering the text archives according to the sequence of filing time points.
The text archive archiving information acquisition module is used for acquiring archiving information of each text archive, wherein the archiving information comprises a former and an archiving category.
The text archive historical application parameter extraction module is used for setting an application time period based on the archiving time point corresponding to each text archive, and further extracting historical application parameters corresponding to each text archive in the set application time period, wherein the historical application parameters comprise the number of application records, the interval duration of adjacent application records, the application duration corresponding to each application record and application purposes, and the application purposes comprise information lookup or text printing.
In a specific embodiment of the present invention, the specific setting manner of setting the application time period is to compare the filing time points corresponding to the text files, select the earliest filing time point from the filing time points, use the earliest filing time point as the initial application time point, and calculate the application ending time point according to the initial application time point and the preset time interval, thereby setting the time period between the initial application time point and the application ending time point as the application time period.
The text archive application value analysis module is used for analyzing the application value degree corresponding to each text archive according to the historical application parameters corresponding to each text archive, and the method specifically refers to the following steps:
extracting the application record quantity and the interval duration of adjacent application records from the historical application parameters corresponding to each text file, and calculating the application frequency FU corresponding to each text file according to the application record quantity and the interval duration i Where i is represented as a text archive number, i =1,2, · n,Δd i j the time length of the interval between the j +1 th application record and the j th application record corresponding to the ith text file is respectively represented, j is represented as an application record number, j =1,2.
In the application frequency calculation formula, the longer the interval duration of the adjacent application records corresponding to a certain text file is, the smaller the application frequency corresponding to the text file is.
Extracting application duration corresponding to each application record from historical application parameters corresponding to each text file, and calculating application value degree AV corresponding to each text file by combining the application duration corresponding to each text file with application frequency corresponding to each text file i Whereintij represents the application duration of the jth application record corresponding to the ith text file, and m represents the number of the application records.
The text archive storage value analysis module is used for analyzing the storage value degree corresponding to each text archive according to the archiving information corresponding to each text archive, and the method comprises the following steps: and extracting the formers from the filing information corresponding to each text file, matching the formers corresponding to each text file with the source importance indexes corresponding to various formers stored in the file management library, and matching the source importance indexes corresponding to each text file.
Illustratively, the aforementioned formers include, but are not limited to, official agencies, semi-official agencies, un-official agencies, individuals.
And extracting the filing type from the filing information corresponding to each text file, matching the filing type corresponding to each text file with the content importance index corresponding to each filing type stored in the file management library, and matching the content importance index corresponding to each text file.
Illustratively, the above mentioned archival categories include, but are not limited to, scientific archives, military archives, and economic archives.
Substituting the source importance index and the content importance index corresponding to each text file into a preservation value degree analysis formulaAnalyzing the corresponding storage value PV of each text archive i ,η i 、ξ i Respectively expressed as a source importance index and a content importance index corresponding to the ith text file, U is expressed as a preset constant, and U is expressed as>1。
The text archive typesetting requirement analysis module is used for extracting application purposes corresponding to each application from historical application parameters corresponding to each text archive, and analyzing the typesetting requirement degree corresponding to each text archive according to the application purposes, and specifically comprises the following steps: and comparing the application purposes of the application records corresponding to the text files with each other, classifying the application records corresponding to the same application purpose in the text files, and counting the number of the application records existing in the text printing in the text files.
Calculating the typesetting requirement degree corresponding to each text file according to the number of application records existing in the text printing in each text file, wherein the calculation formula isWherein TR i Expressed as the corresponding typesetting demand degree, x, of the ith text file i Number of application records, m, existing for printing of text in ith text file i The number of the application records corresponding to the ith text file is represented, wherein the more the number of the application records existing in the text printing in a certain text file is, the greater the typesetting requirement degree corresponding to the text file is.
The archive management library is used for storing characteristic parameters corresponding to various storage formats to which the text archives belong, storing source importance indexes corresponding to various formers, and storing content importance indexes corresponding to various archiving categories.
The text archive adaptive storage format analysis module is used for analyzing the adaptive storage format corresponding to each text archive based on the application value degree, the storage value degree and the typesetting requirement degree corresponding to each text archive and the characteristic parameters corresponding to various storage formats to which the text archive belongs, wherein the characteristic parameters comprise the loading program compatibility degree, the storage loss rate and the text typesetting support rate.
In a specific embodiment of the present invention, analyzing the adaptive storage format corresponding to each text archive specifically includes: and comparing the application value degree corresponding to each text file with the application value degree interval corresponding to each set application value grade, and determining the application value grade corresponding to each text file.
And matching the application value grade corresponding to each text file with the compatibility of the demand loader corresponding to each set application value grade, and matching the compatibility of the demand loader corresponding to each text file.
Extracting the compatibility of the loader from the characteristic parameters, comparing the compatibility of the required loader corresponding to each text file with the compatibility of the loader corresponding to each storage format of the text file, and calculating the similarity index of the compatibility of the loader corresponding to each text file in each storage formatWherein delta k→i Expressed as loader compatibility similarity index, lambda, of the kth storage format relative to the ith text file k Expressed as the compatibility of the loader program corresponding to the k-th storage format of the text file, lambda i Expressed as the required loader compatibility corresponding to the ith text file, Δ λ is expressed as a preset reference loader compatibility contrast difference, k is expressed as the number of various storage formats to which the text file belongs, k =1,2The closer the compatibility is to the required loader compatibility of the text file, the greater the loader compatibility similarity index.
Comparing the loader compatibility similarity indexes of various storage formats relative to the text files with a set similarity index threshold, and screening out the storage formats of which the loader compatibility similarity indexes are larger than the set similarity index threshold from the text files as compatible storage formats corresponding to the text files to form a compatible storage format set corresponding to the text files.
And comparing the preservation value degree corresponding to each text file with the preservation value degree intervals corresponding to various set preservation value levels, and determining the preservation value level corresponding to each text file.
And matching the storage value level corresponding to each text file with the storage loss rate lower limit threshold corresponding to each set storage value level, and matching the storage loss rate lower limit threshold corresponding to each text file.
And extracting the storage loss rate from the characteristic parameters, comparing the lower limit threshold of the storage loss rate corresponding to each text archive with the storage loss rates corresponding to various storage formats to which the text archive belongs, screening out the storage formats meeting the lower limit threshold of the storage loss rate corresponding to each text archive, and taking the storage formats as the stable consistent storage formats corresponding to each text archive to form a stable consistent storage format set corresponding to each text archive.
And extracting the character typesetting support rate from the characteristic parameters, matching the character typesetting support rate corresponding to each storage format to which the text archive belongs with the typesetting requirement degree which can be met by the set character typesetting support rate, and matching the typesetting requirement degree which can be met by each storage format.
Comparing the typesetting requirement degree corresponding to each text file with the typesetting requirement degree which can be met by each storage format, if the typesetting requirement degree which can be met by a certain storage format is more than or equal to the typesetting requirement degree corresponding to a certain text file, taking the storage format as the typesetting conforming storage format corresponding to the text file, extracting the typesetting conforming storage format corresponding to each text file, and forming a typesetting conforming storage format set corresponding to each text file.
Determining an adaptive storage format corresponding to each text archive based on the compatible consistent storage format set, the stable consistent storage format set and the typesetting consistent storage format set corresponding to each text archive, wherein the specific determination steps are as follows: comparing the compatible storage format set, the stable storage format set and the typesetting storage format set corresponding to each text archive, if a certain corresponding storage format corresponding to a certain text archive appears in the sets, using the text archive as a common text archive, otherwise, using the text archive as a special text archive.
For the common text archive, the corresponding storage formats appearing in the sets are used as the adaptive storage formats corresponding to the common text archive.
For the special text archives, classifying the same storage formats in the compatible storage format set, the stable consistent storage format set and the typesetting consistent storage format set corresponding to the special text archives, and counting the occurrence frequency corresponding to each consistent storage format and the related set type and consistency corresponding to each occurrence, wherein the related set type comprises the compatible consistent storage format set, the stable consistent storage format set or the typesetting consistent storage format set;
calculating the corresponding advantage saliency of each corresponding storage format in the special text archive according to the corresponding frequency of occurrence of each corresponding storage format and the related set type and the corresponding conformity of each occurrenceWherein->The storage format is expressed as the corresponding dominant saliency of the r-th consistent storage format in the special text file, r is expressed as the consistent storage format number, r =1,2 r Expressed as the occurrence frequency corresponding to the matching storage format in the r-th text file, is/are>μ r f represents the weight factor and the consistency of the corresponding storage format of the r-th consistent storage format in the special text file corresponding to the related set type at the f-th occurrence, f represents the number of each occurrence, and f =1,2.
In a further preferred embodiment, the above-mentioned correspondence for each occurrence is obtained by the following method: identifying a basic value and a reference value corresponding to each occurrence of each corresponding storage format, for example, if a corresponding storage format occurs in the set of compatible storage formats, the basic value corresponding to the occurrence of the corresponding storage format is a loader compatibility similarity index, and at this time, the reference value corresponding to the occurrence of the corresponding storage format is a set similarity index threshold.
Further exemplarily, if a corresponding storage format occurs in the stable corresponding storage format set, the basic value corresponding to the occurrence of the corresponding storage format at this time is the storage loss rate, and at this time, the reference value corresponding to the occurrence of the corresponding storage format at this time is the storage loss rate lower limit threshold.
Further exemplarily, if the corresponding storage format appears in the storage format set corresponding to the type setting, the basic value corresponding to the occurrence of the corresponding storage format at this time is the type setting requirement degree which can be met, and at this time, the reference value corresponding to the occurrence of the corresponding storage format at this time is the type setting requirement degree.
Substituting the basic value and the reference value corresponding to each occurrence of each consistent storage format into a consistency calculation formulaAnd calculating the corresponding conformity of each time of each conforming storage format.
And comparing the advantage saliency corresponding to each consistent storage format in the special text archive, and extracting the consistent storage format corresponding to the maximum advantage saliency from the comparison as the adaptive storage format corresponding to the special text archive.
According to the embodiment of the invention, when the adaptive storage format corresponding to each text archive is analyzed according to the text archive filing information and the historical application parameters, the application value, the storage value and the typesetting requirement corresponding to each text archive are analyzed, so that the multi-dimensional intelligent analysis of the adaptive storage format of the text archive is realized by combining the characteristic parameters corresponding to the various storage formats to which the text archive belongs, the analysis result is more accurate and reliable, the application range is wider, and the available value is higher.
The key text archive identification output terminal is used for acquiring the actual storage format corresponding to each text archive, comparing the actual storage format with the adaptive storage format corresponding to each text archive, recording a text archive as a key text archive if the actual storage format corresponding to a certain text archive is inconsistent with the adaptive storage format corresponding to the text archive, and then outputting the serial number of the key text archive and the adaptive storage format corresponding to the key text archive in a background for modification by an archive manager.
The method and the device have the advantages that the archiving information and the historical application parameters of each text archive stored in the target archive are acquired, the adaptive storage format corresponding to each text archive is analyzed according to the acquired archiving information and the historical application parameters, and the text archive with the actual storage format inconsistent with the adaptive storage format is subjected to key identification and modification.
The foregoing is merely exemplary and illustrative of the present invention and various modifications, additions and substitutions may be made by those skilled in the art to the specific embodiments described without departing from the scope of the invention as defined in the following claims.
Claims (10)
1. An electronic archive management system, comprising:
the target archive text archive counting module is used for taking an archive to be subjected to electronic archive management as a target archive, further counting the number of text archives stored in the target archive, and numbering the text archives according to the sequence of filing time points;
the text archive filing information acquisition module is used for acquiring filing information of each text archive;
the text archive historical application parameter extraction module is used for setting an application time period based on the archiving time point corresponding to each text archive, and further extracting the historical application parameters corresponding to each text archive in the set application time period;
the text file application value analysis module is used for analyzing the application value degree corresponding to each text file according to the historical application parameters corresponding to each text file;
the text archive storage value analysis module is used for analyzing the storage value degree corresponding to each text archive according to the filing information corresponding to each text archive;
the text file typesetting requirement analysis module is used for extracting application purposes corresponding to each application from historical application parameters corresponding to each text file and analyzing the typesetting requirement degree corresponding to each text file according to the application purposes;
the archive management library is used for storing characteristic parameters corresponding to various storage formats to which the text archives belong, storing source importance indexes corresponding to various formers, and storing content importance indexes corresponding to various archiving categories;
the text archive adaptive storage format analysis module is used for analyzing the adaptive storage format corresponding to each text archive based on the application value degree, the storage value degree and the typesetting requirement degree corresponding to each text archive and the characteristic parameters corresponding to various storage formats to which the text archive belongs;
and the key text archive identification output terminal is used for acquiring the actual storage format corresponding to each text archive, comparing the actual storage format with the adaptive storage format corresponding to each text archive, recording the text archive as a key text archive if the actual storage format corresponding to a certain text archive is inconsistent with the adaptive storage format corresponding to the text archive, and further outputting the serial number of the key text archive and the adaptive storage format corresponding to the key text archive in a background for modification by an archive manager.
2. An electronic archive management system according to claim 1, characterized by: the archive information includes a creator and an archive category.
3. An electronic archive management system according to claim 1, characterized in that: the specific setting mode for setting the application time period based on the archiving time points corresponding to the text archives is to compare the archiving time points corresponding to the text archives, screen out the earliest archiving time point from the archiving time points, use the earliest archiving time point as the initial application time point, further calculate the application ending time point according to the initial application time point and the preset time interval, and set the time period between the initial application time point and the application ending time point as the application time period.
4. An electronic archive management system according to claim 1, characterized in that: the historical application parameters comprise the number of application records, the interval duration of adjacent application records, the application duration corresponding to each application record and an application purpose, wherein the application purpose comprises information lookup or text printing.
5. An electronic archive management system according to claim 4, characterized in that: the analysis of the application value degree corresponding to each text archive specifically refers to the following steps:
extracting the application record quantity and the interval duration of adjacent application records from the historical application parameters corresponding to each text file, and calculating the application frequency FU corresponding to each text file according to the application record quantity and the interval duration i Where i is represented as a text archive number, i =1,2, ·, n,Δd i j respectively representing the interval duration between the j +1 th application record and the j th application record corresponding to the ith text file, wherein j represents an application record number, j =1,2,.. Once, m, T represents the duration corresponding to the set application time period, and e represents a natural constant;
extracting application duration corresponding to each application record from historical application parameters corresponding to each text file, and calculating application value degree AV corresponding to each text file by combining the application duration corresponding to each text file with application frequency corresponding to each text file i Whereintij represents the application duration of the jth application record corresponding to the ith text file, and m represents the number of the application records.
6. An electronic archive management system according to claim 2, characterized in that: the analysis of the corresponding preservation value degree of each text archive refers to the following steps:
extracting formers from the filing information corresponding to each text file, and further matching the formers corresponding to each text file with the source importance indexes corresponding to various formers stored in the file management library, so as to obtain the source importance indexes corresponding to each text file;
extracting filing types from the filing information corresponding to the text archives, matching the filing types corresponding to the text archives with the content importance indexes corresponding to the filing types stored in the archive management library, and matching the content importance indexes corresponding to the text archives;
substituting the source importance index and the content importance index corresponding to each text file into a preservation value degree analysis formulaAnalyzing the corresponding storage value PV of each text archive i ,η i 、ξ i Are respectively expressed as the ith textThe source importance index and the content importance index corresponding to the file, U is expressed as a preset constant, and U is expressed as a preset constant>1。
7. An electronic archive management system according to claim 4, characterized in that: the analyzing of the typesetting requirement degree corresponding to each text archive specifically comprises:
comparing the application purposes of the application records corresponding to the text files with each other, classifying the application records corresponding to the same application purpose in the text files, and counting the number of the application records existing in the text printing in the text files;
calculating the typesetting requirement degree corresponding to each text file according to the number of application records existing in the text printing in each text file, wherein the calculation formula isWherein TR i Expressed as the corresponding typesetting demand degree, x, of the ith text file i Number of application records, m, existing for printing of text in ith text file i The number of application records corresponding to the ith text file is expressed.
8. An electronic archive management system according to claim 1, characterized in that: the characteristic parameters comprise the compatibility of a loading program, the storage loss rate and the text typesetting support rate.
9. An electronic archive management system according to claim 8, characterized in that: the analysis of the adaptive storage format corresponding to each text archive specifically comprises:
comparing the application value degree corresponding to each text file with the application value degree interval corresponding to each set application value grade, and determining the application value grade corresponding to each text file;
matching the application value grade corresponding to each text file with the compatibility of the demand loader corresponding to each set application value grade, and matching the compatibility of the demand loader corresponding to each text file;
extracting loader compatibility from the characteristic parameters, comparing the required loader compatibility with the loading program compatibility of the text files in different storage formats, and calculating the loader compatibility similarity index of each storage format relative to each text fileWherein delta k→i Expressed as loader compatibility similarity index, lambda, of the kth storage format relative to the ith text file k Expressed as the loader compatibility, lambda, corresponding to the kth storage format to which the text file belongs i Expressing the compatibility of a required loader corresponding to the ith text file, expressing delta lambda as a preset reference loader compatibility contrast difference value, expressing k as the number of various storage formats to which the text file belongs, and expressing k =1,2, …, z;
comparing the loader compatibility similarity indexes of various storage formats relative to the text files with a set similarity index threshold, and screening out the storage formats, in which the loader compatibility similarity indexes are greater than the set similarity index threshold, of the text files as compatible storage formats corresponding to the text files to form a compatible storage format set corresponding to the text files;
comparing the preservation value degree corresponding to each text file with the preservation value degree intervals corresponding to various set preservation value levels, and determining the preservation value level corresponding to each text file;
matching the storage value level corresponding to each text file with the storage loss rate lower limit threshold corresponding to each set storage value level, and matching the storage loss rate lower limit threshold corresponding to each text file;
extracting a storage loss rate from the characteristic parameters, comparing the lower limit threshold of the storage loss rate corresponding to each text archive with the storage loss rates corresponding to various storage formats to which the text archive belongs, screening out the storage formats meeting the lower limit threshold of the storage loss rate corresponding to each text archive, and taking the storage formats as the stable consistent storage formats corresponding to each text archive to form a stable consistent storage format set corresponding to each text archive;
extracting the character typesetting support rate from the characteristic parameters, further matching the character typesetting support rate corresponding to each storage format to which the text file belongs with the typesetting requirement degree which can be met by the set character typesetting support rate, and matching the typesetting requirement degree which can be met by each storage format;
comparing the typesetting requirement degree corresponding to each text file with the typesetting requirement degree which can be met by each storage format, if the typesetting requirement degree which can be met by a certain storage format is more than or equal to the typesetting requirement degree corresponding to a certain text file, taking the storage format as the typesetting conforming storage format corresponding to the text file, extracting the typesetting conforming storage format corresponding to each text file, and forming a typesetting conforming storage format set corresponding to each text file;
and determining the adaptive storage format corresponding to each text archive based on the compatible storage format set, the stable consistent storage format set and the typesetting consistent storage format set corresponding to each text archive.
10. An electronic archive management system according to claim 9, characterized by: the specific determination of the adaptive storage format corresponding to each text archive is as follows:
comparing a compatible storage format set, a stable consistent storage format set and a typesetting consistent storage format set corresponding to each text archive, wherein if a certain consistent storage format corresponding to a certain text archive appears in the sets, the text archive is used as a common text archive, otherwise, the text archive is used as a special text archive;
for the common text archives, the consistent storage formats appearing in the sets are used as the adaptive storage formats corresponding to the common text archives;
for the special text archives, classifying the same storage formats in the compatible storage format set, the stable consistent storage format set and the typesetting consistent storage format set corresponding to the special text archives, and counting the occurrence frequency corresponding to each consistent storage format and the related set type and the consistency corresponding to each occurrence;
calculating the corresponding advantage saliency of each corresponding storage format in the special text archive according to the corresponding frequency of occurrence of each corresponding storage format and the related set type and the corresponding conformity of each occurrenceWherein->The storage format is expressed as the corresponding dominant saliency of the r-th consistent storage format in the special text file, r is expressed as the consistent storage format number, r =1,2, …, w, y r Expressed as the frequency of occurrence corresponding to the r-th matching storage format in the special text file, and based on the storage format in the R-th matching storage format>μ r f represents the weight factor and the consistency of the corresponding related set type of the r-th consistent storage format in the special text file at the f-th occurrence, f represents the number of each occurrence, and f =1,2, …, y and alpha represent preset correction factors;
and comparing the advantage saliency corresponding to each consistent storage format in the special text archive, and extracting the consistent storage format corresponding to the maximum advantage saliency from the comparison as the adaptive storage format corresponding to the special text archive.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202211541720.XA CN115878559A (en) | 2022-12-02 | 2022-12-02 | Electronic file management system |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202211541720.XA CN115878559A (en) | 2022-12-02 | 2022-12-02 | Electronic file management system |
Publications (1)
Publication Number | Publication Date |
---|---|
CN115878559A true CN115878559A (en) | 2023-03-31 |
Family
ID=85765677
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202211541720.XA Pending CN115878559A (en) | 2022-12-02 | 2022-12-02 | Electronic file management system |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN115878559A (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN116681261A (en) * | 2023-07-27 | 2023-09-01 | 山东创亿智慧信息科技发展有限责任公司 | Intelligent archive management control system |
-
2022
- 2022-12-02 CN CN202211541720.XA patent/CN115878559A/en active Pending
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN116681261A (en) * | 2023-07-27 | 2023-09-01 | 山东创亿智慧信息科技发展有限责任公司 | Intelligent archive management control system |
CN116681261B (en) * | 2023-07-27 | 2023-10-17 | 山东创亿智慧信息科技发展有限责任公司 | Intelligent archive management control system |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110765770B (en) | Automatic contract generation method and device | |
US7076503B2 (en) | Managing media objects in a database | |
US8001113B2 (en) | Query string matching method and apparatus | |
US20060282465A1 (en) | System and method for searching media content | |
WO2017074174A1 (en) | A system and method for processing big data using electronic document and electronic file-based system that operates on rdbms | |
CN110188077B (en) | Intelligent classification method and device for electronic files, electronic equipment and storage medium | |
CN113486392B (en) | Sensitive data identification and desensitization method based on big data platform | |
US20060294096A1 (en) | Additive clustering of images into events using capture date-time information | |
CN115878559A (en) | Electronic file management system | |
CN115935412A (en) | Automatic classification and classification method and system for unstructured data | |
CN115309963B (en) | Intelligent archive management method, system and storage medium | |
CN112506858A (en) | File management method for intelligent brain of intelligent meeting room | |
US20110179036A1 (en) | Methods and Apparatuses For Abstract Representation of Financial Documents | |
CN109710628B (en) | Information processing method, information processing device, information processing system, computer and readable storage medium | |
CN113254398A (en) | Sample file management method, device, equipment and medium | |
CN115309702B (en) | File retrieval management method, device, system and equipment based on USN log | |
CN114817518A (en) | License handling method, system and medium based on big data archive identification | |
CN112733186A (en) | User privacy data analysis method and device | |
CN115328863B (en) | File processing method, equipment and storage medium | |
CN116910057B (en) | Quick intelligent data modeling method and system supporting multiple scenes | |
CN111915424B (en) | Information storage method and related product | |
CN118658169B (en) | Feature extraction-based intelligent cluster analysis method and system | |
CN115510144B (en) | Method and system for capturing real-time change data of database | |
CN102163240A (en) | Method for constructing human face characteristic image index database based on MPEG-7 (Motion Picture Experts Group-7) standard | |
CN110489378B (en) | Method and system for file migration in Internet |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |