CN115878559A - Electronic file management system - Google Patents

Electronic file management system Download PDF

Info

Publication number
CN115878559A
CN115878559A CN202211541720.XA CN202211541720A CN115878559A CN 115878559 A CN115878559 A CN 115878559A CN 202211541720 A CN202211541720 A CN 202211541720A CN 115878559 A CN115878559 A CN 115878559A
Authority
CN
China
Prior art keywords
text
archive
application
storage format
storage
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202211541720.XA
Other languages
Chinese (zh)
Inventor
李娅
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CN202211541720.XA priority Critical patent/CN115878559A/en
Publication of CN115878559A publication Critical patent/CN115878559A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention relates to the technical field of electronic archive management, and discloses an electronic archive management system which comprises a target archive text archive statistics module, a text archive filing information acquisition module, a text archive historical application parameter extraction module, a text archive application value analysis module, a text archive storage value analysis module, a text archive typesetting requirement analysis module, an archive management library, a text archive adaptive storage format analysis module and a key text archive identification output terminal.

Description

Electronic file management system
Technical Field
The invention relates to the technical field of electronic archive management, in particular to a text archive management technology, and specifically relates to an electronic archive management system.
Background
The archives are original records of various characters, images, sound images and other forms which have preservation value and are formed in social activities of countries, organizations, social organizations and individuals, and have functions and functions irreplaceable with other data. The record carrier of traditional archives is mainly paper, and under the continuous development of science and technology, the mode of archives preservation also changes the storage mode of electronization into from single paper preservation, forms electronic file, and electronic file relies on its advantage of being convenient for to store, duplicate, transmit, makes its daily file management who adapts to the archives of present era more.
The paper form dominates the presentation form of the whole record carrier, so that the existing electronic archives stored in archives have the largest text archive ratio, and the storage formats of texts in computer systems are various, such as txt, doc, pdf, etc., and the various storage formats have their own advantages and disadvantages. In such a situation, currently, an archive needs to select one of several text storage formats when storing the text archive.
However, in the prior art, the selection of the storage format of the text archive is basically determined subjectively by the archives or a unified default storage format is selected for saving the storage space, and due to the lack of objective scientific selection basis, the selection mode is difficult to adapt to the targeted storage requirement of the text archive, so that the adaptation degree of the selection result is not high, which not only brings inconvenience to the search and application of the text archive, but also may bring certain risk potential to the storage safety of the text archive, and the storage effect of the text archive is not good, which is not favorable for the permanent safety application of the text archive.
Disclosure of Invention
In order to solve the technical problems, the invention is realized by the following technical scheme: an electronic archive management system comprising: and the target archive text archive counting module is used for taking the archive to be subjected to electronic archive management as a target archive, counting the number of the text archives stored in the target archive, and numbering the text archives according to the sequence of the filing time points.
And the text archive filing information acquisition module is used for acquiring filing information of each text archive.
And the text archive historical application parameter extraction module is used for setting an application time period based on the filing time point corresponding to each text archive, and further extracting the historical application parameters corresponding to each text archive in the set application time period.
And the text file application value analysis module is used for analyzing the application value degree corresponding to each text file according to the historical application parameters corresponding to each text file.
And the text archive storage value analysis module is used for analyzing the storage value degree corresponding to each text archive according to the filing information corresponding to each text archive.
And the text file typesetting requirement analysis module is used for extracting the application purpose corresponding to each application from the historical application parameters corresponding to each text file and analyzing the typesetting requirement degree corresponding to each text file according to the application purpose.
The file management library is used for storing characteristic parameters corresponding to various storage formats to which the text files belong, storing source importance indexes corresponding to various formers, and storing content importance indexes corresponding to various filing types.
And the text archive adaptive storage format analysis module is used for analyzing the adaptive storage format corresponding to each text archive based on the application value degree, the storage value degree and the typesetting requirement degree corresponding to each text archive and the characteristic parameters corresponding to various storage formats to which the text archive belongs.
And the key text archive identification output terminal is used for acquiring the actual storage format corresponding to each text archive, comparing the actual storage format with the adaptive storage format corresponding to each text archive, recording the text archive as a key text archive if the actual storage format corresponding to a certain text archive is inconsistent with the adaptive storage format corresponding to the text archive, and further outputting the serial number of the key text archive and the adaptive storage format corresponding to the key text archive in a background for modification by an archive manager.
As applied to the above embodiment, the archive information includes a creator and an archive category.
The specific setting manner of setting the application time period based on the filing time points corresponding to the text files is to compare the filing time points corresponding to the text files, screen out the earliest filing time point from the comparison, use the earliest filing time point as the initial application time point, further calculate the application ending time point according to the initial application time point and the preset time interval, and set the time period between the initial application time point and the application ending time point as the application time period.
Applied to the above embodiment, the historical application parameters include the number of application records, the interval duration between adjacent application records, the application duration corresponding to each application record, and the application purpose, where the application purpose includes information lookup or text printing.
Applied to the above embodiment, the analyzing the application value degree corresponding to each text archive specifically refers to the following steps: extracting the application record quantity and the interval duration of adjacent application records from the historical application parameters corresponding to each text file, and calculating the application frequency FU corresponding to each text file according to the application record quantity and the interval duration i Where i is represented as a text archive number, i =1,2, · n,
Figure BDA0003977996280000031
Δd i j the time lengths of the interval between the j +1 th application record and the j th application record corresponding to the ith text file are respectively expressed, j is an application record number, j =1,2.
Extracting the application duration corresponding to each application record from the historical application parameters corresponding to each text file, and calculating the application value degree AV corresponding to each text file by combining the application duration corresponding to each text file with the application frequency degree corresponding to each text file i Wherein
Figure BDA0003977996280000041
t i j represents the application duration of the ith text file corresponding to the jth application record, and m represents the number of the application records.
For the embodiment, the analysis of the preservation value degree corresponding to each text archive refers to the following steps: and extracting the formers from the filing information corresponding to each text file, and further matching the formers corresponding to each text file with the source importance indexes corresponding to various formers stored in the file management library, so as to match the source importance indexes corresponding to each text file.
And extracting the filing type from the filing information corresponding to each text file, matching the filing type corresponding to each text file with the content importance index corresponding to each filing type stored in the file management library, and matching the content importance index corresponding to each text file.
Substituting the source importance index and the content importance index corresponding to each text file into a preservation value degree analysis formula
Figure BDA0003977996280000042
Analyzing the corresponding storage value PV of each text archive i ,η i 、ξ i Respectively expressed as a source importance index and a content importance index corresponding to the ith text file, U is expressed as a preset constant, and U is expressed as a>1。
Applied to the above embodiment, the analyzing the layout requirement degree corresponding to each text archive specifically includes: and comparing the application purposes of the application records corresponding to the text files with each other, classifying the application records corresponding to the same application purpose in the text files, and counting the number of the application records existing in the text printing in the text files.
Calculating the typesetting requirement degree corresponding to each text file according to the number of application records existing in the text printing in each text file, wherein the calculation formula is
Figure BDA0003977996280000051
Wherein TR i Expressed as the corresponding typesetting demand degree, x, of the ith text file i Number of application records, m, existing for printing of text in ith text file i Expressed as the number of application records corresponding to the ith text archive.
Applied to the above embodiment, the characteristic parameters include a loader compatibility, a storage loss rate, and a text layout support rate.
Applied to the above embodiment, the analyzing the adapted storage format corresponding to each text archive specifically includes: and comparing the application value degree corresponding to each text file with the application value degree interval corresponding to each set application value grade, and determining the application value grade corresponding to each text file.
And matching the application value grade corresponding to each text file with the compatibility of the demand loader corresponding to each set application value grade, and matching the compatibility of the demand loader corresponding to each text file.
Extracting the compatibility of the loader from the characteristic parameters, comparing the compatibility of the required loader corresponding to each text file with the compatibility of the loader corresponding to each storage format of the text file, and calculating the similarity index of the compatibility of the loader corresponding to each text file in each storage format
Figure BDA0003977996280000052
Wherein delta k→i Expressed as loader compatibility similarity index, lambda, of the kth storage format relative to the ith text file k Expressed as the loader compatibility, lambda, corresponding to the kth storage format to which the text file belongs i The required loader compatibility corresponding to the ith text file is represented, Δ λ is represented as a preset reference loader compatibility contrast difference, k is represented as the number of various storage formats to which the text file belongs, and k =1,2.
Comparing the loader compatibility similarity indexes of various storage formats relative to the text files with a set similarity index threshold, and screening out the storage formats of which the loader compatibility similarity indexes are larger than the set similarity index threshold from the text files as compatible storage formats corresponding to the text files to form a compatible storage format set corresponding to the text files.
And comparing the preservation value degree corresponding to each text file with the preservation value degree intervals corresponding to various set preservation value levels, and determining the preservation value level corresponding to each text file.
And matching the storage value level corresponding to each text file with the storage loss rate lower limit threshold corresponding to each set storage value level, and matching the storage loss rate lower limit threshold corresponding to each text file.
And extracting the storage loss rate from the characteristic parameters, comparing the lower limit threshold of the storage loss rate corresponding to each text archive with the storage loss rates corresponding to various storage formats to which the text archive belongs, screening out the storage formats meeting the lower limit threshold of the storage loss rate corresponding to each text archive, and taking the storage formats as the stable consistent storage formats corresponding to each text archive to form a stable consistent storage format set corresponding to each text archive.
And extracting the text typesetting support rate from the characteristic parameters, further matching the text typesetting support rate corresponding to each storage format to which the text archive belongs with the typesetting requirement degree which can be met by the set text typesetting support rate, and matching the typesetting requirement degree which can be met by each storage format from the text typesetting support rate.
Comparing the typesetting requirement degree corresponding to each text file with the typesetting requirement degree which can be met by each storage format, if the typesetting requirement degree which can be met by a certain storage format is more than or equal to the typesetting requirement degree corresponding to a certain text file, taking the storage format as the typesetting conforming storage format corresponding to the text file, extracting the typesetting conforming storage format corresponding to each text file, and forming a typesetting conforming storage format set corresponding to each text file.
And determining the adaptive storage format corresponding to each text archive based on the compatible storage format set, the stable consistent storage format set and the typesetting consistent storage format set corresponding to each text archive.
Applied to the above embodiment, the determining of the adapted storage format corresponding to each text archive is specifically as follows: comparing the compatible storage format set, the stable storage format set and the typesetting storage format set corresponding to each text archive, if a certain corresponding storage format corresponding to a certain text archive appears in the sets, using the text archive as a common text archive, otherwise, using the text archive as a special text archive.
For the common text archive, the corresponding storage formats appearing in the sets are used as the adaptive storage formats corresponding to the common text archive.
For the special text archives, the compatible storage format set, the stable storage format set and the typesetting storage format set corresponding to the special text archives are classified, and the occurrence frequency corresponding to each corresponding storage format and the related set type and the corresponding degree of conformity corresponding to each occurrence are counted.
Calculating the corresponding advantage saliency of each corresponding storage format in the special text archive according to the corresponding frequency of occurrence of each corresponding storage format and the related set type and the corresponding conformity of each occurrence
Figure BDA0003977996280000081
Wherein->
Figure BDA0003977996280000082
The storage format is expressed as the corresponding dominant saliency of the r-th consistent storage format in the special text file, r is expressed as the consistent storage format number, r =1,2 r Expressed as the occurrence frequency corresponding to the matching storage format in the r-th text file, is/are>
Figure BDA0003977996280000083
μ r f represents the weight factor and the consistency of the corresponding storage format of the r-th consistent storage format in the special text file corresponding to the related set type at the f-th occurrence, f represents the number of each occurrence, and f =1,2.
And comparing the dominant saliency corresponding to each consistent storage format in the special text archive, and extracting the consistent storage format corresponding to the maximum dominant saliency from the dominant saliency to serve as the adaptive storage format corresponding to the special text archive.
Compared with the prior art, the invention has the following advantages: 1. the method and the device have the advantages that the archiving information and the historical application parameters of each text archive stored in the target archive are acquired, the adaptive storage format corresponding to each text archive is analyzed according to the acquired archiving information and the historical application parameters, and the text archive with the actual storage format inconsistent with the adaptive storage format is subjected to key identification and modification.
2. When the adaptive storage format corresponding to each text archive is analyzed according to the text archive filing information and the historical application parameters, the multi-dimensional intelligent analysis of the adaptive storage format of the text archive is realized by analyzing the application value, the storage value and the typesetting requirement corresponding to each text archive and combining the characteristic parameters corresponding to various storage formats to which the text archive belongs, so that the analysis result is more accurate and reliable, the application range is wider, and the method has higher available value.
Drawings
The invention is further illustrated by means of the attached drawings, but the embodiments in the drawings do not constitute any limitation to the invention, and for a person skilled in the art, other drawings can be obtained on the basis of the following drawings without inventive effort.
FIG. 1 is a schematic diagram of the system connection of the present invention.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
Referring to fig. 1, the present invention provides an electronic archive management system, which comprises a target archive text archive statistics module, a text archive archiving information acquisition module, a text archive historical application parameter extraction module, a text archive application value analysis module, a text archive storage value analysis module, a text archive typesetting requirement analysis module, an archive management library, a text archive adaptive storage format analysis module and a key text archive identification output terminal, wherein the target archive text archive statistics module is respectively connected with the text archive archiving information acquisition module and the text archive historical application parameter extraction module, the text archive historical application parameter extraction module is respectively connected with the text archive application value analysis module and the text archive typesetting requirement analysis module, the text archive historical application parameter extraction module is connected with the text archive storage value analysis module, the text archive application value analysis module, the text archive storage value analysis module and the text archive typesetting requirement analysis module, the text archive adaptive storage format analysis module is connected with the key text archive identification output terminal, and the archive management library is respectively connected with the text archive storage value analysis module and the text archive adaptive storage format analysis module.
The target archive text archive counting module is used for taking an archive to be subjected to electronic archive management as a target archive, further counting the number of the text archives stored in the target archive, and numbering the text archives according to the sequence of filing time points.
The text archive archiving information acquisition module is used for acquiring archiving information of each text archive, wherein the archiving information comprises a former and an archiving category.
The text archive historical application parameter extraction module is used for setting an application time period based on the archiving time point corresponding to each text archive, and further extracting historical application parameters corresponding to each text archive in the set application time period, wherein the historical application parameters comprise the number of application records, the interval duration of adjacent application records, the application duration corresponding to each application record and application purposes, and the application purposes comprise information lookup or text printing.
In a specific embodiment of the present invention, the specific setting manner of setting the application time period is to compare the filing time points corresponding to the text files, select the earliest filing time point from the filing time points, use the earliest filing time point as the initial application time point, and calculate the application ending time point according to the initial application time point and the preset time interval, thereby setting the time period between the initial application time point and the application ending time point as the application time period.
The text archive application value analysis module is used for analyzing the application value degree corresponding to each text archive according to the historical application parameters corresponding to each text archive, and the method specifically refers to the following steps:
extracting the application record quantity and the interval duration of adjacent application records from the historical application parameters corresponding to each text file, and calculating the application frequency FU corresponding to each text file according to the application record quantity and the interval duration i Where i is represented as a text archive number, i =1,2, · n,
Figure BDA0003977996280000111
Δd i j the time length of the interval between the j +1 th application record and the j th application record corresponding to the ith text file is respectively represented, j is represented as an application record number, j =1,2.
In the application frequency calculation formula, the longer the interval duration of the adjacent application records corresponding to a certain text file is, the smaller the application frequency corresponding to the text file is.
Extracting application duration corresponding to each application record from historical application parameters corresponding to each text file, and calculating application value degree AV corresponding to each text file by combining the application duration corresponding to each text file with application frequency corresponding to each text file i Wherein
Figure BDA0003977996280000112
tij represents the application duration of the jth application record corresponding to the ith text file, and m represents the number of the application records.
The text archive storage value analysis module is used for analyzing the storage value degree corresponding to each text archive according to the archiving information corresponding to each text archive, and the method comprises the following steps: and extracting the formers from the filing information corresponding to each text file, matching the formers corresponding to each text file with the source importance indexes corresponding to various formers stored in the file management library, and matching the source importance indexes corresponding to each text file.
Illustratively, the aforementioned formers include, but are not limited to, official agencies, semi-official agencies, un-official agencies, individuals.
And extracting the filing type from the filing information corresponding to each text file, matching the filing type corresponding to each text file with the content importance index corresponding to each filing type stored in the file management library, and matching the content importance index corresponding to each text file.
Illustratively, the above mentioned archival categories include, but are not limited to, scientific archives, military archives, and economic archives.
Substituting the source importance index and the content importance index corresponding to each text file into a preservation value degree analysis formula
Figure BDA0003977996280000121
Analyzing the corresponding storage value PV of each text archive i ,η i 、ξ i Respectively expressed as a source importance index and a content importance index corresponding to the ith text file, U is expressed as a preset constant, and U is expressed as>1。
The text archive typesetting requirement analysis module is used for extracting application purposes corresponding to each application from historical application parameters corresponding to each text archive, and analyzing the typesetting requirement degree corresponding to each text archive according to the application purposes, and specifically comprises the following steps: and comparing the application purposes of the application records corresponding to the text files with each other, classifying the application records corresponding to the same application purpose in the text files, and counting the number of the application records existing in the text printing in the text files.
Calculating the typesetting requirement degree corresponding to each text file according to the number of application records existing in the text printing in each text file, wherein the calculation formula is
Figure BDA0003977996280000131
Wherein TR i Expressed as the corresponding typesetting demand degree, x, of the ith text file i Number of application records, m, existing for printing of text in ith text file i The number of the application records corresponding to the ith text file is represented, wherein the more the number of the application records existing in the text printing in a certain text file is, the greater the typesetting requirement degree corresponding to the text file is.
The archive management library is used for storing characteristic parameters corresponding to various storage formats to which the text archives belong, storing source importance indexes corresponding to various formers, and storing content importance indexes corresponding to various archiving categories.
The text archive adaptive storage format analysis module is used for analyzing the adaptive storage format corresponding to each text archive based on the application value degree, the storage value degree and the typesetting requirement degree corresponding to each text archive and the characteristic parameters corresponding to various storage formats to which the text archive belongs, wherein the characteristic parameters comprise the loading program compatibility degree, the storage loss rate and the text typesetting support rate.
In a specific embodiment of the present invention, analyzing the adaptive storage format corresponding to each text archive specifically includes: and comparing the application value degree corresponding to each text file with the application value degree interval corresponding to each set application value grade, and determining the application value grade corresponding to each text file.
And matching the application value grade corresponding to each text file with the compatibility of the demand loader corresponding to each set application value grade, and matching the compatibility of the demand loader corresponding to each text file.
Extracting the compatibility of the loader from the characteristic parameters, comparing the compatibility of the required loader corresponding to each text file with the compatibility of the loader corresponding to each storage format of the text file, and calculating the similarity index of the compatibility of the loader corresponding to each text file in each storage format
Figure BDA0003977996280000141
Wherein delta k→i Expressed as loader compatibility similarity index, lambda, of the kth storage format relative to the ith text file k Expressed as the compatibility of the loader program corresponding to the k-th storage format of the text file, lambda i Expressed as the required loader compatibility corresponding to the ith text file, Δ λ is expressed as a preset reference loader compatibility contrast difference, k is expressed as the number of various storage formats to which the text file belongs, k =1,2The closer the compatibility is to the required loader compatibility of the text file, the greater the loader compatibility similarity index.
Comparing the loader compatibility similarity indexes of various storage formats relative to the text files with a set similarity index threshold, and screening out the storage formats of which the loader compatibility similarity indexes are larger than the set similarity index threshold from the text files as compatible storage formats corresponding to the text files to form a compatible storage format set corresponding to the text files.
And comparing the preservation value degree corresponding to each text file with the preservation value degree intervals corresponding to various set preservation value levels, and determining the preservation value level corresponding to each text file.
And matching the storage value level corresponding to each text file with the storage loss rate lower limit threshold corresponding to each set storage value level, and matching the storage loss rate lower limit threshold corresponding to each text file.
And extracting the storage loss rate from the characteristic parameters, comparing the lower limit threshold of the storage loss rate corresponding to each text archive with the storage loss rates corresponding to various storage formats to which the text archive belongs, screening out the storage formats meeting the lower limit threshold of the storage loss rate corresponding to each text archive, and taking the storage formats as the stable consistent storage formats corresponding to each text archive to form a stable consistent storage format set corresponding to each text archive.
And extracting the character typesetting support rate from the characteristic parameters, matching the character typesetting support rate corresponding to each storage format to which the text archive belongs with the typesetting requirement degree which can be met by the set character typesetting support rate, and matching the typesetting requirement degree which can be met by each storage format.
Comparing the typesetting requirement degree corresponding to each text file with the typesetting requirement degree which can be met by each storage format, if the typesetting requirement degree which can be met by a certain storage format is more than or equal to the typesetting requirement degree corresponding to a certain text file, taking the storage format as the typesetting conforming storage format corresponding to the text file, extracting the typesetting conforming storage format corresponding to each text file, and forming a typesetting conforming storage format set corresponding to each text file.
Determining an adaptive storage format corresponding to each text archive based on the compatible consistent storage format set, the stable consistent storage format set and the typesetting consistent storage format set corresponding to each text archive, wherein the specific determination steps are as follows: comparing the compatible storage format set, the stable storage format set and the typesetting storage format set corresponding to each text archive, if a certain corresponding storage format corresponding to a certain text archive appears in the sets, using the text archive as a common text archive, otherwise, using the text archive as a special text archive.
For the common text archive, the corresponding storage formats appearing in the sets are used as the adaptive storage formats corresponding to the common text archive.
For the special text archives, classifying the same storage formats in the compatible storage format set, the stable consistent storage format set and the typesetting consistent storage format set corresponding to the special text archives, and counting the occurrence frequency corresponding to each consistent storage format and the related set type and consistency corresponding to each occurrence, wherein the related set type comprises the compatible consistent storage format set, the stable consistent storage format set or the typesetting consistent storage format set;
calculating the corresponding advantage saliency of each corresponding storage format in the special text archive according to the corresponding frequency of occurrence of each corresponding storage format and the related set type and the corresponding conformity of each occurrence
Figure BDA0003977996280000161
Wherein->
Figure BDA0003977996280000162
The storage format is expressed as the corresponding dominant saliency of the r-th consistent storage format in the special text file, r is expressed as the consistent storage format number, r =1,2 r Expressed as the occurrence frequency corresponding to the matching storage format in the r-th text file, is/are>
Figure BDA0003977996280000163
μ r f represents the weight factor and the consistency of the corresponding storage format of the r-th consistent storage format in the special text file corresponding to the related set type at the f-th occurrence, f represents the number of each occurrence, and f =1,2.
In a further preferred embodiment, the above-mentioned correspondence for each occurrence is obtained by the following method: identifying a basic value and a reference value corresponding to each occurrence of each corresponding storage format, for example, if a corresponding storage format occurs in the set of compatible storage formats, the basic value corresponding to the occurrence of the corresponding storage format is a loader compatibility similarity index, and at this time, the reference value corresponding to the occurrence of the corresponding storage format is a set similarity index threshold.
Further exemplarily, if a corresponding storage format occurs in the stable corresponding storage format set, the basic value corresponding to the occurrence of the corresponding storage format at this time is the storage loss rate, and at this time, the reference value corresponding to the occurrence of the corresponding storage format at this time is the storage loss rate lower limit threshold.
Further exemplarily, if the corresponding storage format appears in the storage format set corresponding to the type setting, the basic value corresponding to the occurrence of the corresponding storage format at this time is the type setting requirement degree which can be met, and at this time, the reference value corresponding to the occurrence of the corresponding storage format at this time is the type setting requirement degree.
Substituting the basic value and the reference value corresponding to each occurrence of each consistent storage format into a consistency calculation formula
Figure BDA0003977996280000171
And calculating the corresponding conformity of each time of each conforming storage format.
And comparing the advantage saliency corresponding to each consistent storage format in the special text archive, and extracting the consistent storage format corresponding to the maximum advantage saliency from the comparison as the adaptive storage format corresponding to the special text archive.
According to the embodiment of the invention, when the adaptive storage format corresponding to each text archive is analyzed according to the text archive filing information and the historical application parameters, the application value, the storage value and the typesetting requirement corresponding to each text archive are analyzed, so that the multi-dimensional intelligent analysis of the adaptive storage format of the text archive is realized by combining the characteristic parameters corresponding to the various storage formats to which the text archive belongs, the analysis result is more accurate and reliable, the application range is wider, and the available value is higher.
The key text archive identification output terminal is used for acquiring the actual storage format corresponding to each text archive, comparing the actual storage format with the adaptive storage format corresponding to each text archive, recording a text archive as a key text archive if the actual storage format corresponding to a certain text archive is inconsistent with the adaptive storage format corresponding to the text archive, and then outputting the serial number of the key text archive and the adaptive storage format corresponding to the key text archive in a background for modification by an archive manager.
The method and the device have the advantages that the archiving information and the historical application parameters of each text archive stored in the target archive are acquired, the adaptive storage format corresponding to each text archive is analyzed according to the acquired archiving information and the historical application parameters, and the text archive with the actual storage format inconsistent with the adaptive storage format is subjected to key identification and modification.
The foregoing is merely exemplary and illustrative of the present invention and various modifications, additions and substitutions may be made by those skilled in the art to the specific embodiments described without departing from the scope of the invention as defined in the following claims.

Claims (10)

1. An electronic archive management system, comprising:
the target archive text archive counting module is used for taking an archive to be subjected to electronic archive management as a target archive, further counting the number of text archives stored in the target archive, and numbering the text archives according to the sequence of filing time points;
the text archive filing information acquisition module is used for acquiring filing information of each text archive;
the text archive historical application parameter extraction module is used for setting an application time period based on the archiving time point corresponding to each text archive, and further extracting the historical application parameters corresponding to each text archive in the set application time period;
the text file application value analysis module is used for analyzing the application value degree corresponding to each text file according to the historical application parameters corresponding to each text file;
the text archive storage value analysis module is used for analyzing the storage value degree corresponding to each text archive according to the filing information corresponding to each text archive;
the text file typesetting requirement analysis module is used for extracting application purposes corresponding to each application from historical application parameters corresponding to each text file and analyzing the typesetting requirement degree corresponding to each text file according to the application purposes;
the archive management library is used for storing characteristic parameters corresponding to various storage formats to which the text archives belong, storing source importance indexes corresponding to various formers, and storing content importance indexes corresponding to various archiving categories;
the text archive adaptive storage format analysis module is used for analyzing the adaptive storage format corresponding to each text archive based on the application value degree, the storage value degree and the typesetting requirement degree corresponding to each text archive and the characteristic parameters corresponding to various storage formats to which the text archive belongs;
and the key text archive identification output terminal is used for acquiring the actual storage format corresponding to each text archive, comparing the actual storage format with the adaptive storage format corresponding to each text archive, recording the text archive as a key text archive if the actual storage format corresponding to a certain text archive is inconsistent with the adaptive storage format corresponding to the text archive, and further outputting the serial number of the key text archive and the adaptive storage format corresponding to the key text archive in a background for modification by an archive manager.
2. An electronic archive management system according to claim 1, characterized by: the archive information includes a creator and an archive category.
3. An electronic archive management system according to claim 1, characterized in that: the specific setting mode for setting the application time period based on the archiving time points corresponding to the text archives is to compare the archiving time points corresponding to the text archives, screen out the earliest archiving time point from the archiving time points, use the earliest archiving time point as the initial application time point, further calculate the application ending time point according to the initial application time point and the preset time interval, and set the time period between the initial application time point and the application ending time point as the application time period.
4. An electronic archive management system according to claim 1, characterized in that: the historical application parameters comprise the number of application records, the interval duration of adjacent application records, the application duration corresponding to each application record and an application purpose, wherein the application purpose comprises information lookup or text printing.
5. An electronic archive management system according to claim 4, characterized in that: the analysis of the application value degree corresponding to each text archive specifically refers to the following steps:
extracting the application record quantity and the interval duration of adjacent application records from the historical application parameters corresponding to each text file, and calculating the application frequency FU corresponding to each text file according to the application record quantity and the interval duration i Where i is represented as a text archive number, i =1,2, ·, n,
Figure FDA0003977996270000031
Δd i j respectively representing the interval duration between the j +1 th application record and the j th application record corresponding to the ith text file, wherein j represents an application record number, j =1,2,.. Once, m, T represents the duration corresponding to the set application time period, and e represents a natural constant;
extracting application duration corresponding to each application record from historical application parameters corresponding to each text file, and calculating application value degree AV corresponding to each text file by combining the application duration corresponding to each text file with application frequency corresponding to each text file i Wherein
Figure FDA0003977996270000032
tij represents the application duration of the jth application record corresponding to the ith text file, and m represents the number of the application records.
6. An electronic archive management system according to claim 2, characterized in that: the analysis of the corresponding preservation value degree of each text archive refers to the following steps:
extracting formers from the filing information corresponding to each text file, and further matching the formers corresponding to each text file with the source importance indexes corresponding to various formers stored in the file management library, so as to obtain the source importance indexes corresponding to each text file;
extracting filing types from the filing information corresponding to the text archives, matching the filing types corresponding to the text archives with the content importance indexes corresponding to the filing types stored in the archive management library, and matching the content importance indexes corresponding to the text archives;
substituting the source importance index and the content importance index corresponding to each text file into a preservation value degree analysis formula
Figure FDA0003977996270000041
Analyzing the corresponding storage value PV of each text archive i ,η i 、ξ i Are respectively expressed as the ith textThe source importance index and the content importance index corresponding to the file, U is expressed as a preset constant, and U is expressed as a preset constant>1。
7. An electronic archive management system according to claim 4, characterized in that: the analyzing of the typesetting requirement degree corresponding to each text archive specifically comprises:
comparing the application purposes of the application records corresponding to the text files with each other, classifying the application records corresponding to the same application purpose in the text files, and counting the number of the application records existing in the text printing in the text files;
calculating the typesetting requirement degree corresponding to each text file according to the number of application records existing in the text printing in each text file, wherein the calculation formula is
Figure FDA0003977996270000042
Wherein TR i Expressed as the corresponding typesetting demand degree, x, of the ith text file i Number of application records, m, existing for printing of text in ith text file i The number of application records corresponding to the ith text file is expressed.
8. An electronic archive management system according to claim 1, characterized in that: the characteristic parameters comprise the compatibility of a loading program, the storage loss rate and the text typesetting support rate.
9. An electronic archive management system according to claim 8, characterized in that: the analysis of the adaptive storage format corresponding to each text archive specifically comprises:
comparing the application value degree corresponding to each text file with the application value degree interval corresponding to each set application value grade, and determining the application value grade corresponding to each text file;
matching the application value grade corresponding to each text file with the compatibility of the demand loader corresponding to each set application value grade, and matching the compatibility of the demand loader corresponding to each text file;
extracting loader compatibility from the characteristic parameters, comparing the required loader compatibility with the loading program compatibility of the text files in different storage formats, and calculating the loader compatibility similarity index of each storage format relative to each text file
Figure FDA0003977996270000051
Wherein delta k→i Expressed as loader compatibility similarity index, lambda, of the kth storage format relative to the ith text file k Expressed as the loader compatibility, lambda, corresponding to the kth storage format to which the text file belongs i Expressing the compatibility of a required loader corresponding to the ith text file, expressing delta lambda as a preset reference loader compatibility contrast difference value, expressing k as the number of various storage formats to which the text file belongs, and expressing k =1,2, …, z;
comparing the loader compatibility similarity indexes of various storage formats relative to the text files with a set similarity index threshold, and screening out the storage formats, in which the loader compatibility similarity indexes are greater than the set similarity index threshold, of the text files as compatible storage formats corresponding to the text files to form a compatible storage format set corresponding to the text files;
comparing the preservation value degree corresponding to each text file with the preservation value degree intervals corresponding to various set preservation value levels, and determining the preservation value level corresponding to each text file;
matching the storage value level corresponding to each text file with the storage loss rate lower limit threshold corresponding to each set storage value level, and matching the storage loss rate lower limit threshold corresponding to each text file;
extracting a storage loss rate from the characteristic parameters, comparing the lower limit threshold of the storage loss rate corresponding to each text archive with the storage loss rates corresponding to various storage formats to which the text archive belongs, screening out the storage formats meeting the lower limit threshold of the storage loss rate corresponding to each text archive, and taking the storage formats as the stable consistent storage formats corresponding to each text archive to form a stable consistent storage format set corresponding to each text archive;
extracting the character typesetting support rate from the characteristic parameters, further matching the character typesetting support rate corresponding to each storage format to which the text file belongs with the typesetting requirement degree which can be met by the set character typesetting support rate, and matching the typesetting requirement degree which can be met by each storage format;
comparing the typesetting requirement degree corresponding to each text file with the typesetting requirement degree which can be met by each storage format, if the typesetting requirement degree which can be met by a certain storage format is more than or equal to the typesetting requirement degree corresponding to a certain text file, taking the storage format as the typesetting conforming storage format corresponding to the text file, extracting the typesetting conforming storage format corresponding to each text file, and forming a typesetting conforming storage format set corresponding to each text file;
and determining the adaptive storage format corresponding to each text archive based on the compatible storage format set, the stable consistent storage format set and the typesetting consistent storage format set corresponding to each text archive.
10. An electronic archive management system according to claim 9, characterized by: the specific determination of the adaptive storage format corresponding to each text archive is as follows:
comparing a compatible storage format set, a stable consistent storage format set and a typesetting consistent storage format set corresponding to each text archive, wherein if a certain consistent storage format corresponding to a certain text archive appears in the sets, the text archive is used as a common text archive, otherwise, the text archive is used as a special text archive;
for the common text archives, the consistent storage formats appearing in the sets are used as the adaptive storage formats corresponding to the common text archives;
for the special text archives, classifying the same storage formats in the compatible storage format set, the stable consistent storage format set and the typesetting consistent storage format set corresponding to the special text archives, and counting the occurrence frequency corresponding to each consistent storage format and the related set type and the consistency corresponding to each occurrence;
calculating the corresponding advantage saliency of each corresponding storage format in the special text archive according to the corresponding frequency of occurrence of each corresponding storage format and the related set type and the corresponding conformity of each occurrence
Figure FDA0003977996270000071
Wherein->
Figure FDA0003977996270000072
The storage format is expressed as the corresponding dominant saliency of the r-th consistent storage format in the special text file, r is expressed as the consistent storage format number, r =1,2, …, w, y r Expressed as the frequency of occurrence corresponding to the r-th matching storage format in the special text file, and based on the storage format in the R-th matching storage format>
Figure FDA0003977996270000073
μ r f represents the weight factor and the consistency of the corresponding related set type of the r-th consistent storage format in the special text file at the f-th occurrence, f represents the number of each occurrence, and f =1,2, …, y and alpha represent preset correction factors;
and comparing the advantage saliency corresponding to each consistent storage format in the special text archive, and extracting the consistent storage format corresponding to the maximum advantage saliency from the comparison as the adaptive storage format corresponding to the special text archive.
CN202211541720.XA 2022-12-02 2022-12-02 Electronic file management system Pending CN115878559A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202211541720.XA CN115878559A (en) 2022-12-02 2022-12-02 Electronic file management system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202211541720.XA CN115878559A (en) 2022-12-02 2022-12-02 Electronic file management system

Publications (1)

Publication Number Publication Date
CN115878559A true CN115878559A (en) 2023-03-31

Family

ID=85765677

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202211541720.XA Pending CN115878559A (en) 2022-12-02 2022-12-02 Electronic file management system

Country Status (1)

Country Link
CN (1) CN115878559A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116681261A (en) * 2023-07-27 2023-09-01 山东创亿智慧信息科技发展有限责任公司 Intelligent archive management control system

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116681261A (en) * 2023-07-27 2023-09-01 山东创亿智慧信息科技发展有限责任公司 Intelligent archive management control system
CN116681261B (en) * 2023-07-27 2023-10-17 山东创亿智慧信息科技发展有限责任公司 Intelligent archive management control system

Similar Documents

Publication Publication Date Title
US7076503B2 (en) Managing media objects in a database
US8001113B2 (en) Query string matching method and apparatus
US20060282465A1 (en) System and method for searching media content
EP2186275B1 (en) Generating a fingerprint of a bit sequence
US8117528B2 (en) Information handling
CN110188077B (en) Intelligent classification method and device for electronic files, electronic equipment and storage medium
CN113486392B (en) Sensitive data identification and desensitization method based on big data platform
US20060294096A1 (en) Additive clustering of images into events using capture date-time information
CN115878559A (en) Electronic file management system
CN112506858B (en) File management method for intelligent brain of intelligent meeting room
CN115309963B (en) Intelligent archive management method, system and storage medium
CN111159763A (en) System and method for analyzing portrait of law-related personnel group
CN115935412A (en) Automatic classification and classification method and system for unstructured data
US20110179036A1 (en) Methods and Apparatuses For Abstract Representation of Financial Documents
CN109710628B (en) Information processing method, information processing device, information processing system, computer and readable storage medium
CN113254398A (en) Sample file management method, device, equipment and medium
CN114817518A (en) License handling method, system and medium based on big data archive identification
CN112733186A (en) User privacy data analysis method and device
CN115328863B (en) File processing method, equipment and storage medium
CN112559739A (en) Method for processing insulation state data of power equipment
CN107506398B (en) Method for adding label attribute to book
CN115510144B (en) Method and system for capturing real-time change data of database
CN110489378B (en) Method and system for file migration in Internet
US20230326222A1 (en) System and method for unsupervised document ontology generation
CN114328911A (en) Automatic document classifying and warehousing method based on content

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination