CN117220686B - Parasitic parameter compression and extraction system and method - Google Patents
Parasitic parameter compression and extraction system and method Download PDFInfo
- Publication number
- CN117220686B CN117220686B CN202311206468.1A CN202311206468A CN117220686B CN 117220686 B CN117220686 B CN 117220686B CN 202311206468 A CN202311206468 A CN 202311206468A CN 117220686 B CN117220686 B CN 117220686B
- Authority
- CN
- China
- Prior art keywords
- preset
- data
- extraction
- cpu
- interactive
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000605 extraction Methods 0.000 title claims abstract description 143
- 230000003071 parasitic effect Effects 0.000 title claims abstract description 33
- 238000007906 compression Methods 0.000 title claims abstract description 29
- 230000006835 compression Effects 0.000 title claims abstract description 29
- 238000000034 method Methods 0.000 title claims description 21
- 238000011156 evaluation Methods 0.000 claims abstract description 118
- 230000003993 interaction Effects 0.000 claims abstract description 117
- 238000007781 pre-processing Methods 0.000 claims abstract description 20
- 230000002452 interceptive effect Effects 0.000 claims description 138
- 238000005192 partition Methods 0.000 claims description 113
- 230000006837 decompression Effects 0.000 claims description 12
- 239000000284 extract Substances 0.000 claims description 4
- 238000012163 sequencing technique Methods 0.000 claims description 3
- 238000013075 data extraction Methods 0.000 abstract description 2
- 238000013144 data compression Methods 0.000 abstract 1
- 238000012545 processing Methods 0.000 description 5
- 238000004364 calculation method Methods 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000006467 substitution reaction Methods 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
Classifications
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02D—CLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
- Y02D10/00—Energy efficient computing, e.g. low power processors, power management or thermal management
Landscapes
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The application relates to the technical field of data compression and extraction, in particular to a parasitic parameter compression and extraction system and a parasitic parameter compression and extraction method, wherein the system comprises the following components: the acquisition module is used for acquiring the interaction data in the parasitic parameter extraction process and preprocessing the interaction data; the compression module is used for compressing the preprocessed interaction data based on an LZMA algorithm and storing the compressed interaction data in a memory; the extraction module is used for decompressing the interaction data and extracting preset interaction data, and is also used for calculating the use evaluation degree of the second CPU before extracting the preset interaction data, and determining feature extraction data in the preset interaction data according to the use evaluation degree. The invention solves the technical problems that the prior extraction system needs to frequently read in a hard disk and a memory when extracting compressed data, wastes resources and time very and has low efficiency.
Description
Technical Field
The present disclosure relates to the technical field, and in particular, to a system and a method for compressing and extracting parasitic parameters.
Background
During the parasitic parameter providing tool extraction process, there is a large amount of interaction data. However, these data are usually either very large in size or fragmented, and in the process of frequent reading, the parasitic data cannot be stored in the memory completely, only the hard disk can be stored partially, the memory can be stored partially, and frequent reading is performed between the hard disk and the memory through the processing from the hard disk to the memory to the cpu. And after the calculation is finished, the result is fed to the memory through the CPU, and finally stored to the hard disk. The extraction system is very wasteful of resources and time and inefficient.
Disclosure of Invention
In order to solve the technical problems, the application provides a parasitic parameter compression extraction method, which aims to solve the technical problems that the conventional extraction system needs to frequently read in a hard disk and a memory when extracting compressed data, so that resources and time are wasted and the efficiency is low.
In some embodiments of the present application, preprocessing is performed on the interaction data, where the preprocessing includes determining a data partition and a partition amount of the interaction data according to a data bit length of the interaction data, calculating an average data heat value in the data partition, storing the interaction data in the data partition to a corresponding storage space in a memory according to the average data heat value, and dividing the interaction data into a plurality of data partitions, so that execution efficiency of storing the data in the memory is improved, and storage space of the memory is saved.
In some embodiments of the present application, the interactive data is decompressed and the required preset interactive data is extracted, before the preset interactive data is extracted, the usage evaluation degree of the second CPU is calculated according to the usage condition and the extraction condition of the second CPU, the feature extraction data in the preset interactive data is determined according to the usage evaluation degree, and under the condition that the performance of the second CPU and other instruction operations are not affected, important content in the required preset interactive data is extracted, so that the usage efficiency and the extraction efficiency of the CPU are improved.
In some embodiments of the present application, a system and a method for extracting parasitic parameters by compression are provided, the system includes:
the acquisition module is used for acquiring the interaction data in the parasitic parameter extraction process and preprocessing the interaction data;
the compression module is used for compressing the preprocessed interaction data based on an LZMA algorithm and storing the compressed interaction data in a memory;
the extraction module is used for decompressing the interaction data and extracting preset interaction data, and is also used for calculating the use evaluation degree of the second CPU before extracting the preset interaction data, and determining feature extraction data in the preset interaction data according to the use evaluation degree.
In some embodiments of the present application, the extraction module includes:
the first extraction sub-module comprises a first CPU, wherein the first CPU is used for extracting and decompressing the compressed interaction data and setting an index identifier for the decompressed interaction data;
the second extraction sub-module comprises a second CPU, and the second CPU is used for extracting corresponding interaction data in the first extraction sub-module according to the index identifier;
and the computing sub-module is used for acquiring the extraction condition and the use condition when the second CPU extracts the interactive data, and computing the use evaluation degree of the second CPU according to the extraction condition and the use condition.
In some embodiments of the present application, extracting and decompressing the compressed interaction data, setting an index identifier for the decompressed interaction data, including:
acquiring interactive data to be decompressed, extracting the interactive data in the corresponding data partition and storage space from the memory according to the data bit length and the data heat value corresponding to the interactive data to be decompressed, and performing decompression operation on the interactive data to be decompressed and the interactive data extracted from the memory to obtain a decompression result;
converting the interactive data in the decompression result into an interactive text file, wherein the interactive text file comprises a plurality of interactive text segments, and each interactive text segment is provided with an index identifier;
the interactive text file comprises a search index of a plurality of index elements, each index element is associated with an index identifier in one or more interactive text segments, one or more interactive text segments in the interactive text file can be detected by using the index element of one search index, and preset contents of the interactive data can be positioned by using the index identifier in each interactive text segment.
In some embodiments of the present application, calculating the usage evaluation degree of the second CPU according to the extraction condition and the usage condition includes:
the extraction condition of the second CPU comprises an extraction rate K1 and an extraction integrity K2, the use condition of the second CPU comprises a second CPU occupancy rate N1 and a second CPU load value N2, and the use evaluation degree of the second CPU is calculated according to the extraction condition and the use condition;
the second CPU has a usage evaluation degree of:
;
wherein W is the usage evaluation degree of the second CPU, a1 is the weight coefficient in the case of extraction rate to the second CPU extraction, a2 is the weight coefficient in the case of extraction integrity to the second CPU extraction, and a1+a2=1; m1 is a weight coefficient in the use case of the second CPU by the second CPU occupancy, m2 is a weight coefficient in the use case of the second CPU by the second CPU load value, and m1+m2=1, q1 is a weight coefficient of the use evaluation degree by the extraction case, q2 is a weight coefficient of the use evaluation degree by the use case, and q1+q2=1.
In some embodiments of the present application, determining feature extraction data in the preset interaction data according to the usage evaluation degree includes:
extracting preset interaction data, determining feature data contents to be extracted in the preset interaction data, and sequencing the feature data contents according to the extraction importance degree to obtain a feature extraction data set;
each feature data content in the feature extraction data set is provided with a corresponding index identifier, the number of the index identifiers is determined according to the relation between the use evaluation degree and the preset use evaluation degree, and feature extraction data is determined according to the number of the index identifiers and the corresponding feature data content.
In some embodiments of the present application, determining the number of index identifiers according to the relationship between the usage evaluation degree and the preset usage evaluation degree includes:
the extraction module is preset with a first preset use evaluation degree, a second preset use evaluation degree and a third preset evaluation degree, wherein the first preset use evaluation degree is smaller than the second preset use evaluation degree, and the second preset use evaluation degree is smaller than the third preset use evaluation degree; the method comprises the steps of setting a first preset index identification number, a second preset index identification number, a third preset index identification number and a fourth preset index identification number in advance, wherein the first preset index identification number is smaller than the second preset index identification number, the second preset index identification number is smaller than the third preset index identification number, and the third preset index identification number is smaller than the fourth preset index identification number;
when the using evaluation degree is smaller than a first preset using evaluation degree, selecting the number of first preset index marks as the number of current index marks;
when the using evaluation degree is between a first preset using evaluation degree and a second preset using evaluation degree, selecting the number of second preset index marks as the number of current index marks;
when the using evaluation degree is between the first preset using evaluation degree and the second preset using evaluation degree, selecting the number of third preset index marks as the number of current index marks;
and when the use evaluation degree is larger than the third preset use evaluation degree, selecting the number of the fourth preset index marks as the number of the current index marks.
In some embodiments of the present application, preprocessing the interaction data includes:
the preprocessing comprises the steps of obtaining the data bit length of the interactive data, and determining the data partition and partition quantity of the interactive data according to the data bit length of the interactive data;
the method comprises the steps of presetting a first preset data bit length, a second preset data bit length and a third preset data bit length, wherein the first preset data bit length is smaller than the second preset data bit length, and the second preset data bit length is smaller than the third preset data bit length; the method comprises the steps of setting a first preset partition amount, a second preset partition amount, a third preset partition amount and a fourth preset partition amount in advance, wherein the first preset partition amount is smaller than the second preset partition amount, the second preset partition amount is smaller than the third preset partition amount, and the third preset partition amount is smaller than the fourth preset partition amount;
when the data bit length is smaller than a first preset data bit length, selecting the first preset partition amount as a current partition amount;
when the data bit length is between a first preset data bit length and a second preset data bit length, selecting the second preset partition amount as a current partition amount;
when the data bit length is between the second preset data bit length and the third preset data bit length, selecting the third preset partition amount as the current partition amount;
and when the data bit length is longer than the third preset data bit length, selecting the fourth preset partition amount as the current partition amount.
In some embodiments of the present application, calculating an average data heat value in the data partition, sequentially arranging the average data heat value, storing the data partition with the average data heat value greater than a preset data heat value threshold into a first storage space, and storing the data partition with the average data heat value less than the preset data heat value threshold into a second storage space, wherein the first storage space and the second storage space are arranged on a memory;
based on an LZMA algorithm, compressing the interactive data in the first storage space and the second storage space according to a preset compression algorithm, comparing the compressed interactive data with corresponding interactive data in the first storage space and the second storage space, and if the comparison is successful, storing the compressed interactive data in a third storage space.
In some embodiments of the present application, before storing the compressed interaction data in the third storage space, the method includes:
calculating the storage duty ratio of the current interaction data in real time, and determining whether the third storage space meets the storage condition according to the storage duty ratio;
if the storage condition is met, storing the current interaction data into the third storage space;
if the storage condition is not met, determining a supplementary space according to the storage duty ratio, comparing a second empty storage duty ratio of the second storage space with the supplementary space, if the second empty storage duty ratio is larger than the supplementary space, taking the second empty storage duty ratio as a first external storage space of the third storage space, if the second empty storage duty ratio is smaller than the supplementary space, calculating a difference space between the supplementary space and the empty storage duty ratio, and taking the first empty storage duty ratio of the first storage space as a second external storage space of the third storage space based on the difference space;
the storage condition is that the third space storage duty ratio of the third storage space is larger than the storage duty ratio of the current interaction data, and the storage quality of the current interaction data can be ensured.
In some embodiments of the present application, a parasitic parameter compression extraction method is further included:
acquiring interaction data in a parasitic parameter extraction process, and preprocessing the interaction data;
based on an LZMA algorithm, compressing the preprocessed interactive data, and storing the compressed interactive data in a memory;
decompressing the interaction data and extracting preset interaction data, and calculating the use evaluation degree of the second CPU before extracting the preset interaction data, and determining feature extraction data in the preset interaction data according to the use evaluation degree.
Compared with the prior art, the parasitic parameter compression and extraction system and the parasitic parameter compression and extraction method have the beneficial effects that:
preprocessing the interactive data, wherein the preprocessing comprises the steps of determining a data partition and partition quantity of the interactive data according to the data bit length of the interactive data, calculating an average data heat value in the data partition, storing the interactive data in the data partition into corresponding storage spaces in a memory according to the average data heat value, and dividing the interactive data into a plurality of data partitions, so that the execution efficiency of storing the data in the memory is improved, and the storage space of the memory is saved.
Decompressing the interaction data and extracting the required preset interaction data, calculating the use evaluation degree of the second CPU according to the use condition and the extraction condition of the second CPU before extracting the preset interaction data, determining the feature extraction data in the preset interaction data according to the use evaluation degree, and extracting important contents in the required preset interaction data under the condition that the performance and other instruction operation of the second CPU are not affected, thereby improving the use efficiency and the extraction efficiency of the CPU.
Drawings
FIG. 1 is a schematic diagram of a parasitic parameter compression extraction system in accordance with a preferred embodiment of the present application;
fig. 2 is a schematic flow chart of a parasitic parameter compression extraction method in a preferred embodiment of the present application.
Detailed Description
The detailed description of the present application is further described in detail below with reference to the drawings and examples. The following examples are illustrative of the present application, but are not intended to limit the scope of the present application.
In the description of the present application, it should be understood that the terms "center," "upper," "lower," "front," "rear," "left," "right," "vertical," "horizontal," "top," "bottom," "inner," "outer," and the like indicate orientations or positional relationships based on the orientation or positional relationships shown in the drawings, merely to facilitate description of the present application and simplify the description, and do not indicate or imply that the devices or elements referred to must have a specific orientation, be configured and operated in a specific orientation, and therefore should not be construed as limiting the present application.
The terms "first," "second," and the like, are used for descriptive purposes only and are not to be construed as indicating or implying relative importance or implicitly indicating the number of technical features indicated. Thus, a feature defining "a first" or "a second" may explicitly or implicitly include one or more such feature. In the description of the present application, unless otherwise indicated, the meaning of "a plurality" is two or more.
In the description of the present application, it should be noted that, unless explicitly specified and limited otherwise, the terms "mounted," "connected," and "connected" are to be construed broadly, and may be either fixedly connected, detachably connected, or integrally connected, for example; can be mechanically or electrically connected; can be directly connected or indirectly connected through an intermediate medium, and can be communication between two elements. The specific meaning of the terms in this application will be understood by those of ordinary skill in the art in a specific context.
As shown in fig. 1, a parasitic parameter compression extraction method in a preferred embodiment of the present application includes:
the acquisition module is used for acquiring the interaction data in the parasitic parameter extraction process and preprocessing the interaction data;
the compression module is used for compressing the preprocessed interaction data based on an LZMA algorithm and storing the compressed interaction data in a memory;
the extraction module is used for decompressing the interaction data and extracting preset interaction data, and is also used for calculating the use evaluation degree of the second CPU before extracting the preset interaction data, and determining feature extraction data in the preset interaction data according to the use evaluation degree.
In this embodiment, the preset interaction data is required interaction data, the usage evaluation degree of the second CPU is calculated before the preset interaction data is extracted, the second CPU is a main CPU, and in order to prevent the operation of other instructions in the main CPU from being affected, the feature extraction data is data corresponding to necessary extraction content in the preset interaction data.
In some embodiments of the present application, the extraction module includes:
the first extraction sub-module comprises a first CPU, wherein the first CPU is used for extracting and decompressing the compressed interaction data and setting an index identifier for the decompressed interaction data;
the second extraction sub-module comprises a second CPU, and the second CPU is used for extracting corresponding interaction data in the first extraction sub-module according to the index identifier;
and the computing sub-module is used for acquiring the extraction condition and the use condition when the second CPU extracts the interactive data, and computing the use evaluation degree of the second CPU according to the extraction condition and the use condition.
In some embodiments of the present application, extracting and decompressing the compressed interaction data, setting an index identifier for the decompressed interaction data, including:
acquiring interactive data to be decompressed, extracting the interactive data in the corresponding data partition and storage space from the memory according to the data bit length and the data heat value corresponding to the interactive data to be decompressed, and performing decompression operation on the interactive data to be decompressed and the interactive data extracted from the memory to obtain a decompression result;
converting the interactive data in the decompression result into an interactive text file, wherein the interactive text file comprises a plurality of interactive text segments, and each interactive text segment is provided with an index identifier;
the interactive text file comprises a search index of a plurality of index elements, each index element is associated with an index identifier in one or more interactive text segments, one or more interactive text segments in the interactive text file can be detected by using the index element of one search index, and preset contents of the interactive data can be positioned by using the index identifier in each interactive text segment.
In this embodiment, the data bit length is the data length corresponding to the interactive data, the data popularity value is set according to the occurrence frequency and the occurrence mode of the interactive data, the decompressed interactive data is converted into the interactive text file through the text function, a plurality of index elements are determined according to the characteristic information of a plurality of interactive text segments in the interactive text file, the interactive text segments are provided with corresponding index identifiers, the preset content of the interactive data can be located by using the index identifiers in each interactive text segment, the main CPU can extract a certain item of data in the parasitic data from the first CPU through the index identifiers to participate in processing, the first CPU can liberate the calculation pressure of the main CPU, and meanwhile, the index identifiers can also reduce the data processing capacity of the main CPU.
In some embodiments of the present application, calculating the usage evaluation degree of the second CPU according to the extraction condition and the usage condition includes:
the extraction condition of the second CPU comprises an extraction rate K1 and an extraction integrity K2, the use condition of the second CPU comprises a second CPU occupancy rate N1 and a second CPU load value N2, and the use evaluation degree of the second CPU is calculated according to the extraction condition and the use condition;
the second CPU has a usage evaluation degree of:
;
wherein W is the usage evaluation degree of the second CPU, a1 is the weight coefficient in the case of extraction rate to the second CPU extraction, a2 is the weight coefficient in the case of extraction integrity to the second CPU extraction, and a1+a2=1; m1 is a weight coefficient in the use case of the second CPU by the second CPU occupancy, m2 is a weight coefficient in the use case of the second CPU by the second CPU load value, and m1+m2=1, q1 is a weight coefficient of the use evaluation degree by the extraction case, q2 is a weight coefficient of the use evaluation degree by the use case, and q1+q2=1.
In this embodiment, k1=v/t, V is the extraction speed, t is the extraction duration, the second CPU occupancy N1 is the remaining occupancy of the second CPU except for extracting the interaction data, the second CPU load value N2 is the difference between the second CPU real-time load value and the normal load value, if the difference is within the normal load difference interval, the second CPU load value N2 is set to 1, and if the difference is not within the normal load difference interval, the second CPU load value N2 is set to-1.
In some embodiments of the present application, determining feature extraction data in the preset interaction data according to the usage evaluation degree includes:
extracting preset interaction data, determining feature data contents to be extracted in the preset interaction data, and sequencing the feature data contents according to the extraction importance degree to obtain a feature extraction data set;
each feature data content in the feature extraction data set is provided with a corresponding index identifier, the number of the index identifiers is determined according to the relation between the use evaluation degree and the preset use evaluation degree, and feature extraction data is determined according to the number of the index identifiers and the corresponding feature data content.
In this embodiment, the feature data contents are sorted according to the extraction importance, the index identifiers corresponding to the feature data contents are also sorted according to the extraction importance, and after the number of the index identifiers is determined, if the number of the index identifiers is 4, the feature data contents corresponding to the first four index identifiers sorted from big to small according to the extraction importance are used as feature extraction data, so that the data processing capacity of the second CPU and the occupation rate of data extraction are greatly reduced.
In some embodiments of the present application, determining the number of index identifiers according to the relationship between the usage evaluation degree and the preset usage evaluation degree includes:
the extraction module is preset with a first preset use evaluation degree, a second preset use evaluation degree and a third preset evaluation degree, wherein the first preset use evaluation degree is smaller than the second preset use evaluation degree, and the second preset use evaluation degree is smaller than the third preset use evaluation degree; the method comprises the steps of setting a first preset index identification number, a second preset index identification number, a third preset index identification number and a fourth preset index identification number in advance, wherein the first preset index identification number is smaller than the second preset index identification number, the second preset index identification number is smaller than the third preset index identification number, and the third preset index identification number is smaller than the fourth preset index identification number;
when the using evaluation degree is smaller than a first preset using evaluation degree, selecting the number of first preset index marks as the number of current index marks;
when the using evaluation degree is between a first preset using evaluation degree and a second preset using evaluation degree, selecting the number of second preset index marks as the number of current index marks;
when the using evaluation degree is between the first preset using evaluation degree and the second preset using evaluation degree, selecting the number of third preset index marks as the number of current index marks;
and when the use evaluation degree is larger than the third preset use evaluation degree, selecting the number of the fourth preset index marks as the number of the current index marks.
In the present embodiment, when the usage evaluation degree is higher, it is indicated that the current second CPU operating pressure is smaller to satisfy the data processing amount.
In some embodiments of the present application, preprocessing the interaction data includes:
the preprocessing comprises the steps of obtaining the data bit length of the interactive data, and determining the data partition and partition quantity of the interactive data according to the data bit length of the interactive data;
the method comprises the steps of presetting a first preset data bit length, a second preset data bit length and a third preset data bit length, wherein the first preset data bit length is smaller than the second preset data bit length, and the second preset data bit length is smaller than the third preset data bit length; the method comprises the steps of setting a first preset partition amount, a second preset partition amount, a third preset partition amount and a fourth preset partition amount in advance, wherein the first preset partition amount is smaller than the second preset partition amount, the second preset partition amount is smaller than the third preset partition amount, and the third preset partition amount is smaller than the fourth preset partition amount;
when the data bit length is smaller than a first preset data bit length, selecting the first preset partition amount as a current partition amount;
when the data bit length is between a first preset data bit length and a second preset data bit length, selecting the second preset partition amount as a current partition amount;
when the data bit length is between the second preset data bit length and the third preset data bit length, selecting the third preset partition amount as the current partition amount;
and when the data bit length is longer than the third preset data bit length, selecting the fourth preset partition amount as the current partition amount.
In this embodiment, the interactive data is divided by obtaining the data bit length of the interactive data, if the current partition amount is 10, the interactive data is divided into 10 data partitions, each data partition includes summer-time, and the partition amount of the data partition is set according to the relationship between the data bit length of the interactive data and the preset data bit length, so that the interactive data is split, the execution efficiency of storing the data in the memory is improved, and the storage space of the memory is saved.
In some embodiments of the present application, calculating an average data heat value in the data partition, sequentially arranging the average data heat value, storing the data partition with the average data heat value greater than a preset data heat value threshold into a first storage space, and storing the data partition with the average data heat value less than the preset data heat value threshold into a second storage space, wherein the first storage space and the second storage space are arranged on a memory;
based on an LZMA algorithm, compressing the interactive data in the first storage space and the second storage space according to a preset compression algorithm, comparing the compressed interactive data with corresponding interactive data in the first storage space and the second storage space, and if the comparison is successful, storing the compressed interactive data in a third storage space.
In this embodiment, the memory is divided into a first storage space, a second storage space and a third storage space, and the data is stored in the corresponding storage spaces in a partitioned manner according to the sequency of the average data heat value, so as to ensure the data order.
In some embodiments of the present application, before storing the compressed interaction data in the third storage space, the method includes:
calculating the storage duty ratio of the current interaction data in real time, and determining whether the third storage space meets the storage condition according to the storage duty ratio;
if the storage condition is met, storing the current interaction data into the third storage space;
if the storage condition is not met, determining a supplementary space according to the storage duty ratio, comparing a second empty storage duty ratio of the second storage space with the supplementary space, if the second empty storage duty ratio is larger than the supplementary space, taking the second empty storage duty ratio as a first external storage space of the third storage space, if the second empty storage duty ratio is smaller than the supplementary space, calculating a difference space between the supplementary space and the empty storage duty ratio, and taking the first empty storage duty ratio of the first storage space as a second external storage space of the third storage space based on the difference space;
the storage condition is that the third space storage duty ratio of the third storage space is larger than the storage duty ratio of the current interaction data, and the storage quality of the current interaction data can be ensured.
In this embodiment, by allocating the space of the first storage space, the second storage space and the third storage space, the storage space of the interactive data in the memory is ensured to be enough, and when the extraction is performed, only the transfer between the memory and the CPU is performed, so that the extraction time is greatly shortened, and the storage and extraction efficiency is improved.
In some embodiments of the present application, as shown in fig. 2, a parasitic parameter compression extraction method is further included:
step S201: acquiring interaction data in a parasitic parameter extraction process, and preprocessing the interaction data;
step S202: based on an LZMA algorithm, compressing the preprocessed interactive data, and storing the compressed interactive data in a memory;
step S203: decompressing the interaction data and extracting preset interaction data, and calculating the use evaluation degree of the second CPU before extracting the preset interaction data, and determining feature extraction data in the preset interaction data according to the use evaluation degree.
In summary, the invention discloses a system and a method for extracting parasitic parameters by compression, wherein the system comprises: the acquisition module is used for acquiring the interaction data in the parasitic parameter extraction process and preprocessing the interaction data; the compression module is used for compressing the preprocessed interaction data based on an LZMA algorithm and storing the compressed interaction data in a memory; the extraction module is used for decompressing the interaction data and extracting preset interaction data, and is also used for calculating the use evaluation degree of the second CPU before extracting the preset interaction data, and determining feature extraction data in the preset interaction data according to the use evaluation degree.
According to the first conception, the interactive data is preprocessed, the data partition and partition quantity of the interactive data are determined according to the data bit length of the interactive data, the average data heat value in the data partition is calculated, the interactive data in the data partition are stored in corresponding storage spaces in the memory according to the average data heat value, the interactive data are divided into a plurality of data partitions, the execution efficiency of storing the data in the memory is improved, and the storage space of the memory is saved.
According to the second conception, the interactive data are decompressed and the required preset interactive data are extracted, the use evaluation degree of the second CPU is calculated according to the use condition and the extraction condition of the second CPU before the preset interactive data are extracted, the feature extraction data in the preset interactive data are determined according to the use evaluation degree, and important contents in the required preset interactive data are extracted under the condition that the performance of the second CPU and other instruction operation are not affected, so that the use efficiency and the extraction efficiency of the CPU are improved.
The foregoing is merely a preferred embodiment of the present application, and it should be noted that modifications and substitutions can be made by those skilled in the art without departing from the technical principles of the present application, and these modifications and substitutions should also be considered as being within the scope of the present application.
Claims (6)
1. A parasitic parameter compression extraction system, comprising:
the acquisition module is used for acquiring the interaction data in the parasitic parameter extraction process and preprocessing the interaction data;
the compression module is used for compressing the preprocessed interaction data based on an LZMA algorithm and storing the compressed interaction data in a memory;
the extraction module is used for decompressing the interaction data and extracting preset interaction data, and also used for calculating the use evaluation degree of the second CPU before extracting the preset interaction data, and determining feature extraction data in the preset interaction data according to the use evaluation degree;
the extraction module comprises:
the first extraction sub-module comprises a first CPU, wherein the first CPU is used for extracting and decompressing the compressed interaction data and setting an index identifier for the decompressed interaction data;
the second extraction sub-module comprises a second CPU, and the second CPU is used for extracting corresponding interaction data in the first extraction sub-module according to the index identifier;
the computing sub-module is used for acquiring the extraction condition and the use condition when the second CPU extracts the interaction data, and computing the use evaluation degree of the second CPU according to the extraction condition and the use condition;
extracting and decompressing the compressed interactive data, setting index identification for the decompressed interactive data, and comprising:
acquiring interactive data to be decompressed, extracting the interactive data in the corresponding data partition and storage space from the memory according to the data bit length and the data heat value corresponding to the interactive data to be decompressed, and performing decompression operation on the interactive data to be decompressed and the interactive data extracted from the memory to obtain a decompression result;
converting the interactive data in the decompression result into an interactive text file, wherein the interactive text file comprises a plurality of interactive text segments, and each interactive text segment is provided with an index identifier;
the interactive text file comprises search indexes of a plurality of index elements, each index element is associated with index identifiers in one or more interactive text segments, one or more interactive text segments in the interactive text file can be detected by using the index element of one search index, and preset contents of the interactive data can be positioned by using the index identifier in each interactive text segment;
calculating the use evaluation degree of the second CPU according to the extraction condition and the use condition, wherein the method comprises the following steps:
the extraction condition of the second CPU comprises an extraction rate K1 and an extraction integrity K2, the use condition of the second CPU comprises a second CPU occupancy rate N1 and a second CPU load value N2, and the use evaluation degree of the second CPU is calculated according to the extraction condition and the use condition;
the second CPU has a usage evaluation degree of:
;
wherein W is the usage evaluation degree of the second CPU, a1 is the weight coefficient in the case of extraction rate to the second CPU extraction, a2 is the weight coefficient in the case of extraction integrity to the second CPU extraction, and a1+a2=1; m1 is a weight coefficient in the use case of the second CPU by the second CPU occupancy, m2 is a weight coefficient in the use case of the second CPU by the second CPU load value, m1+m2=1, q1 is a weight coefficient of the use evaluation degree by the extraction case, q2 is a weight coefficient of the use evaluation degree by the use case, and q1+q2=1;
preprocessing the interaction data, including:
the preprocessing comprises the steps of obtaining the data bit length of the interactive data, and determining the data partition and partition quantity of the interactive data according to the data bit length of the interactive data;
the method comprises the steps of presetting a first preset data bit length, a second preset data bit length and a third preset data bit length, wherein the first preset data bit length is smaller than the second preset data bit length, and the second preset data bit length is smaller than the third preset data bit length; the method comprises the steps of setting a first preset partition amount, a second preset partition amount, a third preset partition amount and a fourth preset partition amount in advance, wherein the first preset partition amount is smaller than the second preset partition amount, the second preset partition amount is smaller than the third preset partition amount, and the third preset partition amount is smaller than the fourth preset partition amount;
when the data bit length is smaller than a first preset data bit length, selecting the first preset partition amount as a current partition amount;
when the data bit length is between a first preset data bit length and a second preset data bit length, selecting the second preset partition amount as a current partition amount;
when the data bit length is between the second preset data bit length and the third preset data bit length, selecting the third preset partition amount as the current partition amount;
and when the data bit length is longer than the third preset data bit length, selecting the fourth preset partition amount as the current partition amount.
2. The parasitic parameter compression and extraction system of claim 1, wherein determining feature extraction data in the preset interaction data according to the usage evaluation degree comprises:
extracting preset interaction data, determining feature data contents to be extracted in the preset interaction data, and sequencing the feature data contents according to the extraction importance degree to obtain a feature extraction data set;
each feature data content in the feature extraction data set is provided with a corresponding index identifier, the number of the index identifiers is determined according to the relation between the use evaluation degree and the preset use evaluation degree, and feature extraction data is determined according to the number of the index identifiers and the corresponding feature data content.
3. The parasitic parameter compression and extraction system of claim 2, wherein determining the number of index markers according to the relationship between the usage evaluation degree and a preset usage evaluation degree comprises:
the extraction module is preset with a first preset use evaluation degree, a second preset use evaluation degree and a third preset evaluation degree, wherein the first preset use evaluation degree is smaller than the second preset use evaluation degree, and the second preset use evaluation degree is smaller than the third preset use evaluation degree; the method comprises the steps of setting a first preset index identification number, a second preset index identification number, a third preset index identification number and a fourth preset index identification number in advance, wherein the first preset index identification number is smaller than the second preset index identification number, the second preset index identification number is smaller than the third preset index identification number, and the third preset index identification number is smaller than the fourth preset index identification number;
when the using evaluation degree is smaller than a first preset using evaluation degree, selecting the number of first preset index marks as the number of current index marks;
when the using evaluation degree is between a first preset using evaluation degree and a second preset using evaluation degree, selecting the number of second preset index marks as the number of current index marks;
when the using evaluation degree is between the first preset using evaluation degree and the second preset using evaluation degree, selecting the number of third preset index marks as the number of current index marks;
and when the use evaluation degree is larger than the third preset use evaluation degree, selecting the number of the fourth preset index marks as the number of the current index marks.
4. The parasitic parameter compression extraction system of claim 1,
calculating an average data heat value in the data partition, arranging the average data heat value according to sequence, storing the data partition with the average data heat value larger than a preset data heat value threshold into a first storage space, and storing the data partition with the average data heat value smaller than the preset data heat value threshold into a second storage space, wherein the first storage space and the second storage space are arranged on a memory;
based on an LZMA algorithm, compressing the interactive data in the first storage space and the second storage space according to a preset compression algorithm, comparing the compressed interactive data with corresponding interactive data in the first storage space and the second storage space, and if the comparison is successful, storing the compressed interactive data in a third storage space.
5. The parasitic parameter compression and extraction system of claim 4, comprising, prior to storing the compressed interaction data in the third storage space:
calculating the storage duty ratio of the current interaction data in real time, and determining whether the third storage space meets the storage condition according to the storage duty ratio;
if the storage condition is met, storing the current interaction data into the third storage space;
if the storage condition is not met, determining a supplementary space according to the storage duty ratio, comparing a second empty storage duty ratio of the second storage space with the supplementary space, if the second empty storage duty ratio is larger than the supplementary space, taking the second empty storage duty ratio as a first external storage space of the third storage space, if the second empty storage duty ratio is smaller than the supplementary space, calculating a difference space between the supplementary space and the empty storage duty ratio, and taking the first empty storage duty ratio of the first storage space as a second external storage space of the third storage space based on the difference space;
the storage condition is that the third space storage duty ratio of the third storage space is larger than the storage duty ratio of the current interaction data, and the storage quality of the current interaction data can be ensured.
6. A parasitic parameter compression extraction method applied to the parasitic parameter compression extraction system as claimed in any one of claims 1 to 5, comprising:
acquiring interaction data in a parasitic parameter extraction process, and preprocessing the interaction data;
based on an LZMA algorithm, compressing the preprocessed interactive data, and storing the compressed interactive data in a memory;
decompressing the interaction data and extracting preset interaction data, and further calculating a use evaluation degree of a second CPU before extracting the preset interaction data, and determining feature extraction data in the preset interaction data according to the use evaluation degree;
extracting and decompressing the compressed interactive data, and setting an index identifier for the decompressed interactive data;
extracting corresponding interaction data in the first extraction sub-module according to the index identifier;
acquiring the extraction condition and the use condition of the second CPU when the interactive data are extracted, and calculating the use evaluation degree of the second CPU according to the extraction condition and the use condition;
extracting and decompressing the compressed interactive data, setting index identification for the decompressed interactive data, and comprising:
acquiring interactive data to be decompressed, extracting the interactive data in the corresponding data partition and storage space from the memory according to the data bit length and the data heat value corresponding to the interactive data to be decompressed, and performing decompression operation on the interactive data to be decompressed and the interactive data extracted from the memory to obtain a decompression result;
converting the interactive data in the decompression result into an interactive text file, wherein the interactive text file comprises a plurality of interactive text segments, and each interactive text segment is provided with an index identifier;
the interactive text file comprises search indexes of a plurality of index elements, each index element is associated with index identifiers in one or more interactive text segments, one or more interactive text segments in the interactive text file can be detected by using the index element of one search index, and preset contents of the interactive data can be positioned by using the index identifier in each interactive text segment;
calculating the use evaluation degree of the second CPU according to the extraction condition and the use condition, wherein the method comprises the following steps:
the extraction condition of the second CPU comprises an extraction rate K1 and an extraction integrity K2, the use condition of the second CPU comprises a second CPU occupancy rate N1 and a second CPU load value N2, and the use evaluation degree of the second CPU is calculated according to the extraction condition and the use condition;
the second CPU has a usage evaluation degree of:
;
wherein W is the usage evaluation degree of the second CPU, a1 is the weight coefficient in the case of extraction rate to the second CPU extraction, a2 is the weight coefficient in the case of extraction integrity to the second CPU extraction, and a1+a2=1; m1 is a weight coefficient in the use case of the second CPU by the second CPU occupancy, m2 is a weight coefficient in the use case of the second CPU by the second CPU load value, m1+m2=1, q1 is a weight coefficient of the use evaluation degree by the extraction case, q2 is a weight coefficient of the use evaluation degree by the use case, and q1+q2=1;
preprocessing the interaction data, including:
the preprocessing comprises the steps of obtaining the data bit length of the interactive data, and determining the data partition and partition quantity of the interactive data according to the data bit length of the interactive data;
the method comprises the steps of presetting a first preset data bit length, a second preset data bit length and a third preset data bit length, wherein the first preset data bit length is smaller than the second preset data bit length, and the second preset data bit length is smaller than the third preset data bit length; the method comprises the steps of setting a first preset partition amount, a second preset partition amount, a third preset partition amount and a fourth preset partition amount in advance, wherein the first preset partition amount is smaller than the second preset partition amount, the second preset partition amount is smaller than the third preset partition amount, and the third preset partition amount is smaller than the fourth preset partition amount;
when the data bit length is smaller than a first preset data bit length, selecting the first preset partition amount as a current partition amount;
when the data bit length is between a first preset data bit length and a second preset data bit length, selecting the second preset partition amount as a current partition amount;
when the data bit length is between the second preset data bit length and the third preset data bit length, selecting the third preset partition amount as the current partition amount;
and when the data bit length is longer than the third preset data bit length, selecting the fourth preset partition amount as the current partition amount.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202311206468.1A CN117220686B (en) | 2023-09-18 | 2023-09-18 | Parasitic parameter compression and extraction system and method |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202311206468.1A CN117220686B (en) | 2023-09-18 | 2023-09-18 | Parasitic parameter compression and extraction system and method |
Publications (2)
Publication Number | Publication Date |
---|---|
CN117220686A CN117220686A (en) | 2023-12-12 |
CN117220686B true CN117220686B (en) | 2024-02-23 |
Family
ID=89045859
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202311206468.1A Active CN117220686B (en) | 2023-09-18 | 2023-09-18 | Parasitic parameter compression and extraction system and method |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN117220686B (en) |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113726341A (en) * | 2021-08-25 | 2021-11-30 | 杭州海康威视数字技术股份有限公司 | Data processing method and device, electronic equipment and storage medium |
CN114490547A (en) * | 2022-02-16 | 2022-05-13 | 平安科技(深圳)有限公司 | Data compression method, device, equipment and medium |
CN114610650A (en) * | 2022-03-11 | 2022-06-10 | Oppo广东移动通信有限公司 | Memory compression method and device, storage medium and electronic equipment |
CN115858163A (en) * | 2022-12-14 | 2023-03-28 | 天翼物联科技有限公司 | Method and device for storing mass equipment instruction data of Internet of things platform |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10789231B2 (en) * | 2016-05-19 | 2020-09-29 | Oracle International Corporation | Spatial indexing for distributed storage using local indexes |
-
2023
- 2023-09-18 CN CN202311206468.1A patent/CN117220686B/en active Active
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113726341A (en) * | 2021-08-25 | 2021-11-30 | 杭州海康威视数字技术股份有限公司 | Data processing method and device, electronic equipment and storage medium |
CN114490547A (en) * | 2022-02-16 | 2022-05-13 | 平安科技(深圳)有限公司 | Data compression method, device, equipment and medium |
CN114610650A (en) * | 2022-03-11 | 2022-06-10 | Oppo广东移动通信有限公司 | Memory compression method and device, storage medium and electronic equipment |
CN115858163A (en) * | 2022-12-14 | 2023-03-28 | 天翼物联科技有限公司 | Method and device for storing mass equipment instruction data of Internet of things platform |
Also Published As
Publication number | Publication date |
---|---|
CN117220686A (en) | 2023-12-12 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110399104B (en) | Data storage method, data storage device, electronic apparatus, and storage medium | |
US20140215170A1 (en) | Block Compression in a Key/Value Store | |
WO2021091489A1 (en) | Method and apparatus for storing time series data, and server and storage medium thereof | |
CN108108394B (en) | Compressed file recovery method and storage medium of APFS file system | |
CN102567522B (en) | Method and device for managing file system of intelligent card | |
CN101464910B (en) | Balance clustering compression method based on data similarity | |
CN111061752B (en) | Data processing method and device and electronic equipment | |
EP3422204A1 (en) | Method and apparatus for archiving database, and method and apparatus for searching archived database | |
US9519543B2 (en) | Method and apparatus for image search using feature point | |
CN104298736A (en) | Method and device for aggregating and connecting data as well as database system | |
CN111611250A (en) | Data storage device, data query method, data query device, server and storage medium | |
CN110442489B (en) | Method of data processing and storage medium | |
CN108446399B (en) | Dynamic storage optimization method for structured massive real-time data | |
CN116915259A (en) | Bin allocation data optimized storage method and system based on internet of things | |
CN117220686B (en) | Parasitic parameter compression and extraction system and method | |
CN107301203B (en) | Mass data comparison method and system | |
CN107330061B (en) | File deletion method and device based on distributed storage | |
US20180294819A1 (en) | Method and system for compressing data from smart meter | |
CN102479211A (en) | Mass data processing system and method on basis of database | |
CN111984651A (en) | Column type storage method, device and equipment based on persistent memory | |
CN108880559A (en) | Data compression method, uncompressing data, compression device and decompression apparatus | |
CN116301635A (en) | Data storage platform | |
CN116011403A (en) | Repeated data identification method for computer data storage | |
CN114255840B (en) | Intelligent data processing system based on clinical research data | |
CN108629356B (en) | Data storage method and device for classified application of electric loads |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |