CN114817831A - Computing auxiliary method for building engineering economy - Google Patents

Computing auxiliary method for building engineering economy Download PDF

Info

Publication number
CN114817831A
CN114817831A CN202210754358.8A CN202210754358A CN114817831A CN 114817831 A CN114817831 A CN 114817831A CN 202210754358 A CN202210754358 A CN 202210754358A CN 114817831 A CN114817831 A CN 114817831A
Authority
CN
China
Prior art keywords
data
data segment
processed
time sequence
engineering project
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202210754358.8A
Other languages
Chinese (zh)
Other versions
CN114817831B (en
Inventor
邹金森
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Sichuan Highway Engineering Consulting And Supervision Co ltd
Original Assignee
Sichuan Highway Engineering Consulting And Supervision Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sichuan Highway Engineering Consulting And Supervision Co ltd filed Critical Sichuan Highway Engineering Consulting And Supervision Co ltd
Priority to CN202210754358.8A priority Critical patent/CN114817831B/en
Publication of CN114817831A publication Critical patent/CN114817831A/en
Application granted granted Critical
Publication of CN114817831B publication Critical patent/CN114817831B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/10Complex mathematical operations
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • G06F16/2474Sequence data queries, e.g. querying versioned data
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/12Use of codes for handling textual entities
    • G06F40/126Character encoding
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/08Construction

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Physics (AREA)
  • Data Mining & Analysis (AREA)
  • General Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Databases & Information Systems (AREA)
  • Software Systems (AREA)
  • Computational Linguistics (AREA)
  • Computational Mathematics (AREA)
  • Economics (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Mathematical Analysis (AREA)
  • Pure & Applied Mathematics (AREA)
  • Fuzzy Systems (AREA)
  • Probability & Statistics with Applications (AREA)
  • Algebra (AREA)
  • Mathematical Optimization (AREA)
  • Human Resources & Organizations (AREA)
  • Marketing (AREA)
  • Primary Health Care (AREA)
  • Strategic Management (AREA)
  • Tourism & Hospitality (AREA)
  • General Business, Economics & Management (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)

Abstract

The invention relates to the technical field of electronic digital data processing, in particular to a calculation auxiliary method for construction engineering economy, which comprises the following steps: acquiring engineering project time sequence data to be processed, determining each data segment corresponding to the engineering project time sequence data to be processed by utilizing an electronic digital data processing technology, and further determining binary codes corresponding to each data in each data segment; the method comprises the steps of obtaining data which are not read frequently and data which are read frequently in each data section corresponding to engineering project time sequence data to be processed, determining a single bit value corresponding to the data which are not read frequently and a plurality of bit values corresponding to the data which are read frequently according to a read data index and a write data index corresponding to the data which are read frequently in each data section and binary codes corresponding to the data in each data section, and performing compression coding processing on each data section corresponding to the engineering project time sequence data to be processed. The invention improves the reading speed of the compressed engineering project time sequence data.

Description

Computing auxiliary method for construction engineering economy
Technical Field
The invention relates to the technical field of electronic digital data processing, in particular to a calculation auxiliary method for construction engineering economy.
Background
The construction engineering economy is usually accompanied by a large amount of time series data, a large amount of data can be generated along with the advance of an engineering project, such as capital expenditure, profitability expenditure, non-business expenditure, profit allocation expenditure, profit income and other asset expense data, the data volume of the data is large, the storage of the data needs to occupy a large memory, meanwhile, account clearing needs to be carried out when the engineering project is finished, and the efficient and reliable storage and retrieval become the current prominent problems in the face of explosive growth of time series data.
The traditional database provides a solution for data safety, reliability and convenience in query, but in the face of massive time series data, the traditional storage solution is heavy and inefficient. For example, when data is stored, the situation that the system is jammed and occupies too much system resources due to too much data writing amount often occurs; when reading data, the data is often too large to be read, and the reading speed is often slow. With the development of storage compression technology, a NoSQL (Not only SQL, a non-relational database) database storage scheme has appeared, which alleviates the pressure of reading data to some extent, but the storage scheme is still Not mature for compression of massive time series data, and when decoding compressed data, the storage scheme often needs to decode from the beginning, resulting in a slow speed of reading data.
Disclosure of Invention
In order to solve the problem that the conventional compressed storage has low data reading speed, the invention aims to provide a calculation auxiliary method for building engineering economy.
The invention provides a calculation auxiliary method for construction engineering economy, which comprises the following steps:
acquiring engineering project time sequence data to be processed, determining each data segment corresponding to the engineering project time sequence data to be processed according to the engineering project time sequence data to be processed, and further determining binary codes corresponding to each data in each data segment;
acquiring infrequently read data in each data segment corresponding to engineering project time sequence data to be processed, determining a bit value range corresponding to each data in each data segment according to binary codes corresponding to each data in each data segment, and further determining a single bit value corresponding to the infrequently read data in each data segment;
acquiring data which is read frequently in each data segment corresponding to engineering project time sequence data to be processed, a read data index and a write data index corresponding to the data which is read frequently, and determining a plurality of bit values corresponding to the data which is read frequently in each data segment according to a single bit value corresponding to the data which is not read frequently in each data segment, a binary code corresponding to the data which is read frequently in each data segment, a read data index and a write data index corresponding to the data which is read frequently in each data segment;
and performing compression coding processing on each data segment corresponding to the engineering project time sequence data to be processed according to the binary codes corresponding to the data in each data segment, the single bit values corresponding to the data which is not read frequently in each data segment and the multiple bit values corresponding to the data which is read frequently, so as to obtain the engineering project time sequence data to be processed after the compression coding.
Further, the step of determining a single bit value corresponding to the data that is not frequently read in each data segment includes:
determining a plurality of byte numbers corresponding to each data segment under different bit values according to a binary code corresponding to each data in each data segment corresponding to the engineering project time sequence data to be processed and a bit value range corresponding to each data in each data segment, and further determining a minimum byte number corresponding to each data segment;
determining a bit value corresponding to the minimum byte number corresponding to each data segment according to the minimum byte number corresponding to each data segment corresponding to the engineering project time sequence data to be processed, and taking the bit value corresponding to the minimum byte number corresponding to each data segment as a single bit value corresponding to the data which is not frequently read in each data segment.
Further, the calculation formula for determining the number of bytes corresponding to each data segment under different bit values is as follows:
Figure 100002_DEST_PATH_IMAGE002
Figure 100002_DEST_PATH_IMAGE004
wherein the content of the first and second substances,
Figure 100002_DEST_PATH_IMAGE005
the number of bytes corresponding to each data segment corresponding to the engineering project time sequence data to be processed under any bit value,
Figure 100002_DEST_PATH_IMAGE006
a second one corresponding to the DACTS code of each data segment corresponding to the engineering project time sequence data to be processedqAny bit value of the bit layer is,
Figure 100002_DEST_PATH_IMAGE007
a second one corresponding to the DACTS code of each data segment corresponding to the engineering project time sequence data to be processedqThe number of bit blocks corresponding to an arbitrary bit value of the bit layer,Qthe number of bit layers corresponding to any bit value of the DACTS codes of each data segment corresponding to the engineering project time sequence data to be processed,Bin order to preset the minimum bit value,
Figure 100002_DEST_PATH_IMAGE008
the maximum binary code length corresponding to each data segment corresponding to the engineering project time sequence data to be processed,
Figure 100002_DEST_PATH_IMAGE009
in order to find the function of the maximum value,xand the code length of the binary code corresponding to each data in each data segment corresponding to the engineering project time sequence data to be processed is determined.
Further, the step of determining a plurality of bit values corresponding to data that is frequently read in each data segment includes:
determining the number of bytes when each bit layer corresponding to the DACTS code of the data which is read frequently in each data segment under the single bit value corresponding to the data which is read infrequently and the number of bytes when each bit layer corresponding to the single bit value corresponding to the data which is read infrequently in each data segment is not complemented;
determining the corrected byte number of each bit layer corresponding to the DACTS code of the data which is read frequently in each data segment according to the byte number when each bit layer is not complemented and the byte number when the byte number is complemented, the read data index and the write data index corresponding to the data which is read frequently, which correspond to the single bit value corresponding to the data which is not read frequently of the DACTS code of the data which is read frequently in each data segment;
and determining the corrected bit value of each bit layer corresponding to the DACTS code of the data which is read frequently in each data segment according to the corrected byte number of each bit layer corresponding to the DACTS code of the data which is read frequently in each data segment, thereby determining a plurality of bit values corresponding to the data which is read frequently in each data segment.
Further, a calculation formula for determining the corrected byte number of each bit layer corresponding to the DACTS code of the data which is read frequently in each data segment is as follows:
Figure 100002_DEST_PATH_IMAGE011
wherein the content of the first and second substances,
Figure 100002_DEST_PATH_IMAGE012
a second code corresponding to the DACTS code of the data which is read frequently in each data segmentuThe modified number of bytes for the bit layer,
Figure 100002_DEST_PATH_IMAGE013
DACTS encoding of frequently read data within respective data segments corresponds to a second bit value of the infrequently read datauThe number of bytes when the bit layer is zero-padded,
Figure 100002_DEST_PATH_IMAGE014
DACTS encoding of frequently read data within respective data segments corresponds to a second bit value of the infrequently read datauThe number of bytes when the bit layer is not zero padded,Rthe data reading indexes corresponding to the data which is read frequently in each data segment,Wwriting data indexes corresponding to data which are read frequently in each data segment;
the calculation formula for determining the modified bit value of each bit layer corresponding to the DACTS code of the data which is read frequently in each data segment is as follows:
Figure 100002_DEST_PATH_IMAGE016
wherein the content of the first and second substances,
Figure 130779DEST_PATH_IMAGE012
a second code corresponding to the DACTS code of the data which is read frequently in each data segmentuThe modified number of bytes for the bit layer,
Figure 100002_DEST_PATH_IMAGE017
a second code corresponding to the DACTS code of the data which is read frequently in each data segmentuThe modified bit values of the individual bit layers,
Figure 100002_DEST_PATH_IMAGE018
a second code corresponding to the DACTS code of the data which is read frequently in each data segmentuThe number of bit blocks corresponding to the modified bit values of the individual bit layers.
Further, the step of determining each data segment corresponding to the engineering project time sequence data to be processed includes:
determining each initial data segment corresponding to the engineering project time sequence data to be processed according to the engineering project time sequence data to be processed;
determining modification necessity indexes of all initial data segments according to all initial data segments corresponding to the engineering project time sequence data to be processed;
and modifying each initial data segment corresponding to the engineering project time sequence data to be processed according to the modification necessity index of each initial data segment, thereby determining each data segment corresponding to the engineering project time sequence data to be processed.
Further, the step of determining each initial data segment corresponding to the engineering project time sequence data to be processed includes:
determining a first initial data segment corresponding to the engineering project time sequence data to be processed according to the engineering project time sequence data to be processed, wherein the determining step of the first initial data segment is as follows: determining data segment balance corresponding to a preliminary initial data segment composed of the first 2 data according to the first 2 data in the engineering project time sequence data to be processed, if the data segment balance corresponding to the preliminary initial data segment composed of the first 2 data is not less than a preset data segment balance threshold value, determining data segment balance corresponding to the preliminary initial data segment composed of the first 3 data according to the first 3 data in the engineering project time sequence data to be processed, if the data segment balance corresponding to the preliminary initial data segment composed of the first 3 data is not less than the preset data segment balance threshold value, determining data segment balance corresponding to the preliminary initial data segment composed of the first 4 data according to the first 4 data in the engineering project time sequence data to be processed, and repeating the steps until the first data in the engineering project time sequence data to be processedmThe data segment balance corresponding to the preliminary initial data segment composed of the data is smaller than the preset data segment balance threshold value, and the front part in the engineering project time sequence data to be processed is processed
Figure 100002_DEST_PATH_IMAGE019
A preliminary initial data segment composed of the data is used as a first initial data segment corresponding to the engineering project time sequence data to be processed;
and according to the time sequence data of the engineering project to be processed after the first initial data segment, obtaining a second initial data segment corresponding to the time sequence data of the engineering project to be processed by referring to the determining step of the first initial data segment, according to the time sequence data of the engineering project to be processed after the second initial data segment, obtaining a third initial data segment corresponding to the time sequence data of the engineering project to be processed by referring to the determining step of the first initial data segment, and continuously repeating the step of determining the initial data segments until the time sequence data of the engineering project to be processed all have the initial data segments corresponding to the time sequence data of the engineering project to be processed.
Further, determining the front in the engineering project time sequence data to be processedmThe calculation formula of the data segment equilibrium corresponding to the preliminary initial data segment composed of the data is as follows:
Figure 100002_DEST_PATH_IMAGE021
wherein the content of the first and second substances,Efor pre-in-time sequence data of engineering project to be processedmThe data segment balance corresponding to the preliminary initial data segment formed by the data,
Figure 100002_DEST_PATH_IMAGE022
for pre-in-time sequence data of engineering project to be processedmIn a preliminary initial data section consisting of individual datajData, β is a hyperparameter.
Further, the step of determining the modification necessity indicators for each initial data segment includes:
determining the data average value of each initial data segment corresponding to the engineering project time sequence data to be processed according to each initial data segment corresponding to the engineering project time sequence data to be processed;
calculating the difference value between each data in each initial data segment corresponding to the engineering project time sequence data to be processed and the data average value of each initial data segment, thereby obtaining the differential data segment of each initial data segment corresponding to the engineering project time sequence data to be processed;
and determining the modification necessity index of each initial data segment according to the binary code of each initial data segment corresponding to the engineering project time sequence data to be processed and the binary code of the differential data segment of each initial data segment.
Further, the step of modifying each initial data segment corresponding to the engineering project time sequence data to be processed includes:
and if the modification necessity index of any initial data segment corresponding to the engineering project time sequence data to be processed is larger than the preset modification necessity index threshold, modifying the initial data segment corresponding to the engineering project time sequence data to be processed, and taking the differential data segment of the initial data segment as the data segment corresponding to the engineering project time sequence data to be processed.
The invention has the following beneficial effects:
according to the method and the device, the data segments corresponding to the engineering project time sequence data to be processed are determined by acquiring the engineering project time sequence data to be processed, and the compression rate of the engineering project time sequence data to be processed is improved. And determining a single bit value corresponding to the data which is not read frequently and a plurality of bit values corresponding to the data which is read frequently in each data segment through each data segment corresponding to the engineering project time sequence data to be processed. And obtaining the compressed and encoded engineering project time sequence data to be processed according to each data segment corresponding to the engineering project time sequence data to be processed, a single bit value corresponding to the data which is not read frequently in each data segment and a plurality of bit values corresponding to the data which is read frequently.
The invention utilizes the single bit value corresponding to the infrequently read data to perform compression storage processing on the infrequently read data in the engineering project time sequence data to be processed, and utilizes the multiple bit values corresponding to the frequently read data to perform compression storage processing on the frequently read data in the engineering project time sequence data to be processed.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions and advantages of the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, it is obvious that the drawings in the following description are only some embodiments of the present invention, and other drawings can be obtained by those skilled in the art without creative efforts.
Fig. 1 is a flowchart of a calculation assistance method for construction economy in an embodiment of the present invention.
Detailed Description
To further explain the technical means and effects of the present invention adopted to achieve the predetermined objects, the following detailed description of the embodiments, structures, features and effects of the technical solutions according to the present invention will be given with reference to the accompanying drawings and preferred embodiments. In the following description, different references to "one embodiment" or "another embodiment" do not necessarily refer to the same embodiment. Furthermore, the particular features, structures, or characteristics may be combined in any suitable manner in one or more embodiments.
Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs.
In this embodiment, by acquiring the engineering project time series data to be processed, determining the bit values and the number of bit layers corresponding to different types of data in each data segment corresponding to the engineering project time series data to be processed, and compressing and storing binary codes corresponding to each data in each data segment corresponding to the engineering project time series data to be processed by using the bit values and the number of bit layers corresponding to the different types of data in each data segment, the purpose of quickly and accurately acquiring data is achieved, and calculation and verification of accounts are assisted to be completed. The embodiment provides a calculation assisting method for building engineering economy, which comprises the following steps as shown in figure 1:
(1) the method comprises the steps of obtaining engineering project time sequence data to be processed, determining each data segment corresponding to the engineering project time sequence data to be processed according to the engineering project time sequence data to be processed, and further determining binary codes corresponding to the data in each data segment.
In the advancing process of the construction project, the project time series data to be processed are obtained, the project time series data to be processed comprise various asset cost data such as capital expenditure, profitable expenditure, non-business expenditure, profit allocation expenditure and profit corresponding to each timestamp, and the project time series data to be processed are large in data volume.
After the engineering project time sequence data to be processed is obtained, determining each data segment corresponding to the engineering project time sequence data to be processed according to the engineering project time sequence data to be processed, wherein the steps comprise:
and (1-1) determining each initial data segment corresponding to the engineering project time sequence data to be processed according to the engineering project time sequence data to be processed.
It should be noted that, for a single piece of data in the engineering project time series data to be processed, if the numerical value of the single piece of data is large, a large number of index values are required when compressing the binary code corresponding to the single piece of data, resulting in a low data compression rate. In order to improve the data compression rate, this embodiment divides the data with similar storage time into each initial data segment, that is, determines each initial data segment corresponding to the engineering project time series data to be processed, and the steps include:
(1-1-1) determining a first initial data segment corresponding to the engineering project time sequence data to be processed according to the engineering project time sequence data to be processed, wherein the determining step of the first initial data segment is as follows: determining data segment balance corresponding to a preliminary initial data segment composed of the first 2 data according to the first 2 data in the engineering project time sequence data to be processed, if the data segment balance corresponding to the preliminary initial data segment composed of the first 2 data is not less than a preset data segment balance threshold, and in this embodiment, setting the preset data segment balance threshold to 0.7, determining data segment balance corresponding to a preliminary initial data segment composed of the first 3 data according to the first 3 data in the engineering project time sequence data to be processed, if the data segment balance corresponding to the preliminary initial data segment composed of the first 3 data is not less than the preset data segment balance threshold 0.7, determining data segment balance corresponding to a preliminary initial data segment composed of the first 4 data according to the first 4 data in the engineering project time sequence data to be processed, and continuously repeating the above stepsStep, until the front in the engineering project time sequence data to be processedmThe data segment balance corresponding to the preliminary initial data segment composed of the data is smaller than the preset data segment balance threshold value, and the front part in the engineering project time sequence data to be processed is processed
Figure 625084DEST_PATH_IMAGE019
A preliminary initial data segment composed of the data is used as a first initial data segment corresponding to the engineering project time sequence data to be processed, and the first initial data segment comprises the front part in the engineering project time sequence data to be processed
Figure 857351DEST_PATH_IMAGE019
And (4) data.
A preliminary initial data segment consisting of the first 2 data, a preliminary initial data segment consisting of the first 3 data, a preliminary initial data segment consisting of the first 4 data and a preliminary initial data segment of the first 2 data in the engineering project time sequence data to be processedmThe calculation process of the data segment balance corresponding to the preliminary initial data segment composed of the data is kept consistent, and the embodiment is used for determining the front part in the time sequence data of the engineering project to be processedmData segment equalization corresponding to preliminary initial data segment of individual data components is taken as an examplemThe calculation formula of the data segment equilibrium corresponding to the preliminary initial data segment composed of the data is as follows:
Figure 190243DEST_PATH_IMAGE021
wherein the content of the first and second substances,Efor pre-in-time sequence data of engineering project to be processedmThe data segment balance corresponding to the preliminary initial data segment formed by the data,
Figure 457145DEST_PATH_IMAGE022
for the front in the time series data of the project to be processedmIn a preliminary initial data section consisting of individual datajData, beta is a hyper-parameter, empirical values are taken
Figure DEST_PATH_IMAGE023
It should be noted that, the front in the time series data of the project to be processedmThe greater the data segment balance corresponding to the preliminary initial data segment composed of the data is, the higher the similarity of the data in the preliminary initial data segment is, that is, the smaller the numerical difference between the data in the preliminary initial data segment is, the greater the possibility that the preliminary initial data segment is the initial data segment corresponding to the time series data of the engineering project to be processed is.
(1-1-2) according to the engineering project time sequence data to be processed after the first initial data segment, referring to the determining step of the first initial data segment to obtain a second initial data segment corresponding to the engineering project time sequence data to be processed, according to the engineering project time sequence data to be processed after the second initial data segment, referring to the determining step of the first initial data segment to obtain a third initial data segment corresponding to the engineering project time sequence data to be processed, and continuously repeating the step of determining the initial data segments until the engineering project time sequence data to be processed all have the initial data segments corresponding to the engineering project time sequence data to be processed.
In this embodiment, the data that is not attributed in the engineering project time series data to be processed, that is, the previous data in the engineering project time series data to be processed is determined through each data in the first initial data segment corresponding to the engineering project time series data to be processed obtained in step (1-1-1)
Figure 404241DEST_PATH_IMAGE019
The data after the data is according to the front part in the time sequence data of the engineering project to be processed
Figure 608958DEST_PATH_IMAGE019
And the data after the data is obtained by referring to the determination process of the first initial data segment corresponding to the engineering project time sequence data to be processed to obtain a second initial data segment corresponding to the engineering project time sequence data to be processed, and continuously repeating the step of determining the initial data segments until the engineering project time sequence data to be processed all have the corresponding initial data segments.
And (1-2) determining the modification necessity index of each initial data segment according to each initial data segment corresponding to the engineering project time sequence data to be processed.
It should be noted that the modification necessity index of each initial data segment calculated in this embodiment is convenient for modifying data in some initial data segments corresponding to the engineering project time series data to be subsequently processed, and the step of determining the modification necessity index of each initial data segment includes:
and (1-2-1) determining the data average value of each initial data segment corresponding to the engineering project time sequence data to be processed according to each initial data segment corresponding to the engineering project time sequence data to be processed.
In the embodiment, the first time sequence data corresponding to the engineering project time sequence data to be processed is determinedaTaking the average value of the initial data segment as an example, and according to the first time sequence data corresponding to the engineering project time sequence data to be processedaEach data in the initial data segment, theaAn initial data segment is marked asM
Figure DEST_PATH_IMAGE024
={
Figure DEST_PATH_IMAGE025
Is calculated to beaThe average value of the data of the initial data segments is calculated by the formula:
Figure DEST_PATH_IMAGE027
wherein the content of the first and second substances,
Figure DEST_PATH_IMAGE028
the first one corresponding to the engineering project time sequence data to be processedaThe average value of the data of the initial data segments,mthe first one corresponding to the engineering project time sequence data to be processedaThe number of data of each initial data segment,xthe first one corresponding to the engineering project time sequence data to be processedaEach data of the initial data segment.
It should be noted that reference to the second corresponding to the engineering project time series data to be processedaInitial number ofAnd determining the data average value of the data segments, and determining the data average value of each initial data segment corresponding to the engineering project time sequence data to be processed according to each data in each initial data segment corresponding to the engineering project time sequence data to be processed.
And (1-2-2) calculating the difference value between each data in each initial data segment corresponding to the engineering project time sequence data to be processed and the data average value of each initial data segment, so as to obtain the differential data segment of each initial data segment corresponding to the engineering project time sequence data to be processed.
In the embodiment, the first time sequence data corresponding to the engineering project time sequence data to be processed is determinedaTaking the differential data segment of the initial data segment as an example, according to the second corresponding to the engineering project time sequence data to be processedaEach data in the initial data segment and the second data obtained in the step (1-2-1)aThe average value of the data corresponding to the initial data segment respectively calculates the first time sequence data of the engineering project to be processedaEach data in the initial data segment and the second data segmentaThe difference value of the average values of the data corresponding to the initial data segments is calculated by the following formula:
Figure DEST_PATH_IMAGE029
={
Figure DEST_PATH_IMAGE030
taking each difference as the secondaA differential data segment of the initial data segments.
It should be noted that reference to the second corresponding to the engineering project time series data to be processedaAnd determining the differential data segment of each initial data segment to obtain the differential data segment of each initial data segment corresponding to the engineering project time sequence data to be processed. At this time, the differential data segment of each initial data segment can be represented by a shorter encoding code length by increasing the index value, which helps to improve the compression rate of the project engineering time series data to be processed.
For example, the firstaAn initial data segment
Figure 97271DEST_PATH_IMAGE024
={
Figure DEST_PATH_IMAGE031
}, then the firstaData mean of initial data segment
Figure DEST_PATH_IMAGE032
Of 1 ataA differential data segment of the initial data segment
Figure DEST_PATH_IMAGE033
={
Figure DEST_PATH_IMAGE034
At this time, the secondaThe binary code length of each data in the initial data segment is 7 bits, and the second oneaThe maximum code length of binary coding of each data in the differential data segment of the initial data segment is only 2 bits.
And (1-2-3) determining the modification necessity index of each initial data segment according to the binary code of each initial data segment corresponding to the engineering project time sequence data to be processed and the binary code of the differential data segment of each initial data segment.
It should be noted that, the code length of each data in the differential data segment of each initial data segment corresponding to the engineering project time series data to be processed may be reduced to different degrees, but an index needs to be added when each data in the differential data segment of each initial data segment is processed, the index is added to facilitate data differentiation, when the number of data in the differential data segment of the initial data segment is too small, the code length of each data reduced in the differential data segment will be smaller than the length of the index added to each data in the differential data segment, and at this time, the necessity of modifying the initial data segment into the differential data segment will be low, that is, the index of necessity of modifying the initial data segment is low.
In this embodiment, according to the binary code of each data in each initial data segment corresponding to the engineering project time series data to be processed and the binary code of each data in the differential data segment of each initial data segment, the code length of each initial data segment and the code length of the differential data segment of each initial data segment are obtained, and the modification necessity index of each initial data segment is obtained by calculating the difference between the code length of each initial data segment and the code length of the differential data segment of each initial data segment, in which the calculated difference is used as the modification necessity index of the initial data segment, and the calculation formula of the modification necessity index is as follows:
Figure DEST_PATH_IMAGE036
wherein the content of the first and second substances,afor the modification necessity index of each initial data segment corresponding to the engineering project time sequence data to be processed,
Figure DEST_PATH_IMAGE037
for the code length of each initial data segment corresponding to the engineering project time sequence data to be processed,
Figure DEST_PATH_IMAGE038
and the code length of the differential data segment of each initial data segment corresponding to the engineering project time sequence data to be processed is determined.
It should be noted that, if the modification necessity index of a certain initial data segment corresponding to the engineering project time sequence data to be processed is providedaIf the data segment is positive, the differential data segment of the initial data segment replaces the initial data segment to become a data segment corresponding to the time sequence data of the engineering project to be processed, and if the data segment is the modification necessity index of a certain initial data segment corresponding to the time sequence data of the engineering project to be processedaAnd if the data segment is negative or 0, discarding the differential data segment of the initial data segment, and taking the initial data segment as the data segment corresponding to the engineering project time sequence data to be processed.
And (1-3) modifying each initial data segment corresponding to the engineering project time sequence data to be processed according to the modification necessity index of each initial data segment, thereby determining each data segment corresponding to the engineering project time sequence data to be processed.
In this embodiment, each initial data segment corresponding to the engineering project time sequence data to be processed is modified according to the modification necessity index of each initial data segment obtained in step (1-2-3), and if the modification necessity index of any one initial data segment corresponding to the engineering project time sequence data to be processed is greater than the preset modification necessity index threshold, the preset modification necessity index threshold is set to 0, and the differential data segment of the initial data segment is used as the data segment corresponding to the engineering project time sequence data to be processed.
The modification necessity index of each initial data segment corresponding to the engineering project time sequence data to be processed is compared with a preset modification necessity index threshold value 0, so that modification of some initial data segments corresponding to the engineering project time sequence data to be processed is realized, namely, a differential data segment of the initial data segment which is larger than the preset modification necessity index threshold value 0 replaces the corresponding initial data segment to be a data segment corresponding to the engineering project time sequence data to be processed, and each data segment corresponding to the engineering project time sequence data to be processed is determined.
At this time, each data segment corresponding to the engineering project time sequence data to be processed is obtained in this embodiment, many floating points generally exist in each data segment corresponding to the engineering project time sequence data to be processed, and the floating points are important data types of the engineering project time sequence data.
For example, data 25.125 in each data segment corresponding to the engineering project time series data to be processed converts data 25.125 into a binary coding format by using a conversion format of a scientific counting method, and the conversion process is as follows:
25.125(D) = 11001.001(B) = 1.1001001*2^4(B)
it should be noted that D represents decimal and B represents binary in the process of converting the data 25.125 into binary code format. The conversion format of the scientific counting method is the prior art, that is, the process of acquiring the binary code corresponding to the data is the prior art, which is not in the protection scope of the present invention, and the detailed description is not repeated here.
(2) The method comprises the steps of obtaining data which are not read frequently in each data segment corresponding to engineering project time sequence data to be processed, determining a bit value range corresponding to each data in each data segment according to binary codes corresponding to each data in each data segment, and further determining a single bit value corresponding to the data which are not read frequently in each data segment.
Firstly, it should be noted that, when time series data is compressed and encoded, the commonly used compression encoding mode is fixed length encoding and variable length encoding, the fixed length encoding takes the maximum encoding bit number in the current time series data as the reference, the encoding bit numbers of all the time series data are unified into the maximum encoding bit number, the compression encoding mode can cause the waste of storage space, and the data compression ratio is low; variable length coding is based on the minimum number of coding bits in current time series data, and since the coding length of each time series data is variable, the position of target data cannot be obtained during decoding, decoding must be performed sequentially from the first time series data, and decoding efficiency and data reading speed are low. In order to simultaneously take Compression rate and decoding efficiency into account, a DACTS (Direct Access Compression of time series) encoding algorithm is proposed, but the encoding algorithm relies on artificially set parameters, which are bit values, and the size of the bit values affects the Compression rate and decoding efficiency of time series data. In the embodiment, the bit value is obtained through data information self-adaptation, and is adjusted, so that the bit value is suitable for auxiliary calculation of the engineering project time sequence data to be processed.
The traditional DACTS coding algorithm for artificially setting parameter bit values operates in the following mode:
the coding mode during coding is as follows: given time series data sequence
Figure DEST_PATH_IMAGE039
={
Figure DEST_PATH_IMAGE040
Carrying out binary coding processing on the time sequence data sequence to obtain binary coding of the time sequence data sequence, and determining a bit value according to human experiencebThe binary code of the time sequence data sequence is divided into bit blocks with the size of the bit block
Figure DEST_PATH_IMAGE041
The high order of each bit block is an identifier which indicates whether the bit block is data or not
Figure DEST_PATH_IMAGE042
The bit block slicing is performed from the last bit to the first bit of the binary code corresponding to the data.
For example, given a time series of data
Figure 723075DEST_PATH_IMAGE039
={
Figure DEST_PATH_IMAGE043
Value of bitbTo 2, the encoding process is shown in table 1 below:
TABLE 1
Serial number
Figure DEST_PATH_IMAGE045
Binary coding DACTS
1 4 100 001 100
2 2 10 010
3 10 1010 010 110
4 1 1 001
5 21 10101 001 101 101
6 5 101 001 101
7 19 10011 001 100 111
In Table 1, the bit value is 2, and the timing sequence dataData 4 in the sequence is DACTS encoded, the binary encoding of data 4 is 100, and binary encoding 100 is divided into 1 and 00. The slicing sequence of the sliced bit block is from back to front, that is, 2 bits are taken from 0 in the binary code 100 to front, when the last code in the binary code is less than 2 bits, 0 is complemented, and after the binary code 100 of the data 4 is complemented by zero, 0100 is obtained. At this time, when looking from back to front, the binary code segment where the binary code 0100, 00 after zero padding is located is preceded by the binary code segment 01, so the identifier of the binary code segment 00 is 1, the DACTS coded bit block of the binary code segment is 100, and the binary code segment where 01 is located is not preceded by the binary code segment, so the identifier of the binary code segment 01 is 0, and the DACTS coded bit block of the binary code segment is 001, at this time, the DACTS code of data 4 in the time series data sequence is obtained. Similarly, a DACTS code of each data in the time series data sequence is obtained, that is, the time series data sequence is obtained
Figure 28023DEST_PATH_IMAGE039
={
Figure 454456DEST_PATH_IMAGE043
DACTS encoding of (1), the encoding results are shown in the following Table 2:
TABLE 2
Figure DEST_PATH_IMAGE047
In the above Table 2
Figure DEST_PATH_IMAGE048
Is shown asnA layer of one bit or more bits,
Figure DEST_PATH_IMAGE049
is shown asnA binary code segment corresponding to a time series data sequence in a bit layer,
Figure DEST_PATH_IMAGE050
is shown asnAnd identifiers of binary coding segments corresponding to time sequence data sequences in the bit layers.
The decoding mode during decoding is as follows: for example, the 5 th data in Table 2 is decoded, the 1 st bit layer in Table 2
Figure DEST_PATH_IMAGE051
Find the binary code segment 01 corresponding to the 5 th data and the identifier corresponding to the binary code segment 01
Figure DEST_PATH_IMAGE052
Identification of
Figure 955582DEST_PATH_IMAGE052
A value of 1 indicates that there is a binary segment before the binary segment 01, because of the 1 st bit layer in table 2
Figure 683496DEST_PATH_IMAGE051
Corresponding identifier of middle and first 5 data binary coding segments
Figure 51024DEST_PATH_IMAGE052
Of 1, there are 3, the 2 nd bit layer in Table 2
Figure DEST_PATH_IMAGE053
Continuously searching for binary code segment 01 corresponding to the 3 rd data and identifier corresponding to binary code segment 01
Figure 417283DEST_PATH_IMAGE052
Identification of
Figure 879357DEST_PATH_IMAGE052
To be 1, it is indicated that there is a binary segment before the binary segment 01, because of the 3 rd bit layer in table 2
Figure DEST_PATH_IMAGE054
Corresponding identifier of middle and first 3 data binary coding segments
Figure DEST_PATH_IMAGE055
1 for 1, bit layer 3 in Table 2
Figure 585145DEST_PATH_IMAGE054
Continuously searching for the binary code segment 01 corresponding to the 1 st data and the identifier corresponding to the binary code segment 01
Figure 107262DEST_PATH_IMAGE052
Identification of
Figure 773867DEST_PATH_IMAGE052
At this point, since the decoding is finished, the DACTS code corresponding to the 5 th data in table 2 is 001101101, the DACTS code corresponding to the 5 th data is converted into the binary code 10101, and the decoding is successful to obtain the data 21.
And (2-1) acquiring data which are not read frequently in each data segment corresponding to the engineering project time sequence data to be processed, and determining a bit numerical range corresponding to each data in each data segment according to a binary code corresponding to each data in each data segment.
In this embodiment, according to each data segment corresponding to the engineering project time series data to be processed and the read record of each data in each data segment, the data that is not frequently read in each data segment can be obtained. According to the binary code corresponding to each data in each data segment corresponding to the engineering project time sequence data to be processed, namely according to the code length corresponding to each data in each data segment, obtaining the value range of the bit value, wherein the value range of the bit value is
Figure DEST_PATH_IMAGE056
Wherein, in the step (A),bfor the bit value corresponding to each data in each data segment corresponding to the engineering project time sequence data to be processed,Bin order to preset the minimum bit value,
Figure DEST_PATH_IMAGE057
and the maximum code length corresponding to each data segment corresponding to the engineering project time sequence data to be processed is obtained. According to each bit value in the value range, the number of bit layers and the number of bit blocks corresponding to each bit value can be obtained, if a certain bit valueThe larger the bit value is, the smaller the number of bit layers and the number of bit blocks corresponding to the bit value are.
It should be noted that, by determining each data segment corresponding to the engineering project time series data to be processed, the compression rate of the engineering project time series data to be processed is effectively improved, and at this time, bit values corresponding to different types of data in each data segment need to be determined. The conventional bit values are determined according to human experience, and different bit values need to be selected for different engineering project time sequence data, so that the universal effectiveness and robustness of the artificially determined bit values are poor, so that the bit values corresponding to different types of data in each data segment are determined according to the data distribution condition of the engineering project time sequence data to be processed, and the different types of data are divided into: data that is not read frequently and data that is read frequently.
(2-2) determining a single bit value corresponding to the data which is not frequently read in each data segment according to the binary code corresponding to each data in each data segment corresponding to the engineering project time sequence data to be processed and the bit value range corresponding to each data in each data segment, wherein the steps comprise:
(2-2-1) determining a plurality of byte numbers corresponding to each data segment under different bit values according to the binary code corresponding to each data in each data segment corresponding to the engineering project time sequence data to be processed and the bit value range corresponding to each data in each data segment, and further determining the minimum byte number corresponding to each data segment.
According to binary codes corresponding to all data in all data sections corresponding to engineering project time sequence data to be processed, bit values in a bit value range, bit layer numbers corresponding to the bit values and bit block number bytes corresponding to the bit values, a gradient descent method is adopted for solving, the number of bytes corresponding to all the data sections corresponding to all the bit values under different bit values is calculated, namely each bit value has the number of bytes corresponding to the bit value, all the data sections corresponding to the engineering project time sequence data to be processed have a plurality of bytes, and a calculation formula for determining the number of bytes corresponding to each data section under different bit values is as follows:
Figure DEST_PATH_IMAGE058
Figure 898687DEST_PATH_IMAGE004
wherein the content of the first and second substances,
Figure DEST_PATH_IMAGE059
the number of bytes corresponding to each data segment corresponding to the engineering project time sequence data to be processed under any bit value,
Figure DEST_PATH_IMAGE060
a second code corresponding to DACTS codes of all data segments corresponding to the time sequence data of the engineering project to be processedqAny bit value of the bit layer is,
Figure 990139DEST_PATH_IMAGE007
a second one corresponding to the DACTS code of each data segment corresponding to the engineering project time sequence data to be processedqThe number of bit blocks corresponding to an arbitrary bit value of the bit layer,Qthe number of bit layers corresponding to any bit value of the DACTS codes of each data segment corresponding to the engineering project time sequence data to be processed,Bin order to preset the minimum bit value,
Figure 699469DEST_PATH_IMAGE057
the maximum binary code length corresponding to each data segment corresponding to the engineering project time sequence data to be processed,
Figure DEST_PATH_IMAGE061
in order to find the function of the maximum value,xand the code length of the binary code corresponding to each data in each data segment corresponding to the engineering project time sequence data to be processed is determined.
It should be noted that, for the bit value of each bit layer corresponding to the DACTS code of each data segment corresponding to the engineering project time series data to be processedbBit values of different bit layersbBit values of the same bit layer may differbMust be consistent, usually in terms of bit valuesbIn the range of 10 or less, the most common bit valuesbMay be equal to 2, 3 or 4. In addition, it should be noted that the process of solving by the gradient descent method is prior art and is not within the scope of the present invention, and the details are not described herein.
In this embodiment, after determining the number of bytes corresponding to each data segment, the number of bytes corresponding to each data segment under different bit values is arranged according to a certain arrangement order, and the minimum number of bytes is selected from the number of bytes corresponding to each arranged data segment under different bit values, so as to obtain the minimum number of bytes corresponding to each data segment.
(2-2-2) determining a bit value corresponding to the minimum byte number corresponding to each data segment according to the minimum byte number corresponding to each data segment corresponding to the engineering project time sequence data to be processed, and taking the bit value corresponding to the minimum byte number corresponding to each data segment as a single bit value corresponding to the data which is not read frequently in each data segment.
It should be noted that, according to the minimum byte number corresponding to each data segment obtained in the step (2-2-1), a bit value corresponding to the minimum byte number corresponding to each data segment is determined, and the bit value corresponding to the minimum byte number corresponding to each data segment only considers the compression ratio, at this time, the obtained compression result is not optimal, because a large number of read-write operations often exist in the engineering project time series data to be processed, this embodiment performs different compression and storage operations on different types of data in each data segment, so as to achieve both the compression ratio and the reading efficiency, and the reading efficiency is the decompression efficiency. The different types of data include data that is not frequently read and data that is frequently read.
For the data that is not frequently read in each data segment, the reading frequency of the data that is not frequently read is low, so the decoding efficiency of the data that is not frequently read will not be considered, in this embodiment, the bit value corresponding to the minimum byte number corresponding to each data segment is taken as the bit value corresponding to the data that is not frequently read in each data segment, that is, the bit value corresponding to the minimum byte number corresponding to each data segment is taken as the bit value of each bit layer corresponding to the DACTS encoding of the data that is not frequently read, and the bit values of each bit layer are kept consistent, so that a single bit value corresponding to the data that is not frequently read in each data segment is obtained.
(3) The method comprises the steps of obtaining data which are read frequently in each data section and correspond to engineering project time sequence data to be processed, and data reading indexes and data writing indexes which correspond to the data which are read frequently, and determining a plurality of bit values which correspond to the data which are read frequently in each data section according to a single bit value which corresponds to the data which are not read frequently in each data section, a binary code which corresponds to the data which are read frequently in each data section, and the data reading indexes and the data writing indexes which correspond to the data which are read frequently.
And (3-1) acquiring data which is read frequently in each data section corresponding to the engineering project time sequence data to be processed, and a read data index and a write data index corresponding to the data which is read frequently.
According to the data in each data segment corresponding to the engineering project time sequence data to be processed and the reading record and the writing record of the data in each data segment, the data which is read frequently in each data segment corresponding to the engineering project time sequence data to be processed, the reading data index and the writing data index corresponding to the data which is read frequently in each data segment can be obtained, and the reading data index and the writing data index of each bit layer corresponding to the DACTS code of the data which is read frequently in each data segment are consistent. The process of obtaining the binary code corresponding to the data that is read frequently, the read data index corresponding to the data that is read frequently, and the write data index corresponding to the data that is read frequently in each data segment is the prior art, and is not within the protection scope of the present invention, and is not elaborated here.
(3-2) determining a plurality of bit values corresponding to the data which is read frequently in each data segment according to a single bit value corresponding to the data which is not read frequently in each data segment, a binary code corresponding to the data which is read frequently in each data segment, a read data index and a write data index corresponding to the data which is read frequently in each data segment, wherein the steps comprise:
(3-2-1) determining the number of bytes when each bit layer of the DACTS code of the data which is read frequently in each data segment is not complemented and the number of bytes when the bits are complemented according to the single bit value corresponding to the data which is read infrequently in each data segment and the binary code corresponding to the data which is read frequently in each data segment.
In this embodiment, when performing DACTS encoding processing on binary codes corresponding to data that is read frequently in each data segment corresponding to engineering project time series data to be processed, zero padding processing needs to be performed on binary codes corresponding to some data that is read frequently, and according to a zero padding processing result in a DACTS encoding processing process corresponding to the data that is read frequently under a single bit value corresponding to the data that is not read frequently, the number of bytes when each bit layer corresponding to the DACTS encoding of the data that is read frequently in each data segment under the single bit value corresponding to the data that is not read frequently is determined, and the number of bytes when zero padding is performed.
(3-2-2) determining the corrected byte number of each bit layer corresponding to the DACTS code of the data which is read frequently in each data segment according to the byte number when each bit layer corresponding to the single bit value of the data which is read infrequently is not complemented and the byte number when the byte number is complemented, the read data index and the write data index corresponding to the data which is read frequently in each data segment.
In an embodiment, the second data segment is encoded by a DACTS corresponding to the data which is determined to be read frequently in each data segmentuTaking the corrected byte number of each bit layer as an example, according to the DACTS code of the data which is read frequently in each data segment, the corresponding second bit value under the single bit value corresponding to the data which is not read frequentlyuDetermining the number of bytes when the bit layer is not compensated, the number of bytes when the bit layer is compensated, the reading data index and the writing data index corresponding to the data which is read frequently, and determining the second byte corresponding to the DACTS code of the data which is read frequently in each data segmentuThe calculation formula of the corrected byte number of each bit layer is as follows:
Figure 46180DEST_PATH_IMAGE011
wherein the content of the first and second substances,
Figure DEST_PATH_IMAGE062
a second code corresponding to the DACTS code of the data which is read frequently in each data segmentuThe modified number of bytes for the bit layer,
Figure DEST_PATH_IMAGE063
DACTS encoding of frequently read data within respective data segments corresponds to a second bit value of the infrequently read datauThe number of bytes when the bit layer is zero-padded,
Figure DEST_PATH_IMAGE064
DACTS encoding of frequently read data within respective data segments at respective single bit values of less frequently read datauThe number of bytes when the bit layer is not zero-padded,Rthe data reading indexes corresponding to the data which is read frequently in each data segment,Wand writing data indexes corresponding to the data which is read frequently in each data segment.
It should be noted that the second one corresponds to the DACTS code referring to the data that is often read in each data segmentuAnd determining the corrected byte number of each bit layer, and obtaining the corrected byte number of each bit layer corresponding to the DACTS code of the data which is read frequently in each data section.
(3-2-3) determining the corrected bit value of each bit layer corresponding to the DACTS code of the data which is read frequently in each data segment according to the corrected byte number of each bit layer corresponding to the DACTS code of the data which is read frequently in each data segment, thereby determining a plurality of bit values corresponding to the data which is read frequently in each data segment.
In the present embodiment, the second data corresponding to the DACTS code of the data which is read frequently in each data segment is determineduModified bit values of individual bit layers, for example, are determined from the respective dataA second one corresponding to DACTS coding of data frequently read in a segmentuThe corrected byte number of each bit layer determines the second byte number corresponding to the DACTS code of the data which is read frequently in each data segmentuThe bit value of each bit layer is calculated by the following formula:
Figure DEST_PATH_IMAGE065
wherein the content of the first and second substances,
Figure 896062DEST_PATH_IMAGE062
a second code corresponding to the DACTS code of the data which is read frequently in each data segmentuThe modified number of bytes for the bit layer,
Figure DEST_PATH_IMAGE066
a second code corresponding to the DACTS code of the data which is read frequently in each data segmentuThe modified bit values of the individual bit layers,
Figure DEST_PATH_IMAGE067
a second code corresponding to the DACTS code of the data which is read frequently in each data segmentuThe number of bit blocks corresponding to the modified bit values of the individual bit layers.
It should be noted that the second one corresponds to the DACTS code referring to the data that is often read in each data segmentuAnd determining the corrected bit number value of each bit layer, wherein the corrected bit number value of each bit layer corresponding to the DACTS code of the data which is read frequently in each data segment can be obtained.
(4) And performing compression coding processing on each data segment corresponding to the engineering project time sequence data to be processed according to the binary codes corresponding to the data in each data segment, the single bit values corresponding to the data which is not frequently read in each data segment and the plurality of bit values corresponding to the data which is frequently read, so as to obtain the engineering project time sequence data to be processed after the compression coding.
In this embodiment, the binary codes corresponding to the infrequently read data in each data segment are compressed and encoded by using the bit values corresponding to the infrequently read data in each data segment, and the binary codes corresponding to the frequently read data in each data segment are compressed and encoded by using a plurality of bit values corresponding to the frequently read data, so that the time series data of the engineering project to be processed after being compressed and encoded is obtained, and the data required by an implementer can be conveniently and quickly and accurately extracted subsequently, which is beneficial to assisting in the calculation of the building economic account.
The method determines the single bit value corresponding to the data which is not read frequently and the plurality of bit values corresponding to the data which is read frequently in each data section corresponding to the engineering project time sequence data to be processed through the electronic digital data processing technology, further determines the engineering project time sequence data to be processed after the compression coding, and effectively improves the compression ratio of the engineering project time sequence data to be processed and the decoding efficiency of the compressed data.
The above-mentioned embodiments are only used to illustrate the technical solutions of the present application, and not to limit the same; although the present application has been described in detail with reference to the foregoing embodiments, it should be understood by those of ordinary skill in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some technical features may be equivalently replaced; such modifications and substitutions do not substantially depart from the spirit and scope of the embodiments of the present application and are intended to be included within the scope of the present application.

Claims (10)

1. A calculation assistance method for construction engineering economy is characterized by comprising the following steps:
acquiring engineering project time sequence data to be processed, determining each data segment corresponding to the engineering project time sequence data to be processed according to the engineering project time sequence data to be processed, and further determining binary codes corresponding to each data in each data segment;
acquiring infrequently read data in each data segment corresponding to engineering project time sequence data to be processed, determining a bit value range corresponding to each data in each data segment according to binary codes corresponding to each data in each data segment, and further determining a single bit value corresponding to the infrequently read data in each data segment;
acquiring data which is read frequently in each data segment corresponding to engineering project time sequence data to be processed, a read data index and a write data index corresponding to the data which is read frequently, and determining a plurality of bit values corresponding to the data which is read frequently in each data segment according to a single bit value corresponding to the data which is not read frequently in each data segment, a binary code corresponding to the data which is read frequently in each data segment, a read data index and a write data index corresponding to the data which is read frequently in each data segment;
and performing compression coding processing on each data segment corresponding to the engineering project time sequence data to be processed according to the binary codes corresponding to the data in each data segment, the single bit values corresponding to the data which is not frequently read in each data segment and the plurality of bit values corresponding to the data which is frequently read, so as to obtain the engineering project time sequence data to be processed after the compression coding.
2. A computational assistance method for construction economics according to claim 1 wherein the step of further determining the individual bit values corresponding to infrequently read data within each data segment comprises:
determining a plurality of byte numbers corresponding to each data segment under different bit values according to a binary code corresponding to each data in each data segment corresponding to the engineering project time sequence data to be processed and a bit value range corresponding to each data in each data segment, and further determining a minimum byte number corresponding to each data segment;
determining a bit value corresponding to the minimum byte number corresponding to each data segment according to the minimum byte number corresponding to each data segment corresponding to the engineering project time sequence data to be processed, and taking the bit value corresponding to the minimum byte number corresponding to each data segment as a single bit value corresponding to the data which is not frequently read in each data segment.
3. A calculation aid method for construction engineering economics according to claim 2 wherein the calculation formula for determining the number of bytes corresponding to each data segment at different bit values is:
Figure DEST_PATH_IMAGE002
Figure DEST_PATH_IMAGE004
wherein the content of the first and second substances,
Figure DEST_PATH_IMAGE005
the number of bytes corresponding to each data segment corresponding to the engineering project time sequence data to be processed under any bit value,
Figure DEST_PATH_IMAGE006
a second one corresponding to the DACTS code of each data segment corresponding to the engineering project time sequence data to be processedqAny bit value of the bit layer is,
Figure DEST_PATH_IMAGE007
a second one corresponding to the DACTS code of each data segment corresponding to the engineering project time sequence data to be processedqThe number of bit blocks corresponding to an arbitrary bit value of the bit layer,Qthe number of bit layers corresponding to any bit value of the DACTS codes of each data segment corresponding to the engineering project time sequence data to be processed,Bin order to preset the minimum bit value,
Figure DEST_PATH_IMAGE008
the maximum binary code length corresponding to each data segment corresponding to the engineering project time sequence data to be processed,
Figure DEST_PATH_IMAGE009
in order to find the function of the maximum value,xmapping of time series data for a project to be processedThe length of the binary code corresponding to each data in each data segment.
4. The method of claim 1, wherein the step of determining a plurality of bit values corresponding to data that is frequently read in each data segment comprises:
determining the number of bytes when each bit layer corresponding to the DACTS code of the data which is read frequently in each data segment under the single bit value corresponding to the data which is read infrequently and the number of bytes when each bit layer corresponding to the single bit value corresponding to the data which is read infrequently in each data segment is not complemented;
determining the corrected byte number of each bit layer corresponding to the DACTS code of the data which is read frequently in each data segment according to the byte number when each bit layer is not complemented and the byte number when the byte number is complemented, the read data index and the write data index corresponding to the data which is read frequently, which correspond to the single bit value corresponding to the data which is not read frequently of the DACTS code of the data which is read frequently in each data segment;
and determining the corrected bit value of each bit layer corresponding to the DACTS code of the data which is read frequently in each data segment according to the corrected byte number of each bit layer corresponding to the DACTS code of the data which is read frequently in each data segment, thereby determining a plurality of bit values corresponding to the data which is read frequently in each data segment.
5. The calculation assisting method for construction engineering economy as claimed in claim 4, wherein the calculation formula for determining the corrected byte number of each bit layer corresponding to the DACTS code of the data which is frequently read in each data segment is:
Figure DEST_PATH_IMAGE011
wherein the content of the first and second substances,
Figure DEST_PATH_IMAGE012
a second code corresponding to the DACTS code of the data which is read frequently in each data segmentuThe modified number of bytes for the bit layer,
Figure DEST_PATH_IMAGE013
DACTS encoding of frequently read data within respective data segments corresponds to a second bit value of the infrequently read datauThe number of bytes when the bit layer is zero-padded,
Figure DEST_PATH_IMAGE014
DACTS encoding of frequently read data within respective data segments corresponds to a second bit value of the infrequently read datauThe number of bytes when the bit layer is not zero-padded,Rthe data reading indexes corresponding to the data which is read frequently in each data segment,Wwriting data indexes corresponding to data which are read frequently in each data segment;
the calculation formula for determining the modified bit value of each bit layer corresponding to the DACTS code of the data which is read frequently in each data segment is as follows:
Figure DEST_PATH_IMAGE016
wherein the content of the first and second substances,
Figure 462703DEST_PATH_IMAGE012
a second code corresponding to the DACTS code of the data which is read frequently in each data segmentuThe modified number of bytes for the bit layer,
Figure DEST_PATH_IMAGE017
a second code corresponding to the DACTS code of the data which is read frequently in each data segmentuThe modified bit values of the individual bit layers,
Figure DEST_PATH_IMAGE018
a second code corresponding to the DACTS code of the data which is read frequently in each data segmentuThe number of bit blocks corresponding to the modified bit values of the individual bit layers.
6. The method of claim 1, wherein the step of determining the data segments corresponding to the time series data of the construction project comprises:
determining each initial data segment corresponding to the engineering project time sequence data to be processed according to the engineering project time sequence data to be processed;
determining modification necessity indexes of all initial data segments according to all initial data segments corresponding to the engineering project time sequence data to be processed;
and modifying each initial data segment corresponding to the engineering project time sequence data to be processed according to the modification necessity index of each initial data segment, thereby determining each data segment corresponding to the engineering project time sequence data to be processed.
7. The calculation assistance method for construction engineering economy as claimed in claim 6, wherein the step of determining each initial data segment corresponding to the engineering project time series data to be processed comprises:
determining a first initial data segment corresponding to the engineering project time sequence data to be processed according to the engineering project time sequence data to be processed, wherein the determining step of the first initial data segment is as follows: determining data segment balance corresponding to a preliminary initial data segment composed of the first 2 data according to the first 2 data in the engineering project time sequence data to be processed, if the data segment balance corresponding to the preliminary initial data segment composed of the first 2 data is not less than a preset data segment balance threshold value, determining data segment balance corresponding to the preliminary initial data segment composed of the first 3 data according to the first 3 data in the engineering project time sequence data to be processed, and if the data segment balance corresponding to the preliminary initial data segment composed of the first 3 data is not less than a preset data segment balance threshold value, determining data segment balance corresponding to the preliminary initial data segment composed of the first 3 data according to the first 4 data in the engineering project time sequence data to be processedThe data segment balance corresponding to the preliminary initial data segment formed by the first 4 data is continuously repeated until the front part in the engineering project time sequence data to be processedmThe data segment balance corresponding to the preliminary initial data segment composed of the data is smaller than the preset data segment balance threshold value, and the front part in the engineering project time sequence data to be processed is processed
Figure DEST_PATH_IMAGE019
A preliminary initial data segment composed of the data is used as a first initial data segment corresponding to the engineering project time sequence data to be processed;
and according to the engineering project time sequence data to be processed after the first initial data segment, obtaining a second initial data segment corresponding to the engineering project time sequence data to be processed by referring to the determining step of the first initial data segment, according to the engineering project time sequence data to be processed after the second initial data segment, obtaining a third initial data segment corresponding to the engineering project time sequence data to be processed by referring to the determining step of the first initial data segment, and continuously repeating the step of determining the initial data segments until the engineering project time sequence data to be processed all have the initial data segments corresponding to the engineering project time sequence data to be processed.
8. A computational assistance method for construction engineering economics according to claim 7 wherein the pre-determination in the project time series data to be processed is mademThe calculation formula of the data segment equilibrium corresponding to the preliminary initial data segment composed of the data is as follows:
Figure DEST_PATH_IMAGE021
wherein the content of the first and second substances,Efor pre-in-time sequence data of engineering project to be processedmThe data segment balance corresponding to the preliminary initial data segment formed by the data,
Figure DEST_PATH_IMAGE022
for pre-in-time sequence data of engineering project to be processedmIn a preliminary initial data section consisting of individual datajData, β is a hyperparameter.
9. A computational assistance method for construction economy as claimed in claim 6 wherein the step of determining a modification necessity indicator for each initial data segment comprises:
determining the data average value of each initial data segment corresponding to the engineering project time sequence data to be processed according to each initial data segment corresponding to the engineering project time sequence data to be processed;
calculating the difference value between each data in each initial data segment corresponding to the engineering project time sequence data to be processed and the data average value of each initial data segment, thereby obtaining the differential data segment of each initial data segment corresponding to the engineering project time sequence data to be processed;
and determining the modification necessity index of each initial data segment according to the binary code of each initial data segment corresponding to the engineering project time sequence data to be processed and the binary code of the differential data segment of each initial data segment.
10. The calculation assistance method for construction engineering economy as claimed in claim 6, wherein the step of modifying each initial data segment corresponding to the time series data of the engineering project to be processed comprises:
and if the modification necessity index of any initial data segment corresponding to the engineering project time sequence data to be processed is greater than the preset modification necessity index threshold, modifying the initial data segment corresponding to the engineering project time sequence data to be processed, and taking the differential data segment of the initial data segment as the data segment corresponding to the engineering project time sequence data to be processed.
CN202210754358.8A 2022-06-30 2022-06-30 Computing auxiliary method for building engineering economy Active CN114817831B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210754358.8A CN114817831B (en) 2022-06-30 2022-06-30 Computing auxiliary method for building engineering economy

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210754358.8A CN114817831B (en) 2022-06-30 2022-06-30 Computing auxiliary method for building engineering economy

Publications (2)

Publication Number Publication Date
CN114817831A true CN114817831A (en) 2022-07-29
CN114817831B CN114817831B (en) 2022-09-23

Family

ID=82523339

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210754358.8A Active CN114817831B (en) 2022-06-30 2022-06-30 Computing auxiliary method for building engineering economy

Country Status (1)

Country Link
CN (1) CN114817831B (en)

Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6128413A (en) * 1997-12-04 2000-10-03 Agfa Corporation Method and apparatus for data compression
JP2012134858A (en) * 2010-12-22 2012-07-12 Mitsubishi Electric Corp Data compression apparatus, data compression method and data compression program
CN103688539A (en) * 2011-09-19 2014-03-26 松下电器产业株式会社 Image encoding method, image decoding method, image encoding device, image decoding device, and image encoding-decoding device
US20170155404A1 (en) * 2014-06-27 2017-06-01 Gurulogic Microsystems Oy Encoder and decoder
CN110545106A (en) * 2019-08-06 2019-12-06 清华大学 Method and device for coding time series data
CN111201519A (en) * 2017-08-11 2020-05-26 Altr解决方案公司 Immutable data storage for low latency reading and writing of large data sets
CN113094346A (en) * 2021-03-10 2021-07-09 北京四达时代软件技术股份有限公司 Big data coding and decoding method and device based on time sequence
CN113746485A (en) * 2021-08-16 2021-12-03 深圳数联天下智能科技有限公司 Data compression method, electronic equipment and storage medium
CN113868206A (en) * 2021-10-08 2021-12-31 八十一赞科技发展(重庆)有限公司 Data compression method, decompression method, device and storage medium
CN114124106A (en) * 2022-01-28 2022-03-01 苏州浪潮智能科技有限公司 LZ4 decompression method, system, storage medium and equipment
CN114665884A (en) * 2022-03-29 2022-06-24 北京诺司时空科技有限公司 Time sequence database self-adaptive lossy compression method, system and medium

Patent Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6128413A (en) * 1997-12-04 2000-10-03 Agfa Corporation Method and apparatus for data compression
JP2012134858A (en) * 2010-12-22 2012-07-12 Mitsubishi Electric Corp Data compression apparatus, data compression method and data compression program
CN103688539A (en) * 2011-09-19 2014-03-26 松下电器产业株式会社 Image encoding method, image decoding method, image encoding device, image decoding device, and image encoding-decoding device
US20170155404A1 (en) * 2014-06-27 2017-06-01 Gurulogic Microsystems Oy Encoder and decoder
CN111201519A (en) * 2017-08-11 2020-05-26 Altr解决方案公司 Immutable data storage for low latency reading and writing of large data sets
CN110545106A (en) * 2019-08-06 2019-12-06 清华大学 Method and device for coding time series data
CN113094346A (en) * 2021-03-10 2021-07-09 北京四达时代软件技术股份有限公司 Big data coding and decoding method and device based on time sequence
CN113746485A (en) * 2021-08-16 2021-12-03 深圳数联天下智能科技有限公司 Data compression method, electronic equipment and storage medium
CN113868206A (en) * 2021-10-08 2021-12-31 八十一赞科技发展(重庆)有限公司 Data compression method, decompression method, device and storage medium
CN114124106A (en) * 2022-01-28 2022-03-01 苏州浪潮智能科技有限公司 LZ4 decompression method, system, storage medium and equipment
CN114665884A (en) * 2022-03-29 2022-06-24 北京诺司时空科技有限公司 Time sequence database self-adaptive lossy compression method, system and medium

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
ADRIÁN GÓMEZ-BRANDÓN 等: "Lossless compression of industrial time series with direct access", 《COMPUTERS IN INDUSTRY》 *
邹同春: "海量时序数据的压缩存储方法研究", 《中国优秀硕士学位论文全文数据库 信息科技辑》 *
韩骅宇: "时序数据的高效存储与检索", 《中国优秀硕士学位论文全文数据库 信息科技辑》 *

Also Published As

Publication number Publication date
CN114817831B (en) 2022-09-23

Similar Documents

Publication Publication Date Title
CN116303374B (en) Multi-dimensional report data optimization compression method based on SQL database
CN112953550B (en) Data compression method, electronic device and storage medium
CN116016606B (en) Sewage treatment operation and maintenance data efficient management system based on intelligent cloud
CN108416427A (en) Convolution kernel accumulates data flow, compressed encoding and deep learning algorithm
WO2012033498A1 (en) Systems and methods for data compression
CN109871362A (en) A kind of data compression method towards streaming time series data
CN103995887A (en) Bitmap index compressing method and bitmap index decompressing method
CN106549673A (en) A kind of data compression method and device
CN115269526B (en) Method and system for processing semiconductor production data
CN114782148B (en) Agricultural product purchase management platform and business data compression method thereof
CN117743870B (en) Water conservancy data management system based on big data
CN116361256A (en) Data synchronization method and system based on log analysis
CN114442954B (en) LZ4 coding compression device
CN114268323B (en) Data compression coding method, device and time sequence database supporting line memory
CN114817831B (en) Computing auxiliary method for building engineering economy
CN117040542B (en) Intelligent comprehensive distribution box energy consumption data processing method
US20210125063A1 (en) Apparatus and method for generating binary neural network
CN111061428B (en) Data compression method and device
CN105515586A (en) Rapid delta compression method
CN115882867B (en) Data compression storage method based on big data
CN110310709B (en) Reference sequence-based gene compression method
CN108416425A (en) A kind of convolution method and device
CN202931290U (en) Compression hardware system based on GZIP
CN112506876B (en) Lossless compression query method supporting SQL query
CN115913248A (en) Live broadcast software development data intelligent management system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant