CN114817831A - Computing auxiliary method for building engineering economy - Google Patents
Computing auxiliary method for building engineering economy Download PDFInfo
- Publication number
- CN114817831A CN114817831A CN202210754358.8A CN202210754358A CN114817831A CN 114817831 A CN114817831 A CN 114817831A CN 202210754358 A CN202210754358 A CN 202210754358A CN 114817831 A CN114817831 A CN 114817831A
- Authority
- CN
- China
- Prior art keywords
- data
- data segment
- processed
- time sequence
- engineering project
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 36
- 238000004364 calculation method Methods 0.000 claims abstract description 27
- 238000007906 compression Methods 0.000 claims abstract description 25
- 230000006835 compression Effects 0.000 claims abstract description 25
- 238000010276 construction Methods 0.000 claims abstract description 16
- 238000012545 processing Methods 0.000 claims abstract description 15
- 230000004048 modification Effects 0.000 claims description 32
- 238000012986 modification Methods 0.000 claims description 32
- 239000000126 substance Substances 0.000 claims description 14
- 238000013507 mapping Methods 0.000 claims 1
- 238000005516 engineering process Methods 0.000 abstract description 3
- 230000008569 process Effects 0.000 description 10
- 238000006243 chemical reaction Methods 0.000 description 3
- 238000013144 data compression Methods 0.000 description 3
- 101100380328 Dictyostelium discoideum asns gene Proteins 0.000 description 2
- 241000623377 Terminalia elliptica Species 0.000 description 2
- 101150062095 asnA gene Proteins 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 238000007667 floating Methods 0.000 description 2
- 230000006870 function Effects 0.000 description 2
- 238000011478 gradient descent method Methods 0.000 description 2
- 101150083667 ataA gene Proteins 0.000 description 1
- 230000006837 decompression Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000004069 differentiation Effects 0.000 description 1
- 239000002360 explosive Substances 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 238000012795 verification Methods 0.000 description 1
- 239000002699 waste material Substances 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/10—Complex mathematical operations
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/24—Querying
- G06F16/245—Query processing
- G06F16/2458—Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
- G06F16/2474—Sequence data queries, e.g. querying versioned data
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/12—Use of codes for handling textual entities
- G06F40/126—Character encoding
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q50/00—Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
- G06Q50/08—Construction
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Mathematical Physics (AREA)
- Data Mining & Analysis (AREA)
- General Engineering & Computer Science (AREA)
- Business, Economics & Management (AREA)
- Health & Medical Sciences (AREA)
- General Health & Medical Sciences (AREA)
- Databases & Information Systems (AREA)
- Software Systems (AREA)
- Computational Linguistics (AREA)
- Computational Mathematics (AREA)
- Economics (AREA)
- Artificial Intelligence (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Mathematical Analysis (AREA)
- Pure & Applied Mathematics (AREA)
- Fuzzy Systems (AREA)
- Probability & Statistics with Applications (AREA)
- Algebra (AREA)
- Mathematical Optimization (AREA)
- Human Resources & Organizations (AREA)
- Marketing (AREA)
- Primary Health Care (AREA)
- Strategic Management (AREA)
- Tourism & Hospitality (AREA)
- General Business, Economics & Management (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
The invention relates to the technical field of electronic digital data processing, in particular to a calculation auxiliary method for construction engineering economy, which comprises the following steps: acquiring engineering project time sequence data to be processed, determining each data segment corresponding to the engineering project time sequence data to be processed by utilizing an electronic digital data processing technology, and further determining binary codes corresponding to each data in each data segment; the method comprises the steps of obtaining data which are not read frequently and data which are read frequently in each data section corresponding to engineering project time sequence data to be processed, determining a single bit value corresponding to the data which are not read frequently and a plurality of bit values corresponding to the data which are read frequently according to a read data index and a write data index corresponding to the data which are read frequently in each data section and binary codes corresponding to the data in each data section, and performing compression coding processing on each data section corresponding to the engineering project time sequence data to be processed. The invention improves the reading speed of the compressed engineering project time sequence data.
Description
Technical Field
The invention relates to the technical field of electronic digital data processing, in particular to a calculation auxiliary method for construction engineering economy.
Background
The construction engineering economy is usually accompanied by a large amount of time series data, a large amount of data can be generated along with the advance of an engineering project, such as capital expenditure, profitability expenditure, non-business expenditure, profit allocation expenditure, profit income and other asset expense data, the data volume of the data is large, the storage of the data needs to occupy a large memory, meanwhile, account clearing needs to be carried out when the engineering project is finished, and the efficient and reliable storage and retrieval become the current prominent problems in the face of explosive growth of time series data.
The traditional database provides a solution for data safety, reliability and convenience in query, but in the face of massive time series data, the traditional storage solution is heavy and inefficient. For example, when data is stored, the situation that the system is jammed and occupies too much system resources due to too much data writing amount often occurs; when reading data, the data is often too large to be read, and the reading speed is often slow. With the development of storage compression technology, a NoSQL (Not only SQL, a non-relational database) database storage scheme has appeared, which alleviates the pressure of reading data to some extent, but the storage scheme is still Not mature for compression of massive time series data, and when decoding compressed data, the storage scheme often needs to decode from the beginning, resulting in a slow speed of reading data.
Disclosure of Invention
In order to solve the problem that the conventional compressed storage has low data reading speed, the invention aims to provide a calculation auxiliary method for building engineering economy.
The invention provides a calculation auxiliary method for construction engineering economy, which comprises the following steps:
acquiring engineering project time sequence data to be processed, determining each data segment corresponding to the engineering project time sequence data to be processed according to the engineering project time sequence data to be processed, and further determining binary codes corresponding to each data in each data segment;
acquiring infrequently read data in each data segment corresponding to engineering project time sequence data to be processed, determining a bit value range corresponding to each data in each data segment according to binary codes corresponding to each data in each data segment, and further determining a single bit value corresponding to the infrequently read data in each data segment;
acquiring data which is read frequently in each data segment corresponding to engineering project time sequence data to be processed, a read data index and a write data index corresponding to the data which is read frequently, and determining a plurality of bit values corresponding to the data which is read frequently in each data segment according to a single bit value corresponding to the data which is not read frequently in each data segment, a binary code corresponding to the data which is read frequently in each data segment, a read data index and a write data index corresponding to the data which is read frequently in each data segment;
and performing compression coding processing on each data segment corresponding to the engineering project time sequence data to be processed according to the binary codes corresponding to the data in each data segment, the single bit values corresponding to the data which is not read frequently in each data segment and the multiple bit values corresponding to the data which is read frequently, so as to obtain the engineering project time sequence data to be processed after the compression coding.
Further, the step of determining a single bit value corresponding to the data that is not frequently read in each data segment includes:
determining a plurality of byte numbers corresponding to each data segment under different bit values according to a binary code corresponding to each data in each data segment corresponding to the engineering project time sequence data to be processed and a bit value range corresponding to each data in each data segment, and further determining a minimum byte number corresponding to each data segment;
determining a bit value corresponding to the minimum byte number corresponding to each data segment according to the minimum byte number corresponding to each data segment corresponding to the engineering project time sequence data to be processed, and taking the bit value corresponding to the minimum byte number corresponding to each data segment as a single bit value corresponding to the data which is not frequently read in each data segment.
Further, the calculation formula for determining the number of bytes corresponding to each data segment under different bit values is as follows:
wherein the content of the first and second substances,the number of bytes corresponding to each data segment corresponding to the engineering project time sequence data to be processed under any bit value,a second one corresponding to the DACTS code of each data segment corresponding to the engineering project time sequence data to be processedqAny bit value of the bit layer is,a second one corresponding to the DACTS code of each data segment corresponding to the engineering project time sequence data to be processedqThe number of bit blocks corresponding to an arbitrary bit value of the bit layer,Qthe number of bit layers corresponding to any bit value of the DACTS codes of each data segment corresponding to the engineering project time sequence data to be processed,Bin order to preset the minimum bit value,the maximum binary code length corresponding to each data segment corresponding to the engineering project time sequence data to be processed,in order to find the function of the maximum value,xand the code length of the binary code corresponding to each data in each data segment corresponding to the engineering project time sequence data to be processed is determined.
Further, the step of determining a plurality of bit values corresponding to data that is frequently read in each data segment includes:
determining the number of bytes when each bit layer corresponding to the DACTS code of the data which is read frequently in each data segment under the single bit value corresponding to the data which is read infrequently and the number of bytes when each bit layer corresponding to the single bit value corresponding to the data which is read infrequently in each data segment is not complemented;
determining the corrected byte number of each bit layer corresponding to the DACTS code of the data which is read frequently in each data segment according to the byte number when each bit layer is not complemented and the byte number when the byte number is complemented, the read data index and the write data index corresponding to the data which is read frequently, which correspond to the single bit value corresponding to the data which is not read frequently of the DACTS code of the data which is read frequently in each data segment;
and determining the corrected bit value of each bit layer corresponding to the DACTS code of the data which is read frequently in each data segment according to the corrected byte number of each bit layer corresponding to the DACTS code of the data which is read frequently in each data segment, thereby determining a plurality of bit values corresponding to the data which is read frequently in each data segment.
Further, a calculation formula for determining the corrected byte number of each bit layer corresponding to the DACTS code of the data which is read frequently in each data segment is as follows:
wherein the content of the first and second substances,a second code corresponding to the DACTS code of the data which is read frequently in each data segmentuThe modified number of bytes for the bit layer,DACTS encoding of frequently read data within respective data segments corresponds to a second bit value of the infrequently read datauThe number of bytes when the bit layer is zero-padded,DACTS encoding of frequently read data within respective data segments corresponds to a second bit value of the infrequently read datauThe number of bytes when the bit layer is not zero padded,Rthe data reading indexes corresponding to the data which is read frequently in each data segment,Wwriting data indexes corresponding to data which are read frequently in each data segment;
the calculation formula for determining the modified bit value of each bit layer corresponding to the DACTS code of the data which is read frequently in each data segment is as follows:
wherein the content of the first and second substances,a second code corresponding to the DACTS code of the data which is read frequently in each data segmentuThe modified number of bytes for the bit layer,a second code corresponding to the DACTS code of the data which is read frequently in each data segmentuThe modified bit values of the individual bit layers,a second code corresponding to the DACTS code of the data which is read frequently in each data segmentuThe number of bit blocks corresponding to the modified bit values of the individual bit layers.
Further, the step of determining each data segment corresponding to the engineering project time sequence data to be processed includes:
determining each initial data segment corresponding to the engineering project time sequence data to be processed according to the engineering project time sequence data to be processed;
determining modification necessity indexes of all initial data segments according to all initial data segments corresponding to the engineering project time sequence data to be processed;
and modifying each initial data segment corresponding to the engineering project time sequence data to be processed according to the modification necessity index of each initial data segment, thereby determining each data segment corresponding to the engineering project time sequence data to be processed.
Further, the step of determining each initial data segment corresponding to the engineering project time sequence data to be processed includes:
determining a first initial data segment corresponding to the engineering project time sequence data to be processed according to the engineering project time sequence data to be processed, wherein the determining step of the first initial data segment is as follows: determining data segment balance corresponding to a preliminary initial data segment composed of the first 2 data according to the first 2 data in the engineering project time sequence data to be processed, if the data segment balance corresponding to the preliminary initial data segment composed of the first 2 data is not less than a preset data segment balance threshold value, determining data segment balance corresponding to the preliminary initial data segment composed of the first 3 data according to the first 3 data in the engineering project time sequence data to be processed, if the data segment balance corresponding to the preliminary initial data segment composed of the first 3 data is not less than the preset data segment balance threshold value, determining data segment balance corresponding to the preliminary initial data segment composed of the first 4 data according to the first 4 data in the engineering project time sequence data to be processed, and repeating the steps until the first data in the engineering project time sequence data to be processedmThe data segment balance corresponding to the preliminary initial data segment composed of the data is smaller than the preset data segment balance threshold value, and the front part in the engineering project time sequence data to be processed is processedA preliminary initial data segment composed of the data is used as a first initial data segment corresponding to the engineering project time sequence data to be processed;
and according to the time sequence data of the engineering project to be processed after the first initial data segment, obtaining a second initial data segment corresponding to the time sequence data of the engineering project to be processed by referring to the determining step of the first initial data segment, according to the time sequence data of the engineering project to be processed after the second initial data segment, obtaining a third initial data segment corresponding to the time sequence data of the engineering project to be processed by referring to the determining step of the first initial data segment, and continuously repeating the step of determining the initial data segments until the time sequence data of the engineering project to be processed all have the initial data segments corresponding to the time sequence data of the engineering project to be processed.
Further, determining the front in the engineering project time sequence data to be processedmThe calculation formula of the data segment equilibrium corresponding to the preliminary initial data segment composed of the data is as follows:
wherein the content of the first and second substances,Efor pre-in-time sequence data of engineering project to be processedmThe data segment balance corresponding to the preliminary initial data segment formed by the data,for pre-in-time sequence data of engineering project to be processedmIn a preliminary initial data section consisting of individual datajData, β is a hyperparameter.
Further, the step of determining the modification necessity indicators for each initial data segment includes:
determining the data average value of each initial data segment corresponding to the engineering project time sequence data to be processed according to each initial data segment corresponding to the engineering project time sequence data to be processed;
calculating the difference value between each data in each initial data segment corresponding to the engineering project time sequence data to be processed and the data average value of each initial data segment, thereby obtaining the differential data segment of each initial data segment corresponding to the engineering project time sequence data to be processed;
and determining the modification necessity index of each initial data segment according to the binary code of each initial data segment corresponding to the engineering project time sequence data to be processed and the binary code of the differential data segment of each initial data segment.
Further, the step of modifying each initial data segment corresponding to the engineering project time sequence data to be processed includes:
and if the modification necessity index of any initial data segment corresponding to the engineering project time sequence data to be processed is larger than the preset modification necessity index threshold, modifying the initial data segment corresponding to the engineering project time sequence data to be processed, and taking the differential data segment of the initial data segment as the data segment corresponding to the engineering project time sequence data to be processed.
The invention has the following beneficial effects:
according to the method and the device, the data segments corresponding to the engineering project time sequence data to be processed are determined by acquiring the engineering project time sequence data to be processed, and the compression rate of the engineering project time sequence data to be processed is improved. And determining a single bit value corresponding to the data which is not read frequently and a plurality of bit values corresponding to the data which is read frequently in each data segment through each data segment corresponding to the engineering project time sequence data to be processed. And obtaining the compressed and encoded engineering project time sequence data to be processed according to each data segment corresponding to the engineering project time sequence data to be processed, a single bit value corresponding to the data which is not read frequently in each data segment and a plurality of bit values corresponding to the data which is read frequently.
The invention utilizes the single bit value corresponding to the infrequently read data to perform compression storage processing on the infrequently read data in the engineering project time sequence data to be processed, and utilizes the multiple bit values corresponding to the frequently read data to perform compression storage processing on the frequently read data in the engineering project time sequence data to be processed.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions and advantages of the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, it is obvious that the drawings in the following description are only some embodiments of the present invention, and other drawings can be obtained by those skilled in the art without creative efforts.
Fig. 1 is a flowchart of a calculation assistance method for construction economy in an embodiment of the present invention.
Detailed Description
To further explain the technical means and effects of the present invention adopted to achieve the predetermined objects, the following detailed description of the embodiments, structures, features and effects of the technical solutions according to the present invention will be given with reference to the accompanying drawings and preferred embodiments. In the following description, different references to "one embodiment" or "another embodiment" do not necessarily refer to the same embodiment. Furthermore, the particular features, structures, or characteristics may be combined in any suitable manner in one or more embodiments.
Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs.
In this embodiment, by acquiring the engineering project time series data to be processed, determining the bit values and the number of bit layers corresponding to different types of data in each data segment corresponding to the engineering project time series data to be processed, and compressing and storing binary codes corresponding to each data in each data segment corresponding to the engineering project time series data to be processed by using the bit values and the number of bit layers corresponding to the different types of data in each data segment, the purpose of quickly and accurately acquiring data is achieved, and calculation and verification of accounts are assisted to be completed. The embodiment provides a calculation assisting method for building engineering economy, which comprises the following steps as shown in figure 1:
(1) the method comprises the steps of obtaining engineering project time sequence data to be processed, determining each data segment corresponding to the engineering project time sequence data to be processed according to the engineering project time sequence data to be processed, and further determining binary codes corresponding to the data in each data segment.
In the advancing process of the construction project, the project time series data to be processed are obtained, the project time series data to be processed comprise various asset cost data such as capital expenditure, profitable expenditure, non-business expenditure, profit allocation expenditure and profit corresponding to each timestamp, and the project time series data to be processed are large in data volume.
After the engineering project time sequence data to be processed is obtained, determining each data segment corresponding to the engineering project time sequence data to be processed according to the engineering project time sequence data to be processed, wherein the steps comprise:
and (1-1) determining each initial data segment corresponding to the engineering project time sequence data to be processed according to the engineering project time sequence data to be processed.
It should be noted that, for a single piece of data in the engineering project time series data to be processed, if the numerical value of the single piece of data is large, a large number of index values are required when compressing the binary code corresponding to the single piece of data, resulting in a low data compression rate. In order to improve the data compression rate, this embodiment divides the data with similar storage time into each initial data segment, that is, determines each initial data segment corresponding to the engineering project time series data to be processed, and the steps include:
(1-1-1) determining a first initial data segment corresponding to the engineering project time sequence data to be processed according to the engineering project time sequence data to be processed, wherein the determining step of the first initial data segment is as follows: determining data segment balance corresponding to a preliminary initial data segment composed of the first 2 data according to the first 2 data in the engineering project time sequence data to be processed, if the data segment balance corresponding to the preliminary initial data segment composed of the first 2 data is not less than a preset data segment balance threshold, and in this embodiment, setting the preset data segment balance threshold to 0.7, determining data segment balance corresponding to a preliminary initial data segment composed of the first 3 data according to the first 3 data in the engineering project time sequence data to be processed, if the data segment balance corresponding to the preliminary initial data segment composed of the first 3 data is not less than the preset data segment balance threshold 0.7, determining data segment balance corresponding to a preliminary initial data segment composed of the first 4 data according to the first 4 data in the engineering project time sequence data to be processed, and continuously repeating the above stepsStep, until the front in the engineering project time sequence data to be processedmThe data segment balance corresponding to the preliminary initial data segment composed of the data is smaller than the preset data segment balance threshold value, and the front part in the engineering project time sequence data to be processed is processedA preliminary initial data segment composed of the data is used as a first initial data segment corresponding to the engineering project time sequence data to be processed, and the first initial data segment comprises the front part in the engineering project time sequence data to be processedAnd (4) data.
A preliminary initial data segment consisting of the first 2 data, a preliminary initial data segment consisting of the first 3 data, a preliminary initial data segment consisting of the first 4 data and a preliminary initial data segment of the first 2 data in the engineering project time sequence data to be processedmThe calculation process of the data segment balance corresponding to the preliminary initial data segment composed of the data is kept consistent, and the embodiment is used for determining the front part in the time sequence data of the engineering project to be processedmData segment equalization corresponding to preliminary initial data segment of individual data components is taken as an examplemThe calculation formula of the data segment equilibrium corresponding to the preliminary initial data segment composed of the data is as follows:
wherein the content of the first and second substances,Efor pre-in-time sequence data of engineering project to be processedmThe data segment balance corresponding to the preliminary initial data segment formed by the data,for the front in the time series data of the project to be processedmIn a preliminary initial data section consisting of individual datajData, beta is a hyper-parameter, empirical values are taken。
It should be noted that, the front in the time series data of the project to be processedmThe greater the data segment balance corresponding to the preliminary initial data segment composed of the data is, the higher the similarity of the data in the preliminary initial data segment is, that is, the smaller the numerical difference between the data in the preliminary initial data segment is, the greater the possibility that the preliminary initial data segment is the initial data segment corresponding to the time series data of the engineering project to be processed is.
(1-1-2) according to the engineering project time sequence data to be processed after the first initial data segment, referring to the determining step of the first initial data segment to obtain a second initial data segment corresponding to the engineering project time sequence data to be processed, according to the engineering project time sequence data to be processed after the second initial data segment, referring to the determining step of the first initial data segment to obtain a third initial data segment corresponding to the engineering project time sequence data to be processed, and continuously repeating the step of determining the initial data segments until the engineering project time sequence data to be processed all have the initial data segments corresponding to the engineering project time sequence data to be processed.
In this embodiment, the data that is not attributed in the engineering project time series data to be processed, that is, the previous data in the engineering project time series data to be processed is determined through each data in the first initial data segment corresponding to the engineering project time series data to be processed obtained in step (1-1-1)The data after the data is according to the front part in the time sequence data of the engineering project to be processedAnd the data after the data is obtained by referring to the determination process of the first initial data segment corresponding to the engineering project time sequence data to be processed to obtain a second initial data segment corresponding to the engineering project time sequence data to be processed, and continuously repeating the step of determining the initial data segments until the engineering project time sequence data to be processed all have the corresponding initial data segments.
And (1-2) determining the modification necessity index of each initial data segment according to each initial data segment corresponding to the engineering project time sequence data to be processed.
It should be noted that the modification necessity index of each initial data segment calculated in this embodiment is convenient for modifying data in some initial data segments corresponding to the engineering project time series data to be subsequently processed, and the step of determining the modification necessity index of each initial data segment includes:
and (1-2-1) determining the data average value of each initial data segment corresponding to the engineering project time sequence data to be processed according to each initial data segment corresponding to the engineering project time sequence data to be processed.
In the embodiment, the first time sequence data corresponding to the engineering project time sequence data to be processed is determinedaTaking the average value of the initial data segment as an example, and according to the first time sequence data corresponding to the engineering project time sequence data to be processedaEach data in the initial data segment, theaAn initial data segment is marked asM,={Is calculated to beaThe average value of the data of the initial data segments is calculated by the formula:
wherein the content of the first and second substances,the first one corresponding to the engineering project time sequence data to be processedaThe average value of the data of the initial data segments,mthe first one corresponding to the engineering project time sequence data to be processedaThe number of data of each initial data segment,xthe first one corresponding to the engineering project time sequence data to be processedaEach data of the initial data segment.
It should be noted that reference to the second corresponding to the engineering project time series data to be processedaInitial number ofAnd determining the data average value of the data segments, and determining the data average value of each initial data segment corresponding to the engineering project time sequence data to be processed according to each data in each initial data segment corresponding to the engineering project time sequence data to be processed.
And (1-2-2) calculating the difference value between each data in each initial data segment corresponding to the engineering project time sequence data to be processed and the data average value of each initial data segment, so as to obtain the differential data segment of each initial data segment corresponding to the engineering project time sequence data to be processed.
In the embodiment, the first time sequence data corresponding to the engineering project time sequence data to be processed is determinedaTaking the differential data segment of the initial data segment as an example, according to the second corresponding to the engineering project time sequence data to be processedaEach data in the initial data segment and the second data obtained in the step (1-2-1)aThe average value of the data corresponding to the initial data segment respectively calculates the first time sequence data of the engineering project to be processedaEach data in the initial data segment and the second data segmentaThe difference value of the average values of the data corresponding to the initial data segments is calculated by the following formula:={taking each difference as the secondaA differential data segment of the initial data segments.
It should be noted that reference to the second corresponding to the engineering project time series data to be processedaAnd determining the differential data segment of each initial data segment to obtain the differential data segment of each initial data segment corresponding to the engineering project time sequence data to be processed. At this time, the differential data segment of each initial data segment can be represented by a shorter encoding code length by increasing the index value, which helps to improve the compression rate of the project engineering time series data to be processed.
For example, the firstaAn initial data segment={}, then the firstaData mean of initial data segmentOf 1 ataA differential data segment of the initial data segment={At this time, the secondaThe binary code length of each data in the initial data segment is 7 bits, and the second oneaThe maximum code length of binary coding of each data in the differential data segment of the initial data segment is only 2 bits.
And (1-2-3) determining the modification necessity index of each initial data segment according to the binary code of each initial data segment corresponding to the engineering project time sequence data to be processed and the binary code of the differential data segment of each initial data segment.
It should be noted that, the code length of each data in the differential data segment of each initial data segment corresponding to the engineering project time series data to be processed may be reduced to different degrees, but an index needs to be added when each data in the differential data segment of each initial data segment is processed, the index is added to facilitate data differentiation, when the number of data in the differential data segment of the initial data segment is too small, the code length of each data reduced in the differential data segment will be smaller than the length of the index added to each data in the differential data segment, and at this time, the necessity of modifying the initial data segment into the differential data segment will be low, that is, the index of necessity of modifying the initial data segment is low.
In this embodiment, according to the binary code of each data in each initial data segment corresponding to the engineering project time series data to be processed and the binary code of each data in the differential data segment of each initial data segment, the code length of each initial data segment and the code length of the differential data segment of each initial data segment are obtained, and the modification necessity index of each initial data segment is obtained by calculating the difference between the code length of each initial data segment and the code length of the differential data segment of each initial data segment, in which the calculated difference is used as the modification necessity index of the initial data segment, and the calculation formula of the modification necessity index is as follows:
wherein the content of the first and second substances,afor the modification necessity index of each initial data segment corresponding to the engineering project time sequence data to be processed,for the code length of each initial data segment corresponding to the engineering project time sequence data to be processed,and the code length of the differential data segment of each initial data segment corresponding to the engineering project time sequence data to be processed is determined.
It should be noted that, if the modification necessity index of a certain initial data segment corresponding to the engineering project time sequence data to be processed is providedaIf the data segment is positive, the differential data segment of the initial data segment replaces the initial data segment to become a data segment corresponding to the time sequence data of the engineering project to be processed, and if the data segment is the modification necessity index of a certain initial data segment corresponding to the time sequence data of the engineering project to be processedaAnd if the data segment is negative or 0, discarding the differential data segment of the initial data segment, and taking the initial data segment as the data segment corresponding to the engineering project time sequence data to be processed.
And (1-3) modifying each initial data segment corresponding to the engineering project time sequence data to be processed according to the modification necessity index of each initial data segment, thereby determining each data segment corresponding to the engineering project time sequence data to be processed.
In this embodiment, each initial data segment corresponding to the engineering project time sequence data to be processed is modified according to the modification necessity index of each initial data segment obtained in step (1-2-3), and if the modification necessity index of any one initial data segment corresponding to the engineering project time sequence data to be processed is greater than the preset modification necessity index threshold, the preset modification necessity index threshold is set to 0, and the differential data segment of the initial data segment is used as the data segment corresponding to the engineering project time sequence data to be processed.
The modification necessity index of each initial data segment corresponding to the engineering project time sequence data to be processed is compared with a preset modification necessity index threshold value 0, so that modification of some initial data segments corresponding to the engineering project time sequence data to be processed is realized, namely, a differential data segment of the initial data segment which is larger than the preset modification necessity index threshold value 0 replaces the corresponding initial data segment to be a data segment corresponding to the engineering project time sequence data to be processed, and each data segment corresponding to the engineering project time sequence data to be processed is determined.
At this time, each data segment corresponding to the engineering project time sequence data to be processed is obtained in this embodiment, many floating points generally exist in each data segment corresponding to the engineering project time sequence data to be processed, and the floating points are important data types of the engineering project time sequence data.
For example, data 25.125 in each data segment corresponding to the engineering project time series data to be processed converts data 25.125 into a binary coding format by using a conversion format of a scientific counting method, and the conversion process is as follows:
25.125(D) = 11001.001(B) = 1.1001001*2^4(B)
it should be noted that D represents decimal and B represents binary in the process of converting the data 25.125 into binary code format. The conversion format of the scientific counting method is the prior art, that is, the process of acquiring the binary code corresponding to the data is the prior art, which is not in the protection scope of the present invention, and the detailed description is not repeated here.
(2) The method comprises the steps of obtaining data which are not read frequently in each data segment corresponding to engineering project time sequence data to be processed, determining a bit value range corresponding to each data in each data segment according to binary codes corresponding to each data in each data segment, and further determining a single bit value corresponding to the data which are not read frequently in each data segment.
Firstly, it should be noted that, when time series data is compressed and encoded, the commonly used compression encoding mode is fixed length encoding and variable length encoding, the fixed length encoding takes the maximum encoding bit number in the current time series data as the reference, the encoding bit numbers of all the time series data are unified into the maximum encoding bit number, the compression encoding mode can cause the waste of storage space, and the data compression ratio is low; variable length coding is based on the minimum number of coding bits in current time series data, and since the coding length of each time series data is variable, the position of target data cannot be obtained during decoding, decoding must be performed sequentially from the first time series data, and decoding efficiency and data reading speed are low. In order to simultaneously take Compression rate and decoding efficiency into account, a DACTS (Direct Access Compression of time series) encoding algorithm is proposed, but the encoding algorithm relies on artificially set parameters, which are bit values, and the size of the bit values affects the Compression rate and decoding efficiency of time series data. In the embodiment, the bit value is obtained through data information self-adaptation, and is adjusted, so that the bit value is suitable for auxiliary calculation of the engineering project time sequence data to be processed.
The traditional DACTS coding algorithm for artificially setting parameter bit values operates in the following mode:
the coding mode during coding is as follows: given time series data sequence={Carrying out binary coding processing on the time sequence data sequence to obtain binary coding of the time sequence data sequence, and determining a bit value according to human experiencebThe binary code of the time sequence data sequence is divided into bit blocks with the size of the bit blockThe high order of each bit block is an identifier which indicates whether the bit block is data or notThe bit block slicing is performed from the last bit to the first bit of the binary code corresponding to the data.
For example, given a time series of data={Value of bitbTo 2, the encoding process is shown in table 1 below:
TABLE 1
Serial number | Binary coding | DACTS | |
1 | 4 | 100 | 001 100 |
2 | 2 | 10 | 010 |
3 | 10 | 1010 | 010 110 |
4 | 1 | 1 | 001 |
5 | 21 | 10101 | 001 101 101 |
6 | 5 | 101 | 001 101 |
7 | 19 | 10011 | 001 100 111 |
In Table 1, the bit value is 2, and the timing sequence dataData 4 in the sequence is DACTS encoded, the binary encoding of data 4 is 100, and binary encoding 100 is divided into 1 and 00. The slicing sequence of the sliced bit block is from back to front, that is, 2 bits are taken from 0 in the binary code 100 to front, when the last code in the binary code is less than 2 bits, 0 is complemented, and after the binary code 100 of the data 4 is complemented by zero, 0100 is obtained. At this time, when looking from back to front, the binary code segment where the binary code 0100, 00 after zero padding is located is preceded by the binary code segment 01, so the identifier of the binary code segment 00 is 1, the DACTS coded bit block of the binary code segment is 100, and the binary code segment where 01 is located is not preceded by the binary code segment, so the identifier of the binary code segment 01 is 0, and the DACTS coded bit block of the binary code segment is 001, at this time, the DACTS code of data 4 in the time series data sequence is obtained. Similarly, a DACTS code of each data in the time series data sequence is obtained, that is, the time series data sequence is obtained={DACTS encoding of (1), the encoding results are shown in the following Table 2:
TABLE 2
In the above Table 2Is shown asnA layer of one bit or more bits,is shown asnA binary code segment corresponding to a time series data sequence in a bit layer,is shown asnAnd identifiers of binary coding segments corresponding to time sequence data sequences in the bit layers.
The decoding mode during decoding is as follows: for example, the 5 th data in Table 2 is decoded, the 1 st bit layer in Table 2Find the binary code segment 01 corresponding to the 5 th data and the identifier corresponding to the binary code segment 01Identification ofA value of 1 indicates that there is a binary segment before the binary segment 01, because of the 1 st bit layer in table 2Corresponding identifier of middle and first 5 data binary coding segmentsOf 1, there are 3, the 2 nd bit layer in Table 2Continuously searching for binary code segment 01 corresponding to the 3 rd data and identifier corresponding to binary code segment 01Identification ofTo be 1, it is indicated that there is a binary segment before the binary segment 01, because of the 3 rd bit layer in table 2Corresponding identifier of middle and first 3 data binary coding segments1 for 1, bit layer 3 in Table 2Continuously searching for the binary code segment 01 corresponding to the 1 st data and the identifier corresponding to the binary code segment 01Identification ofAt this point, since the decoding is finished, the DACTS code corresponding to the 5 th data in table 2 is 001101101, the DACTS code corresponding to the 5 th data is converted into the binary code 10101, and the decoding is successful to obtain the data 21.
And (2-1) acquiring data which are not read frequently in each data segment corresponding to the engineering project time sequence data to be processed, and determining a bit numerical range corresponding to each data in each data segment according to a binary code corresponding to each data in each data segment.
In this embodiment, according to each data segment corresponding to the engineering project time series data to be processed and the read record of each data in each data segment, the data that is not frequently read in each data segment can be obtained. According to the binary code corresponding to each data in each data segment corresponding to the engineering project time sequence data to be processed, namely according to the code length corresponding to each data in each data segment, obtaining the value range of the bit value, wherein the value range of the bit value isWherein, in the step (A),bfor the bit value corresponding to each data in each data segment corresponding to the engineering project time sequence data to be processed,Bin order to preset the minimum bit value,and the maximum code length corresponding to each data segment corresponding to the engineering project time sequence data to be processed is obtained. According to each bit value in the value range, the number of bit layers and the number of bit blocks corresponding to each bit value can be obtained, if a certain bit valueThe larger the bit value is, the smaller the number of bit layers and the number of bit blocks corresponding to the bit value are.
It should be noted that, by determining each data segment corresponding to the engineering project time series data to be processed, the compression rate of the engineering project time series data to be processed is effectively improved, and at this time, bit values corresponding to different types of data in each data segment need to be determined. The conventional bit values are determined according to human experience, and different bit values need to be selected for different engineering project time sequence data, so that the universal effectiveness and robustness of the artificially determined bit values are poor, so that the bit values corresponding to different types of data in each data segment are determined according to the data distribution condition of the engineering project time sequence data to be processed, and the different types of data are divided into: data that is not read frequently and data that is read frequently.
(2-2) determining a single bit value corresponding to the data which is not frequently read in each data segment according to the binary code corresponding to each data in each data segment corresponding to the engineering project time sequence data to be processed and the bit value range corresponding to each data in each data segment, wherein the steps comprise:
(2-2-1) determining a plurality of byte numbers corresponding to each data segment under different bit values according to the binary code corresponding to each data in each data segment corresponding to the engineering project time sequence data to be processed and the bit value range corresponding to each data in each data segment, and further determining the minimum byte number corresponding to each data segment.
According to binary codes corresponding to all data in all data sections corresponding to engineering project time sequence data to be processed, bit values in a bit value range, bit layer numbers corresponding to the bit values and bit block number bytes corresponding to the bit values, a gradient descent method is adopted for solving, the number of bytes corresponding to all the data sections corresponding to all the bit values under different bit values is calculated, namely each bit value has the number of bytes corresponding to the bit value, all the data sections corresponding to the engineering project time sequence data to be processed have a plurality of bytes, and a calculation formula for determining the number of bytes corresponding to each data section under different bit values is as follows:
wherein the content of the first and second substances,the number of bytes corresponding to each data segment corresponding to the engineering project time sequence data to be processed under any bit value,a second code corresponding to DACTS codes of all data segments corresponding to the time sequence data of the engineering project to be processedqAny bit value of the bit layer is,a second one corresponding to the DACTS code of each data segment corresponding to the engineering project time sequence data to be processedqThe number of bit blocks corresponding to an arbitrary bit value of the bit layer,Qthe number of bit layers corresponding to any bit value of the DACTS codes of each data segment corresponding to the engineering project time sequence data to be processed,Bin order to preset the minimum bit value,the maximum binary code length corresponding to each data segment corresponding to the engineering project time sequence data to be processed,in order to find the function of the maximum value,xand the code length of the binary code corresponding to each data in each data segment corresponding to the engineering project time sequence data to be processed is determined.
It should be noted that, for the bit value of each bit layer corresponding to the DACTS code of each data segment corresponding to the engineering project time series data to be processedbBit values of different bit layersbBit values of the same bit layer may differbMust be consistent, usually in terms of bit valuesbIn the range of 10 or less, the most common bit valuesbMay be equal to 2, 3 or 4. In addition, it should be noted that the process of solving by the gradient descent method is prior art and is not within the scope of the present invention, and the details are not described herein.
In this embodiment, after determining the number of bytes corresponding to each data segment, the number of bytes corresponding to each data segment under different bit values is arranged according to a certain arrangement order, and the minimum number of bytes is selected from the number of bytes corresponding to each arranged data segment under different bit values, so as to obtain the minimum number of bytes corresponding to each data segment.
(2-2-2) determining a bit value corresponding to the minimum byte number corresponding to each data segment according to the minimum byte number corresponding to each data segment corresponding to the engineering project time sequence data to be processed, and taking the bit value corresponding to the minimum byte number corresponding to each data segment as a single bit value corresponding to the data which is not read frequently in each data segment.
It should be noted that, according to the minimum byte number corresponding to each data segment obtained in the step (2-2-1), a bit value corresponding to the minimum byte number corresponding to each data segment is determined, and the bit value corresponding to the minimum byte number corresponding to each data segment only considers the compression ratio, at this time, the obtained compression result is not optimal, because a large number of read-write operations often exist in the engineering project time series data to be processed, this embodiment performs different compression and storage operations on different types of data in each data segment, so as to achieve both the compression ratio and the reading efficiency, and the reading efficiency is the decompression efficiency. The different types of data include data that is not frequently read and data that is frequently read.
For the data that is not frequently read in each data segment, the reading frequency of the data that is not frequently read is low, so the decoding efficiency of the data that is not frequently read will not be considered, in this embodiment, the bit value corresponding to the minimum byte number corresponding to each data segment is taken as the bit value corresponding to the data that is not frequently read in each data segment, that is, the bit value corresponding to the minimum byte number corresponding to each data segment is taken as the bit value of each bit layer corresponding to the DACTS encoding of the data that is not frequently read, and the bit values of each bit layer are kept consistent, so that a single bit value corresponding to the data that is not frequently read in each data segment is obtained.
(3) The method comprises the steps of obtaining data which are read frequently in each data section and correspond to engineering project time sequence data to be processed, and data reading indexes and data writing indexes which correspond to the data which are read frequently, and determining a plurality of bit values which correspond to the data which are read frequently in each data section according to a single bit value which corresponds to the data which are not read frequently in each data section, a binary code which corresponds to the data which are read frequently in each data section, and the data reading indexes and the data writing indexes which correspond to the data which are read frequently.
And (3-1) acquiring data which is read frequently in each data section corresponding to the engineering project time sequence data to be processed, and a read data index and a write data index corresponding to the data which is read frequently.
According to the data in each data segment corresponding to the engineering project time sequence data to be processed and the reading record and the writing record of the data in each data segment, the data which is read frequently in each data segment corresponding to the engineering project time sequence data to be processed, the reading data index and the writing data index corresponding to the data which is read frequently in each data segment can be obtained, and the reading data index and the writing data index of each bit layer corresponding to the DACTS code of the data which is read frequently in each data segment are consistent. The process of obtaining the binary code corresponding to the data that is read frequently, the read data index corresponding to the data that is read frequently, and the write data index corresponding to the data that is read frequently in each data segment is the prior art, and is not within the protection scope of the present invention, and is not elaborated here.
(3-2) determining a plurality of bit values corresponding to the data which is read frequently in each data segment according to a single bit value corresponding to the data which is not read frequently in each data segment, a binary code corresponding to the data which is read frequently in each data segment, a read data index and a write data index corresponding to the data which is read frequently in each data segment, wherein the steps comprise:
(3-2-1) determining the number of bytes when each bit layer of the DACTS code of the data which is read frequently in each data segment is not complemented and the number of bytes when the bits are complemented according to the single bit value corresponding to the data which is read infrequently in each data segment and the binary code corresponding to the data which is read frequently in each data segment.
In this embodiment, when performing DACTS encoding processing on binary codes corresponding to data that is read frequently in each data segment corresponding to engineering project time series data to be processed, zero padding processing needs to be performed on binary codes corresponding to some data that is read frequently, and according to a zero padding processing result in a DACTS encoding processing process corresponding to the data that is read frequently under a single bit value corresponding to the data that is not read frequently, the number of bytes when each bit layer corresponding to the DACTS encoding of the data that is read frequently in each data segment under the single bit value corresponding to the data that is not read frequently is determined, and the number of bytes when zero padding is performed.
(3-2-2) determining the corrected byte number of each bit layer corresponding to the DACTS code of the data which is read frequently in each data segment according to the byte number when each bit layer corresponding to the single bit value of the data which is read infrequently is not complemented and the byte number when the byte number is complemented, the read data index and the write data index corresponding to the data which is read frequently in each data segment.
In an embodiment, the second data segment is encoded by a DACTS corresponding to the data which is determined to be read frequently in each data segmentuTaking the corrected byte number of each bit layer as an example, according to the DACTS code of the data which is read frequently in each data segment, the corresponding second bit value under the single bit value corresponding to the data which is not read frequentlyuDetermining the number of bytes when the bit layer is not compensated, the number of bytes when the bit layer is compensated, the reading data index and the writing data index corresponding to the data which is read frequently, and determining the second byte corresponding to the DACTS code of the data which is read frequently in each data segmentuThe calculation formula of the corrected byte number of each bit layer is as follows:
wherein the content of the first and second substances,a second code corresponding to the DACTS code of the data which is read frequently in each data segmentuThe modified number of bytes for the bit layer,DACTS encoding of frequently read data within respective data segments corresponds to a second bit value of the infrequently read datauThe number of bytes when the bit layer is zero-padded,DACTS encoding of frequently read data within respective data segments at respective single bit values of less frequently read datauThe number of bytes when the bit layer is not zero-padded,Rthe data reading indexes corresponding to the data which is read frequently in each data segment,Wand writing data indexes corresponding to the data which is read frequently in each data segment.
It should be noted that the second one corresponds to the DACTS code referring to the data that is often read in each data segmentuAnd determining the corrected byte number of each bit layer, and obtaining the corrected byte number of each bit layer corresponding to the DACTS code of the data which is read frequently in each data section.
(3-2-3) determining the corrected bit value of each bit layer corresponding to the DACTS code of the data which is read frequently in each data segment according to the corrected byte number of each bit layer corresponding to the DACTS code of the data which is read frequently in each data segment, thereby determining a plurality of bit values corresponding to the data which is read frequently in each data segment.
In the present embodiment, the second data corresponding to the DACTS code of the data which is read frequently in each data segment is determineduModified bit values of individual bit layers, for example, are determined from the respective dataA second one corresponding to DACTS coding of data frequently read in a segmentuThe corrected byte number of each bit layer determines the second byte number corresponding to the DACTS code of the data which is read frequently in each data segmentuThe bit value of each bit layer is calculated by the following formula:
wherein the content of the first and second substances,a second code corresponding to the DACTS code of the data which is read frequently in each data segmentuThe modified number of bytes for the bit layer,a second code corresponding to the DACTS code of the data which is read frequently in each data segmentuThe modified bit values of the individual bit layers,a second code corresponding to the DACTS code of the data which is read frequently in each data segmentuThe number of bit blocks corresponding to the modified bit values of the individual bit layers.
It should be noted that the second one corresponds to the DACTS code referring to the data that is often read in each data segmentuAnd determining the corrected bit number value of each bit layer, wherein the corrected bit number value of each bit layer corresponding to the DACTS code of the data which is read frequently in each data segment can be obtained.
(4) And performing compression coding processing on each data segment corresponding to the engineering project time sequence data to be processed according to the binary codes corresponding to the data in each data segment, the single bit values corresponding to the data which is not frequently read in each data segment and the plurality of bit values corresponding to the data which is frequently read, so as to obtain the engineering project time sequence data to be processed after the compression coding.
In this embodiment, the binary codes corresponding to the infrequently read data in each data segment are compressed and encoded by using the bit values corresponding to the infrequently read data in each data segment, and the binary codes corresponding to the frequently read data in each data segment are compressed and encoded by using a plurality of bit values corresponding to the frequently read data, so that the time series data of the engineering project to be processed after being compressed and encoded is obtained, and the data required by an implementer can be conveniently and quickly and accurately extracted subsequently, which is beneficial to assisting in the calculation of the building economic account.
The method determines the single bit value corresponding to the data which is not read frequently and the plurality of bit values corresponding to the data which is read frequently in each data section corresponding to the engineering project time sequence data to be processed through the electronic digital data processing technology, further determines the engineering project time sequence data to be processed after the compression coding, and effectively improves the compression ratio of the engineering project time sequence data to be processed and the decoding efficiency of the compressed data.
The above-mentioned embodiments are only used to illustrate the technical solutions of the present application, and not to limit the same; although the present application has been described in detail with reference to the foregoing embodiments, it should be understood by those of ordinary skill in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some technical features may be equivalently replaced; such modifications and substitutions do not substantially depart from the spirit and scope of the embodiments of the present application and are intended to be included within the scope of the present application.
Claims (10)
1. A calculation assistance method for construction engineering economy is characterized by comprising the following steps:
acquiring engineering project time sequence data to be processed, determining each data segment corresponding to the engineering project time sequence data to be processed according to the engineering project time sequence data to be processed, and further determining binary codes corresponding to each data in each data segment;
acquiring infrequently read data in each data segment corresponding to engineering project time sequence data to be processed, determining a bit value range corresponding to each data in each data segment according to binary codes corresponding to each data in each data segment, and further determining a single bit value corresponding to the infrequently read data in each data segment;
acquiring data which is read frequently in each data segment corresponding to engineering project time sequence data to be processed, a read data index and a write data index corresponding to the data which is read frequently, and determining a plurality of bit values corresponding to the data which is read frequently in each data segment according to a single bit value corresponding to the data which is not read frequently in each data segment, a binary code corresponding to the data which is read frequently in each data segment, a read data index and a write data index corresponding to the data which is read frequently in each data segment;
and performing compression coding processing on each data segment corresponding to the engineering project time sequence data to be processed according to the binary codes corresponding to the data in each data segment, the single bit values corresponding to the data which is not frequently read in each data segment and the plurality of bit values corresponding to the data which is frequently read, so as to obtain the engineering project time sequence data to be processed after the compression coding.
2. A computational assistance method for construction economics according to claim 1 wherein the step of further determining the individual bit values corresponding to infrequently read data within each data segment comprises:
determining a plurality of byte numbers corresponding to each data segment under different bit values according to a binary code corresponding to each data in each data segment corresponding to the engineering project time sequence data to be processed and a bit value range corresponding to each data in each data segment, and further determining a minimum byte number corresponding to each data segment;
determining a bit value corresponding to the minimum byte number corresponding to each data segment according to the minimum byte number corresponding to each data segment corresponding to the engineering project time sequence data to be processed, and taking the bit value corresponding to the minimum byte number corresponding to each data segment as a single bit value corresponding to the data which is not frequently read in each data segment.
3. A calculation aid method for construction engineering economics according to claim 2 wherein the calculation formula for determining the number of bytes corresponding to each data segment at different bit values is:
wherein the content of the first and second substances,the number of bytes corresponding to each data segment corresponding to the engineering project time sequence data to be processed under any bit value,a second one corresponding to the DACTS code of each data segment corresponding to the engineering project time sequence data to be processedqAny bit value of the bit layer is,a second one corresponding to the DACTS code of each data segment corresponding to the engineering project time sequence data to be processedqThe number of bit blocks corresponding to an arbitrary bit value of the bit layer,Qthe number of bit layers corresponding to any bit value of the DACTS codes of each data segment corresponding to the engineering project time sequence data to be processed,Bin order to preset the minimum bit value,the maximum binary code length corresponding to each data segment corresponding to the engineering project time sequence data to be processed,in order to find the function of the maximum value,xmapping of time series data for a project to be processedThe length of the binary code corresponding to each data in each data segment.
4. The method of claim 1, wherein the step of determining a plurality of bit values corresponding to data that is frequently read in each data segment comprises:
determining the number of bytes when each bit layer corresponding to the DACTS code of the data which is read frequently in each data segment under the single bit value corresponding to the data which is read infrequently and the number of bytes when each bit layer corresponding to the single bit value corresponding to the data which is read infrequently in each data segment is not complemented;
determining the corrected byte number of each bit layer corresponding to the DACTS code of the data which is read frequently in each data segment according to the byte number when each bit layer is not complemented and the byte number when the byte number is complemented, the read data index and the write data index corresponding to the data which is read frequently, which correspond to the single bit value corresponding to the data which is not read frequently of the DACTS code of the data which is read frequently in each data segment;
and determining the corrected bit value of each bit layer corresponding to the DACTS code of the data which is read frequently in each data segment according to the corrected byte number of each bit layer corresponding to the DACTS code of the data which is read frequently in each data segment, thereby determining a plurality of bit values corresponding to the data which is read frequently in each data segment.
5. The calculation assisting method for construction engineering economy as claimed in claim 4, wherein the calculation formula for determining the corrected byte number of each bit layer corresponding to the DACTS code of the data which is frequently read in each data segment is:
wherein the content of the first and second substances,a second code corresponding to the DACTS code of the data which is read frequently in each data segmentuThe modified number of bytes for the bit layer,DACTS encoding of frequently read data within respective data segments corresponds to a second bit value of the infrequently read datauThe number of bytes when the bit layer is zero-padded,DACTS encoding of frequently read data within respective data segments corresponds to a second bit value of the infrequently read datauThe number of bytes when the bit layer is not zero-padded,Rthe data reading indexes corresponding to the data which is read frequently in each data segment,Wwriting data indexes corresponding to data which are read frequently in each data segment;
the calculation formula for determining the modified bit value of each bit layer corresponding to the DACTS code of the data which is read frequently in each data segment is as follows:
wherein the content of the first and second substances,a second code corresponding to the DACTS code of the data which is read frequently in each data segmentuThe modified number of bytes for the bit layer,a second code corresponding to the DACTS code of the data which is read frequently in each data segmentuThe modified bit values of the individual bit layers,a second code corresponding to the DACTS code of the data which is read frequently in each data segmentuThe number of bit blocks corresponding to the modified bit values of the individual bit layers.
6. The method of claim 1, wherein the step of determining the data segments corresponding to the time series data of the construction project comprises:
determining each initial data segment corresponding to the engineering project time sequence data to be processed according to the engineering project time sequence data to be processed;
determining modification necessity indexes of all initial data segments according to all initial data segments corresponding to the engineering project time sequence data to be processed;
and modifying each initial data segment corresponding to the engineering project time sequence data to be processed according to the modification necessity index of each initial data segment, thereby determining each data segment corresponding to the engineering project time sequence data to be processed.
7. The calculation assistance method for construction engineering economy as claimed in claim 6, wherein the step of determining each initial data segment corresponding to the engineering project time series data to be processed comprises:
determining a first initial data segment corresponding to the engineering project time sequence data to be processed according to the engineering project time sequence data to be processed, wherein the determining step of the first initial data segment is as follows: determining data segment balance corresponding to a preliminary initial data segment composed of the first 2 data according to the first 2 data in the engineering project time sequence data to be processed, if the data segment balance corresponding to the preliminary initial data segment composed of the first 2 data is not less than a preset data segment balance threshold value, determining data segment balance corresponding to the preliminary initial data segment composed of the first 3 data according to the first 3 data in the engineering project time sequence data to be processed, and if the data segment balance corresponding to the preliminary initial data segment composed of the first 3 data is not less than a preset data segment balance threshold value, determining data segment balance corresponding to the preliminary initial data segment composed of the first 3 data according to the first 4 data in the engineering project time sequence data to be processedThe data segment balance corresponding to the preliminary initial data segment formed by the first 4 data is continuously repeated until the front part in the engineering project time sequence data to be processedmThe data segment balance corresponding to the preliminary initial data segment composed of the data is smaller than the preset data segment balance threshold value, and the front part in the engineering project time sequence data to be processed is processedA preliminary initial data segment composed of the data is used as a first initial data segment corresponding to the engineering project time sequence data to be processed;
and according to the engineering project time sequence data to be processed after the first initial data segment, obtaining a second initial data segment corresponding to the engineering project time sequence data to be processed by referring to the determining step of the first initial data segment, according to the engineering project time sequence data to be processed after the second initial data segment, obtaining a third initial data segment corresponding to the engineering project time sequence data to be processed by referring to the determining step of the first initial data segment, and continuously repeating the step of determining the initial data segments until the engineering project time sequence data to be processed all have the initial data segments corresponding to the engineering project time sequence data to be processed.
8. A computational assistance method for construction engineering economics according to claim 7 wherein the pre-determination in the project time series data to be processed is mademThe calculation formula of the data segment equilibrium corresponding to the preliminary initial data segment composed of the data is as follows:
wherein the content of the first and second substances,Efor pre-in-time sequence data of engineering project to be processedmThe data segment balance corresponding to the preliminary initial data segment formed by the data,for pre-in-time sequence data of engineering project to be processedmIn a preliminary initial data section consisting of individual datajData, β is a hyperparameter.
9. A computational assistance method for construction economy as claimed in claim 6 wherein the step of determining a modification necessity indicator for each initial data segment comprises:
determining the data average value of each initial data segment corresponding to the engineering project time sequence data to be processed according to each initial data segment corresponding to the engineering project time sequence data to be processed;
calculating the difference value between each data in each initial data segment corresponding to the engineering project time sequence data to be processed and the data average value of each initial data segment, thereby obtaining the differential data segment of each initial data segment corresponding to the engineering project time sequence data to be processed;
and determining the modification necessity index of each initial data segment according to the binary code of each initial data segment corresponding to the engineering project time sequence data to be processed and the binary code of the differential data segment of each initial data segment.
10. The calculation assistance method for construction engineering economy as claimed in claim 6, wherein the step of modifying each initial data segment corresponding to the time series data of the engineering project to be processed comprises:
and if the modification necessity index of any initial data segment corresponding to the engineering project time sequence data to be processed is greater than the preset modification necessity index threshold, modifying the initial data segment corresponding to the engineering project time sequence data to be processed, and taking the differential data segment of the initial data segment as the data segment corresponding to the engineering project time sequence data to be processed.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210754358.8A CN114817831B (en) | 2022-06-30 | 2022-06-30 | Computing auxiliary method for building engineering economy |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210754358.8A CN114817831B (en) | 2022-06-30 | 2022-06-30 | Computing auxiliary method for building engineering economy |
Publications (2)
Publication Number | Publication Date |
---|---|
CN114817831A true CN114817831A (en) | 2022-07-29 |
CN114817831B CN114817831B (en) | 2022-09-23 |
Family
ID=82523339
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202210754358.8A Active CN114817831B (en) | 2022-06-30 | 2022-06-30 | Computing auxiliary method for building engineering economy |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN114817831B (en) |
Citations (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6128413A (en) * | 1997-12-04 | 2000-10-03 | Agfa Corporation | Method and apparatus for data compression |
JP2012134858A (en) * | 2010-12-22 | 2012-07-12 | Mitsubishi Electric Corp | Data compression apparatus, data compression method and data compression program |
CN103688539A (en) * | 2011-09-19 | 2014-03-26 | 松下电器产业株式会社 | Image encoding method, image decoding method, image encoding device, image decoding device, and image encoding-decoding device |
US20170155404A1 (en) * | 2014-06-27 | 2017-06-01 | Gurulogic Microsystems Oy | Encoder and decoder |
CN110545106A (en) * | 2019-08-06 | 2019-12-06 | 清华大学 | Method and device for coding time series data |
CN111201519A (en) * | 2017-08-11 | 2020-05-26 | Altr解决方案公司 | Immutable data storage for low latency reading and writing of large data sets |
CN113094346A (en) * | 2021-03-10 | 2021-07-09 | 北京四达时代软件技术股份有限公司 | Big data coding and decoding method and device based on time sequence |
CN113746485A (en) * | 2021-08-16 | 2021-12-03 | 深圳数联天下智能科技有限公司 | Data compression method, electronic equipment and storage medium |
CN113868206A (en) * | 2021-10-08 | 2021-12-31 | 八十一赞科技发展(重庆)有限公司 | Data compression method, decompression method, device and storage medium |
CN114124106A (en) * | 2022-01-28 | 2022-03-01 | 苏州浪潮智能科技有限公司 | LZ4 decompression method, system, storage medium and equipment |
CN114665884A (en) * | 2022-03-29 | 2022-06-24 | 北京诺司时空科技有限公司 | Time sequence database self-adaptive lossy compression method, system and medium |
-
2022
- 2022-06-30 CN CN202210754358.8A patent/CN114817831B/en active Active
Patent Citations (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6128413A (en) * | 1997-12-04 | 2000-10-03 | Agfa Corporation | Method and apparatus for data compression |
JP2012134858A (en) * | 2010-12-22 | 2012-07-12 | Mitsubishi Electric Corp | Data compression apparatus, data compression method and data compression program |
CN103688539A (en) * | 2011-09-19 | 2014-03-26 | 松下电器产业株式会社 | Image encoding method, image decoding method, image encoding device, image decoding device, and image encoding-decoding device |
US20170155404A1 (en) * | 2014-06-27 | 2017-06-01 | Gurulogic Microsystems Oy | Encoder and decoder |
CN111201519A (en) * | 2017-08-11 | 2020-05-26 | Altr解决方案公司 | Immutable data storage for low latency reading and writing of large data sets |
CN110545106A (en) * | 2019-08-06 | 2019-12-06 | 清华大学 | Method and device for coding time series data |
CN113094346A (en) * | 2021-03-10 | 2021-07-09 | 北京四达时代软件技术股份有限公司 | Big data coding and decoding method and device based on time sequence |
CN113746485A (en) * | 2021-08-16 | 2021-12-03 | 深圳数联天下智能科技有限公司 | Data compression method, electronic equipment and storage medium |
CN113868206A (en) * | 2021-10-08 | 2021-12-31 | 八十一赞科技发展(重庆)有限公司 | Data compression method, decompression method, device and storage medium |
CN114124106A (en) * | 2022-01-28 | 2022-03-01 | 苏州浪潮智能科技有限公司 | LZ4 decompression method, system, storage medium and equipment |
CN114665884A (en) * | 2022-03-29 | 2022-06-24 | 北京诺司时空科技有限公司 | Time sequence database self-adaptive lossy compression method, system and medium |
Non-Patent Citations (3)
Title |
---|
ADRIÁN GÓMEZ-BRANDÓN 等: "Lossless compression of industrial time series with direct access", 《COMPUTERS IN INDUSTRY》 * |
邹同春: "海量时序数据的压缩存储方法研究", 《中国优秀硕士学位论文全文数据库 信息科技辑》 * |
韩骅宇: "时序数据的高效存储与检索", 《中国优秀硕士学位论文全文数据库 信息科技辑》 * |
Also Published As
Publication number | Publication date |
---|---|
CN114817831B (en) | 2022-09-23 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN116303374B (en) | Multi-dimensional report data optimization compression method based on SQL database | |
CN112953550B (en) | Data compression method, electronic device and storage medium | |
CN116016606B (en) | Sewage treatment operation and maintenance data efficient management system based on intelligent cloud | |
CN108416427A (en) | Convolution kernel accumulates data flow, compressed encoding and deep learning algorithm | |
WO2012033498A1 (en) | Systems and methods for data compression | |
CN109871362A (en) | A kind of data compression method towards streaming time series data | |
CN103995887A (en) | Bitmap index compressing method and bitmap index decompressing method | |
CN106549673A (en) | A kind of data compression method and device | |
CN115269526B (en) | Method and system for processing semiconductor production data | |
CN114782148B (en) | Agricultural product purchase management platform and business data compression method thereof | |
CN117743870B (en) | Water conservancy data management system based on big data | |
CN116361256A (en) | Data synchronization method and system based on log analysis | |
CN114442954B (en) | LZ4 coding compression device | |
CN114268323B (en) | Data compression coding method, device and time sequence database supporting line memory | |
CN114817831B (en) | Computing auxiliary method for building engineering economy | |
CN117040542B (en) | Intelligent comprehensive distribution box energy consumption data processing method | |
US20210125063A1 (en) | Apparatus and method for generating binary neural network | |
CN111061428B (en) | Data compression method and device | |
CN105515586A (en) | Rapid delta compression method | |
CN115882867B (en) | Data compression storage method based on big data | |
CN110310709B (en) | Reference sequence-based gene compression method | |
CN108416425A (en) | A kind of convolution method and device | |
CN202931290U (en) | Compression hardware system based on GZIP | |
CN112506876B (en) | Lossless compression query method supporting SQL query | |
CN115913248A (en) | Live broadcast software development data intelligent management system |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |