CN110008236A - A kind of data distribution formula is from increasing coding method, system, equipment and medium - Google Patents

A kind of data distribution formula is from increasing coding method, system, equipment and medium Download PDF

Info

Publication number
CN110008236A
CN110008236A CN201910301360.8A CN201910301360A CN110008236A CN 110008236 A CN110008236 A CN 110008236A CN 201910301360 A CN201910301360 A CN 201910301360A CN 110008236 A CN110008236 A CN 110008236A
Authority
CN
China
Prior art keywords
data
field
duplicate removal
code
newly
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201910301360.8A
Other languages
Chinese (zh)
Other versions
CN110008236B (en
Inventor
周孝文
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Chongqing Tianpeng Network Co Ltd
Original Assignee
Chongqing Tianpeng Network Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Chongqing Tianpeng Network Co Ltd filed Critical Chongqing Tianpeng Network Co Ltd
Priority to CN201910301360.8A priority Critical patent/CN110008236B/en
Publication of CN110008236A publication Critical patent/CN110008236A/en
Application granted granted Critical
Publication of CN110008236B publication Critical patent/CN110008236B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing

Abstract

The invention discloses a kind of data distribution formulas to increase coding method, system, electronic equipment and medium certainly, comprising: obtains external source data, generates first set;Machined data are obtained, second set is generated;The union for calculating the first set and second set obtains total data set;Duplicate removal processing is carried out to the total data set, obtains duplicate removal data acquisition system;The newly-increased data in the duplicate removal data acquisition system are obtained, the newly-increased data are encoded, complete data from addendum code.Without writing UDF function, the data for realizing hive by sql to achieve the purpose that carry out data encoding to name field known to some, are effectively reduced development cost, improve development efficiency the present invention from encoding.

Description

A kind of data distribution formula is from increasing coding method, system, equipment and medium
Technical field
The present invention relates to big data technical fields, and in particular to a kind of data distribution formula is from increasing coding method, system, equipment And medium.
Background technique
With the promotion of computer storage capacity and the development of complicated algorithm, web database technology exponentially grade increases in recent years Long, the application with mass data demand such as science data processing, business intelligence data analysis becomes increasingly prevalent, mainstream Big data processing technique by hive and it is distributed carry out data processing, data from addendum code be a wherein important link. In the prior art, data are realized from addendum code by writing the modes such as UDF function, need to rely on Java exploitation, development cost compared with Greatly, efficiency is lower, and growing day by day with big data process demand, traditional data is gradually protruded from the drawbacks of addendum code, so Being badly in need of one kind helps to reduce development difficulty, and the data for improving development efficiency increase coding techniques certainly.
Summary of the invention
In view of the above-mentioned problems, the present invention provides a kind of data distribution formula from coding method, system, equipment and medium, it is not necessarily to UDF function is write, the data in hive are realized from increasing, to achieve the purpose that encode to name field known to some by sql.
The present invention specifically:
A kind of data distribution formula is from increasing coding method, comprising:
External source data are obtained, first set is generated;
Machined data are obtained, second set is generated;
The union for calculating the first set and second set obtains total data set;
Duplicate removal processing is carried out to the total data set, obtains duplicate removal data acquisition system;
The newly-increased data in the duplicate removal data acquisition system are obtained, the newly-increased data are encoded, complete data from increasing Coding.
Further, the acquisition external source data generate first set, specifically include:
External source data are obtained, code field and the first field is constructed, obtains first set;
It is described to obtain machined data, second set is generated, is specifically included:
Tight preceding manufactured data are obtained, the second field is created, obtains second set;The default value of second field Take the value of the code field.
Further, duplicate removal processing is carried out to the total data set, obtains duplicate removal data acquisition system, specifically includes:
The field encoded as required is grouped the data in the total data set;
The data in the total data set are ranked up according to the code field;
Corresponding data is taken out in the data after sequence by preset rules, forms duplicate removal data acquisition system.
Further, the newly-increased data in the duplicate removal data acquisition system are obtained, the newly-increased data are encoded, are completed Data are specifically included from addendum code:
The data in the duplicate removal data acquisition system are ranked up according to second field, and in the data after sequence The data for meeting preset condition are searched, the newly-increased data is obtained, the newly-increased data is compiled according to the code field Code obtains the value of new code field, completes data from addendum code;The process realizes encoding certainly for newly-increased data, while not Coded data can be impacted, encoded context number remains unchanged.
The above method realizes hive data from addendum code without writing UDF function, by sql.
A kind of data distribution formula is from increasing coded system, comprising:
External source data processing module generates first set for obtaining external source data;
Machined data processing module generates second set for obtaining machined data;
Data combiners block obtains total data set for calculating the union of the first set and second set;
Data deduplication module obtains duplicate removal data acquisition system for carrying out duplicate removal processing to the total data set;
Data are from coding module is increased, for obtaining the newly-increased data in the duplicate removal data acquisition system, by the newly-increased data It is encoded, completes data from addendum code.
Further, the external source data processing module, is specifically used for:
External source data are obtained, code field and the first field is constructed, obtains first set;
The machined data processing module, is specifically used for:
Tight preceding manufactured data are obtained, the second field is created, obtains second set;The default value of second field Take the value of the code field.
Further, the data deduplication module, is specifically used for:
The field encoded as required is grouped the data in the total data set;
The data in the total data set are ranked up according to the code field;
Corresponding data is taken out in the data after sequence by preset rules, forms duplicate removal data acquisition system.
Further, the data are specifically used for from coding module is increased:
The data in the duplicate removal data acquisition system are ranked up according to second field, and in the data after sequence The data for meeting preset condition are searched, the newly-increased data is obtained, the newly-increased data is compiled according to the code field Code obtains the value of new code field, completes data from addendum code;The process realizes encoding certainly for newly-increased data, while not Coded data can be impacted, encoded context number remains unchanged.
Above system realizes hive data from addendum code without writing UDF function, by sql.
A kind of electronic equipment, comprising: shell, processor, memory, circuit board and power circuit, wherein circuit board placement In the space interior that shell surrounds, processor and memory setting are on circuit boards;Power circuit, for being above-mentioned electronic equipment Each circuit or device power supply;Memory is for storing executable program code;Processor is stored by reading in memory Executable program code run program corresponding with executable program code, for executing aforementioned data distribution from addendum Code method.
A kind of computer readable storage medium is stored with one or more program, and one or more of programs can It is executed by one or more processor, to realize that aforementioned data distribution increases coding method certainly.
The beneficial effects of the present invention are embodied in:
The present invention realizes that the data of hive encode certainly without writing UDF function, by sql, to reach to known to some Name field carries out the purpose of data encoding, and development cost is effectively reduced, and improves development efficiency.It is simple that the present invention is different from tradition Application sequence disposably to be encoded to data, newly-increased record content can be realized from coding, and encoded content Number remains unchanged.
Detailed description of the invention
It, below will be to specific in order to illustrate more clearly of the specific embodiment of the invention or technical solution in the prior art Embodiment or attached drawing needed to be used in the description of the prior art are briefly described.In all the appended drawings, similar element Or part is generally identified by similar appended drawing reference.In attached drawing, each element or part might not be drawn according to actual ratio.
Fig. 1 is a kind of data distribution formula of the embodiment of the present invention from addendum code method flow diagram;
Fig. 2 is that a kind of data distribution formula of the embodiment of the present invention increases coded system structure chart certainly;
Fig. 3 is a kind of electronic equipment of embodiment of the present invention structural schematic diagram.
Specific embodiment
It is described in detail below in conjunction with embodiment of the attached drawing to technical solution of the present invention.Following embodiment is only used for Clearly illustrate technical solution of the present invention, therefore be only used as example, and cannot be used as a limitation and limit protection model of the invention It encloses.
It should be noted that unless otherwise indicated, technical term or scientific term used in this application should be this hair The ordinary meaning that bright one of ordinary skill in the art are understood.
As shown in Figure 1, being a kind of data distribution formula of the present invention from addendum code embodiment of the method, comprising:
S11: obtaining external source data, generates first set;
S12: obtaining machined data, generates second set;
S13: calculating the union of the first set and second set, obtains total data set;
S14: duplicate removal processing is carried out to the total data set, obtains duplicate removal data acquisition system;
S15: obtaining the newly-increased data in the duplicate removal data acquisition system, and the newly-increased data are encoded, and completes data From addendum code.
Preferably, the acquisition external source data generate first set, specifically include:
External source data are obtained, code field and the first field is constructed, obtains first set;For example, one volume of construction Code field (c_id), default value 0 reconstruct a field (c_id2), and default value is empty (null);
It is described to obtain machined data, second set is generated, is specifically included:
Tight preceding manufactured data are obtained, the second field is created, obtains second set;The default value of second field Take the value of the code field;For example, obtaining the last data processed, a field (c_id2), default value are constructed Take the value of code field (c_id).
Preferably, duplicate removal processing is carried out to the total data set, obtains duplicate removal data acquisition system, specifically includes:
The field encoded as required is grouped the data in the total data set;
The data in the total data set are ranked up according to the code field;
Corresponding data is taken out in the data after sequence by preset rules, forms duplicate removal data acquisition system;
For example, being grouped to the field (c_n) that the data in the total data set encode as needed, according still further to coding Field (c_id) carries out descending sort, and the record of serial number 1 is taken after sequence, this step is mainly that duplicate removal keeps unique, if in institute It states in first set and second set there are the identical field (c_n) for needing to encode, takes the data in the second set, really Protect the uniqueness that identical recordings encode every time.
Preferably, the newly-increased data in the duplicate removal data acquisition system are obtained, the newly-increased data are encoded, complete number According to from addendum code, specifically include:
The data in the duplicate removal data acquisition system are ranked up according to second field, and in the data after sequence The data for meeting preset condition are searched, the newly-increased data is obtained, the newly-increased data is compiled according to the code field Code obtains the value of new code field, completes data from addendum code;For example, to the data in the duplicate removal data acquisition system according to The field (c_id2) of construction carries out descending arrangement, obtains serial number rn2, and null value can come below after descending arrangement, then judge structure Whether the field (c_id2) made is empty, is not the empty value for then taking field (c_id2), then takes serial number rn2 to obtain as coding to be empty The value of new code field (c_id);The process realizes encoding certainly for newly-increased data, while will not cause to coded data It influences, encoded context number remains unchanged.
The above method realizes hive data from addendum code without writing UDF function, by sql.
As shown in Fig. 2, being a kind of data distribution formula of the present invention from addendum code system embodiment, comprising:
External source data processing module 21 generates first set for obtaining external source data;
Machined data processing module 22 generates second set for obtaining machined data;
Data combiners block 23 obtains total data set for calculating the union of the first set and second set;
Data deduplication module 24 obtains duplicate removal data acquisition system for carrying out duplicate removal processing to the total data set;
Data are from coding module 25 is increased, for obtaining the newly-increased data in the duplicate removal data acquisition system, by the newly-increased number According to being encoded, data are completed from addendum code.
Preferably, the external source data processing module 21, is specifically used for:
External source data are obtained, code field and the first field is constructed, obtains first set;For example, one volume of construction Code field (c_id), default value 0 reconstruct a field (c_id2), and default value is empty (null)
The machined data processing module 22, is specifically used for:
Tight preceding manufactured data are obtained, the second field is created, obtains second set;The default value of second field Take the value of the code field;For example, obtaining the last data processed, a field (c_id2), default value are constructed Take the value of code field (c_id).
There is piccolo, the data deduplication module 24 is specifically used for:
The field encoded as required is grouped the data in the total data set;
The data in the total data set are ranked up according to the code field;
Corresponding data is taken out in the data after sequence by preset rules, forms duplicate removal data acquisition system;
For example, being grouped to the field (c_n) that the data in the total data set encode as needed, according still further to coding Field (c_id) carries out descending sort, and the record of serial number 1 is taken after sequence, this step is mainly that duplicate removal keeps unique, if in institute It states in first set and second set there are the identical field (c_n) for needing to encode, takes the data in the second set, really Protect the uniqueness that identical recordings encode every time.
Preferably, the data are specifically used for from coding module 25 is increased:
The data in the duplicate removal data acquisition system are ranked up according to second field, and in the data after sequence The data for meeting preset condition are searched, the newly-increased data is obtained, the newly-increased data is compiled according to the code field Code obtains the value of new code field, completes data from addendum code;For example, to the data in the duplicate removal data acquisition system according to The field (c_id2) of construction carries out descending arrangement, obtains serial number rn2, and null value can come below after descending arrangement, then judge structure Whether the field (c_id2) made is empty, is not the empty value for then taking field (c_id2), then takes serial number rn2 to obtain as coding to be empty The value of new code field (c_id);The process realizes encoding certainly for newly-increased data, while will not cause to coded data It influences, encoded context number remains unchanged.
Above system realizes hive data from addendum code without writing UDF function, by sql.
The embodiment of the present invention also provides a kind of electronic equipment, as shown in figure 3, embodiment illustrated in fig. 1 of the present invention may be implemented Process, as shown in figure 3, above-mentioned electronic equipment may include: shell 31, processor 32, memory 33, circuit board 34 and power supply Circuit 35, wherein circuit board 34 is placed in the space interior that shell 31 surrounds, and processor 32 and memory 33 are arranged in circuit board On 34;Power circuit 35, for each circuit or the device power supply for above-mentioned electronic equipment;Memory 33 is executable for storing Program code;Processor 32 is run by reading the executable program code stored in memory 33 and executable program code Corresponding program increases coding method for executing aforementioned data distribution certainly.
Processor 32 to the specific implementation procedures of above-mentioned steps and processor 32 by operation executable program code come The step of further executing may refer to the description of embodiment illustrated in fig. 1 of the present invention, and details are not described herein.
The electronic equipment exists in a variety of forms, including but not limited to:
(1) server: providing the equipment of the service of calculating, and the composition of server includes that processor, hard disk, memory, system are total Line etc., server is similar with general computer architecture, but due to needing to provide highly reliable service, in processing energy Power, stability, reliability, safety, scalability, manageability etc. are more demanding;
(2) other electronic equipments with data interaction function.
The embodiment of the present invention also provides a kind of computer readable storage medium, the computer-readable recording medium storage There is one or more program, one or more of programs can be executed by one or more processor, aforementioned to realize Data distribution formula increases coding method certainly.
The present invention realizes that the data of hive encode certainly without writing UDF function, by sql, to reach to known to some Name field carries out the purpose of data encoding, and development cost is effectively reduced, and improves development efficiency.It is simple that the present invention is different from tradition Application sequence disposably to be encoded to data, newly-increased record content can be realized from coding, and encoded content Number remains unchanged.
Finally, it should be noted that the above embodiments are only used to illustrate the technical solution of the present invention., rather than its limitations;To the greatest extent Pipe present invention has been described in detail with reference to the aforementioned embodiments, those skilled in the art should understand that: its according to So be possible to modify the technical solutions described in the foregoing embodiments, or to some or all of the technical features into Row equivalent replacement;And these are modified or replaceed, various embodiments of the present invention technology that it does not separate the essence of the corresponding technical solution The range of scheme should all cover within the scope of the claims and the description of the invention.

Claims (10)

1. a kind of data distribution formula increases coding method certainly characterized by comprising
External source data are obtained, first set is generated;
Machined data are obtained, second set is generated;
The union for calculating the first set and second set obtains total data set;
Duplicate removal processing is carried out to the total data set, obtains duplicate removal data acquisition system;
The newly-increased data in the duplicate removal data acquisition system are obtained, the newly-increased data are encoded, complete data from addendum code.
2. the method as described in claim 1, which is characterized in that the acquisition external source data generate first set, specifically Include:
External source data are obtained, code field and the first field is constructed, obtains first set;
It is described to obtain machined data, second set is generated, is specifically included:
Tight preceding manufactured data are obtained, the second field is created, obtains second set;The default value of second field takes institute State the value of code field.
3. method according to claim 2, which is characterized in that carry out duplicate removal processing to the total data set, obtain duplicate removal Data acquisition system specifically includes:
The field encoded as required is grouped the data in the total data set;
The data in the total data set are ranked up according to the code field;
Corresponding data is taken out in the data after sequence by preset rules, forms duplicate removal data acquisition system.
4. method as claimed in claim 3, which is characterized in that the newly-increased data in the duplicate removal data acquisition system are obtained, by institute It states newly-increased data to be encoded, completes data from addendum code, specifically include:
The data in the duplicate removal data acquisition system are ranked up according to second field, and are searched in the data after sequence The data for meeting preset condition obtain the newly-increased data, are encoded, are obtained to the newly-increased data according to the code field To the value of new code field, data are completed from addendum code.
5. a kind of data distribution formula increases coded system certainly characterized by comprising
External source data processing module generates first set for obtaining external source data;
Machined data processing module generates second set for obtaining machined data;
Data combiners block obtains total data set for calculating the union of the first set and second set;
Data deduplication module obtains duplicate removal data acquisition system for carrying out duplicate removal processing to the total data set;
Data are from coding module is increased, and for obtaining the newly-increased data in the duplicate removal data acquisition system, the newly-increased data are carried out Coding completes data from addendum code.
6. system as claimed in claim 5, which is characterized in that the external source data processing module is specifically used for:
External source data are obtained, code field and the first field is constructed, obtains first set;
The machined data processing module, is specifically used for:
Tight preceding manufactured data are obtained, the second field is created, obtains second set;The default value of second field takes institute State the value of code field.
7. system as claimed in claim 6, which is characterized in that the data deduplication module is specifically used for:
The field encoded as required is grouped the data in the total data set;
The data in the total data set are ranked up according to the code field;
Corresponding data is taken out in the data after sequence by preset rules, forms duplicate removal data acquisition system.
8. system as claimed in claim 7, which is characterized in that the data are specifically used for from coding module is increased:
The data in the duplicate removal data acquisition system are ranked up according to second field, and are searched in the data after sequence The data for meeting preset condition obtain the newly-increased data, are encoded, are obtained to the newly-increased data according to the code field To the value of new code field, data are completed from addendum code.
9. a kind of electronic equipment, which is characterized in that the electronic equipment includes: shell, processor, memory, circuit board and electricity Source circuit, wherein circuit board is placed in the space interior that shell surrounds, and processor and memory setting are on circuit boards;Power supply Circuit, for each circuit or the device power supply for above-mentioned electronic equipment;Memory is for storing executable program code;Processing Device runs program corresponding with executable program code by reading the executable program code stored in memory, for holding Method of the row as described in claim 1-4 is any.
10. a kind of computer readable storage medium, which is characterized in that the computer-readable recording medium storage have one or Multiple programs, one or more of programs can be executed by one or more processor, to realize that claim 1-4 such as appoints Method described in one.
CN201910301360.8A 2019-04-15 2019-04-15 Data distributed type self-increment coding method, system, equipment and medium Active CN110008236B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910301360.8A CN110008236B (en) 2019-04-15 2019-04-15 Data distributed type self-increment coding method, system, equipment and medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910301360.8A CN110008236B (en) 2019-04-15 2019-04-15 Data distributed type self-increment coding method, system, equipment and medium

Publications (2)

Publication Number Publication Date
CN110008236A true CN110008236A (en) 2019-07-12
CN110008236B CN110008236B (en) 2020-08-04

Family

ID=67172027

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910301360.8A Active CN110008236B (en) 2019-04-15 2019-04-15 Data distributed type self-increment coding method, system, equipment and medium

Country Status (1)

Country Link
CN (1) CN110008236B (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113220783A (en) * 2021-05-07 2021-08-06 深圳市粤睦信息科技有限公司 Data processing method and device, electronic equipment and storage medium
CN113641520A (en) * 2021-08-20 2021-11-12 北京百度网讯科技有限公司 Data processing method, system, device and storage medium

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101741880A (en) * 2008-11-10 2010-06-16 江苏省电力公司南京供电公司 Power service mobile service system-based method for interacting wireless remote data
CN103064908A (en) * 2012-12-18 2013-04-24 北京讯鸟软件有限公司 Method for rapidly removing repeated list through a memory
US20140019776A1 (en) * 2012-07-01 2014-01-16 Jerzy Lewak Methods of providing fast search, analysis, and data retrieval of encrypted data without decryption
CN103559323A (en) * 2013-11-22 2014-02-05 盛杰 Database implementation method
US20150066875A1 (en) * 2013-08-29 2015-03-05 Cleversafe, Inc. Updating de-duplication tracking data for a dispersed storage network
CN106802817A (en) * 2016-12-29 2017-06-06 杭州迪普科技股份有限公司 The upgrade method and device of SQLite databases
CN107544984A (en) * 2016-06-27 2018-01-05 北京京东尚科信息技术有限公司 A kind of method and apparatus of data processing
EP3547145A2 (en) * 2018-03-30 2019-10-02 Atlassian Pty Ltd Systems and methods for reducing storage required for code coverage results

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101741880A (en) * 2008-11-10 2010-06-16 江苏省电力公司南京供电公司 Power service mobile service system-based method for interacting wireless remote data
US20140019776A1 (en) * 2012-07-01 2014-01-16 Jerzy Lewak Methods of providing fast search, analysis, and data retrieval of encrypted data without decryption
CN103064908A (en) * 2012-12-18 2013-04-24 北京讯鸟软件有限公司 Method for rapidly removing repeated list through a memory
US20150066875A1 (en) * 2013-08-29 2015-03-05 Cleversafe, Inc. Updating de-duplication tracking data for a dispersed storage network
CN103559323A (en) * 2013-11-22 2014-02-05 盛杰 Database implementation method
CN107544984A (en) * 2016-06-27 2018-01-05 北京京东尚科信息技术有限公司 A kind of method and apparatus of data processing
CN106802817A (en) * 2016-12-29 2017-06-06 杭州迪普科技股份有限公司 The upgrade method and device of SQLite databases
EP3547145A2 (en) * 2018-03-30 2019-10-02 Atlassian Pty Ltd Systems and methods for reducing storage required for code coverage results

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113220783A (en) * 2021-05-07 2021-08-06 深圳市粤睦信息科技有限公司 Data processing method and device, electronic equipment and storage medium
CN113220783B (en) * 2021-05-07 2024-03-26 深圳市粤睦信息科技有限公司 Data processing method, device, electronic equipment and storage medium
CN113641520A (en) * 2021-08-20 2021-11-12 北京百度网讯科技有限公司 Data processing method, system, device and storage medium
CN113641520B (en) * 2021-08-20 2024-04-05 北京百度网讯科技有限公司 Data processing method, system, device and storage medium

Also Published As

Publication number Publication date
CN110008236B (en) 2020-08-04

Similar Documents

Publication Publication Date Title
Lemire et al. Consistently faster and smaller compressed bitmaps with roaring
CN106897322B (en) A kind of access method and device of database and file system
KR102376117B1 (en) Parallel decision tree processor architecture
CN106528896B (en) A kind of database optimizing method and device
CN103003813B (en) Columnar storage representations of records
CN104021123B (en) method and system for data migration
CN110008236A (en) A kind of data distribution formula is from increasing coding method, system, equipment and medium
US20150262063A1 (en) Decision tree processors
CN108052643A (en) Date storage method, device and storage engines based on LSM Tree structures
CN201402459Y (en) Test case management device
CN105302915B (en) The high-performance data processing system calculated based on memory
Esmaili et al. The core storage primitive: Cross-object redundancy for efficient data repair & access in erasure coded storage
CN106844288A (en) A kind of random string generation method and device
CN102024021A (en) Method for logging metadata in logical file system
CN106603673A (en) Fine-grained cloud storage scheduling method based on erasure codes
CN109933589B (en) Data structure conversion method for data summarization based on ElasticSearch aggregation operation result
CN113468571B (en) Source tracing method based on block chain
CN108920110A (en) A kind of parallel processing big data storage system and method calculating mode based on memory
CN114138792A (en) Key-value separated storage method and system
CN105550220A (en) Fetching method and apparatus for heterogeneous system
CN104750743A (en) System and method for ticking and rechecking transaction files
CN110532284A (en) Mass data storage and search method, device, computer equipment and storage medium
US20100228703A1 (en) Reducing memory required for prediction by partial matching models
CN115221361A (en) Method for storing and encoding graph data based on attribute graph model
CN111752954B (en) Large-scale feature data storage method and device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant