KR101727508B1 - Apparatus and method for accelerating hardware compression based on hadoop - Google Patents
Apparatus and method for accelerating hardware compression based on hadoop Download PDFInfo
- Publication number
- KR101727508B1 KR101727508B1 KR1020150106457A KR20150106457A KR101727508B1 KR 101727508 B1 KR101727508 B1 KR 101727508B1 KR 1020150106457 A KR1020150106457 A KR 1020150106457A KR 20150106457 A KR20150106457 A KR 20150106457A KR 101727508 B1 KR101727508 B1 KR 101727508B1
- Authority
- KR
- South Korea
- Prior art keywords
- hadoop
- compression
- data block
- control module
- size
- Prior art date
Links
Images
Classifications
-
- G06F17/30318—
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F1/00—Details not covered by groups G06F3/00 - G06F13/00 and G06F21/00
- G06F1/26—Power supply means, e.g. regulation thereof
- G06F1/32—Means for saving power
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
-
- G06F17/30194—
-
- H—ELECTRICITY
- H03—ELECTRONIC CIRCUITRY
- H03M—CODING; DECODING; CODE CONVERSION IN GENERAL
- H03M7/00—Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
- H03M7/30—Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
Abstract
The present invention relates to a Hadoop-based hardware compression and acceleration apparatus. The present invention complements the performance of a low-power CPU by performing compression and decompression processes performed by Hadoop middleware in a low-power Hadoop storage appliance through hardware. To this end, the Hadoop-based hardware compression accelerating apparatus according to the present invention performs pre-registration and search with an input buffer for receiving a data block to be compressed or decompressed, and compresses the data block through a window And a control module for controlling the input buffer, the dictionary module, and the output buffer based on a dictionary module, an output buffer for outputting the result of performing the compression, and Hadoop storage appliance information.
Description
The present invention relates to an apparatus and method for accelerating hardware compression for high-speed processing of low power Hadoop storage appliances.
Recently, Hadoop clusters have been used as a method for efficiently distributing Big Data.
Even with the Hadoop cluster, as the amount of data to be processed increases, a larger number of servers are required for data storage and analysis. The expansion of such a server causes a lot of power consumption in cluster operation, and the cost of cluster management is high.
Therefore, the need for low-power Hadoop storage appliances is emerging. The use of these low-power Hadoop storage appliances requires the use of low-power CPUs. However, there is no provision of an apparatus and method for compensating for the poor computing power of such a low-power CPU.
SUMMARY OF THE INVENTION The present invention has been made to solve the above problems, and an object of the present invention is to provide a Hadoop-based hardware compression and acceleration apparatus to an appliance.
More specifically, the present invention provides an apparatus and method for minimizing the time consumed in distributing and analyzing big data by performing data compression of the Hadoop system through the hardware compression / speedup device.
The Hadoop-based hardware compression and acceleration apparatus according to an embodiment of the present invention performs pre-registration and search with an input buffer for receiving a data block to be compressed or decompressed, and performs compression on the data block through a window And a control module for controlling the input buffer, the dictionary module, and the output buffer based on a dictionary module, an output buffer for outputting the result of performing the compression, and Hadoop storage appliance information.
The Hadoop-based hardware compression acceleration apparatus and method according to the present invention can utilize hardware parallelism to accelerate the pre-retrieval and registration process of the compression algorithm.
More specifically, Hadoop-based hardware compression accelerators and methods improve throughput over existing software compression through acceleration of pre-retrieval and registration processes.
In addition, according to the Hadoop-based hardware compression speed increasing apparatus and method according to the present invention, it is possible to manage a Hadoop cluster with a low-power Hadoop storage appliance at low cost by supplementing the insufficient computing power of a low-power CPU.
BRIEF DESCRIPTION OF THE DRAWINGS FIG. 1 is an illustration of a structure of a Hadoop-based hardware compression accelerator according to the present invention. FIG.
2 is a flowchart illustrating an operation of a Hadoop-based hardware compression accelerator according to the present invention;
Hereinafter, preferred embodiments of the present invention will be described in detail with reference to the accompanying drawings. BRIEF DESCRIPTION OF THE DRAWINGS The advantages and features of the present invention, and the manner of achieving them, will be apparent from and elucidated with reference to the embodiments described hereinafter in conjunction with the accompanying drawings. The present invention may, however, be embodied in many different forms and should not be construed as limited to the embodiments set forth herein. Rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the scope of the invention to those skilled in the art. To fully disclose the scope of the invention to those skilled in the art, and the invention is only defined by the scope of the claims. Like reference numerals refer to like elements throughout the specification.
Unless defined otherwise, all terms (including technical and scientific terms) used herein may be used in a sense commonly understood by one of ordinary skill in the art to which this invention belongs. Also, commonly used predefined terms are not ideally or excessively interpreted unless explicitly defined otherwise.
The terminology used herein is for the purpose of illustrating embodiments and is not intended to be limiting of the present invention. In the present specification, the singular form includes plural forms unless otherwise specified in the specification. The terms " comprises "and / or" comprising "used in the specification do not exclude the presence or addition of one or more other elements in addition to the stated element.
In this specification, an appliance means hardware such as a server or storage. The appliance may be an information device that is pre-installed with software and sold in a state optimized for a specific task. The user can use the appliance by connecting the power supply at the time of purchase without installing a separate program such as installation or setting of the integrated equipment operating system or application software.
In particular, the Hadoop storage appliance refers to an appliance that performs distributed data storage based on Hadoop.
BRIEF DESCRIPTION OF THE DRAWINGS FIG. 1 is an illustration of a Hadoop-based hardware compression and acceleration device in accordance with the present invention; FIG. Referring to FIG. 1, a Hadoop-based hardware compression and
The hardware compression and
In addition, the Hadoop-based hardware compression /
When the Hadoop-based hardware compression /
The
The endianness may be determined according to the usage environment of the hardware compression / In other words, it can be determined according to the kind of CPU applied to the Hadoop cluster. Alternatively, the endianness may be determined according to a setting of a user or a manufacturer.
The
The
The
The
The
Since each step of the compression process is sequential, the
The
In addition, the
That is, the
In addition, the
The
In addition, the
2 is a flowchart illustrating an operation of a Hadoop-based hardware compression accelerator according to the present invention. The compression operation of the Hadoop-based hardware compression /
Referring to FIG. 2, the Hadoop-based hardware compression and
The Hadoop-based hardware compression and
Next, the Hadoop-based hardware compression /
On the other hand, if the calculated computation amount does not satisfy the predefined computation amount, the
Through this process, the Hadoop-based hardware compression /
On the other hand, since the constituent elements in Fig. 1 are merely classified according to function or operation, they may be classified according to other criteria. In addition, since the illustrated elements are not essential elements, they may not include some elements or may further include additional elements.
While the present invention has been particularly shown and described with reference to exemplary embodiments thereof, it is to be understood that the invention is not limited to the disclosed exemplary embodiments. , And are not intended to limit the scope of the present invention. It will be apparent to those skilled in the art that other modifications based on the technical idea of the present invention may be practiced without departing from the scope of the invention disclosed herein.
Claims (9)
The control module pre-stores a plurality of compression algorithms including an LZ 4 compression algorithm for compressing the data block based on the Hadoop storage appliance information included in the Hadoop cluster, and determines a second step;
A dictionary module including a memory for storing a dictionary value of an offset and a hash function for controlling a memory address simultaneously performs a pre-registration and a search in parallel by the size of a window and stores the data block received from the input buffer through a window A third step of performing compression according to the compression algorithm determined by the control module;
A fourth step of the control module measuring a computation amount for compression performed through the pre-module;
A fifth step of determining whether the computation amount measured by the control module satisfies a predetermined computation amount; And
And outputting the result of the compression in accordance with the bus width and endian of the Hadoop storage appliance according to FIFO (First In First Out)
The control module changes the compression algorithm on the basis of the Hadoop storage appliance information and the measured computation amount, and if the size of the data block and the size of the window are less than the predetermined computation amount, Size, and the pre-module compresses the data block with the modified compression algorithm and the changed size so that the control module performs the fifth step again,
Wherein the sixth step is performed after the pre-module in the third step completes performing compression on the data block when the calculated amount of computation in the fifth step satisfies the preset amount of computation,
The Hadoop storage appliance information includes total storage space information of the number of Hadoop storage appliances included in the Hadoop cluster, the available storage space of each of the Hadoop storage appliances, and the storage space of the Hadoop storage appliance
Wherein the control module determines a size of a data block to which the input buffer should be compressed or decompressed from the MapReduce task,
The amount of computation for compression is adjusted according to the compression algorithm change
Hadoop based hardware acceleration method.
The pre-
Performing compression on the data block according to the LZ 4 compression algorithm,
Hadoop based hardware acceleration method.
Wherein the size of the data block received by the input buffer is 256 KB,
Hadoop based hardware acceleration method.
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020150106457A KR101727508B1 (en) | 2015-07-28 | 2015-07-28 | Apparatus and method for accelerating hardware compression based on hadoop |
PCT/KR2015/008449 WO2017018567A1 (en) | 2015-07-28 | 2015-08-12 | Hadoop-based hardware compression acceleration device and method |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020150106457A KR101727508B1 (en) | 2015-07-28 | 2015-07-28 | Apparatus and method for accelerating hardware compression based on hadoop |
Publications (2)
Publication Number | Publication Date |
---|---|
KR20170014042A KR20170014042A (en) | 2017-02-08 |
KR101727508B1 true KR101727508B1 (en) | 2017-04-18 |
Family
ID=57886830
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020150106457A KR101727508B1 (en) | 2015-07-28 | 2015-07-28 | Apparatus and method for accelerating hardware compression based on hadoop |
Country Status (2)
Country | Link |
---|---|
KR (1) | KR101727508B1 (en) |
WO (1) | WO2017018567A1 (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR102195239B1 (en) | 2019-11-29 | 2020-12-24 | 숭실대학교산학협력단 | Method for data compression transmission considering bandwidth in hadoop cluster, recording medium and device for performing the method |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR102592785B1 (en) * | 2021-06-02 | 2023-10-23 | 네이버 주식회사 | Method, computer device, and computer program to provide individual data retrieval service |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2014105323A1 (en) | 2012-12-28 | 2014-07-03 | Apple Inc. | Methods and apparatus for compressed and compacted virtual memory |
US20140258650A1 (en) | 2013-03-06 | 2014-09-11 | Ab Initio Technology Llc | Managing operations on stored data units |
US20150172209A1 (en) | 2013-12-12 | 2015-06-18 | International Business Machines Corporation | Resource over-subscription |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103020205B (en) * | 2012-12-05 | 2018-07-31 | 中科天玑数据科技股份有限公司 | Compression/decompression method based on hardware accelerator card in a kind of distributed file system |
US9342557B2 (en) * | 2013-03-13 | 2016-05-17 | Cloudera, Inc. | Low latency query engine for Apache Hadoop |
CN103729429A (en) * | 2013-12-26 | 2014-04-16 | 浪潮电子信息产业股份有限公司 | Hbase based compression method |
-
2015
- 2015-07-28 KR KR1020150106457A patent/KR101727508B1/en active IP Right Grant
- 2015-08-12 WO PCT/KR2015/008449 patent/WO2017018567A1/en active Application Filing
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2014105323A1 (en) | 2012-12-28 | 2014-07-03 | Apple Inc. | Methods and apparatus for compressed and compacted virtual memory |
US20140258650A1 (en) | 2013-03-06 | 2014-09-11 | Ab Initio Technology Llc | Managing operations on stored data units |
US20150172209A1 (en) | 2013-12-12 | 2015-06-18 | International Business Machines Corporation | Resource over-subscription |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR102195239B1 (en) | 2019-11-29 | 2020-12-24 | 숭실대학교산학협력단 | Method for data compression transmission considering bandwidth in hadoop cluster, recording medium and device for performing the method |
Also Published As
Publication number | Publication date |
---|---|
WO2017018567A1 (en) | 2017-02-02 |
KR20170014042A (en) | 2017-02-08 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN107273331A (en) | A kind of heterogeneous computing system and method based on CPU+GPU+FPGA frameworks | |
CN112514264A (en) | Data compression method, data decompression method, related device, electronic equipment and system | |
CN102694554A (en) | Data compression devices, operating methods thereof, and data processing apparatuses including the same | |
CN113572479B (en) | Method and system for generating finite state entropy coding table | |
US10891082B2 (en) | Methods for accelerating compression and apparatuses using the same | |
EP3330866A1 (en) | Methods and apparatus for programmable integrated circuit coprocessor sector management | |
US10218358B2 (en) | Methods and apparatus for unloading data from a configurable integrated circuit | |
KR101727508B1 (en) | Apparatus and method for accelerating hardware compression based on hadoop | |
CN109075798B (en) | Variable size symbol entropy-based data compression | |
US9319040B2 (en) | Distributing multiplexing logic to remove multiplexor latency on the output path for variable clock cycle, delayed signals | |
JP5674954B2 (en) | Stream data abnormality detection method and apparatus | |
US20220253668A1 (en) | Data processing method and device, storage medium and electronic device | |
CN115941598A (en) | Flow table semi-uninstalling method, device and medium | |
CN110688160B (en) | Instruction pipeline processing method, system, equipment and computer storage medium | |
US20030005189A1 (en) | Method for improving inline compression bandwidth for high speed buses | |
US11604738B2 (en) | Device and method for data compression using a metadata cache | |
Shcherbakov et al. | A parallel adaptive range coding compressor: algorithm, FPGA prototype, evaluation | |
US9197243B2 (en) | Compression ratio for a compression engine | |
Daoud et al. | Real-time Bitstream Decompression Scheme for FPGAs Reconfiguration | |
CN115904488A (en) | Data transmission method, system, device and equipment | |
US9495304B2 (en) | Address compression method, address decompression method, compressor, and decompressor | |
WO2017044128A1 (en) | Averaging modules | |
US20240119022A1 (en) | Hardware distributed architecture in a data transform accelerator | |
Choi et al. | Energy efficient and low-cost server architecture for hadoop storage appliance | |
US20230325230A1 (en) | Network functions virtualization platforms with function chaining capabilities |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
A201 | Request for examination | ||
N231 | Notification of change of applicant | ||
GRNT | Written decision to grant |