KR102824624B1 - 인코딩 및 디코딩 테이블을 이용한 세미 소팅 압축 - Google Patents
인코딩 및 디코딩 테이블을 이용한 세미 소팅 압축 Download PDFInfo
- Publication number
- KR102824624B1 KR102824624B1 KR1020227007999A KR20227007999A KR102824624B1 KR 102824624 B1 KR102824624 B1 KR 102824624B1 KR 1020227007999 A KR1020227007999 A KR 1020227007999A KR 20227007999 A KR20227007999 A KR 20227007999A KR 102824624 B1 KR102824624 B1 KR 102824624B1
- Authority
- KR
- South Korea
- Prior art keywords
- prefix
- data
- prefixes
- delete delete
- data items
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/06—Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
- G06F3/0601—Interfaces specially adapted for storage systems
- G06F3/0628—Interfaces specially adapted for storage systems making use of a particular technique
- G06F3/0638—Organizing or formatting or addressing of data
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F7/00—Methods or arrangements for processing data by operating upon the order or content of the data handled
- G06F7/74—Selecting or encoding within a word the position of one or more bits having a specified value, e.g. most or least significant one or zero detection, priority encoders
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
- G06F18/241—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/06—Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
- G06F3/0601—Interfaces specially adapted for storage systems
- G06F3/0602—Interfaces specially adapted for storage systems specifically adapted to achieve a particular effect
- G06F3/0608—Saving storage space on storage systems
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/06—Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
- G06F3/0601—Interfaces specially adapted for storage systems
- G06F3/0628—Interfaces specially adapted for storage systems making use of a particular technique
- G06F3/0655—Vertical data movement, i.e. input-output transfer; data movement between one or more hosts and one or more storage devices
- G06F3/0661—Format or protocol conversion arrangements
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/06—Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
- G06F3/0601—Interfaces specially adapted for storage systems
- G06F3/0668—Interfaces specially adapted for storage systems adopting a particular infrastructure
- G06F3/0671—In-line storage system
- G06F3/0673—Single storage device
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/06—Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
- G06F3/0601—Interfaces specially adapted for storage systems
- G06F3/0668—Interfaces specially adapted for storage systems adopting a particular infrastructure
- G06F3/0671—In-line storage system
- G06F3/0673—Single storage device
- G06F3/0679—Non-volatile semiconductor memory device, e.g. flash memory, one time programmable memory [OTP]
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F7/00—Methods or arrangements for processing data by operating upon the order or content of the data handled
- G06F7/06—Arrangements for sorting, selecting, merging, or comparing data on individual record carriers
- G06F7/08—Sorting, i.e. grouping record carriers in numerical or other ordered sequence according to the classification of at least some of the information they carry
-
- H—ELECTRICITY
- H03—ELECTRONIC CIRCUITRY
- H03M—CODING; DECODING; CODE CONVERSION IN GENERAL
- H03M7/00—Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
- H03M7/30—Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
- H03M7/40—Conversion to or from variable length codes, e.g. Shannon-Fano code, Huffman code, Morse code
- H03M7/4031—Fixed length to variable length coding
- H03M7/4037—Prefix coding
-
- H—ELECTRICITY
- H03—ELECTRONIC CIRCUITRY
- H03M—CODING; DECODING; CODE CONVERSION IN GENERAL
- H03M7/00—Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
- H03M7/30—Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
- H03M7/40—Conversion to or from variable length codes, e.g. Shannon-Fano code, Huffman code, Morse code
- H03M7/4031—Fixed length to variable length coding
- H03M7/4037—Prefix coding
- H03M7/4043—Adaptive prefix coding
- H03M7/4056—Coding table selection
-
- H—ELECTRICITY
- H03—ELECTRONIC CIRCUITRY
- H03M—CODING; DECODING; CODE CONVERSION IN GENERAL
- H03M7/00—Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
- H03M7/30—Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
- H03M7/40—Conversion to or from variable length codes, e.g. Shannon-Fano code, Huffman code, Morse code
- H03M7/4031—Fixed length to variable length coding
- H03M7/4037—Prefix coding
- H03M7/4043—Adaptive prefix coding
- H03M7/4062—Coding table adaptation
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Human Computer Interaction (AREA)
- Data Mining & Analysis (AREA)
- Life Sciences & Earth Sciences (AREA)
- Artificial Intelligence (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Evolutionary Biology (AREA)
- Evolutionary Computation (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US16/542,872 US11309911B2 (en) | 2019-08-16 | 2019-08-16 | Semi-sorting compression with encoding and decoding tables |
| US16/542,872 | 2019-08-16 | ||
| PCT/US2020/045903 WO2021034565A1 (en) | 2019-08-16 | 2020-08-12 | Semi-sorting compression with encoding and decoding tables |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| KR20220049540A KR20220049540A (ko) | 2022-04-21 |
| KR102824624B1 true KR102824624B1 (ko) | 2025-06-24 |
Family
ID=74567451
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| KR1020227007999A Active KR102824624B1 (ko) | 2019-08-16 | 2020-08-12 | 인코딩 및 디코딩 테이블을 이용한 세미 소팅 압축 |
Country Status (6)
| Country | Link |
|---|---|
| US (2) | US11309911B2 (enExample) |
| EP (1) | EP4014128A4 (enExample) |
| JP (1) | JP7631308B2 (enExample) |
| KR (1) | KR102824624B1 (enExample) |
| CN (1) | CN114258521B (enExample) |
| WO (1) | WO2021034565A1 (enExample) |
Families Citing this family (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US11309911B2 (en) * | 2019-08-16 | 2022-04-19 | Advanced Micro Devices, Inc. | Semi-sorting compression with encoding and decoding tables |
| US20240185021A1 (en) * | 2022-12-01 | 2024-06-06 | Western Digital Technologies, Inc. | Pre-encoding method for dna storage |
Family Cites Families (22)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US5442630A (en) * | 1991-05-24 | 1995-08-15 | Gagliardi; Ugo O. | ISDN interfacing of local area networks |
| US5704060A (en) | 1995-05-22 | 1997-12-30 | Del Monte; Michael G. | Text storage and retrieval system and method |
| US6489902B2 (en) | 1997-12-02 | 2002-12-03 | Hughes Electronics Corporation | Data compression for use with a communications channel |
| US6121903A (en) | 1998-01-27 | 2000-09-19 | Infit Communications Ltd. | On-the-fly data re-compression |
| US6650259B1 (en) | 2002-05-06 | 2003-11-18 | Unisys Corporation | Character table implemented data decompression method and apparatus |
| US8238923B2 (en) | 2004-12-22 | 2012-08-07 | Qualcomm Incorporated | Method of using shared resources in a communication system |
| WO2007108395A1 (ja) * | 2006-03-23 | 2007-09-27 | Nec Corporation | 可変長符号の復号装置および復号方法 |
| US8458457B2 (en) * | 2007-02-02 | 2013-06-04 | Red Hat, Inc. | Method and system for certificate revocation list pre-compression encoding |
| US20080317364A1 (en) * | 2007-06-25 | 2008-12-25 | Augusta Technology, Inc. | Methods for determining neighboring locations for partitions of a video stream |
| US7609179B2 (en) | 2008-01-08 | 2009-10-27 | International Business Machines Corporation | Method for compressed data with reduced dictionary sizes by coding value prefixes |
| EP2164176A1 (en) * | 2008-09-12 | 2010-03-17 | Thomson Licensing | Method for lossless compressing prefix-suffix-codes, method for decompressing a bit sequence representing integers or symbols encoded in compressed prefix-suffix-codes and storage medium or signal carrying compressed prefix-suffix-codes |
| TW201141081A (en) * | 2009-12-29 | 2011-11-16 | Ibm | Prefix-offset encoding method for data compression |
| US20120110025A1 (en) | 2010-10-28 | 2012-05-03 | Qualcomm Incorporated | Coding order-independent collections of words |
| US8606772B1 (en) | 2011-01-26 | 2013-12-10 | Trend Micro Incorporated | Efficient multiple-keyword match technique with large dictionaries |
| KR101672107B1 (ko) | 2011-11-08 | 2016-11-02 | 구글 테크놀로지 홀딩스 엘엘씨 | 변환 계수들에 대한 이진 코드워드들을 결정하는 방법 |
| US10235377B2 (en) | 2013-12-23 | 2019-03-19 | Sap Se | Adaptive dictionary compression/decompression for column-store databases |
| GB2531005A (en) * | 2014-10-06 | 2016-04-13 | Canon Kk | Improved encoding process using a palette mode |
| US9647684B2 (en) | 2014-10-21 | 2017-05-09 | Huawei Technologies Co., Ltd. | Memory-based history search |
| US10191682B2 (en) * | 2016-09-08 | 2019-01-29 | Qualcomm Incorporated | Providing efficient lossless compression for small data blocks in processor-based systems |
| US10613791B2 (en) | 2017-06-12 | 2020-04-07 | Pure Storage, Inc. | Portable snapshot replication between storage systems |
| US10838961B2 (en) * | 2017-09-29 | 2020-11-17 | Oracle International Corporation | Prefix compression |
| US11309911B2 (en) * | 2019-08-16 | 2022-04-19 | Advanced Micro Devices, Inc. | Semi-sorting compression with encoding and decoding tables |
-
2019
- 2019-08-16 US US16/542,872 patent/US11309911B2/en active Active
-
2020
- 2020-08-12 CN CN202080058010.4A patent/CN114258521B/zh active Active
- 2020-08-12 KR KR1020227007999A patent/KR102824624B1/ko active Active
- 2020-08-12 EP EP20854389.2A patent/EP4014128A4/en active Pending
- 2020-08-12 JP JP2022508996A patent/JP7631308B2/ja active Active
- 2020-08-12 WO PCT/US2020/045903 patent/WO2021034565A1/en not_active Ceased
-
2022
- 2022-04-18 US US17/722,931 patent/US11736119B2/en active Active
Also Published As
| Publication number | Publication date |
|---|---|
| JP7631308B2 (ja) | 2025-02-18 |
| US11736119B2 (en) | 2023-08-22 |
| CN114258521B (zh) | 2025-09-02 |
| WO2021034565A1 (en) | 2021-02-25 |
| US11309911B2 (en) | 2022-04-19 |
| EP4014128A1 (en) | 2022-06-22 |
| KR20220049540A (ko) | 2022-04-21 |
| CN114258521A (zh) | 2022-03-29 |
| JP2022545644A (ja) | 2022-10-28 |
| EP4014128A4 (en) | 2023-08-09 |
| US20220239315A1 (en) | 2022-07-28 |
| US20210050864A1 (en) | 2021-02-18 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Pibiri et al. | Techniques for inverted index compression | |
| US20200387488A1 (en) | Space efficient vector for columnar data storage | |
| US11762828B2 (en) | Cuckoo filters and cuckoo hash tables with biasing, compression, and decoupled logical sparsity | |
| CN101937448B (zh) | 用于主存储器列存储装置的基于字典的保持顺序的串压缩 | |
| US11669521B2 (en) | Accelerated filtering, grouping and aggregation in a database system | |
| JP2014534486A (ja) | スケーラブル・データ・デュプリケーションのための方法、システム、およびコンピュータ・プログラム | |
| RU2633178C2 (ru) | Способ и система базы данных для индексирования ссылок на документы базы данных | |
| Klein et al. | Parallel lempel ziv coding | |
| CN103914483B (zh) | 文件存储方法、装置及文件读取方法、装置 | |
| Qiao et al. | An FPGA-based BWT accelerator for Bzip2 data compression | |
| KR102824624B1 (ko) | 인코딩 및 디코딩 테이블을 이용한 세미 소팅 압축 | |
| Jansson et al. | CRAM: Compressed random access memory | |
| US12001339B2 (en) | Dictionary compression device and memory system | |
| US10749545B1 (en) | Compressing tags in software and hardware semi-sorted caches | |
| Bertola et al. | A Class of Heuristics for Reducing the Number of BWT-Runs in the String Ordering Problem | |
| Culpepper et al. | Revisiting bounded context block‐sorting transformations | |
| CN115495479A (zh) | 字段可变的数据查询方法和装置 | |
| US12489461B2 (en) | Compression device and compression method | |
| Díaz-Domínguez et al. | Algorithms for Computing Very Large BWTs: a Short Survey | |
| US12298952B1 (en) | Multiple pass sort with subset splitting | |
| Liu et al. | An Efficient Direct Access Algorithm for Integer Compression | |
| JP5419069B2 (ja) | データベース装置、データベースの管理方法、データベースのデータ構造、データベースの管理プログラムおよびそれを記録したコンピュータ読み取り可能な記録媒体 | |
| Pavlo | Lecture# 03: Data Formats & Encoding II |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PA0105 | International application |
Patent event date: 20220310 Patent event code: PA01051R01D Comment text: International Patent Application |
|
| PG1501 | Laying open of application | ||
| E902 | Notification of reason for refusal | ||
| PE0902 | Notice of grounds for rejection |
Comment text: Notification of reason for refusal Patent event date: 20240719 Patent event code: PE09021S01D |
|
| E701 | Decision to grant or registration of patent right | ||
| PE0701 | Decision of registration |
Patent event code: PE07011S01D Comment text: Decision to Grant Registration Patent event date: 20250412 |
|
| GRNT | Written decision to grant | ||
| PR0701 | Registration of establishment |
Comment text: Registration of Establishment Patent event date: 20250619 Patent event code: PR07011E01D |
|
| PR1002 | Payment of registration fee |
Payment date: 20250620 End annual number: 3 Start annual number: 1 |
|
| PG1601 | Publication of registration |