CN114258521B - 使用编码及解码表的半分类压缩 - Google Patents
使用编码及解码表的半分类压缩Info
- Publication number
- CN114258521B CN114258521B CN202080058010.4A CN202080058010A CN114258521B CN 114258521 B CN114258521 B CN 114258521B CN 202080058010 A CN202080058010 A CN 202080058010A CN 114258521 B CN114258521 B CN 114258521B
- Authority
- CN
- China
- Prior art keywords
- prefix
- data
- codeword
- prefixes
- encoding
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/06—Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
- G06F3/0601—Interfaces specially adapted for storage systems
- G06F3/0628—Interfaces specially adapted for storage systems making use of a particular technique
- G06F3/0638—Organizing or formatting or addressing of data
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F7/00—Methods or arrangements for processing data by operating upon the order or content of the data handled
- G06F7/74—Selecting or encoding within a word the position of one or more bits having a specified value, e.g. most or least significant one or zero detection, priority encoders
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
- G06F18/241—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/06—Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
- G06F3/0601—Interfaces specially adapted for storage systems
- G06F3/0602—Interfaces specially adapted for storage systems specifically adapted to achieve a particular effect
- G06F3/0608—Saving storage space on storage systems
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/06—Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
- G06F3/0601—Interfaces specially adapted for storage systems
- G06F3/0628—Interfaces specially adapted for storage systems making use of a particular technique
- G06F3/0655—Vertical data movement, i.e. input-output transfer; data movement between one or more hosts and one or more storage devices
- G06F3/0661—Format or protocol conversion arrangements
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/06—Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
- G06F3/0601—Interfaces specially adapted for storage systems
- G06F3/0668—Interfaces specially adapted for storage systems adopting a particular infrastructure
- G06F3/0671—In-line storage system
- G06F3/0673—Single storage device
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/06—Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
- G06F3/0601—Interfaces specially adapted for storage systems
- G06F3/0668—Interfaces specially adapted for storage systems adopting a particular infrastructure
- G06F3/0671—In-line storage system
- G06F3/0673—Single storage device
- G06F3/0679—Non-volatile semiconductor memory device, e.g. flash memory, one time programmable memory [OTP]
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F7/00—Methods or arrangements for processing data by operating upon the order or content of the data handled
- G06F7/06—Arrangements for sorting, selecting, merging, or comparing data on individual record carriers
- G06F7/08—Sorting, i.e. grouping record carriers in numerical or other ordered sequence according to the classification of at least some of the information they carry
-
- H—ELECTRICITY
- H03—ELECTRONIC CIRCUITRY
- H03M—CODING; DECODING; CODE CONVERSION IN GENERAL
- H03M7/00—Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
- H03M7/30—Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
- H03M7/40—Conversion to or from variable length codes, e.g. Shannon-Fano code, Huffman code, Morse code
- H03M7/4031—Fixed length to variable length coding
- H03M7/4037—Prefix coding
-
- H—ELECTRICITY
- H03—ELECTRONIC CIRCUITRY
- H03M—CODING; DECODING; CODE CONVERSION IN GENERAL
- H03M7/00—Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
- H03M7/30—Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
- H03M7/40—Conversion to or from variable length codes, e.g. Shannon-Fano code, Huffman code, Morse code
- H03M7/4031—Fixed length to variable length coding
- H03M7/4037—Prefix coding
- H03M7/4043—Adaptive prefix coding
- H03M7/4056—Coding table selection
-
- H—ELECTRICITY
- H03—ELECTRONIC CIRCUITRY
- H03M—CODING; DECODING; CODE CONVERSION IN GENERAL
- H03M7/00—Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
- H03M7/30—Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
- H03M7/40—Conversion to or from variable length codes, e.g. Shannon-Fano code, Huffman code, Morse code
- H03M7/4031—Fixed length to variable length coding
- H03M7/4037—Prefix coding
- H03M7/4043—Adaptive prefix coding
- H03M7/4062—Coding table adaptation
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Human Computer Interaction (AREA)
- Data Mining & Analysis (AREA)
- Life Sciences & Earth Sciences (AREA)
- Artificial Intelligence (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Evolutionary Biology (AREA)
- Evolutionary Computation (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US16/542,872 US11309911B2 (en) | 2019-08-16 | 2019-08-16 | Semi-sorting compression with encoding and decoding tables |
| US16/542,872 | 2019-08-16 | ||
| PCT/US2020/045903 WO2021034565A1 (en) | 2019-08-16 | 2020-08-12 | Semi-sorting compression with encoding and decoding tables |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN114258521A CN114258521A (zh) | 2022-03-29 |
| CN114258521B true CN114258521B (zh) | 2025-09-02 |
Family
ID=74567451
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN202080058010.4A Active CN114258521B (zh) | 2019-08-16 | 2020-08-12 | 使用编码及解码表的半分类压缩 |
Country Status (6)
| Country | Link |
|---|---|
| US (2) | US11309911B2 (https=) |
| EP (1) | EP4014128A4 (https=) |
| JP (1) | JP7631308B2 (https=) |
| KR (1) | KR102824624B1 (https=) |
| CN (1) | CN114258521B (https=) |
| WO (1) | WO2021034565A1 (https=) |
Families Citing this family (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US11309911B2 (en) * | 2019-08-16 | 2022-04-19 | Advanced Micro Devices, Inc. | Semi-sorting compression with encoding and decoding tables |
| US20240185021A1 (en) * | 2022-12-01 | 2024-06-06 | Western Digital Technologies, Inc. | Pre-encoding method for dna storage |
Citations (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN109661780A (zh) * | 2016-09-08 | 2019-04-19 | 高通股份有限公司 | 实现对基于处理器的系统中的小数据块的高效无损压缩 |
Family Cites Families (21)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US5442630A (en) * | 1991-05-24 | 1995-08-15 | Gagliardi; Ugo O. | ISDN interfacing of local area networks |
| US5704060A (en) | 1995-05-22 | 1997-12-30 | Del Monte; Michael G. | Text storage and retrieval system and method |
| US6489902B2 (en) | 1997-12-02 | 2002-12-03 | Hughes Electronics Corporation | Data compression for use with a communications channel |
| US6121903A (en) | 1998-01-27 | 2000-09-19 | Infit Communications Ltd. | On-the-fly data re-compression |
| US6650259B1 (en) | 2002-05-06 | 2003-11-18 | Unisys Corporation | Character table implemented data decompression method and apparatus |
| US8238923B2 (en) | 2004-12-22 | 2012-08-07 | Qualcomm Incorporated | Method of using shared resources in a communication system |
| WO2007108395A1 (ja) * | 2006-03-23 | 2007-09-27 | Nec Corporation | 可変長符号の復号装置および復号方法 |
| US8458457B2 (en) * | 2007-02-02 | 2013-06-04 | Red Hat, Inc. | Method and system for certificate revocation list pre-compression encoding |
| US20080317364A1 (en) * | 2007-06-25 | 2008-12-25 | Augusta Technology, Inc. | Methods for determining neighboring locations for partitions of a video stream |
| US7609179B2 (en) | 2008-01-08 | 2009-10-27 | International Business Machines Corporation | Method for compressed data with reduced dictionary sizes by coding value prefixes |
| EP2164176A1 (en) * | 2008-09-12 | 2010-03-17 | Thomson Licensing | Method for lossless compressing prefix-suffix-codes, method for decompressing a bit sequence representing integers or symbols encoded in compressed prefix-suffix-codes and storage medium or signal carrying compressed prefix-suffix-codes |
| TW201141081A (en) * | 2009-12-29 | 2011-11-16 | Ibm | Prefix-offset encoding method for data compression |
| US20120110025A1 (en) | 2010-10-28 | 2012-05-03 | Qualcomm Incorporated | Coding order-independent collections of words |
| US8606772B1 (en) | 2011-01-26 | 2013-12-10 | Trend Micro Incorporated | Efficient multiple-keyword match technique with large dictionaries |
| KR101672107B1 (ko) | 2011-11-08 | 2016-11-02 | 구글 테크놀로지 홀딩스 엘엘씨 | 변환 계수들에 대한 이진 코드워드들을 결정하는 방법 |
| US10235377B2 (en) | 2013-12-23 | 2019-03-19 | Sap Se | Adaptive dictionary compression/decompression for column-store databases |
| GB2531005A (en) * | 2014-10-06 | 2016-04-13 | Canon Kk | Improved encoding process using a palette mode |
| US9647684B2 (en) | 2014-10-21 | 2017-05-09 | Huawei Technologies Co., Ltd. | Memory-based history search |
| US10613791B2 (en) | 2017-06-12 | 2020-04-07 | Pure Storage, Inc. | Portable snapshot replication between storage systems |
| US10838961B2 (en) * | 2017-09-29 | 2020-11-17 | Oracle International Corporation | Prefix compression |
| US11309911B2 (en) * | 2019-08-16 | 2022-04-19 | Advanced Micro Devices, Inc. | Semi-sorting compression with encoding and decoding tables |
-
2019
- 2019-08-16 US US16/542,872 patent/US11309911B2/en active Active
-
2020
- 2020-08-12 CN CN202080058010.4A patent/CN114258521B/zh active Active
- 2020-08-12 KR KR1020227007999A patent/KR102824624B1/ko active Active
- 2020-08-12 EP EP20854389.2A patent/EP4014128A4/en active Pending
- 2020-08-12 JP JP2022508996A patent/JP7631308B2/ja active Active
- 2020-08-12 WO PCT/US2020/045903 patent/WO2021034565A1/en not_active Ceased
-
2022
- 2022-04-18 US US17/722,931 patent/US11736119B2/en active Active
Patent Citations (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN109661780A (zh) * | 2016-09-08 | 2019-04-19 | 高通股份有限公司 | 实现对基于处理器的系统中的小数据块的高效无损压缩 |
Also Published As
| Publication number | Publication date |
|---|---|
| JP7631308B2 (ja) | 2025-02-18 |
| US11736119B2 (en) | 2023-08-22 |
| WO2021034565A1 (en) | 2021-02-25 |
| US11309911B2 (en) | 2022-04-19 |
| EP4014128A1 (en) | 2022-06-22 |
| KR20220049540A (ko) | 2022-04-21 |
| CN114258521A (zh) | 2022-03-29 |
| JP2022545644A (ja) | 2022-10-28 |
| EP4014128A4 (en) | 2023-08-09 |
| KR102824624B1 (ko) | 2025-06-24 |
| US20220239315A1 (en) | 2022-07-28 |
| US20210050864A1 (en) | 2021-02-18 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US11669521B2 (en) | Accelerated filtering, grouping and aggregation in a database system | |
| RU2633178C2 (ru) | Способ и система базы данных для индексирования ссылок на документы базы данных | |
| CN114258521B (zh) | 使用编码及解码表的半分类压缩 | |
| CN114730295B (zh) | 基于模式的高速缓存块压缩 | |
| US11139828B2 (en) | Memory compression method and apparatus | |
| JP2026010210A (ja) | 受信したデータを処理する装置 | |
| JP2009512099A (ja) | トライでの再始動可能なハッシュの方法及び装置 | |
| US8976048B2 (en) | Efficient processing of Huffman encoded data | |
| US10749545B1 (en) | Compressing tags in software and hardware semi-sorted caches | |
| US8010510B1 (en) | Method and system for tokenized stream compression | |
| Bertola et al. | A Class of Heuristics for Reducing the Number of BWT-Runs in the String Ordering Problem | |
| Culpepper et al. | Revisiting bounded context block‐sorting transformations | |
| US7254689B1 (en) | Decompression of block-sorted data | |
| US12489461B2 (en) | Compression device and compression method | |
| Díaz-Domínguez et al. | Algorithms for Computing Very Large BWTs: a Short Survey | |
| JP7849406B2 (ja) | ストレージシステム | |
| Safieh | The Parallel Dictionary LZW Algorithm for Flash Memory Controllers | |
| Waidyasooriya et al. | An fpga architecture for text search using a wavelet-tree-based succinct-data-structure | |
| Liu et al. | An Efficient Direct Access Algorithm for Integer Compression | |
| Zhang | Efficient Parallel Text Compression on GPUs |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| GR01 | Patent grant |