CN114258521B - 使用编码及解码表的半分类压缩 - Google Patents

使用编码及解码表的半分类压缩

Info

Publication number
CN114258521B
CN114258521B CN202080058010.4A CN202080058010A CN114258521B CN 114258521 B CN114258521 B CN 114258521B CN 202080058010 A CN202080058010 A CN 202080058010A CN 114258521 B CN114258521 B CN 114258521B
Authority
CN
China
Prior art keywords
prefix
data
codeword
prefixes
encoding
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202080058010.4A
Other languages
English (en)
Chinese (zh)
Other versions
CN114258521A (zh
Inventor
亚历山大·D·布雷斯洛
努万·贾亚塞纳
约翰·卡拉马丁纳斯
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Advanced Micro Devices Inc
Original Assignee
Advanced Micro Devices Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Advanced Micro Devices Inc filed Critical Advanced Micro Devices Inc
Publication of CN114258521A publication Critical patent/CN114258521A/zh
Application granted granted Critical
Publication of CN114258521B publication Critical patent/CN114258521B/zh
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0628Interfaces specially adapted for storage systems making use of a particular technique
    • G06F3/0638Organizing or formatting or addressing of data
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F7/00Methods or arrangements for processing data by operating upon the order or content of the data handled
    • G06F7/74Selecting or encoding within a word the position of one or more bits having a specified value, e.g. most or least significant one or zero detection, priority encoders
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0602Interfaces specially adapted for storage systems specifically adapted to achieve a particular effect
    • G06F3/0608Saving storage space on storage systems
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0628Interfaces specially adapted for storage systems making use of a particular technique
    • G06F3/0655Vertical data movement, i.e. input-output transfer; data movement between one or more hosts and one or more storage devices
    • G06F3/0661Format or protocol conversion arrangements
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0668Interfaces specially adapted for storage systems adopting a particular infrastructure
    • G06F3/0671In-line storage system
    • G06F3/0673Single storage device
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0668Interfaces specially adapted for storage systems adopting a particular infrastructure
    • G06F3/0671In-line storage system
    • G06F3/0673Single storage device
    • G06F3/0679Non-volatile semiconductor memory device, e.g. flash memory, one time programmable memory [OTP]
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F7/00Methods or arrangements for processing data by operating upon the order or content of the data handled
    • G06F7/06Arrangements for sorting, selecting, merging, or comparing data on individual record carriers
    • G06F7/08Sorting, i.e. grouping record carriers in numerical or other ordered sequence according to the classification of at least some of the information they carry
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03MCODING; DECODING; CODE CONVERSION IN GENERAL
    • H03M7/00Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
    • H03M7/30Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
    • H03M7/40Conversion to or from variable length codes, e.g. Shannon-Fano code, Huffman code, Morse code
    • H03M7/4031Fixed length to variable length coding
    • H03M7/4037Prefix coding
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03MCODING; DECODING; CODE CONVERSION IN GENERAL
    • H03M7/00Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
    • H03M7/30Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
    • H03M7/40Conversion to or from variable length codes, e.g. Shannon-Fano code, Huffman code, Morse code
    • H03M7/4031Fixed length to variable length coding
    • H03M7/4037Prefix coding
    • H03M7/4043Adaptive prefix coding
    • H03M7/4056Coding table selection
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03MCODING; DECODING; CODE CONVERSION IN GENERAL
    • H03M7/00Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
    • H03M7/30Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
    • H03M7/40Conversion to or from variable length codes, e.g. Shannon-Fano code, Huffman code, Morse code
    • H03M7/4031Fixed length to variable length coding
    • H03M7/4037Prefix coding
    • H03M7/4043Adaptive prefix coding
    • H03M7/4062Coding table adaptation

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Human Computer Interaction (AREA)
  • Data Mining & Analysis (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
CN202080058010.4A 2019-08-16 2020-08-12 使用编码及解码表的半分类压缩 Active CN114258521B (zh)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US16/542,872 US11309911B2 (en) 2019-08-16 2019-08-16 Semi-sorting compression with encoding and decoding tables
US16/542,872 2019-08-16
PCT/US2020/045903 WO2021034565A1 (en) 2019-08-16 2020-08-12 Semi-sorting compression with encoding and decoding tables

Publications (2)

Publication Number Publication Date
CN114258521A CN114258521A (zh) 2022-03-29
CN114258521B true CN114258521B (zh) 2025-09-02

Family

ID=74567451

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202080058010.4A Active CN114258521B (zh) 2019-08-16 2020-08-12 使用编码及解码表的半分类压缩

Country Status (6)

Country Link
US (2) US11309911B2 (https=)
EP (1) EP4014128A4 (https=)
JP (1) JP7631308B2 (https=)
KR (1) KR102824624B1 (https=)
CN (1) CN114258521B (https=)
WO (1) WO2021034565A1 (https=)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11309911B2 (en) * 2019-08-16 2022-04-19 Advanced Micro Devices, Inc. Semi-sorting compression with encoding and decoding tables
US20240185021A1 (en) * 2022-12-01 2024-06-06 Western Digital Technologies, Inc. Pre-encoding method for dna storage

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109661780A (zh) * 2016-09-08 2019-04-19 高通股份有限公司 实现对基于处理器的系统中的小数据块的高效无损压缩

Family Cites Families (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5442630A (en) * 1991-05-24 1995-08-15 Gagliardi; Ugo O. ISDN interfacing of local area networks
US5704060A (en) 1995-05-22 1997-12-30 Del Monte; Michael G. Text storage and retrieval system and method
US6489902B2 (en) 1997-12-02 2002-12-03 Hughes Electronics Corporation Data compression for use with a communications channel
US6121903A (en) 1998-01-27 2000-09-19 Infit Communications Ltd. On-the-fly data re-compression
US6650259B1 (en) 2002-05-06 2003-11-18 Unisys Corporation Character table implemented data decompression method and apparatus
US8238923B2 (en) 2004-12-22 2012-08-07 Qualcomm Incorporated Method of using shared resources in a communication system
WO2007108395A1 (ja) * 2006-03-23 2007-09-27 Nec Corporation 可変長符号の復号装置および復号方法
US8458457B2 (en) * 2007-02-02 2013-06-04 Red Hat, Inc. Method and system for certificate revocation list pre-compression encoding
US20080317364A1 (en) * 2007-06-25 2008-12-25 Augusta Technology, Inc. Methods for determining neighboring locations for partitions of a video stream
US7609179B2 (en) 2008-01-08 2009-10-27 International Business Machines Corporation Method for compressed data with reduced dictionary sizes by coding value prefixes
EP2164176A1 (en) * 2008-09-12 2010-03-17 Thomson Licensing Method for lossless compressing prefix-suffix-codes, method for decompressing a bit sequence representing integers or symbols encoded in compressed prefix-suffix-codes and storage medium or signal carrying compressed prefix-suffix-codes
TW201141081A (en) * 2009-12-29 2011-11-16 Ibm Prefix-offset encoding method for data compression
US20120110025A1 (en) 2010-10-28 2012-05-03 Qualcomm Incorporated Coding order-independent collections of words
US8606772B1 (en) 2011-01-26 2013-12-10 Trend Micro Incorporated Efficient multiple-keyword match technique with large dictionaries
KR101672107B1 (ko) 2011-11-08 2016-11-02 구글 테크놀로지 홀딩스 엘엘씨 변환 계수들에 대한 이진 코드워드들을 결정하는 방법
US10235377B2 (en) 2013-12-23 2019-03-19 Sap Se Adaptive dictionary compression/decompression for column-store databases
GB2531005A (en) * 2014-10-06 2016-04-13 Canon Kk Improved encoding process using a palette mode
US9647684B2 (en) 2014-10-21 2017-05-09 Huawei Technologies Co., Ltd. Memory-based history search
US10613791B2 (en) 2017-06-12 2020-04-07 Pure Storage, Inc. Portable snapshot replication between storage systems
US10838961B2 (en) * 2017-09-29 2020-11-17 Oracle International Corporation Prefix compression
US11309911B2 (en) * 2019-08-16 2022-04-19 Advanced Micro Devices, Inc. Semi-sorting compression with encoding and decoding tables

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109661780A (zh) * 2016-09-08 2019-04-19 高通股份有限公司 实现对基于处理器的系统中的小数据块的高效无损压缩

Also Published As

Publication number Publication date
JP7631308B2 (ja) 2025-02-18
US11736119B2 (en) 2023-08-22
WO2021034565A1 (en) 2021-02-25
US11309911B2 (en) 2022-04-19
EP4014128A1 (en) 2022-06-22
KR20220049540A (ko) 2022-04-21
CN114258521A (zh) 2022-03-29
JP2022545644A (ja) 2022-10-28
EP4014128A4 (en) 2023-08-09
KR102824624B1 (ko) 2025-06-24
US20220239315A1 (en) 2022-07-28
US20210050864A1 (en) 2021-02-18

Similar Documents

Publication Publication Date Title
US11669521B2 (en) Accelerated filtering, grouping and aggregation in a database system
RU2633178C2 (ru) Способ и система базы данных для индексирования ссылок на документы базы данных
CN114258521B (zh) 使用编码及解码表的半分类压缩
CN114730295B (zh) 基于模式的高速缓存块压缩
US11139828B2 (en) Memory compression method and apparatus
JP2026010210A (ja) 受信したデータを処理する装置
JP2009512099A (ja) トライでの再始動可能なハッシュの方法及び装置
US8976048B2 (en) Efficient processing of Huffman encoded data
US10749545B1 (en) Compressing tags in software and hardware semi-sorted caches
US8010510B1 (en) Method and system for tokenized stream compression
Bertola et al. A Class of Heuristics for Reducing the Number of BWT-Runs in the String Ordering Problem
Culpepper et al. Revisiting bounded context block‐sorting transformations
US7254689B1 (en) Decompression of block-sorted data
US12489461B2 (en) Compression device and compression method
Díaz-Domínguez et al. Algorithms for Computing Very Large BWTs: a Short Survey
JP7849406B2 (ja) ストレージシステム
Safieh The Parallel Dictionary LZW Algorithm for Flash Memory Controllers
Waidyasooriya et al. An fpga architecture for text search using a wavelet-tree-based succinct-data-structure
Liu et al. An Efficient Direct Access Algorithm for Integer Compression
Zhang Efficient Parallel Text Compression on GPUs

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant